BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy10434
         (593 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
          Length = 648

 Score =  648 bits (1672), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 326/587 (55%), Positives = 409/587 (69%), Gaps = 21/587 (3%)

Query: 13  CTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQH 71
           C ++F  + + + A  KKP+II+I+ADD+G+NDV FHGS++IPTPNIDALAYNG+ILN H
Sbjct: 5   CKIIFFGSVVASFAQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSH 64

Query: 72  YVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
           Y QALCTPSRSA +TGKYPIH+GMQH VILE EPWGLPL E +LPQYLK  GYATHAIGK
Sbjct: 65  YTQALCTPSRSAFLTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGK 124

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
           WHLGFFR+ YTPT+RGFDSH+GYWQGLQDYY H+  ATF P  G DMR NM VD    G 
Sbjct: 125 WHLGFFRKEYTPTYRGFDSHFGYWQGLQDYYKHTVHATFTPEHGYDMRRNMTVDWSAQGK 184

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
           YST L+T+ A+ +I EHN   PMF+YLAHLA H+GN  +P QAPDEE+AKF  I+DPERR
Sbjct: 185 YSTTLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERR 244

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            YA MVS LD+SVG+VIAALR   MLENSI++FM+DNGA   GIH+N GSN+PLRG K++
Sbjct: 245 IYAAMVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNS 304

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
            W+G MR VAAIWSP +K+ Q+VS+ L HISDWLPT   AAG+   +   +DGV+ W  +
Sbjct: 305 AWEGAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELPKMDGVDMWASI 364

Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE-IDKY 429
           ++G  + R+E+LHNID + N    Y ALRV + KY+ G+  NG+SD WYG +  + +  Y
Sbjct: 365 SEGKDSPRTELLHNIDEIYN----YGALRVGNWKYLYGSTTNGKSDGWYGSSGRDPLYTY 420

Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC- 488
               VL S+ G T   L T  QIK+KH  D     ++    +L  E I  LR  A V+C 
Sbjct: 421 DDSAVLASQTGSTLAGLTTYQQIKEKHQGD-----TNFTHKLLDSETIKTLRGAAEVKCP 475

Query: 489 --NYDN--KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPG 544
             N++   +   CN+   PCLFNI +DPCEQ NLA  +  ++  +E  LA +K T +P  
Sbjct: 476 RVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVLNMEMALARFKQTALPIR 535

Query: 545 NKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLSPTTITL 591
           N P D  ADPA+WNN WV W D  D +K     +++   LSP  I L
Sbjct: 536 NVPRDPNADPAKWNNTWVNWQDYEDVKK----QKIYFNTLSPLAIGL 578


>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 646

 Score =  639 bits (1649), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 324/587 (55%), Positives = 407/587 (69%), Gaps = 23/587 (3%)

Query: 13  CTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQH 71
           C ++F  + + + A  KKP+II+I+ADD+G+NDV FHGS++IPTPNIDALAYNG+ILN H
Sbjct: 5   CKIIFFGSVVASFAQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSH 64

Query: 72  YVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
           Y QALCTPSRSA +TGKYPIH+GMQH VILE EPWGLPL E +LPQYLK  GYATHAIGK
Sbjct: 65  YTQALCTPSRSAFLTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGK 124

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
           WHLGFFR+ YTPT+RGFDSH+GYWQGLQDYY H+    F P  G DMR NM VD    G 
Sbjct: 125 WHLGFFRKEYTPTYRGFDSHFGYWQGLQDYYKHTVH--FTPEHGYDMRRNMTVDWSAQGK 182

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
           YST L+T+ A+ +I EHN   PMF+YLAHLA H+GN  +P QAPDEE+AKF  I+DPERR
Sbjct: 183 YSTTLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERR 242

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            YA MVS LD+SVG+VIAALR   MLENSI++FM+DNGA   GIH+N GSN+PLRG K++
Sbjct: 243 IYAAMVSMLDKSVGSVIAALRDKHMLENSIIVFMSDNGAKPDGIHANHGSNYPLRGNKNS 302

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
            W+G MR VAAIWSP +K+ Q+VS+ L HISDWLPT   AAG+   +   +DGV+ W  +
Sbjct: 303 AWEGAMRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELPKMDGVDMWASI 362

Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE-IDKY 429
           ++G  + R+E+LHNID + N    Y ALRV + KY+ G+  NG+SD WYG +  + +  Y
Sbjct: 363 SEGKDSPRTELLHNIDEIYN----YGALRVGNWKYLYGSTTNGKSDGWYGSSGRDPLYTY 418

Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC- 488
               VL S+ G T   L T  QIK+KH  D     ++    +L  E I  LR  A V+C 
Sbjct: 419 DDSAVLASQTGSTLAGLTTYQQIKEKHQGD-----TNFTHKLLDSETIKTLRGAAEVKCP 473

Query: 489 --NYDN--KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPG 544
             N++   +   CN+   PCLFNI +DPCEQ NLA  +  ++  +E  LA +K T +P  
Sbjct: 474 RVNFEEIPESKKCNAVESPCLFNIKEDPCEQINLAAERPMIVLNMEMALARFKQTALPIR 533

Query: 545 NKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLSPTTITL 591
           N P D  ADPA+WNN WV W D  D +K     +++   LSP  I L
Sbjct: 534 NVPRDPNADPAKWNNTWVNWQDYEDVKK----QKIYFNTLSPLAIGL 576


>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 710

 Score =  616 bits (1589), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 308/559 (55%), Positives = 395/559 (70%), Gaps = 27/559 (4%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+I++I+ADD+GWNDVSFHGS QIPTPNIDALAYNG+ILN HYV ALCTPSRSAL+TGKY
Sbjct: 44  PNIVMIIADDMGWNDVSFHGSDQIPTPNIDALAYNGVILNSHYVSALCTPSRSALLTGKY 103

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PIH GMQH VILE EP GLPL EK+LPQYLKEAGYATHAIGKWH GF R  YTPT+RGFD
Sbjct: 104 PIHTGMQHLVILEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGFD 163

Query: 150 SHYGYWQGLQDYYDHSCKAT--FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           SH+GYWQGLQDYY H   ++   E + G DMR NM +   T G YSTDL+T+ A+ +I E
Sbjct: 164 SHFGYWQGLQDYYTHEVGSSNPKEGFLGFDMRRNMSLARDTYGKYSTDLFTDEAVRLIEE 223

Query: 208 HN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           H  ++ PMFLYLAHLA H+GN  EP QAPDEEVAKF  + DPERR YA M+S+LD+SVG 
Sbjct: 224 HRPEAGPMFLYLAHLAPHSGNDNEPLQAPDEEVAKFSYVEDPERRIYAAMMSKLDQSVGE 283

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V++ALR+  ML+NSIV+FMADNGA + GIH N+GSN+PLRG+K++ W+G +RG AA+WSP
Sbjct: 284 VVSALRRKNMLQNSIVVFMADNGAATQGIHYNRGSNYPLRGIKASAWEGAVRGAAAVWSP 343

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIE---INDTSLDGVNQWDVLTKGAKT-KRSEIL 382
            +++ +++ +EL  I+DWLPTL +A+G+        ++DGV+QW  ++  A +  R+EIL
Sbjct: 344 LIQRPKRIYNELMSIADWLPTLLSASGLRDVVRVSANIDGVDQWPAISGVAPSPPRNEIL 403

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK--------YSPKEV 434
            NID + N    Y+ALR  + KYV GT  NG+  EWYG+T    ++        Y P+ V
Sbjct: 404 VNIDPIFN----YSALRRGEFKYVLGTVGNGE--EWYGETGRPENQGLEGASPTYDPETV 457

Query: 435 LYSKAGITFNALKTKLQIKQKHAA--DPKANSSD---ALRTILTDEKILELREFARVRCN 489
           L SKAG     L T  Q+ +  A   D      D   A   +LT +++L+LR  A +RC 
Sbjct: 458 LMSKAGTAIAGLLTAKQVSEIRAVRKDTYGRVKDDEMATSKLLTVDELLKLRSSASLRCT 517

Query: 490 Y-DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF 548
             +++   C+    PCLFNI +DPCEQ NLA S+  +L  LE+ L  Y+ T +PP N P 
Sbjct: 518 VPESERVACHPLQSPCLFNIKEDPCEQRNLAASRAMILATLEEALLKYRVTALPPSNVPN 577

Query: 549 DKRADPARWNNIWVPWYDE 567
           D +A+PA WN+ WV W D+
Sbjct: 578 DPKANPAFWNHTWVNWQDD 596


>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 671

 Score =  616 bits (1588), Expect = e-173,   Method: Compositional matrix adjust.
 Identities = 301/561 (53%), Positives = 394/561 (70%), Gaps = 23/561 (4%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           +TA +KPHI+IILADDLGWNDVSFHGS QIPTPNIDALAYNG+ILN HYV ALCTPSRSA
Sbjct: 22  STAKQKPHIVIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSA 81

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           LMTGK PIH+GMQH V+   EP GLPL+EKLLP+YL+E GY THA+GKWHLG+F++ YTP
Sbjct: 82  LMTGKNPIHLGMQHSVLFPAEPRGLPLSEKLLPEYLREVGYKTHAVGKWHLGYFKKEYTP 141

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKA---TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           T+RGFDSH+GYW GLQDYY H  +     +  Y+G DMR N+ V   T+G YSTDL+T  
Sbjct: 142 TYRGFDSHFGYWNGLQDYYTHITQEPDPVYSEYKGFDMRRNLTVAWDTVGKYSTDLFTNE 201

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A+ +I EHN  +PMFLYLAHLA H GN  + F+APDEE+AKF  I DPERR  A +VS+L
Sbjct: 202 AVRLINEHNIDQPMFLYLAHLAPHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKL 261

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D+SVG+V+ ALR  GMLENSIV+FM+DNGAPS G+ SN+GSN+PLRG+K++PW+GG RGV
Sbjct: 262 DQSVGDVMDALRNRGMLENSIVVFMSDNGAPSQGLLSNEGSNYPLRGIKNSPWEGGTRGV 321

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRS 379
           AAIWSP +K++++VS+++  +SDWLPTL +AAG++     ++DG + W  L     + RS
Sbjct: 322 AAIWSPLIKKSKRVSNQMMFMSDWLPTLLSAAGVDRRQLGNIDGYDLWSALVLNKISSRS 381

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID----KYSPKEVL 435
           E++ NID++ +    YAA R  D KY+ G    G +  W G + +  +    +Y P ++L
Sbjct: 382 EVMLNIDDLSD----YAAFRRGDFKYIIGRTETGSA--WLGASGDPSEGISPQYDPHKIL 435

Query: 436 YSKAGITFNAL---KTKLQIKQKHAADPKANSSDALRT-----ILTDEKILELREFARVR 487
           YSK G+  + +   K  +++K++           AL+T     IL+ E I ELR  A+++
Sbjct: 436 YSKTGVAISGIITAKQAMELKEERRKRSVKIYDTALKTNFEEKILSVENISELRRKAQIK 495

Query: 488 CNYDNKG-AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK 546
           CN   K    C     PCLFNI  DPCE  NLAE +  ++  LE  L  Y+ T++PP N 
Sbjct: 496 CNVKEKDRIPCEPMTAPCLFNIEKDPCEMVNLAEKRPMIMAILERILMKYRLTVIPPSNL 555

Query: 547 PFDKRADPARWNNIWVPWYDE 567
             D RA+P+ WNN W  WY++
Sbjct: 556 DGDPRANPSLWNNTWTSWYED 576


>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 713

 Score =  612 bits (1579), Expect = e-172,   Method: Compositional matrix adjust.
 Identities = 304/559 (54%), Positives = 389/559 (69%), Gaps = 24/559 (4%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT  +KPHIIIILADDLGWNDVSFHGS QIPTPNIDALAYNG+ILN HYV ALCTPSRS
Sbjct: 65  STTTKQKPHIIIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRS 124

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           ALMTGK PIH+GMQH V+   EP GLPL+EKLLP+YL+E GY THA+GKWHLG+F++ YT
Sbjct: 125 ALMTGKNPIHLGMQHSVLFPTEPRGLPLSEKLLPEYLREIGYKTHAVGKWHLGYFKKEYT 184

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKA---TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
           PT+RGFDSH+GYW GLQDYY H  +     F  ++G DMR N+ V   T+G YSTDL+T 
Sbjct: 185 PTYRGFDSHFGYWNGLQDYYTHITQEPDPAFSEFKGFDMRRNLTVAWDTVGKYSTDLFTN 244

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            AI +I EH+  +PMFLYLAHLAVH GN  + F+APDEE+AKF  I DPERR  A +VS+
Sbjct: 245 EAIRLINEHDTDRPMFLYLAHLAVHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSK 304

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           LD+SVG+V+ ALR  GMLENSIVLFM+DNGAPS GI +NKGSN+PLRG+K++PW+GG RG
Sbjct: 305 LDQSVGDVMDALRNRGMLENSIVLFMSDNGAPSQGILNNKGSNYPLRGIKNSPWEGGTRG 364

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR 378
           VAAIWSP +K++++VS+++  ISDWLPTL +AAG+      ++DG + W  L     + R
Sbjct: 365 VAAIWSPLIKKSKRVSNQMMFISDWLPTLLSAAGVNRKQLGNIDGFDLWSALVLNKVSSR 424

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID----KYSPKEV 434
           SE++ NID++ +    YA+ R  + KY+ G    G   +W G + +  +    +Y P ++
Sbjct: 425 SEVVLNIDDLSD----YASFRRGNFKYIIGRTETGS--DWLGASGDPSEGISPQYDPYKI 478

Query: 435 LYSKAGITFNAL---KTKLQIKQKHAADPKANSSDALRT-----ILTDEKILELREFARV 486
           LYSK G+  + +   K  +++K++     K   + AL T     IL+ E I ELR  A++
Sbjct: 479 LYSKTGVAISGIITAKQAMELKERRKRSIKIYDT-ALETKFEEKILSVENISELRRKAQI 537

Query: 487 RCNYDNKG-AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
           +CN   K    C     PCLFNI  DPCE  NLAE +  ++  LE  L  Y+ T +P  N
Sbjct: 538 KCNVQEKDRIPCEPMTAPCLFNIEKDPCEMVNLAERRPVIMAILERILMKYRITAIPASN 597

Query: 546 KPFDKRADPARWNNIWVPW 564
              D RADP  WNN W  W
Sbjct: 598 LDGDPRADPTLWNNTWTSW 616


>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 689

 Score =  607 bits (1566), Expect = e-171,   Method: Compositional matrix adjust.
 Identities = 303/548 (55%), Positives = 382/548 (69%), Gaps = 15/548 (2%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KPHIIIILADDLGWNDVSFHGS QIPTPNIDA+AYNG+ILN HYV ALCTPSR+ALMTG
Sbjct: 57  QKPHIIIILADDLGWNDVSFHGSDQIPTPNIDAIAYNGIILNSHYVAALCTPSRTALMTG 116

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           K PIH+GMQH V+L  EP GLPL+EKLLP+YL+E GY THA+GKWHLG+FR  YTPTFRG
Sbjct: 117 KNPIHLGMQHSVLLPSEPRGLPLSEKLLPEYLREVGYRTHAVGKWHLGYFRREYTPTFRG 176

Query: 148 FDSHYGYWQGLQDYYDHSCKA---TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           FD+H+GYW GLQDYY H  +     F  + GLDMR N+     T G YSTDL+TE A+ +
Sbjct: 177 FDTHFGYWNGLQDYYTHITQEPDPQFGEFMGLDMRRNLTAAWDTQGKYSTDLFTEEAVRL 236

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I EH+K  PMFLYLAHLA H GN     +A DE++A+F  I DPERR  A +VS++D+SV
Sbjct: 237 INEHDKDDPMFLYLAHLAPHKGNPNRLLRASDEDIARFSYILDPERRIQAAVVSKVDQSV 296

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G V+ ALRK GMLENSIVLFM+DNGAP+ G  SN+GSN+PLRG+K +PW+GG RGVAAIW
Sbjct: 297 GEVMDALRKRGMLENSIVLFMSDNGAPTQGSLSNQGSNYPLRGIKDSPWEGGTRGVAAIW 356

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILH 383
           SP +K++Q+VS+++  I+DWLPTL +AAG++     ++DG + W  L     + RS+++ 
Sbjct: 357 SPLIKESQRVSNQMMFIADWLPTLLSAAGVDSKQLGNIDGFDLWPALRSDKISPRSDVVL 416

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID----KYSPKEVLYSKA 439
           NID++      YA++R  D KYV G    G++  W G T    +    KY P +VLYSKA
Sbjct: 417 NIDDISK----YASIRRGDFKYVIGETETGRA--WVGATGAPSEGIPPKYDPDKVLYSKA 470

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCN 498
           G+  + + T  Q K+       ++  D    IL+ EKI +LR  ARV+CN   +    C+
Sbjct: 471 GVAISGVLTAKQAKELKKKKNTSSKIDFEEKILSAEKIFQLRNDARVQCNVKEENRIPCD 530

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
            T  PCLFNI  DPCE  NLA+ +  +L  +E  L  Y+ T VPP N   D RA+PA WN
Sbjct: 531 PTAAPCLFNIEKDPCEMVNLADKRPLILAIMERVLMNYRLTTVPPSNVNGDPRANPALWN 590

Query: 559 NIWVPWYD 566
           N W  W D
Sbjct: 591 NTWTTWDD 598


>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 570

 Score =  553 bits (1425), Expect = e-155,   Method: Compositional matrix adjust.
 Identities = 272/570 (47%), Positives = 387/570 (67%), Gaps = 47/570 (8%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           +T ++ F +        ++P+IIIILADDLGWNDVSFHGS+QI TPNIDALAYNG+ILN 
Sbjct: 31  ITLSIFFQNV---VDGIERPNIIIILADDLGWNDVSFHGSNQIQTPNIDALAYNGIILNS 87

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
           HYV ALCTPSR++LMTGKYP  +GMQH VIL  EPWGLPL E L+P+Y  + GYATHA+G
Sbjct: 88  HYVPALCTPSRASLMTGKYPTSLGMQHLVILSPEPWGLPLNETLMPEYFNKNGYATHAVG 147

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           KWHLGFF++ YTP +RGFDSH+G+W G QDYYDH+  +  +  +G DMR N +VD    G
Sbjct: 148 KWHLGFFKKEYTPIYRGFDSHFGHWNGFQDYYDHTTMS--DSLKGYDMRRNFEVDYSYQG 205

Query: 191 IYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
           +Y+TD++T+ AI +I  HN  K P+FLYL+HLA H+GN   PFQAP++E++K   I+DP 
Sbjct: 206 MYTTDVFTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAPEDEISKHECINDPG 265

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           R+ YA MV++LDESVG V++AL K+ ML NSI++FM+DNGA ++G+HSN+GSN+PLRG+K
Sbjct: 266 RKIYAAMVTKLDESVGQVVSALEKNKMLNNSIIIFMSDNGAATYGLHSNRGSNYPLRGLK 325

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL----DGVN 365
            +PW+GG+RG AAIWSP+L +T++VS +L H+SDWLPTL  AAG+  + T L    DG++
Sbjct: 326 ESPWEGGVRGTAAIWSPFLNKTKRVSKQLMHMSDWLPTLLTAAGLNYSSTQLINKIDGID 385

Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDN 424
            W+VL+    + R E+ +N D ++N    Y++L +D  KYV GT   G++D W+ + + N
Sbjct: 386 MWNVLSNDLPSPRKEVFNNYDEIEN----YSSLMIDSWKYVEGTAQEGKADYWFEEPSRN 441

Query: 425 EIDKY--SPKEVLYSKAGITF----NALKTKLQIKQKHAADPKANSSDALRTILTDEKIL 478
              +Y  S +++   +   T         + L I + +  D K    +  + +LT + +L
Sbjct: 442 NCSEYRVSNEDIFRLRRDSTIICDNPTFSSSLSITRNNHTDVK----NKTKYVLTCDPLL 497

Query: 479 ELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKS 538
           +                      + CLFN+ DDPCE+ NLA+   D++K+++++L   K 
Sbjct: 498 K----------------------RFCLFNLNDDPCERLNLADVFPDVVKRIKNRLLELKK 535

Query: 539 TMVPPGNKPFDKRADPARWNNIWVPWYDEL 568
           ++V P NKP D  ++P  +N  WV W D++
Sbjct: 536 SVVKPLNKPEDPYSNPMFYNGTWVNWKDDI 565


>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 583

 Score =  553 bits (1424), Expect = e-154,   Method: Compositional matrix adjust.
 Identities = 279/549 (50%), Positives = 367/549 (66%), Gaps = 36/549 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           KKPHIIIILADDLGWNDVSFHGS+QIPTPNIDALAY G++L  HYV ALCTPSR+AL+TG
Sbjct: 39  KKPHIIIILADDLGWNDVSFHGSNQIPTPNIDALAYYGVLLKNHYVAALCTPSRAALLTG 98

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           KYPIH+GMQH  I   EP GLPL EKLLPQYLK+  Y TH +GKWHLG+++  YTP +RG
Sbjct: 99  KYPIHLGMQHEAIFPSEPRGLPLEEKLLPQYLKDMNYVTHIVGKWHLGYYKMEYTPLYRG 158

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ---GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           FD+H+GYW GLQDYY H    T EPY    G+DMR N  V   T+G YS DLYT+ A+ +
Sbjct: 159 FDTHFGYWNGLQDYYSHK---TAEPYTLNIGMDMRRNFTVAWDTMGKYSVDLYTDEAVRL 215

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  HN   PMFLYLA +A HAGN  +  QA  EE+ KF  I DP+R+ YA +VS+LDESV
Sbjct: 216 INTHNTDNPMFLYLAQIAPHAGNANQLPQALPEEIEKFSYIIDPKRKRYAAVVSKLDESV 275

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G V+ AL+   ML NSI+LFM+DNGAP+ G  SN GSN+P RG+K T W+GG+RG AAIW
Sbjct: 276 GKVVEALKNRNMLSNSIILFMSDNGAPTSGFLSNGGSNYPFRGIKKTLWEGGVRGAAAIW 335

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILH 383
           SP +K+ ++VS  L H++DWLPTL +AAG    +  ++DG++ W+ +  G  + R E+L 
Sbjct: 336 SPLIKKRERVSYHLMHVTDWLPTLLSAAGGNKEEIGTIDGLDMWETIVYGDCSPRKEVLI 395

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK----YSPKEVLYSKA 439
           NID++ + +    A+R+   KYV G   + ++++W GD+     +    YSP+ VL SK 
Sbjct: 396 NIDDICDSE----AIRIGKYKYVRG--ESKKNNDWVGDSGRNSSEQRPPYSPETVLNSKV 449

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY-DNKGAHCN 498
           GI   +L             P  N+       LT  K+LELR+ A V CN  + +   C+
Sbjct: 450 GIAIASL-------------PGNNN-----VALTSAKMLELRQQAEVHCNVTEEEKVQCD 491

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
           + + PCLF++  DPCE  N+ + +  + K+L++ L  ++STM+ P N+  D RA P  WN
Sbjct: 492 TQLAPCLFDLERDPCEMINIIKQEPLVAKELQEVLLQHRSTMLLPNNRIIDVRASPILWN 551

Query: 559 NIWVPWYDE 567
             WV W D+
Sbjct: 552 CTWVCWMDD 560


>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
          Length = 544

 Score =  548 bits (1412), Expect = e-153,   Method: Compositional matrix adjust.
 Identities = 271/561 (48%), Positives = 363/561 (64%), Gaps = 30/561 (5%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           A+    L      + TA ++PHI+ ILADDLGWNDV FHGS QIPTPNIDALAY+GL+L+
Sbjct: 6   AVLWAFLIVPFLRHVTAAERPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLD 65

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
           ++YV  +CTPSRSALMTGKYPIH GMQHGV+   EP GLPL EKLLP+YLKE GY TH +
Sbjct: 66  RYYVSPICTPSRSALMTGKYPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLKELGYRTHIV 125

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
           GKWHLGF+ + YTPT+RGFDSH G+W G QDY+DH+  A   PY GLDMR  M+      
Sbjct: 126 GKWHLGFYTKQYTPTYRGFDSHIGFWSGHQDYFDHT--AVESPYWGLDMRRGMEAAWDLH 183

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
           G YSTD++T  A+ +I  HN SKP+FLYLAH AVH+GN Y+P  APD +VAKF +I D  
Sbjct: 184 GQYSTDVFTSEAVKLINNHNDSKPLFLYLAHAAVHSGNPYDPLPAPDVDVAKFTNIFDYN 243

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           RR +A M+S+LDESVG V+ AL++ GML+NSI++F  DNG P  G + N  SN PLRG K
Sbjct: 244 RRRFAAMLSKLDESVGEVVEALQRKGMLKNSIIVFSTDNGGPPAGFNLNAASNWPLRGTK 303

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDV 369
           +T W+GG+RG   IWSP L +  +VS +LFHISDWLPTL  AAG + ++ S+DG++ W  
Sbjct: 304 NTLWEGGVRGAGLIWSPRLIRPGRVSRQLFHISDWLPTLLTAAGGDPSNLSIDGLDLWHA 363

Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKY 429
           L++  ++ R  +LHNID++       + + + + KY+ G+   GQ D WYG         
Sbjct: 364 LSEDTESPRQSVLHNIDDIFG----VSGITIGEWKYIQGSTYEGQWDGWYGP-------- 411

Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN 489
           S +E +Y   G+                  P   +  ++   L+ E +  LRE A ++C 
Sbjct: 412 SGREWVYDIGGVI---------------GSPTGRAVASVGYSLSSENVRRLREQAMIKCR 456

Query: 490 YDNKG-AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF 548
             N     C   + PCLF++  DPCE NNL     ++ + + + L  + +T + PGN P+
Sbjct: 457 PRNDTLPLCKPLIAPCLFHVLKDPCEDNNLINEYPEIARAMREDLKKWNATALLPGNLPW 516

Query: 549 DKRADPARWNNIWVPWYDELD 569
           DKRA+P  W++ W+ + D LD
Sbjct: 517 DKRANPELWDHTWINFGDHLD 537


>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 545

 Score =  541 bits (1395), Expect = e-151,   Method: Compositional matrix adjust.
 Identities = 274/548 (50%), Positives = 360/548 (65%), Gaps = 30/548 (5%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +  A  +PHI+ ILADDLGWNDV FHGS QIPTPNIDALAY+GL+L+++YV  +CTPSRS
Sbjct: 20  HVAAAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRS 79

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           ALMTGK+PIH GMQHGV+   EP GLPL EKLLPQYL+E GY+TH +GKWHLGF+ + YT
Sbjct: 80  ALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYT 139

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P +RGFDSH G+W G  DY+DHS  A   PY GLDMR  +       G YSTD++T+ A+
Sbjct: 140 PMYRGFDSHIGFWSGHHDYFDHS--AVESPYWGLDMRRGLNSAWDLHGQYSTDIFTKEAV 197

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
            +I +HN S+PMFLYL+H AVH+GN+Y P  APD++VAKF +I + ERR +AGM+S+LDE
Sbjct: 198 KLINDHNASRPMFLYLSHAAVHSGNSYNPLPAPDQDVAKFTNIFNYERRRFAGMLSKLDE 257

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           SVG V+ ALRK GML+NS+++F  DNG P  G + N  SN PLRG K+T W+GG+RG   
Sbjct: 258 SVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFNLNAASNWPLRGTKNTLWEGGVRGAGL 317

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           +WSP L +  +VS +LFHISDWLPTL  AAG   ++ S+DG++ W  L++  ++ R  +L
Sbjct: 318 LWSPRLVRPGRVSRQLFHISDWLPTLITAAGGNPSNLSIDGMDLWYALSEDTESPRKMVL 377

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
           HNID++       A +   D KY+ G+  NGQ D WYG +  E   Y    V+ S AG  
Sbjct: 378 HNIDDIFG----VAGITYGDWKYIQGSTYNGQWDGWYGPSGREW-VYDIGGVIGSTAGRA 432

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCNSTV 501
             ++   L             S+DA+R          LRE A ++C   N     C    
Sbjct: 433 VASVGLAL-------------SADAIR---------RLREDAMIKCPPKNDSLPLCKPLE 470

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
            PCLFN+  DPCE NNL      L+ +L+DKL    +T + PGN P+D +A+P  W++ W
Sbjct: 471 APCLFNVQQDPCEDNNLINKYPSLVTELQDKLRKLNATAILPGNLPWDNKANPDYWDHTW 530

Query: 562 VPWYDELD 569
             + D L+
Sbjct: 531 TNFGDYLN 538


>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
          Length = 545

 Score =  539 bits (1389), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 273/548 (49%), Positives = 357/548 (65%), Gaps = 30/548 (5%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +  A  +PHI+ ILADDLGWNDV FHGS QIPTPNIDALAY+GL+L+++YV  +CTPSRS
Sbjct: 20  HVAAAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRS 79

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           ALMTGK+PIH GMQHGV+   EP GLPL EKLLPQYL+E GY+TH +GKWHLGF+ + YT
Sbjct: 80  ALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYT 139

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P +RGFDSH G+W G  DY+DHS  A   PY GLDMR  +       G YSTD++T+ A+
Sbjct: 140 PMYRGFDSHIGFWSGHHDYFDHS--AVESPYWGLDMRRGLNSAWDLHGQYSTDIFTKEAV 197

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
            +I +HN S+PMFLYL H AVH+GN+Y P   PD++VAKF +I + ERR +AGM+S+LDE
Sbjct: 198 KLINDHNASRPMFLYLPHAAVHSGNSYNPLPVPDQDVAKFTNIFNYERRRFAGMLSKLDE 257

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           SVG V+ ALRK GML+NS+++F  DNG P  G + N  SN PLRG K+T W+GG+RG   
Sbjct: 258 SVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFNLNAASNWPLRGTKNTLWEGGVRGAGL 317

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           +WSP L +  +VS +LFHISDWLPTL  AAG   ++ S+DG++ W  L++  ++ R  +L
Sbjct: 318 LWSPRLVRPGRVSRQLFHISDWLPTLITAAGGNPSNLSIDGMDLWYALSEDTESPRKMVL 377

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
           HNID++       A +   D KY+ G+  NGQ D WYG +  E   Y    V+ S AG  
Sbjct: 378 HNIDDIFG----VAGITYGDWKYIQGSTYNGQWDGWYGPSGREW-VYDIGGVIGSTAGRA 432

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCNSTV 501
             ++   L             S+DA+R          LRE A ++C   N     C    
Sbjct: 433 VASVGLTL-------------SADAIR---------RLREDAMIKCPPKNDSLPLCKPLE 470

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
            PCLFN+  DPCE NNL      L+ +L D+L    +T + PGN P+D RA+P  W++ W
Sbjct: 471 APCLFNVQQDPCEDNNLINKYPSLVTELRDRLRKLNATAILPGNLPWDNRANPDYWDHTW 530

Query: 562 VPWYDELD 569
             + D L+
Sbjct: 531 TNFGDYLN 538


>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
 gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
          Length = 573

 Score =  538 bits (1385), Expect = e-150,   Method: Compositional matrix adjust.
 Identities = 272/571 (47%), Positives = 365/571 (63%), Gaps = 34/571 (5%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           + +T  LL+     +T   ++P+I+II+ADDLGWNDVSFHGS+QIPTPNIDALAY+G+IL
Sbjct: 7   YLVTLWLLWLGGGASTAHQERPNIVIIVADDLGWNDVSFHGSNQIPTPNIDALAYDGIIL 66

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
           N+HYV  LCTPSR++LMTGK+P++IGMQ  VI+  EPWGL L +KL+PQY +EAGY TH 
Sbjct: 67  NRHYVPPLCTPSRASLMTGKHPMNIGMQDHVIISDEPWGLGLDQKLMPQYFREAGYRTHL 126

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           +GKWHLGFFR  YTPT+RGFDSH+GY     DY+DHS +      +GLDMR N  V+   
Sbjct: 127 VGKWHLGFFRRAYTPTYRGFDSHFGYLGPYIDYWDHSLQMNETSARGLDMRRNTAVNYDA 186

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
            G Y+TDL+ + A+ +I  HN+SKP+FL L HLA H GN  +P QAP +E+AKF  I DP
Sbjct: 187 NGTYATDLFNDEAVRLIDSHNRSKPLFLVLTHLAPHTGNEDDPLQAPADEIAKFDYIQDP 246

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
           +RRT A MVSR+D  VG +  +L +  ML N+I+LF ADNGAP+ GIH+N GSN+PLRG 
Sbjct: 247 KRRTLAAMVSRIDTGVGRIYRSLERRRMLNNTIILFYADNGAPTLGIHANSGSNYPLRGQ 306

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI--NDTSLDGVNQ 366
           K +PW+G +RG A IWS  L +   VS++  H+SDWLPTL  AAGI I  N++ +DG NQ
Sbjct: 307 KESPWEGAVRGAALIWSRLLPRKGIVSNQWLHVSDWLPTLGHAAGIRISPNNSPIDGQNQ 366

Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
           W  LT  + + R+ +++N  N    +  Y++      KYV GT   G  D W G   +  
Sbjct: 367 WSTLTSSSNSGRTVVMNNAHN----EFTYSSYIKRGWKYVNGTSFKGAYDRWLGQLKDG- 421

Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
           ++ S +E  Y K  I    + + +Q                    LT + +  LR  A V
Sbjct: 422 EQISHEE--YYKRLIASQTIGSSMQ--------------------LTRDDVKRLRSKAMV 459

Query: 487 RCNYDNKGAH-----CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
            C+      +     C   VKPCLFN+ +DPCE+ NLA+   +LLK L+  +  +K   +
Sbjct: 460 ECSSAQGTVYVQAIPCEPLVKPCLFNVVEDPCERVNLADQYPELLKDLQSDVERFKRDAL 519

Query: 542 PPGNKPFDKRADPARWNNIWVPWYDELDKQK 572
           PP N+P D R+DPA  N  W  W DE+D+Q+
Sbjct: 520 PPRNQPSDSRSDPALHNYTWTWWQDEMDEQQ 550


>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
          Length = 546

 Score =  533 bits (1372), Expect = e-148,   Method: Compositional matrix adjust.
 Identities = 268/558 (48%), Positives = 360/558 (64%), Gaps = 30/558 (5%)

Query: 13  CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
           C  L      +  A  +PHI+ ILADDLGWNDV FHG SQIPTPNIDALAY GL+L+++Y
Sbjct: 11  CICLTVFVVDHVVASARPHIVFILADDLGWNDVGFHGLSQIPTPNIDALAYTGLLLDRYY 70

Query: 73  VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
           V  +CTPSRSALMTGK+PIH GMQHGV+   EP GLPL EKLLP+YL+  GY+TH +GKW
Sbjct: 71  VSPICTPSRSALMTGKHPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLRNLGYSTHMVGKW 130

Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
           HLGF+ + YTPT+RGFDSH G+W G  DY+DH+  A  EPY GLDMR  ++      G Y
Sbjct: 131 HLGFYTKEYTPTYRGFDSHLGFWSGHHDYFDHT--AVEEPYWGLDMRRGLEPAWDLHGQY 188

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           STD++T+ A+ +I  HN S+PMFLYL+H AVH+GN Y P  A D +VAKF  I D  RR 
Sbjct: 189 STDVFTKEAVRLIDNHNTSRPMFLYLSHAAVHSGNPYNPLPAHDHDVAKFPKILDYNRRR 248

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           +AGM+S+LDESVG V+ ALRK  MLENS+++F  DNG P  G + N  SN PLRG K+T 
Sbjct: 249 FAGMLSKLDESVGLVVEALRKRKMLENSVIVFSTDNGGPPAGFNLNAASNFPLRGTKNTL 308

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           W+GG+RG   +WSP L    ++S +L HISDWLPTL  A G + ++ ++DGV+ W  L +
Sbjct: 309 WEGGVRGTGLVWSPKLINPGRISRQLIHISDWLPTLLTAVGGDPSNLAIDGVDMWRALNE 368

Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPK 432
              + R  +LHNID++       A +   D K++ G+  NGQ D+WYG +  +   Y P 
Sbjct: 369 DTISPRKMVLHNIDDIYG----IAGITYGDWKFIQGSTYNGQWDDWYGPSGRDW-AYDPY 423

Query: 433 EVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDN 492
            V+ S AG                    +A +S  L   LT   I +L++ A ++C+  N
Sbjct: 424 AVMESAAG--------------------RAAASVGLS--LTAGNIEKLQQGAAIKCSTRN 461

Query: 493 KG-AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
           +    C     PCLFN+ +DPCE+NNL E    ++  L++KL++  ++ + PGN P+DKR
Sbjct: 462 ENLPLCKPLEAPCLFNVREDPCEKNNLIEKYPSIVSDLKEKLSLLNASAILPGNLPWDKR 521

Query: 552 ADPARWNNIWVPWYDELD 569
            +P  W++ W  + D L+
Sbjct: 522 GNPDYWDHTWTNFGDYLN 539


>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
          Length = 543

 Score =  526 bits (1355), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 261/565 (46%), Positives = 353/565 (62%), Gaps = 30/565 (5%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
           L+    F+   A ++P++I ILADDLGWNDV FHGS QIPTPNIDALAY+GLIL  +YV 
Sbjct: 8   LILFPLFVTIRASERPNVIFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLILQNYYVT 67

Query: 75  ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
            +CTPSRSALMTGKYPIH GMQH V+   EP GLPLTEK+LP+YL+E GY    +GKWHL
Sbjct: 68  PICTPSRSALMTGKYPIHTGMQHTVLFGAEPRGLPLTEKILPEYLRELGYTNRLVGKWHL 127

Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
           G + + YTP +RGFDSH GYW G QDYYDH+  A   P  G DMR NM +     G YST
Sbjct: 128 GSYTKEYTPLYRGFDSHLGYWTGHQDYYDHT--AVENPGWGFDMRRNMDLAYDLHGQYST 185

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
           D++T+ A+ +I  HN + P+FLYLAH+AVH+ N Y P  APDE V KF +I   +R+ +A
Sbjct: 186 DVFTQEAVKIIENHNTTNPLFLYLAHVAVHSANPYNPLPAPDETVEKFSNIPSYKRQRFA 245

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            M+S+LD+SVG V+ +L K  ML+NSI++F  DNG  + G + N  SN PLRG+K+T W+
Sbjct: 246 AMLSKLDDSVGAVVESLSKRKMLKNSIIIFSTDNGGAAAGFNLNAASNFPLRGVKNTVWE 305

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA 374
           GG+RG   +WSP +KQ Q+V+ +  HISDWLPTL +A G   +  ++DG++ W+ L++  
Sbjct: 306 GGVRGAGLLWSPLIKQPQRVAQQFMHISDWLPTLLSAVGANSSLKNIDGLDLWESLSQNK 365

Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            + RSEILHNID++       AAL V   K V GT  NG  D WYG +  +   Y    V
Sbjct: 366 NSPRSEILHNIDDIYG----NAALTVGAWKLVKGTTYNGAWDGWYGPSGRDY-TYDLSLV 420

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           + S  G+   ++                         L  ++I +LR  +R+ C  D + 
Sbjct: 421 VNSPTGVALKSINMS----------------------LVPDEITKLRNASRIDCATD-QP 457

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
             C    + CLFN+  DPCE  NLA+    ++  LE+ L  Y +T +PPGN P D R +P
Sbjct: 458 IPCKPLEETCLFNVIQDPCELRNLAKENPKIVMNLEETLNKYNNTALPPGNLPLDPRGNP 517

Query: 555 ARWNNIWVPWYDELDKQKAIETMRL 579
             WNN W  + D + ++ ++  + +
Sbjct: 518 KYWNNTWTNFGDYIKEKSSLYRVEI 542


>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
 gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
          Length = 585

 Score =  525 bits (1352), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 270/580 (46%), Positives = 359/580 (61%), Gaps = 48/580 (8%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+I++I+ADDLGWNDVSFHGS QIPTPNIDALAY+G+ILN+HY   LCTPSR++LMTGK
Sbjct: 30  RPNIVVIIADDLGWNDVSFHGSLQIPTPNIDALAYSGIILNRHYAPPLCTPSRASLMTGK 89

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           +PI+IGMQH VI   EPWGL L +KL+P+Y +EAGY T  +GKWHLGFFR+ YTPT RGF
Sbjct: 90  HPINIGMQHHVIEVDEPWGLGLDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTMRGF 149

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           DSHYGY     DY+DHS + +    +GLDMR N+QVD    G Y+TDL+T  A+ +I +H
Sbjct: 150 DSHYGYIGPYIDYWDHSLQMSNTSTRGLDMRRNLQVDYSARGTYATDLFTREAVRLIHDH 209

Query: 209 NKSK--PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           N++   P+FL + HLA H GN  +P QAP+E+V  F  I DP+RR  A M+S+LD+ VG 
Sbjct: 210 NQTSANPLFLVVTHLAPHTGNEDDPMQAPEEDVELFSFIKDPKRRVLAAMISKLDDGVGQ 269

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           ++ AL+K GML N++VLF ADNGAP+ G HSN GSN PLRG K +PW+G +R  AAIWSP
Sbjct: 270 IVQALKKSGMLNNTVVLFYADNGAPTIGKHSNGGSNFPLRGQKYSPWEGAVRTAAAIWSP 329

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLDGVNQWDVLTKGAKTKRSEILHN 384
            L  T +VS++  H+SDWLPTL  AAGIE N   + +DG +QW+ L     T R+ ++HN
Sbjct: 330 LLNNTGRVSNQWIHVSDWLPTLARAAGIEANFSGSEIDGQDQWEALNNPNLTGRTTVMHN 389

Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
           ID      ++Y++   +  K V GT   G  D+W G  D E +    +  +   + +   
Sbjct: 390 IDQF----QHYSSYTKNGWKLVNGTSWEGAFDDWMGSLDEEDELSEEEYSVRLISSVVGR 445

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGA-HCNSTVKP 503
             K  L                        + +  +R  A V C Y   G   C     P
Sbjct: 446 LTKVDL------------------------DDVARIRRDATVHCPYIPGGTIDCEPQKSP 481

Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           CLFNI +DPCE NN+A  + DLLKQL   +  Y+ +   P N+P D  +DP  +N  W  
Sbjct: 482 CLFNILEDPCETNNIANQRPDLLKQLGQDVERYRLSATEPRNQPADPYSDPRYYNRTWTW 541

Query: 564 WYDELDKQKAIE---------------TMRLHKPLLSPTT 588
           W DELD+ + I+                + + KP+L P+T
Sbjct: 542 WQDELDEARDIKFPLAIFIVSITFIAIMLIVVKPILCPST 581


>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 543

 Score =  523 bits (1348), Expect = e-146,   Method: Compositional matrix adjust.
 Identities = 264/551 (47%), Positives = 360/551 (65%), Gaps = 30/551 (5%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
            F+      +PHI+ ILADDLGWNDV FHGS QIPTPNIDALAY+GL+L+++YV  +CTP
Sbjct: 15  VFVVGHVAARPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTP 74

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           SRSALMTGK+PIH GMQHGV+   EP GLPL EKLLP+Y ++ GY+TH +GKWHLGF+ +
Sbjct: 75  SRSALMTGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPEYFRDLGYSTHIVGKWHLGFYTK 134

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
            YTP +RGFDSH G+W G  DY+DHS  A  EPY GLDMR  ++      G YSTD++T+
Sbjct: 135 EYTPMYRGFDSHIGFWSGHHDYFDHS--AVEEPYWGLDMRRGLEPAWDLHGQYSTDVFTK 192

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            A+ +I  HN S+PMFLYLAH AVH+GN Y P  A D++VAKF +I +  RR +AG++S+
Sbjct: 193 EAVKLIDNHNTSRPMFLYLAHAAVHSGNPYNPLPAHDQDVAKFTNIFNYNRRRFAGILSK 252

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           LDESVG V+ ALR+  MLENS+++F  DNG    G + N  SN PLRG K+T W+GG+RG
Sbjct: 253 LDESVGLVVEALRRRKMLENSVIVFSTDNGGAPAGFNLNAASNWPLRGTKNTLWEGGVRG 312

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
           V  +WSP L +  +V  +L HISDWLPTL  AAG + ++ ++DG++ W  L++  ++ R 
Sbjct: 313 VGLLWSPKLIKPGRVGRQLIHISDWLPTLLTAAGGDPSNLTIDGMDIWHALSEDTESPRK 372

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
            +LHNID++       A +   D K++ G+  NGQ D WYG         S +E +Y   
Sbjct: 373 MVLHNIDDIYG----VAGITYGDWKFIQGSTYNGQWDGWYGP--------SGREWVYDVG 420

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCN 498
           G+  +A               +A +S  L   L+ + I +LRE A ++C   N     C 
Sbjct: 421 GVIGSAAG-------------RAVASVGLS--LSADNIRKLREDAMIKCPPRNDNLPLCK 465

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
               PCLFN+  DPCE NNL E    ++  L+++L +  ++ V PGN P+DKRA+P  W+
Sbjct: 466 PLEAPCLFNVRQDPCEDNNLIEKYPSVVSDLKERLRVLNASAVLPGNLPWDKRANPDYWD 525

Query: 559 NIWVPWYDELD 569
           + W+ + D L+
Sbjct: 526 HTWINFGDYLN 536


>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
          Length = 881

 Score =  519 bits (1336), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 262/570 (45%), Positives = 358/570 (62%), Gaps = 34/570 (5%)

Query: 1   MTWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDA 60
           M   R     +   L+    F      + PHI+ I+ADD+G NDV FHGS+QIPTPNIDA
Sbjct: 5   MMGKRITIVTMVALLISGPTFAEPQTTESPHIVFIMADDMGLNDVGFHGSNQIPTPNIDA 64

Query: 61  LAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK 120
           LAY+G+ILN+HY   +CTPSR+ALMTG++P+++GMQH VI   EPWGL L EKLLPQY +
Sbjct: 65  LAYDGIILNRHYSAPMCTPSRAALMTGRHPMNVGMQHYVIDSDEPWGLGLQEKLLPQYFR 124

Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
           EAGY TH IGKWHLGF+ E Y PT RGFD+H GY     DY+ +  K     ++G D+R 
Sbjct: 125 EAGYRTHLIGKWHLGFYAEPYLPTNRGFDTHIGYLGPYIDYWSYISKMDSATFEGYDLRQ 184

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
           N+ V+ K  G Y+TD +TEAA+ +I  HN++ + M L L HLA H GN   P QAP+E +
Sbjct: 185 NLAVNYKPNGTYATDYFTEAAVEIIRSHNRTGERMLLVLNHLAPHTGNDDAPLQAPEETI 244

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
            KF  I D +RRTYA M+S+LD+SVG V  AL++HGMLENSI++F++DNG  + G+HSN 
Sbjct: 245 EKFAYIRDTDRRTYAAMMSKLDDSVGQVYRALQQHGMLENSIIVFISDNGGVTRGMHSNS 304

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI--- 356
           GSN+PLRG K +PW+G +R  A IWSP LK  Q+VS++ FHISDWLPTL +AAGI     
Sbjct: 305 GSNYPLRGQKHSPWEGAVRTAALIWSPLLKDRQRVSNQWFHISDWLPTLASAAGIPAYRG 364

Query: 357 -NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQS 415
            N + +DG++QW+ L  G    R  +++N+D +      Y +   + LKYV GT  +G +
Sbjct: 365 DNLSDIDGIDQWEALAYGTGNPRQRVMNNLDEIFG----YTSYVDNGLKYVNGTTLDGVN 420

Query: 416 DEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDE 475
           D WYG  D+        + L S A      L+T +             +S   R    DE
Sbjct: 421 DGWYGAIDD-------TDTLPSNA----EYLRTVM-------------ASHVGRLTKPDE 456

Query: 476 KILE-LREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLA 534
           K++  LRE AR+ C  D     CN   +PCLF+I +DPCE ++L E   D ++Q+++++ 
Sbjct: 457 KVVSYLREHARIDCGNDIGNRDCNPLRRPCLFDIINDPCEMHDLFERYPDKVRQIQERVE 516

Query: 535 IYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
            Y+     P N+P D  ADP+++  +W  W
Sbjct: 517 KYRLRAAKPRNRPHDPAADPSKFGGVWNWW 546


>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
          Length = 519

 Score =  519 bits (1336), Expect = e-144,   Method: Compositional matrix adjust.
 Identities = 262/523 (50%), Positives = 349/523 (66%), Gaps = 31/523 (5%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           GWNDV FHGS QIPTPNIDALAY+GL+L+++YV ++CTPSRSALMTGKYPIH GMQH ++
Sbjct: 10  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTSICTPSRSALMTGKYPIHTGMQHSIL 69

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
              EP GLPL EK+LP+YL+E GY+TH +GKWHLGF++  YTPT+RGFD+H GYW G  D
Sbjct: 70  KGAEPRGLPLHEKILPEYLRELGYSTHIVGKWHLGFYKREYTPTYRGFDTHIGYWTGHHD 129

Query: 161 YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
           YYDH+  A   PY GLDMR  M+      G YSTD++T+ A+ +I  HN S+PMFLYLAH
Sbjct: 130 YYDHT--AVENPYWGLDMRRGMKPAWDLHGEYSTDVFTKEAVKLINNHNSSRPMFLYLAH 187

Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
            AVH+GN Y P  APDEEVAKF +I D  RR +AG++S+LD+SVG V+ AL K+ ML++S
Sbjct: 188 AAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGILSKLDQSVGQVVEALYKNNMLQDS 247

Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
           +++F  DNG P+ G + N  SN PLRG+K+T W+GG+RG   IWSP L +  +VS ++ H
Sbjct: 248 VIVFSTDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPRLVRAGRVSRQMIH 307

Query: 341 ISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
           I+DWLPTL  AAG    N TS+DGVN WD L++  ++ R+ ILHNID++       +A+ 
Sbjct: 308 ITDWLPTLITAAGGNASNLTSIDGVNLWDALSEDTESPRTNILHNIDDIYG----VSAIT 363

Query: 400 VDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAAD 459
           V D K++ G+  NG  D WYG +  E   Y    V+ S AG                   
Sbjct: 364 VGDWKFIQGSTYNGVWDGWYGPSGREW-IYDADGVISSTAG------------------- 403

Query: 460 PKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVKPCLFNITDDPCEQNNL 518
            +A +S  L   +T E I  LRE A ++C   N     C    +PCLFNI  DPCE NNL
Sbjct: 404 -RAIASVGLS--VTTEIISSLRENAMIKCPPRNDSLPICRPLQEPCLFNIYQDPCEDNNL 460

Query: 519 AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
            +    ++++L+++L  + +T + PGN P+D +ADP  W++ W
Sbjct: 461 VKQFPTIVRKLQEELKKFNNTAILPGNLPWDSKADPNLWDHTW 503


>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 532

 Score =  515 bits (1326), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 255/522 (48%), Positives = 349/522 (66%), Gaps = 30/522 (5%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           GWNDV FHGS QIPTPNIDALAY+GL+L+++YV  +CTPSRSALMTGKYPIHIGMQHGV+
Sbjct: 26  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIHIGMQHGVL 85

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
              EP GLPL EK+LP+YL++ GY+TH +GKWHLGF+ + YTPT+RGF SH G+W G QD
Sbjct: 86  KGAEPRGLPLHEKILPEYLRDLGYSTHIVGKWHLGFYTKEYTPTYRGFASHTGFWTGHQD 145

Query: 161 YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
           Y+DH+  A   PY GLDMR +M+      G YSTD++T+ A+ +I  HN S+P+FLYLAH
Sbjct: 146 YFDHT--AVESPYWGLDMRRDMEPAWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLYLAH 203

Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
            AVH+GN Y P  APDEEVAKF +I D  RR +AG++S+LD+SVG V+  LRK GML +S
Sbjct: 204 AAVHSGNPYNPLPAPDEEVAKFDNIFDYNRRRFAGILSKLDKSVGQVVEGLRKKGMLRDS 263

Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
           +++F  DNG P+ G + N  SN PLRG+K+T W+GG+RG   IWSP L ++ +VS +L H
Sbjct: 264 VIIFSTDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGAGLIWSPKLTRSGRVSRQLLH 323

Query: 341 ISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRV 400
           I+DWLPTL  AAG + ++ ++DG++ W  L+   ++ R+ +LHNID++       +A+ V
Sbjct: 324 ITDWLPTLITAAGGDPSNLNIDGMDLWSALSDDTESPRTSVLHNIDDIYG----VSAVTV 379

Query: 401 DDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADP 460
            D K V G+  NG  D WYG +  E   Y    V+ S+AG                    
Sbjct: 380 GDWKLVQGSTYNGAWDGWYGPSGREW-VYDVGGVIGSEAG-------------------- 418

Query: 461 KANSSDALRTILTDEKILELREFARVRCNYDNKG-AHCNSTVKPCLFNITDDPCEQNNLA 519
           +A +S  L   +T E +  LRE A ++C   N     C    +PCLFN+  DPCE NNL 
Sbjct: 419 RAVASVGLG--ITAETVRMLREKAMIKCPPKNDSLPLCKPLQEPCLFNVYQDPCEDNNLV 476

Query: 520 ESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
                ++++L+++L  + S+ + PGN P+D + DP+ W++ W
Sbjct: 477 RQSPTIVRKLQEELKKFNSSAIYPGNLPWDNKGDPSLWDHTW 518


>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
 gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
          Length = 675

 Score =  514 bits (1325), Expect = e-143,   Method: Compositional matrix adjust.
 Identities = 256/557 (45%), Positives = 357/557 (64%), Gaps = 38/557 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++PHI+II+ADD+GWNDV FHGS+QIPTPNIDALAY+G+ILN+HY   +CTPSR++LMTG
Sbjct: 30  QRPHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTG 89

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           K PI+IGMQH VI+  EPWGL L +K++P+Y KEAGY TH +GKWHLGF  + YTPT RG
Sbjct: 90  KNPINIGMQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRG 149

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           FD+H GY     DY+D++ K  F P   +QG DMR+N+ VD  + G Y+TD +T+AA ++
Sbjct: 150 FDTHVGYLGPYVDYWDYTLK--FSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSI 207

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  H+   P+FL + HLA HA N  +P QAP+E++ KF  ISD  RR YA MVS+LD+SV
Sbjct: 208 IERHDTKDPLFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSV 267

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G +  +LR   ML+NSI+LFM+DNGAP+  +H+N GSN+PLRG+KS PW+   R VAAIW
Sbjct: 268 GQIFNSLRSKNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIW 327

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI----NDTSLDGVNQWDVLTKGAKTKRSE 380
           SP L++ Q+VS++  HISDWLPTL +AAGI+I    + + +DG +QW+ L+      R  
Sbjct: 328 SPLLQERQRVSNQFIHISDWLPTLASAAGIDIPFSKDHSEIDGQDQWEALSYDTGNPRRV 387

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV--LYSK 438
           +L+ ID +      Y++   +  K+V GT +NG  D WYG  +      S  +   L  +
Sbjct: 388 VLNMIDEIYG----YSSYMENGFKFVNGTYSNGSYDGWYGQPNTSDQTLSDDQYIDLVLQ 443

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
             IT  A +T                       ++ + I  LR+ ARV CN+  +   CN
Sbjct: 444 TEITRWAGET-----------------------ISRDTIKYLRKHARVNCNHQPEANKCN 480

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
              +PCLF+I +DPCE N+L+       ++L   +  Y+     P NKP D  A+PA + 
Sbjct: 481 PLKRPCLFDIINDPCELNDLSHKFPMKFRELRSTVQTYRRLATKPRNKPADPAANPANFG 540

Query: 559 NIWVPWYDELDKQKAIE 575
            +W  W  +  +++ I+
Sbjct: 541 GVWTWWRPDQTREELIK 557


>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
 gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
          Length = 607

 Score =  513 bits (1320), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 256/557 (45%), Positives = 357/557 (64%), Gaps = 38/557 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++PHI+II+ADD+GWNDV FHGS+QIPTPNIDALAY+G+ILN+HY   +CTPSR++LMTG
Sbjct: 30  QRPHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTG 89

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           K PI+IGMQH VI+  EPWGL L +K++P+Y KEAGY TH +GKWHLGF  + YTPT RG
Sbjct: 90  KNPINIGMQHYVIVSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRG 149

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           FD+H GY     DY+D++ K  F P   +QG DMR+N+ VD  + G Y+TD +T+AA ++
Sbjct: 150 FDTHVGYLGPYVDYWDYTLK--FSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSI 207

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  H+   P+FL + HLA HA N  +P QAP+E++ KF  ISD  RR YA MVS+LD+SV
Sbjct: 208 IERHDTKDPLFLVVNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSV 267

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G +  +LR   ML+NSI+LFM+DNGAP+  +H+N GSN+PLRG+KS PW+   R VAAIW
Sbjct: 268 GQIFNSLRSKNMLDNSIILFMSDNGAPTAALHANTGSNYPLRGIKSVPWEAATRCVAAIW 327

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI----NDTSLDGVNQWDVLTKGAKTKRSE 380
           SP L++ Q+VS++  HISDWLPTL +AAGI+I    + + +DG +QW+ L+      R  
Sbjct: 328 SPLLQERQRVSNQFIHISDWLPTLASAAGIDIPFSKDHSEIDGQDQWEALSYDTGNPRRV 387

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV--LYSK 438
           +L+ ID +      Y++   +  K+V GT +NG  D WYG  +      S  +   L  +
Sbjct: 388 VLNMIDEIYG----YSSYMENGFKFVNGTYSNGSYDGWYGQPNTSDQTLSDDQYIDLVLQ 443

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
             IT  A +T                       ++ + I  LR+ ARV CN+  +   CN
Sbjct: 444 TEITRWAGET-----------------------ISRDTIKYLRKHARVNCNHQPEANKCN 480

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
              +PCLF+I +DPCE N+L+       ++L   +  Y+     P NKP D  A+PA + 
Sbjct: 481 PLKRPCLFDIINDPCELNDLSHKFPMKFRELRSTVQTYRRLATKPRNKPADPAANPANFG 540

Query: 559 NIWVPWYDELDKQKAIE 575
            +W  W  +  +++ I+
Sbjct: 541 GVWTWWRPDQTREELIK 557


>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
          Length = 528

 Score =  512 bits (1319), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 254/530 (47%), Positives = 351/530 (66%), Gaps = 30/530 (5%)

Query: 33  IIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIH 92
           +I+L    GWNDV FHGS QIPTPNIDALAY+GL+L+++YV  +CTPSRSALMTGKYPIH
Sbjct: 13  LILLLAVHGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIH 72

Query: 93  IGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHY 152
            GMQHGV+   EP GLPL EKLLP+YL+E GY TH +GKWHLGF+++ YTPT+RGFD+H 
Sbjct: 73  TGMQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVGKWHLGFYKKEYTPTYRGFDTHI 132

Query: 153 GYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK 212
           G+W G  DY+DH+  A   PY GLD+R  MQ      G YSTD++T+ A+ +I  HN S+
Sbjct: 133 GFWTGHHDYFDHT--AVENPYWGLDIRRGMQPAWDLHGQYSTDIFTKEAVRLIDNHNSSR 190

Query: 213 PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALR 272
           PMFLYLAH AVH+GN Y P  APDEEVAKF +I D  RR +AGM+S+LD+SVG V+ ALR
Sbjct: 191 PMFLYLAHAAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGMLSKLDQSVGQVVDALR 250

Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
           K  ML +SI++F +DNG P+ G + N  SN PLRG+K+T W+GG+RG   IWSP L +  
Sbjct: 251 KRDMLRDSIIIFSSDNGGPAAGFNLNAASNWPLRGVKNTLWEGGVRGTGLIWSPRLVRPG 310

Query: 333 KVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ 392
           +VS ++ HI+DWLPTL  AAG + ++ ++DG++ W+ L++  ++ R+ +LHNID++    
Sbjct: 311 RVSRQMLHITDWLPTLITAAGGDSSNLTVDGMDLWNALSEDTESPRANVLHNIDDIYG-- 368

Query: 393 KYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQI 452
              +A+ V D K + G+   G  D WYG         S +E +Y   G+           
Sbjct: 369 --VSAITVGDWKLIRGSTYKGVWDGWYGP--------SGREWVYDVDGVI---------- 408

Query: 453 KQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVKPCLFNITDD 511
                +   A++  ++   +T + +L LRE A ++C   N     C    + CLFNI  D
Sbjct: 409 -----SSTTAHAIASVGLGITTDVVLMLRENALIKCPLRNDSLPICKPLEEACLFNIYQD 463

Query: 512 PCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
           PCE NNL +    ++++L+D+L  + S+ + PGN P+D + DP  W++ W
Sbjct: 464 PCEDNNLVKQFPTIVRKLQDELKKFNSSALLPGNLPWDSKGDPNLWDHTW 513


>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
 gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
          Length = 634

 Score =  511 bits (1317), Expect = e-142,   Method: Compositional matrix adjust.
 Identities = 255/566 (45%), Positives = 357/566 (63%), Gaps = 33/566 (5%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           PK+PHI++I ADD+GWNDV FHGS+QI TP+IDALAY+G+ILN+HY   +CTPSR+ALMT
Sbjct: 31  PKRPHIVLITADDMGWNDVGFHGSNQIATPHIDALAYDGVILNRHYSAPMCTPSRAALMT 90

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G++PI++GMQH VI   EPWGL L ++++PQY + AGY TH IGKWHLGFF E Y PT R
Sbjct: 91  GRHPINVGMQHYVIDSDEPWGLGLDQRIMPQYFRAAGYRTHMIGKWHLGFFTEHYIPTNR 150

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           GFD+H GY     DY+ +  K     ++G DMR N  V+    G Y+TD +T AA ++IA
Sbjct: 151 GFDTHIGYLGPYVDYWSYVSKMNSGTFEGYDMRQNQFVNYAANGTYATDYFTSAARDIIA 210

Query: 207 EHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           +H KS +PM L + HLA HAGN  +P QAP E + +F  I + +RRTYA M+S+LD+SVG
Sbjct: 211 QHGKSGQPMLLVMNHLAPHAGNDDDPLQAPQETIDRFAYIGNRDRRTYAAMMSKLDDSVG 270

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            V  +LR +GML+NSI++F++DNG  + G+HSN GSN+P RG K +PW+G +R  A IWS
Sbjct: 271 AVYESLRANGMLDNSIIVFLSDNGGVTRGMHSNTGSNYPFRGQKHSPWEGAVRTAALIWS 330

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS---LDGVNQWDVLTKGAKTKRSEIL 382
           P LK TQ+VS++ FHISDWLPTL +AAGI + ++S   +DG++QW+ L  G    R  ++
Sbjct: 331 PLLKDTQRVSNQWFHISDWLPTLASAAGIALGNSSFSDIDGIDQWEALAYGTGNPRQRLM 390

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
           +N+D +      Y +   +  KY+ GT  NG +D WYG+TD  +D+    E   S    T
Sbjct: 391 NNLDEIFG----YTSYMENGFKYINGTTLNGVNDAWYGETD-PLDQLPDDEQYLSSVMAT 445

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE-LREFARVRCNYDNKGAHCNSTV 501
             A   +L                       DEK++  LR  A V C        CN  V
Sbjct: 446 RIARAGRL-----------------------DEKLISYLRRHATVNCGNSIATKPCNPLV 482

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
           KPCLF+I +DPCE +++++     L+ + ++L +Y+     P N+P D  ADP+++  +W
Sbjct: 483 KPCLFDIINDPCEMHDISDEYPKRLRGIRNRLELYRLRAAKPRNRPHDPEADPSKFGGVW 542

Query: 562 VPWYDELDKQKAIETMRLHKPLLSPT 587
             W  +  K  +++       L  PT
Sbjct: 543 NWWRTDPTKAASLQDEGDDSALPPPT 568


>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 545

 Score =  508 bits (1308), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 266/556 (47%), Positives = 353/556 (63%), Gaps = 34/556 (6%)

Query: 11  LTCTLLFNDAFLNTTAPKK--PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           L   LL    ++   AP+   PHII I+ADDLGWNDV FHGS QIPTPNIDALAY+GLIL
Sbjct: 7   LFSYLLAARNWIVAAAPETTPPHIIFIVADDLGWNDVGFHGSGQIPTPNIDALAYSGLIL 66

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
           N++YV  +CTPSRSALMTGKYPIH GMQ GV+   EP GLPL EKLLP+YL+E GY TH 
Sbjct: 67  NRYYVSPICTPSRSALMTGKYPIHTGMQRGVLKGAEPRGLPLKEKLLPEYLRELGYRTHI 126

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           +GKWHLGF+ + YTPT+RGF+SH GYW G QDYYDHS  A  EPY G+DMR NM+     
Sbjct: 127 VGKWHLGFYTKEYTPTYRGFESHLGYWTGHQDYYDHS--AVEEPYWGMDMRRNMEPAWDL 184

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
            G YSTD++T+ A+ +I  HN S+PMFLYLAH AVH+ N Y P    DE V K + I + 
Sbjct: 185 HGQYSTDVFTKEAVKLIKSHNASQPMFLYLAHAAVHSANPYNPLPVSDEIVDK-IKIPEY 243

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
           +RR +AGM S +D SVG V+ AL K  ML + +++F  DNG P+ G + N  SN PLRG+
Sbjct: 244 KRRRFAGMASAMDASVGAVVEALEKRNMLRDCLIVFSTDNGGPAEGFNLNAASNWPLRGV 303

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQW 367
           K T ++GG+RGV  +WS  L +  ++S   FHI+DWLPTL  AAG  +N T  +DG++ W
Sbjct: 304 KDTLFEGGVRGVGLVWSERLARPGRLSEATFHIADWLPTLLRAAGGYVNATGKIDGLDAW 363

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID 427
           + L    ++ R  +LHNID+V       AA+ VD  K + G+   G+ ++WYG +  +  
Sbjct: 364 EALADDKESPRRVVLHNIDDVLGT----AAISVDQWKLIKGSTYRGRWNQWYGPSGRDY- 418

Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
            Y  + V  ++  +   ALK                    L   L   K++ELR  A + 
Sbjct: 419 PYDVEAV--AEGCLVAKALKR-------------------LGRGLDRGKLVELRNQATIH 457

Query: 488 CNYDNKGAH--CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
           C    +     CN     CL ++  DPCE+ NLA+ Q D++++LE++LA   ++ VPPGN
Sbjct: 458 CGEQPQQPQKACNPLEAACLIDVRADPCERRNLADEQPDVVRRLEEELARLNASAVPPGN 517

Query: 546 KPFDKRADPARWNNIW 561
            P+D +ADPA W++ W
Sbjct: 518 LPWDSQADPALWDHTW 533


>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 657

 Score =  503 bits (1296), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 251/549 (45%), Positives = 339/549 (61%), Gaps = 48/549 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KPHI+II+ADD+GWNDV FHGS+QIPTPNIDALAY G+ILN+HY   +CTPSR+A+MTG
Sbjct: 29  QKPHIVIIMADDMGWNDVGFHGSNQIPTPNIDALAYGGIILNRHYTAPMCTPSRAAIMTG 88

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           + PI +GMQH VI   EPWGL L +K++P+Y +EAGY TH +GKWHLGFF + YTPT RG
Sbjct: 89  RNPISVGMQHYVIDSDEPWGLGLDQKIMPEYFREAGYRTHLVGKWHLGFFAQQYTPTMRG 148

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDSH  Y                    G DMRHN+ VD    G Y+TD +T+AA  +I +
Sbjct: 149 FDSHTNY-------------------TGYDMRHNLAVDYDANGTYATDHFTDAASRIIDK 189

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           HN S+P+FL + HLA H GN  +P QAP+E + KF  ISD  RR YA MVS+LD+SVG +
Sbjct: 190 HNPSEPLFLMVNHLAPHTGNDNDPLQAPEERIKKFEHISDENRRIYAAMVSKLDDSVGAI 249

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
             +LR   M++NSI+LF++DNGAPSFG+HSN GSN PLRG K++PW+G  R VAAIWSP 
Sbjct: 250 FKSLRAKNMIQNSIILFISDNGAPSFGLHSNSGSNFPLRGQKNSPWEGATRNVAAIWSPL 309

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEI----NDTSLDGVNQWDVLTKGAKTKRSEILH 383
           L++ Q+VS++  HISDWLPT  + AGI +    N + +DGV+QW+ L+      R  IL+
Sbjct: 310 LEERQRVSNQYIHISDWLPTFASIAGIRLPQARNSSEVDGVDQWEALSYDTGYPRRVILN 369

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
            ID +      Y++   +  KYV GT   G  D WYG  +NE    S +E          
Sbjct: 370 MIDEIFG----YSSYMENGFKYVNGTTGGGNFDGWYGQLNNEDPHPSDEEY--------- 416

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
             +KT L+ +    +            +L+ E I  LR+ +RV C+       CN   + 
Sbjct: 417 --IKTVLETEIAQLSG----------DVLSSELIKHLRKHSRVVCHKPLHPTTCNPLKRA 464

Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           CLF+I +DPCE N+++  +    ++L  K+   +     P NKP D   +PA    +W  
Sbjct: 465 CLFDIVNDPCELNDISAREPFKFRELRTKVETLRRLAKKPRNKPHDPAGNPANHGGVWTW 524

Query: 564 WYDELDKQK 572
           W  ++  Q+
Sbjct: 525 WRKDISAQE 533


>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 567

 Score =  501 bits (1291), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 255/562 (45%), Positives = 351/562 (62%), Gaps = 35/562 (6%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           L  T   KP+I+ ILADDLGWND+SFHGS +IPTPNIDALA+NG++LN  Y Q +CTPSR
Sbjct: 20  LARTQSSKPNIVFILADDLGWNDLSFHGSDEIPTPNIDALAFNGIVLNNLYTQPVCTPSR 79

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
            ALMTGKYPI +GMQ       EP GLPL+EKLLP+YL+E GY T AIGKWHLGF+++ Y
Sbjct: 80  VALMTGKYPIKLGMQGPPTYGAEPNGLPLSEKLLPEYLRELGYTTRAIGKWHLGFYKQAY 139

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCK---ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
           TPT RGFDSH+GY+ G   YYD+  +     F  +QG DMR N  +    +G Y+TD++T
Sbjct: 140 TPTRRGFDSHFGYYTGYVSYYDYLLQDVYQNFGEFQGFDMRRNDTIAWDVVGKYATDVFT 199

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           + A+ +I E   ++P+F+YLAH+AVH GN  +  +AP  EV KF  I DP RR YA MVS
Sbjct: 200 DEAVRLIKEQPANQPLFMYLAHVAVHTGNRGKYLEAPQSEVNKFNHILDPNRRIYAAMVS 259

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-----------GIHSNKGSNHPLRG 307
           +LDESVG V+ AL +  ML+N+I++FM+DNG+PSF           G+ +N GSN P RG
Sbjct: 260 KLDESVGRVVEALTEKKMLQNTIIVFMSDNGSPSFDGSGRNFRPEAGVTANWGSNFPYRG 319

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLDGVN 365
           +K+T W+GG++  + IW+P+ ++  +VS +L HI+DWLPTL +AAG        +LDGV+
Sbjct: 320 IKNTLWEGGVKSASFIWAPYFQENPRVSKQLMHITDWLPTLYSAAGGNPGFLPKNLDGVD 379

Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
           QW  LT    ++R+ +L NID     ++ YA +  D+ K + G+  NG  D ++G T   
Sbjct: 380 QWTSLTLNLPSQRNFVLLNIDE----KQRYAGILKDNWKLIVGSTANGTLDGFFGATRPR 435

Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
              Y+   VLYS AG     L   L             ++     + +  ++L +R  + 
Sbjct: 436 T-PYNATAVLYSPAGRALTKLSDSLPF-----------ATAGFGPMSSVRELLAMRYEST 483

Query: 486 VRCNYDNKGAHCNSTV---KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
           +RCN    G    S     + CLF++  DPCE NN+A+    +  QL + L  Y+  +VP
Sbjct: 484 IRCNPSAAGPRTQSPCPAGEACLFDLVADPCETNNVAKKYVAVSGQLYEALKYYRRLLVP 543

Query: 543 PGNKPFDKRADPARWNNIWVPW 564
             N+PFD  A+PAR+NN W  W
Sbjct: 544 QTNRPFDPAANPARFNNTWSTW 565


>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
 gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
          Length = 579

 Score =  501 bits (1290), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 266/593 (44%), Positives = 362/593 (61%), Gaps = 37/593 (6%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           L  + +F+ AF  +    KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG++LN+
Sbjct: 10  LVVSTIFSLAF-GSGYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNK 68

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
           HYV  LCTPSR+ L+TGKYPIH GMQH VI+  EPWGLP  E+L+P+  ++AGY+TH +G
Sbjct: 69  HYVPNLCTPSRATLLTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVG 128

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTI 189
           KWHLGF+R+  TPT RGFD H+GY+ G  DYYDH  +     Y  GLD R +++   +  
Sbjct: 129 KWHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRLLDRNYSAGLDFRRDLEPCPEAN 188

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
           G Y+T+ +T  A  +I +H+KSKP+F+ L+HLAVH GN   P QAP+EEVAKF  I DP+
Sbjct: 189 GTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPK 248

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           RRTYAGMVS LD+SV   I AL+ +GML NSI+L  +DNGAP+ GIHSN GSN+P RG K
Sbjct: 249 RRTYAGMVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQK 308

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWD 368
            +PW+GG+R   A+WSP LK+   VS++  H  DWLPTL  AAG+ +  D  LDG+N W 
Sbjct: 309 ESPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLPLDGINLWP 368

Query: 369 VLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEID 427
           +L+   + K   ++H +D V      Y++   D LKYV G+   G+ D+W G+ + NE D
Sbjct: 369 MLSGNEEPKPRTMIHVLDEVFG----YSSYMRDTLKYVNGSSFEGRYDQWLGELETNEDD 424

Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
             S                    +  ++H       S    R  LT+++I ++R  A   
Sbjct: 425 PLS--------------------ESYEQHVLASDVQSLLGNRG-LTEDRIRQMRSEATET 463

Query: 488 C------NYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
           C      N       C     PC F++  DPCE+ NLA+     L+QL D++   + T +
Sbjct: 464 CPPVEGQNPLEPHFKCEPLKAPCFFDLAKDPCERYNLAQIYPLQLQQLADEVEQIRKTAI 523

Query: 542 PPGNKPF-DKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLSPTTITLIY 593
           P    P  D RA+P   N  W  W++  D Q +  T  ++  L   + I L+Y
Sbjct: 524 PSARVPHSDSRANPTFHNGNW-EWWNNTDTQSSSGTFSINLWLAMGSIIGLVY 575


>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
 gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
          Length = 550

 Score =  500 bits (1288), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 260/559 (46%), Positives = 351/559 (62%), Gaps = 43/559 (7%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           T   KP+IIIIL DD+G+NDVSFHGS+QI TPNIDALAYNG++LN+ YV  LCTPSR+ L
Sbjct: 22  TPSTKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGVLLNKLYVPNLCTPSRATL 81

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +TGKYPIH GMQH VI+  EPWGLP  E+L+P++ ++AGY+T  IGKWHLGF+ +  TPT
Sbjct: 82  LTGKYPIHTGMQHYVIITDEPWGLPKQERLMPEFFRDAGYSTQLIGKWHLGFWEKDLTPT 141

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
            RGFD HYGY+ G  DYYDH+     + Y +G+D R ++    +  G Y+TD +T  A  
Sbjct: 142 MRGFDHHYGYYNGYIDYYDHTLHMLTKNYTKGVDFRRDLDPCPQDNGTYATDAFTAEAKR 201

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           VI +H+KS+P+F+ L+HLAVH GN   P QAP+EEVAKF  I+DP+RRTYAGM+S LD+S
Sbjct: 202 VIEQHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHITDPKRRTYAGMISSLDKS 261

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VG  + AL  +GML NSI+L  +DNGAP+ GIHSN+GSN+P RG K +PW+GG+R   AI
Sbjct: 262 VGQTVKALSDNGMLNNSIILLYSDNGAPTVGIHSNQGSNYPFRGQKESPWEGGIRSAGAI 321

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEIL 382
           WSP L+Q   VS+++ H  DWLPTL AAA +E+  +  LDGVN W  L+   + +   +L
Sbjct: 322 WSPLLQQRGYVSNQIIHAIDWLPTLAAAANVELPENLKLDGVNMWPSLSGNIEPQPRRLL 381

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD----TDNEIDKYSPKEVLYSK 438
           H +D+V      Y++   D+LKYV G+   GQ D+W GD     D+ +  Y  ++VL S+
Sbjct: 382 HVMDDVFG----YSSYMRDNLKYVNGSSFEGQYDQWLGDLESGQDDPLSPYYAQQVLSSE 437

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-- 496
                 +                          +T+E+I ++RE A  RC   N+  H  
Sbjct: 438 VHAVIGSQS------------------------VTNERIEQMREEATHRCPL-NEQDHLQ 472

Query: 497 ----CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKR 551
               C     PC FN+  DPCE+ NLA      L+ L  ++  ++   VP    P  D+R
Sbjct: 473 EIYLCEPLKAPCFFNLAKDPCERFNLANLYPLQLQFLSQEVEQFRQNAVPSARVPISDER 532

Query: 552 ADPARWNNIWVPWYDELDK 570
           A+P      W  W++E  K
Sbjct: 533 ANPKLHGGYW-EWWNETAK 550


>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
 gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
          Length = 579

 Score =  498 bits (1283), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 265/593 (44%), Positives = 362/593 (61%), Gaps = 37/593 (6%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           L  + +F+ AF  +    KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG++LN+
Sbjct: 10  LVVSTIFSLAF-GSGYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNK 68

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
           HYV  LCTPSR+ L+TGKYPI+ GMQH VI+  EPWGLP  E+L+P+  ++AGY+TH +G
Sbjct: 69  HYVPNLCTPSRATLLTGKYPIYTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVG 128

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTI 189
           KWHLGF+R+  TPT RGFD H+GY+ G  DYYDH  +     Y  GLD R +++   +  
Sbjct: 129 KWHLGFWRKDLTPTMRGFDHHFGYYNGYIDYYDHQVRLLDRNYSAGLDFRRDLEPCPEAN 188

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
           G Y+T+ +T  A  +I +H+KSKP+F+ L+HLAVH GN   P QAP+EEVAKF  I DP+
Sbjct: 189 GTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPK 248

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           RRTYAGMVS LD+SV   I AL+ +GML NSI+L  +DNGAP+ GIHSN GSN+P RG K
Sbjct: 249 RRTYAGMVSSLDKSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQK 308

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWD 368
            +PW+GG+R   A+WSP LK+   VS++  H  DWLPTL  AAG+ +  D  LDG+N W 
Sbjct: 309 ESPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLPLDGINLWP 368

Query: 369 VLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEID 427
           +L+   + K   ++H +D V      Y++   D LKYV G+   G+ D+W G+ + NE D
Sbjct: 369 MLSGNEEPKPRTMIHVLDEVFG----YSSYMKDTLKYVNGSSFEGRYDQWLGELETNEDD 424

Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
             S                    +  +KH       S    R  LT+++I ++R  A   
Sbjct: 425 PLS--------------------ESYEKHVLASDVQSLLGNRG-LTEDRIRQMRSEATET 463

Query: 488 C------NYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
           C      N       C     PC F++  DPCE+ NLA+     L++L D++   + T +
Sbjct: 464 CPPVEGQNPLEPHFKCEPLKAPCFFDLAKDPCERYNLAQIYPLQLQELADEVEQIRKTAI 523

Query: 542 PPGNKPF-DKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLSPTTITLIY 593
           P    P  D RA+P   N  W  W++  D Q +  T  ++  L   + I L+Y
Sbjct: 524 PSARVPHSDSRANPTFHNANW-EWWNNTDTQSSSGTFSINLWLAMVSIIGLVY 575


>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
 gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
          Length = 579

 Score =  498 bits (1283), Expect = e-138,   Method: Compositional matrix adjust.
 Identities = 261/575 (45%), Positives = 351/575 (61%), Gaps = 36/575 (6%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG++LN+HYV  LCTPSR+ L+TGK
Sbjct: 27  KPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGK 86

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPIH GMQH VI+  EPWGLP  E+L+P+  ++AGY+TH +GKWHLGF+R+  TPT RGF
Sbjct: 87  YPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGF 146

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           D H+GY+ G  DYYDH  +     Y  GLD R +++   +  G Y+T+ +T  A  +I +
Sbjct: 147 DHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEPCPEANGTYATEAFTSEAKRIIEQ 206

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H+KSKP+F+ L+HLAVH GN   P QAP+EEVAKF  I DP+RRTYAGM+S LD+SV   
Sbjct: 207 HDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQT 266

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           I AL+ +GML NSI+L  +DNGAP+ GIHSN GSN+P RG K +PW+GG+R   A+WSP 
Sbjct: 267 IGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWSPL 326

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
           LK+   VS++  H  DWLPTL  AAG+ +  D  LDG+N W +L+   + K   ++H +D
Sbjct: 327 LKERGYVSNQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPKPRTMIHVLD 386

Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEIDKYSPKEVLYSKAGITFNA 445
            V      Y++   D LKYV G+   G+ D+W G+ + NE D                  
Sbjct: 387 EVFG----YSSYMRDTLKYVNGSSFKGRYDQWLGELETNEDDPLG--------------- 427

Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC------NYDNKGAHCNS 499
                +  ++H       S    R  LT ++I ++R  A   C      N       C  
Sbjct: 428 -----ESYEQHVLASDVQSLLGNRG-LTKDRIRQMRSEATETCPPIEGQNPLESHFKCEP 481

Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKRADPARWN 558
              PC F++  DPCE+ NLA+     L+QL D+L   + T +P    P  D RA+P   N
Sbjct: 482 LKAPCFFDLAKDPCERYNLAQMYPLQLQQLADELEQIRKTAIPSARVPHSDSRANPTFHN 541

Query: 559 NIWVPWYDELDKQKAIETMRLHKPLLSPTTITLIY 593
             W  W++  D Q    T  ++  L   + I L+Y
Sbjct: 542 GNW-EWWNNTDTQSGSGTFSINLWLAMGSIIGLVY 575


>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
 gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
          Length = 542

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 258/564 (45%), Positives = 353/564 (62%), Gaps = 36/564 (6%)

Query: 40  LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGV 99
           +G NDVSFHGS+QI TPNIDALAYNG++LN+HYV  LCTPSR+ L+TGKYPIH GMQH V
Sbjct: 1   MGMNDVSFHGSNQILTPNIDALAYNGVLLNKHYVPNLCTPSRATLLTGKYPIHTGMQHWV 60

Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ 159
           I+  EPWGLP  E+L+P+  +EAGY+TH +GKWHLGF+R+  TPT RGFD HYGY+ G  
Sbjct: 61  IITDEPWGLPKKERLMPELFREAGYSTHLVGKWHLGFWRQDLTPTMRGFDHHYGYYNGYI 120

Query: 160 DYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
           DYYDH  +     Y  GLD R + + + K  G Y+T+ +T  A  +I EH+KSKP+F+ L
Sbjct: 121 DYYDHQVRLLGTNYSAGLDFRRDFEPNPKANGTYATEAFTSEAKRIIEEHDKSKPLFMVL 180

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
           +HLAVH GN   P QAP+EEVAKF  I DP+RRTYAGM+S LD+SVG  + AL+ + ML 
Sbjct: 181 SHLAVHTGNEDNPMQAPEEEVAKFSHIKDPKRRTYAGMISSLDKSVGQTMRALKDNDMLN 240

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
           N+IVL  +DNGAP+ GIHSN GSN+P RG K +PW+GG+R V A+WSP L++   VS++ 
Sbjct: 241 NTIVLLYSDNGAPTVGIHSNAGSNYPYRGQKESPWEGGIRSVGALWSPLLEKQGYVSNQA 300

Query: 339 FHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAA 397
            H  DWLPTL AAAG+ +  D  LDG N W  L++ A+ K  + +H +D V      Y++
Sbjct: 301 IHAIDWLPTLAAAAGVSLPQDLELDGENLWPSLSEKAEPKPRKFIHVLDEVFG----YSS 356

Query: 398 LRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQ-IKQKH 456
              D LKYV G+  +GQ D W GD   E ++  P   +Y +       L +++Q +   H
Sbjct: 357 YMSDTLKYVNGSSFDGQYDYWLGDL--EANEDDPLSGIYGE-----QVLASEVQSVLGSH 409

Query: 457 AADPKANSSDALRTILTDEKILELREFARVRC----NYDNKGAH--CNSTVKPCLFNITD 510
                          LT+++IL++R  A  +C      D    H  C   V PC F++  
Sbjct: 410 G--------------LTEDRILQMRSEATQKCPAIEGQDPLQPHYRCEPLVAPCFFDLAQ 455

Query: 511 DPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKRADPARWNNIWVPWYDELD 569
           DPCE+ NLA      L++L  ++  ++ T +P    PF D+RADP   N  W  W++  +
Sbjct: 456 DPCERYNLARLYPLQLEKLRQEVEEFRQTAIPSARVPFSDERADPIFHNGNW-EWWNNTE 514

Query: 570 KQKAIETMRLHKPLLSPTTITLIY 593
              +  T   +  ++  + + L+Y
Sbjct: 515 SDSSAGTCSRNLWIILGSIVALVY 538


>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 610

 Score =  493 bits (1268), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 263/554 (47%), Positives = 352/554 (63%), Gaps = 32/554 (5%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+P+II ILADDLGWND+SFHGS QI TPN+DALAYNG+ILN  YV  +CTPSR+ALMTG
Sbjct: 39  KQPNIIFILADDLGWNDLSFHGSDQIQTPNLDALAYNGVILNSQYVLPVCTPSRAALMTG 98

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            YPIH GMQ   +   EP  LP   KLLP YLK+ GY T  +GKWHLG++++ +TPT+RG
Sbjct: 99  MYPIHNGMQGLPLEASEPRALP-AGKLLPSYLKDLGYTTRMVGKWHLGYYQKEFTPTYRG 157

Query: 148 FDSHYGYWQGLQDYYDHSC-----KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           FDSH GYW G+  YYD+       +       G DMR N+       G Y+T+++T+ A+
Sbjct: 158 FDSHLGYWNGIVSYYDYILQEDDNRKPRSSLNGFDMRRNITPAYDLQGRYATEMFTDEAV 217

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           ++I  HNK+ P+FLY++HLAVHAGN  +  +AP E + KFL I+DP RRT+AGMV +LDE
Sbjct: 218 HLIRSHNKNTPLFLYMSHLAVHAGNPGKFLEAPQEAINKFLHIADPNRRTFAGMVQKLDE 277

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           SVG +IAAL +  ML NSI++F ADNGAP+ G + N GSN+PLRG+K T W+GG RGV+ 
Sbjct: 278 SVGRLIAALEESNMLHNSIIVFSADNGAPTIGGYPNWGSNYPLRGIKETLWEGGTRGVSF 337

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEI 381
           IWSP +K T +VS+EL HISDWLPTL AAAG  + N  +LDGVNQW  +++   ++R+EI
Sbjct: 338 IWSPLIKNTPRVSNELIHISDWLPTLYAAAGGNMENVKNLDGVNQWPSISENLPSERNEI 397

Query: 382 LHNIDNVDNPQKYYAALRVDDLKY----VAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
           L NID +       AA+R++  KY    V GT NNG  DE++G+  N+  + +P      
Sbjct: 398 LLNIDEILQT----AAIRMNSRKYHWKLVVGTVNNGTYDEYFGEGTNKFPENNPP----- 448

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH- 496
                FNA+       Q +    +      L    + EK+  LR  A +RC+Y     + 
Sbjct: 449 ---YDFNAVAN----SQVYRTLERLAQKRLLTYPTSKEKMKVLRRDATIRCDYKIINPNR 501

Query: 497 CNSTVKP---CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR-A 552
           CN +  P   CL++I  DPCE NNL     ++   L   L  ++ ++VP  +K   +  A
Sbjct: 502 CNPSDGPNQVCLYDIKRDPCELNNLTPFYPNVASYLFRALVQHRESLVPQESKELQQDLA 561

Query: 553 DPARWNNIWVPWYD 566
           DP  +N  W PW D
Sbjct: 562 DPVNFNGTWTPWID 575


>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
 gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
          Length = 577

 Score =  491 bits (1265), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 259/557 (46%), Positives = 344/557 (61%), Gaps = 34/557 (6%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A  KP+IIIIL DD+G+NDVSFHGS+QI TPNIDALAYNG++LN+HYV  LCTPSR+ L+
Sbjct: 24  ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TGKYPIH GMQH VI+  EPWGLP  E+L+P+  ++AGYATH +GKWHLGF+R+  TPT 
Sbjct: 84  TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           RGFD H+GY+ G  DYYD + +     Y  GLD R +++   +  G Y+T+ +T  A  V
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEPCREAEGTYATEAFTTEARKV 203

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  H+K++P+F+ L+HLAVH GN   P QAP+EEVAKF  I DP+RRTYAGM+S LD+SV
Sbjct: 204 IERHDKNRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISSLDKSV 263

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G  + AL  +GML NSIVL  +DNGAP+ GIHSN GSN+P RG K +PW+GG+R V A+W
Sbjct: 264 GQTMRALADNGMLNNSIVLLYSDNGAPTVGIHSNAGSNYPFRGQKESPWEGGIRSVGALW 323

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
           SP L+Q   VS++  H  DWLPTL AAAG+ + +D  LDG+N W  L+  A+ +   +LH
Sbjct: 324 SPLLQQRGYVSNQAIHAIDWLPTLAAAAGVSLPSDLRLDGLNLWPSLSASAQPQPRNLLH 383

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPKEVLYSKAGIT 442
            +D+V      Y++   D LKYV G+   G+ D W G+  D E D  S       +A   
Sbjct: 384 VLDDVFG----YSSYTQDTLKYVNGSSFEGRYDHWLGELEDGEEDPNSRDYGQLVRASEV 439

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-----C 497
              L +                        TD++I ++R+ A  RC    +        C
Sbjct: 440 QAVLGS---------------------VAPTDDRIRQMRDEATQRCPIQERDPEDPVYLC 478

Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKP-FDKRADPAR 556
                 CLF++ +DPCE+ NLA      L+ L  ++   + T +P    P  D RADPA 
Sbjct: 479 EPLKAACLFDLANDPCERYNLANLYPLQLQLLAQEVEELRLTALPSARVPRADVRADPAL 538

Query: 557 WNNIWVPWYDELDKQKA 573
            +  W  W + L    A
Sbjct: 539 HDGHWEWWNNTLPGSSA 555


>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
 gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
 gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
          Length = 562

 Score =  490 bits (1261), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 268/570 (47%), Positives = 350/570 (61%), Gaps = 38/570 (6%)

Query: 7   YFFALTCTLL-FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +F  L C LL   DA     +P KP+II ILADDLG+NDV FHGS++IPTPNIDALAY+G
Sbjct: 2   WFLWLICLLLPIIDAAEVEKSPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSG 61

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +ILN++YV  +CTPSRSALMTGKYPIH GMQH V+   EP GLPL EK+LPQYL E GY 
Sbjct: 62  IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYT 121

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           +H  GKWHLG ++  YTP +RGF SH G+W G QDY DH+  A      GLDMR+  QV 
Sbjct: 122 SHIAGKWHLGHWKLKYTPLYRGFSSHVGFWSGHQDYNDHT--AVENNQWGLDMRNGTQVA 179

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
               G Y+TD+ T+ ++ VIA HN +K P+FLY+AH A H+ N Y P   PD +V K   
Sbjct: 180 YDLHGHYTTDVITDHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSH 239

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I + +RR +A MVS++D SVG ++  LRK  MLENSI++F +DNG P+ G + N  SN+P
Sbjct: 240 IPNYKRRKFAAMVSKMDNSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 299

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL--- 361
           L+G+K+T W+GG+R    +WSP LK++Q+VS++  HI DWLPTL  AAG +   ++L   
Sbjct: 300 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAGGQPALSNLSKQ 359

Query: 362 -DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
            DG + W  L +   + R  +LHNID++       AAL V D K V GT+  G  D WYG
Sbjct: 360 IDGQSIWRALVQDKASPRLNVLHNIDDIWGS----AALSVGDWKLVKGTNYRGSWDGWYG 415

Query: 421 DTDNEIDK-YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
                  + Y  + V  S+AG    ALK               + +D  R          
Sbjct: 416 PAGERDPRLYDWQLVGRSRAGKALEALKM------------LPSRADQQR---------- 453

Query: 480 LREFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
           +R  A V C    ++G  C +T    PCLF+I DDPCEQ NLA+   +++  L  +L  +
Sbjct: 454 IRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEVVNALMTELERF 513

Query: 537 KSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
            +T VPP NKP D RADP  WN  W  + D
Sbjct: 514 NATAVPPSNKPADPRADPRFWNYTWTNFGD 543


>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
 gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
          Length = 577

 Score =  489 bits (1259), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 252/562 (44%), Positives = 343/562 (61%), Gaps = 42/562 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG++LN+HYV  LCTPSR+ L+TGK
Sbjct: 27  KPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGK 86

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPIH GMQH VI+  EPWGLP  E+L+P+  ++AGY+TH +GKWHLGF+R+  TPT RGF
Sbjct: 87  YPIHTGMQHFVIITDEPWGLPSRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGF 146

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           D H+GY+ G  DYYDH  +     Y  GLD R +++   +  G Y+T+ +T  A  +I +
Sbjct: 147 DHHFGYYNGYIDYYDHQVRMLDRNYSHGLDFRRDLEPCPEANGTYATEAFTSEAKRIIEQ 206

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H+KSKP+F+ ++HLAVH GN   P QAP+EEVAKF  I DP+RRTYAGM+S LD+SV   
Sbjct: 207 HDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQT 266

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           I AL+ + ML NSI+L  +DNGAP+ GIHSN GSN+P RG K +PW+GG+R   A+WSP 
Sbjct: 267 IGALKDNDMLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWSPL 326

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
           LK+   VS++  H  DWLPTL  AAG+ +  D  LDG+N W +L+   + K   ++H +D
Sbjct: 327 LKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPKPRTMIHVLD 386

Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNAL 446
            V      Y++   D LKY+ G+   G+ D W G+                        L
Sbjct: 387 EVFG----YSSYMKDTLKYINGSSFQGRYDHWLGE------------------------L 418

Query: 447 KTKLQIKQKHAADPKANSSDALRTI----LTDEKILELREFARVRC------NYDNKGAH 496
           KT        + +    +SD    +    LT+++I ++R  A   C      N       
Sbjct: 419 KTNEDHPLSESYEQHVLASDVQSILGNRWLTEDRIRQMRSEATEMCPPVEGLNPLEPQFK 478

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKRADPA 555
           C     PC F++  DPCE+ NLAE+    L+QL D++   + T +P    P+ D RA+P 
Sbjct: 479 CEPLKAPCFFDLAKDPCERYNLAETYPLQLQQLADEVEQIRKTAIPSARVPYSDSRANPT 538

Query: 556 RWNNIWVPWYDELDKQKAIETM 577
             N  W  W++  D   +  T 
Sbjct: 539 FHNGNW-EWWNNTDTDSSSGTF 559


>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
 gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
          Length = 578

 Score =  488 bits (1256), Expect = e-135,   Method: Compositional matrix adjust.
 Identities = 255/583 (43%), Positives = 352/583 (60%), Gaps = 34/583 (5%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           K+   L  + + + A + +    KP+I+IIL DD+G NDVSFHGS+QI TPNIDALAYNG
Sbjct: 5   KFHILLVVSAIVSLA-VGSAYSTKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNG 63

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           ++LN+HYV  LCTPSR+ L+TGKYPIH GMQH VI+  EPWGLP  E+L+P+  +EAGY+
Sbjct: 64  ILLNKHYVPNLCTPSRATLLTGKYPIHTGMQHFVIITDEPWGLPQRERLMPEIFREAGYS 123

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQV 184
           TH +GKWHLGF+ +  TPT RGFD H+GY+ G  DYYDH  +     Y  GLD R +++ 
Sbjct: 124 THLVGKWHLGFWHKDLTPTRRGFDHHFGYYNGYIDYYDHQVRMLDRNYSAGLDFRRDLEP 183

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
             +  G Y+T+ +T  A  +I +H+KSKP+F+ ++HLAVH GN   P QAP+EEVAKF  
Sbjct: 184 CPEANGTYATEAFTAEAKRIIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPH 243

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I DP+RRTYAGM+S LD+SV   + AL+ +GML NSI+L  +DNGAP+ GIHSN GSN+P
Sbjct: 244 IRDPKRRTYAGMISSLDKSVAQTVGALKDNGMLNNSIILLYSDNGAPTIGIHSNAGSNYP 303

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDG 363
            RG K +PW+GG+R   A+WSP LK+   VS++  H  DWLPTL  AAG+ +  D  LDG
Sbjct: 304 YRGQKESPWEGGIRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAGVSLPQDLPLDG 363

Query: 364 VNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
           +N W +L+   + K   ++H +D V      Y++   D LKYV G+   G+ D+W G+ +
Sbjct: 364 INLWPLLSGNEQPKPRTMIHVLDEVFG----YSSYMKDTLKYVNGSSFQGRYDQWLGELE 419

Query: 424 NEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF 483
              D   P    Y                 ++H       S    R  LT+++I ++R  
Sbjct: 420 TSAD--DPLSESY-----------------EQHVLASDVQSILGNRG-LTEDRIRQMRSE 459

Query: 484 ARVRC------NYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
           A   C      N       C     PC F++ +DPCE+ NLA +    L+QL D++ + +
Sbjct: 460 ATELCPPVEGQNPLEPHFKCEPLKAPCFFDLANDPCERYNLARTYPLQLQQLADEVVLIR 519

Query: 538 STMVPPGNKPF-DKRADPARWNNIWVPWYDELDKQKAIETMRL 579
            T +        D RA+PA  N  W  W +      A  ++ L
Sbjct: 520 KTAISSARVAHSDSRANPAYHNGNWEWWNNTPSSSSATFSINL 562


>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
 gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
          Length = 564

 Score =  483 bits (1242), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 258/545 (47%), Positives = 332/545 (60%), Gaps = 36/545 (6%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+II ILADDLGWNDV FHGS+QIPTPN+DALAY+G+ILN++YV  +CTPSRSALMTGK
Sbjct: 38  RPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVNPICTPSRSALMTGK 97

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPIH GMQH V+   EP GLPL+EKLLPQYLK+ GY+ H +GKWHLG ++  +TP  RGF
Sbjct: 98  YPIHTGMQHTVLYAMEPRGLPLSEKLLPQYLKDLGYSNHIVGKWHLGHYQLRFTPMQRGF 157

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           DSH G+W G     DH+  A    + GLDMR    V     G Y+T +    AI ++  H
Sbjct: 158 DSHTGFWTGHHHMNDHT--AVEHGHWGLDMRRGYDVAYDLHGQYTTHVLGAEAIAIVQGH 215

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           NKS P+FLY+AH AVH+ N Y+   APDE VA    I +  RR +A M+  LD +VG+++
Sbjct: 216 NKSSPLFLYVAHAAVHSANPYDFLPAPDETVANLGHIENYRRRKFAAMMVELDRTVGSLV 275

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
            AL    MLEN+I++F +DNG P+ G + N  SN PLRG+K+T W+GG+R    IWSP L
Sbjct: 276 DALHARDMLENTIIVFSSDNGGPADGFNDNAASNWPLRGVKNTLWEGGLRAAGFIWSPLL 335

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLDGVNQWDVLTKGAKTKRSEILHNID 386
           +   +VS ++  + DWLPTL  AAG  ++   T LDG++ W  L+ GA T+R EILHNID
Sbjct: 336 QNVSRVSHQMVQVCDWLPTLYEAAGGNVSALPTDLDGISVWHELSSGAPTRRIEILHNID 395

Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNAL 446
           ++       AAL V + K V G+  N   D WYG                  AGI     
Sbjct: 396 DIWGT----AALTVGNWKLVKGSHYNRTWDGWYG-----------------PAGIR---- 430

Query: 447 KTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH----CNSTVK 502
             K     K A  P     + L  + T E+I +LR  A V C     GAH    C+   +
Sbjct: 431 DEKAYALDKLANSPAGKVMEELSLLPTPERITQLRREATVSC---GAGAHMASECDPLER 487

Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
           PCL+++  DPCE NNLAE    +L+ L  +LA Y ST VPP N   D R +P RWN  W 
Sbjct: 488 PCLYDVESDPCEYNNLAEEHLHILQSLLARLADYNSTAVPPSNLEDDPRGEPQRWNFTWH 547

Query: 563 PWYDE 567
            + DE
Sbjct: 548 NFGDE 552


>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 552

 Score =  481 bits (1239), Expect = e-133,   Method: Compositional matrix adjust.
 Identities = 253/565 (44%), Positives = 339/565 (60%), Gaps = 32/565 (5%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
              +   +L     L T++   P+II ILADDLGWNDV FHGS+QIPTPN+DALAY+G+I
Sbjct: 3   LVVIAVLILAFTIVLVTSSKPPPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAYSGII 62

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           LN++YV  +CTPSR+ALMTG+YPIH GMQH V+   EP GLPL EKLLP+YL+E GY  H
Sbjct: 63  LNRYYVTPICTPSRAALMTGRYPIHTGMQHAVLYGMEPRGLPLEEKLLPEYLRELGYKNH 122

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            +GKWHLG +   YTP  RGFDSH G+W G    +DHS   T     GLDMR    V   
Sbjct: 123 IVGKWHLGHYTRRYTPLERGFDSHVGFWTGHHHMFDHSAVET--ETWGLDMRRGYDVAYD 180

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
             G Y+T +  + A+  I  H+   P+FLY+AH AVH+ N Y+   APD  VA    +  
Sbjct: 181 LHGKYTTHVIRDEAVARIGNHSIGDPLFLYVAHAAVHSANPYDFLPAPDVTVAGLEHVEP 240

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
             RR +A M+S LDESVG ++ AL+  GML+N+I++F +DNG P+ G +SN  SN PLRG
Sbjct: 241 YPRRKFAAMLSELDESVGAIVEALKIKGMLDNTIIVFSSDNGGPAEGFNSNAASNWPLRG 300

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVN 365
           +K+T W+GG+R    IWSP + ++++VS ++ HISDWLPTL  AAG  +E    +LDG+N
Sbjct: 301 VKNTLWEGGVRAAGFIWSPLIAESRRVSQQMIHISDWLPTLLDAAGYNVETLPKTLDGIN 360

Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DN 424
            W  L  G  T+R EILHNID++       AAL V++ K + GT+  GQ D WYG   D 
Sbjct: 361 VWKSLLNGNSTERREILHNIDDIWGS----AALTVNNWKILKGTNYKGQWDNWYGPPGDR 416

Query: 425 EIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFA 484
           + + Y+   V +                       P   +   +  + T+  I  LR  A
Sbjct: 417 DPEAYNITAVQHC----------------------PTGKALSEMNLLPTEATIRNLRLNA 454

Query: 485 RVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPG 544
            V C+ + + A CN   +PCL+++  DPCE +NLAE    +L  +  KL  Y +T VPPG
Sbjct: 455 TVHCSKEVR-ATCNPLEQPCLYDVLQDPCELDNLAERFPAILDSMLKKLDAYNATAVPPG 513

Query: 545 NKPFDKRADPARWNNIWVPWYDELD 569
           N   D R DP  W   W  + D+L+
Sbjct: 514 NLALDTRGDPRFWGYTWHNFGDDLE 538


>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
 gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
          Length = 533

 Score =  476 bits (1226), Expect = e-131,   Method: Compositional matrix adjust.
 Identities = 258/550 (46%), Positives = 349/550 (63%), Gaps = 38/550 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K PHII I+ADDLGWNDV +HGS +IPTPNIDALAYNG+ILN++YV  +CTPSRSALMTG
Sbjct: 8   KPPHIIFIMADDLGWNDVGYHGSDEIPTPNIDALAYNGIILNRYYVLPVCTPSRSALMTG 67

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           ++PIH GMQH V+   E  GLPLTEKLLP+YL++ GY+TH +GKWHLGF+++ YTP +RG
Sbjct: 68  RHPIHNGMQHRVLFGVETRGLPLTEKLLPEYLQKLGYSTHIVGKWHLGFYKKEYTPLYRG 127

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           F+SH G+W G QDYYDH+ +   E   GLDMRH M+      G YST +YT  ++ +I  
Sbjct: 128 FESHIGFWTGHQDYYDHTAEE--ERLWGLDMRHGMKPAWYLHGEYSTHVYTRESVKIIKN 185

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           +N +KP+FLY+AH AVH+GN Y P  APD+ V K   I +  RR YA MVS LD SVGN+
Sbjct: 186 YNSTKPLFLYVAHAAVHSGNKYNPLPAPDKTVDKLDHIQNYNRRRYAAMVSELDTSVGNI 245

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           I+AL+   ML NSIV+F  DNG P+ G++ N  SN P+RG+K+T W+GG+RGV  +WS  
Sbjct: 246 ISALKDANMLSNSIVIFSTDNGGPANGLNYNYASNWPMRGVKNTLWEGGVRGVGVVWSSK 305

Query: 328 LKQTQ--KVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAKTKRSEILH 383
            K     +VS+ + HI DWLPTL AAAG ++ D   +LDG N W+ L     + R+EILH
Sbjct: 306 FKPNSGGRVSNRMMHICDWLPTLYAAAGGDVRDLGENLDGYNMWNSLLDDLPSPRNEILH 365

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEIDKYSPKEVLYSKAGIT 442
           NID++       +AL +   K V GT  NG  D W+G T  N    Y+  +VL S +   
Sbjct: 366 NIDDIFGN----SALTMSQWKLVKGTTYNGFYDSWFGPTGRNPGYFYNIGQVLSSLSNKA 421

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC------NYDNKGAH 496
            + L+T                       L  E I  +RE A + C      + +++ ++
Sbjct: 422 LSTLQT---------------------VPLNPESIRRIREEATIDCGNHQLISSESRFSN 460

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
           CN   + CLFN+ +DPCE +N+A+S   ++K++E++L  + ST V P     D R +P  
Sbjct: 461 CNPLKESCLFNVDEDPCEIDNVAKSFPQVVKEMEERLLYWNSTAVKPAKVINDLRGNPKF 520

Query: 557 WNNIWVPWYD 566
           W  +W  + D
Sbjct: 521 WGYVWTNFGD 530


>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
          Length = 563

 Score =  471 bits (1211), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 239/562 (42%), Positives = 336/562 (59%), Gaps = 31/562 (5%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           L   L    +  +    ++P+I++I+ADDLGWNDV FHGS+QIPTPNID +A++G+ L+ 
Sbjct: 30  LVLLLFIVTSLSDCECHERPNIVLIIADDLGWNDVGFHGSNQIPTPNIDIMAWSGVSLHN 89

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
           +YV  +CTPSR+ALMTGKYPIH GMQH VI   EP GLPLTEK+LPQYLKE GY TH +G
Sbjct: 90  YYVTPICTPSRAALMTGKYPIHTGMQHTVIFAAEPRGLPLTEKILPQYLKELGYKTHLVG 149

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           KWHLG +++ Y P  RGFDSH G+W G  D YDH+ +   + Y G D R +    +   G
Sbjct: 150 KWHLGSYKKEYLPLNRGFDSHLGFWNGKIDMYDHTNQE--KGYWGFDFRRDFSTAHDLFG 207

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
            Y+TD+YT  A+ +I  HN S P+FL L+H AVH GN  EP +AP++    F  I D +R
Sbjct: 208 QYATDVYTNEAVKIIKSHNTSSPLFLMLSHSAVHTGNPSEPIRAPEKLFVNFTHIQDFQR 267

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
           R +A ++++LDESVG V+AAL+  G+L +SIV+F  DNG  + G + N  SN+PLRG+K+
Sbjct: 268 RKFAAVLTKLDESVGEVVAALKAKGVLNDSIVVFTTDNGGAAAGFNDNAASNYPLRGVKN 327

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDV 369
           T W+GG+RG   +WSP++ +  +V+++  H+ DWLPTL +AAG+ ++    +DGV+QW  
Sbjct: 328 TLWEGGVRGAGWLWSPFIDKRSRVATQRMHLVDWLPTLLSAAGMNVSSIKHIDGVDQWCA 387

Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKY 429
           L++   + R  ++HNID+    +   A++     K   GT+  G  D WYG    E   Y
Sbjct: 388 LSQDLPSARESLVHNIDD----ESGSASITYKQWKVHKGTNYGGSWDGWYGPAGRE-GAY 442

Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN 489
               +L SKA                      A +   +  +   E IL LR  A V C 
Sbjct: 443 DTTRLLASKA----------------------AGALLDIGMLPDTEHILRLRSEATVECG 480

Query: 490 YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD 549
            D +   C     PCLFNI +DPCE  NLA+   D+L+ +  +L     T VPP N+P  
Sbjct: 481 -DREALPCRPLEAPCLFNIDEDPCETRNLADIHPDVLQVMLKELDRVNRTAVPPNNQPLT 539

Query: 550 KRADPARWNNIWVPWYDELDKQ 571
              DP  W  +   + D ++ +
Sbjct: 540 PGGDPKYWGYVITNFGDYINNE 561


>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
          Length = 561

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 248/567 (43%), Positives = 344/567 (60%), Gaps = 25/567 (4%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           F     LL     ++ T+P+ P+I+ ILADDLGWNDV FHGS Q+ TPN+DALAY+G+IL
Sbjct: 6   FRCVAFLLLQLTAVSATSPR-PNIVFILADDLGWNDVGFHGSEQVLTPNLDALAYDGVIL 64

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
             +YVQ +CTPSR+ALMTG++PIH GMQ  VI   EP+GL L E + PQYLK+ GY TH 
Sbjct: 65  ENYYVQPICTPSRAALMTGRHPIHTGMQQNVIYSAEPYGLGLNEIIFPQYLKQIGYKTHI 124

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH----SCKATFEPYQGLDMRHNMQV 184
           +GKWHLGFF E YTP  RGFDSHYGY+ G +DY+ H    + + +F    GLD R N +V
Sbjct: 125 VGKWHLGFFAEQYTPIERGFDSHYGYYMGAEDYWVHIAGNAHEVSFNASWGLDFRRNGEV 184

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY---EPFQAPDEEVAK 241
                G YST+L+T  A N+IA HN+S+P+F+Y+A  AVH+ N Y      +AP +   K
Sbjct: 185 VKTAFGQYSTELFTTEAENIIASHNQSEPLFMYVAQQAVHSANPYTGDAELEAPFKYYEK 244

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           F  I + +RR +A MVS LD+SVGN+  AL   G+LEN+I+ F  DNG P+ G   N  S
Sbjct: 245 FPHIKNEKRRKFAAMVSALDDSVGNITKALHAKGILENTIICFSTDNGGPASGFDFNDAS 304

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS- 360
           N PLRG+K + W+GG+RG   +WSP +K +   S  +  I D+LPTL  AAG +++  S 
Sbjct: 305 NWPLRGVKESLWEGGVRGTGFVWSPLMKTSGYTSPAMMQIFDFLPTLLKAAGYDMSKLSP 364

Query: 361 -LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
            LDG++QW+++++     R+E+LHNIDN +  +    ALR++++K V     + +S   +
Sbjct: 365 DLDGIDQWEIISERKDAVRTEMLHNIDNTEGTR----ALRINEMKVVVSQGGSTRSWPLW 420

Query: 420 GDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
             TD     Y+  +   S        L  K  I  + A    ++ +  LR +    + L 
Sbjct: 421 YSTDQFYGGYNHTDHSVSA------DLSAKWDIDSEVAPRVNSDLTSVLRNL---GRPLR 471

Query: 480 LREFARVRCNYDNKGAH--CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
                ++ C      A   C+   KPCLFNIT DPCE +NLA+ + D+L  + D+L  Y+
Sbjct: 472 SGTPLKIHCGPVPPSARGSCDVGKKPCLFNITADPCEFHNLADERPDMLSTMLDRLDQYQ 531

Query: 538 STMVPPGNKPFDKRADPARWNNIWVPW 564
           ST V P NKP D    P++   IW PW
Sbjct: 532 STAVAPRNKPVDPNGLPSKHGGIWEPW 558


>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
          Length = 557

 Score =  469 bits (1207), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 252/569 (44%), Positives = 336/569 (59%), Gaps = 25/569 (4%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           Y   +  ++L + ++ +    ++P++I ILADD GWNDVSFHGS QIPTPN+DALA++GL
Sbjct: 3   YLHIVALSVLVSSSWASGKQIQQPNLIFILADDFGWNDVSFHGSKQIPTPNLDALAFSGL 62

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
           IL  +YV  LCTPSRSALMTGK+PIH GMQH V+     +GLPL+E  LP+YLK+ GY  
Sbjct: 63  ILQNYYVTPLCTPSRSALMTGKHPIHTGMQHDVLYGYSRYGLPLSEITLPEYLKDLGYKN 122

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
           H +GKWHLG ++ VYTP FRGFDSHYGYW G QDYYDH+  A      G DMR N  VD 
Sbjct: 123 HIVGKWHLGHYKSVYTPLFRGFDSHYGYWTGHQDYYDHT--AVEWNAWGYDMRRNHSVDW 180

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
              G Y+T L T+ A +VI +H+ S P+FLY+AHLAVH+ N Y P QAP+E V  F  I 
Sbjct: 181 SAYGKYTTTLLTDEACDVITKHDVSSPLFLYVAHLAVHSANPYSPLQAPEETVEMFSSIE 240

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
           + +RR YA MV  LD SVG ++ +L  + MLEN++++F  DNG P+ G + N  SN PL+
Sbjct: 241 NLQRRRYAAMVHELDVSVGKIVKSLGDNNMLENTVIVFSTDNGGPAEGFNQNAASNWPLK 300

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIE--INDTS-- 360
           G+K+TPW+GG+R    IWSP + ++++  V S L  ISDWLPTL  AAG      D +  
Sbjct: 301 GVKNTPWEGGVRAAGLIWSPLIPKSRRGQVMSNLMDISDWLPTLFEAAGQSRICGDVATL 360

Query: 361 --LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
             LDGV+ W+ +     + R+ +LHNID+    Q  YAA+R ++ K V GT   G  D W
Sbjct: 361 KGLDGVSHWNTVLYDKPSARNHVLHNIDD----QLGYAAIRKENWKLVKGTTYEGSWDGW 416

Query: 419 YGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL 478
           YG          P       +    N++     I +    D  A    A+  +   + + 
Sbjct: 417 YG----------PSGRTNESSTNFINSVDDVSLIYKNAVLDSDAAQYIAMLGMSATQNLY 466

Query: 479 ELREFARVRCNYDNKGAH-CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
           +    A + C      AH C      CLF+I  DPCE NNLAE    + + L   L+ Y 
Sbjct: 467 QQLLDAEISCGERPVDAHPCLPLDALCLFDIEQDPCEYNNLAEKMPHITEDLLSLLSWYN 526

Query: 538 STMVPPGNKPFDKRADPARWNNIWVPWYD 566
           ST V P N   D  ++P  WN  +  W D
Sbjct: 527 STAVAPLNTSPDPMSNPKYWNYTYTNWAD 555


>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
 gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
          Length = 542

 Score =  468 bits (1205), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 260/570 (45%), Positives = 339/570 (59%), Gaps = 58/570 (10%)

Query: 7   YFFALTCTLL-FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +F  L C LL   DA  N   P KP+II ILADDLG+NDV FHGS++IPTPNIDALAY+G
Sbjct: 2   WFLWLICLLLPIIDAAENEKPPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSG 61

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +ILN++YV  +CTPSRSALMTGKYPIH GMQH V+   EP GLPL EK+LPQYL E GY 
Sbjct: 62  IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYT 121

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           +H  GKWHLG ++  YTP +RGF SH+                      GLDMR+  QV 
Sbjct: 122 SHIAGKWHLGHWKLKYTPLYRGFSSHW----------------------GLDMRNGTQVA 159

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
               G Y+TD+ T+ ++ VIA HN +K P+FLY+AH A H+ N Y P   PD +V K   
Sbjct: 160 YDLHGHYTTDVITDQSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSH 219

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I + +RR +A MVS++D+SVG ++  LRK  MLENSI++F +DNG P+ G + N  SN+P
Sbjct: 220 IPNYKRRKFAAMVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 279

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL--- 361
           L+G+K+T W+GG+R    +WSP LK++Q+VS++  HI DWLPTL  AAG +    +L   
Sbjct: 280 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIVDWLPTLLEAAGGQPALANLSKQ 339

Query: 362 -DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
            DG + W  L     + R  +LHNID++       AAL V D K V GT+  G  D WYG
Sbjct: 340 IDGQSIWRALVHDKASPRLNVLHNIDDIWGS----AALSVGDWKLVKGTNYRGSWDGWYG 395

Query: 421 DTDNEIDK-YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
                  + Y  + V  S+AG    ALK               + +D  R          
Sbjct: 396 PAGERDPRLYDWQLVGRSRAGKALEALKM------------LPSRADQQR---------- 433

Query: 480 LREFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
           +R  A V C    ++G  C +T    PCLF+I DDPCEQ NLA+   +++  L  +L  +
Sbjct: 434 IRAAATVSCPGQSSQGTSCVATAFSAPCLFHIRDDPCEQYNLAKQYPEVVSALMAELERF 493

Query: 537 KSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
            +T VPP NKP D RADP  WN  W  + D
Sbjct: 494 NATAVPPSNKPADPRADPRFWNFTWTNFGD 523


>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
 gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
          Length = 560

 Score =  465 bits (1196), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 249/549 (45%), Positives = 348/549 (63%), Gaps = 34/549 (6%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           TA K P+III++ADD+G++D+S  G+ +  TPNIDALAY+G +L++ Y  ++CTPSR AL
Sbjct: 28  TASKLPNIIIVMADDMGFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGAL 87

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           ++GKYPIH G QH VIL  EPWGL L   L+P+  ++AGY+T+ +GKWHLGF R  YTPT
Sbjct: 88  LSGKYPIHTGTQHSVILNEEPWGLALNATLMPEIFRDAGYSTNLVGKWHLGFVRPEYTPT 147

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           +RGFD HYGYW G  DYY    +   + Y  G D R NMQV+    G+Y TDL T  A  
Sbjct: 148 YRGFDYHYGYWGGYIDYYQRRSQMPSDNYSMGYDFRRNMQVECTDRGVYVTDLLTNEAER 207

Query: 204 VIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           +I E   K +P+FL L+HLA H GN  +P QAP+EE+ KF+ I+DP RR YA MVS+LDE
Sbjct: 208 IIREQPAKEQPLFLILSHLAPHTGNEIDPLQAPEEELQKFVHINDPNRRLYAAMVSKLDE 267

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           SVG VI AL +   LENSIV+F +DNGAPS GI +N GSN PL+G K +PW+GG+R   A
Sbjct: 268 SVGRVITALARTDQLENSIVIFYSDNGAPSVGIFANTGSNWPLKGQKDSPWEGGLRVAGA 327

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLT--KGAKTKRS 379
           IWSP LK    + ++  ++ D+LPTL  AAGIE++ +  LDG++ W  L+  K A     
Sbjct: 328 IWSPLLKARGNLFTQPIYVGDFLPTLAHAAGIELDSSLHLDGIDLWPQLSGPKDAPHVPR 387

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
           EILH +D+V   Q    +L +   KYV GT + GQ D+    T  ++D   P+  +Y  A
Sbjct: 388 EILHQLDDVGRMQ----SLLLGQYKYVNGTTSKGQYDQLL--TYRKLDDLDPRASIYGMA 441

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
                          +++A  K  +S  ++ I T ++I  L   A V C+   +G  CN+
Sbjct: 442 --------------VRNSAVSKELASFDIQQI-TQDRIDILSRSAAVICSDMQRG--CNA 484

Query: 500 TVKPCLFNITDDPCEQNNLAESQ--TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA-- 555
            ++ CL++++ DPCEQNNLA S+  +++L  +  ++   +S  V PGN+P ++ ADP+  
Sbjct: 485 LLEECLYDLSTDPCEQNNLAYSKQHSEILAIMRQRVQQLRSGAVAPGNRPNERDADPSLH 544

Query: 556 --RWNNIWV 562
              W+N  V
Sbjct: 545 ECTWDNFGV 553


>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 514

 Score =  464 bits (1195), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 253/541 (46%), Positives = 327/541 (60%), Gaps = 59/541 (10%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHIIIILADDLGWNDV FHGS+QIPTPNIDALA+ G+ILN +YV  +CTPSRSAL+TGKY
Sbjct: 31  PHIIIILADDLGWNDVGFHGSNQIPTPNIDALAFTGIILNNYYVAPVCTPSRSALLTGKY 90

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PIH G+QHGVI    P+GL L EKLLP+YL+   Y T  +GKWHLG F++ YTP +RGFD
Sbjct: 91  PIHTGLQHGVIHGSAPYGLNLNEKLLPEYLRSLNYVTRHVGKWHLGSFKKDYTPEYRGFD 150

Query: 150 SHYGYWQGLQDYYDHSCKATFEP-YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           SHYGYW G QDYYDH+  A   P + G DMR  M V     G Y+TDL+T  A+ VI  H
Sbjct: 151 SHYGYWTGHQDYYDHT--AIENPGFWGYDMRRGMNVTRSDFGYYTTDLFTNEAVKVIKGH 208

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           + +KP+FLYLAHLA H+GN Y P QAP E VAKF  I D  RR +AGM+S+LDESVG V+
Sbjct: 209 DSNKPLFLYLAHLATHSGNKYSPLQAPAETVAKFNYIKDKNRRLFAGMLSKLDESVGKVV 268

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
            AL    M+ N ++LF  DNG P+ G + N  SN PLRG+K T W+GG+RGV  IWSP+L
Sbjct: 269 EALADSNMINNCVILFSTDNGGPAGGFNLNAASNWPLRGVKDTLWEGGVRGVGFIWSPFL 328

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
             + KVS+ + HI+DWLPTL +      + + +DG+N W  L+KG  + R EIL NID  
Sbjct: 329 -PSSKVSNAMIHITDWLPTLLSLTNASNSISDIDGINVWPNLSKGLPSVRKEILLNIDT- 386

Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
              ++  AALR  + KY  G  NN   ++WYG +  +++ Y   E+  S A      +K 
Sbjct: 387 ---ERKIAALRYKNWKYKKGNANNW--NKWYGPSGRDMN-YDWNELYNSDAAKAIKHIK- 439

Query: 449 KLQIKQKHAADPKANSSDALRTILTDEK-ILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
                                 +L ++K ++EL    +V CN   +    + +       
Sbjct: 440 ----------------------VLPNKKLVMELINATQVTCNVKEEEEEEDES------- 470

Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDE 567
              + CE+         +  Q+        S MVPP NKP D R++P  W+  W  W D 
Sbjct: 471 -DQNSCER--------VIWHQI--------SKMVPPRNKPVDIRSNPVYWDYTWTNWMDY 513

Query: 568 L 568
           L
Sbjct: 514 L 514


>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
          Length = 1007

 Score =  464 bits (1193), Expect = e-128,   Method: Compositional matrix adjust.
 Identities = 249/566 (43%), Positives = 335/566 (59%), Gaps = 52/566 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P++I+I+ADDLGWNDVSFH S QI TPNID LAY+G+ILN+HY     T S+ ALMTG +
Sbjct: 456 PNVIVIVADDLGWNDVSFHSSKQIFTPNIDVLAYHGVILNRHYCAPFGTASQVALMTGSH 515

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           P+ +G Q       +PW L    KL+P+Y ++AGYATH IGKW LGF R+ YTPT RGFD
Sbjct: 516 PLSVGTQSASNEPDQPWTLDPELKLMPEYFRDAGYATHLIGKWGLGFSRKDYTPTQRGFD 575

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           SH+G+     DY+DHS +      +GLDMR N+ VD    G Y+TDL+   A+ +I EH+
Sbjct: 576 SHFGFLGPYIDYWDHSLRLRNTSTRGLDMRRNLDVDYSVNGSYATDLFNGEAVRLIREHD 635

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
           + KP+ L L HLA H GN  +P QAP EEV KF  I D +RR  A M+S++DE VG +I 
Sbjct: 636 QKKPLLLVLTHLAPHTGNEDDPMQAPAEEVEKFDYIRDEKRRVLAAMISKVDEGVGQIIQ 695

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM--------------------- 308
            L +  ML+NSIVLF ADNGAP+ G+HSN GSN PLRG+                     
Sbjct: 696 TLAERDMLDNSIVLFYADNGAPTVGMHSNSGSNFPLRGVCIDNWSTLHIRCTNGCFSFQQ 755

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAAGIE--INDTSLDGVN 365
           K +PW+G +R VAAIWSP L  T  +VS++  H+SDWLPTL  AAGIE     + +DG N
Sbjct: 756 KYSPWEGAVRTVAAIWSPLLNLTAGRVSNQWIHVSDWLPTLAHAAGIEGIPIGSEIDGRN 815

Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
           QW+ L   A T R+ +++NID +     +Y++   +  KYV GT   G+ D W G+ ++E
Sbjct: 816 QWEALKNPAITVRNVVMNNIDEL----HHYSSYSRNGWKYVNGTSWEGKFDNWMGELEDE 871

Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
            +    + V+     +    +   L                        E +  LR  A 
Sbjct: 872 DELSEEEYVVRLAGSVVGRMMPLDL------------------------EHVARLRRAAT 907

Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
           V C  + +   C     PCLFN+ DDPCE+NN+A  Q D+L++L +++  Y+ T V P N
Sbjct: 908 VECEVEGQVKACQPMESPCLFNLMDDPCERNNVALDQLDILEELREEVERYRQTAVEPRN 967

Query: 546 KPFDKRADPARWNNIWVPWYDELDKQ 571
           KP D R+DP  +NN W  W DE+D +
Sbjct: 968 KPADPRSDPGFYNNTWTWWLDEIDSR 993


>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
 gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
          Length = 545

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 258/551 (46%), Positives = 328/551 (59%), Gaps = 56/551 (10%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P KPHII ILADDLG+NDV FHGS+QIPTPNIDALAY+G+ILN++YV  +CTPSRSALMT
Sbjct: 23  PPKPHIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMT 82

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           GKYPIH GMQH V+   EP GLPL EK+LPQYL + GY +H  GKWHLG ++  YTP FR
Sbjct: 83  GKYPIHTGMQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFR 142

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           GF SH   W                   GLDMR+   V     G Y+TD+ T+ +++VIA
Sbjct: 143 GFSSHKNLW-------------------GLDMRNGTDVAYNLHGQYTTDVITKHSVSVIA 183

Query: 207 EHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
            H+ +K P+FLY+AH A H+GN Y P   PD++V K   I   +RR YA MV+++DESVG
Sbjct: 184 NHDAAKGPLFLYVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVG 243

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            ++  LRK  MLENSI++F +DNG P+ G + N  SN+PLRG+K+T W+GG+R    +WS
Sbjct: 244 VIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLRGVKNTLWEGGVRAAGLVWS 303

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAG-----IEIN-DTSLDGVNQWDVLTKGAKTKRS 379
           P L++ Q+V  E  HISDWLPTL  AAG        N   SLDG++ W  L     + R 
Sbjct: 304 PLLEKRQRVVDETMHISDWLPTLIEAAGGIPALANANLSQSLDGMSIWQTLVHAYPSPRK 363

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG-DTDNEIDKYSPKEVLYSK 438
            +LHNID++       AAL V D K + GT+ NG+ D WYG   + +   Y    VL S 
Sbjct: 364 TVLHNIDDIWGS----AALTVGDWKMIQGTNYNGRWDGWYGPGGERDPHLYDWNLVLKSS 419

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC-NYDNKGAHC 497
           +G     LK               + +D  R          LRE A V C     +G  C
Sbjct: 420 SGKAMTGLKM------------LPSRADQQR----------LRETATVTCQGQSTQGTSC 457

Query: 498 NSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
            +T    PCLF++ DDPCEQ NLAE    +L  L  +L    ST VPP N+P D RA P 
Sbjct: 458 VATAFSAPCLFHVRDDPCEQFNLAEQYPKVLDLLMAELQHINSTAVPPSNQPDDPRASPM 517

Query: 556 RWNNIWVPWYD 566
            +N  W  + D
Sbjct: 518 YFNYTWTNFGD 528


>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
 gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
          Length = 545

 Score =  462 bits (1188), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 258/551 (46%), Positives = 328/551 (59%), Gaps = 56/551 (10%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P KPHII ILADDLG+NDV FHGS+QIPTPNIDALAY+G+ILN++YV  +CTPSRSALMT
Sbjct: 23  PPKPHIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMT 82

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           GKYPIH GMQH V+   EP GLPL EK+LPQYL + GY +H  GKWHLG ++  YTP FR
Sbjct: 83  GKYPIHTGMQHTVLYAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFR 142

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           GF SH   W                   GLDMR+   V     G Y+TD+ T+ +++VIA
Sbjct: 143 GFSSHKNLW-------------------GLDMRNGTDVAYNLHGQYTTDVITKHSVSVIA 183

Query: 207 EHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
            H+ +K P+FLY+AH A H+GN Y P   PD++V K   I   +RR YA MV+++DESVG
Sbjct: 184 NHDAAKGPLFLYVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVG 243

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            ++  LRK  MLENSI++F +DNG P+ G + N  SN+PLRG+K+T W+GG+R    +WS
Sbjct: 244 VIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPLRGVKNTLWEGGVRAAGLVWS 303

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAG-----IEIN-DTSLDGVNQWDVLTKGAKTKRS 379
           P L++ Q+V  E  HISDWLPTL  AAG        N   SLDG++ W  L     + R 
Sbjct: 304 PLLEKRQRVVDETMHISDWLPTLIEAAGGIPALANANLSQSLDGMSIWQTLVHAYPSPRK 363

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG-DTDNEIDKYSPKEVLYSK 438
            +LHNID++       AAL V D K + GT+ NG+ D WYG   + +   Y    VL S 
Sbjct: 364 TVLHNIDDIWGS----AALTVGDWKMIQGTNYNGRWDGWYGPGGERDPHLYDWNLVLKSS 419

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC-NYDNKGAHC 497
           +G     LK               + +D  R          LRE A V C     +G  C
Sbjct: 420 SGKAMTGLKM------------LPSRADQQR----------LRETATVTCQGQSTQGTSC 457

Query: 498 NSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
            +T    PCLF++ DDPCEQ NLAE    +L  L  +L    ST VPP N+P D RA P 
Sbjct: 458 VATAFSAPCLFHVRDDPCEQFNLAEQYPKVLDLLVAELQHINSTAVPPSNQPDDPRASPM 517

Query: 556 RWNNIWVPWYD 566
            +N  W  + D
Sbjct: 518 YFNYTWTNFGD 528


>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
 gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
          Length = 531

 Score =  461 bits (1187), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 256/557 (45%), Positives = 331/557 (59%), Gaps = 56/557 (10%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHII ILADDLG+NDV F GS+QIPTPNIDALAY+GLILN++YV  +CTPSRSALMTGK
Sbjct: 20  QPHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGK 79

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPIH GMQH V+   EP GLPL  K+LPQYL + GY +H  GKWHLG ++ VYTP +RGF
Sbjct: 80  YPIHTGMQHTVLYAAEPRGLPLDLKILPQYLNDLGYTSHIAGKWHLGHWQRVYTPLYRGF 139

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
            SH+GYW                   GLDMR+  +V     G YSTD+ T+ ++NVI++H
Sbjct: 140 SSHHGYW-------------------GLDMRNGTEVAYDLHGQYSTDVITQHSLNVISKH 180

Query: 209 NKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           N +K P+FLY+AH AVH+GN Y P    D+ V +   I   +RR YA +V+ +DESVG +
Sbjct: 181 NATKGPLFLYVAHAAVHSGNPYNPLPVKDDAVRRLDTIQHYKRRKYAALVTAMDESVGKI 240

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           +  LRK  MLE SI++F  DNG P+ G +SN  SN+PLRG+K+T W+GG+RG A +WSP 
Sbjct: 241 VEQLRKSRMLETSIIVFSTDNGGPAEGFNSNFASNYPLRGVKNTLWEGGVRGAALLWSPQ 300

Query: 328 LKQTQKVSSELFHISDWLPTLC-----AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           L +  + + +  HI DWLPTL       AA  ++    LDG++ W  L K   + R  +L
Sbjct: 301 LTKRPRTAEQTMHIVDWLPTLVEAAGGEAALAKLGTAKLDGISLWQALLKNETSPRKSVL 360

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEIDKYSPKEVLYSKAGI 441
           HNID++       AAL V D K V GT+ NG  D WYG         Y  + V  S  G 
Sbjct: 361 HNIDDIWGS----AALSVGDWKLVKGTNYNGSWDGWYGPAGMRNPHDYDWQSVAKSATGQ 416

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN----YDNKGAHC 497
               LK            PK  ++D +R          LRE A V C      D     C
Sbjct: 417 AMQQLKML----------PK--TADMIR----------LREAANVICKDANPKDAPHIAC 454

Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARW 557
           N   +PCLFN+ +DPCEQ N A+    +L+ L  +L +  +T VPP NKP D RADP  W
Sbjct: 455 NPLSEPCLFNVREDPCEQRNQAKQHPKVLELLLKELQLLNATAVPPANKPDDPRADPRLW 514

Query: 558 NNIWVPWYDELDKQKAI 574
           N+ W  + D ++    I
Sbjct: 515 NHTWTNFGDYIETNDCI 531


>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 526

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 247/552 (44%), Positives = 334/552 (60%), Gaps = 56/552 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+P+II ILADDLGW+DVSFHGS QIPTPNID LAY+G++L+ +YVQ +CTP+R AL+TG
Sbjct: 25  KQPNIIFILADDLGWDDVSFHGSHQIPTPNIDELAYSGVLLHNYYVQPVCTPTRGALLTG 84

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +YP+H+G+QH VI   EP GLPL E  LP YLK+ GY+TH +GKWHLGFF + YTPT+RG
Sbjct: 85  RYPMHLGLQHFVITPNEPVGLPLNETTLPTYLKKLGYSTHMVGKWHLGFFAKEYTPTYRG 144

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDSHYGY+ G QDYY H+  A +    G D+RHNM +   T G Y  +L+T  A  +I +
Sbjct: 145 FDSHYGYFLGHQDYYTHN--ALWNNQWGFDLRHNMDLQRSTFGEYGPELFTTQAEKLIYD 202

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEP---FQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           H+  KP+FLY AH AVH GN+       +AP +  ++F  I+D +RR YAGMVS LD+SV
Sbjct: 203 HDHKKPLFLYFAHQAVHYGNSGPNGTLLEAPYKYTSRFPHIADHQRRIYAGMVSALDDSV 262

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           GN+  AL+K G+  NSI++F  DNG P    H  +  N PLRG K+T W+GG+RG A I 
Sbjct: 263 GNITRALKKSGLYNNSIIVFSTDNGGP----HECEAFNTPLRGTKNTLWEGGIRGAAFIH 318

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILH 383
           S  L++ ++V   + H+ DWLPT+   AG  ++D  +LDG N WD L+ G  + R+EILH
Sbjct: 319 SVLLEKPKRVCEGMMHVVDWLPTMYHVAGGNLSDLHNLDGFNMWDTLSSGVSSPRTEILH 378

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
           NID    P    AALRV +LK + G              +N+ D + P ++ + K     
Sbjct: 379 NID----PITQTAALRVGNLKVIVG--------------ENKSDWFVPPQITFGK----- 415

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
                              N ++  +   +D + + +   +     Y N    C+   KP
Sbjct: 416 -------------------NGTNLTKQFGSDHRGIIVHCSSVPPVAYKN----CDPKQKP 452

Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           CL+NI +DPCE  NLA+   D+L+ L  KL  Y ST V P   P D R++P+  N+ W P
Sbjct: 453 CLYNIKNDPCEYYNLADEYPDVLQSLLYKLNDYNSTSVTPWVIPPDPRSNPSLHNDTWSP 512

Query: 564 WYDELDKQKAIE 575
           W D  D     E
Sbjct: 513 WIDLADIAPDFE 524


>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
 gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
          Length = 544

 Score =  461 bits (1186), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 253/581 (43%), Positives = 342/581 (58%), Gaps = 58/581 (9%)

Query: 7   YFFALTCTLLFN-DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +F  L C LL    +      P +P+II ILADDLG+NDV FHGS++IPTPNIDALAY+G
Sbjct: 2   WFLWLICLLLPTLISGEEEIRPAQPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSG 61

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +ILN++YV  +CTPSRSALMTGKYPIH GMQH V+   EP GLPL EK+LPQYL E GY 
Sbjct: 62  IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYT 121

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           +H  GKWHLG ++  YTP  RGF SH+                      GLDMR+  QV 
Sbjct: 122 SHIAGKWHLGHWKLKYTPLHRGFSSHW----------------------GLDMRNGTQVA 159

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
               G Y+TD+ T+ ++ VIA HN +K P+FLY+AH A H+ N Y P   PD +V K   
Sbjct: 160 YDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVLKMSH 219

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I   +RR +A MVS++D+SVG ++  LRK  MLENSI++F +DNG P+ G + N  SN+P
Sbjct: 220 IPHYKRRKFAAMVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 279

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG----IEINDTS 360
           L+G+K+T W+GG+R    +WSP LK++Q+VS++  HI DWLPTL  AAG    +   +  
Sbjct: 280 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAGGPPAVANLNQQ 339

Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
           +DG + W  L +   + R  +LHNID++       AAL V D K V GT+  G  D WYG
Sbjct: 340 IDGQSIWRALVQDKASPRLNVLHNIDDIWGS----AALSVGDWKLVKGTNYRGSWDGWYG 395

Query: 421 DT-DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
              + +  +Y  + V  S+ G                         DAL+ + +      
Sbjct: 396 PAGERDPRQYDWQLVAKSRVGKAL----------------------DALKMLPSRADQQR 433

Query: 480 LREFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
           +R  A V C    ++G  C +T    PCLF+I DDPCEQ NLA+   +++  L  +L  +
Sbjct: 434 IRAAATVTCPAQSSQGTSCLATAFSAPCLFHIRDDPCEQYNLAKQYPEVVNALMTELERF 493

Query: 537 KSTMVPPGNKPFDKRADPARWNNIWVPWYDELDKQKAIETM 577
            +T VPP NKP D RADP  WN  W  + D  + +  ++T+
Sbjct: 494 NATAVPPSNKPADPRADPRFWNYTWTNFGDYQNTRNDLDTL 534


>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
 gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
          Length = 542

 Score =  460 bits (1184), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 253/564 (44%), Positives = 334/564 (59%), Gaps = 57/564 (10%)

Query: 13  CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
           C + F      +T   +P+II ILADDLG+NDV F GS+QIPTPNIDALAY+GLILN++Y
Sbjct: 18  CPISFALVACQST---QPNIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYY 74

Query: 73  VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
           V  +CTPSRSALMTGKYPIH GMQH V+   EP GLPL  K+LPQYL E GY +H  GKW
Sbjct: 75  VNPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLDLKILPQYLNELGYTSHIAGKW 134

Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
           HLG ++ VYTP +RGF SH+GYW                   GLDMR+  ++     G Y
Sbjct: 135 HLGHWKRVYTPLYRGFSSHHGYW-------------------GLDMRNGTEIAYDLHGQY 175

Query: 193 STDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
           +TD+ T+ ++ VIA H  +K P+FLY+AH AVH+GN Y P  A D+ V +   I   +RR
Sbjct: 176 TTDVITQHSLQVIANHKPAKGPLFLYVAHAAVHSGNPYNPLPASDDAVRRLDKIQHYKRR 235

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            YA +V+ +DESVG ++  LRK  MLENSI++F  DNG P+ G +SN  SN+PLRG+K+T
Sbjct: 236 KYAALVTAMDESVGKIVEQLRKSRMLENSIIVFSTDNGGPAEGFNSNFASNYPLRGVKNT 295

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-----NDTSLDGVNQ 366
            W+GG+RG A +WSP L +  + + +L H+ DWLPTL  AAG ++         LDGV+ 
Sbjct: 296 LWEGGVRGAALLWSPQLTKRPRTAEQLMHMVDWLPTLVEAAGGKVALSKLEAAKLDGVSL 355

Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
           W  L +   + R+ ILHNID++       AAL V   K V G+  NG  D WYG      
Sbjct: 356 WQALVRNEASPRNSILHNIDDIWGS----AALSVGVWKLVQGSHYNGSWDGWYG------ 405

Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
                       AG+  N      Q+  K AA     +   L+ +     I+ LR+ A V
Sbjct: 406 -----------PAGLR-NPHDYNWQLVTKCAA---GQAMQQLKMLPQQADIIRLRKAATV 450

Query: 487 RCN----YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
            C       +    CN   + CLFN+ DDPCEQ NLA+    +L  L ++L ++ +T +P
Sbjct: 451 SCKEAKVKPSPDIACNPLKEICLFNVLDDPCEQRNLAKQHPKVLALLLNELQLFNATAIP 510

Query: 543 PGNKPFDKRADPARWNNIWVPWYD 566
           P NKP D R DP  WN+ W  + D
Sbjct: 511 PANKPDDPRGDPRLWNHTWTNFGD 534


>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
 gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
          Length = 541

 Score =  460 bits (1183), Expect = e-126,   Method: Compositional matrix adjust.
 Identities = 258/584 (44%), Positives = 343/584 (58%), Gaps = 60/584 (10%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           + F       F    +  ++PK P+II ILADDLG+NDV FHGS+QIPTPNIDALAY+G+
Sbjct: 5   WLFIFLILPQFLKGEVGNSSPK-PNIIFILADDLGFNDVGFHGSAQIPTPNIDALAYSGI 63

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
           ILN++YV  +CTPSRSALMTGKYPIH GMQH V+   EP GL L EK+LPQYL + GY +
Sbjct: 64  ILNRYYVTPICTPSRSALMTGKYPIHTGMQHAVLYAAEPRGLSLKEKILPQYLNDLGYTS 123

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
           H  GKWHLG ++  YTP FRGF SH+                      GLDMR+  +V  
Sbjct: 124 HIAGKWHLGHWKLKYTPLFRGFSSHW----------------------GLDMRNGTEVAY 161

Query: 187 KTIGIYSTDLYTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
              G Y+TD+ T+ A+ VIA HN  S P+FLY+AH A H+ N Y P   PD EV K   I
Sbjct: 162 DLHGRYTTDVITDHAVKVIANHNTTSGPLFLYVAHAACHSSNPYNPLPVPDNEVMKLGHI 221

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
              +RR +A MV+++DESVG ++  LRK  MLENSI++F +DNG P+ G + N  SN+PL
Sbjct: 222 PHYKRRKFAAMVTKMDESVGLIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYPL 281

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL---- 361
           +G+K+T W+GG+R    IWSP LK  Q+VS++  HISDWLPTL  AAG     ++L    
Sbjct: 282 KGVKNTLWEGGVRAAGLIWSPLLKSRQRVSTQTIHISDWLPTLLEAAGGAAALSNLSKGI 341

Query: 362 DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD 421
           DG + W  L     + R +++HNID++       A L V D K V GT+  G+ D WYG 
Sbjct: 342 DGQSIWQALVHDEPSPRLKVIHNIDDIWGS----AGLSVGDWKLVKGTNYYGKWDGWYGP 397

Query: 422 T-DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
             + +   Y  +++  S++G   N LK                       + +  +   L
Sbjct: 398 AGERDPHLYDWQQIPKSRSGKALNGLK----------------------MLPSQAEQQRL 435

Query: 481 REFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
           R  A V+C     +G+ C +T    PCLFNI DDPCEQ NLAE    +LK L ++L    
Sbjct: 436 RAAATVKCLGQSPQGSSCKATAFSAPCLFNIHDDPCEQFNLAEQYPKVLKALMEELKYQN 495

Query: 538 STMVPPGNKPFDKRADPARWNNIWVPW--YDELDKQKAIETMRL 579
           ST VPP NKP D+RA P  WN  W  +  YD L+ +  +  +++
Sbjct: 496 STAVPPANKPGDRRASPKYWNYTWTNFGDYDLLNNKPVLGDLKV 539


>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
 gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
          Length = 556

 Score =  456 bits (1173), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 254/548 (46%), Positives = 328/548 (59%), Gaps = 55/548 (10%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KPHII ILADDLG+NDV FHGS+QIPTPNIDALAY+GLILN++YV  +CTPSRS+LMTGK
Sbjct: 25  KPHIIYILADDLGFNDVGFHGSAQIPTPNIDALAYSGLILNRYYVAPICTPSRSSLMTGK 84

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH GMQH V+   EP GLPL EKLLPQYL + GY +H  GKWHLG ++  YTP  RGF
Sbjct: 85  YAIHTGMQHTVLYGAEPRGLPLEEKLLPQYLNDLGYTSHIAGKWHLGHWKMPYTPLRRGF 144

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           +SH+G+W                   GLDMR+  +V  +  G Y+TD+ T+ +I+VIA H
Sbjct: 145 NSHHGFW-------------------GLDMRNGSRVAYELHGQYTTDVITQHSIDVIANH 185

Query: 209 NKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
             SK P+FLY+AH A H+GN Y P    D+EV K   I   +RR YA MVS+LDESVG +
Sbjct: 186 PVSKGPLFLYVAHAAAHSGNPYNPLPVADDEVIKLGRIEHYKRRRYAAMVSKLDESVGII 245

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           +  LR+  MLENSI++F  DNG  + G + N  SN+PLRG+K+T W+GG+RGV  +WS  
Sbjct: 246 VDQLRRSNMLENSIIVFATDNGGAAEGFNLNFASNYPLRGVKNTLWEGGVRGVGLLWSQR 305

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAG----IEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
             +  +V+ +  HI+DWLPT    AG    +    TSLDG + W  L +   + R  +LH
Sbjct: 306 YLKRPRVAEQTMHITDWLPTFVEVAGGKEALANFTTSLDGQSIWQALVQDEASPRKSVLH 365

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGIT 442
           NID++ N     +AL V D K + GT+  G  D WYG   + +   Y+ ++V  S AG  
Sbjct: 366 NIDDIWNS----SALTVGDWKLIKGTNYKGAWDGWYGPAGERDPQLYNWQQVAKSLAG-- 419

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC----NYDNKGAHCN 498
               K  LQ+K         + +D  R          LR  A V C    +   KG+ CN
Sbjct: 420 ----KAMLQLKV------LPSRADQQR----------LRSAATVNCPNQQSNTQKGSVCN 459

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
               PCLFN+ DDPCEQ NLA     +L  L  +L  + ST +PP N P D R +P  WN
Sbjct: 460 PLQTPCLFNLRDDPCEQYNLATQYPLVLNSLLSELKHFNSTAIPPSNLPDDPRGNPKFWN 519

Query: 559 NIWVPWYD 566
             W  + D
Sbjct: 520 YTWTNFGD 527


>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
 gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
          Length = 565

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 241/545 (44%), Positives = 333/545 (61%), Gaps = 30/545 (5%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A ++P+IIII+ADD+G++DVSF G  +  TPNIDALAY+G +L++ Y  A+CTPSR A
Sbjct: 21  SAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGA 80

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L++G+YPIH G QH VI   EPW L L   L+P+  KEAGY+T+ +GKWHLGF R  YTP
Sbjct: 81  LLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTP 140

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           T RGFD H+GYW    DY+    K     Y  G D R NM+++ +  G+Y TDL T  A 
Sbjct: 141 TRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAE 200

Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
            +I  H  K +P+FL L+HLA H  N  +P QAP+EE+ KF  I DP RR YA M+S+LD
Sbjct: 201 RLIKGHAGKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLD 260

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           +SVG ++++L     LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R   
Sbjct: 261 QSVGRIMSSLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGGVRVAG 320

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR-- 378
           AIWS  L+    + S+  +++DWLPTLC AAGI+++ T  LDG++ W  L+  A      
Sbjct: 321 AIWSSKLQARGSIFSQPLYVADWLPTLCRAAGIKLDSTLKLDGIDLWAELSGSADAPHVP 380

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
            EILH +D+V       AAL++   KYV GT   G+ D     T  E+D   P+E  Y+ 
Sbjct: 381 REILHILDDVWR----LAALQLGQWKYVNGTTAAGRYDSLL--TYRELDDLDPRESRYA- 433

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
                        +  +++A  +A S   LR  LT ++I   R  A VRC    +   CN
Sbjct: 434 -------------VTVRNSATSRALSRYDLRR-LTQQRISITRRLAAVRCGDLQRS--CN 477

Query: 499 STVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
             ++ CL++I+ DPCEQNNL  +E  +D+L  L  ++   +++   PGN+     ADP  
Sbjct: 478 PLLEECLYDISTDPCEQNNLVYSEKHSDVLAALRRRVQELRASASRPGNRASMPEADPTL 537

Query: 557 WNNIW 561
               W
Sbjct: 538 HTCAW 542


>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
 gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
          Length = 512

 Score =  455 bits (1171), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 245/554 (44%), Positives = 326/554 (58%), Gaps = 57/554 (10%)

Query: 16  LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA 75
           L +  F+  TA KKPHI+ I+ADDLGW+DVSFHGS QIPTPNID LA  G+ILN +YV  
Sbjct: 10  LMSLPFVALTANKKPHIVFIVADDLGWDDVSFHGSGQIPTPNIDGLAKTGVILNNYYVSP 69

Query: 76  LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           +CTP+RSA+MTGKYPIH GMQH VIL  +P+GL L E L+PQYLK  GYATH +GKWHLG
Sbjct: 70  ICTPTRSAIMTGKYPIHTGMQHSVILAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLG 129

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
           FF+  YTP  RGFDS++GYW G  DY+DHS    +    GLD+  + Q      G YS+D
Sbjct: 130 FFKYEYTPIQRGFDSYFGYWCGKGDYWDHSNNEKYG--WGLDLHDSEQDVWTEWGHYSSD 187

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           L+ E A+NVI+ HN S P+FLYL   AVH+ N  +P QAP + + KF +I D  RR +A 
Sbjct: 188 LFAEKAVNVISTHNASVPLFLYLPFQAVHSANFIQPLQAPPDLIDKFKNIKDERRRIFAA 247

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MVS +D ++  V+ +L+   M  NSI++F  DNG P+ G  SN  SN PLRG+K T W+G
Sbjct: 248 MVSSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANGFDSNMASNFPLRGVKRTLWEG 307

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGA 374
           G+RG A I SP + +  +V +EL H+SDWLPTL   AG +I+D  +LDG + WD ++  A
Sbjct: 308 GIRGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVAGGDIHDLQNLDGFDLWDSISTDA 367

Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            + R E++HNID    P  + AA R  + K V   +       WY               
Sbjct: 368 MSPREEMVHNID----PVNWEAAYRFREWKIV--VNQTKYMSGWYP-------------- 407

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNK- 493
                          ++ ++ H A                     LR+ A V+C    + 
Sbjct: 408 ------------LPNIEEREPHPA--------------------TLRD-AVVKCGPPPEI 434

Query: 494 GAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRAD 553
             +C ++  PCLFNI +DPCE  NLA+ + ++L  +   L  YK  MVP  N P D  A+
Sbjct: 435 PVNCTASDGPCLFNIKNDPCEYVNLAKKELEILNNMLIWLEGYKKGMVPIRNTPLDPSAN 494

Query: 554 PARWNNIWVPWYDE 567
           PA +  +W  W D+
Sbjct: 495 PANYGGVWTAWKDQ 508


>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
 gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
          Length = 554

 Score =  452 bits (1164), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 240/546 (43%), Positives = 333/546 (60%), Gaps = 30/546 (5%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            + A ++P+IIII+ADD+G++DVSF G  +  TPNIDALAY+G +L++ Y  A+CTPSR 
Sbjct: 20  QSAAARRPNIIIIMADDMGFDDVSFRGGREFVTPNIDALAYHGRLLDRLYAPAMCTPSRG 79

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           AL++G+YPIH G QH VI   EPW L L   L+P+  KEAGY+T+ +GKWHLGF R  YT
Sbjct: 80  ALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYT 139

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           PT RGFD H+GYW    DY+    K     Y  G D R NM ++ +  G+Y TDL T  A
Sbjct: 140 PTRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTTEA 199

Query: 202 INVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
             +I +H +K +P+FL L+HLA H  N  +P QAP+EE+ KF  I DP RR YA M+S+L
Sbjct: 200 ERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKL 259

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D+SVG +I+ L     L+NSIV+F +DNGAPS G+ SN GSN PLRG K+TPW+GG+R  
Sbjct: 260 DQSVGRIISTLSSTDQLDNSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVRVA 319

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR- 378
            AIWS  L+    + S+  +++DWLPTL  AAGIE++ +  LDG++ W  L+  A     
Sbjct: 320 GAIWSSKLQARGSIFSQPLYVADWLPTLSRAAGIELDSSLKLDGIDLWPELSGSADAPHV 379

Query: 379 -SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
             EILH +D+V       +AL++   KYV GT  +G+ D     T  E+D   P+E  Y+
Sbjct: 380 PREILHILDDV----WRLSALQMGQWKYVNGTTADGRYDSVL--TYRELDDLDPRESRYA 433

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
                         +  +++A  +A S   LR  LT ++I   R  A VRC    +   C
Sbjct: 434 --------------VTVRNSATSRALSRYDLRR-LTQQRISLTRRLAAVRCGDLQRS--C 476

Query: 498 NSTVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
           N  ++ CL++I+ DPCEQNNL  +E  +D+L  L  ++   +++   PGN+     ADP 
Sbjct: 477 NPLLEECLYDISTDPCEQNNLVYSERHSDVLTALRRRVRELRASASRPGNRASMPEADPT 536

Query: 556 RWNNIW 561
                W
Sbjct: 537 LHTCAW 542


>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
 gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
          Length = 559

 Score =  451 bits (1160), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 249/557 (44%), Positives = 330/557 (59%), Gaps = 56/557 (10%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A  KP+IIIIL DD+G+NDVSFHGS+QI TPNIDALAYNG++LN+HYV  LCTPSR+ L+
Sbjct: 24  ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TGKYPIH GMQH VI+  EPWGLP  E+L+P+  ++AGYATH +GKWHLGF+R+  TPT 
Sbjct: 84  TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           RGFD H+GY+ G  DYYD + +     Y  GLD R +++   +  G Y+T+ +T  A  V
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYSTGLDFRRDLEPCREAEGTYATEAFTTEARKV 203

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  H+KS+P+F+ L+HLAVH GN   P QAP+EEVAKF  I DP+RRTYAGM+S LD+SV
Sbjct: 204 IERHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISSLDKSV 263

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G  + AL  +GML NSIVL  +DNGAP+ GIHSN GSN+P RG                 
Sbjct: 264 GQTMRALADNGMLNNSIVLLYSDNGAPTVGIHSNAGSNYPFRG----------------- 306

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
                Q   VS++  H  DWLPTL AAAG+ + +D  LDG+N W  L+  A+ +   +LH
Sbjct: 307 -----QRGYVSNQAIHAIDWLPTLAAAAGVSLPSDLRLDGLNLWPSLSASAQPQPRNLLH 361

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPKEVLYSKAGIT 442
            +D+V      Y++   D LKYV G+   G+ D W G+  D E D  S       +A   
Sbjct: 362 VLDDVFG----YSSYTQDTLKYVNGSSFEGRYDHWLGELEDGEEDPNSKDYGQLVRA--- 414

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-----C 497
                +++Q      A              TD++I ++R+ A  RC    +        C
Sbjct: 415 -----SEVQAVLGSVAP-------------TDDRIRQMRDEATQRCPIQERDPEDPVYLC 456

Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKP-FDKRADPAR 556
                 CLF++ +DPCE+ NLA      L+ L  ++   + T +P    P  D RADPA 
Sbjct: 457 EPLKAACLFDLANDPCERYNLANLYPLQLQLLAQEVEELRLTALPSARVPRADVRADPAL 516

Query: 557 WNNIWVPWYDELDKQKA 573
            +  W  W + L    A
Sbjct: 517 HDGHWEWWNNTLPGSSA 533


>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 562

 Score =  451 bits (1160), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 239/580 (41%), Positives = 357/580 (61%), Gaps = 55/580 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNT--------TAPKKPHIIIILADDLGWNDVSFHGSSQIPTP 56
           RK FF L  +++F    +             K+P+II+I+ADDLGWNDVSFHG++QIPTP
Sbjct: 6   RKTFFLLATSVIFVTGCIYAWIQDSKVWERVKRPNIIVIMADDLGWNDVSFHGANQIPTP 65

Query: 57  NIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLP 116
           NIDALAYNG+IL +HYV  +CTPSR+A +TG+YPI  GMQ   +  GEP  +PL   LLP
Sbjct: 66  NIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPLKAGEPRAIPLNNTLLP 125

Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL 176
           +YL++ GYATH +GKWH+G++ + +TPT RGFD+ +GY+ G   Y++H+ K   + + G 
Sbjct: 126 EYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYSGYISYFNHTIKQ--DDHIGY 183

Query: 177 DMRHNMQVDNKTIGI-----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
           D+ ++   ++K + I     Y+TDL TE A N+I  H++ KP++L L HLA H+ +  E 
Sbjct: 184 DLHYD---NSKNLSIDYNFEYTTDLITERAENIIKNHDRRKPLYLQLCHLAAHSSDAKEV 240

Query: 232 FQAPDEEV--AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
            +  DE+   A    I D  RR YAG+V+ +DESVG VI AL +  MLENSI++F++DNG
Sbjct: 241 MEVRDEQETNATLKYIEDYNRRKYAGVVTAMDESVGRVIKALGQSSMLENSIIVFISDNG 300

Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
           A + G+  N GSN+PLRG+K T ++GG+RGVA ++S  ++ + ++S+EL HI+DWLPT  
Sbjct: 301 AQTEGLLENYGSNYPLRGLKFTLFEGGIRGVACVYSRLIQNSSRISNELMHITDWLPTFY 360

Query: 350 AAAG--IEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA 407
           +AAG  +E  + ++DGV+QWD +  G ++KR  +L NID V++     ++  +   K + 
Sbjct: 361 SAAGGNLENLEENMDGVDQWDTIVSGKESKRESVLLNIDEVED----VSSALIGKYKLII 416

Query: 408 GTDNNGQSDEWYGDTDNEID--KYSPKEVLYSKAGIT-FNALKTKLQIKQKHAADPKANS 464
               N Q +++YGD    +   +Y+   VL+S AG T F+A    L IK           
Sbjct: 417 -NGKNIQYNDYYGDNGTSVSYPEYNVSSVLHSIAGSTIFHATNFTLNIKD---------- 465

Query: 465 SDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTD 524
                       +++LR  ARV CN     + C   V  CLF+I +DPCE  +L+    +
Sbjct: 466 ------------VIKLRNKARVVCNNFTSYSKC---VDKCLFDIYNDPCETTDLSNEHPE 510

Query: 525 LLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           ++K+L   +A Y+  ++   N P D R+ P  +N  W+PW
Sbjct: 511 IVKKLNYFIAEYEKVLLHQTNAPVDPRSYPENFNGTWMPW 550


>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
 gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
          Length = 528

 Score =  451 bits (1159), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 251/560 (44%), Positives = 327/560 (58%), Gaps = 59/560 (10%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A  +PHII ILADDLG+NDV F GS+QIPTPNIDALAY+GLILN++YV  +CTPSRSALM
Sbjct: 17  AIAQPHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALM 76

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           T KYPIH GMQH V+   EP GLPL  K+LPQYL + GY +H  GKWHLG ++ VYTP +
Sbjct: 77  TAKYPIHTGMQHTVLYAAEPRGLPLNLKILPQYLNDLGYTSHIAGKWHLGHWKRVYTPLY 136

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           RGF SH+                      GLDMR+  ++     G Y+TD+ T+ ++NVI
Sbjct: 137 RGFSSHW----------------------GLDMRNGTELAYDLHGQYTTDVITQHSLNVI 174

Query: 206 AEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           A+HN +K P+FLY+AH AVH+GN Y P  A D+ V +   I D +RR YA +V+ +D+SV
Sbjct: 175 AKHNSTKGPLFLYVAHAAVHSGNPYNPLPAKDDIVRRLGTIQDYKRRKYAALVTAMDDSV 234

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G ++  LRK  ML+NSI++F  DNG P+ G +SN  SN+PLRG+K+T W+GG+RG   +W
Sbjct: 235 GRIVEQLRKSHMLDNSIIIFSTDNGGPAEGFNSNFASNYPLRGVKNTLWEGGVRGAGLVW 294

Query: 325 SPWLKQTQKVSSELFHISDWLPT-----LCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
           S  L +  +V+ +  HI+DWLPT        AA  ++N+  LDG++ W  L K   + R 
Sbjct: 295 SAQLSKRPRVADQTMHIADWLPTLLEAAGGEAALAQLNNAQLDGMSVWQALLKNEVSPRK 354

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE-IDKYSPKEVLYSK 438
            +LHNID++       AAL V D K V GT+ NG  D WYG      +  Y    V  S 
Sbjct: 355 SVLHNIDDIWGS----AALSVGDWKLVKGTNYNGSWDGWYGPAGTRNLRDYDWYLVAKSL 410

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN----YDNKG 494
           AG     LK            PK  ++D +R           RE A V C      D   
Sbjct: 411 AGQALQQLKVL----------PK--TADMMRA----------REAANVICKDARLKDAPN 448

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
             CN    PCLFNI +DPCEQ N A+    +L+ +  +L +  ST VPP NKP D    P
Sbjct: 449 ISCNPLKAPCLFNIREDPCEQRNQADVYPKVLELMLKELQLINSTAVPPANKPDDPLGYP 508

Query: 555 ARWNNIWVPWYDELDKQKAI 574
             WN  W  + D +D    I
Sbjct: 509 RFWNYTWTNFGDYVDTNDCI 528


>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
 gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
 gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
          Length = 564

 Score =  449 bits (1154), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 239/545 (43%), Positives = 331/545 (60%), Gaps = 30/545 (5%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A ++P+IIII+ADD+G++DVSF G  +  TPNIDALAY+G +L++ Y  A+CTPSR A
Sbjct: 21  SAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGA 80

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L++G+YPIH G QH VI   EPW L L   L+P+  KEAGY+T+ +GKWHLGF R  YTP
Sbjct: 81  LLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTP 140

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           T RGFD H+GYW    DY+    K     Y  G D R NM+++ +  G+Y TDL T  A 
Sbjct: 141 TRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAE 200

Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
            +I +H +K +P+FL L+HLA H  N  +P QAP+EE+ KF  I DP RR YA M+S+LD
Sbjct: 201 RLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLD 260

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           +SVG +I AL     LENSIV+F +DNGAPS G+ SN GSN PLRG K+TPW+GG+R   
Sbjct: 261 QSVGRIITALSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVRVAG 320

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR-- 378
           AIWS  L+    +  +  +++DWLPTL  AA IE++ +  LDG++ W  L+  A      
Sbjct: 321 AIWSSGLQARGSIFRQPLYVADWLPTLSRAADIELDSSLKLDGIDLWPELSGSADAPHVP 380

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
            EILH +D+V       +AL++   KYV GT  +G+ D     T  E+D   P++  Y+ 
Sbjct: 381 REILHILDDV----WRLSALQMGQWKYVNGTTASGRYDSVL--TYRELDDLDPRDSRYA- 433

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
                        +  +++A  +A S   LR  LT ++I   R  A VRC    +   CN
Sbjct: 434 -------------VTVRNSATSRALSRYDLRR-LTQQRISLTRRLAAVRCGDLQRS--CN 477

Query: 499 STVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
             ++ CL++I  DPCEQNNL  +E  +D+L  L  ++   +++   PGN+     ADP  
Sbjct: 478 PLLEECLYDILSDPCEQNNLVYSERHSDVLTALRRRVQELRASASRPGNRASMPEADPTL 537

Query: 557 WNNIW 561
               W
Sbjct: 538 HTCAW 542


>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
 gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
          Length = 565

 Score =  449 bits (1154), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 238/545 (43%), Positives = 332/545 (60%), Gaps = 30/545 (5%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A ++P+IIII+ADD+G++DVSF G  +  TPNIDALAY+G +L++ Y  A+CTPSR A
Sbjct: 21  SAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGA 80

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L++G+YPIH G QH VI   EPW L L   L+P+  K AGY+T+ +GKWHLGF R  YTP
Sbjct: 81  LLSGRYPIHTGTQHFVISNKEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTP 140

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           T RGFD H+GYW    DY+    K     Y  G D R NM+++ +  G Y TDL T  A 
Sbjct: 141 TRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGAYVTDLLTAEAE 200

Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
            +I +H +K +P+FL L+HLA H  N  +P QAP+EE+ KF  I DP RR YA ++S+LD
Sbjct: 201 RLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIISKLD 260

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           +SVG +++ L     LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R   
Sbjct: 261 QSVGRIMSTLAATDQLENSIVIFYSDNGAPSLGMFANTGSNFPLRGQKNTPWEGGVRVAG 320

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR-- 378
           AIWS  L+    + S+  +++DWLPTL  AAGIE++ +  LDG++ W  L+  A      
Sbjct: 321 AIWSSKLQARGSIFSQPLYVADWLPTLSHAAGIELDSSLKLDGIDLWPELSGSADAPHVP 380

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
            EILH +D++       AAL++   KYV GT   G+ D     T  E+D   P+E  Y+ 
Sbjct: 381 REILHILDDIWR----VAALQLGQWKYVNGTTAAGRYDSLL--TYRELDDLDPRESRYA- 433

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
                        +  +++A  +A S   LR  LT ++I   R  A VRC    +   CN
Sbjct: 434 -------------VTVRNSATSRALSRYDLRR-LTQQRISITRRLAAVRCGDLQRS--CN 477

Query: 499 STVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
             ++ CL++I+ DPCEQNNL  +E  +D+L  L  ++   +++   PGN+     ADPA 
Sbjct: 478 PLLEECLYDISTDPCEQNNLVYSERHSDVLAALRRRVRELRASASRPGNRASMPEADPAL 537

Query: 557 WNNIW 561
               W
Sbjct: 538 HTCAW 542


>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
 gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
          Length = 619

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 243/549 (44%), Positives = 339/549 (61%), Gaps = 34/549 (6%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           TA K P+III++ADD+G++D+S  G+ +  TPNIDALAY+G +L++ Y  ++CTPSR AL
Sbjct: 28  TANKLPNIIIVMADDMGFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGAL 87

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           ++GKYPIH G QH VI   EPWGL L   L+P+  +EAGY+T+ +GKWHLGF R  YTPT
Sbjct: 88  LSGKYPIHTGTQHYVIGNEEPWGLALNTTLMPEIFREAGYSTNLVGKWHLGFSRPEYTPT 147

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
            RGFD+HYGYW    DYY    +     Y  G D R NMQV+    G+Y TDL T  A  
Sbjct: 148 HRGFDNHYGYWGAYIDYYQRRSQMPLGNYSVGYDFRRNMQVECTDRGVYVTDLLTNEAER 207

Query: 204 VIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           +I E   K +P+FL L+HLA H GNT +P QAP+EE+ KF  I DP RR YA MVS+LDE
Sbjct: 208 IIREQPAKEQPLFLILSHLAPHTGNTNKPLQAPEEELRKFTHIKDPNRRLYAAMVSKLDE 267

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           SVG VI AL +   LENSIV+F +DNGAPS G+ SN GSN PL+G K++PW+GG+R   A
Sbjct: 268 SVGRVITALARTDQLENSIVIFYSDNGAPSVGMFSNTGSNWPLKGQKNSPWEGGLRVAGA 327

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLT--KGAKTKRS 379
           IWSP LK    + ++  ++ D+LPTL  AAGIE++ +  LDG++ W  L+  K A     
Sbjct: 328 IWSPLLKARGNLFTQPIYVGDFLPTLAHAAGIELDSSLHLDGIDLWPQLSGPKDAPHVPR 387

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
           EILH +D+V   Q  +        KYV GT   GQ D+    T  E+D   P+   Y   
Sbjct: 388 EILHQLDDVWRVQSLFQG----QYKYVNGTTLAGQFDQVL--TYRELDDLDPRASRYV-- 439

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
                       ++ +++   +A +   L+ I T  +I  +   A V C+   +G  CN+
Sbjct: 440 ------------VEVRNSPASRALARYDLQRI-TQARINTISRSAAVICSDMQRG--CNA 484

Query: 500 TVKPCLFNITDDPCEQNNLAESQ--TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA-- 555
            ++ CL++++ DPCEQNNLA S+  +++L  +  ++   ++  + P N+P ++ ADP+  
Sbjct: 485 LLEECLYDLSTDPCEQNNLAYSKQHSEILTSMRQRVQQLRAGAMAPNNRPSERDADPSLH 544

Query: 556 --RWNNIWV 562
              W+N  V
Sbjct: 545 ECTWDNFGV 553


>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 559

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 214/407 (52%), Positives = 293/407 (71%), Gaps = 10/407 (2%)

Query: 16  LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA 75
           + ++  L T++  +PHII+I+ADDLGW+DVSFHGS QIPTPNID LAY+G++L+ +YVQ 
Sbjct: 16  ILSECCLATSS--QPHIIVIIADDLGWDDVSFHGSDQIPTPNIDELAYSGVLLHNYYVQP 73

Query: 76  LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           +CTP+R+ALMTG++PIH+G+Q GVI+   P+GLPL E ++PQYLK  GY TH +GKWHLG
Sbjct: 74  ICTPTRAALMTGRHPIHLGLQDGVIVASHPYGLPLNETIMPQYLKPLGYDTHIVGKWHLG 133

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP-YQGLDMRHNMQVDNKTIGIYST 194
           FF   YTP +RGFD+H+GY+ G + YYDH+ +   EP Y GLD R+N ++     G YST
Sbjct: 134 FFAWQYTPLYRGFDTHFGYYNGEEGYYDHTAE---EPKYIGLDFRNNTELFKSAYGEYST 190

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
           +L+T  A  +I  HNK+KP+FLYLAH AVH+GN+Y P +AP +  ++F  I D  RRT+A
Sbjct: 191 ELFTSYAEKIIHNHNKNKPLFLYLAHQAVHSGNSYSPLEAPYKYTSRFPYIQDERRRTFA 250

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
           GMVS LD+SVGN+  AL+  G+ EN I+LF  DNG P+ G  +N  SN PLRG+K T W+
Sbjct: 251 GMVSALDDSVGNITRALKHSGLYENCIILFSTDNGGPAAGFDANYASNWPLRGIKHTLWE 310

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTK 372
           GG+RG   I SP L++  ++S+++ H+ DWLPT+   AG  ++   T+LDG N WD L++
Sbjct: 311 GGVRGDGFIHSPLLEKPGRISTDMIHVCDWLPTIYHIAGGNVSSLPTNLDGFNVWDTLSR 370

Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
           G  + R+EILHNID   N     AALRV + K + GT+ +G+ D WY
Sbjct: 371 GTPSPRTEILHNIDPHKNHNA--AALRVGEFKIILGTEEHGEWDGWY 415


>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
 gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
          Length = 575

 Score =  448 bits (1152), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 240/566 (42%), Positives = 337/566 (59%), Gaps = 43/566 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+IIIILADD+G++DVSF G  +  TPNIDALA++G IL++ Y  A+CTPSR AL++G+
Sbjct: 36  RPNIIIILADDMGFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGR 95

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPIH G QH VI   EPWGL L   L+P+  ++AGY+T+ IGKWHLGF R  YTPT RGF
Sbjct: 96  YPIHTGTQHFVISNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGF 155

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           D HYGYW    DYY    K     Y  G D R NM+++ +  G+Y TDL T  A  VI E
Sbjct: 156 DYHYGYWGAYIDYYQRRSKMPARNYSLGYDFRRNMELECRDRGVYVTDLLTNEAERVIRE 215

Query: 208 H-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
              + +P+FL L+HLA H  N  +P QAP+EE+ KF  I DP RR YA MVS+LD+SVG 
Sbjct: 216 REGQEEPLFLVLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGR 275

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +++AL   G LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R   AIWS 
Sbjct: 276 IVSALNSTGQLENSIVIFYSDNGAPSVGMFANTGSNWPLRGQKNTPWEGGVRVAGAIWSA 335

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR--SEILH 383
            L+    + ++  +++DWLP+L  AAGIE+  T  LDG++ W  L           EILH
Sbjct: 336 QLQARGNIFTQPIYVADWLPSLAHAAGIELPHTLELDGIDLWPQLAGPVDAPHVPREILH 395

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
            +D+    +    + ++   KY+ GT + GQ D+    T  E+D   P+E  Y       
Sbjct: 396 ILDD----EWRVTSFQMGHWKYINGTTSAGQYDQVL--TYRELDDLDPRESRYV------ 443

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
                   +  +++   +A +   +R  +T ++++ LR  A VRC    +G  CN  V+ 
Sbjct: 444 --------VTVRNSPTSRALARHDIRR-MTQQRVMTLRRSAAVRCGDLQRG--CNPLVEE 492

Query: 504 CLFNITDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
           C+++I DDPCE+NNL  S   +++L  L  ++ + +     PGN+   +R+DP      W
Sbjct: 493 CVYDIWDDPCEKNNLVYSTRHSEVLAALRQRVQLLREGASAPGNREGIRRSDPTLHKCTW 552

Query: 562 VPWYDELDKQKAIETMRLHKPLLSPT 587
                        +T  + KP L+P 
Sbjct: 553 -------------DTFEVEKPALAPV 565


>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
 gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
          Length = 554

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 239/546 (43%), Positives = 331/546 (60%), Gaps = 30/546 (5%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            + A ++P+IIII+ADD+G++DVSF G  +  TPNIDALAY+G +L++ Y  A+CTPSR 
Sbjct: 20  QSAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRG 79

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           AL++G+YPIH G QH VI   EPW L L   L+P+  KEAGY+T+ +GKWHLGF R  YT
Sbjct: 80  ALLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYT 139

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           PT RGFD H+GYW    DY+    K     Y  G D R NM ++ +  G+Y TDL T  A
Sbjct: 140 PTRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTTEA 199

Query: 202 INVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
             +I +H +K +P+FL L+HLA H  N  +P QAP+EE+ KF  I D  RR YA M+S+L
Sbjct: 200 ERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDSNRRKYAAMISKL 259

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D+SVG +I+ L     LENSIV+F +DNGAPS G+ SN GSN PLRG K+TPW+GG+R  
Sbjct: 260 DQSVGRIISTLSSTDQLENSIVIFYSDNGAPSVGMFSNTGSNFPLRGQKNTPWEGGVRVA 319

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR- 378
            AIW+  L+    + S+  +++DWLPTL  AAGIE++ +  LDG++ W  L+  A     
Sbjct: 320 GAIWNSKLQARGSIFSQPLYVADWLPTLSRAAGIELDSSLKLDGIDLWPELSGTADAPHV 379

Query: 379 -SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
             EILH +D+V       +AL++   KYV GT   G+ D     T  E+D   P+E  Y+
Sbjct: 380 PREILHILDDV----WRLSALQMGQWKYVNGTTAAGRYDSVL--TYRELDDLDPRESRYA 433

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
                         +  +++A  +A S   LR  LT ++I   R  A VRC    +   C
Sbjct: 434 --------------VTVRNSATSRALSRYDLRR-LTQQRISLTRRLAAVRCGDLQRS--C 476

Query: 498 NSTVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
           N  ++ CL++I+ DPCEQNNL  +E  +D+L  L  ++   +++   PGN+     ADP 
Sbjct: 477 NPLLEECLYDISTDPCEQNNLVYSERHSDVLTALRRRVQELRASASRPGNRASMPEADPT 536

Query: 556 RWNNIW 561
                W
Sbjct: 537 LHTCAW 542


>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
 gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
          Length = 498

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 244/541 (45%), Positives = 322/541 (59%), Gaps = 49/541 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHII+++ADDLGW+D+SFHGS QIPTPN+DALA +G+ILN +YV  + TPSR++ MTGKY
Sbjct: 1   PHIIMVVADDLGWDDISFHGSPQIPTPNLDALANSGVILNNYYVSPMDTPSRASFMTGKY 60

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PIH+G+QH  +   +P+G+PLTEK LP++L+E GY THA+GKW LGFF + YTPT+RGFD
Sbjct: 61  PIHMGVQHDTLHNRQPFGVPLTEKFLPEFLREMGYQTHAVGKWQLGFFAKEYTPTYRGFD 120

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           S +G+W   +DYY+H      +   G+D+R N+ V N   G+Y T+L    A  VI  H+
Sbjct: 121 SFFGFWTSHEDYYNHVAN---DGGYGIDLRRNLDVSNDHTGVYGTELLAREADEVIENHS 177

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             KP+FLYLAH AVH GN  EP QAP   V KF  I+D  RRT+AGMVS LDES+  ++ 
Sbjct: 178 GDKPLFLYLAHQAVHVGNMDEPLQAPKRHVDKFKYITDERRRTFAGMVSSLDESMRQLVT 237

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           +L++ G+ +NSI++F  DNG  + G+  + GSN PLRG K+T W+GG+RGVA + SP +K
Sbjct: 238 SLKRKGLYQNSIIIFTTDNGGAAGGLDMSAGSNFPLRGNKNTLWEGGVRGVAFVHSPLIK 297

Query: 330 QTQKVSSELFHISDWLPTL-CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           +  +V   L H SDWLPTL   A G       +DG + WD ++KG  + R E+LH ID +
Sbjct: 298 RPGRVYDGLMHASDWLPTLHLLAGGSPKLLGQIDGYDLWDGISKGTPSPRYEVLHGIDTM 357

Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG--DTDNEIDKYSPKEVLYSKAGITFNAL 446
              Q   AALRV D K +   D    SD WY   + ++E+D Y  K  L S A +     
Sbjct: 358 ---QGNRAALRVGDYKVILNQDLKFYSD-WYKRPEGEHELD-YIRKPRLLSNATV----- 407

Query: 447 KTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLF 506
                IK  H              +L     L                  CN   KPCLF
Sbjct: 408 --DCTIKSPHP-------------LLYTHAPL------------------CNPQKKPCLF 434

Query: 507 NITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
           NI  DPCE +NLA    + LK L D+L  Y   M P      D  A+P R   +W PW D
Sbjct: 435 NIKWDPCEYHNLAAFMPNTLKVLLDRLKFYAEGMKPVIYPRADDLANPDRLGGVWTPWKD 494

Query: 567 E 567
           +
Sbjct: 495 D 495


>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
 gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
          Length = 565

 Score =  447 bits (1151), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 238/545 (43%), Positives = 332/545 (60%), Gaps = 30/545 (5%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A ++P+IIII+ADD+G++DVSF G  +  TPNIDALAY+G +L++ Y  A+CTPSR A
Sbjct: 21  SAAARRPNIIIIMADDMGFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGA 80

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L++G+YPIH G QH VI   EPW L L   L+P+  K AGY+T+ +GKWHLGF R  YTP
Sbjct: 81  LLSGRYPIHTGTQHFVISNEEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTP 140

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           T RGFD H+GYW    DY+    K     Y  G D R NM+++ +  G Y TDL T  A 
Sbjct: 141 TRRGFDYHFGYWGAYIDYFQRRSKMPVANYSLGYDFRRNMELECRDRGAYVTDLLTAEAE 200

Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
            +I +H +K +P+FL L+HLA H  N  +P QAP+EE+ KF  I DP RR YA ++S+LD
Sbjct: 201 RLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIISKLD 260

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           +SVG +++ L     LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R   
Sbjct: 261 QSVGRIMSTLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGGVRVAG 320

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR-- 378
           AIWS  L+    + S+  +++DWLPTL  AAGIE++ +  LDG++ W  L+  A      
Sbjct: 321 AIWSSKLQARGSIFSQPLYVADWLPTLSHAAGIELDSSLKLDGIDLWPELSGSADAPHVP 380

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
            EILH +D++       AAL++   KYV GT   G+ D     T  E+D   P+E  Y+ 
Sbjct: 381 REILHILDDIWR----VAALQLGQWKYVNGTTAAGRYDSLL--TYRELDDLDPRESRYA- 433

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
                        +  +++A  +A S   LR  LT ++I   R  A VRC    +   CN
Sbjct: 434 -------------VTVRNSATSRALSRYDLRR-LTQQRISITRRLAAVRCGDLQRS--CN 477

Query: 499 STVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
             ++ CL++I+ DPCEQNNL  +E  +D+L  L  ++   +++   PGN+     ADPA 
Sbjct: 478 PLLEECLYDISTDPCEQNNLVYSERHSDVLAALRRRVRELRASASRPGNRASMPEADPAL 537

Query: 557 WNNIW 561
               W
Sbjct: 538 HTCAW 542


>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
 gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
          Length = 575

 Score =  442 bits (1138), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 238/566 (42%), Positives = 335/566 (59%), Gaps = 43/566 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+IIIILADD+G++DVSF G  +  TPNIDALA++G IL++ Y  A+CTPSR AL++G+
Sbjct: 36  RPNIIIILADDMGFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGR 95

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPIH G QH VI   EPWGL L   L+P+  ++AGY+T+ IGKWHLGF R  YTPT RGF
Sbjct: 96  YPIHTGTQHFVISNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGF 155

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           D HYGYW    DYY    K     Y  G D R NM+++ +  G+Y TDL T  A  VI E
Sbjct: 156 DYHYGYWGAYIDYYQRRSKMPARNYSLGYDFRRNMELECRDRGVYVTDLLTNEAERVIRE 215

Query: 208 H-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
              + +P+FL L+HLA H  N  +P QAP+EE+ KF  I DP RR YA MVS+LD+SVG 
Sbjct: 216 REGQEQPLFLVLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGR 275

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +++AL   G LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R   AIWS 
Sbjct: 276 IVSALNSTGQLENSIVIFYSDNGAPSVGMFANTGSNWPLRGQKNTPWEGGVRVAGAIWSA 335

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR--SEILH 383
            L+    + ++  +++DWLP+L  AAGI +  T  LDG++ W  L           EILH
Sbjct: 336 QLQARGNIFTQPIYVADWLPSLAHAAGIGLPHTLELDGIDLWPQLAGPVDAPHVPREILH 395

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
            +D+    +    + ++   KY+ GT + GQ D+    T  E+D   P+E  Y       
Sbjct: 396 ILDD----EWRVTSFQMGHWKYINGTTSAGQYDQVL--TYRELDDLDPRESRYV------ 443

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
                   +  +++   +A +   +R  +T ++++ LR  A VRC    +G  CN  V+ 
Sbjct: 444 --------VTVRNSPTSRALARHDIRR-MTQQRVMTLRRSAAVRCGDLQRG--CNPLVEE 492

Query: 504 CLFNITDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
           C+++I DDPCE+NNL  S   +++L  L  ++ + +     PGN+   +R+DP      W
Sbjct: 493 CVYDIWDDPCEKNNLVYSTRHSEVLAALRQRVQMLREGASAPGNRQGIRRSDPTLHKCTW 552

Query: 562 VPWYDELDKQKAIETMRLHKPLLSPT 587
                        +   + KP L+P 
Sbjct: 553 -------------DVFEVEKPALAPV 565


>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
          Length = 500

 Score =  442 bits (1137), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 241/558 (43%), Positives = 324/558 (58%), Gaps = 70/558 (12%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           + C LL +   L  +    P+++ ILADDLGW+D+S HGS +IPTPNID LA +G++LN 
Sbjct: 8   IACFLLLSS--LRNSFAGHPNVVFILADDLGWDDISLHGSQEIPTPNIDLLATDGILLNN 65

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
           +YVQ +CTPSR+ALMTG++P+H+G+QH VI+  +P+GL L E LLPQYLK  GY+TH +G
Sbjct: 66  YYVQPICTPSRAALMTGRHPVHLGLQHDVIVWAQPYGLGLNETLLPQYLKTLGYSTHMVG 125

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           KWHLGF+ + +TPT RGFDSH GY+ G +DYYDH+   T + + GLD  H+ +VD    G
Sbjct: 126 KWHLGFYDKEHTPTKRGFDSHLGYYTGCEDYYDHTWGFTKQDW-GLDFWHDREVDRSAFG 184

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP--FQAPDEEVAKFLDISDP 248
            YST+++T  A  VIAEH+ SKP+FLYLA  AVH+GN       QAP + V  F+ I   
Sbjct: 185 QYSTEVFTSEAERVIAEHDVSKPLFLYLAQQAVHSGNPGNKVRLQAPWKYVKNFMGIKSE 244

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
           ERR +AGMVS LD+SV NV  AL + G+L ++I++F  DNG P+     N   N PLRG 
Sbjct: 245 ERRVFAGMVSALDDSVSNVTKALHEKGILNDTIIIFSTDNGGPANRQDYNDACNWPLRGS 304

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT--SLDGVNQ 366
           K T W+GG+RG   IWSP L+ +  VS  L  I DW+PT+  AAG +++    SLDG++Q
Sbjct: 305 KRTMWEGGVRGNGFIWSPLLENSGYVSEHLMQIVDWVPTVLEAAGFDMSKLPPSLDGLSQ 364

Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
           W  L++   + R  +LHNID ++       ALR  D+K V G           GDT+   
Sbjct: 365 WKALSRHESSPRKVLLHNIDPINGD----GALRTTDMKIVFGG----------GDTERYP 410

Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
             Y P+E L                         K  SS A                   
Sbjct: 411 GWYEPEEALIRSPS--------------------KRPSSTA------------------- 431

Query: 487 RCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK 546
                     C   +KPC+FNIT DPCE  N+A+   D+L++L  K+  +  T VPP N 
Sbjct: 432 ----------CKPWLKPCVFNITADPCEYYNIADFHPDILQELLQKVDEFNQTAVPPRNV 481

Query: 547 PFDKRADPARWNNIWVPW 564
           P D +  P   N +W PW
Sbjct: 482 PSDPKGLPINNNGVWGPW 499


>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 546

 Score =  441 bits (1135), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 230/548 (41%), Positives = 338/548 (61%), Gaps = 48/548 (8%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+II+I+ADDLGWNDVSFHG++QIPTPNIDALAYNG+IL +HYV  +CTPSR+A +TG+
Sbjct: 23  RPNIIVIMADDLGWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGR 82

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPI  GMQ   +  GEP  +PL   LLP+YL++ GYATH +GKWH+G++ + +TPT RGF
Sbjct: 83  YPIRTGMQGYPLKAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGF 142

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI-----YSTDLYTEAAIN 203
           D+ +GY+ G   Y++H+ K     + G D+ ++   ++K + +     Y TDL TE A N
Sbjct: 143 DTFFGYYNGYISYFNHTIKQN--NHVGYDLHYH---NSKNLSVAYNFEYITDLITERAEN 197

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV--AKFLDISDPERRTYAGMVSRLD 261
           +I  H++ KP++L L+HLAVH+ +  E  +  DE+   A    I D  RR YAG+V+ +D
Sbjct: 198 IIKNHDRRKPLYLQLSHLAVHSSDAKEVMEVRDEQETNATLEYIEDYNRRKYAGVVTAMD 257

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           ESVG VI AL +  MLENSI++FM+DNGA + G+  N GSN+PLRG+K T ++GG+RGVA
Sbjct: 258 ESVGRVIKALGQSSMLENSIIVFMSDNGAQTEGLLENYGSNYPLRGLKFTLFEGGIRGVA 317

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKTKRS 379
            ++S  ++ + ++S+EL HI+DWLPT  +AAG  +E  + ++DGV+QWD +  G ++KR 
Sbjct: 318 CVYSRLIQNSSRISNELIHITDWLPTFYSAAGGNLENLEENMDGVDQWDTIVSGRESKRG 377

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID--KYSPKEVLYS 437
            +L NID V+N             KY      N Q  ++YGD    +   +Y+   VL+S
Sbjct: 378 SVLLNIDEVENVSSALLG------KYKLINGKNIQYSDYYGDNGTSVSYPEYNVSSVLHS 431

Query: 438 KAGIT-FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH 496
             G T F+     L I                      E +++LR  ARV CN       
Sbjct: 432 FVGSTIFHTTNFTLNI----------------------EDVIKLRNKARVVCNNFTSYPK 469

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
           C   V  CLF++ +DPCE  +L++   +++K+L   +  Y+  ++   N P D  + P  
Sbjct: 470 C---VDKCLFDVYNDPCETTDLSDEYPEIVKKLNYFITEYEKVLLHQTNAPVDPHSYPEN 526

Query: 557 WNNIWVPW 564
           +N  W+PW
Sbjct: 527 FNGTWMPW 534


>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
 gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
          Length = 525

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 241/537 (44%), Positives = 325/537 (60%), Gaps = 24/537 (4%)

Query: 36  LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM 95
           +ADDLG++DVSF GS+   TPNIDALAY+G+ILN  Y  A+CTPSRSAL+TGKYPI  GM
Sbjct: 1   MADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTPAMCTPSRSALLTGKYPISTGM 60

Query: 96  QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
           QH VI+  +PWGLPL E  + +  ++ GY T  +GKWHLG  +  +TPT RGFD+H GY 
Sbjct: 61  QHYVIVNDQPWGLPLNETTMAEIFQQNGYYTSLLGKWHLGMSQRNFTPTKRGFDTHLGYL 120

Query: 156 QGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN-KSKP 213
               DYYD +   + + Y +G D R N++  +  +G Y TD+ ++AA+ +I +HN  +KP
Sbjct: 121 GAYIDYYDQTYLQSSQNYSRGHDFRDNLEASHDKVGHYVTDILSDAAVELIEKHNVTAKP 180

Query: 214 MFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRK 273
           +FL L+HLA HA N  +P QAP EE+++F  I +   R YA MVSRLD+SVG VI AL  
Sbjct: 181 LFLLLSHLAPHAANDNDPLQAPMEELSQFEYIQNKSHRYYAAMVSRLDKSVGRVIEALAN 240

Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK 333
             ML NSIVLF++DNG PS G H+N  SN+PLRG K TPW+GG+R  AAIWS   ++   
Sbjct: 241 REMLRNSIVLFLSDNGGPSVGEHANTASNYPLRGQKHTPWEGGIRSSAAIWSTEFERLGS 300

Query: 334 VSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ 392
           V  +  +I D LPTLC+AAGI ++ +  LDG+N W  L  G ++   EI+H+ID VD PQ
Sbjct: 301 VWKQRIYIGDLLPTLCSAAGISLDPSMHLDGLNLWSALKYGYESVEREIVHSIDAVD-PQ 359

Query: 393 KYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQI 452
            + A +R    K + GT N+GQ D W G      D+  P+ V Y +  I    +  +LQ+
Sbjct: 360 PHLAYMR-GKWKIINGTTNSGQYDGWLG--QRITDEVDPRSVHYDQL-IRNTTVWQQLQL 415

Query: 453 KQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDP 512
               A  P+            +  I ELR+ A++ C        C   + PC F+I  DP
Sbjct: 416 L---ADKPQ-----------QERNITELRQQAQILCPTPANAKPCMPLIAPCFFDIEVDP 461

Query: 513 CEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDE 567
           CEQNNL +    T +   L +++  Y      P NKP D + DP  + N W  W DE
Sbjct: 462 CEQNNLYDEYRNTSIFLDLWERIDQYARNAHLPNNKPADPKCDPRYYRNEWTWWQDE 518


>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 588

 Score =  441 bits (1134), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 241/563 (42%), Positives = 327/563 (58%), Gaps = 34/563 (6%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F AL   LL +      +  ++PHI+ ILADDLGWNDVS++G  QI TPNIDALA+NG+ 
Sbjct: 47  FIALAAMLLTS----RRSPVRQPHIVFILADDLGWNDVSYNGCPQIRTPNIDALAWNGIR 102

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L ++Y Q +CTPSR+ALMTG+YPIH GMQH VIL+ EP GLPL  KLLPQ+L + GY + 
Sbjct: 103 LQRYYTQPMCTPSRAALMTGRYPIHTGMQHFVILQNEPRGLPLKFKLLPQWLGDLGYVSQ 162

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVD 185
            +GKWHLGF+++ YTPT RGF  H G W G  DYY H    K  F  + GLD R  +   
Sbjct: 163 MLGKWHLGFYKKEYTPTMRGFQKHIGSWGGFVDYYSHIRFNKIGFS-HSGLDFRQGLSEG 221

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
            +  G Y T+  TEAA  VI  H   KP+FLYLAHLA H  N ++P Q P +   K+ DI
Sbjct: 222 REFDGQYYTEFMTEAATRVIENHPLEKPLFLYLAHLAPHGANRHDPLQVPKKYSDKYHDI 281

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
               R  YAGMVS LDESVG V+ AL K GML +++++F +DNG  + G + N  S+ P 
Sbjct: 282 GHWNRTMYAGMVSALDESVGAVVEALGKRGMLSDTVLVFSSDNGGDTNGENPNYASSWPF 341

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVS-SELFHISDWLPTLCAAAGIEINDTS-LDG 363
           +G K T W+GG+     IWSP     +    + +FHISDWLPTL   AG + +D   +DG
Sbjct: 342 KGQKRTLWEGGIHVPGFIWSPLFSGMRGFDYNNIFHISDWLPTLYQLAGGDPSDLGDIDG 401

Query: 364 VNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
           ++  D L++ ++T R E+L NID ++N     +A+     K V+G    G  DEW+    
Sbjct: 402 ISHLDSLSRRSETPRKELLINIDPIEN----VSAIIEGHFKLVSGVVRGGVLDEWFQVPG 457

Query: 424 NEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF 483
           N                IT++  + + + +    A    N+   +     D         
Sbjct: 458 N----------------ITWDYNRARQECETSLVARVLRNAGHDVACGSEDGSFP----- 496

Query: 484 ARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP 543
             ++C   +    C  TV PCLF+++ DPCE NN+AE   ++L++L  KL  Y+ T VPP
Sbjct: 497 TPIKCGKRDPSKPCVPTVAPCLFDLSKDPCEYNNIAEQHNEVLQRLLGKLEGYRETSVPP 556

Query: 544 GNKPFDKRADPARWNNIWVPWYD 566
           GN P D++AD A  NN+W  W D
Sbjct: 557 GNLPSDEQADAALHNNVWSSWGD 579


>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
 gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
          Length = 567

 Score =  439 bits (1129), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 245/551 (44%), Positives = 334/551 (60%), Gaps = 35/551 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           TT  K+P+IIIILADD+G++DVSF G  +  TPNIDALAY+G IL+  Y  A+CTPSR A
Sbjct: 22  TTTTKRPNIIIILADDMGFDDVSFRGGREFLTPNIDALAYHGRILDNLYAPAMCTPSRGA 81

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L++G+YP H G QH VI   EPWGL L   L+P+  KEAGY+T+ IGKWHLGF    YTP
Sbjct: 82  LLSGRYPAHTGTQHFVISNEEPWGLTLNATLMPEIFKEAGYSTNLIGKWHLGFASPEYTP 141

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           T RGFD HYGYW    DYY    +     Y  G D R NM ++ +  G+Y TDL T+ A 
Sbjct: 142 TRRGFDYHYGYWGAYIDYYQRRSQMPVANYSMGYDFRRNMDLECQNRGVYITDLLTQEAE 201

Query: 203 NVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           +VI E   + +  FL L+HLA H  N  +P QAP+EE+ KF  I DP RR YA MV++LD
Sbjct: 202 HVIREKAAANETFFLMLSHLATHTANDNDPLQAPEEEIRKFAYIKDPRRRKYAAMVAKLD 261

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           +SVG +I  L +   LENSIV+F +DNGAPS G  +N GSN PLRG K+TPW+GG+R   
Sbjct: 262 QSVGRIITTLAQTEQLENSIVIFYSDNGAPSVGQFANTGSNWPLRGQKNTPWEGGIRVAG 321

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS--LDGVNQWDVL--TKGAKTK 377
           AIWS  L+      ++  +++DWLPTL  AAGI++ ++S  LDG++ W  L  +K A   
Sbjct: 322 AIWSTQLQARGNKFTQPIYVADWLPTLAHAAGIDLPESSLQLDGLDLWPQLAGSKDAAHV 381

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
             EILH +D+V     + +++ +   KY+ GT  +G  D+    T  E+D   P+   Y 
Sbjct: 382 PREILHALDDV----WHVSSMLLGHWKYINGTTASGTYDQVL--TYRELDDLDPRNSRYV 435

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
                       ++++   A+   A+  D  R  L  +KI  LR  A VRC    +G  C
Sbjct: 436 ------------VEVRNSPASRALAH-FDLQR--LGQQKITSLRHSASVRCGNLQRG--C 478

Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTD--LLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
           N  V+ CL++I  DPCEQNNL  S+    +L  L  +L  ++ + +PPGN+P  +++DP+
Sbjct: 479 NPLVEECLYDIAADPCEQNNLIYSKQHEAVLMALRKRLQEFRLSALPPGNRPSMRQSDPS 538

Query: 556 ----RWNNIWV 562
                W+N  V
Sbjct: 539 LHECTWDNFLV 549


>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
 gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
          Length = 583

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 244/561 (43%), Positives = 330/561 (58%), Gaps = 34/561 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           T    KP+IIII+ADDLG++DVSF GS+   TPNIDALAY+G+ILN  Y   +CTPSR+A
Sbjct: 29  TEISGKPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAPMCTPSRAA 88

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+TGKYPI+ GMQH VI+  +PWGLPL E  +    +  GY T  IGKWHLG  +  YTP
Sbjct: 89  LLTGKYPINTGMQHYVIVNDQPWGLPLNETTMADIFRGNGYRTSLIGKWHLGMSQRNYTP 148

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           T RGFD H GY     DYY+ S +   + Y+G D R N++ +++ +  Y TD+ T+AA+ 
Sbjct: 149 TLRGFDYHLGYLGAYVDYYNQSYEQVSKGYRGHDFRENLKPNHEHVDKYVTDILTDAAVR 208

Query: 204 VIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
            I +H   N SKP+FL L HLA HA N  +P QAP +E++ F  I D   R YA MVSRL
Sbjct: 209 EIDDHAAKNNSKPLFLLLNHLAPHAANDADPMQAPADELSGFEYIRDETHRYYAAMVSRL 268

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D+SVG+V+ AL +  ML+NSI+LF++DNG P+ G H+   SN+PLRG K++PW+GG+R  
Sbjct: 269 DKSVGHVVDALARQDMLQNSILLFLSDNGGPTLGEHATTASNYPLRGQKNSPWEGGIRSS 328

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRS 379
           AAIWS   ++   V  +  +I D LPTL AAAGI  + D  LDG+N W  L  G ++   
Sbjct: 329 AAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPDLRLDGLNLWSALKYGYESVER 388

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPK-EVLYS 437
           EI+H ID  + P+ + A  R    K V+GT N+G+ D W G+    E+D  S + E L  
Sbjct: 389 EIVHVIDE-EAPEPHLAYTR-GKWKVVSGTTNSGRYDGWLGERVTTEVDPRSQEYEALIQ 446

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH- 496
              +        LQ+++                   +  I  LR+ AR+ C    +  H 
Sbjct: 447 NTSVW-------LQLQKVSGG---------------ERNISGLRQQARIECPATTEDEHT 484

Query: 497 -CNSTVKPCLFNITDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRAD 553
            C     PCLF+I  DPCEQNNL E    T + + L  ++  + +   PP NKP D   +
Sbjct: 485 PCLPLEAPCLFDIEADPCEQNNLYEEYKNTTVFQDLWQRVQQFTAQAHPPNNKPGDPSCN 544

Query: 554 PARWNNIWVPWYDELDKQKAI 574
           PA ++N W  W DE     A 
Sbjct: 545 PAFYHNEWTWWQDETPSNSAF 565


>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 536

 Score =  437 bits (1123), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 239/548 (43%), Positives = 337/548 (61%), Gaps = 30/548 (5%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+++ I+ADDLGWNDV FHGS+QIPTPNIDALAYNG+ILN HY Q+  TPSR+AL+TGK
Sbjct: 7   KPNVVFIVADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGK 66

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP+ +G+Q   I   E   LP   K++ +Y K+ GYATH +GKWHLG  R   TPTFRGF
Sbjct: 67  YPMKLGLQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGF 125

Query: 149 DSHYGYWQGLQDYYDHSC--KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           D  +G++ G   YYD+    K   + Y G D+R +        G Y+TDL+ E A++VI 
Sbjct: 126 DHFFGFYNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQ 185

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           +HN + P+F+ +AHLAVH GN  +  +AP E V KF  I DP RRTYA MVS+LD+S+G 
Sbjct: 186 KHNVNTPLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGA 245

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V  AL    ML+N+IV+F++DNGAP+ G H N GSN+PLRG+K T ++GG+R VA IWSP
Sbjct: 246 VFEALEAKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSP 305

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKTKRSEILHN 384
            L Q+ +VS++L HI+DWLPTL  A G  + + D  LDG++QW  L     + R++I  N
Sbjct: 306 LLVQSSRVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLPSARNDIPLN 365

Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
           ID     +   AALR    K + GT  NG  + ++G   NE    + +E  Y+ + I  +
Sbjct: 366 IDE----KTRNAALRFSYWKLIVGTSGNGSYNGYFGAPLNE----NIEEQQYNTSAINDS 417

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC-NYDNKGAHCN-STVK 502
            +    +I +K   +P           L++     LR  A ++C +   K   C+ ++  
Sbjct: 418 PVG---RIAKKINYNP-----------LSETDFDGLRRVATLKCLDAKAKRNPCDPASGA 463

Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD-KRADPARWNNIW 561
            CL+NI +DPCE+N+LA+    ++++++  L  Y+  +VP      D + ADP  +   W
Sbjct: 464 VCLYNIPNDPCEENDLAKFFPSVVRRIKRTLVDYRQGLVPQIETSIDIENADPKLFQFAW 523

Query: 562 VPWYDELD 569
            PW D  D
Sbjct: 524 NPWRDCSD 531


>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
          Length = 558

 Score =  435 bits (1119), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 239/551 (43%), Positives = 339/551 (61%), Gaps = 30/551 (5%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A  +P+++ I+ADDLGWNDV FHGS+QIPTPNIDALAYNG+ILN HY Q+  TPSR+AL+
Sbjct: 26  ATGQPNVVFIVADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALL 85

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TGKYP+ +G+Q   I   E   LP   K++ +Y K+ GYATH +GKWHLG  R   TPTF
Sbjct: 86  TGKYPMKLGLQGPSITPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTF 144

Query: 146 RGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           RGFD  +G++ G   YYD+  + K   + Y G D+R +        G Y+TDL+ E A++
Sbjct: 145 RGFDHFFGFYNGFTSYYDYVSNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVD 204

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           VI +HN + P+F+ +AHLAVH GN  +  +AP E V KF  I DP RRTYA MVS+LD+S
Sbjct: 205 VIQKHNVNTPLFMMIAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDS 264

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           +G V  AL    ML+N+IV+F++DNGAP+ G H N GSN+PLRG+K T ++GG+R VA I
Sbjct: 265 IGAVFEALEAKNMLQNTIVVFISDNGAPTVGPHHNWGSNYPLRGIKDTLFEGGVRTVACI 324

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKTKRSEI 381
           WSP L Q+ +VS++L HI+DWLPTL  A G  + + D  LDG++QW  L     + R++I
Sbjct: 325 WSPLLVQSSRVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWSSLVYDLPSARNDI 384

Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
             NID     +   AALR    K + GT  NG  + ++G   NE    + +E  Y+ + I
Sbjct: 385 PLNIDE----KTRNAALRFSYWKLIVGTSGNGSYNGYFGAPLNE----NIEEQQYNTSAI 436

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC-NYDNKGAHCN-S 499
             + +    +I +K   +P           L++     LR  A ++C +   K   C+ +
Sbjct: 437 NDSPVG---RIAKKINYNP-----------LSETDFDGLRRVATLKCLDAKAKRNPCDPA 482

Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD-KRADPARWN 558
           +   CL+NI +DPCE+N+LA+    ++++++  L  Y+  +VP      D + ADP  + 
Sbjct: 483 SGAVCLYNIPNDPCEENDLAKFFPSVVRRIKRTLVDYRQGLVPQIETSIDIENADPKLFQ 542

Query: 559 NIWVPWYDELD 569
             W PW D  D
Sbjct: 543 FAWNPWRDCSD 553


>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
 gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
          Length = 570

 Score =  434 bits (1116), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 241/563 (42%), Positives = 323/563 (57%), Gaps = 32/563 (5%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F  L C            A ++P+IIII+ADD+G++DVSF G  +  TPNIDALAY+G +
Sbjct: 8   FLLLLCLRGSKANGAEMEATRRPNIIIIMADDMGFDDVSFRGGREFITPNIDALAYHGRL 67

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L++ Y  A+CTPSR AL++G+YPIH G QH VI   EPW L     L+P+  + AGY+T+
Sbjct: 68  LDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVISNEEPWALDSNATLMPEIFQRAGYSTN 127

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDN 186
            +GKWHLGF R  YTPT RGFD HYGYW    DYY    K     Y  G D R NM+++ 
Sbjct: 128 LVGKWHLGFSRPEYTPTHRGFDYHYGYWGAYIDYYQRRSKMPVANYSLGYDFRRNMELEC 187

Query: 187 KTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
           +  G Y TDL T  A  +I E   K KP+FL L+HLA H  N  +P QAP+EE+ KF  I
Sbjct: 188 RDRGTYVTDLLTTEAERLIKEQAGKDKPLFLMLSHLATHTANEDDPLQAPEEEIRKFSYI 247

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
            DP RR YA MVS+LD+SVG +I+ L     LENSIV+F +DNGAPS G+ +N GSN PL
Sbjct: 248 KDPNRRKYAAMVSKLDQSVGRIISTLASTDQLENSIVIFYSDNGAPSVGMFANTGSNWPL 307

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND---TSLD 362
           RG K+TPW+GG+R   A+WS  L+    + ++  ++ DWLPTL  AA IE+       LD
Sbjct: 308 RGQKNTPWEGGVRVAGAVWSSKLEARGSIFTQPIYVGDWLPTLAHAADIELEPKILAGLD 367

Query: 363 GVNQWDVLTKGAKTKR--SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
           G++ W  L+  A       EILH++D+V        +L +   KYV GT + GQ D    
Sbjct: 368 GIDLWPQLSGAADAPHVPREILHSLDDVWK----VGSLLMGHWKYVNGTTSAGQYDSVL- 422

Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
            T  E D   P+E  Y+              +  +++   +A +   LR  LT ++I  L
Sbjct: 423 -TYREFDDIDPRESRYA--------------VTIRNSPASRALTRLDLRR-LTQQRIAGL 466

Query: 481 REFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLA--ESQTDLLKQLEDKLAIYKS 538
           R  A V C    +   CN  ++ CLF+I+ DPCEQNNLA  E  +D+L  L  +    ++
Sbjct: 467 RRSAAVHCGDFQRS--CNPLLEDCLFDISADPCEQNNLAYSERHSDVLAALRKRFRELRA 524

Query: 539 TMVPPGNKPFDKRADPARWNNIW 561
           +   PGN+     ADP      W
Sbjct: 525 SASRPGNRASMPSADPTWHTCTW 547


>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
 gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
          Length = 574

 Score =  433 bits (1113), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 235/549 (42%), Positives = 331/549 (60%), Gaps = 34/549 (6%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           T  K P++III+ADD+G++D+S  G  +  TPNIDAL ++G +L++ Y  A+CTPSR AL
Sbjct: 30  TGNKLPNVIIIMADDMGFDDISLRGGREFLTPNIDALGFHGRLLDRLYAPAMCTPSRGAL 89

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           ++GKYPIH G QH VI   EPW L L   L+P+  + AGY+T+ +GKWHLGF R  YTPT
Sbjct: 90  LSGKYPIHTGTQHFVISNEEPWSLMLNTTLMPEIFRSAGYSTNLVGKWHLGFARPEYTPT 149

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
            RGFD HYGYW    DYY    +   + Y  G D R NM+V+    G+Y TDL T  A  
Sbjct: 150 HRGFDYHYGYWGAYIDYYQRRSQMPEKTYIVGYDFRRNMEVECTDRGVYVTDLLTNEAER 209

Query: 204 VIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           +I E   K KP+FL + HLA H  N  +P QAP+EEV KFL I DP  R YA MVS+LD+
Sbjct: 210 IIQETAAKQKPLFLMINHLATHTANDNDPLQAPEEEVQKFLHIKDPNHRKYAAMVSKLDQ 269

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           SVG VI AL +   LENSIV+F +DNGAPS G+ +N GSN PL+G K++PW+GG+R    
Sbjct: 270 SVGRVITALARAEQLENSIVIFYSDNGAPSVGMFANTGSNWPLKGQKNSPWEGGVRVAGV 329

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL--TKGAKTKRS 379
           IWS  L     + ++  ++ D+LPTL  AAGIE+ ++  LDG++ W  L  +K A     
Sbjct: 330 IWSQLLTARGHLFTQPIYVGDFLPTLAHAAGIELPESLQLDGIDLWPQLAGSKDAPHVPR 389

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
           EILH++D+V     ++++L +   KYV GT   GQ D+    T  E+D+  P+   Y   
Sbjct: 390 EILHSLDDV----SHFSSLLLGQWKYVNGTTLAGQFDQQL--THRELDELDPRARRYV-- 441

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
                       ++ +++   KA +   L+  LT  ++  LR  A VRC    +G  CN+
Sbjct: 442 ------------VEVRNSPASKALARYDLQR-LTQHRMHTLRRLATVRCGDMQRG--CNA 486

Query: 500 TVKPCLFNITDDPCEQNNLA--ESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP--- 554
            ++ CL++++ DPCE NNLA  E   ++L  +  ++   ++  +PPGN+P    A+P   
Sbjct: 487 LLEECLYDLSVDPCELNNLAYSERHAEILATMRQRVQQLRAEALPPGNRPGVSYANPNLH 546

Query: 555 -ARWNNIWV 562
              W+N  V
Sbjct: 547 ECTWDNFEV 555


>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
 gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
          Length = 478

 Score =  431 bits (1107), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 240/524 (45%), Positives = 313/524 (59%), Gaps = 58/524 (11%)

Query: 7   YFFALTCTLLFN-DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +F  L C LL N  A  +   P +P+II ILADDLG+NDV FHGS+ IPTPNIDALAY+G
Sbjct: 2   WFLWLICLLLPNMIAGEDEKPPAQPNIIFILADDLGFNDVGFHGSADIPTPNIDALAYSG 61

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +ILN++YV  +CTPSRSALMTGKYPIH GMQH V+   EP GLPL EK+LPQYL E GYA
Sbjct: 62  IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYA 121

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           +H  GKWHLG ++  YTP +RGF SH+                      GLDMR+  +V 
Sbjct: 122 SHIAGKWHLGHWKLKYTPLYRGFSSHW----------------------GLDMRNGTEVA 159

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
               G Y+TD+ T+ ++ VIA HN +K P+FLY+AH A H+ N Y P   PD +V K   
Sbjct: 160 YDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMAH 219

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I   +RR +A MVS++D+SVG ++  LRK  MLENSI++F +DNG P+ G + N  SN+P
Sbjct: 220 IPHYKRRKFAAMVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 279

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG----IEINDTS 360
           L+G+K+T W+GG+R    +WSP LK++Q+VS++  HI DWLPTL  AAG    +      
Sbjct: 280 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIIDWLPTLLEAAGGPAAVANLSKQ 339

Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
           +DG + W  L +   + R  +LHNID++       AAL V D K V GT+  G  D WYG
Sbjct: 340 IDGQSIWRALVQDKASPRLNVLHNIDDIWGS----AALSVGDWKLVKGTNYRGSWDGWYG 395

Query: 421 DTDNEIDK-YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
                  + Y  + V  S+AG    ALK               + +D  R          
Sbjct: 396 PAGERNPRLYDWQMVARSRAGKALEALKM------------LPSQADQQR---------- 433

Query: 480 LREFARVRC-NYDNKGAHCNSTV--KPCLFNITDDPCEQNNLAE 520
           +R  A V C    +KG  C +T    PCLF+I DDPCEQ NLA+
Sbjct: 434 IRAAATVSCPAQSSKGTSCLATAFSAPCLFHIRDDPCEQYNLAK 477


>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
 gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
 gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
 gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
 gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
 gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
          Length = 585

 Score =  430 bits (1105), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 242/547 (44%), Positives = 322/547 (58%), Gaps = 31/547 (5%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+IIII+ADDLG++DVSF GS+   TPNIDALAY+G+ILN  YV  +CTPSR+AL+TGK
Sbjct: 34  KPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGK 93

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPI+ GMQH VI+  +PWGLPL E  + +  +E GY T  +GKWHLG  +  +TPT RGF
Sbjct: 94  YPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGF 153

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D H GY     DYY  S +   + Y G D R +++  +  +G Y TDL T+AA+  I +H
Sbjct: 154 DRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAVKEIEDH 213

Query: 209 ---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
              N S+P+FL L HLA HA N  +P QAP EEV++F  IS+   R YA MVSRLD+SVG
Sbjct: 214 GSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVG 273

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           +VI AL +  ML+NSI+LF++DNG P+ G HS   SN+PLRG K++PW+G +R  AAIWS
Sbjct: 274 SVIDALARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSPWEGALRSSAAIWS 333

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
              ++   V  +  +I D LPTL AAAGI  +    LDG+N W  L  G ++   EI+H 
Sbjct: 334 TEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHV 393

Query: 385 ID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
           ID +V  P   Y   +    K ++GT N G  D W G    E  +  P+ V Y +     
Sbjct: 394 IDEDVAEPHLSYTRGK---WKVISGTTNQGLYDGWLG--HRETSEVDPRAVEYEE---LV 445

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVK 502
                 LQ++Q                   +  I ELR+ +R+ C     G   C     
Sbjct: 446 RNTSVWLQLQQ---------------VSFGERNISELRDQSRIECPDPATGVKPCLPLEG 490

Query: 503 PCLFNITDDPCEQNNL-AESQ-TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
           PCLF+I  DPCE++NL AE Q + +   L  ++  +     PP NKP D   DP  ++N 
Sbjct: 491 PCLFDIEADPCERSNLYAEYQNSTIFLDLWSRIQQFAKQAHPPNNKPGDPNCDPRFYHNE 550

Query: 561 WVPWYDE 567
           W  W DE
Sbjct: 551 WTWWQDE 557


>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 502

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 239/557 (42%), Positives = 327/557 (58%), Gaps = 68/557 (12%)

Query: 14  TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
           T +     +      KPHII+I+ADDLGWND+SFHGS++IPTPNID LA NG+IL+ +YV
Sbjct: 4   TFVLTSFIMIVIHADKPHIIMIVADDLGWNDISFHGSNEIPTPNIDRLANNGVILDNYYV 63

Query: 74  QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
             +CTPSRSA+MTG+YPIH GMQ   I    P+G+ L EK LPQYLK+ GY TH +GKWH
Sbjct: 64  LPICTPSRSAIMTGRYPIHTGMQQDTIFGPNPYGVGLNEKFLPQYLKQQGYKTHGVGKWH 123

Query: 134 LGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS 193
           LGFF + YTPT+RGFDS+YG + G  DY++HS   T   Y GLD+  N        G YS
Sbjct: 124 LGFFAKQYTPTYRGFDSYYGSYLGKGDYWNHSNTET---YSGLDLHDNENGVFSQDGNYS 180

Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE-PFQAPDEEVAKFLDISDPERRT 252
           T++YT  AI+ I  HN S+P+FLYLA+ AVH+ NT E P QAP E + KF  I   +RR 
Sbjct: 181 TEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQRRK 240

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           YA M+  +D  VG V  AL K  ML+NSI++F  DNG P+ G   N  +N PLRG+K+T 
Sbjct: 241 YAAMLGYMDYGVGRVHDALAKKKMLDNSIIIFTTDNGGPANGFDYNWANNFPLRGVKATL 300

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           ++GG+RGV+ ++S  L ++ +VS EL HI+DWLPTL   AG +++D  LDG +QW  L  
Sbjct: 301 FEGGVRGVSFVYSK-LIESPRVSHELIHITDWLPTLVNLAGGKVSDGFLDGFDQWATLQN 359

Query: 373 GAKTKRSEILHNIDNVDNPQKYY--AALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
              ++R+E+L NID     +K +   ALRV   K +       + + W        D + 
Sbjct: 360 KQSSQRNEVLLNID-----EKVWKNEALRVGSWKIIK------EGNYW--------DGWY 400

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P      ++  +F+ L +                                     V+C +
Sbjct: 401 PPPSFNEQSNNSFSYLSST------------------------------------VKCGH 424

Query: 491 DNKGA--HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP-GNKP 547
           D      HC+S    CLF+I +DPCE N+L++   ++L +L ++L  Y+ +MVPP  N  
Sbjct: 425 DIPIVINHCDSY---CLFHIDEDPCEINDLSKKFPEVLAELINRLNTYRQSMVPPRNNMT 481

Query: 548 FDKRADPARWNNIWVPW 564
            D R+DP   N +W PW
Sbjct: 482 IDPRSDPKLHNGVWQPW 498


>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
           purpuratus]
          Length = 514

 Score =  429 bits (1103), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 230/543 (42%), Positives = 321/543 (59%), Gaps = 56/543 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + PHII I+ADDLGW+DVS HGSSQIPTPNID LA +G+ L  +YV  LCTPSRSA+MTG
Sbjct: 9   QPPHIIFIVADDLGWDDVSLHGSSQIPTPNIDTLAQDGVTLTNYYVSPLCTPSRSAIMTG 68

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           ++PIH G+Q GVI    P+GL L EK + QYLK  GY+THA+GKWHLG+F + YTPT+RG
Sbjct: 69  RHPIHTGLQFGVISPEAPYGLGLEEKTMAQYLKTLGYSTHAVGKWHLGYFAKEYTPTWRG 128

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDS +G++ G  DYY H  +       G D+  N +V     G YSTD++ + A  +I  
Sbjct: 129 FDSFFGFYNGRGDYYTH--EEVQSEVSGYDLHKNGKVYRPAFGQYSTDIFNQEAEQIIKA 186

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEP-FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           HN S+P+FLYLAH AVHAG   +   QAPD+   +F  I    RR YA MVS LD+SVG+
Sbjct: 187 HNASQPLFLYLAHQAVHAGVYPDRLLQAPDKYYQRFPHIETEGRRMYAAMVSALDDSVGS 246

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSF----GIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           +   LR  G+ +NSI++F  DNG  ++    G H+   SN PLRG K T W+GG+RG A 
Sbjct: 247 ISQTLRDAGLYDNSIIVFTTDNGGATYDFFDGTHA---SNWPLRGGKHTLWEGGVRGTAF 303

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEI 381
           + SP +K+ ++ S ++ H+ DWLPTL + AG +  +  + DG +QWD L+  A + R+EI
Sbjct: 304 VNSPLIKKPRRFSDQMMHVCDWLPTLHSVAGGDPKELKNSDGFDQWDALSNNAPSPRNEI 363

Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
           LHNI+    P + YAA+RV D K   G        +++         Y P+    + A  
Sbjct: 364 LHNIE----PIEKYAAIRVGDFKLHYGAFGGKIGSKYFSGW------YPPEGATSNMAEY 413

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
             N+ + +   KQ +A+                             C+Y ++        
Sbjct: 414 VPNSFRVRCPSKQANAS---------------------------TNCDYGDR-------- 438

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
           KPCL+NI DDPCE NN+A+   D+++ +  +L  Y+ + V P N+P D +++P    N+W
Sbjct: 439 KPCLYNIRDDPCEYNNIADWNQDIVEAMTARLREYRRSAVEPRNQPLDPQSNPKLHGNVW 498

Query: 562 VPW 564
            PW
Sbjct: 499 EPW 501


>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
          Length = 512

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 231/539 (42%), Positives = 319/539 (59%), Gaps = 39/539 (7%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           GWNDVS+HG  QI TPNIDALA+NG+ L ++Y Q LCTPSR+AL+TG+YPI++G+QH VI
Sbjct: 1   GWNDVSYHGCPQIRTPNIDALAWNGIRLRRYYAQPLCTPSRAALLTGRYPINMGLQHSVI 60

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
              EP GLPL++ LLPQ+L + GY TH +GKWH+GFF++ YTPT RGF+ H G+W    D
Sbjct: 61  YNEEPRGLPLSDTLLPQWLADLGYVTHHLGKWHIGFFKKEYTPTMRGFERHVGFWGAYID 120

Query: 161 YYDHSCKATFEPYQ--GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
           YY H  KA   P +  GLDMR N+ +     G Y T L T+ A+ VI  H   KP+FLYL
Sbjct: 121 YYKHE-KAYLGPTRSPGLDMRRNLFLARNDTGRYVTQLLTKEALEVIENHPVDKPLFLYL 179

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
           AHLA H+    +P Q PD+ + ++ DI   ER  YAGMVS LDESVG V  AL K+GML 
Sbjct: 180 AHLAPHSAGPQDPLQVPDKYIEQYRDIGSTERMKYAGMVSALDESVGLVFEALAKNGMLS 239

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE- 337
           N++ +F +DNGA + G ++N  S+ PLRG K TPW+GG+R  A +WS  L    +   + 
Sbjct: 240 NTLFVFTSDNGADAAGPNANYASSWPLRGQKYTPWEGGVRAAALLWSHALGPRVRAGGDY 299

Query: 338 --LFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTK-----RSEILHNIDNVD 389
             LFHI+DWLPTL   AG +  +   +DGV+  + L   A  +     R+E+L NID +D
Sbjct: 300 EHLFHITDWLPTLYQLAGGDPAHLGDIDGVSHVNALVGDASPQPPQPPRTEVLVNIDPID 359

Query: 390 NPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTK 449
           N    Y+A+     K+VAG    G  ++WY    N +D  S + +   +A      LK+ 
Sbjct: 360 N----YSAIIDGRFKFVAGVAERGIYEQWYPILGN-VDWNSTEPLEQCEASPVARVLKS- 413

Query: 450 LQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNIT 509
                + A  P   +                   A + C   +    C  TV PCLF++ 
Sbjct: 414 --FGPEPACGPSGGA-------------------APIDCGVRDPAKACQPTVAPCLFDLR 452

Query: 510 DDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDEL 568
           +DPCE N+++  + +L+K L  KL  Y+   +PP N P D  +DP  ++N W+PW ++L
Sbjct: 453 EDPCEYNDVSAERPELVKSLLTKLRRYEDASMPPANIPHDPLSDPGLYDNAWIPWREKL 511


>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
 gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
          Length = 585

 Score =  427 bits (1098), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 241/547 (44%), Positives = 322/547 (58%), Gaps = 31/547 (5%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+IIII+ADDLG++DVSF GS    TPNIDALAY+G+ILN  YV  +CTPSR+AL+TGK
Sbjct: 34  KPNIIIIMADDLGFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGK 93

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPI+ GMQH VI+  +PWGLPL E  + +  +E GY T  +GKWHLG  +  +TPT RGF
Sbjct: 94  YPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGF 153

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D H+GY     DYY  S +   + Y G D R N++  +  +G Y TD+ T+AA+  I +H
Sbjct: 154 DRHFGYLGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDHVGHYVTDVLTDAAVKEIEDH 213

Query: 209 ---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
              N S+P+FL L HLA HA N  +P QAP EEV++F  IS+   R YA MVSRLD+SVG
Sbjct: 214 GSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKAHRYYAAMVSRLDKSVG 273

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           +VI AL +  ML+NSI+LF++DNG P+ G HS   SN+PLRG K++PW+G +R  AAIWS
Sbjct: 274 SVIDALARQEMLQNSIILFLSDNGGPTEGQHSTTASNYPLRGQKNSPWEGALRSSAAIWS 333

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
              ++   V  +  +I D LPTL AAAGI  +    LDG+N W  L  G ++   EI+H 
Sbjct: 334 TEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHV 393

Query: 385 ID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
           ID +V  P   Y   +    K ++GT N G  D W G    E  +  P  V Y +     
Sbjct: 394 IDEDVAVPHLSYTRGK---WKVISGTTNEGLYDGWLG--HRETSEVDPGAVEYEE---LV 445

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVK 502
                 LQ++Q  +                +  I ELR+ +R+ C     G   C     
Sbjct: 446 RNTSVWLQLQQVTSG---------------ERNISELRDKSRIECPDPATGVKPCLPLEG 490

Query: 503 PCLFNITDDPCEQNNL-AESQ-TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
           PCLF+I  DPCE++NL AE Q + +   L +++  +      P NKP D   DP  ++N 
Sbjct: 491 PCLFDIEADPCERSNLYAEYQNSTIFLDLWERIQQFAKQAHSPNNKPGDPNCDPRFYHNE 550

Query: 561 WVPWYDE 567
           W  W DE
Sbjct: 551 WTWWQDE 557


>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
 gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
          Length = 584

 Score =  426 bits (1096), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 248/575 (43%), Positives = 329/575 (57%), Gaps = 38/575 (6%)

Query: 2   TWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
           TW    F  + C  L N       A  +P+IIII+ADDLG++DVSF GS+   TPNIDAL
Sbjct: 12  TWLLAGF--VLCIALSNGII---AASDRPNIIIIMADDLGFDDVSFRGSNNFLTPNIDAL 66

Query: 62  AYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE 121
           AY+G+ILN  YV  +CTPSR+AL+TGKYPI+ GMQH VI+  +PWGLPL E  + +  +E
Sbjct: 67  AYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRE 126

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
            GY T  +GKWHLGF +  +TPT RGFD H GY     DYY  S +   + Y G D R N
Sbjct: 127 NGYRTSLLGKWHLGFSQRNFTPTQRGFDRHLGYLGAYVDYYTQSYEQQSKGYNGHDFRDN 186

Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
           ++  +  +G Y TD+ T+AA+  I +H   N S+P+FL L HLA HA N   P QAP EE
Sbjct: 187 LKSSHDQVGHYITDVLTDAAVKEIEDHASKNSSQPLFLLLNHLAPHAANDDNPMQAPAEE 246

Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
           V++F  I +   R YA MVS+LD+SVG+VI AL +  ML+NSIVLF++DNG P+ G HS 
Sbjct: 247 VSRFEYIRNKTHRYYAAMVSKLDQSVGSVIDALARQEMLQNSIVLFLSDNGGPTQGQHST 306

Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
             SN+PLRG K +PW+G +R  AAIWS   ++   V  +  +I D LPTL AAAGI  + 
Sbjct: 307 TASNYPLRGQKDSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDP 366

Query: 359 T-SLDGVNQWDVLTKGAKTKRSEILHNID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
              LDG+N W  L  G ++   EI+H ID +V  P   Y   +    K ++GT N G  D
Sbjct: 367 ALHLDGLNLWSALKYGYESVEREIVHVIDEDVAVPHLSYTRGK---WKVISGTTNQGLYD 423

Query: 417 EWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEK 476
            W G    E  +  P+ V Y +       L     + Q+            L+ + + E+
Sbjct: 424 GWLG--HRETSEVDPRAVEYEE-------LVRNTSVWQQ------------LQQVTSGER 462

Query: 477 -ILELREFARVRCNYDNKGAH-CNSTVKPCLFNITDDPCEQNNL-AE-SQTDLLKQLEDK 532
            I ELR+ A + C     G   C     PCLF+I  DPCE++NL AE   +     L  +
Sbjct: 463 NISELRDQASIECPNPATGVRPCLPLEAPCLFDIEADPCERSNLYAEYKNSTTFLDLWAR 522

Query: 533 LAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDE 567
           +  +     PP NKP D   DP  ++N W  W DE
Sbjct: 523 IQQFAKQAQPPNNKPADPDCDPRFFHNEWTWWQDE 557


>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
 gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
          Length = 585

 Score =  426 bits (1094), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 240/547 (43%), Positives = 321/547 (58%), Gaps = 31/547 (5%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+IIII+ADDLG++DVSF GS    TPNIDALAY+G+ILN  YV  +CTPSR+AL+TGK
Sbjct: 34  KPNIIIIMADDLGFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGK 93

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPI+ GMQH VI+  +PWGLP+ E  + +  +E GY T  +GKWHLG  +  +TPT RGF
Sbjct: 94  YPINTGMQHYVIVNDQPWGLPINETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGF 153

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D H+GY     DYY  S +   + Y G D R N++  +  +G Y TD+ T+AA+  I +H
Sbjct: 154 DRHFGYLGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDYVGHYVTDVLTDAAVKEIEDH 213

Query: 209 ---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
              N S+P+FL L HLA HA N  +P QAP EEV++F  IS+   R YA MVSRLD+SVG
Sbjct: 214 GSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVG 273

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           +VI AL +  ML+NSI+LF++DNG P+ G HS   SN+PLRG K++PW+G +R  AAIWS
Sbjct: 274 SVIDALARQEMLQNSIILFLSDNGGPTEGQHSTTASNYPLRGQKNSPWEGALRSSAAIWS 333

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
              ++   V  +  +I D LPTL AAAGI  +    LDG+N W  L  G ++   EI+H 
Sbjct: 334 TEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHV 393

Query: 385 ID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
           ID +V  P   Y   +    K ++GT N G  D W G    E  +  P+ V Y +     
Sbjct: 394 IDEDVAVPHLSYTRGK---WKVISGTTNEGLYDGWLG--HRETSEVDPRAVEYEE---LI 445

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVK 502
                 LQ++Q  +                +  I ELR  +R+ C     G   C     
Sbjct: 446 RNTSVWLQLQQVSSG---------------ERNISELRYQSRIECPDPATGVKPCLPLEG 490

Query: 503 PCLFNITDDPCEQNNL-AESQ-TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
           PCLF+I  DPCE++NL AE Q + +   L  ++  +      P NKP D   DP  ++N 
Sbjct: 491 PCLFDIEADPCERSNLYAEYQNSTIFLDLWARIQQFAKQAHSPNNKPGDPNCDPRFYHNE 550

Query: 561 WVPWYDE 567
           W  W DE
Sbjct: 551 WTWWQDE 557


>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 599

 Score =  424 bits (1090), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 244/585 (41%), Positives = 334/585 (57%), Gaps = 47/585 (8%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +    L  +LL + +FL     K+PHII+I+ADDLGWNDV FHGS QIPTPNIDALAYNG
Sbjct: 10  RILIILVSSLLSDCSFLRGPNKKQPHIILIVADDLGWNDVGFHGSIQIPTPNIDALAYNG 69

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +ILN+HYVQ  CTPSR+AL+TGKYPI  G+Q   I+ G P  LPL EK+LPQYLK+ GY+
Sbjct: 70  VILNRHYVQPTCTPSRAALLTGKYPIRYGLQGFPIIAGVPLALPLNEKILPQYLKDLGYS 129

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-V 184
           TH +GKWHLG  +  +TP  RGFDSH+GYW G   Y + +         G D R   +  
Sbjct: 130 THLVGKWHLGANKNQHTPIKRGFDSHFGYWNGFISYRNSTHSTGL--MVGKDARRGFERA 187

Query: 185 DNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAG----NTYEPFQAPDEEV 239
            ++ +  Y+TD++T+ A  VI    N  KPMFL ++HLAVH G    N  E       ++
Sbjct: 188 GDEMVDRYATDIFTDEANKVIKLCKNHDKPMFLMVSHLAVHTGVPGPNILEVSNKTHNDI 247

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS---FGIH 296
            +F  I + ERR YAGM++ LDESVG++I +L  +GMLE+SIVLF++DNGAP+      +
Sbjct: 248 -RFDYIENKERRLYAGMLTSLDESVGSIIESLDNNGMLEDSIVLFISDNGAPADDPIWGY 306

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
            N GSN PLRG K    DGG+RGVAAIWSPWLK+  ++   LFHI+DWLPTL  AAG + 
Sbjct: 307 GNSGSNWPLRGEKGAVLDGGVRGVAAIWSPWLKKKHRIFENLFHITDWLPTLYTAAGGDF 366

Query: 357 NDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ-------KYYAALRVDDLKYVAG 408
            D   +DGV+QW+ LT+ +K  RS +L NID     +       K   + R  D+ Y   
Sbjct: 367 EDLGQIDGVDQWESLTETSKRTRSMVLINIDETSGEEALIFNHWKVVKSNRTSDIAYYL- 425

Query: 409 TDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDAL 468
                   ++ G+  N    Y+   V  S AG   + +   +            + +D +
Sbjct: 426 --------KYSGEPGNVGPDYNMTAVAESLAGSRLSKINCLV------------SKADCM 465

Query: 469 RTILTDEKILELREFARV------RCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQ 522
            T+ T +    +R  A++      R +  +  A       PCLF+I  DPCE  N+A   
Sbjct: 466 DTLTTYDLFYNVRSQAKIDGKCSERISDPDPNARYECFDSPCLFDIQRDPCEYQNIASRN 525

Query: 523 TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDE 567
            + L    D LA +K+ M        D ++DP  ++  W  W ++
Sbjct: 526 PEALNMTADILAQFKNEMTMQNYPETDPKSDPRFFDGYWDTWLEK 570


>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
 gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
          Length = 591

 Score =  423 bits (1088), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 237/548 (43%), Positives = 316/548 (57%), Gaps = 31/548 (5%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+IIII+ADDLG++DVSF GS+   TPNIDALAY+G+ILN  YV  +CTPSR+AL+TGK
Sbjct: 34  KPNIIIIMADDLGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGK 93

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPI+ GMQH VI+  +PWGLPL E  + +  +E GY T  +GKWHLGF +  +TPT RGF
Sbjct: 94  YPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGF 153

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D H+GY     DYY  S +   + Y G D R N++  +  +G Y TD+ T+AA+  I +H
Sbjct: 154 DRHFGYLGAYVDYYTQSYEQQNKGYNGHDFRDNLKSTHDHVGRYITDVLTDAAVKEIEDH 213

Query: 209 ---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
              N S+P+FL L HLA HA N   P QAP EEV++F  I +   R YA MVS+LD+SVG
Sbjct: 214 GSKNSSQPLFLLLNHLAPHAANDDNPMQAPAEEVSRFEYIGNKTHRYYAAMVSKLDQSVG 273

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            VI AL +  ML+NSIVLF++DNG P+ G HS   SN+PLRG K +PW+G +R  AAIWS
Sbjct: 274 YVIDALARQEMLQNSIVLFLSDNGGPTQGQHSTTASNYPLRGQKDSPWEGALRSSAAIWS 333

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
              ++   V  +  +I D LPTL A AGI  +    LDG+N W  L  G ++   EI+H 
Sbjct: 334 TEFERLGSVWKQQIYIGDLLPTLAAGAGISPDPALHLDGLNLWSALKYGYESVEREIVHV 393

Query: 385 ID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
           ID +V  P   Y   +    K ++GT N G  D W G    E  +  P+  +Y +     
Sbjct: 394 IDEDVAMPHLSYTRGK---WKVISGTTNQGLYDGWLG--HRETSEVDPRAAVYEELVRNT 448

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVK 502
           +  +   Q+                     +  I ELR+ A + C     G   C     
Sbjct: 449 SVWRQLQQVS------------------FGERNISELRDQASIECPDTANGVRPCLPLEA 490

Query: 503 PCLFNITDDPCEQNNL-AE-SQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
           PCLF+I  DPCE++NL AE   +     L  ++  +     PP NKP D   DP  ++N 
Sbjct: 491 PCLFDIEADPCERSNLYAEYKNSTTFLDLWARIQQFAKQSHPPNNKPGDPDCDPRFYHNE 550

Query: 561 WVPWYDEL 568
           W  W DE+
Sbjct: 551 WTWWQDEM 558


>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 579

 Score =  423 bits (1088), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 234/550 (42%), Positives = 325/550 (59%), Gaps = 54/550 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHII+ +ADDLGWNDV FHGS+QIPTPNIDALAYNG+ILN+HYV    TPSR A  TG Y
Sbjct: 31  PHIIVFMADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLY 90

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PI IGMQ   I  GEP GLPL  K+LP++L+  GY T  IGKWH+G+    YTP  RGFD
Sbjct: 91  PIRIGMQGDGIRGGEPRGLPLHIKILPEHLRGLGYTTKLIGKWHMGYHTPQYTPLHRGFD 150

Query: 150 SHYGYWQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           + +G++     YYD+         ++ ++G D  H ++ +      Y TDL+T+ AI +I
Sbjct: 151 TFFGFYNSHITYYDYEYSNQNMTGYDMHRGDDPAHGIKRE------YVTDLFTKEAIKII 204

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-KFLDISDPERRTYAGMVSRLDESV 264
             H   +P++L ++HLAVHA     P + PD+  + + + I +P RR YA MVS+LDESV
Sbjct: 205 ENHELPRPLYLQISHLAVHA-----PIEQPDDSSSDEIIQIREPNRRKYAKMVSKLDESV 259

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G ++ AL + GML +S++LF+ DNGA S G + N GSN+PLRG K T ++GG+RGVAA+W
Sbjct: 260 GRIVHALGEKGMLRDSLILFLTDNGAASIGRYRNYGSNYPLRGTKYTLYEGGVRGVAALW 319

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
           S  L++  +V  +L HI+DWLPTL +AAG ++ D   +DG++QW VL++     R ++L 
Sbjct: 320 SSRLEKAARVFKKLIHITDWLPTLYSAAGGDLKDLGEIDGIDQWRVLSEEQGRGREKLLL 379

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA---G 440
           NID V    K   A+     K + G+  NG  D++YGD+   ++     EV+   A    
Sbjct: 380 NIDEV---MKTEGAI-YSRFKLLRGSIGNGYYDKYYGDSGRTLETPPYTEVVLKSAVSQS 435

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN------YDNKG 494
           IT+            H   P    S           +++LR  A V+C+      Y +  
Sbjct: 436 ITY------------HLGGPVTQPS----------TMIQLRREATVQCHPNVTYYYRHSF 473

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
             CN  V  CLF+I +DPCE  N+AE+   + + L+  L  Y   ++    KP D  ADP
Sbjct: 474 TFCN--VTECLFDIVNDPCETKNIAEAYARIARDLDLYLEQYGRVLMKQIRKPVDWLADP 531

Query: 555 ARWNNIWVPW 564
            R NN W PW
Sbjct: 532 RRRNNTWEPW 541


>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 575

 Score =  422 bits (1086), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 253/578 (43%), Positives = 327/578 (56%), Gaps = 29/578 (5%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F L   L   D  L     K+PHI++I+ADDLGWNDV FHGS QIPTPNIDALAYNG I
Sbjct: 11  IFILIVNLQLTDYVLLRELKKRPHIVVIVADDLGWNDVGFHGSIQIPTPNIDALAYNGAI 70

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           LN+HYVQ  CTPSR+AL+TGKYPI  G+Q   I  G+   LP  EK+LPQYLKE GY+TH
Sbjct: 71  LNRHYVQPTCTPSRAALLTGKYPIRYGLQGPPIASGKASALPTNEKILPQYLKELGYSTH 130

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            +GKWHLG +++ +TPT RGFDSH+GYW G   Y + S  AT     G+D R   +    
Sbjct: 131 LVGKWHLGHYQKRFTPTKRGFDSHFGYWNGYISYRN-STHAT-RTMSGIDARRGFERAGN 188

Query: 188 TI--GIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNT----YEPFQAPDEEVA 240
            +    Y+TD++TE A  +I +   ++  MFL ++HLAVH+GN+     E       + A
Sbjct: 189 EMDRDRYATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSGPNHLEVLNKTYNDEA 248

Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS---FGIHS 297
            F  I +  RR YAGMV+ LD+SVG +I +L +H MLENSIV+F++DNGAP+      H 
Sbjct: 249 -FGYIENENRRLYAGMVTSLDDSVGKIIESLHEHEMLENSIVVFISDNGAPTDDPLWGHD 307

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN 357
           N GSN PLRG K++  +GG+RGVAAIWSPWLK+   VS+ LFHI+DWLPTL  AAG    
Sbjct: 308 NFGSNWPLRGKKASVLEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTLYTAAGGNFE 367

Query: 358 DTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG--TDNNGQ 414
           D   +DGVNQW  LT  +K  RS+IL NID     +    AL  +  K V    T N   
Sbjct: 368 DLGKIDGVNQWKSLTDTSKQVRSKILINIDETRGEE----ALIFNQWKVVKSNRTSNMVF 423

Query: 415 SDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTD 474
           + ++ GD  N   KY+   V+ S  G   + +    Q       D   N+S A       
Sbjct: 424 NLKYAGDPGNAGPKYNMSGVVASTVGKRLSKINCLTQ-------DSCMNASTARDVFYVS 476

Query: 475 EKILELR-EFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKL 533
             + ++  +    R +  N           CLF+I  DPCE  N+A+     L      L
Sbjct: 477 RSLAKVDGQTCPERVSNPNPDQQYECFDGYCLFDILQDPCEYRNVAKQNPQTLDATIHML 536

Query: 534 AIYKSTMVPPGNKP-FDKRADPARWNNIWVPWYDELDK 570
             +K  M      P  D  ADP R+   W PW +  DK
Sbjct: 537 EQFKKEMTMQAPDPIIDPDADPRRFAGYWEPWLEPFDK 574


>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
 gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
          Length = 576

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 232/544 (42%), Positives = 323/544 (59%), Gaps = 30/544 (5%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           T  K P+IIII+ADD+G++D+S  G  +  TPNIDALA++G +L++ Y  A+CTPSR AL
Sbjct: 30  TGSKLPNIIIIMADDMGFDDLSIRGGREFLTPNIDALAFHGRLLDRLYAPAMCTPSRGAL 89

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           ++GKYPIH G QH VI   EPW L L   L+P+  + AGY+T+ +GKWHLG+ R  +TPT
Sbjct: 90  LSGKYPIHTGTQHFVISNQEPWSLKLNTTLMPEIFRAAGYSTNLVGKWHLGYARPEFTPT 149

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
            RGFD HYGYW    DYY    +   + Y  G D R NM+V+    G+Y TDL T  A  
Sbjct: 150 HRGFDYHYGYWGAYIDYYQRRSQMPDKTYIMGYDFRRNMEVECADRGVYMTDLLTNEAER 209

Query: 204 VIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           VI E   K +P+FL + HLAVH GN  +P Q P+EE+ KF  I DP  R YA MVS+LD+
Sbjct: 210 VIQETAAKQQPLFLMINHLAVHTGNDNDPLQVPEEELQKFTHIKDPNHRKYAAMVSKLDK 269

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           SVG VI AL +   LENSIV+F +DNGAPS G+ +N GSN PL+G K++PW+GG+R   A
Sbjct: 270 SVGRVIGALARAEQLENSIVIFYSDNGAPSVGMFANTGSNWPLKGQKNSPWEGGLRVAGA 329

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLT--KGAKTKRS 379
           IWSP LK    V ++  ++ D+LPTL  AAGIE+ D+  LDG++ W  L+  K A     
Sbjct: 330 IWSPLLKARSSVFTQPIYVGDFLPTLAHAAGIELPDSLQLDGIDLWPQLSGPKEAPHVPR 389

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
           EILH++D+V N     ++L +   KYV GT   GQ D+       E+D   P+   Y   
Sbjct: 390 EILHSLDDVWN----VSSLLLGHWKYVNGTTLAGQYDQQL--VQRELDDLDPRANRY--- 440

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
                 ++  L  +     D +  +   ++ I+ D         A+V C    +G  CN+
Sbjct: 441 ---VVEVRNSLASRALARFDQQRLTQHRMKNIMID---------AKVDCGDMQRG--CNA 486

Query: 500 TVKPCLFNITDDPCEQNNLA--ESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARW 557
            ++ CL++++ DPCE+NNLA  E   ++L  +  ++   ++  VP  N P    A+P   
Sbjct: 487 LLEECLYDMSVDPCERNNLAYSEQHAEILATMRQRIKQLRAGAVPIQNLPGVAYANPTLH 546

Query: 558 NNIW 561
              W
Sbjct: 547 ECTW 550


>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
          Length = 564

 Score =  422 bits (1085), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 237/549 (43%), Positives = 323/549 (58%), Gaps = 52/549 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHII+ +ADDLGWNDV FHGS+QIPTPNIDALAYNG+ILN+HYV    TPSR A  TG Y
Sbjct: 31  PHIIVFMADDLGWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLY 90

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PI IGMQ   I  GEP GLPL  K+LP++L+  GY T  IGKWH+GF    YTP  RGFD
Sbjct: 91  PIRIGMQGDGIRGGEPRGLPLHIKILPEHLRGLGYVTKLIGKWHMGFHTLQYTPLHRGFD 150

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTEAAINVIA 206
           + +G++     YYD+  + + +   G DM      D+   G+   Y+TDL+T  AI +I 
Sbjct: 151 TFFGFYNSHITYYDY--EYSNQNMTGYDMHCG---DDPAYGMKREYATDLFTNEAIKIIE 205

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-KFLDISDPERRTYAGMVSRLDESVG 265
            H   +P++L ++HLAVHA     P + PD+    + + I +P RR YA MVS+LDESVG
Sbjct: 206 NHELPRPLYLQISHLAVHA-----PIEQPDDSSRDEIVQIREPNRRKYAKMVSKLDESVG 260

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            V+ AL + GML +S++LF+ DNGA S G + N GSN+PLRG K T ++GG+RGVAA+WS
Sbjct: 261 RVVHALGEKGMLRDSLILFLTDNGAASIGRYRNYGSNYPLRGTKYTLYEGGVRGVAALWS 320

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
             L++  +V  +L HI+DWLPTL +AAG ++ D   +DG++QW VL++G    R ++L N
Sbjct: 321 SRLEKGARVFKKLIHITDWLPTLYSAAGGDLKDLGKIDGIDQWRVLSEGQGHGREKLLLN 380

Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA---GI 441
           ID V   +           K + G+  NG  D++YGD+   ++     EV+   A    I
Sbjct: 381 IDEVMITE----GAIYSRFKLLRGSVGNGYYDKYYGDSGRTLETPPYTEVVLKSAVSQSI 436

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN------YDNKGA 495
           T+            H   P    S           +++LR  A V+C+      Y +   
Sbjct: 437 TY------------HLGGPVTQPS----------TMVQLRREATVQCHPNMSYYYRHSFT 474

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
            CN  V  CLF+I +DPCE  N+AE+   + + L+  L  Y   ++    KP D  ADP 
Sbjct: 475 FCN--VTECLFDIVNDPCETKNIAEAYARIARDLDLYLEHYGRVLMKQIRKPVDWLADPK 532

Query: 556 RWNNIWVPW 564
           R NN W PW
Sbjct: 533 RRNNTWEPW 541


>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
 gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
          Length = 558

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 229/566 (40%), Positives = 324/566 (57%), Gaps = 43/566 (7%)

Query: 14  TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
           +L+ N + +      +PHII+I+ DD+G ND+    ++QIPTPNIDAL YNG++L+++YV
Sbjct: 11  SLIQNSSQVCRVRRPRPHIIVIMGDDMGHNDIGLR-TNQIPTPNIDALGYNGVVLDRYYV 69

Query: 74  QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
           Q  CTPSR+A +TG YPI   MQ   I+ GE   LPL   L+PQ+LK  GY TH +GKWH
Sbjct: 70  QNACTPSRAAFLTGNYPIRSAMQGLPIVAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWH 129

Query: 134 LGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS 193
           LG      TPT +GFDSH+GYW G   YYD+         +G D+    + +    G Y+
Sbjct: 130 LGSAYRSSTPTEKGFDSHFGYWNGFTGYYDYFTDFNSTAIEGFDLHDRFETERGYQGQYA 189

Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK--FLDISDPERR 251
           T ++TE A+++I  HN ++P+FL + HLA HAG        P+E  A+  +  I DP RR
Sbjct: 190 TRVFTERALDIIEGHNTTRPLFLLMTHLAAHAGRDGTELGVPNEVEAQRTYSYIQDPRRR 249

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            YA +V+ LD S+G V+  L +  MLENSI+LF +DNGAP+ G ++N GSN PLRG+K T
Sbjct: 250 LYAEIVAELDRSIGQVVRKLSERQMLENSIILFFSDNGAPTVGPYTNSGSNWPLRGIKLT 309

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVL 370
            ++GG+RG A I+SP LK+   V+ EL H+SDWLPT  AAAG  + D   +DGVNQW  L
Sbjct: 310 NFEGGIRGTATIFSPLLKKRGYVNKELIHVSDWLPTFYAAAGGNLADLGPIDGVNQWPTL 369

Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDD--LKYVAGTDNNGQSDEWYGDTDNEIDK 428
           +    + RSEIL NI+  DN      ++  D+   K V G    G  D ++GD+    D 
Sbjct: 370 SLDTPSPRSEILVNINEQDN----TTSIITDNGRFKLVTGAFEGGTYDGYFGDSGRSPDT 425

Query: 429 --YSPKEVLYSKAGITFNALK----TKLQIKQKHAA-DPKANSSDALRTILTDEKILELR 481
             Y P  VL S+  I    L     T+ QI+   A  D     +D+ R  L         
Sbjct: 426 PPYDPFAVLQSETNIAIQELTQTPITRQQIRVTRAQIDLSWCRNDSFRPPL--------- 476

Query: 482 EFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
                         +C+   +PCLF++ +DPCE  N+ E   ++ ++L  ++A +   +V
Sbjct: 477 --------------NCS---QPCLFDLENDPCETTNIIEENPEIAERLLRRIADFFEVLV 519

Query: 542 PPGNKPFDKRADPARWNNIWVPWYDE 567
           P G+   D  A+P R+NN W  W D+
Sbjct: 520 PQGDSTIDPNANPQRYNNTWCTWLDD 545


>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 529

 Score =  421 bits (1083), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 239/557 (42%), Positives = 316/557 (56%), Gaps = 32/557 (5%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           +   L    A  + ++   P+II ILADDLGW DVSF G  QIPTPN+D LA  G+ILN 
Sbjct: 1   MCLILALQTAVFSKSSTVPPNIIFILADDLGWADVSFRGDPQIPTPNLDVLASQGIILNN 60

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
           +YVQ LC PSR ALM+G YPIH G+QH V   GEPWGLP    ++P+YLK  GYATH IG
Sbjct: 61  YYVQPLCAPSRGALMSGLYPIHTGLQHLVPGPGEPWGLPTNLTIMPEYLKNLGYATHMIG 120

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           KWHLG+ +E YTPT RGFDS YGY  G +DYYDH+    +    GLD   N         
Sbjct: 121 KWHLGYHKESYTPTRRGFDSFYGYLNGGEDYYDHTI--LWSNASGLDFWENTTPVRNEGN 178

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
            YST+L+T+ A ++I  H+ +KPMFLY +H AVH G+     +AP   +A F  I +  R
Sbjct: 179 HYSTELFTKKAQSLIKHHDPAKPMFLYFSHQAVHCGDYKVELEAPALAIAHFPYIKELNR 238

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
             +AG V  LD+SVG V+ AL K GML NSIV+F  DNG   +G+  N G N PLRG K 
Sbjct: 239 SIHAGAVYELDKSVGLVMEALNKRGMLSNSIVIFSTDNGGLPWGVEPNSGYNWPLRGSKE 298

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDV 369
           T W+GG RG A +WSP L ++ ++S+++ HI+DWLPTL +AAG  ++   ++DG + W  
Sbjct: 299 TNWEGGARGAAFVWSPLLFKSGRLSNQMMHITDWLPTLYSAAGGNVSTLGNIDGKDMWKA 358

Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG--DTDNEID 427
           L++  ++ R E+L NID ++N     +AL V   K V G+ N G  D           +D
Sbjct: 359 LSEDLESPRQEVLINIDPIENS----SALIVGRHKVVLGSFNEGSHDMRMKAPGGSRPVD 414

Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
               + +L S+ G     LK    ++Q                 LT       R  A VR
Sbjct: 415 GLD-QMMLSSRTG---KVLKDFYNVRQ-----------------LTVRP--NWRNEAVVR 451

Query: 488 CNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKP 547
           C+      +  +   P  F++  DPCE NNLA S    L++L  K+  Y   MVPP NKP
Sbjct: 452 CDRYAPPNNFVAASPPYYFDLEHDPCELNNLAASNVTELEELIKKIKEYAKGMVPPANKP 511

Query: 548 FDKRADPARWNNIWVPW 564
            D R  P   + +W PW
Sbjct: 512 LDPRGLPKYNHGLWGPW 528


>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  421 bits (1082), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 233/551 (42%), Positives = 323/551 (58%), Gaps = 56/551 (10%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           N    + P+I++I+ADDLGWNDVSFHG+ +IPTPNIDALAYNG+ILN+HYV  LCTPSR+
Sbjct: 7   NDVPKQPPNIVVIMADDLGWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPLCTPSRT 66

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           A +TGKYPI  GMQ  V+   EP G+PL + LLP+YL++ GYATH +GKWH+G+  + YT
Sbjct: 67  AFLTGKYPIRTGMQGYVLQPAEPRGIPLNDTLLPEYLRKLGYATHLVGKWHVGYHTKNYT 126

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTE 199
           PT RGFD+  GY+ G   Y++H+     + Y G D  H +  +N+TI     Y TD+ T+
Sbjct: 127 PTRRGFDTFLGYYNGYIHYFNHTILDEEQKYLGYDF-HRIVGENRTIEYRYDYITDIITD 185

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD--EEVAKFLDISDPERRTYAGMV 257
              N+I  HN +KP++L ++H A H+G      Q  D  E  A    I D  RR YA +V
Sbjct: 186 EVENIIFSHNPAKPLYLQVSHDAAHSGGIGIEMQVRDWKETNATLGYIEDINRRKYASVV 245

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
           + LDESVG +I ALRK  ML+NSI++F++DNGA + G   N GSN+PLRG+K + ++GG+
Sbjct: 246 ATLDESVGRIIDALRKTDMLKNSIIVFISDNGAQTEGFLQNYGSNYPLRGLKFSLFEGGV 305

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKT 376
           RG A I+SP + +  +VS++LFHI+DWLPTL +AAG   +D   LDG +QW  +     +
Sbjct: 306 RGAAYIYSPLIDRLSRVSTQLFHITDWLPTLYSAAGGNSSDLKQLDGFDQWSAIKSAEDS 365

Query: 377 KRSEILHNIDNVDNP--QKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
           KR  IL NID  D    QKYY                                 YS    
Sbjct: 366 KRKSILINIDERDTSEYQKYY--------------------------------NYSGNNA 393

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT-ILTDEKILELREFARVRCNYDNK 493
           LY K    +NA+           A P A++  ++ T +L   KI++LR+ A + C   N 
Sbjct: 394 LYPK----YNAINI--------LASPVASAIASISTSMLNANKIMQLRKEATIVCK--NF 439

Query: 494 GAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRAD 553
               N T + CLF++ +DPCE  +L+      +++L   +  YKS +V   N P D    
Sbjct: 440 MDFSNCTNRSCLFDVKEDPCETTDLSAKYPKEVERLNMFIDRYKSVLVKQPNTPIDPAGY 499

Query: 554 PARWNNIWVPW 564
              +NN W+PW
Sbjct: 500 AYHFNNTWIPW 510


>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 579

 Score =  420 bits (1080), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 233/548 (42%), Positives = 322/548 (58%), Gaps = 48/548 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHII+ +ADDLGWNDVSFHGS+QIPTPNIDAL YNG+ILN+HYV    TPSR+A  TG+Y
Sbjct: 31  PHIIVFMADDLGWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQY 90

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PI IGMQ   I  GEP GLPL  K+LP++L+  GY T  IGKWH+G++   YTP  RGFD
Sbjct: 91  PIRIGMQGADIRGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFD 150

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTEAAINVIA 206
           +  G++     YYD++     +   G DM      D+   G+   Y+TD++T  AIN+I 
Sbjct: 151 TFLGFYNSYISYYDYNYSN--QNMSGYDMHRG---DDPAYGMNREYATDMFTSEAINIIE 205

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAP-----DEEVAKFLDISDPERRTYAGMVSRLD 261
            H  ++P++L L+HLAVH+     P + P     D E    + I +P RR YA MVS+LD
Sbjct: 206 NHELNRPLYLQLSHLAVHS-----PLEQPANVYNDREP---IHIREPNRRKYAKMVSKLD 257

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           ESVG +  AL + GML++S++LF+ DNGA S G + N GSN+PLRG K T ++GG+RGVA
Sbjct: 258 ESVGRIFHALGEKGMLKDSLILFLTDNGAASVGRYRNYGSNYPLRGTKYTLYEGGVRGVA 317

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
           A+WS  L++  +VS+EL HI+DWLPTL +AAG ++ D   +DG++QW +L++G    R +
Sbjct: 318 ALWSSRLQRAARVSNELVHITDWLPTLYSAAGGDLKDLGEIDGIDQWRLLSEGRGRGRDK 377

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
           +L NID V    K   A+     K + GT  +G  D +YGD+   ++       +  K+ 
Sbjct: 378 LLLNIDEV---LKTEGAI-YSRFKLLRGTIESGYYDGYYGDSGKTLETLPTYTDIVMKST 433

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN----YDNKGAH 496
           ++        +    H   P    S  +R          LR+ A V+CN    Y  +   
Sbjct: 434 VS--------ESLTYHLGGPVTQPSTMVR----------LRQDATVQCNANFTYYRRQML 475

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
               V  CLF+I +DPCE  N+A++   + + L   L  Y   +V    KP D  ADP  
Sbjct: 476 TPCNVTECLFDIVNDPCETRNIADTYVRIARDLNYYLEQYSQILVKQTKKPIDWSADPRN 535

Query: 557 WNNIWVPW 564
            N  W PW
Sbjct: 536 KNGTWGPW 543


>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 533

 Score =  420 bits (1080), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 234/553 (42%), Positives = 319/553 (57%), Gaps = 48/553 (8%)

Query: 22  LNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
           L  +A KKP +II +L DDLGW+DVSFHGS QIPTPN+D LA +G+ILN +YVQ  CTPS
Sbjct: 17  LGVSASKKPPNIIFMLVDDLGWDDVSFHGSPQIPTPNMDVLAGDGVILNNYYVQHFCTPS 76

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           R+ALMTG YPIH G+Q  VI   +P+GLPL  K++P++ K+ GY TH IGKWHLG+FR+ 
Sbjct: 77  RAALMTGLYPIHNGLQDFVIDVAQPYGLPLYLKVMPEFFKDMGYETHMIGKWHLGYFRKE 136

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           YTPT+RGFDS YGY+ G +DYY+HS        Q  ++R    V  K I  Y  +  TE 
Sbjct: 137 YTPTYRGFDSFYGYYNGAEDYYNHSITKVIS--QSYNIRQG-SVTKKRIENYIKN--TEL 191

Query: 201 AINVIAEHNK---SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
            +  +  +     S+P+FLYLA+ +VH      P +AP+E + KF  I +  R  +AG V
Sbjct: 192 VLLSLTFYLSILFSQPLFLYLAYQSVHG-----PLEAPEENIMKFPYIGEENRTIFAGAV 246

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
             LD+SVG V+  L +  ML+N+I++   DNG   FG+ SN+G N PLRG K T W+GG 
Sbjct: 247 DALDQSVGEVLETLYEADMLDNTIIVLSTDNGGLPFGVQSNRGYNFPLRGAKGTLWEGGT 306

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL---DGVNQWDVLTKGA 374
           RG A IWSP L Q ++VS ++ HI+DWLPTL +AAG+  N  SL   DG + W  L+   
Sbjct: 307 RGSAFIWSPLLNQNRRVSDQMLHITDWLPTLYSAAGMRGNPVSLGKVDGYDMWTQLSYNL 366

Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            + R E+L+NID    P    +ALR  + K V GT  +G+ D+ +  T      Y   E+
Sbjct: 367 PSSRYEVLYNID----PTDLNSALRYTNYKLVLGTYEDGEFDQRFRTTGGS-RPYGDLEI 421

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL---ELREFARVRCNYD 491
                                  A  ++ ++  L+        L     R+ A V C  +
Sbjct: 422 -----------------------AMAQSKAARVLKGFYNRGNFLLPRNWRQKATVNCVSN 458

Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
           N   +  S   P LF++T DPCE NN+A ++  L+  L+ +L  Y S+ + P NKP D R
Sbjct: 459 NGPKNFVSQAPPYLFDLTKDPCELNNIASAEAGLVDFLKQRLQSYASSSIAPLNKPTDPR 518

Query: 552 ADPARWNNIWVPW 564
             P   N  W PW
Sbjct: 519 GYPENLNGTWGPW 531


>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
          Length = 563

 Score =  419 bits (1078), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 223/545 (40%), Positives = 330/545 (60%), Gaps = 40/545 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P+II+I+ADDLGWNDVSFHG+ QIPTPNIDALAYNG+IL +HYV  +CTPSR+A +TG
Sbjct: 37  QRPNIIVIMADDLGWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTG 96

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +YPI  GMQ   +  GE   +PL   LLP+YL++ GYATH +GKWH+G++ + +TP +RG
Sbjct: 97  RYPIRTGMQGYPLKAGEERAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRG 156

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM--QVDNKTIGIYSTDLYTEAAINVI 205
           FD+  GY+ G   Y+ H+ +     + G D+ +++   +  K    Y TDL TE A ++I
Sbjct: 157 FDTFLGYYSGYITYFKHTIEQNL--HVGYDLHYDVAGNLSVKYSHEYMTDLITERAEDII 214

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV--AKFLDISDPERRTYAGMVSRLDES 263
             HN+SKP++L L+H+A H+ +     +  DEE   A    I D +RR  AG+V+ +DES
Sbjct: 215 FNHNRSKPLYLQLSHVAAHSSDAKANMEVRDEEETNATLGYIEDFDRRKLAGVVTAMDES 274

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VG V+ ALR+  MLENSI++FM+DNGA + G+ +N GSN+PLRG+K T ++GG+RGVA +
Sbjct: 275 VGRVVQALRQTNMLENSIIVFMSDNGAQTVGLLNNYGSNYPLRGLKFTLFEGGVRGVACV 334

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAKTKRSEI 381
           +SP +K   ++S++L H++DWLPT  +AAG  + D   +LDGV+QW  +    KT+R  +
Sbjct: 335 YSPAIKNPSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLDGVDQWATIVSEKKTRRKNV 394

Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD--TDNEIDKYSPKEVLYSKA 439
           L NI   D  Q    AL +   K + G   N    ++YGD  T     +Y+   VL+S A
Sbjct: 395 LLNI---DEKQALSGAL-MGRYKLINGA--NVSYGDYYGDSGTSESYPEYNVSNVLHSAA 448

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
           G                     A+ S+A       E+I+ELR  A V C+       C  
Sbjct: 449 GSAL------------------ASVSNA---TFNSERIIELRRAATVVCDNFTSYPRC-- 485

Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
            +  CLF++  DPCE  +++    +++ +L   +A Y + +    +   D R+ P  +N 
Sbjct: 486 -LDRCLFDVYKDPCETTDVSREHPEVVDELNRFIAEYFAVLKTQTSGLVDPRSYPEHFNG 544

Query: 560 IWVPW 564
            W+PW
Sbjct: 545 TWMPW 549


>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
 gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
          Length = 548

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 243/560 (43%), Positives = 327/560 (58%), Gaps = 33/560 (5%)

Query: 36  LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM 95
           +ADD+G++DVSF GS+   TPNIDALAY+G+ILN  Y  A+CTPSR+AL+TGKYPI+ GM
Sbjct: 1   MADDMGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGM 60

Query: 96  QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
           QH VI+  +PWGLP  EK + +  +E GY T  +GKWHLG  +  +TPT RGFD H GY 
Sbjct: 61  QHYVIVNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYL 120

Query: 156 QGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKS 211
               DYYD + +   + Y +G D R N+ V +  +G Y TD+ ++AA+ +I +H   N S
Sbjct: 121 GAYVDYYDQTYQQNGKNYARGHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180

Query: 212 KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAAL 271
           +P+FL L+HLA HA N  +P QAP EE+AKF  I +   R YA MVSRLD SVG VI AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRYYAAMVSRLDRSVGRVIDAL 240

Query: 272 RKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT 331
            +  ML++SIVLF++DNG P+ G HS+  SN+PLRG K++PW+GG+R  AAIWS   ++ 
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSSTASNYPLRGQKNSPWEGGLRSSAAIWSTEFERL 300

Query: 332 QKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDN 390
             V  +  +I D LPTL AAAGI  + D  LDG+N W  L  G ++   EI+HNID   +
Sbjct: 301 GTVWKQQTYIGDLLPTLAAAAGIAPDPDLKLDGMNLWSALKYGYESLEREIVHNIDM--D 358

Query: 391 PQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPKEVLYSKAGITFNALKTK 449
           P   Y A      K V GT N    D W G+    E+D   P+   Y +  I   ++   
Sbjct: 359 PSAPYLAYTRGKWKVVNGTTNAALFDGWLGERVTREVD---PRSANYEEL-IRNTSVWQH 414

Query: 450 LQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH--CNSTVKPCLFN 507
           LQ+               L     +  I  LR+ A++ C  +  GA   C   + PCLF+
Sbjct: 415 LQL---------------LMGSQNERNITSLRQQAQIECP-EPPGATKPCLPLLAPCLFD 458

Query: 508 ITDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWY 565
           I  DPCE+NNL E    T +   L  ++  + S    P N P D   DP  ++N W  W 
Sbjct: 459 IDADPCERNNLYEDYRNTTIFLDLLQRIEQFASQAHAPNNMPEDPDCDPRFYHNEWTWWQ 518

Query: 566 DELDKQKAIE-TMRLHKPLL 584
           DE     AI  +++   PLL
Sbjct: 519 DEQASGAAIMWSVKRLNPLL 538


>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 593

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 242/582 (41%), Positives = 325/582 (55%), Gaps = 55/582 (9%)

Query: 4   ARKYFFALTCTLLFNDA--FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
            R     L C LLF DA  F +T     PHII+ + DDLGWNDVSFHGS+QIPTPNIDAL
Sbjct: 7   GRSNLLVLCCILLFGDALGFYDTA----PHIIVFMVDDLGWNDVSFHGSNQIPTPNIDAL 62

Query: 62  AYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE 121
            YNG+ILN+HYV    TPSR+A  TG YPI IGMQ   I  GEP GLPL  ++LP+YL+ 
Sbjct: 63  GYNGIILNRHYVLPSSTPSRAAFFTGLYPIRIGMQGEGIQGGEPRGLPLNIRILPEYLRG 122

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
            GY T  IGKWH+G+    +TP  RGFD+  G++     YYD+  + +++   G DM   
Sbjct: 123 LGYTTKLIGKWHVGYHTPQHTPLHRGFDAFLGFYNSHVSYYDY--RYSYQNMSGYDMHRG 180

Query: 182 MQVDNKTIGI---YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
              DN   G+   Y+TDL+T+ A+ +I  H   +P++L ++HLAVHA     P ++PD++
Sbjct: 181 ---DNPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQISHLAVHA-----PIESPDDD 232

Query: 239 -----VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
                  +F  I +  RR YA MV+RLDESVG ++  L   GML +S++LF+ DNGAP+ 
Sbjct: 233 HRNSNRERFKHIPEVNRRNYASMVTRLDESVGRIVHTLGNRGMLRDSLILFLTDNGAPTI 292

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           G   N GSN PLRGMK T ++GG+RGVA +WSP L++  +V +EL HI+DWLPT  +AAG
Sbjct: 293 GKFRNWGSNLPLRGMKYTLYEGGVRGVAVLWSPRLRKAARVCNELVHITDWLPTFYSAAG 352

Query: 354 IEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
            ++ D   +DGVNQW +L  G    R  +L NID V   +           K + G+   
Sbjct: 353 GDLKDLGEIDGVNQWRMLNDGEPVARQSLLLNIDEVSRTE----GAIYKSFKLLRGSIEG 408

Query: 413 GQSDEWYGDTDN------EIDKYSPKEVLYSKAGITFNALKTKL-QIKQKHAADPKANSS 465
           G  D ++   +       E  K  P EV       T   LK+ + Q    H   P    S
Sbjct: 409 GHYDGYHTIREPSWIEHVEHKKNEPTEV----PSYTETVLKSLVSQSITYHLGGPVTQPS 464

Query: 466 DALRTILTDEKILELREFARVRCNYD---NKGAHCNSTVKPCLFNITDDPCEQNNLAESQ 522
                      I +LR+ A V C  +   N+   CN  V  CLF+I +DPCE  N+A+  
Sbjct: 465 ----------TINQLRQEATVHCRSNGSYNRFTTCN--VTECLFDINNDPCETKNIAQQY 512

Query: 523 TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
             + ++L+  L  Y   +      P D  ADP R N  W PW
Sbjct: 513 PRIARELDLFLEKYGRNVTRQMKVPVDWLADPRRTNGTWEPW 554


>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
          Length = 563

 Score =  419 bits (1076), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 218/544 (40%), Positives = 326/544 (59%), Gaps = 40/544 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+II+I+ADDLGWNDVSFHG+ QIPTPNIDALAYNG+IL +HYV  +CTPSR+A +TG+
Sbjct: 38  RPNIIVIMADDLGWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGR 97

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YPI  GMQ   +  GE   +PL   LLP+YL++ GYATH +GKWH+G++ + +TP +RGF
Sbjct: 98  YPIRTGMQGHPLDPGEVRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGF 157

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM--QVDNKTIGIYSTDLYTEAAINVIA 206
           D+  GY+ G   Y++H+ +     + G D+ +++   +  K    Y TDL TE A ++I 
Sbjct: 158 DTFLGYYSGFMTYFNHTIEQNH--HVGYDLHYDVAGNLSVKYSHEYMTDLITERAEDIIL 215

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV--AKFLDISDPERRTYAGMVSRLDESV 264
            HN SKP++L L+H+A H+ N  +  +  DEE   A    I D +RR  AG+V+ LDESV
Sbjct: 216 NHNHSKPLYLQLSHIAAHSSNINKTVEVRDEEETNATLGYIEDFDRRKLAGVVTALDESV 275

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G V+ AL++  MLENSI+LFM+DNGA + G+  N GSN+PLRGMK T ++GG+RG A ++
Sbjct: 276 GRVVQALKQANMLENSIILFMSDNGAQTVGLLDNSGSNYPLRGMKFTLFEGGVRGAACVY 335

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAKTKRSEIL 382
           SP +K   ++S++L H++DWLPT  +AAG  + D   +LDGV+QW  +     T+R  IL
Sbjct: 336 SPAIKNPSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLDGVDQWATIVSEENTRRKNIL 395

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT--DNEIDKYSPKEVLYSKAG 440
            NID  ++     +   +   K + G     Q  ++YGD+       +Y+   VL+S AG
Sbjct: 396 LNIDEKED----LSGALMGRYKLINGA--KLQYGDYYGDSGMSESYPEYNVSNVLHSAAG 449

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
               ++                         L  E+I+ELR  A V C+       C   
Sbjct: 450 FAVASVSN---------------------VTLNSERIIELRRAAMVVCDNFTSYPKC--- 485

Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
           +  CLF++ +DPCE  +++    +++ ++   +A Y + ++       D R+ P  +N  
Sbjct: 486 LDRCLFDVYNDPCETTDVSREHPEVVDEINRFIAEYFTVLMTQTPTKTDPRSYPENFNGT 545

Query: 561 WVPW 564
           W+PW
Sbjct: 546 WMPW 549


>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 575

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 235/547 (42%), Positives = 324/547 (59%), Gaps = 48/547 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHII+I+ADDLGWNDV FHGS+QIPTPNIDAL YNG+ILN+HYV    TPSRSA +TG Y
Sbjct: 27  PHIIVIMADDLGWNDVGFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRSAFLTGLY 86

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PI IGMQ   I  GEP GLPL  K+LP++L++ GY T  IGKWH+G+    YTP  RGFD
Sbjct: 87  PIRIGMQGDGIRGGEPRGLPLDIKILPEHLRDLGYTTKLIGKWHMGYHTPQYTPLHRGFD 146

Query: 150 SHYGYWQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
              G++     YYD+       + ++ ++G D  H +  +      Y+TDL+T+ A+ +I
Sbjct: 147 FFLGFYNSHISYYDYHYSNQNMSGYDLHRGDDPAHGINRE------YATDLFTKEAVRMI 200

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV--AKFLDISDPERRTYAGMVSRLDES 263
             H   +P++L ++HLAVHA     P + P +E    +F  I +P RR YA MVSRLD+S
Sbjct: 201 ETHELPRPLYLQISHLAVHA-----PLEQPRDEYNDGRFSYIREPNRRKYARMVSRLDDS 255

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VG ++ AL + GML +S++LF+ DNGA S G + N GSN+PLRG K T ++GG+R VAA+
Sbjct: 256 VGRIVHALGEKGMLRDSLILFLTDNGAASVGRYRNWGSNYPLRGTKYTLYEGGVRSVAAL 315

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEIL 382
           WSP L++  +VS++L HI+DWLPTL +AAG ++ D   +DG++ W VL++G    R ++L
Sbjct: 316 WSPRLQKVARVSNQLIHITDWLPTLYSAAGGDLKDLGQIDGIDLWRVLSEGYGRGRDKLL 375

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDN--EIDKYSPKEVLYSKAG 440
            NID +   QK   A+     K + G+  NG  D +YGD+    EI  Y+   VL S   
Sbjct: 376 LNIDEI---QKTEGAI-YSRFKLLRGSVENGYYDGFYGDSGRTLEIPPYT-DVVLRSSVS 430

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN---YDNKGAHC 497
                     Q    H   P    S           +++LR  A V+C    Y  + A  
Sbjct: 431 ----------QSITYHLGGPVTQPS----------TMIQLRRDATVQCQNMTYHYRHAFT 470

Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARW 557
              V  CLF+I +DPCE  N+A++   + + L+  L  Y   ++     P D  ADP R 
Sbjct: 471 TCNVTECLFDIVNDPCETKNVADAYPRIARDLDLYLEHYGHILMRQIKHPVDWLADPRRR 530

Query: 558 NNIWVPW 564
           NN W PW
Sbjct: 531 NNTWEPW 537


>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
 gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
          Length = 548

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 238/548 (43%), Positives = 318/548 (58%), Gaps = 30/548 (5%)

Query: 36  LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM 95
           +ADD+G++DVSF GS+   TPNIDALAY+G+ILN  Y  A+CTPSR+AL+TGKYPI+ GM
Sbjct: 1   MADDMGFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGM 60

Query: 96  QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
           QH VI+  +PWGLP  EK + +  +E GY T  +GKWHLG  +  +TPT RGFD H GY 
Sbjct: 61  QHYVIVNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYL 120

Query: 156 QGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKS 211
               DYYD + +   + Y +G D R N+ V +  +G Y TD+ ++AA+ +I +H   N S
Sbjct: 121 GAYVDYYDQTYQQNGKNYARGHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180

Query: 212 KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAAL 271
           +P+FL L+HLA HA N  +P QAP EE+AKF  I +   R YA MVSRLD SVG VI AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRHYAAMVSRLDRSVGRVIDAL 240

Query: 272 RKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT 331
            +  ML++SIVLF++DNG P+ G HS   SN+PLRG K++PW+GG+R  AAIWS   ++ 
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSTTASNYPLRGQKNSPWEGGLRSSAAIWSTEFERL 300

Query: 332 QKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDN 390
             V  +  +I D LPTL AAAGI  + D  LDG+N W  L  G ++   EI+HNID   +
Sbjct: 301 GTVWKQQTYIGDLLPTLAAAAGIAPDPDLKLDGMNLWSALKYGYESLEREIVHNIDM--D 358

Query: 391 PQKYYAALRVDDLKYVAGTDNNGQSDEWYGD-TDNEIDKYSPKEVLYSKAGITFNALKTK 449
           P   Y A      K V GT N    D W G+    E+D   P+   Y +  I   ++   
Sbjct: 359 PSAPYLAYTRGKWKVVNGTTNAALFDGWLGERVTREMD---PRSANYEEL-IRNTSVWQH 414

Query: 450 LQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH-CNSTVKPCLFNI 508
           LQ+               L     +  I  LR+ A++ C   +     C   + PCLF+I
Sbjct: 415 LQL---------------LMGSQNERNITSLRQQAQIECPEPSGATKPCLPLLAPCLFDI 459

Query: 509 TDDPCEQNNLAES--QTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
             DPCE+NNL E    T +   L  ++  + S    P N P D   DP  ++N W  W D
Sbjct: 460 DADPCERNNLYEDYRNTTIFLDLLQRIEQFASQAHAPNNMPEDPDCDPRFYHNEWTWWQD 519

Query: 567 ELDKQKAI 574
           E     AI
Sbjct: 520 EQASGAAI 527


>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
          Length = 542

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 237/566 (41%), Positives = 326/566 (57%), Gaps = 51/566 (9%)

Query: 16  LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA 75
               +F N+  P  PHII ILADD GW+DV FHGS +IPTPN+DALA +G+IL+ HY Q 
Sbjct: 12  FLTGSFGNSHRP--PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQP 69

Query: 76  LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           LCTPSR +L+TGK+PI IG+Q GVI   +P+GL L EKLLP+YLK  GY +H +GKWHLG
Sbjct: 70  LCTPSRGSLLTGKHPIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLG 129

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
           FF + YTP  RGFDSHYG++   +DY  H          GLD   N Q D    G YST 
Sbjct: 130 FFADEYTPMRRGFDSHYGFYGASEDYMTH-----IGGMGGLDFWLNGQPDRSGQGHYSTT 184

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP---FQAPDEEVAKFLDISDPERRT 252
           L+T  A  ++AEHN+++PMFLY +H AVH  +  E      AP +   KF  I+D E + 
Sbjct: 185 LFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQH 244

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---GSNHPLRGMK 309
            AGM+S LDESVGN+  +L  +GMLEN+I++F +DNGAP    H N     SN+PLRG K
Sbjct: 245 MAGMISALDESVGNLTKSLHTNGMLENTIIIFSSDNGAP----HENTDVCSSNYPLRGAK 300

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD--GVNQW 367
           ++ W+GG R  A +WSP LK++  VS ++ HISDWLPTL  AAG  ++    D  GV+QW
Sbjct: 301 TSVWEGGTRVPAFVWSPLLKKSGYVSKQMMHISDWLPTLLEAAGYNMSALPGDLYGVSQW 360

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV----AGTDNNGQSDEWYGDTD 423
             L +   + R+ +L+   N D+ ++   ALRV D+K +    A   ++G    +  +  
Sbjct: 361 QALQENGPSARNSMLY---NADHKKQGSWALRVGDMKLLQAEGATQLHSGWQQPFAINLQ 417

Query: 424 NEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF 483
           +++    P    Y             +Q +Q   A         +R   T       + F
Sbjct: 418 SQLPSNEPNNTNY-------------VQEEQSQVASILEELGRGMRNSTT-------KYF 457

Query: 484 ARVRCNYDNKGA---HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTM 540
             V C + + G     C  T +PCL+N+T DPCE +NLA+   +++++LE  L       
Sbjct: 458 --VNCGFISSGVTSHSCIDTGRPCLYNVTADPCEYHNLAKDLPEVVERLEASLDDLLKMY 515

Query: 541 VPPGNKPFDKRADPARWNNIWVPWYD 566
           + P NKP D  A P     +W+PW +
Sbjct: 516 IEPINKPADPAAYPKNHGGVWMPWIE 541


>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
          Length = 581

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 241/595 (40%), Positives = 340/595 (57%), Gaps = 53/595 (8%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           A+ C L +         P  PHI+IILADD+GWNDVSFHG+++IPTPNIDALAYNG+ILN
Sbjct: 15  AIVCFLGYGTLAWGKAIPLPPHIVIILADDMGWNDVSFHGANEIPTPNIDALAYNGVILN 74

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
           ++Y   +CTPSRSALMTG+YPI  GMQ   +   EP G+PL   L+P+ ++  GY T  +
Sbjct: 75  KYYTMPICTPSRSALMTGRYPIRDGMQGTPMRPAEPRGIPLNVSLMPEQMRRLGYETRLV 134

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNK 187
           GKWHLG+  E YTP  RGFD+ +GY+ G   YYD+      T E   G D+ H  + D+ 
Sbjct: 135 GKWHLGYTTEDYTPVRRGFDTFFGYYNGFISYYDYWIGWNDTNE-VTGYDL-HRDESDSF 192

Query: 188 TIG---IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY--EPFQA--PDEEVA 240
            +     Y TDL T+ A  +I  +  +KP+FL ++HLAVHAG+    +P +    D+  A
Sbjct: 193 ELAHSSEYFTDLITDEAEKIIRNNKNAKPLFLEISHLAVHAGSKVHDDPLEVRRTDDVNA 252

Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
            F  I D + R YAGM++ LDESVG V+ AL++  MLENSI++FM+DNGAP+ G+++N G
Sbjct: 253 SFPYIEDYQHRKYAGMMAALDESVGRVVKALKEAEMLENSIIIFMSDNGAPTVGLYNNTG 312

Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
           SN+P+RG+K   ++G  R  A I+SP +K   +VS EL HI DWLPTL  AAG    D  
Sbjct: 313 SNYPMRGIKGGMFEGAARAAACIFSPLIKAHSRVSEELMHIVDWLPTLYTAAGGNPMDLQ 372

Query: 361 --------LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
                   LDGV+QW  +  G  + R  +L NID     Q + AA+ +   K V G    
Sbjct: 373 SQFDGALPLDGVSQWSSIVAGGPSSRQSLLVNIDEA---QGFEAAI-IGRHKLVKGMT-- 426

Query: 413 GQSDEWYGDTDNE--IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT 470
            + D +YG++ N+     Y+ K+VL S AG +   L           A P A        
Sbjct: 427 -KEDGYYGNSGNDPSFPAYNVKKVLSSTAGASIGKLAG--------FASPSA-------- 469

Query: 471 ILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
                + L LR+ + + C      A+C+ T   CLF+++ DPCE  +L+     ++K+LE
Sbjct: 470 ----RRALWLRQKSVITCKPFTSAANCSGT---CLFDLSKDPCETRDLSSKLPLIVKKLE 522

Query: 531 DKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD-ELDKQKAI-ETMRLHKPL 583
             L  Y+  ++P  N P D    P  +N +++PW   E D    +  +   H+PL
Sbjct: 523 SFLGEYRRVLMPQTNSPQDACGLPKYFNGVYMPWRTIEFDNVFGLTNSTTCHRPL 577


>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
          Length = 549

 Score =  416 bits (1068), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 238/563 (42%), Positives = 330/563 (58%), Gaps = 40/563 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHII ILADD GW+DV FHGS +IPTPN+DALA +G+IL+ HY Q LCTPSR +L+TGK+
Sbjct: 1   PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKH 60

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PI IG+Q GVI   +P+GL L EKLLP+YLK  GY +H +GKWHLGFF + YTP  RGFD
Sbjct: 61  PIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFD 120

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           SHYG++   +DY  H          GLD   N Q D    G YST L+T  A  ++AEHN
Sbjct: 121 SHYGFYGASEDYMTH-----IGGMGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHN 175

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQ---APDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           +++PMFLY +H AVH  +  E      AP +   KF  I+D E +  AGM+S LDESVGN
Sbjct: 176 QTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGN 235

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---GSNHPLRGMKSTPWDGGMRGVAAI 323
           +  +L  +GMLEN+I++F +DNGAP    H N     SN+PLRG K++ W+GG R  A +
Sbjct: 236 LTKSLHTNGMLENTIIIFSSDNGAP----HENTDVCSSNYPLRGAKTSVWEGGTRVPAFV 291

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD--GVNQWDVLTKGAKTKRSEI 381
           WSP LK++  VS ++ HISDWLPTL  AAG  ++    D  GV+QW  L +   + R+ +
Sbjct: 292 WSPLLKKSGYVSKQMMHISDWLPTLLEAAGYNMSALPGDLYGVSQWQALQENGPSARNSM 351

Query: 382 LHNIDNVDNPQKYYAALRVDDLKYV----AGTDNNGQSDEWYGDTDNEIDKYSPKEVLY- 436
           L+   N D+ ++   ALRV D+K +    A   ++G    +  +  +++    P    Y 
Sbjct: 352 LY---NADHKKQGSWALRVGDMKLLQAEGATQLHSGWQQPFAINLQSQLPSNEPNNTNYV 408

Query: 437 ----SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV------ 486
               S+        +T L  K++H A     +S   + +     I + R+  R       
Sbjct: 409 QEEQSQVASILEEHET-LAGKRRHEATFWL-ASTLCKQLAEPAVIQQARQNKRRSGRTVP 466

Query: 487 RCNYDNKGA---HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP 543
            C + + G     C +T +PCL+N+T DPCE  NLAE   +++++LE +L       + P
Sbjct: 467 NCGFASTGVAPRSCVNTGRPCLYNVTADPCEYYNLAEYLPEVVERLETRLDDLLKRSIEP 526

Query: 544 GNKPFDKRADPARWNNIWVPWYD 566
            NKP D  A P     +W+PW +
Sbjct: 527 INKPADPAAYPKNHGGVWMPWIE 549


>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
          Length = 613

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 240/599 (40%), Positives = 340/599 (56%), Gaps = 54/599 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + P+II ILADDLGWNDVSFHGSSQIPTP+IDALA  G+IL  +YV  +CTP+RSA+MTG
Sbjct: 26  QPPNIIFILADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTG 85

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK-------------WHL 134
           K+PIH G++HGVI    P+GL L EKL+PQYL+E GY TH +GK             + L
Sbjct: 86  KHPIHTGLEHGVIGVSHPYGLGLEEKLMPQYLRELGYRTHMVGKVSLEHGVIGVSHPYGL 145

Query: 135 G--------FFREV-----------YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
           G        + RE+            TP+ RGF+S YGY+ G+ DYY H   +      G
Sbjct: 146 GLEEKLMHQYLRELGYRTHMVGKESLTPSHRGFESFYGYYAGMGDYYTHEITSDGN-MTG 204

Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGN-TYEPFQA 234
            D   +  V     G YST+++TE    +I +HN  +P+++YLAH AVH+ N   +  QA
Sbjct: 205 FDFHMDGSVHKPVFGQYSTEIFTERTQEIILKHNPKEPLYIYLAHQAVHSANYDGQRLQA 264

Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
           P E   +F +I+   RR YA MV+ LD+S+GN+  +L++  +  N++++F  DNG P+ G
Sbjct: 265 PHEYYKRFPNITHENRRKYAAMVAALDDSIGNITQSLKESSLYNNTLIVFTTDNGGPANG 324

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
              N  SN PLRGMK T W+GG+RGV  +W   +++  ++S  + H+ DW+PTL   AG 
Sbjct: 325 FDFNYASNWPLRGMKHTTWEGGLRGVGFLWGALIEKPGRMSDGMMHVCDWVPTLYGLAGG 384

Query: 355 EINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNG 413
             +    LDG++ W +L++G ++ R EILH ID    P    +A+R+ D K V G    G
Sbjct: 385 NTSTLQHLDGIDVWPMLSRGKQSPRQEILHTID----PLLDLSAIRIGDYKLVQGQRGTG 440

Query: 414 QSDEWYGDTDNEIDKYSPK-EVLYS-KAGITFNALKT---KLQIKQKHAADPKANSSDAL 468
           ++  WY      + K SP+ E+L++    +  +A++    KL   Q+          + +
Sbjct: 441 RNG-WY--PPEGVGKQSPRQEILHTIDPLLDLSAIRIGDYKLVQGQRGTGRNGWYPPEGV 497

Query: 469 RTILTDEKILELREFARVRCNYD--NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLL 526
                D K +    F  V C Y   N   +CN   KPCLFNI  DPCE NN+A    D++
Sbjct: 498 SLADLDSKPVP-NAFV-VSCPYKPANASTNCNPYKKPCLFNIRHDPCEFNNIANWNQDIV 555

Query: 527 KQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHKPLLS 585
           + L  ++  Y++TMVP  NK  D R+ P   +N WVPW D  D   A  T R  KP  +
Sbjct: 556 ELLLTRIEEYRATMVPIRNKEPDPRSYPKYHDNCWVPWVD--DTSTAPGT-RFTKPFFN 611


>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
          Length = 502

 Score =  414 bits (1065), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 232/525 (44%), Positives = 309/525 (58%), Gaps = 35/525 (6%)

Query: 11  LTCTLLFNDAFL-----NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           L C  +   +F      N+  P  P+I+ ILADDLGW+DVSFHGS Q+PTPNIDALA +G
Sbjct: 2   LRCAFVLTLSFFLECSGNSEIP--PNIVFILADDLGWDDVSFHGSRQVPTPNIDALASDG 59

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +IL+ +YV  LC+PSR ALMTGK+PI IG+Q GVI+  +P GL L EKLLP+YL   GY 
Sbjct: 60  IILDNYYVHTLCSPSRGALMTGKHPIQIGLQRGVIMPAQPSGLGLKEKLLPEYLNTLGYK 119

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           +H +GKWHLG   E YTP  RGF+SH+G++QG + Y  H C  +     GLD   N + D
Sbjct: 120 SHMVGKWHLGMCAEEYTPMHRGFESHFGFYQGCESYTTHMCGNS-----GLDFWLNEEPD 174

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
           +   G YST L+T  A  ++AEH+ + PMFLYLAH AVH GN  + F APD+   K   I
Sbjct: 175 HSAGGQYSTSLFTAKAEQLLAEHDTASPMFLYLAHQAVHVGNQDQKFYAPDKYTDKLSFI 234

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK-GSNHP 304
           SD  RR  AGM++ LDESVG V  AL   GMLEN+I++F +DNGAP  G   +  GSN P
Sbjct: 235 SDDRRRQMAGMLTALDESVGKVTKALHAKGMLENTIIVFSSDNGAPHIGNSDHTVGSNFP 294

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLD 362
           LRG K + W+GG R  A +WSP  K++  +S +L HISDWLPTL  AAG + +   T+L 
Sbjct: 295 LRGGKPSIWEGGSRVPAFVWSPLFKKSGYISKQLMHISDWLPTLLQAAGYDTSALPTNLY 354

Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT 422
           GV+QW  L +   + R  +L+N+  +   +    ALRV D+K +     + ++  WY   
Sbjct: 355 GVSQWQTLQENGPSPRDSVLYNV--IAGCKDKSWALRVGDMKLLRAQGKSKKNAGWYQPF 412

Query: 423 DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELRE 482
            ++I    PK    S+     + +  +L   Q          +  LR +   EK      
Sbjct: 413 SSDILS-QPK----SRDPAIISVVPAELLQSQ---------VTPILRELGRGEKSSSTEH 458

Query: 483 FARVRCNYDNKGA--HCNSTVKPCLFNITDDPCEQNNLAESQTDL 525
             R+ C     G   +C    KPCLFNIT DPCE +NLA    ++
Sbjct: 459 --RIDCGAPPAGVVYNCIGMDKPCLFNITADPCEYHNLANDMPEI 501


>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 578

 Score =  414 bits (1063), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 237/574 (41%), Positives = 335/574 (58%), Gaps = 48/574 (8%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           RK+   LT   L   +  +     +PHII+ +ADDLGWNDV FHG++QIPTPNIDAL YN
Sbjct: 2   RKFIALLTIAGLLGSSCADFEG--RPHIIVFMADDLGWNDVGFHGATQIPTPNIDALGYN 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
           G+ILN+HYV   C+P+R+A +TGKYPI +GMQ   I  GEP GLP+  + LP+YL+  GY
Sbjct: 60  GIILNKHYVLPSCSPTRAAFLTGKYPIRMGMQGAGIAGGEPRGLPVHVQTLPEYLQGLGY 119

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
            T+ IGKWH+G+    + P  RGFD  YG++     YYD+  + +     G DM  N + 
Sbjct: 120 ETNLIGKWHVGYHTPKHLPNRRGFDYFYGFYNSHIGYYDY--RYSQGNMSGFDMHINGET 177

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA---K 241
              T G+Y+TD +T+AAI+VI  H+   PM+L ++HLA HA     P   P E+     +
Sbjct: 178 AYGTDGVYATDRFTQAAIDVIYRHDLESPMYLQVSHLAPHA-----PMDVPFEDNPYDDE 232

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           F  IS+P+RR YA MV+RLD+S+G ++++L   GML+NS+++FM+DNG  + G   N  S
Sbjct: 233 FRHISEPKRRAYAKMVARLDDSLGRIVSSLGDRGMLKNSVIVFMSDNGGATIGKFRNWAS 292

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS- 360
           N PLRG K T ++GG+R VAA+WSP L    +VS +LFH++DWLPTL   AG ++ D   
Sbjct: 293 NWPLRGTKYTLFEGGVRSVAAMWSPKLLSKGRVSEQLFHVTDWLPTLFQVAGGDMRDLGP 352

Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
           +DGV+QWDVL+ G    R +IL NID     +   AA+     K V G+  +G    +Y 
Sbjct: 353 IDGVSQWDVLSIGTGNVRDKILLNIDEYTQTE---AAID-KRFKIVRGSLYHG----YYD 404

Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL-E 479
           + + EI +   K  +Y+ + I  +A+   +   ++H   P           +T E ++ E
Sbjct: 405 NVEGEIGR-GHKNPMYNTSRILKSAVSDAI---REHLGVP-----------VTQESVMWE 449

Query: 480 LREFARVRCNYDNKGAH---------CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
           L+  A V C  +              CN T   CLF++  DPCE NN+A+S    L++LE
Sbjct: 450 LQRQATVLCRPNMTNMQYTSRYTLPSCNQTA--CLFDLGIDPCETNNVAKSYPAQLQELE 507

Query: 531 DKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
             L  Y + +      P D  ADP RWN  W PW
Sbjct: 508 LFLDKYSNLLTRQIRMPVDYMADPRRWNYTWQPW 541


>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 502

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 217/503 (43%), Positives = 295/503 (58%), Gaps = 29/503 (5%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+PHI+ ILADDLGWNDVS+HGS QI TPNIDALA+NG+ LN++Y Q LCTPSRSA +TG
Sbjct: 24  KQPHIVFILADDLGWNDVSYHGSPQILTPNIDALAWNGIRLNRYYTQPLCTPSRSAFLTG 83

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            YP++ GMQH VIL  EP GLPL  KLLPQ+L + GY +  +GKWHLG+++E YTPT RG
Sbjct: 84  CYPMNTGMQHSVILTTEPRGLPLHYKLLPQWLGDFGYVSRMLGKWHLGYYKEEYTPTMRG 143

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPY--QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           F SH G W+G  DYY H    +++ +   G D R +MQ   +  G Y T + TE A+ +I
Sbjct: 144 FQSHVGSWEGFSDYYSHIMDFSWQTWSISGHDFRRDMQKSKEDDGRYYTHVMTEEALKII 203

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
            +H   KP+FLY+AHLAVH+GN  EP +AP +    ++DI  P R  YAGM+S LDESVG
Sbjct: 204 KDHPNEKPLFLYIAHLAVHSGNQPEPLKAPTKYTDPYMDIGHPSRTLYAGMLSVLDESVG 263

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            V  AL + G+L  +IV+   DNGA      S+ GS  P RG K T W+GG+R  A +WS
Sbjct: 264 AVFEALGRRGILNETIVVLTTDNGAGITTQFSSFGSGWPFRGQKGTAWEGGVRVPAVVWS 323

Query: 326 PWLKQTQ-KVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEILH 383
           P   + +  V   LFH+SDWLPT    AG +  N   +DG++Q D L    K  R+EI+ 
Sbjct: 324 PLFSEHRGAVVESLFHVSDWLPTFYELAGGDPSNLGEIDGISQLDTLRHREKLPRTEIVL 383

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
           NID ++N     +A+ V   K + G D  G  D+WY    N + +         +A + +
Sbjct: 384 NIDPIEN----VSAIIVYPFKLMQGDDQGGHYDDWYPFMGN-VTRTQKDAREGCEASVVY 438

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKP 503
             +        KH+        D +    +            V+C   +    C   VKP
Sbjct: 439 RVM--------KHSGFVVTCGQDPVTYATS------------VKCGEKDPTKACKPIVKP 478

Query: 504 CLFNITDDPCEQNNLAESQTDLL 526
           CLF+++ DPCE N+++    ++ 
Sbjct: 479 CLFDLSKDPCEYNDISAENIEVF 501


>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 594

 Score =  411 bits (1056), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 227/536 (42%), Positives = 312/536 (58%), Gaps = 29/536 (5%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           ++++ILADD+GWNDVSF GS QIPTPN+DALA  G+IL  HYVQ  CTPSR+AL+TG YP
Sbjct: 84  NVVVILADDMGWNDVSFTGSGQIPTPNLDALASAGVILQNHYVQPFCTPSRAALLTGMYP 143

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH GMQH VI   EPWGLPL  KLLPQ+LK+ GY TH IGKWHLG F++ + PT RGFDS
Sbjct: 144 IHSGMQHYVIRSREPWGLPLDFKLLPQHLKDLGYRTHLIGKWHLGQFKKEFLPTRRGFDS 203

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           H GY+ G  DY+ H+   T++    LD  +  +   ++    Y+T L+T+ A  +I +H+
Sbjct: 204 HLGYYNGYIDYFTHN--HTYKRDSALDFFKDEVPYHSEE---YATRLFTDRAEEIIRDHD 258

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
              P+FLY AHLAVH     +PFQAP E + KF  + D  R T+A M++ LD SVG V+ 
Sbjct: 259 VDNPLFLYFAHLAVHRATDRDPFQAPQETIDKFSYVGDRNRTTFAAMLAELDVSVGRVVE 318

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL K G+L+N+I+LF +DNG  +     N G+N PLRG K T ++GG R  A +WSP ++
Sbjct: 319 ALAKKGILDNTIILFSSDNGGQATAPMENTGTNFPLRGQKRTLFEGGTRVPAFVWSPLIR 378

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++V  ++ HI DWLPT+ + AG +  N   +DG+N W  ++ G  + R+EIL+NID  
Sbjct: 379 RPRRVFYDMVHIVDWLPTIYSLAGGDPRNLGRIDGMNVWGAVSDGMASPRTEILYNID-- 436

Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
             P +   A+RV   K + G   N    EWY +       +S  E    K+ +       
Sbjct: 437 --PVQRDLAIRVGRYKLLYGPMYN--LTEWYDNRGFGYASHSVLERYMDKSAVA------ 486

Query: 449 KLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNI 508
             ++ +      K    D  R         + RE  RV C   +    CN + +PCLF++
Sbjct: 487 --RVLRDLGYWWKGREEDYSR-----RPTWQWREETRVHCGNPSSATACNPSEQPCLFDL 539

Query: 509 TDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
             DPCE NNL    T + + L  +L  +  +  P  NKP D    P   N  W PW
Sbjct: 540 ATDPCEYNNLYGQLTQVARYLLSRLEAHARSARPINNKPDDYSFQP---NATWGPW 592


>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
 gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
          Length = 633

 Score =  410 bits (1055), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 218/448 (48%), Positives = 283/448 (63%), Gaps = 33/448 (7%)

Query: 7   YFFALTCTLL-FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +F  L C LL   DA  N   P KP+II ILADDLG+NDV FHGS++IPTPNIDALAY+G
Sbjct: 2   WFLWLICLLLPIIDAAENEKPPAKPNIIFILADDLGFNDVGFHGSAEIPTPNIDALAYSG 61

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +ILN++YV  +CTPSRSALMTGKYPIH GMQH V+   EP GLPL EK+LPQYL E GY 
Sbjct: 62  IILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVLYAAEPRGLPLEEKILPQYLNELGYT 121

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           +H  GKWHLG ++  YTP +RGF SH+                      GLDMR+  QV 
Sbjct: 122 SHIAGKWHLGHWKLKYTPLYRGFSSHW----------------------GLDMRNGTQVA 159

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
               G Y+TD+ TE ++ VIA HN +K P+FLY+AH A H+ N Y P   PD +V K   
Sbjct: 160 YDLHGHYTTDVITEHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPDNDVIKMSH 219

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I + +RR +A MVS++D+SVG ++  LRK  MLENSI++F +DNG P+ G + N  SN+P
Sbjct: 220 IPNYKRRKFAAMVSKMDDSVGQIVDQLRKSNMLENSIIIFSSDNGGPAQGFNLNFASNYP 279

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL--- 361
           L+G+K+T W+GG+R    +WSP LK++Q+VS++  HI DWLPTL  AAG +    +L   
Sbjct: 280 LKGVKNTLWEGGVRAAGLMWSPLLKKSQRVSNQTMHIVDWLPTLLEAAGGQPALANLSKQ 339

Query: 362 -DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
            DG + W  L +   + R  +LHNID++       AA+ V D K V GT+  G  D WYG
Sbjct: 340 IDGQSIWRALVQDKASPRLNVLHNIDDIWGS----AAISVGDWKLVKGTNYRGSWDGWYG 395

Query: 421 DTDNEIDK-YSPKEVLYSKAGITFNALK 447
                  + Y  + V  S+AG     LK
Sbjct: 396 PAGERDPRLYDWQLVGRSRAGKALEGLK 423


>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
          Length = 581

 Score =  410 bits (1054), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 234/555 (42%), Positives = 323/555 (58%), Gaps = 50/555 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHII+ +ADDLGWNDVSFHGS+QIPTPNIDAL YNG+ILN+HYV    TPSR+A  TG+Y
Sbjct: 31  PHIIVFMADDLGWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQY 90

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PI IGMQ   I  GEP GLPL  K+LP++L+  GY T  IGKWH+G++   YTP  RGFD
Sbjct: 91  PIRIGMQGADIRGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFD 150

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTEAAINVIA 206
           +  G++        +    + +   G DM      D+   G+   Y+TD++T  AIN+I 
Sbjct: 151 TFLGFYNSYIS--YYDYSYSNQNMSGYDMHRG---DDPAYGMNREYATDMFTREAINIIE 205

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAP-----DEEVAKFLDISDPERRTYAGMVSRLD 261
            H  ++P++L L+HLAVHA     P + P     D E    + I +P RR YA MVS+LD
Sbjct: 206 NHELNRPLYLQLSHLAVHA-----PLEQPMNVYNDREP---IHIREPNRRKYAKMVSKLD 257

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           ESVG +  AL + GML++S++LF+ DNGA S G + N GSN+PLRG K T ++GG+RGVA
Sbjct: 258 ESVGRIFHALGEKGMLKDSLILFLTDNGAASVGRYRNYGSNYPLRGTKYTLYEGGVRGVA 317

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRS- 379
           A+WS  L++  +VS+EL HI+DWLPTL +AAG ++ D   +DG++QW +L++G+   R  
Sbjct: 318 ALWSSRLQKAARVSNELVHITDWLPTLYSAAGGDLKDLGKIDGIDQWRLLSEGSGRGRGR 377

Query: 380 -EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
            ++L NID V    K   A+     K + GT+ NG  D +YGD+   ++       +  K
Sbjct: 378 DKLLLNIDEV---LKTEGAI-YSRFKLLRGTNENGYYDGYYGDSGKTLETLPTYTDIVMK 433

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN----YDNKG 494
           + ++        +    H   P    S  +R          LR+ A V+CN    Y  + 
Sbjct: 434 STVS--------ESLTYHLGGPVTQPSTMVR----------LRQDATVQCNANFTYYRRQ 475

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                 V  CLF+I +DPCE  N+AE+   + + L   L  Y   +V    KP D  ADP
Sbjct: 476 MLTPCNVTECLFDIVNDPCETRNIAETYVRIARDLNYFLEEYSHILVKQMKKPIDWSADP 535

Query: 555 ARWNNIWVPWYDELD 569
              N+ W PW    D
Sbjct: 536 RNRNDTWGPWVKSGD 550


>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
 gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
          Length = 472

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 217/487 (44%), Positives = 289/487 (59%), Gaps = 32/487 (6%)

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           MTGKYPIH GMQH V+   EP GLPLTEKLLPQYLKE GY  H  GKWHLG +   +TP 
Sbjct: 1   MTGKYPIHTGMQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHIYGKWHLGSYTRKHTPL 60

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
            RGFDSH G+W G    +DH+   T     GLDMR    V     G Y+T +  + ++  
Sbjct: 61  ERGFDSHVGFWTGHHHMFDHTAVETNA--WGLDMRRGFDVAYDLHGYYTTHVIRDESVAA 118

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  HN S+PMFLY++H A H+ N Y+   APDE V +   IS+  RR +A M++ LD+SV
Sbjct: 119 IRAHNTSQPMFLYVSHAATHSANPYDFLPAPDETVERLAGISNYSRRKFAAMLTELDQSV 178

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G +I AL + GML+NSI++F  DNG P+ G ++N  SN PLRG K+T W+GG+RG   IW
Sbjct: 179 GAIIVALSERGMLDNSIIVFSTDNGGPAEGFNNNAASNWPLRGTKNTLWEGGVRGAGCIW 238

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAKTKRSEIL 382
           SP +++ ++VS ++ HISDWLPTL  AAG ++N   ++LDG++ W  L  G  T+R EIL
Sbjct: 239 SPLIEEKRRVSHQIMHISDWLPTLLDAAGYDLNKLPSNLDGISVWPELRNGDVTRRHEIL 298

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGI 441
           HNID++       AAL VDD K V GT+  G+ D WYG   D++   Y+   +     G 
Sbjct: 299 HNIDDIWGS----AALTVDDWKVVKGTNYEGKWDAWYGPAGDHDTKSYNLSSIFGCPTGK 354

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
             + LK                       +  +E+I++LR  A V+C    +   CN   
Sbjct: 355 ALSKLK----------------------MLPPEEEIIKLRREATVQCANGTRNK-CNPLE 391

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
            PCLF++ +DPCE  NLA+   D+L+ L  KL+ Y +T VPPGN P D R DP  W   W
Sbjct: 392 SPCLFDLFNDPCEFENLADQYPDILEVLLKKLSDYNATAVPPGNMPLDSRGDPRFWGYTW 451

Query: 562 VPWYDEL 568
             + DEL
Sbjct: 452 HNFGDEL 458


>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 554

 Score =  409 bits (1051), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 227/541 (41%), Positives = 302/541 (55%), Gaps = 30/541 (5%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A  +PHI+ ILADDLGW+DVSFHGSSQIPTPN+D LA +G+ILN +YV   CTPSR+ALM
Sbjct: 32  AKMRPHIVFILADDLGWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALM 91

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG YPI  GMQ   I   EPWGLP   ++LPQYLKE GY TH +GKWHLG ++E  TPT 
Sbjct: 92  TGLYPIRTGMQGMPIDVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTC 151

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           RGFDS YGY+ G  DY+ H+   ++E + GLD   N +     IG YST ++T+ A  +I
Sbjct: 152 RGFDSFYGYYYGESDYFAHTI--SYENHTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYII 209

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
               KSKP+ L + H A H     E  QAP E + KF  I +  R  YAGMV  LD+SVG
Sbjct: 210 ENRTKSKPLLLVITHQATHCALERERLQAPQENIDKFPYIGEKNRTIYAGMVDALDQSVG 269

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            V+ AL   GMLEN+I++F +DNG   +G H+++G N PLRG K T W+GG R  A +WS
Sbjct: 270 QVVKALSDAGMLENTIIVFSSDNGGGPWGEHNSRGFNWPLRGAKGTVWEGGTRAAAFVWS 329

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHN 384
           P L + ++VS +L HI+DWLPT  + AG        LDG + W  L+ G ++ R E+L+N
Sbjct: 330 PLLARRRRVSHQLMHITDWLPTFYSIAGGNAAKLVELDGHDMWRPLSHGRRSPRVELLYN 389

Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
            D      KY  +  + + +Y    D  G+    + D         P+  L   A  +  
Sbjct: 390 YD-----YKYTKSAALRNHRYKLVLDETGR----FADRYKVPGGSRPRHDLDYLAAKSTA 440

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
           A   +   K+K    PK+                  R  AR+ C    K     S     
Sbjct: 441 ASVLRDFYKKKQLNFPKS-----------------WRRKARLTCGKQKKINFSWSATSVY 483

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP-GNKPFDKRADPARWNNIWVP 563
           LF++  DPCE NNLA    +++  L+ +L  Y +  +P   N  +D R+ P      W P
Sbjct: 484 LFDVVTDPCELNNLASKLPNVVASLKKRLDAYGAAALPERNNDTYDPRSFPENHGGTWAP 543

Query: 564 W 564
           W
Sbjct: 544 W 544


>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
          Length = 591

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 223/545 (40%), Positives = 317/545 (58%), Gaps = 30/545 (5%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHII+I+ADDLGW+DV FHGS QI TPNIDAL YNG+ILN+HYV    TPSR+A  TG+Y
Sbjct: 30  PHIIVIMADDLGWDDVGFHGSDQIRTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQY 89

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PI +GMQ   I  GEP G+PL  ++LP++L++ GY T  IGKWHLG++   +TP  RGFD
Sbjct: 90  PIRMGMQGEDIQGGEPRGIPLNVRILPEFLRDLGYMTKLIGKWHLGYYTPQHTPLRRGFD 149

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           S  G++     YY++  K +F+   G DM         +   Y TD +T+ AI +I  ++
Sbjct: 150 SFLGFYNSHVSYYNY--KYSFQNMSGYDMHRGDAPAYGSTDKYVTDFFTDEAIKIIEYYD 207

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEV---AKFLDISDPERRTYAGMVSRLDESVGN 266
            S+P++L ++HLAVHA     P + P +     ++FL I +  RR YA MV RLD SVG 
Sbjct: 208 PSRPLYLQISHLAVHA-----PLEGPQDYNHYDSQFLHIREINRRKYAKMVLRLDNSVGR 262

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           ++ AL   GML+NS++LF+ DNGA   G   N GSN+PLRGMK T ++GG+RGVA +WSP
Sbjct: 263 IVQALGNKGMLKNSLILFLTDNGAAPIGKFRNYGSNYPLRGMKYTLYEGGVRGVALLWSP 322

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
            L++T +VS EL H++DWLPTL + AG ++ D   +DG++QW +L     + R  +L NI
Sbjct: 323 RLRKTARVSDELMHVTDWLPTLYSIAGGDVRDLGEIDGIDQWCMLNGSLPSARDRLLLNI 382

Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNA 445
           D V   +           K V G+   G  D ++ D    I++  P  ++      T   
Sbjct: 383 DEVSKTE----GAIYKQFKLVRGSIEGGLYDGYFYD---HIERNVPIMLIMPSDPKTI-- 433

Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTD-----EKILELREFARVRCNYDNKG-AHCNS 499
              +  +        ++  S+ + + L D       I++LR  A V C   N+    CN 
Sbjct: 434 --VQEDVPPYTETILRSVVSETITSFLGDPVTQPSTIIQLRREATVNCRSRNRSFLTCN- 490

Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
            V  CLF++ +DPCE NN+AE  + + ++L+  L  Y   +      P D  ADP + NN
Sbjct: 491 -VTECLFDVNNDPCETNNIAEQYSRISRELDKSLEQYWHIVKRQMKIPVDWLADPRKTNN 549

Query: 560 IWVPW 564
            W PW
Sbjct: 550 TWEPW 554


>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 557

 Score =  406 bits (1044), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 235/570 (41%), Positives = 342/570 (60%), Gaps = 49/570 (8%)

Query: 8   FFALTCTLLFNDAFLNTTAPKK--PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
            F   C ++  +   N+ A  K  P+I++I+ADDLGWNDVSF+G  +IPTPNID+LAYNG
Sbjct: 13  LFIFVCDVILKNISRNSQATSKQPPNIVVIMADDLGWNDVSFNGGDEIPTPNIDSLAYNG 72

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +ILN+HYV  +CTPSR+A  TG+YPI  GMQ   +   EP  +PL   LLPQYL++ GYA
Sbjct: 73  VILNRHYVLPICTPSRTAFFTGQYPIRSGMQGYPLQGAEPRSIPLNNILLPQYLRKLGYA 132

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           TH +GKWH+G+    +TPT RGFD+  GY+ G  +Y+ H+     +   G D+ H    D
Sbjct: 133 THLVGKWHVGYQTNNHTPTNRGFDTFLGYYSGYIEYFSHNLVENGQ--SGYDI-HRSVGD 189

Query: 186 NKTIGI---YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT-----YEPFQAPDE 237
           N TI     Y TDL T+ A N+I+ HN +KP++L LAHLA HA           ++A ++
Sbjct: 190 NHTIEYRYDYMTDLITDEAENIISSHNPAKPLYLQLAHLAPHASTVDDVIEVRSWKATND 249

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG-IH 296
            +    DI+   RR +A  V  +DES+G V+ ALR+  ML+NSI++FM+DNGAP+   I 
Sbjct: 250 TLGYIRDIN---RRKFASAVVTMDESIGRVVDALRRADMLKNSIIVFMSDNGAPTKDQIL 306

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
            N GSN+PLRGMK + ++G +RGVA I+SP +    +VS++LFHI+DWLPTL AAAG + 
Sbjct: 307 YNFGSNYPLRGMKQSFFEGAVRGVACIYSPLIDFPSRVSTQLFHITDWLPTLYAAAGGDA 366

Query: 357 ND-TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQS 415
           +D  +LDG++QW ++      +R  +L NI+   + +   AAL +   K +    +  + 
Sbjct: 367 SDLKALDGIDQWSMIKNADNGRRQFLLVNINEKTDSK---AAL-IGRYKLIR---DQSEY 419

Query: 416 DEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANS-SDALRTILTD 474
            ++YG T         K+  Y +  +T N L           A P A++ +D  R+ L  
Sbjct: 420 TKYYGSTG--------KDPSYCEYNVT-NVL-----------ASPVASAIADISRSSLDI 459

Query: 475 EKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLA 534
           +KI +LRE +RV C   ++ + C  T + CLF+I  DPCE  N++     +LK+L   + 
Sbjct: 460 DKIKQLREKSRVVCRV-SRFSRC--TKRTCLFDIKKDPCETTNVSSKYPKILKKLNALID 516

Query: 535 IYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
            Y+S +V   NKP D    P  +N+ W+PW
Sbjct: 517 SYQSILVNQTNKPVDPAGFPCNFNDTWMPW 546


>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  405 bits (1042), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 234/558 (41%), Positives = 329/558 (58%), Gaps = 41/558 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + P+I++I+ADDLGWNDVSFHG+ +IPTPNIDALAYNG+ILN+HYV  +CTPSR+A +TG
Sbjct: 2   QPPNIVVIIADDLGWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPICTPSRTAFLTG 61

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           KYPI  GMQ   +   EP G+ L   LLP+YL++ GYATH +GKWH+G+    Y PT RG
Sbjct: 62  KYPIRTGMQGYPLQGAEPRGILLNNILLPEYLQKLGYATHLVGKWHVGYHTRNYGPTHRG 121

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGI---YSTDLYTEAAIN 203
           FD+  GY+ G   Y++H+    +E  Q G D+ H +  D+  I     Y TDL T+ A N
Sbjct: 122 FDTFAGYYNGYIQYFNHTL---YESEQLGYDL-HRIIGDDHKIEYRYDYMTDLITDEAEN 177

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD--EEVAKFLDISDPERRTYAGMVSRLD 261
           +I+ HN +KP++L +AHLA H+ +  E  +  +  E  A    I D  RR YA +V+ LD
Sbjct: 178 IISSHNPAKPLYLQVAHLAAHSSDAEEEMEVRNWKETNATLGYIEDINRRKYASVVATLD 237

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           ESVG VI AL+K  ML NSI++F++DNGA + G   N GSN+PLRG+K + ++GG+RG A
Sbjct: 238 ESVGRVIDALKKTDMLNNSIIVFISDNGAQTEGFLQNYGSNYPLRGLKFSLFEGGIRGAA 297

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSE 380
            I+SP + +  +VS++LFHI+DWLPTL +AAG   +D   LDG +QW  +     +KRS 
Sbjct: 298 CIYSPLIDRPSRVSNQLFHITDWLPTLYSAAGGNSSDLKQLDGFDQWSAIKSAKNSKRSS 357

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
           IL NID  +N +   AAL +   K V  T    +  ++Y         YS    LY K  
Sbjct: 358 ILINIDEKENSE---AAL-IGHYKLVTDT---SEYQKYY--------NYSGNNALYPKYN 402

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
           +T   L +         +    N+S          KI++LR+ A + C   N     N T
Sbjct: 403 VT-TILASPAASAIASISISTLNAS----------KIMQLRKEATIICK--NFTDFSNCT 449

Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
            + CLF++  DPCE  +L       +K+L   +  YKS ++   N P D    P  +N+ 
Sbjct: 450 NRSCLFDLNKDPCETTDLFSKHPKDVKRLNMFIDRYKSVLMNQTNAPVDPAGFPRNFNDT 509

Query: 561 WVPWYDELDKQKAIETMR 578
           W+PW    D  ++I  +R
Sbjct: 510 WMPWLP--DDYQSISILR 525


>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 510

 Score =  405 bits (1041), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 233/545 (42%), Positives = 304/545 (55%), Gaps = 41/545 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+P+I+ IL DDLGW DVSFHGS+QIPTPNID LA +G+ILN +YV   CTPSR+ALMTG
Sbjct: 1   KRPNIVFILGDDLGWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPTCTPSRAALMTG 60

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            YPIH GMQ  +I    PWGLPL  K+LPQ+ ++ GY  + IGKWHLGFF+  Y P  RG
Sbjct: 61  LYPIHTGMQSDIIEPAAPWGLPLENKILPQHFRDLGYDVNMIGKWHLGFFKTPYVPIKRG 120

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FD+ +G++ G  DYY+H+  +     QG+ +   +      I      L T++ +NV  +
Sbjct: 121 FDTFFGFYTGSNDYYNHTSGSE-NNEQGVSLVATLTAAWGRIAQGVEPLLTDSPLNVYPQ 179

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
                P F Y +H AVH+    EPFQAP   V KF  I +  R  YAGM+  LDESVG V
Sbjct: 180 -----PFFCYFSHHAVHSALMAEPFQAPARNVLKFPYIGESNRTIYAGMLDALDESVGRV 234

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           + AL   GMLE++I++F +DNG    G  SN+G N PLRG K T W+G +R  A +WSP 
Sbjct: 235 VEALDNAGMLEDTIIVFSSDNGGAPCGEASNQGFNWPLRGAKFTLWEGSVRVPAFVWSPK 294

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
             +  +VS++L HISDWLPTL  AAG +++    +DGV+ W  LT  + +KR EIL NID
Sbjct: 295 FLKQSRVSNQLMHISDWLPTLYTAAGGDVSALGVIDGVDMWHSLTHSSASKRKEILLNID 354

Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY---GDT--DNEIDKYSPKEVLYSKAGI 441
              N     + LR    K V G   +G+ ++ Y   G T   N ID+   +         
Sbjct: 355 ASFN----ISGLRYKQYKLVVGGGFDGELNDHYYFTGGTRPTNNIDELRNES-------- 402

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
           T   +  K   KQ+ +  P                 L   E   V C  D    +     
Sbjct: 403 TMARVLKKFYEKQRRSWSP-----------------LTWSENVAVDCRRDRLTENFVPMQ 445

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
            P LFNI  DPCE NNLA+++   L+ L  KL  Y ST+VP   +P D RA P     IW
Sbjct: 446 PPYLFNIEQDPCELNNLAKTKVHTLRFLMKKLDAYNSTVVPSLFQPDDPRAYPEYHGGIW 505

Query: 562 VPWYD 566
            PW D
Sbjct: 506 SPWLD 510


>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
          Length = 545

 Score =  405 bits (1040), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 221/550 (40%), Positives = 309/550 (56%), Gaps = 45/550 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P+I+ ILADDLGW+D+S HGS QIPTPNIDALA +G++LN +YVQ +CTPSR+AL+TG
Sbjct: 25  QRPNIVYILADDLGWDDISLHGSEQIPTPNIDALAADGILLNNYYVQPICTPSRAALLTG 84

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           K+P+H+G+QH  I     +GL L E++LP+YL   GY +H +GKWHLG+F   +TPT+RG
Sbjct: 85  KHPVHLGLQHNTIPAPSAYGLGLNERILPEYLNTLGYDSHMVGKWHLGYFTPQHTPTYRG 144

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           F SH+GY  G +DY DH+    F        GLD  ++ ++   + G YST+++T  A  
Sbjct: 145 FKSHFGYLNGCEDYLDHTLAYDFSTLGMDGWGLDFWNDTKIHRTSFGQYSTEIFTTRAEE 204

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGN---TYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           +I   N  +P+FLY++H AVH+GN        QAP +   KF  I   ERR  AGM+S L
Sbjct: 205 LI-RSNTGEPLFLYMSHQAVHSGNPGLNGSKLQAPWKYFNKFNYIQSDERRRLAGMISAL 263

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D+SVGN+   L   GML N+I++F  DNG P+     N  SN PLRG K T W+GG+RG 
Sbjct: 264 DDSVGNITRTLHAKGMLNNTIIVFSTDNGGPADRFDDNCASNWPLRGGKRTMWEGGLRGN 323

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD--GVNQWDVLTKGAKTKR 378
             +WSP LK +  VS     I DWLPTL  AAG ++     D  G++QW  L++ + + R
Sbjct: 324 GFVWSPLLKHSNYVSEHFMSIMDWLPTLLDAAGYDLGKLPRDVEGMSQWKSLSENSPSPR 383

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
             +LHNID    P      LR+ D+K V G  N      WY           P E+ +  
Sbjct: 384 DYLLHNID----PLTGEGGLRLGDMKVVFGGGNTKAHRGWY-----------PPEMRWGN 428

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTD--EKILELREFARVRCN--YDNKG 494
             ++                 P       L  I  D     ++ +   +V C     +  
Sbjct: 429 ETVS----------------APSHLFETDLWDIFHDLGRPPVQKKISVQVECGPAPPDVT 472

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
            +C S ++PCLFNI+ DPCE +NL+E Q D+L +L   +    +T+ PP +   D    P
Sbjct: 473 TNCKSWIEPCLFNISADPCEYHNLSEDQPDMLGKLLSIVKAINATVTPPYDNSDDPNGYP 532

Query: 555 ARWNNIWVPW 564
            R   +W PW
Sbjct: 533 GRHGGVWKPW 542


>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 571

 Score =  403 bits (1035), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 228/573 (39%), Positives = 329/573 (57%), Gaps = 58/573 (10%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F+   T +F   +       +PHI++I+ADD+GWNDV FHGS++IPTPNIDALAY G+I
Sbjct: 10  LFSFILTPIFGPIY------GQPHIVVIIADDMGWNDVGFHGSNEIPTPNIDALAYGGVI 63

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           LN+HY    CTPSR+A +TG++PI +G+Q   +   EP G+PL E+LLP+YL+E GY T 
Sbjct: 64  LNRHYALPTCTPSRTAFLTGRHPIRMGLQGIPMNVAEPRGVPLHERLLPEYLRELGYVTR 123

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ---- 183
            +GKWHLG++ + +TPT RGFDS  GY+ G+  Y++H+   T + + G+D   +      
Sbjct: 124 LVGKWHLGYYTDKHTPTRRGFDSFVGYYGGVITYFNHT--VTKDKHTGIDYHWDTSGKIE 181

Query: 184 -VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD-EEVAK 241
             DN     Y TD  ++ A  VI  H++ KP+FL LAH+A HA    +P +  +  EV  
Sbjct: 182 PFDNDQ---YVTDFISDQAEAVIKNHDRKKPLFLQLAHVAAHASENRDPIEVRNMTEVND 238

Query: 242 FLD-ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-GIHSNK 299
            L  I D  RR YAG+V+ +D+SVG V+ AL+   ML NSI++FM+DNG+P+    ++N 
Sbjct: 239 TLSYIPDINRRKYAGVVTAMDDSVGRVVKALKDANMLSNSIIIFMSDNGSPTAEAPYTNY 298

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
           GSN+PLRG+K+T ++GG+R  A ++SP LK   +VS ELFHI+DW PTL   AG +++  
Sbjct: 299 GSNYPLRGIKATVFEGGVRVPACVFSPRLKDRFRVSDELFHITDWFPTLYKLAGGDLSKI 358

Query: 360 S-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
             LDGV+QW  ++   K+ R  +L NID V NP+   +       K + G +   + D++
Sbjct: 359 QDLDGVDQWSSISGSQKSNRESLLVNIDEVSNPEAAISGY----YKLIRGIN---RYDDY 411

Query: 419 YGDTDNEIDKYSPKE-----VLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILT 473
           YG   N+   YSPK      VL S AG    +L  +    QK                  
Sbjct: 412 YGKDGND---YSPKTYDVTGVLSSLAGRAIASLGNQYLPPQK------------------ 450

Query: 474 DEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKL 533
             +I ELR  A +RC   +    C  T   CLF+I  DPCE  N+ +   ++   L   +
Sbjct: 451 --RITELRNKATLRCEKKDDRPSCRDT---CLFDIVKDPCETTNIIDEHPEMEIVLNTYI 505

Query: 534 AIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
             YK+ ++   N   D    P  +N  W+PW D
Sbjct: 506 QGYKAVLMNQTNTYIDPAGYPENFNGYWMPWMD 538


>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 527

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 220/536 (41%), Positives = 310/536 (57%), Gaps = 34/536 (6%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+II +L D LGW+DVSFHGS QIPTPN+DALA +G+ILN +YVQ +CTPSR+ALMTG Y
Sbjct: 23  PNIIFMLVDGLGWDDVSFHGSPQIPTPNMDALAADGIILNNYYVQPVCTPSRAALMTGMY 82

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PIH G+QHGV+L  EP GLPL  K++P+Y K+ GY TH IGKW+LG++ + YTPT+RGFD
Sbjct: 83  PIHTGLQHGVLLAAEPNGLPLEFKIMPEYFKDLGYETHLIGKWNLGYYMKEYTPTYRGFD 142

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           S YG++   +DY+ H+ +        ++  + M     +  +Y T L  +   ++++   
Sbjct: 143 SFYGFYNYEEDYFTHNLEF-------VNQSNAMVWRPSSFCVYLT-LSPDTGFDLLSASI 194

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
           +  P FLYL+H +VH  +  +P QAP+E +AKF  I D  R  YAGMV  LDES G+V+ 
Sbjct: 195 ERGPFFLYLSHQSVHGASGNDPLQAPEENIAKFPYIGDERRTKYAGMVDALDESAGDVLE 254

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL + GML N+I++  + NG  S G+ SN GSN PLRG K   W+GG R  A IWSP L 
Sbjct: 255 ALYEAGMLANTIIVMSSANGGLSSGVESNAGSNFPLRGGKGALWEGGTRASAFIWSPLLY 314

Query: 330 QTQKVSSELFHISDWLPTLCAAA-GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           Q  +VS ++ HI+DWLPTL AAA G   N  +LDG N W+ L+    + R E+L+NID +
Sbjct: 315 QKNRVSDQMMHITDWLPTLYAAAQGNPFNLGTLDGQNLWNHLSYNLPSPRYELLYNIDPI 374

Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
           D      +A+R  + K V G  +  + +E    T      Y   ++  +++       K 
Sbjct: 375 DQT----SAIRFLNYKLVLGESSGSEENEHRLKTPGNSRPYRDLDIAMAQS-------KA 423

Query: 449 KLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNI 508
              ++  H            R IL  +     R+ A + C  D++  +      P LF++
Sbjct: 424 ARVLRNFHN-----------RGILFMDS--NWRQKAALNCG-DDERENVVPGAPPYLFDL 469

Query: 509 TDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
             DPCE  N+   +   +  L  +L IY+ T + P NK  D R  P   N  W PW
Sbjct: 470 AKDPCEMRNIVLIEPGWVSILTLRLHIYEQTSIAPSNKQIDPRGFPENLNGTWAPW 525


>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 476

 Score =  402 bits (1034), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 209/515 (40%), Positives = 296/515 (57%), Gaps = 58/515 (11%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           FF L C L      + T  P  P+II ++ADDLGWNDVSFHGSSQIPTP+IDALA  G+I
Sbjct: 13  FFLLGCML----ECVYTAQP--PNIIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVI 66

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +YV  +CTP+RSA+MTGK+PIH G+Q+ VI+  EP+GL   E ++PQYL+  GY TH
Sbjct: 67  LTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTH 126

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            +GKWHLGFF+E  TP+ RGF+S+YGY+ G+QDY+ H  ++T     G D   N  +   
Sbjct: 127 MVGKWHLGFFKESLTPSHRGFESYYGYYGGMQDYFTH--ESTEHTLTGFDFHVNGSIYKP 184

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGN-TYEPFQAPDEEVAKFLDIS 246
             G YST++YTE    +I  HN  +P+++YLAH AVH+ N   +  QAP +   +F +I+
Sbjct: 185 VFGQYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSANYNGQRLQAPYKYYERFPNIT 244

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
           +  RR +A MVS LD+S+GN+   L++  +  N++++F  DNG P+ G  +N  +N PLR
Sbjct: 245 NENRRKFAAMVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGPAHGFDANYANNWPLR 304

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVN 365
           G+K T W+GG+RG   +W   +++  + S  + H+ DW+PTL   AG   +    LDG++
Sbjct: 305 GVKDTTWEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAGGNTSTLQHLDGID 364

Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
            W +L++   + R EILHNID V N     +A+R+ D K V G + NG   +WY      
Sbjct: 365 VWPMLSRAEPSPREEILHNIDPVRN----VSAIRIGDYKLVQGQNYNGSWSDWY------ 414

Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
                                             P+  SS     +  D K +       
Sbjct: 415 ---------------------------------PPEGESS-----VDVDSKPVPNAFVVS 436

Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAE 520
                 N   +C+   KPCLFNI  DPCE NN+A+
Sbjct: 437 CPSKPANASTNCDPKEKPCLFNIRHDPCEFNNIAD 471


>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 470

 Score =  401 bits (1030), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 206/486 (42%), Positives = 282/486 (58%), Gaps = 34/486 (6%)

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           P+    QH VILE EPWGLPL E LLP++L + GY +HA+GKWHLG+F++ YTPT+RGF 
Sbjct: 5   PLLARTQHNVILEPEPWGLPLNEILLPEHLNKLGYTSHAVGKWHLGYFKKAYTPTYRGFK 64

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           S YG+W G QDYY H  +ATF  ++G DMR ++  D  ++G YST L+T+ A ++I +HN
Sbjct: 65  SFYGFWNGYQDYYTHMVQATFASFEGFDMRRDLNPDWSSVGKYSTHLFTKEATDIITKHN 124

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
            S P+FLYLAHLA HAG    P QAP E++  F  I D  RR YAGM+  LD SV  V  
Sbjct: 125 NSVPLFLYLAHLAPHAGTYENPLQAPQEDINSFQSIKDKYRRKYAGMMKNLDNSVAEVFN 184

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL  + ML+N+I +F++DNGAP+ GIH N GSN PL+G K+TPW+GG+R  A +WS  L 
Sbjct: 185 ALHTNNMLDNTIFVFVSDNGAPTNGIHRNYGSNWPLKGEKATPWEGGIRTAAFVWSKLLS 244

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + + + + L HISDWLPTL  AAG    D  ++DG++ W+  +   K+ R +ILHNID++
Sbjct: 245 KKKTLPNPLMHISDWLPTLYQAAGGNTQDLGAIDGISMWESFSNNIKSPRKQILHNIDDI 304

Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
                 YAA+R  + KY+ G+   G  D W G       +Y+   +L S          T
Sbjct: 305 TG----YAAIRDTNFKYIKGSTFLGYLDYWSGSLSPSTIRYNVDAILNS----------T 350

Query: 449 KLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNI 508
              I  +   D            LT   I  LR  + VRC+++     C   +KPCLF+I
Sbjct: 351 TAGILSQINGDS-----------LTPSLIESLRNSSTVRCDFEKDMKPCKPFIKPCLFHI 399

Query: 509 TDDPCEQNNLAESQTDLLKQ--------LEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
             DPCE  NL       +++         E  L  ++ +   P N    K A+PA +NN 
Sbjct: 400 VKDPCELVNLNYKPNSKMRKFVEAKIDYFETSLGKFRESASKPMNVRGTKDANPALYNNT 459

Query: 561 WVPWYD 566
           W+ W D
Sbjct: 460 WISWDD 465


>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
 gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
          Length = 485

 Score =  400 bits (1028), Expect = e-108,   Method: Compositional matrix adjust.
 Identities = 195/384 (50%), Positives = 258/384 (67%), Gaps = 9/384 (2%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           GWNDVSFHGS QIPTPN+D+LAY+G+IL  +YV  +CTP+RSA+MTG++PIH G+QHGVI
Sbjct: 3   GWNDVSFHGSDQIPTPNLDSLAYSGVILGNYYVSPICTPTRSAIMTGRHPIHTGLQHGVI 62

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
               P+GLPL E +LPQYLK  GYATH +GKWHLG     +TPTFRGFDS++GY  G  +
Sbjct: 63  SGATPFGLPLNETILPQYLKPLGYATHIVGKWHLGHHAWEFTPTFRGFDSYFGYLTGKDN 122

Query: 161 YYDHSCKATFEP----YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFL 216
           YYDH+   +  P    Y+GLD+R+  +      G YST+L+   A  +I  H+ SKP+FL
Sbjct: 123 YYDHTDDESNSPEELGYKGLDLRNGTEPVWTENGTYSTELFATEAERIITSHDTSKPLFL 182

Query: 217 YLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGM 276
           YL H AVH+GN   P QAP + + KF  I  P RRT+A MVS LD++VGNV  AL   GM
Sbjct: 183 YLPHQAVHSGNPDNPLQAPQKYIDKFPHIQHPGRRTFAAMVSALDDAVGNVTKALSARGM 242

Query: 277 LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
           LENS+++F  DNG P+ G   N  SN PLRG+K+T W+GG+ G   + SP +KQ ++ + 
Sbjct: 243 LENSVIIFTTDNGGPAAGFDQNYASNWPLRGVKNTLWEGGVHGTGFVHSPLIKQPKRTTH 302

Query: 337 ELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYY 395
           EL H+ D LPT+   AG +  +  +LDG N W+ +++G ++ R E+LHNID    P++  
Sbjct: 303 ELLHVCDLLPTIYELAGGDSTELKNLDGTNVWETISRGVQSPRVEVLHNID----PKRKT 358

Query: 396 AALRVDDLKYVAGTDNNGQSDEWY 419
           AALR  D K + G    G  D WY
Sbjct: 359 AALRYGDYKIILGEAYKGAWDGWY 382



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 32/60 (53%)

Query: 510 DDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDELD 569
           +DPCE NN+A+    L+  L  +L  Y  T VP  N+P D R +P     +W PW +  D
Sbjct: 398 EDPCEFNNIADWNKPLVNFLMGRLEAYSETAVPIRNQPTDPRGNPKYHGGVWSPWINLTD 457


>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
          Length = 531

 Score =  397 bits (1019), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 224/581 (38%), Positives = 326/581 (56%), Gaps = 60/581 (10%)

Query: 11  LTCTLLFNDAFLNTTA----PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           + C L     F+ T++     KKP+IIII+ADDLG+NDVSFHGSSQIPTPN+  +A  G+
Sbjct: 1   MVCVLGLLLYFILTSSKPSTAKKPNIIIIIADDLGYNDVSFHGSSQIPTPNLAKMATRGI 60

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
           IL++ Y Q+ CTPSR+AL+TG+YPI  GMQ   +  GE   LPL    +P + +  GY T
Sbjct: 61  ILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPLKAGENRSLPLNMPTMPLHFQNLGYKT 120

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNMQ 183
           H +GKWHLG   +  TP  +GFDSH+GYW G   Y+D+   +  +     +GLD+    +
Sbjct: 121 HLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVGYFDYVSFSKMDNGTLVKGLDLHDQFE 180

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA--K 241
               + G Y+T+L+TE +++VI  H+   P+FL ++HLA H G        PD +    +
Sbjct: 181 PVWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNHE 240

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           F  I DP RR YAG+VS LD S+G ++A L +  ML+NSIVLF +DNGA + G++ N GS
Sbjct: 241 FSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVGMYENSGS 300

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS- 360
           N PLRG+K + ++GG+R  A I+SP   +   VS  L HISDWLPTL +AAG ++     
Sbjct: 301 NWPLRGVKFSDFEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAAGGDVAHLGQ 360

Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
           +DG++QWD LT    + R+EIL NID VD   + +A +R D  K + G+ + G  D++YG
Sbjct: 361 IDGIDQWDALTNNNPSNRTEILINIDEVD---ENFAIIR-DKFKLIQGSYHEGTFDQYYG 416

Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
           D+    +  +P                           +P   ++D              
Sbjct: 417 DSGRGPENPTP---------------------------NPNHTTTDL------------- 436

Query: 481 REFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTM 540
                  C   ++    N T K CLF++  DPCE  N+ ES+ ++  QL +K+A +   +
Sbjct: 437 -----SWCRAPDQTPILNCT-KGCLFDLDKDPCETTNIIESEPEIANQLYEKIAQFWKEL 490

Query: 541 VPPGNKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHK 581
           VP  NK  D ++DP  +NN W  W ++    K     R+ +
Sbjct: 491 VPQRNKDTDPKSDPIFYNNTWCTWLEDASCHKTTNIERISR 531


>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 593

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 228/578 (39%), Positives = 304/578 (52%), Gaps = 65/578 (11%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A  +PHI+ ILADDLGW+DVSFHGSSQIPTPN+D LA +G+ILN +YV   CTPSR+ALM
Sbjct: 32  AKMRPHIVFILADDLGWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALM 91

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG YPI  GMQ   I   EPWGLP   ++LPQYLKE GY TH +GKWHLG ++E  TPT 
Sbjct: 92  TGLYPIRTGMQGMPIDVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTC 151

Query: 146 RGFDSHYGYWQGLQDYYDHS------------------CKA------------------- 168
           RGFDS YGY+ G  DY+ H+                  C+                    
Sbjct: 152 RGFDSFYGYYYGESDYFAHTISYVRHLSWAFFLTRKCXCRGFDSFYGYYYGESDYFAHTI 211

Query: 169 TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
           ++E + GLD   N +     IG YST ++T+ A  +I    KSKP+ L + H A H    
Sbjct: 212 SYENHTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALE 271

Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
            E  QAP E + KF  I +  R  YAGMV  LD+SVG V+ AL   GMLEN+I++F +DN
Sbjct: 272 RERLQAPQENIDKFPYIGEKNRTIYAGMVDALDQSVGQVVKALSDAGMLENTIIVFSSDN 331

Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
           G   +G H+++G N PLRG K T W+GG R  A +WSP L + ++VS +L HI+DWLPT 
Sbjct: 332 GGGPWGEHNSRGFNWPLRGAKGTVWEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTF 391

Query: 349 CAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA 407
            + AG        LDG + W  L+ G ++ R E+L+N D      KY  +  + + +Y  
Sbjct: 392 YSIAGGNAAKLVELDGHDMWRPLSHGRRSPRVELLYNYD-----YKYTKSAALRNHRYKL 446

Query: 408 GTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDA 467
             D  G+    + D         P+  L   A  +  A   +   K+K    PK+     
Sbjct: 447 VLDETGR----FADRYKVPGGSRPRHDLDYLAAKSTAASVLRDFYKKKQLNFPKS----- 497

Query: 468 LRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLK 527
                        R  AR+ C    K     S     LF++  DPCE NNLA    +++ 
Sbjct: 498 ------------WRRKARLTCGKQKKINFSWSATSVYLFDVVTDPCELNNLASKLPNVVA 545

Query: 528 QLEDKLAIYKSTMVPP-GNKPFDKRADPARWNNIWVPW 564
            L+ +L  Y +  +P   N  +D R+ P      W PW
Sbjct: 546 SLKKRLDAYGAAALPERNNDTYDPRSFPENHGGTWAPW 583


>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 511

 Score =  395 bits (1016), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 221/492 (44%), Positives = 285/492 (57%), Gaps = 47/492 (9%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           GW+DVSFHGS+QIPTPN+D LA +G+ILN +YVQ  CTPSR+ALMTG YPIH GMQHGV+
Sbjct: 2   GWDDVSFHGSAQIPTPNMDTLAADGIILNNYYVQPACTPSRAALMTGLYPIHTGMQHGVL 61

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
              EP+GLPL+  ++PQYLK  GY TH +GKW+LG ++  YTPTFRGFDS YGY+  ++D
Sbjct: 62  SPAEPYGLPLSVSIMPQYLKNLGYETHIVGKWNLGNYKLSYTPTFRGFDSFYGYYSAVED 121

Query: 161 YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
           YY+H+    ++   GLD   N Q      GIYST LYTE    +I   + SKP FLYL +
Sbjct: 122 YYNHT--VLWDNQTGLDFWLNTQPLRNVSGIYSTQLYTERTKFLIENRDVSKPFFLYLPY 179

Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
            AVH GN  +P QAP E + KF  I +  R  +AGM   LDES+G V  AL    ML+N+
Sbjct: 180 QAVHCGNFDDPLQAPQENIDKFPYIGEENRTIFAGMTDALDESIGEVFRALFDADMLDNT 239

Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
           I++  ADNG   FG  SN+G N PLRG K T W+GG R  A +WSP LK+   VS ++ H
Sbjct: 240 IIVLSADNGGLPFGFESNRGYNFPLRGGKGTLWEGGCRASAFVWSPLLKRKGVVSDQMMH 299

Query: 341 ISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
           I+DW+PTL +AAG        LDG + W  L  G+ + R EIL+NID +D      +ALR
Sbjct: 300 ITDWVPTLYSAAGGNPQTLGKLDGTDMWVPLAYGSASPRYEILYNIDPIDR----NSALR 355

Query: 400 VDDLKYVAGTDNNGQSDEW---------YGDTDNEIDKYSPKEVLYSKAGITFNALKTKL 450
             + K V G  N G  D+          YGD D  + +    +VL    G  +  LK   
Sbjct: 356 YTNYKLVLGIANGGAYDQRYLTTGGSRPYGDLDLLMAQSEAAQVLREYYGTGY--LKFPD 413

Query: 451 QIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITD 510
           Q +Q+                            A + C  +    +  S   P LF++T 
Sbjct: 414 QWRQR----------------------------ATLTCG-NKVPKNFVSLSPPYLFDLTA 444

Query: 511 DPCEQNNLAESQ 522
           DPCE NN+A SQ
Sbjct: 445 DPCEINNIANSQ 456


>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 548

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 230/560 (41%), Positives = 317/560 (56%), Gaps = 42/560 (7%)

Query: 13  CTLLFNDAFLNTTAPKK---PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           C+L+F    L T A  K   PHI+ ILADDLGW+DVSFHGSSQIPTPNID LA +G+ L+
Sbjct: 21  CSLVF----LATVALAKKMPPHIVFILADDLGWDDVSFHGSSQIPTPNIDVLAADGITLH 76

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
            +YVQ +CTPSR+ALMTG YPI  GMQH VI   EPWGLPL  KL+P++LK+ GY+TH +
Sbjct: 77  NYYVQPMCTPSRAALMTGLYPIRTGMQHWVIRSPEPWGLPLELKLMPEHLKDLGYSTHLV 136

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
           GK      + + +      +                    ++ + GLD R+  +  +   
Sbjct: 137 GKVLFDLKKFIVSVNRLCINESTEVCHTFVSAVTLCIYFVYKSHAGLDFRNGEEPFHNDT 196

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
           G Y+T L+T+ AI++I +HN++KP+FLYL+HLA H     EP QAPDE V KF  I + +
Sbjct: 197 GQYATTLFTDRAISIIEQHNQTKPLFLYLSHLAPHGATHDEPLQAPDENVEKFDYIGEED 256

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           R  YAGMV  LD SVG V+ AL + GMLENSIV F +DNGA  FG  SN+G N PLRG+K
Sbjct: 257 RTIYAGMVDALDVSVGRVVEALSRAGMLENSIVAFSSDNGAVPFGFRSNRGFNWPLRGIK 316

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWD 368
           ++ W+GG+R    +WSP L+++ ++S+++ HI+DWLPTL AAAG    N   LDG N W+
Sbjct: 317 ASVWEGGVRVPGFVWSPLLRKSARLSTQMMHITDWLPTLYAAAGGNTKNLGQLDGHNMWE 376

Query: 369 VLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK 428
            L++G ++ R+EIL NID    P    A LR    K +    N        G TD  +  
Sbjct: 377 ALSRGWRSPRTEILLNID----PITGSAGLRFGKHKLLIPGVN--------GTTDAHVPT 424

Query: 429 YS-PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF---A 484
              P+             + T +Q         ++ ++  LR     + I   R +   A
Sbjct: 425 TGYPRPT---------EDMDTMMQ---------QSLAAKVLRRFYNVDGIWFRRGWENSA 466

Query: 485 RVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPG 544
            V C      ++  S     LF+I  DPCE  NLA ++   ++ +  KL  Y + M+PP 
Sbjct: 467 EVDCGPAEWQSNFVSGRVYNLFDIERDPCEMINLAATRRGTVRMMMRKLQSYAAAMIPPR 526

Query: 545 NKPFDKRADPARWNNIWVPW 564
           NK  D R+ P +   +W  W
Sbjct: 527 NKEEDPRSFPEKHGGVWTTW 546


>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 528

 Score =  393 bits (1009), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 218/561 (38%), Positives = 304/561 (54%), Gaps = 83/561 (14%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A   P+I++I+ DDLGW+D+S HGS QIPTPNID LA  G++L  +Y QA+CTPSR ALM
Sbjct: 17  AASSPNIVLIVIDDLGWDDISLHGSDQIPTPNIDKLAAEGVLLENYYTQAICTPSRGALM 76

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TGKYPIH+G+Q+ VI   +P+GLP   K++PQYL    Y +H IGKWHLG  R    PT 
Sbjct: 77  TGKYPIHLGLQYDVIQGAQPYGLPTDFKIMPQYLSGTCYKSHIIGKWHLGHSRSELLPTR 136

Query: 146 RGFDSHYGYWQGLQDYYDH----SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGF SH+G+  G  DY++H    S     E Y GLD+  N     K  G Y+ DL+T+ A
Sbjct: 137 RGFHSHFGFRLGHSDYFNHWGEESSPVKNEMYAGLDLWSNEVPIKKYHGTYANDLFTKRA 196

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-DISDPERRTYAGMVSRL 260
           I+++  HNK+ P+FLYLAH AVH G+   P QAP   V KF   I D +R+ YA MVS +
Sbjct: 197 ISILETHNKTTPLFLYLAHQAVHVGDGENPLQAPLSFVKKFEGKIYDEKRKHYAAMVSAM 256

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D S+G ++  +  +G  EN+I+    DNG P+ G+ S+  SNHPLRG K T W+GG RG 
Sbjct: 257 DASIGELMHGISTNGFAENTIIFLTNDNGGPTNGMASSGSSNHPLRGCKYTLWEGGTRGS 316

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
           A  W P  K        + HI+D LPTL  A+G  +   + DG + WD L+  A+ ++  
Sbjct: 317 ALFWYP-KKLGAGTYKGMAHITDVLPTLLTASGNAVAG-NFDGYDLWDALSTNAQPRKET 374

Query: 381 I---------------LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
           +               L+NID    P  +  A+R  D K +     NG+   WY      
Sbjct: 375 VEPQRTYGNEHDFHSLLYNID----PVTHSWAVRSGDFKLITDNALNGEYSGWY------ 424

Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
                                             P   S +  RT               
Sbjct: 425 ----------------------------------PTVESVNLTRT--------------- 435

Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
           V+C   +K   C+ + +PCLFN+  DPCE+N++ + + D++++L  KL  Y+++M P  N
Sbjct: 436 VKCGSASKA--CDGSKRPCLFNLAADPCEENDIFDERPDMVRKLMVKLNEYQTSMRPMEN 493

Query: 546 KPFDKRADPARWNNIWVPWYD 566
           +P    ADPAR++ +W+PW D
Sbjct: 494 RPNSAAADPARFDFVWMPWED 514


>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 513

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 221/569 (38%), Positives = 313/569 (55%), Gaps = 72/569 (12%)

Query: 6   KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           +  F L C   F    ++  +   K P++++I+ DDLGW+D+  HGSSQIPTPNID LA 
Sbjct: 3   RILFFLFCLRSFGTCDISGPSGETKAPNVVLIVVDDLGWDDIGLHGSSQIPTPNIDKLAE 62

Query: 64  NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
            G++L+ +Y Q +CTPSR++LMTGKYP+ +G+QH VI    P+GLP   K++PQYL +  
Sbjct: 63  EGVVLDNYYTQPICTPSRASLMTGKYPVRLGLQHDVISAATPFGLPSNFKIMPQYLHDKN 122

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH----SCKATFEPYQGLDMR 179
           Y  H +GKWHLG  R  + PT RGF  H+GY  G  D+Y H            + GLD+ 
Sbjct: 123 YDCHIVGKWHLGHSRSEFLPTRRGFKDHFGYRLGSSDHYSHYGADDSDVPGSLFYGLDLW 182

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
           HN     +  G YSTD+YT  + +++  HNKS+P+FLYLA+ AVHAGN  +  QAP   V
Sbjct: 183 HNEVPAKEFNGKYSTDIYTHRSTDILRMHNKSRPLFLYLAYQAVHAGNPDQALQAPQSIV 242

Query: 240 AKF-LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
            +F   I +  RR YA MVS +D ++GNV+ A+R +G   N++V F  DNG P      +
Sbjct: 243 DRFSSSIRNDRRRRYAAMVSAVDTAIGNVMGAIRANGFAGNTLVFFTNDNGGPINANDRS 302

Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
            GSN+PLR  K T W+GG+RG    W+P + +  K    L HI D LPT+ +AAG+  + 
Sbjct: 303 PGSNYPLRAGKFTLWEGGVRGTGIFWAPQVLKPGKFGG-LSHIVDVLPTILSAAGMS-SV 360

Query: 359 TSLDGVNQWDVLT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
             LDGV+ W  L+ K ++  R+E+L NID VD    + AA R+ D K +    +NG  D 
Sbjct: 361 PELDGVSLWKSLSEKRSENPRTELLLNIDPVD----HTAAYRLGDFKLIK---SNGLMDG 413

Query: 418 WYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKI 477
           W+   D                                      AN S   RTI      
Sbjct: 414 WFRTPD-------------------------------------AANHS---RTI------ 427

Query: 478 LELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
                    +C  D+    C    +PCLFNI +DPCEQ+++     D++K++ ++L  Y+
Sbjct: 428 ---------KCKLDDPVFICRGVDRPCLFNIAEDPCEQDDVYADNPDIVKEIMERLHRYR 478

Query: 538 STMVPPGNKPFDKRADPARWNNIWVPWYD 566
             M P  +K   + ADPAR++ IW+PW D
Sbjct: 479 HEMHPVEDKNVTRLADPARFDYIWMPWED 507


>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
           purpuratus]
          Length = 479

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 205/516 (39%), Positives = 292/516 (56%), Gaps = 57/516 (11%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           FF L C L      + T  P  P+II ++ADDLGWNDVSFHGSSQIPTP+IDALA  G+I
Sbjct: 13  FFLLGCML----ECVYTAQP--PNIIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVI 66

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +YV  +CTP+RSA+MTGK+PIH G+Q+ VI+  EP+GL   E ++PQYL+  GY TH
Sbjct: 67  LTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTH 126

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE-PYQGLDMRHNMQVDN 186
            +GKWHLGF+ + +TP  RGF+S +GY+ G QDY+ H  +   +    G D   N  +  
Sbjct: 127 MVGKWHLGFYSKEHTPIERGFESTFGYYLGQQDYFTHETQVKRKHTLTGFDFHVNGSIYK 186

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGN-TYEPFQAPDEEVAKFLDI 245
              G YST++YTE    +I  HN  +P+++YLAH AVH+ N   +  QAP +   +F +I
Sbjct: 187 PVFGQYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSANYNGQRLQAPYKYYERFPNI 246

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
           ++  RR +A MVS LD+S+GN+   L++  +  N++++F  DNG P+ G  +N  +N PL
Sbjct: 247 TNENRRKFAAMVSALDDSLGNITQTLKESSLYNNTVIVFTTDNGGPAHGFDANYANNWPL 306

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGV 364
           RG+K T W+GG+RG   +W   +++  + S  + H+ DW+PTL   AG   +    LDG+
Sbjct: 307 RGVKDTTWEGGLRGAGFLWGALIEKPGRTSDGMMHVCDWVPTLYGLAGGNTSTLQHLDGI 366

Query: 365 NQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDN 424
           + W +L++   + R EILHNID V N     +A+R+ D K V G + NG   +WY     
Sbjct: 367 DVWPMLSRAEPSPREEILHNIDPVRN----VSAIRIGDYKLVQGQNYNGSWSDWY----- 417

Query: 425 EIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFA 484
                                              P+  SS     +  D K +      
Sbjct: 418 ----------------------------------PPEGESS-----VDVDSKPVPNAFVV 438

Query: 485 RVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAE 520
                  N   +C+   KPCLFNI  DPCE NN+A+
Sbjct: 439 SCPSKPANASTNCDPKEKPCLFNIRHDPCEFNNIAD 474


>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
          Length = 563

 Score =  391 bits (1004), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 226/599 (37%), Positives = 326/599 (54%), Gaps = 77/599 (12%)

Query: 10  ALTCTLLFNDAFLNTTA----PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
            + C L     F+ T++     KKP+IIII+ADDLG+NDVSFHGSSQIPTPN+  +A  G
Sbjct: 15  VMVCVLGLLLYFILTSSKPSTAKKPNIIIIIADDLGYNDVSFHGSSQIPTPNLAKMATRG 74

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +IL++ Y Q+ CTPSR+AL+TG+YPI  GMQ   +  GE   LPL    +P + +  GY 
Sbjct: 75  IILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPLKAGENRSLPLNMPTMPLHFQNLGYK 134

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNM 182
           TH +GKWHLG   +  TP  +GFDSH+GYW G   Y+D+   +  +     +GLD+    
Sbjct: 135 THLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVGYFDYVSFSKMDNGTLVKGLDLHDQF 194

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-- 240
           +    + G Y+T+L+TE +++VI  H+   P+FL ++HLA H G        PD +    
Sbjct: 195 EPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNH 254

Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
           +F  I DP RR YAG+VS LD S+G ++A L +  ML+NSIVLF +DNGA + G++ N G
Sbjct: 255 EFSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQMLDNSIVLFFSDNGAQTVGMYENSG 314

Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
           SN PLRG+K + ++GG+R  A I+SP   +   VS  L HISDWLPTL +AAG ++    
Sbjct: 315 SNWPLRGVKFSDFEGGVRVAATIYSPLFHKKGYVSEHLIHISDWLPTLYSAAGGDVAHLG 374

Query: 361 -LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN-------- 411
            +DG++QWD LT    + R+EIL NID VD   + +A +R D  K + GT N        
Sbjct: 375 QIDGIDQWDALTNNNPSNRTEILINIDEVD---ENFAIIR-DKFKLIQGTPNYYYQQTLL 430

Query: 412 ---------NGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKA 462
                     G  D++YGD+    +  +P                           +P  
Sbjct: 431 FNCKSGSYHEGTFDQYYGDSGRGPENPTP---------------------------NPNH 463

Query: 463 NSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQ 522
            ++D                     C   ++    N T K CLF++  DPCE  N+ ES+
Sbjct: 464 TTTDL------------------SWCRAPDQTPILNCT-KGCLFDLDKDPCETTNIIESE 504

Query: 523 TDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDELDKQKAIETMRLHK 581
            ++  QL +K+A +   +VP  NK  D ++DP  +NN W  W ++    K     R+ +
Sbjct: 505 PEIANQLYEKIAQFWKELVPQRNKDTDPKSDPIFYNNTWCTWLEDASCHKTTNIERISR 563


>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
 gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
          Length = 380

 Score =  389 bits (998), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 189/365 (51%), Positives = 255/365 (69%), Gaps = 7/365 (1%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A  KPHII+I+ADDLGW+DVSFHGS QIPTPN+D LA  G+ILN +YV  +CTP+R++LM
Sbjct: 1   AQNKPHIIVIVADDLGWDDVSFHGSPQIPTPNLDYLATRGVILNNYYVSPICTPTRASLM 60

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TGKYPIH+GMQH VI   +P+GLPL E  LPQYL+  GY T  IGKWHLGFF + YTPT+
Sbjct: 61  TGKYPIHLGMQHFVIYAAQPYGLPLGEITLPQYLQIQGYKTAGIGKWHLGFFAKEYTPTY 120

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFE-PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           RGFDS YG W    DY++H+   +FE  + G DMR+NM+        Y+T+++T  A+ V
Sbjct: 121 RGFDSFYGMWSAKADYWNHT---SFENGFWGTDMRNNMEPVTTDKDKYATEVFTREALKV 177

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP-ERRTYAGMVSRLDES 263
           I  HNKS+P+FLY+AH A H+ N ++P QAP+++V KF  + D  ER+ YA MV+ +D+S
Sbjct: 178 IENHNKSEPLFLYIAHQAPHSANPHDPLQAPEDKVKKFSGVIDKIERQQYAAMVTCVDDS 237

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           +G V  AL K+ ML NS++LF  DNG    G + N+GSN+PLRG K   W+GG+RG A I
Sbjct: 238 IGEVFRALEKNRMLNNSVILFTTDNGGAPHGFNRNQGSNYPLRGGKDMMWEGGVRGTAFI 297

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN--DTSLDGVNQWDVLTKGAKTKRSEI 381
           +S  +K   +VS++L  ++DW+PTL   AG      +  +DG N W+ ++    + R E+
Sbjct: 298 YSDLIKHKGRVSTDLIDVTDWVPTLYYLAGGTPGYLEPHMDGKNVWETISNKEPSPRKEV 357

Query: 382 LHNID 386
           LHNID
Sbjct: 358 LHNID 362


>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
          Length = 784

 Score =  386 bits (991), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 220/552 (39%), Positives = 303/552 (54%), Gaps = 70/552 (12%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           GW+DVSFHGS QIPTPNID+LA +G+ILN +YV      +++  MTGKY  H+G QHGV+
Sbjct: 1   GWDDVSFHGSPQIPTPNIDSLAKSGVILNNYYVSPSSFATKTEFMTGKYATHLGTQHGVL 60

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL-- 158
              +P+GLP TEK+LPQYLKEAGY  +A+GKW LG+++E   P  RGFD  YG   GL  
Sbjct: 61  HNKQPFGLPHTEKILPQYLKEAGYNNYAVGKWALGYYKEEMLPWKRGFDFFYG---GLTS 117

Query: 159 --QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFL 216
             +DYY HS    F+   GLD+R N +V +   G Y T++YT  A+N+I  +N +KP+FL
Sbjct: 118 SGKDYYTHSA---FDENYGLDLRRNNEVIHNETGNYITEVYTREAVNIIKNYNDNKPLFL 174

Query: 217 YLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGM 276
           Y+AH AVH GN  +P QAP+  + K   I + +R+ +AGMV  LDESV N+  AL + G+
Sbjct: 175 YVAHQAVHTGNADDPLQAPESYLKKLNHIKNIKRKLFAGMVLALDESVLNITIALAQKGL 234

Query: 277 LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
           L+N++++F  D G    G   +  SN PLRG K T W+GG+R VA ++S  +K   +VS 
Sbjct: 235 LDNTVIVFTTDAGGAVGGQELSSASNFPLRGSKLTVWEGGIRAVAFVYSDLIKNKGRVSL 294

Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYA 396
           E++H +DWL T    AGI      +DG + W  +++G  + R EILH  D +    K   
Sbjct: 295 EMYHSTDWLLTTLGLAGINTKGIKVDGFDVWKSISEGEVSPRFEILHQFDPL---TKNLC 351

Query: 397 ALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKH 456
           +LRV D K V   D  G   +WY                                     
Sbjct: 352 SLRVGDYKLVINQD-IGFYGDWY------------------------------------- 373

Query: 457 AADPKANSSDALRTILTDEKILELREFARVRC--NY-----DNKGAHCNSTVKPCLFNIT 509
              P+      LR +    K +E    A+V C  NY     +     C+   KPC+FNI 
Sbjct: 374 ---PRPAEIGELRYL----KRIETLPNAKVTCDKNYPHPFLEMHAPPCDPMKKPCMFNIQ 426

Query: 510 DDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDELD 569
            DPCE +NLAE   + LK L D+L  Y+ T+V P     D +ADP   + +W PW +   
Sbjct: 427 WDPCEFHNLAEFMPNTLKVLLDRLQFYRLTLVKPTYPSVDSQADPENSDGVWSPWVNS-- 484

Query: 570 KQKAIETMRLHK 581
              +I  M  H+
Sbjct: 485 ---SIALMTFHQ 493


>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 557

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 208/497 (41%), Positives = 302/497 (60%), Gaps = 29/497 (5%)

Query: 27  PKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           PKKP ++++ILADD+GWND SFHGS++IPTPN+DALA +G+IL  HY Q +CTP+R+AL+
Sbjct: 76  PKKPTNVVMILADDMGWNDASFHGSAEIPTPNLDALASSGVILQSHYAQPMCTPTRAALL 135

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG YP H GMQ+ VI  GEPWGLPL  K+LP YL EA Y +H +GKWHLG     + PT 
Sbjct: 136 TGLYPFHTGMQNFVIRTGEPWGLPLDYKILPHYLDEAYYHSHLVGKWHLGMHNPAFLPTA 195

Query: 146 RGFDSHYGYWQGLQDYYDHS-CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           R F++H GY+ G  DY+ H       +   GLD  H  + +    G Y+T L+T+ A+N+
Sbjct: 196 RHFNTHVGYYNGFIDYFTHEHISPGNDSLIGLDW-HINEENENEEG-YATHLFTKRAVNL 253

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  H  ++P+F+ L+HLA HAG   +PFQAP E + KF  I D  R+ YA MV  LD+SV
Sbjct: 254 IENHKSTEPLFILLSHLAPHAGCKRDPFQAPRESIEKFAHIKDQNRKVYAAMVDELDQSV 313

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G V+ AL +  +L++++++F++DNG  + G+ +N GSN+P RG K T ++GG R  A +W
Sbjct: 314 GQVVEALYRKRLLDSTMIVFLSDNGGQTTGVMNNTGSNYPFRGQKRTLFEGGTRVSAFVW 373

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHN 384
           S  + +  ++ S L H++DWLPT+   AG+      LDG +QW +L++GA + R+E+L+N
Sbjct: 374 STDIVKKPRIESGLMHVTDWLPTILKRAGLSY-PRELDGRDQWSMLSEGAPSARAEVLYN 432

Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQS--DEWYGDTDNEIDKYSPKEVLYSKAGIT 442
           ++ VD      AALR D  K + G   NG S  ++W  D  ++      +        +T
Sbjct: 433 LNYVDQS----AALRNDRFKLIVGP-RNGVSLLNDW--DQLHKGKHSRAQLDGMMDESLT 485

Query: 443 FNALK-TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
              LK   L  K +H     A+ S               R+ A ++C   +K   C+   
Sbjct: 486 ARTLKEAGLWWKMRHGDTLSADHS-------------TWRDKAEIKCGSHDKTL-CDVHR 531

Query: 502 KPCLFNITDDPCEQNNL 518
            PCLF++ +DPCE NN+
Sbjct: 532 SPCLFDLENDPCELNNV 548


>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
 gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
          Length = 584

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 220/569 (38%), Positives = 305/569 (53%), Gaps = 100/569 (17%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           KKPHIIII+ADDLGW+DVSFHGS QIPTP ID  A  G+ILN +YV  +CTPSR+++MTG
Sbjct: 25  KKPHIIIIVADDLGWDDVSFHGSPQIPTPYIDFYANRGVILNNYYVSPMCTPSRASMMTG 84

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           KYPI++GM                                    WHLGFF + YTP +RG
Sbjct: 85  KYPINLGM------------------------------------WHLGFFTKEYTPVYRG 108

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDS YG+W    DY++HS       + G+D+R NM+      G Y T+L+T  A+ VI  
Sbjct: 109 FDSFYGFWNAKTDYWNHSSYEN--NFWGVDLRDNMEPVQSEDGTYGTELFTREAVKVIEA 166

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD---------ISDPERRTYAGMVS 258
           H+ S P+FLY+AH AVH  N  EP QAP +++   L          I D +R+ YA MV+
Sbjct: 167 HDTSTPLFLYVAHQAVHTANPNEPLQAPQDKIDVSLKQRQQRFKGTIDDDQRQVYAAMVT 226

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LD+SVG++ AAL K  ML +S+V+F  DNG   +G++ N+GSN PLRG K   W+GG++
Sbjct: 227 SLDQSVGDIFAALSKRHMLRDSVVIFTTDNGGAPYGLNWNRGSNFPLRGGKDMLWEGGVK 286

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKT 376
           GVA ++S  +KQ  +VS EL  ++DW+PT+   AG   E    ++DG N W  +++GA +
Sbjct: 287 GVAFVYSDLIKQKGRVSKELIDVTDWVPTIYHLAGGTAEFLVPNMDGKNVWSTISEGAPS 346

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
            R EILHNID    P + +A LR    K V G D+  +   WY       D+Y       
Sbjct: 347 PRDEILHNID----PWRKFAGLRKGKYKIVQGMDDTYKGVGWY-------DRYP------ 389

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC--NYDNKG 494
                              HA              L+  K  EL   A + C   +D + 
Sbjct: 390 ------------------GHA--------------LSSMKQPELLPGAVIDCKKTFDEER 417

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
              +S  K CLF++ +DPCE ++L+    ++L +++ +L  YK+  +PP   P +K A+P
Sbjct: 418 KCDSSDGKFCLFDMEEDPCEYHDLSNQLPEVLAEMKTRLEYYKNIALPPWFPPINKAANP 477

Query: 555 ARWNNIWVPWYDELDKQKAIETMRLHKPL 583
           A +   W PW    DK   I   R  K L
Sbjct: 478 ANFGGFWSPWKRLKDKSSVINVKRALKKL 506


>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
          Length = 513

 Score =  383 bits (984), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 219/539 (40%), Positives = 308/539 (57%), Gaps = 42/539 (7%)

Query: 36  LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM 95
           +ADD+G NDV F+GS QIPTP+IDALAYNG+IL++ Y Q  CTPSR+AL+TG+YPI +GM
Sbjct: 1   MADDVGRNDVGFYGSGQIPTPSIDALAYNGIILDRFYTQCSCTPSRAALLTGQYPIRLGM 60

Query: 96  QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
           Q   I  GE   LPL    +PQYLK  GY TH +GKWHLG+     TP  RGFDSH+GYW
Sbjct: 61  QGLPIRAGENKSLPLDVVTMPQYLKRLGYKTHLVGKWHLGYAHIEDTPLQRGFDSHFGYW 120

Query: 156 QGLQDYYDHSCKATFE-----PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK 210
            G   Y++++  A +E       +G D+   +    +  G Y+TDL+T  A+ +I EHN 
Sbjct: 121 NGFVGYFNYT--AVYELANDTMVKGFDLFDGVVPAWQERGKYATDLFTHKAMKIIDEHNS 178

Query: 211 SKPMFLYLAHLAVHAGNTYEPFQAPDEEVA--KFLDISDPERRTYAGMVSRLDESVGNVI 268
            KP+FL LAHLA H G        PD   A  +F  I +P+RR YA +VSRLD+SVG ++
Sbjct: 179 EKPLFLVLAHLAGHTGEDGVELGVPDVAQAETRFSFIKNPKRRLYADIVSRLDDSVGQIV 238

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
             L +  +L+NSI+LF++DNGA + G++ N GSN PLRG+K T ++GG+R  AAI+SP  
Sbjct: 239 EKLDEKNLLDNSIILFLSDNGAQTIGVYENSGSNWPLRGLKLTEFEGGIRAPAAIYSPLF 298

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDN 387
            Q   VS EL HI+D+LPT  AAAG  +     +DGVNQW+VL++    KR EIL N + 
Sbjct: 299 HQKGYVSKELIHITDFLPTFYAAAGGNLEYLGKIDGVNQWEVLSQQKPGKRDEILVNFNE 358

Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALK 447
           +DN     +A+ +   K V G     +  +++GD+       +P++  Y+     F A+ 
Sbjct: 359 IDN----VSAIIMGQYKLVQGRLAFPEYGDYHGDSGR-----NPEDPPYN-----FTAVL 404

Query: 448 TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
           +K  I Q++                   KI  LR    +      K        + CLFN
Sbjct: 405 SK-NINQQNR-----------------NKIESLRNQTDLSWCRGEKNQPLIDCTEGCLFN 446

Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
           +  DPCE  N+ + +  +  +L  +L  +   +VP     FD  ++P  +N  W  W D
Sbjct: 447 LESDPCETTNIIQEEKIIAGKLLGRLFEFSQILVPQQVAIFDPESNPEFYNGTWCTWLD 505


>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
          Length = 531

 Score =  377 bits (969), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 225/556 (40%), Positives = 312/556 (56%), Gaps = 46/556 (8%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F ++ C      + + T   K PHI+ IL DDLGW D S HGS+QIPTPN+DALA  G++
Sbjct: 7   FISVACV---AGSSVKTPFRKPPHIVFILVDDLGWADTSLHGSAQIPTPNLDALASTGVL 63

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           LN +YVQ LC+PSR ALMTG YP H G++  ++   +  GLPL  K+LP++LK+ GY TH
Sbjct: 64  LNNYYVQPLCSPSRGALMTGLYPAHNGIRMPLV-GAQVAGLPLQFKILPEHLKDLGYETH 122

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            +GKWHLG+F   YTPT+RGFD+ +G+  G  DYY    +   E + GLD  +       
Sbjct: 123 IVGKWHLGYFNLNYTPTYRGFDTFFGFHNGPIDYYRGIMEQ--EGHVGLDFWNGTSALPL 180

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
               Y+T      A ++IA  N SKP+FLYLAH AVH+  + E  QAP E   KF  I D
Sbjct: 181 KERTYATARLQNHAKSIIANRNTSKPLFLYLAHQAVHSVYSPEFLQAPVENTKKFPYIRD 240

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLR 306
             R+  AGM+  LD+SVG ++  L K G+LE++I++F  DNG  P      N+G N PLR
Sbjct: 241 SSRKILAGMMDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRADPNRGFNWPLR 300

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVN 365
           G+K T W+GG+RG A IWSP L+ +++VS++L HISDWLPTL +AAG ++     LDG++
Sbjct: 301 GIKGTVWEGGIRGTAFIWSPRLEVSRRVSTQLMHISDWLPTLYSAAGGDVQTLGPLDGID 360

Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
            W  L+ G  + R E+L   +N    +   AALR  + K V GT ++  S  +       
Sbjct: 361 MWQSLSTGGPSPRIEVLLEFNN----ETDRAALRYLNHKIVLGTFSDEVSKRY------- 409

Query: 426 IDKYSPKEVLYS-KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL------ 478
                  EV+   + GI  +AL+              + +   L+    + ++       
Sbjct: 410 -------EVIGGVRPGIDLDALQR------------NSKAGRVLKRFYGNPQLFLHGFDK 450

Query: 479 ELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKS 538
             R  A VRCN   +  + NS+ K  LF++  DPCE  NLA+S   LL  L  KL  Y  
Sbjct: 451 SWRRRATVRCNEHGR-TNFNSSSKVFLFDLESDPCELRNLADSNVTLLTDLFVKLGAYLR 509

Query: 539 TMVPPGNKPFDKRADP 554
           T+ P  NK  D +  P
Sbjct: 510 TLRPEINKEIDPKGFP 525


>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
 gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
          Length = 324

 Score =  373 bits (958), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 177/304 (58%), Positives = 223/304 (73%), Gaps = 1/304 (0%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F  +   LLF       T P  P+IIIIL DD+G+NDVSFHGS+QI TPNIDA AYNG+I
Sbjct: 6   FTTILSALLFLRQVAGVTQPTNPNIIIILIDDMGFNDVSFHGSNQILTPNIDAFAYNGVI 65

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           LN++YV  LCTPSR+AL+TGKYPIH GMQH V +  EPWGLPL E+L+PQ+ ++AGY+T 
Sbjct: 66  LNRYYVPNLCTPSRAALLTGKYPIHNGMQHFVQIPDEPWGLPLGERLMPQFFRDAGYSTQ 125

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDN 186
            +GKWHLGF+R+ +TP  RGFD H+GY+ G  DYYDH+       Y  G D R ++Q  +
Sbjct: 126 LVGKWHLGFWRQDHTPIMRGFDHHFGYYNGYIDYYDHTHYMLDRNYTAGADFRRDLQRCH 185

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
              G Y+T+ +T+ A  +I +H+ S+P+F+ L+HLAVH GN  +P QAP EEVAKF+ IS
Sbjct: 186 SDNGTYATEAFTKEARRIIEQHDLSRPLFMVLSHLAVHTGNENQPMQAPYEEVAKFVHIS 245

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
           DP+RRTYAGMVS LD+SVG  I AL   GML NSIVL  +DNGAP+ GIHSN GSNHP R
Sbjct: 246 DPKRRTYAGMVSSLDKSVGQTIRALSDRGMLNNSIVLLYSDNGAPTLGIHSNSGSNHPFR 305

Query: 307 GMKS 310
           G+ +
Sbjct: 306 GVST 309


>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
          Length = 500

 Score =  371 bits (953), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 221/526 (42%), Positives = 307/526 (58%), Gaps = 34/526 (6%)

Query: 11  LTCTLLFNDAFLNTTA-----PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           L+C L  + A +  T+     P  PHI+ IL DDLGW D S HGS+QIPTPN+DALA  G
Sbjct: 2   LSCLLFISVACVTGTSAKTPYPTPPHIVFILVDDLGWADTSLHGSAQIPTPNLDALASTG 61

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           ++LN +YVQ LC+PSR+ALMTG YP H G++   ++  +  GLPL  K+LP++LK+ GY 
Sbjct: 62  VLLNNYYVQPLCSPSRAALMTGLYPAHNGIRM-PLMGAQVAGLPLQFKILPEHLKDLGYE 120

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           TH +GKWHLG     YTPT+RGFD+ +G++ G  DYY    +   E + GLD  +  +  
Sbjct: 121 THMVGKWHLGHSSLNYTPTYRGFDTFFGFYNGPIDYYHGIMEQ--EGHIGLDFWNGTRAL 178

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
                IY+T  + + A  +IA  N SKP+FLYLAH AVH+    E  QAP E   KF  +
Sbjct: 179 PLEERIYATTRFQDHANYIIANRNASKPLFLYLAHQAVHSAYEPEFLQAPGENTKKFPFL 238

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHP 304
            D  R++ AGMV  LD+SVG ++  L K G+LE++I++F  DNG  P      N+G N P
Sbjct: 239 GDASRKSLAGMVDALDQSVGELVEELEKAGILEDTILVFSTDNGGHPYTRSDPNRGFNWP 298

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDG 363
           LRG+K T W+GG+RG A IWSP L+ + +VS++L HISDWLPTL +AAG  + N   LDG
Sbjct: 299 LRGIKGTVWEGGIRGTAFIWSPRLELSSRVSTQLMHISDWLPTLYSAAGGNVQNLGRLDG 358

Query: 364 VNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
           ++ W  L+ G  + R+E+L   +N    +   AALR  + K V G  +N  S+ +     
Sbjct: 359 IDMWQSLSTGGPSPRTEVLLEFNN----ETDRAALRYLNHKIVLGIYSNEVSERY----- 409

Query: 424 NEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREF 483
                   + V   + GI  + L+   +   +  A    N    LR    D+     R  
Sbjct: 410 --------EVVGGVRPGIDLDVLQRNSK-AGRVLARFYGNPQLFLRGF--DKS---WRSR 455

Query: 484 ARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQL 529
           A VRCN +N   + NS+ +  LF++  DPCE +NLA+S   LL  L
Sbjct: 456 ATVRCN-ENGPTNFNSSSQVFLFDLETDPCELHNLADSNLTLLTDL 500


>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
 gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
          Length = 488

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 219/556 (39%), Positives = 304/556 (54%), Gaps = 76/556 (13%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           AL+  LL +   L+      PHI+ IL DDLGW D+ +HGS  I TPNI+ LA +G+IL+
Sbjct: 6   ALSLVLL-SAQILSEANAIPPHIVFILVDDLGWFDLGYHGSV-IRTPNINQLAGDGIILD 63

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
            +YVQ LCTP+RSALMTGKYPIH+G QHGVIL G+P GLPL    LP+ LK+ GYATH +
Sbjct: 64  NYYVQPLCTPTRSALMTGKYPIHLGTQHGVILPGQPMGLPLDSSTLPEQLKQQGYATHIV 123

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
           GKWHLGF++E + PT RGFDS YGY+ G +D++ H+          LD R N  +     
Sbjct: 124 GKWHLGFYKEDFVPTKRGFDSFYGYYCGAEDHFTHNVLGF------LDFRDNDLIVKDQK 177

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
           G Y T  +T+ A++ I  HN S P+FLYL    VH      P QAP E + K+  I D  
Sbjct: 178 GTYGTRAFTKRAVDTIHRHNSSSPLFLYLPFQNVHG-----PVQAPPEYIDKYSFIKDKT 232

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           RRT+A MV  +DE++GNV +AL+  G+ EN++++F  DNG    GIH+  G N+PLRG K
Sbjct: 233 RRTHAAMVDIMDEAIGNVTSALKSAGLWENTLLVFSTDNG----GIHTAGGYNYPLRGEK 288

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDV 369
           +T W+GG+RG   +  P   +   + + L H++DW PTL   AG  + D SLDGV+ WD 
Sbjct: 289 NTLWEGGVRGAGFVSGPMAPRHGMIYNGLMHVTDWYPTLVHLAGGSMQD-SLDGVDLWDA 347

Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKY 429
           L +   + R EILHNID           L++D                            
Sbjct: 348 LKQDTASPRKEILHNID-----------LKID---------------------------- 368

Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN 489
            P   ++   G+   +   KL +K  +A            T L      ELRE   +  N
Sbjct: 369 VPTAFVFE--GVALRSRDMKLLLKVPNA------------TWLVPP---ELRE-PGITAN 410

Query: 490 YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD 549
                A     ++  L+NIT DP E+++L+    D++ +++ ++  Y+  +VPP  K  D
Sbjct: 411 EGLGSAKMQEVIEVALYNITADPTERHDLSGKFPDIVNEMKKRVDFYRGGLVPPIIKKND 470

Query: 550 KRAD-PARWNNIWVPW 564
            +A   A  N  W PW
Sbjct: 471 PKAVLTAIKNGAWSPW 486


>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 490

 Score =  367 bits (941), Expect = 1e-98,   Method: Compositional matrix adjust.
 Identities = 216/521 (41%), Positives = 292/521 (56%), Gaps = 50/521 (9%)

Query: 11  LTCTLLFNDAFLNTTA----PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           + C +L +  FL  T      K+P+I+ IL DDLGW DVSFHGS+QIPTPNID LA +G+
Sbjct: 3   VLCRILLDAVFLLGTILQCHAKRPNIVFILGDDLGWGDVSFHGSTQIPTPNIDVLAGDGV 62

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
           ILN +YV  LCTPSR+ALMTG YPI  GMQ   I    PWGLPL  K+LPQ+ K+ GY  
Sbjct: 63  ILNNYYVLPLCTPSRAALMTGLYPIRNGMQLTSIQAAGPWGLPLENKILPQHFKDLGYDV 122

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
           + IGKWHLGFF+  Y P  RGFD+ +G++ G  DYY+H+  ++    + L +  ++QV+ 
Sbjct: 123 NMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSNDYYNHTSGSSHR--KILAVTSSVQVNT 180

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPM-------FLYLAHLAVHAGNTYEPFQAPDEEV 239
              G  S  L+    ++V   H    P+       F Y++H AVH     E FQAP   V
Sbjct: 181 LEKGRLS--LWGPRELSVCFLHQIYSPLNFYLQPFFCYISHQAVHHALNAEMFQAPARNV 238

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
            KF  I +P R  YAGM+  LDES+G V+ AL   G+LE++I++F +DNG   +G+ SN+
Sbjct: 239 LKFSYIGEPNRTIYAGMLDALDESMGRVVEALSSAGILEDTIIVFSSDNGGSPYGLESNR 298

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
           G N PLRG K T W+G +R  A +W+P L +  +VS++L HISDWLPTL  AAG   +  
Sbjct: 299 GYNWPLRGAKFTLWEGSVRVPAFVWNPKLLKKSRVSNQLMHISDWLPTLYTAAGGNASAL 358

Query: 360 -SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
             +DGV+ W  LT G+ +KR EIL NID++ N     + LR    K V G   +G+ D+ 
Sbjct: 359 GGIDGVDMWHSLTHGSASKRKEILLNIDSLYN----ISGLRYKQYKLVVGGGFDGELDDH 414

Query: 419 Y---GDT--DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILT 473
           Y   G T   N ID+   +         T   +  +L  KQ  +  P A S +       
Sbjct: 415 YYFPGGTRPTNNIDQLRNQS--------TMARVLKELYEKQDRSWSPLAWSKN------- 459

Query: 474 DEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCE 514
                       V C  +    +      P LFNI +DPCE
Sbjct: 460 ----------VAVDCRRNRLTENFVPMQPPYLFNIEEDPCE 490


>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
          Length = 354

 Score =  365 bits (937), Expect = 3e-98,   Method: Compositional matrix adjust.
 Identities = 188/363 (51%), Positives = 243/363 (66%), Gaps = 17/363 (4%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHII ILADD GW+DV FHGS +IPTPN+DALA +G+IL+ HY Q LCTPSR +L+TGK+
Sbjct: 1   PHIIFILADDFGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKH 60

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PI IG+Q GVI   +P+GL L EKLLP+YLK  GY +H +GKWHLGFF + YTP  RGFD
Sbjct: 61  PIQIGLQRGVIYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFD 120

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           SHYG++   +DY  H          GLD   N Q D    G YST L+T  A  ++AEHN
Sbjct: 121 SHYGFYGASEDYMTH-----IGGMGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHN 175

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQ---APDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           +++PMFLY +H AVH  +  E      AP +   KF  I+D E +  AGM+S LDESVGN
Sbjct: 176 QTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGN 235

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---GSNHPLRGMKSTPWDGGMRGVAAI 323
           +  +L  +GMLEN+I++F +DNGAP    H N     SN+PLRG KS+ W+GG R  A +
Sbjct: 236 LTKSLHANGMLENTIIIFSSDNGAP----HGNTDVCSSNYPLRGAKSSVWEGGTRVPAFV 291

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD--GVNQWDVLTKGAKTKRSEI 381
           WSP LK++  VS ++ HISDWLPTL  AAG +++    D  GV+QW  L +   + R+ +
Sbjct: 292 WSPLLKKSGYVSKQMMHISDWLPTLLKAAGYDMSALPGDLYGVSQWQALQENGPSARNSM 351

Query: 382 LHN 384
           L+N
Sbjct: 352 LYN 354


>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
 gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
          Length = 492

 Score =  364 bits (934), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 213/544 (39%), Positives = 305/544 (56%), Gaps = 73/544 (13%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KPHI+ ILADD GWND+ +H SS I TPN+D LA  G+ L  +Y+Q +C+PSR+ LMTG
Sbjct: 14  RKPHIVFILADDYGWNDIGYH-SSLIQTPNLDRLAQEGVKLENYYIQPICSPSRAQLMTG 72

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y I  GMQH V++   P GLPL E  LPQ LKE+GYATH +GKWHLG F++ Y PT+RG
Sbjct: 73  RYQIRYGMQHSVLMSDRPHGLPLGEVTLPQVLKESGYATHIVGKWHLGHFKKEYLPTWRG 132

Query: 148 FDSHYGYWQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           FD+ +G+  G +DY+ H        T E Y+  D     +      G YST ++   +I+
Sbjct: 133 FDTFFGFLGGGEDYFTHRIPNEIVETPETYRAFDFWDGSKPCLSENGSYSTHVFARKSID 192

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +I+ H+K KPMFLYL   AVHA     P +AP+E + K+  I     RTYAG+ + +DE+
Sbjct: 193 LISRHDKDKPMFLYLPFQAVHA-----PLEAPEEFINKYTHIRSKNMRTYAGVTTAMDEA 247

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VGN+  AL+ HGM EN++++F  DNGA      S  GSN PLRG K+T W+GG+RGV  +
Sbjct: 248 VGNITRALKDHGMWENTVLIFSTDNGA------SKAGSNWPLRGFKNTLWEGGVRGVGFV 301

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEIL 382
            SP LK  Q++S  L HISDW PT+   AG     T  LDG + WD +++G  + R+EIL
Sbjct: 302 SSPLLKVKQRISDALLHISDWFPTIVRIAGGSTFGTKPLDGYDVWDTISEGTPSPRTEIL 361

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
           HNID +                 +A T+++           ++I  +S           T
Sbjct: 362 HNIDPL-----------------IAATNSDA----------SKIHGHS-----------T 383

Query: 443 FN-ALKTKLQIKQ-KHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
           FN ++K  ++++  K       N    L+T   D KI E     R +  +          
Sbjct: 384 FNTSVKAAIRVRDWKLLTGNPGNGDWQLQTF--DGKIKEKSTGVRGKQLW---------- 431

Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
               LFNI  DP E+ ++++   +++++L  KL+ Y  T VP      D R++PA   ++
Sbjct: 432 ----LFNIRSDPQERKDVSDEYPEVVQRLLTKLSAYNQTAVPVYWPDIDPRSNPALHGDL 487

Query: 561 WVPW 564
             PW
Sbjct: 488 VGPW 491


>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
 gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
          Length = 528

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 222/567 (39%), Positives = 301/567 (53%), Gaps = 60/567 (10%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
            A    LL    FL    P  PHI+ ILADD GWNDV +HGS +I TPN+D L+  G+ L
Sbjct: 10  IATAIILLSCPLFLEGKQP--PHIVFILADDFGWNDVGYHGS-EIKTPNLDKLSAKGVRL 66

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
             +YVQ LCTPSR+ LMTG+Y I  GMQH +I   +P+ +PL EKLLPQ ++EAGYATH 
Sbjct: 67  ENYYVQPLCTPSRNQLMTGRYQIRTGMQHQIIWPCQPYCVPLDEKLLPQLMREAGYATHM 126

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQV 184
           +GKWHLG +R+   PT RGFDS++GY  G +DY+ H   +   P       +D+R   +V
Sbjct: 127 VGKWHLGMYRKDCLPTRRGFDSYFGYLTGSEDYFSHQRCSYVPPLNVTRCAVDLREGEEV 186

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
                G YST L+T+   ++IA+ N  KP+FLY+A  AVHA     P Q P+  VA +  
Sbjct: 187 ATGYTGTYSTQLFTDRVTSIIAKQNSKKPLFLYVALQAVHA-----PLQVPERYVAPYSF 241

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I DP RR YAGMVS +DE+VGN+  AL++ G+ +N++++F  DNG  +       GSN P
Sbjct: 242 IKDPNRRLYAGMVSAMDEAVGNITLALQETGLWDNTVLVFSTDNGGQTLA----GGSNWP 297

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDG 363
           LRG K + W+GG+RGV  + SP LKQ   VS +L HISDWLPTL   AG   N T  LDG
Sbjct: 298 LRGRKWSLWEGGVRGVGFVTSPLLKQPGTVSRQLIHISDWLPTLVGLAGGSTNGTKPLDG 357

Query: 364 VNQWDVLTKGAKTKRSEILHNIDNV-----DNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
            N W+ ++KG  + R E+LHNID +       P      L V +     G          
Sbjct: 358 YNVWNAISKGYASPRLELLHNIDPLYVDIAPCPGSASQRLPVSNAPVSGG---------- 407

Query: 419 YGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL 478
            G T N      P     S   ++ +A    +Q K                 +LT     
Sbjct: 408 -GSTSN------PWSWSNSGFNVSIHAAIRSMQWK-----------------LLTGYPGC 443

Query: 479 ELREFARVRCNYDNKGAHCNSTVKPC-LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
           + R F R      +      + +KP  LF++  DP E++ +++   +++  L  +L  Y 
Sbjct: 444 D-RWFTRPGLENTSGTIRSVNPLKPVMLFDVEADPEERDEVSDQHPNVVDNLLRRLGHYL 502

Query: 538 STMVPPGNKPFDKRADPARWNNIWVPW 564
               P      D R DP      W PW
Sbjct: 503 RRSQPITFPDDDPRCDPG--TGAWGPW 527


>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
          Length = 458

 Score =  361 bits (926), Expect = 6e-97,   Method: Compositional matrix adjust.
 Identities = 185/369 (50%), Positives = 242/369 (65%), Gaps = 15/369 (4%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A + PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS 
Sbjct: 40  SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   P
Sbjct: 99  LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
           T RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            A  VIA H   KP+FLYLA  +VH     +P Q P+E +  +  I D  RR YAGMVS 
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +DE+VGNV  AL+ HG+  N++ +F  DNG    G   + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
              + SP LKQ    S EL HISDWLPTL   AG   N T  LDG N W  +++G  + R
Sbjct: 330 TGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 389

Query: 379 SEILHNIDN 387
            E+LHNID 
Sbjct: 390 VELLHNIDQ 398


>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
          Length = 520

 Score =  360 bits (925), Expect = 9e-97,   Method: Compositional matrix adjust.
 Identities = 210/546 (38%), Positives = 298/546 (54%), Gaps = 54/546 (9%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           T A  +PHI+ I+ADD GWNDV +H +S+I TPN+D L+  G+ L  +YVQ LC+PSR+ 
Sbjct: 23  TLAAPRPHIVFIMADDFGWNDVGYH-NSEIKTPNLDLLSAKGVRLQNYYVQPLCSPSRNQ 81

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           LMTG+Y IH GMQH +I   +P+ +PL EKLLPQ LKEAGYATH +GKWHLG +++   P
Sbjct: 82  LMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYKKDCLP 141

Query: 144 TFRGFDSHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
           T RGFDS++GY+ G +DYY H+      A       LD+R   +V     G YST+L+++
Sbjct: 142 THRGFDSYFGYYLGSEDYYTHTRCYPITALNLTRCALDLRDGEEVATAYKGAYSTELFSQ 201

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            A++VIA+HN S+P+FLY+A  AVH     EP Q P+  V  +  I D  RR YAGMVS 
Sbjct: 202 RAVSVIAKHNASQPLFLYVAMQAVH-----EPLQVPERYVTPYSFIKDVSRRKYAGMVSA 256

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +DE+VGN+  ALR+ G+ +N++++F  DNG    G     GSN PLRG K   W+GG+RG
Sbjct: 257 MDEAVGNISLALRQAGLWDNTVLVFSTDNG----GQTRAGGSNWPLRGRKWALWEGGIRG 312

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
           V  + SP L+    V++EL HISDWLPTL   AG   + T  LDG N W+ ++KG  + R
Sbjct: 313 VGFVSSPLLENPGTVNNELMHISDWLPTLVRLAGGSTSGTKPLDGFNMWNTISKGFASPR 372

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
            E+LHNID +      Y        +    T   G   E +  +   +  ++    L  K
Sbjct: 373 LELLHNIDTL------YEDTSPCPGRQQQLTLAQGFRGETWARSSFNVSIHAAIRYLNWK 426

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
               +    T     + + ++  + S D L+ ++                          
Sbjct: 427 LLTGYPGCPTWFPRPEHNTSESSSTSVDPLKPVM-------------------------- 460

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
                 LF+I  DP E+N ++     ++  L  +L  +    +P      D R DP   +
Sbjct: 461 ------LFDIQKDPEERNEVSAQFPAVVDYLLTRLQSHLKNSLPINFPDDDPRCDPGP-S 513

Query: 559 NIWVPW 564
             W PW
Sbjct: 514 GAWGPW 519


>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
 gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
          Length = 491

 Score =  360 bits (923), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 199/478 (41%), Positives = 288/478 (60%), Gaps = 48/478 (10%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
           L  N   L ++A  KPH++ +LADDLGW+DV FHGS +I TPNID LA NG+IL+ +YVQ
Sbjct: 12  LCLNVVVLQSSA--KPHLLFVLADDLGWSDVGFHGS-KIQTPNIDRLAANGVILDNYYVQ 68

Query: 75  ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
            +CTP+R++LMTGKYPIH G+QHG+I  G P+GLPL   LLPQ L++AGY+TH +GKWHL
Sbjct: 69  PVCTPTRASLMTGKYPIHTGLQHGIIHNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHL 128

Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
           GF+    TPT+RGFD+ YG++ G +++Y H           LD+R N ++     G YS 
Sbjct: 129 GFYNWESTPTYRGFDTFYGFYSGAENHYTHVQD------HYLDLRDNEEIVRDQNGTYSA 182

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
            L+T+ A  ++  H+ S P+F+Y+A   VH+     P QAP E + ++  I DP RRTYA
Sbjct: 183 HLFTKRAEQIVRAHDPSTPLFMYMAFQNVHS-----PVQAPKEYIDRYSFIKDPLRRTYA 237

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            MV+ +D+++GN+  A  K G+ EN+I++F  DNG    G+  N G ++PLRG K T W+
Sbjct: 238 AMVTIMDDALGNLTRAFDKAGLWENTILIFSTDNG----GVPKNGGYDYPLRGRKDTLWE 293

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKG 373
           GG+RGVA +    L+Q+      L H++DW PTL + AG  ++ D  LDG + W+ ++ G
Sbjct: 294 GGVRGVAFVHGVALEQSGVKCKALMHVTDWYPTLVSLAGGSLDEDEDLDGYDVWESISHG 353

Query: 374 AKTKRSEILHNIDNVDNP--------QKYYAALRVDDLK----------YVAGTDNNGQS 415
            ++ R E+LHNID ++ P              LRV D+K          ++   D NG  
Sbjct: 354 VESPRKELLHNIDTINIPPGDGSLGFSTTGIGLRVGDMKLLMAVPNISYFIPPEDRNGSV 413

Query: 416 DEWYGDTDN-----EIDKYSPKEVLYSKAGI--TFNALKTKLQIKQKH---AADPKAN 463
           D WY  ++N     E+  Y+     Y K  +      + T+LQ++ +H    A P AN
Sbjct: 414 D-WYIHSNNKVPMVEVALYNITADPYEKHDLHDKLPDVVTRLQLRVEHYRKTAVPPAN 470



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 40/65 (61%), Gaps = 1/65 (1%)

Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA-DPARWNN 559
           V+  L+NIT DP E+++L +   D++ +L+ ++  Y+ T VPP NKP D  A   A+ N 
Sbjct: 426 VEVALYNITADPYEKHDLHDKLPDVVTRLQLRVEHYRKTAVPPANKPKDPYARQVAKQNG 485

Query: 560 IWVPW 564
            W PW
Sbjct: 486 AWTPW 490


>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
 gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
 gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
          Length = 534

 Score =  359 bits (922), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 184/369 (49%), Positives = 242/369 (65%), Gaps = 15/369 (4%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A + PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS 
Sbjct: 40  SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   P
Sbjct: 99  LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
           T RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            A  VIA H   KP+FLYLA  +VH     +P Q P+E +  +  I D  RR YAGMVS 
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +DE+VGNV  AL+ HG+  N++ +F  DNG    G   + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
              + SP LKQ    S EL HI+DWLPTL   AG   N T  LDG N W  +++G  + R
Sbjct: 330 TGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 389

Query: 379 SEILHNIDN 387
            E+LHNID 
Sbjct: 390 VELLHNIDQ 398



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++++     +++ L  +L  Y    VP    P D R DP +   +W PW
Sbjct: 475 LFDINQDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSHFPPLDPRCDP-KSTGVWSPW 533


>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
          Length = 534

 Score =  359 bits (921), Expect = 2e-96,   Method: Compositional matrix adjust.
 Identities = 184/369 (49%), Positives = 242/369 (65%), Gaps = 15/369 (4%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A + PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS 
Sbjct: 40  SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   P
Sbjct: 99  LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
           T RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            A  VIA H   KP+FLYLA  +VH     +P Q P+E +  +  I D  RR YAGMVS 
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +DE+VGNV  AL+ HG+  N++ +F  DNG    G   + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
              + SP LKQ    S EL HI+DWLPTL   AG   N T  LDG N W  +++G  + R
Sbjct: 330 TGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 389

Query: 379 SEILHNIDN 387
            E+LHNID 
Sbjct: 390 VELLHNIDQ 398



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++++     +++ L  +L  Y    VP    P D R DP +   +W PW
Sbjct: 475 LFDINQDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSHFPPLDPRCDP-KSTGVWSPW 533


>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
          Length = 491

 Score =  358 bits (919), Expect = 4e-96,   Method: Compositional matrix adjust.
 Identities = 187/405 (46%), Positives = 253/405 (62%), Gaps = 26/405 (6%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           L+     KPHI++I+ADD G+ND+ +HGS +I TPN+D LA  G+ L  +YVQ +CTP+R
Sbjct: 16  LSKCQTTKPHILLIIADDFGYNDIGYHGS-EIKTPNLDKLAGEGVKLENYYVQPICTPTR 74

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           S LM+G+Y IH G+QHGVI   +P GLPL   +LPQ LKE GY+THA+GKWHLGF++E Y
Sbjct: 75  SQLMSGRYQIHTGLQHGVIRPPQPNGLPLDSAILPQKLKEVGYSTHAVGKWHLGFYKEEY 134

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
            PT RGFDSH+GY  G +DY+ H     F    G D+R NM   N T G+YST L+ + A
Sbjct: 135 LPTNRGFDSHFGYLTGAEDYFKHD--RCFGAMCGTDLRDNMNPANYT-GVYSTHLFAQKA 191

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             V+  HN  KP+FLYL   +VHA     P Q P++    ++ I D +RRTYAGMVS +D
Sbjct: 192 AEVVNNHNTDKPLFLYLPFQSVHA-----PLQVPEQYTKPYMHIQDKQRRTYAGMVSAMD 246

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E++GN+    +K GM  N++++F  DNG     IH+  G+N PLRG K + W+GG+ GV 
Sbjct: 247 EAIGNLTDVFKKRGMWNNTLMVFSTDNGGQ---IHAG-GNNFPLRGWKDSLWEGGVHGVG 302

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKT-KRS 379
            +    LK+   VS +L H+SDW PTL +  G  +N T  LDG++QW  + +G+    R 
Sbjct: 303 LVHGQMLKRKGSVSRDLIHVSDWFPTLISMVGGTLNGTKPLDGMDQWKTINEGSSPGPRQ 362

Query: 380 EILHNIDNVDNP-----------QKYYAALRVDDLKYVAGTDNNG 413
            ILHNID +  P            +  AA+R+ D K V G   NG
Sbjct: 363 SILHNIDPMYPPVGQRVPSSPFDNRIRAAVRMGDWKLVTGFPGNG 407



 Score = 43.9 bits (102), Expect = 0.27,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT+DP E N+++ +   ++  + + LA Y  T VP      D +A+P      W PW
Sbjct: 431 LFNITEDPNEHNDVSGTYQHVVINMLELLAQYNKTAVPCRYPAMDSKANPKYHGGYWGPW 490


>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
          Length = 516

 Score =  355 bits (912), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 220/565 (38%), Positives = 307/565 (54%), Gaps = 63/565 (11%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F  +T   L    FL   A K+PHI+ ILADD GW DV +H  S+I TP +D L+  G+ 
Sbjct: 6   FLIITGLFLSFPGFL---AAKQPHIVFILADDFGWYDVGYH-LSEIRTPILDKLSSGGVR 61

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +YVQ LCTPSR+ LMTG+Y IH GMQH +I   +P+ +PL EKLLPQ +KEAGYATH
Sbjct: 62  LENYYVQPLCTPSRNQLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATH 121

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYD----HSCKATFEPYQGLDMRHNMQ 183
            +GKWHLG +++   PT RGFDS+ GY  G +DYY     H   A       LD+R    
Sbjct: 122 MVGKWHLGMYKKDCLPTRRGFDSYLGYLTGSEDYYTHIRCHPISALNLTRCALDLREAEA 181

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
           V     G YST+L+++ A+++I +H  ++P+FLY+A  AVHA     P Q P+  VA + 
Sbjct: 182 VARSYKGTYSTELFSQRAVSIIEKHTSTEPLFLYVAFQAVHA-----PLQVPERYVAPYS 236

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
            I D  RR+YAGMVS +DE+VGN+  AL+K G+ EN++++F  DNG  +     + GSN 
Sbjct: 237 FIQDHSRRSYAGMVSAMDEAVGNITLALQKRGLWENTVLVFSTDNGGQTL----SGGSNW 292

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LD 362
           PLRG K + W+GG+RGV  + SP L+Q   +S +L HISDWLPTL   AG   N T  LD
Sbjct: 293 PLRGRKWSLWEGGVRGVGFVASPLLEQPGTISRQLIHISDWLPTLVGLAGGSTNGTKPLD 352

Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVD-DLKYVAGTDNNGQSDEWYGD 421
           G + W+ ++KG  + R E+LHNID + N        R    L  VAG     +++  +  
Sbjct: 353 GFDVWNTISKGFASPRLELLHNIDPLYNDIAPCPGTRRPLTLAQVAG--GASRTNSGFNV 410

Query: 422 TDNEIDKYSPKEVL--YSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
           + +   +YS  ++L  Y    I F                P  N SD      +     E
Sbjct: 411 SIHAAIRYSNWKLLTGYPGCDIWF--------------PRPGHNGSD------STPSEAE 450

Query: 480 LREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKST 539
           L++  R                   L+NI  DP E+N ++     ++ +L  +L  ++ +
Sbjct: 451 LQKPVR-------------------LYNIEKDPEERNEVSAQFPAVVDRLLARLYHHQKS 491

Query: 540 MVPPGNKPFDKRADPARWNNIWVPW 564
            VP      D + DP      W PW
Sbjct: 492 AVPINFPDDDPKCDPGP-TGAWGPW 515


>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
 gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
          Length = 520

 Score =  353 bits (906), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 191/426 (44%), Positives = 258/426 (60%), Gaps = 31/426 (7%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F  L C L     F  + A  KP+I+ ILADD GWND+ +HGS  I TPN+D LA  G+ 
Sbjct: 21  FLLLVCVLSGGKTFAQS-ASGKPNIVFILADDYGWNDIGYHGSV-IRTPNLDRLAAEGVK 78

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +Y+Q +CTPSRS LMTG+Y IH G+QH +I   +P GLPL E  LPQ LKE GY+TH
Sbjct: 79  LENYYIQPICTPSRSQLMTGRYQIHFGLQHSIIWPPQPSGLPLDEVTLPQRLKEGGYSTH 138

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKA---TFEP-YQGLDMRHNMQ 183
            +GKWHLGF++E YTP  RGFD+ YG+  G +++Y H        F P + GLD+R   +
Sbjct: 139 IVGKWHLGFYKEEYTPLHRGFDTFYGFLTGSENHYSHRNSGGMPGFRPGWNGLDLRDQDR 198

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
                 G YST L+ + AI VIA+ +KSKPMFLYL   AVHA     P QAP + ++ + 
Sbjct: 199 PVTDQNGTYSTHLFAKKAIEVIAQQDKSKPMFLYLPFQAVHA-----PLQAPQKYISMYR 253

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
            I+D  RR YA M + +DE+VGN+  AL+++G+ +N++++F  DNG    G     G+N 
Sbjct: 254 HINDYNRRMYAAMTTAMDEAVGNITDALKQYGLWDNTVLVFSTDNG----GQVMAGGNNW 309

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LD 362
           PLRG K + W+GG+RGV  + SP +K   + S  L HISDW PTL   AG   N T  LD
Sbjct: 310 PLRGWKGSLWEGGIRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLAGGSTNGTKPLD 369

Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDN------PQKY---------YAALRVDDLKYVA 407
           G + W+ +++G  + R EILHNID +        P ++         +AA+R  D K + 
Sbjct: 370 GHDVWEAISEGKPSPRKEILHNIDPLFRTILYLPPHQWGDDTFNTSMHAAIRSGDWKLLT 429

Query: 408 GTDNNG 413
           G   NG
Sbjct: 430 GYPGNG 435



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 38/60 (63%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI +DP E+ +L++    +++ L +KLA YK + VP    P D RA+PA   ++W PW
Sbjct: 460 LFNIREDPEERTDLSKKYPLVVQDLLEKLAAYKKSSVPIFYPPPDPRANPALHGDLWSPW 519


>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
          Length = 519

 Score =  352 bits (903), Expect = 3e-94,   Method: Compositional matrix adjust.
 Identities = 182/393 (46%), Positives = 248/393 (63%), Gaps = 27/393 (6%)

Query: 4   ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
            +  ++ +  T + N A   T A ++PH++ ILADD GW D+ +HGS +I TPN+D L+ 
Sbjct: 6   GKAVYWIIASTFIVNLA--QTLAARQPHVVFILADDFGWYDIGYHGS-EIRTPNLDKLSA 62

Query: 64  NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
            G+ L  +YVQ LCTPSR+ LMTG+Y IH GMQH +I   +P+ +PL EKLLPQ +KEAG
Sbjct: 63  GGVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAG 122

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ--------- 174
           YATH +GKWHLG +++   PT RGFD++ GY  G +DY+ H     F  YQ         
Sbjct: 123 YATHMVGKWHLGMYKKDCLPTRRGFDTYLGYLTGSEDYFTH-----FRCYQSPSLNLSRC 177

Query: 175 GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA 234
            LD+R   +V     G+YST+L ++ AI++I  H   KP+F+Y+A  AVHA     P Q 
Sbjct: 178 ALDLRDGEEVATGYKGVYSTELLSQRAISIIERHISQKPLFMYVALQAVHA-----PLQV 232

Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
           P+  V  +  I D  RR YAGMVS +DE+VGN+  AL+K G+  N++++F  DNG  +  
Sbjct: 233 PERYVTPYSFIKDTNRRLYAGMVSAMDEAVGNITMALQKEGLWNNTVLVFSTDNGGQTL- 291

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
              + GSN PLRG K + W+GG+RGV  + SP LKQ   V+ EL HISDWLPTL   AG 
Sbjct: 292 ---SGGSNWPLRGRKWSLWEGGVRGVGFVASPLLKQPGTVNRELIHISDWLPTLVGLAGG 348

Query: 355 EINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
               T  LDG + W+ ++KG  + R E+LHNID
Sbjct: 349 STTGTKPLDGFDVWNAISKGFASPRLELLHNID 381


>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
          Length = 429

 Score =  350 bits (897), Expect = 1e-93,   Method: Compositional matrix adjust.
 Identities = 178/366 (48%), Positives = 236/366 (64%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS+ I TP +DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 41  ASRPPHLVFVLADDLGWNDVGFHGSA-IRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLL 99

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH G+QH +IL  +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   PT 
Sbjct: 100 TGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTR 159

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++TE A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERA 219

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  RR YAGM S +D
Sbjct: 220 TTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMD 274

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL + G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 275 EAVGNVTAALERRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LK+    + EL HISDWLPTL   AG   N T  LDG + W+ +++G+ + R E
Sbjct: 331 FVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTNGTKPLDGFDVWNTISEGSPSPRME 390

Query: 381 ILHNID 386
           +LHNID
Sbjct: 391 LLHNID 396


>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
          Length = 532

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 182/367 (49%), Positives = 239/367 (65%), Gaps = 15/367 (4%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           +A + PH++ +LADDLGWNDV FHGSS I TP++DALA  G+ L+ +YVQ LCTPSRS L
Sbjct: 39  SAGRPPHLVFLLADDLGWNDVGFHGSS-IRTPHLDALAAGGVRLDNYYVQPLCTPSRSQL 97

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           ++G+Y IH G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT
Sbjct: 98  LSGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPT 157

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEA 200
            RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++TE 
Sbjct: 158 RRGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTER 217

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A  +IA H   KP+FLYLA  +VH     EP Q P+E +  +  I D  RR YAGMVS +
Sbjct: 218 ATALIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIEDKNRRHYAGMVSLM 272

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DE+VGNV AAL+K G+ +N++ +F  DNG  +       G+N PLRG K T W+GG+RGV
Sbjct: 273 DEAVGNVTAALKKRGLWDNTVFIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGV 328

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRS 379
             + SP LKQ    + EL HISDWLPTL   AG   N T  LDG + W  +++G+ + R 
Sbjct: 329 GFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWKTVSEGSPSPRV 388

Query: 380 EILHNID 386
           E+LHNID
Sbjct: 389 ELLHNID 395


>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 574

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 214/551 (38%), Positives = 295/551 (53%), Gaps = 81/551 (14%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+PHI+ IL DD G+ND+ +H  + I TP +D LA  G+ L  +YVQ +CTPSRS LMTG
Sbjct: 44  KQPHIVFILIDDQGFNDIGYHNPT-IKTPTLDKLAAEGVRLENYYVQPICTPSRSQLMTG 102

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LP     LP+ L++AGY+TH +GKWHLGF+R+   PT +G
Sbjct: 103 RYQIHTGLQHSIIRPSQPSCLPSHMDTLPERLRQAGYSTHLVGKWHLGFYRKACLPTRKG 162

Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           FD+ +G   G  D+Y++ SC        G D+     V     G YST L+T+ A  ++ 
Sbjct: 163 FDTFFGSLTGSVDHYNYLSCDGPG--VCGYDLHDGEGVAWGQEGKYSTTLFTQRARKILE 220

Query: 207 EHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
            HN + KP+FL L+  AVH      P Q P   +  + D+++  RR  A MVS +DE+V 
Sbjct: 221 SHNPTEKPLFLLLSLQAVHT-----PLQTPKSYIYPYRDMANIARRKLAAMVSTVDEAVR 275

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           NV  ALRK+G   NS++++  DNGA  F      GSN PLRG K T W+GG+RGVA + S
Sbjct: 276 NVTYALRKYGFYRNSVIIYSTDNGAQPF----TGGSNWPLRGRKGTYWEGGIRGVAFVHS 331

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+ ++VS  L HI+DW PTL   AG  I+ +  LDG + W  +++G  + R EILHN
Sbjct: 332 PLLKRRRRVSKALLHITDWFPTLVGLAGGNISQSRGLDGFDVWPTISEGRDSPRQEILHN 391

Query: 385 IDNVDN--------PQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
           ID +               AA+RV D K + G   +G               + P +VL 
Sbjct: 392 IDPLHKHVTSESLWDTSIQAAIRVGDWKLLTGYPGHGD--------------WVPPQVLP 437

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH 496
           +  G  +                      D  R IL+  K                    
Sbjct: 438 TLPGRWW----------------------DLERNILSPYK-------------------- 455

Query: 497 CNSTVKPC-LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
            NST K   LFNIT DPCE+++LA+ + D++ QL  +LA Y  T VP    P D RADP+
Sbjct: 456 -NSTYKNIWLFNITADPCERHDLADQRPDVVLQLLARLAYYNQTAVPVYFPPDDPRADPS 514

Query: 556 RWNNIWVPWYD 566
           +    WVPW +
Sbjct: 515 QHEGAWVPWVN 525


>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
 gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
          Length = 598

 Score =  349 bits (895), Expect = 3e-93,   Method: Compositional matrix adjust.
 Identities = 182/411 (44%), Positives = 253/411 (61%), Gaps = 30/411 (7%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           +  ++  KP+I+ ILADD GWND+ +HGS  I TPN+D LA  G+ L  +YVQ LC+PSR
Sbjct: 114 IQESSSGKPNIVFILADDYGWNDIGYHGSV-IRTPNLDRLAAEGVKLENYYVQPLCSPSR 172

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
             LMTG+Y I  G+QH +I   +P GLPL E  LPQ LKE GY+TH +GKWHLGF+++ Y
Sbjct: 173 CQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGFYKQDY 232

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATF----EPYQGLDMRHNMQVDNKTIGIYSTDLY 197
           TPT RGFD+ YGY  G +DY+ H  K       + + GLD+R   +      G YST L+
Sbjct: 233 TPTHRGFDTFYGYLTGAEDYWTHRQKGGLPGQPQTWSGLDLRDQNRPVTDQNGTYSTHLF 292

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
              AI +IA+ +K+KPMFL+L+  AVH     +P QAP+E+++++  ISD  RR YA M 
Sbjct: 293 ANKAIEIIAQQDKNKPMFLFLSFQAVH-----DPLQAPEEDISRYSHISDTNRRVYAAMT 347

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
           + +D++VGNV  AL+++G+ +N++++F  DNG    G     G N PLRG K + W+GG+
Sbjct: 348 TIMDQAVGNVTRALKQYGLWDNTVLIFSTDNG----GRVDRGGINWPLRGWKGSLWEGGV 403

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKT 376
           RGV  + SP +K   + S  L HISDW PTL   A    N T  LDG + W+ ++ G  +
Sbjct: 404 RGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLASGSTNGTKPLDGHDVWEAISDGKPS 463

Query: 377 KRSEILHNID----NVDNPQKY-----------YAALRVDDLKYVAGTDNN 412
            R EILHNID     V +P+ +           +AA+R  D K + G   N
Sbjct: 464 PRREILHNIDPMFHTVPSPRPHQWGDRVFNTSVHAAIRSGDWKLLTGYPGN 514



 Score = 54.7 bits (130), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 38/60 (63%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI +DP E+ +L++    ++++L +KLA Y  T VP     FD +A+PA   NIW PW
Sbjct: 538 LFNIREDPEERTDLSQKHPGVVQELLEKLARYNRTAVPVFYPSFDPQANPALHGNIWGPW 597


>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
 gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
          Length = 533

 Score =  348 bits (892), Expect = 6e-93,   Method: Compositional matrix adjust.
 Identities = 178/366 (48%), Positives = 236/366 (64%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS+ I TP +DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 41  ASRPPHLVFVLADDLGWNDVGFHGSA-IRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLL 99

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH G+QH +IL  +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   PT 
Sbjct: 100 TGRYQIHTGLQHQIILPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTR 159

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++TE A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERA 219

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  RR YAGM S +D
Sbjct: 220 TTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMD 274

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL + G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 275 EAVGNVTAALERRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LK+    + EL HISDWLPTL   AG   N T  LDG + W+ +++G+ + R E
Sbjct: 331 FVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTNGTKPLDGFDVWNTISEGSPSPRME 390

Query: 381 ILHNID 386
           +LHNID
Sbjct: 391 LLHNID 396


>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
          Length = 520

 Score =  347 bits (891), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 179/364 (49%), Positives = 238/364 (65%), Gaps = 15/364 (4%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + PH++++LADDLGWNDV FHGS ++ TP++DALA  G+ L+ +Y Q LCTPSRS L+TG
Sbjct: 30  QPPHLVLVLADDLGWNDVGFHGS-RLRTPHLDALAAGGVRLDNYYTQPLCTPSRSQLLTG 88

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   PT RG
Sbjct: 89  RYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRG 148

Query: 148 FDSHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           FD+++GY  G +DY+ H      KA       LD R   +V  +   +YS +++   AI+
Sbjct: 149 FDTYFGYLLGSEDYFSHEHCVFIKALNVTRCALDFRDGEEVATEYKNMYSANIFANRAIS 208

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +IA H   KP+FLYLA  +VH     EP Q P+E +  +  I D  RR YAGMVS +DE+
Sbjct: 209 LIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIRDKNRRLYAGMVSLMDEA 263

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VGNV AAL  HG+  N++ +F  DNG  +       G+N PLRG K T W+GG+RGV  +
Sbjct: 264 VGNVTAALESHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGIRGVGFV 319

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEIL 382
            SP LKQ    + EL HISDWLPTL   AG     T +LDG N W+ +++G+ + R E+L
Sbjct: 320 ASPLLKQKGVRTRELIHISDWLPTLVKLAGGSTQGTKALDGFNVWETISEGSPSPRVELL 379

Query: 383 HNID 386
           HNID
Sbjct: 380 HNID 383


>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
 gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
 gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
 gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
          Length = 528

 Score =  347 bits (891), Expect = 8e-93,   Method: Compositional matrix adjust.
 Identities = 179/366 (48%), Positives = 241/366 (65%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A   PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS L+
Sbjct: 36  AAPPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LK+AGYATH +GKWHLG +R+   PT 
Sbjct: 95  TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+ A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +IA H   KP+FLYLA  +VH     +P Q P+E +  +  I D  RR YAGMVS LD
Sbjct: 215 TTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 269

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV  AL+  G+  N++++F  DNG    G   + G+N PLRG K T W+GG+RG  
Sbjct: 270 EAVGNVTKALKSRGLWNNTVLIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGAG 325

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    S EL HI+DWLPTL   AG   + T  LDG + W+ +++G+ + R E
Sbjct: 326 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTKPLDGFDVWETISEGSPSPRVE 385

Query: 381 ILHNID 386
           +L NID
Sbjct: 386 LLLNID 391



 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++++     +++ L  +L  Y    VP    P D R DP +   +W PW
Sbjct: 469 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 527


>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
          Length = 633

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 203/565 (35%), Positives = 306/565 (54%), Gaps = 38/565 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + P II+ + DD+GW+DVSFH + QIPTPNIDA+  + ++LN HYVQA CTPSR AL+TG
Sbjct: 55  RAPDIILFIVDDMGWSDVSFHANGQIPTPNIDAMCSDAVLLNSHYVQASCTPSRGALLTG 114

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           KYPI IG+Q  VI  G    L L  +LLPQYL++ GYATH +GKWHLGF+ E Y P  RG
Sbjct: 115 KYPIKIGLQEYVIQPGRQEALHLKHRLLPQYLRDLGYATHLVGKWHLGFYAEDYLPENRG 174

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDS YG++ G   YY+HS  +  +   G D   N + D    G Y+TD+ T+   ++I  
Sbjct: 175 FDSFYGFYNGAGTYYNHSA-SDADGRIGYDWHLNKESDPDAHGKYATDIITQRVKHLIQS 233

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE--------RRTYAGMVSR 259
            +  KPMFL ++H+A H G+  +     D +      I DPE        R  YAGM+  
Sbjct: 234 RDPEKPMFLMISHMAPHGGDNEDELFEVDRQW-----IEDPEIAHIMVESRTKYAGMIRA 288

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GSNHPLRGMKSTPWDGGM 317
           LD++ G  + AL++  ML+NS++LF +DNG PS    S K   SN PLRG K + W+GG+
Sbjct: 289 LDDAFGQTVVALKEAEMLQNSVILFTSDNGGPSIVDESYKTGASNWPLRGQKFSMWEGGV 348

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND--TSLDGVNQWDVLTKGAK 375
           R  + +WSP LK  + ++  L+HI+DWLPT    AG E+ D    +DG + WD + +   
Sbjct: 349 RVSSFLWSPLLKSQRYINKHLYHITDWLPTFYKLAGGELRDLPIDIDGHDIWDSIDRNTD 408

Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGT---DNNGQSDEWYGDTDNEIDKYSPK 432
           + R  +L N D          A+R  + K + G    + +   D WY      +D +   
Sbjct: 409 SPRKNLLINYDTGGTEN---FAVRRGNFKLMHGKPPPEVDPWYDSWYATAGGTMDDFMTL 465

Query: 433 EVLYSKAGITFNALKTKLQIKQKHAADPKANSSDAL--RTILTDEKILELREFA---RVR 487
           +    K+ +T++ L  +   +++     +A        R +    + L+ R +     ++
Sbjct: 466 QKAMKKS-VTYDILSRRPDFEERRDDWREAGKVKCKINRGMADSPRGLDYRTYTVDDMLK 524

Query: 488 CNYDNKGAHC-NSTVKPCLFNITDDPCEQNNLAESQTD-LLKQLEDKLAIY---KSTMVP 542
            +       C NS  +PCLF++ +DPCE    + +  + ++++L +    Y   K   V 
Sbjct: 525 ASVIKNDEICTNSNEQPCLFDLLEDPCELRQYSGADREQIIRELSEIAEQYLRLKEAAVA 584

Query: 543 PGNKPF---DKRADPARWNNIWVPW 564
           P +      D  ADP ++N  W+P+
Sbjct: 585 PWSSRMEHTDADADPQKFNGTWMPY 609


>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
          Length = 491

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 208/546 (38%), Positives = 301/546 (55%), Gaps = 67/546 (12%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PH++++LADDLGW DV +HGS+ I TP +DAL   G+ L ++Y Q LCTPSRS L++G+Y
Sbjct: 1   PHLVLVLADDLGWGDVGWHGSA-IRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRY 59

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            IH G+QH +I   +P  LPL EKLLP+ L+EAGY TH +GKWHLG +R+   PT RGFD
Sbjct: 60  QIHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYRKECLPTHRGFD 119

Query: 150 SHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           +++GY  G +DYY H      KA       LD R   +V      +YST+L+TE AI+VI
Sbjct: 120 TYFGYLLGSEDYYTHDRCVFIKAKNVTRCALDFRDGEEVATGFKNVYSTNLFTERAIDVI 179

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A H   KP+FLYLA  +VH     EP + P++ V  +  I D +RR YAGMV+ +DE+VG
Sbjct: 180 ANHKTEKPLFLYLAFQSVH-----EPLEVPEKYVKPYSSIKDVKRRHYAGMVTLMDEAVG 234

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL+ +G+ +N++ +F  DNG  +       G+N PLRG K T W+GG+RGV  + S
Sbjct: 235 NLTDALKTYGLWDNTVFVFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGVGFVAS 290

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+    S EL HISDWLPTL   AG   N T  LDG + W+ +++G  + R E+LHN
Sbjct: 291 PLLKRKGVESHELIHISDWLPTLVHLAGGHTNGTKPLDGFDVWNTISEGRPSPRVELLHN 350

Query: 385 IDN--VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
           ID   VD+    +  L +D    +  + ++   D ++  + +   ++   ++L    G +
Sbjct: 351 IDPMFVDD----FPCLGIDKRTSLPQSSSSPWDDTFFNISIHAAIRHGKWKLLTGYPGCS 406

Query: 443 F----NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
                 +L  + QI+   ++DP                                      
Sbjct: 407 HWFPPPSLYNESQIQ---SSDP-------------------------------------- 425

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
           ST +  LF+I  DP E+ +++E    ++++L  +L  Y    VP      D   DPA   
Sbjct: 426 STKRLWLFDIVHDPEEKYDVSEKYPHVVEKLLSRLQYYHKRSVPVFYPDEDPACDPAA-T 484

Query: 559 NIWVPW 564
             W PW
Sbjct: 485 GAWGPW 490


>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
          Length = 431

 Score =  346 bits (887), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 177/353 (50%), Positives = 231/353 (65%), Gaps = 15/353 (4%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A + PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS 
Sbjct: 40  SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 98

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   P
Sbjct: 99  LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 158

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
           T RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+
Sbjct: 159 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 218

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            A  VIA H   KP+FLYLA  +VH     +P Q P+E +  +  I D  RR YAGMVS 
Sbjct: 219 RATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 273

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +DE+VGNV  AL+ HG+  N++ +F  DNG    G   + G+N PLRG K T W+GG+RG
Sbjct: 274 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 329

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLT 371
              + SP LKQ    S EL HISDWLPTL   AG   N T  LDG N W  ++
Sbjct: 330 TGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTNGTKPLDGFNMWKTIS 382


>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
          Length = 484

 Score =  345 bits (886), Expect = 3e-92,   Method: Compositional matrix adjust.
 Identities = 200/538 (37%), Positives = 290/538 (53%), Gaps = 84/538 (15%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KP+I++ILADDLGWNDV ++ + +I TPN+D LA NG+ILN  Y  + C+PSR+AL+TG
Sbjct: 26  QKPNILLILADDLGWNDVGWN-NPEIKTPNLDRLASNGVILNASYALSTCSPSRTALLTG 84

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +YP  +G+QHGV+ +G+P+GLPL   LLPQ LK  GY+THAIGKWHLGF R  YTPTFRG
Sbjct: 85  RYPFKLGLQHGVVKKGKPYGLPLNITLLPQKLKHLGYSTHAIGKWHLGFCRWEYTPTFRG 144

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDS YG++ G +DYY     A     +G D R N +V    I  YST  Y   A+ +I  
Sbjct: 145 FDSFYGFYSGSEDYYKRKTAAI----RGYDFRMNTKVFKPKIKKYSTLDYGRRAVKIIQA 200

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H +++P+FLY+   AVH+     P Q P     K+ +I D  RR ++GMVS +D ++G  
Sbjct: 201 HKRTEPLFLYMPFQAVHS-----PLQVPKSFEFKYRNIVDRNRRIFSGMVSAMDSAIGMT 255

Query: 268 IAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           I AL++  ML N++++F++DNG  P FG     G+N PLRG K+T W+GG R  + +W  
Sbjct: 256 IKALKQRNMLTNTMIVFLSDNGGTPFFG-----GNNWPLRGSKATLWEGGTRVPSFVWGK 310

Query: 327 WLKQTQK-VSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
            L Q     S+E+ H  DW PTL + AG    +  +DGV+QW +L++   ++R E ++ I
Sbjct: 311 MLLQKAGYTSNEMIHAVDWFPTLVSLAG-GTPEKDVDGVDQWQMLSQRQPSRRKEFVYGI 369

Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNA 445
           +++       AA+R    K + GT   G    WY    + + K                 
Sbjct: 370 NSITG----RAAIRRGKYKLIEGTA--GHYSGWYPAPSSRVVK----------------- 406

Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCL 505
           +K   Q+ ++H                                         NST+   L
Sbjct: 407 MKRMKQLNKRH----------------------------------------VNSTL---L 423

Query: 506 FNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           +NI  DP E N+L+     L + +  +L  YK+  V   N   ++ + P  W   W P
Sbjct: 424 YNIQVDPIESNDLSAKYPHLTQFMLRRLNRYKAQEVEAFNPRKERYSQPKFWGGNWSP 481


>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
 gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
          Length = 535

 Score =  345 bits (885), Expect = 4e-92,   Method: Compositional matrix adjust.
 Identities = 213/548 (38%), Positives = 292/548 (53%), Gaps = 75/548 (13%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           H++ +LADDLGW+DV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+TG+Y 
Sbjct: 48  HLVFVLADDLGWHDVGFHGS-RIRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT RGFD+
Sbjct: 107 IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           ++GY  G +DYY H      +        LD R   +V      +YST+++TE A  +I+
Sbjct: 167 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIS 226

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H   KP+FLYLA  +VH     EP Q P+E +  +  I D  RR YAGMVS +DE+VGN
Sbjct: 227 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRRYYAGMVSLMDEAVGN 281

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V AAL+ HG+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV  + SP
Sbjct: 282 VTAALKSHGLWNNTVFVFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASP 337

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
            LK+    S EL HISDWLPTL   AG     T  LDG + W  +++G+ + R E+LHNI
Sbjct: 338 LLKRKGVKSRELVHISDWLPTLVGLAGGSTKGTKPLDGFDVWRTISEGSPSPRMELLHNI 397

Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNA 445
                           D  +V  +   GQS              +P +   S  G  + +
Sbjct: 398 ----------------DPNFVDISPCPGQS-------------LAPAKDDSSHPG--YFS 426

Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC- 504
             T L    +H                 + K+L    +    C +     H N +V P  
Sbjct: 427 FNTSLHAAIRHG----------------NWKLLT--GYPGCGCWFPPPSQH-NVSVIPSS 467

Query: 505 --------LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
                   LF+I  DP E+++L+     ++KQL  +L  Y    VP      D R DP +
Sbjct: 468 DPPTKTLWLFDIDQDPEERHDLSRDHPHVVKQLLSRLQFYHKHSVPVYFPAQDPRCDP-K 526

Query: 557 WNNIWVPW 564
               W PW
Sbjct: 527 GTGAWGPW 534


>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
          Length = 431

 Score =  344 bits (883), Expect = 6e-92,   Method: Compositional matrix adjust.
 Identities = 176/351 (50%), Positives = 230/351 (65%), Gaps = 15/351 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS L+
Sbjct: 42  ATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             VIA H   KP+FLYLA  +VH     +P Q P+E +  +  I D  RR YAGMVS +D
Sbjct: 221 TTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRIYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV  AL+ HG+  N++ +F  DNG    G   + G+N PLRG K T W+GG+RG  
Sbjct: 276 EAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGTG 331

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLT 371
            + SP LKQ    S EL HI+DWLPTL   AG   N T  LDG N W  ++
Sbjct: 332 FVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTIS 382


>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 537

 Score =  344 bits (883), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 210/568 (36%), Positives = 295/568 (51%), Gaps = 64/568 (11%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           ALT T+L     +     K PHI+ I+ADD GW DV +H S+ I TPN+D LA  G+ L 
Sbjct: 20  ALTVTVLICTGLIKGATGKPPHIVFIVADDYGWFDVGYHNST-IKTPNLDLLASRGVKLE 78

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
            +YVQ +C+PSRS LMTG+Y IH G+QH VI+  +P  LPL E  LPQ LKE+GYATH +
Sbjct: 79  NYYVQPICSPSRSQLMTGRYQIHTGLQHFVIIAPQPNCLPLNETTLPQKLKESGYATHLV 138

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ------GLDMRHNMQ 183
           GKWHLGF++    P  RGFDS +GY  G+QDY+ H    +F  +       G+D   N +
Sbjct: 139 GKWHLGFYKNECMPLQRGFDSSFGYLSGMQDYWTHFRSGSFPGFPEGNHWLGIDFWDNNR 198

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
           V  +  G YS  ++TE A  VI +HN ++P+FLYL   +VH      P Q P++ +  + 
Sbjct: 199 VAWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHG-----PLQVPEKYMKPYA 253

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
              D  R+TYAGMV+ +DE+VG V+ +L++ G+  +++++F  DNG    G     G+N 
Sbjct: 254 HFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNG----GTPGKSGNNW 309

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDTS 360
           PLRG K+T W+GG+ GV  I  P +    +  VS    HISDW PTL    AG      +
Sbjct: 310 PLRGTKNTLWEGGVHGVGFITGPMIPAGVQGTVSKHFMHISDWFPTLIEGVAGGNTAGLA 369

Query: 361 LDGVNQWDVLTKGAKTKRSEILHNID---NVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
           LD  N W+ +TKG  + R E+LHNID     D+P  Y                       
Sbjct: 370 LDSYNMWNSITKGTPSPRKELLHNIDPYIRADHPFGY----------------------- 406

Query: 418 WYGDTDNEIDKYSPKEVLYSKAGITFNA-LKTKLQIKQKHAADPKANSSDALRTILTDEK 476
                D E D   P   LY K    F+  ++  +++ +          S        +  
Sbjct: 407 ---GYDEETDMIYPLSGLYPKMAAEFSTDMRAAIRVGEWKLLTGFPGRSGWYPPPEWNIH 463

Query: 477 ILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
            ++  E A       NK           LFNIT DPCE+N+L+    +++ +L  +L  Y
Sbjct: 464 PIDPVEAA-------NKVT--------WLFNITADPCEKNDLSYQHPEVVTELVGRLEAY 508

Query: 537 KSTMVPPGNKPFDKRADPARWNNIWVPW 564
             T VP        +ADPA  + +W PW
Sbjct: 509 YKTSVPVRFPNQTVKADPALHHGVWGPW 536


>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
           (N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
           intestinalis]
          Length = 522

 Score =  344 bits (882), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 208/571 (36%), Positives = 306/571 (53%), Gaps = 79/571 (13%)

Query: 13  CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
           C L+F    L  +  +KP+I++I+ADDLG+NDV +H +  I TPNI+ LA +G+IL  +Y
Sbjct: 11  CFLVFVQV-LYVSGHQKPNILLIVADDLGFNDVGYH-NPDIYTPNINKLAKDGVILESYY 68

Query: 73  VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
           VQ +CTPSRS LMTG+Y IH G+QH VI   +P  LP+ E +LPQ LKEAGY THA+GKW
Sbjct: 69  VQPICTPSRSQLMTGRYQIHTGLQHSVIFAPQPNCLPVDEIILPQKLKEAGYTTHAVGKW 128

Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF------EPYQGLDMRHNMQVDN 186
           HLGF+++   PT RGFD+ YGY+ G +DYY     A F          G D   N + + 
Sbjct: 129 HLGFYKKECLPTSRGFDTFYGYYCGAEDYYTKQVHANFHFGNKTRRVSGFDFHDNSRTEW 188

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
           +  G YS+ LY + A+ +I  HN S P+F+YL   +VH      P Q P + + ++  I 
Sbjct: 189 EANGTYSSYLYRDRAVRIIKSHNSSIPLFMYLPFQSVHF-----PLQVPAKYIKRYRHIK 243

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPL 305
           D +RRT++ MV+ +DE++G+V+ AL+  G+ ++++V+F  DNG  + FG     G+N PL
Sbjct: 244 DRKRRTFSAMVTAMDEAIGSVVDALKWKGIWQDTLVVFTTDNGGQTLFG-----GNNWPL 298

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGV 364
           RG K++ W+GG+RGV  +    ++   + S+EL HISDW PTL   AG   + T  LD  
Sbjct: 299 RGRKASLWEGGVRGVGLVRGYGIRDKGRSSNELVHISDWFPTLLYIAGANASGTKPLDSY 358

Query: 365 NQWDVLT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
           N WD +  K   + R E+LHNID +    + Y                            
Sbjct: 359 NVWDAIANKDVVSPRVELLHNIDPLFT--RPYTP-------------------------- 390

Query: 424 NEIDKYSPKEVLY-SKAGITFNALKTKLQIK--QKHAADPKANSSDALRTILTDEKILEL 480
                 +PK + Y +K G+        L+ K  + H  DP  +  D   T          
Sbjct: 391 ------TPKLIPYPNKHGVNTTKGHAALRYKHWKLHTGDPGYDKWDLPPT---------- 434

Query: 481 REFARVRCNYDNKGAHCNSTVKPC-------LFNITDDPCEQNNLAESQTDLLKQLEDKL 533
               +  CN      + NS  K         LF+I +DP E+ NLA+    ++  L  K+
Sbjct: 435 ----KSYCNLSGMHTNTNSKYKSTHLVPSVHLFDIENDPQERKNLAQIYPKVVDLLLGKI 490

Query: 534 AIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           A+Y  + VP    P+D +++P+    +W PW
Sbjct: 491 AMYNKSAVPINFPPYDPKSNPSLHGGVWEPW 521


>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
          Length = 547

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 207/552 (37%), Positives = 306/552 (55%), Gaps = 40/552 (7%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
           A L  +A  KPH+I+I+ADD+GW+D S HGS  + TPN+D L  +G+ L+++Y  ALC+P
Sbjct: 13  AALCVSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSP 72

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG--FF 137
           +R+A++TGKY   +GMQ   +   E  G+PL E+L+ QYL++AGY T  +GKWH+G  FF
Sbjct: 73  ARTAVLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFF 132

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE--PYQGLDMRHNMQVDNKTIGIYSTD 195
            ++  PT+RGF++H+G   G  DYY+++ +   +  P  GL +  ++Q D  T G Y TD
Sbjct: 133 EQL--PTYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITD 189

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           +YTE +  +I  HN S+P++L L H A H GN     QAP EEV     +    RR +A 
Sbjct: 190 VYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAA 249

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV +LD+S+G ++A L K GMLEN+I+ F  DNGAP+ G+ +N GSN+PLRG+K +PW+G
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEG 309

Query: 316 GMRGVAAIW-----SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
           G+RG A IW     +P      KV     H +DW+PTL  A G +I    LDG+  W  +
Sbjct: 310 GIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEKI-PAGLDGIPMWSHI 368

Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
            +   + R+EI   ID+  N    ++++ +   K V GT +   S   YG+    I    
Sbjct: 369 IENKPSPRTEIF-EIDDYFN----HSSVTLGRHKLVKGTIDESLSKH-YGEDLRGIIGTP 422

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P               K KL+       D KA  S     I  D  ++  R+ A V C  
Sbjct: 423 PD-------------YKQKLR-------DSKAWESLETIGIPLDADVMADRDEAIVTCGN 462

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP-PGNKPFD 549
                   S    CL++I +DPCE  +L+E    L + L  +L   ++ ++P  G    D
Sbjct: 463 VVPKPCSPSAESWCLYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREGQYVAD 522

Query: 550 KRADPARWNNIW 561
            ++ P  +N  W
Sbjct: 523 PKSAPKYFNYTW 534


>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
          Length = 514

 Score =  343 bits (880), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 209/544 (38%), Positives = 296/544 (54%), Gaps = 63/544 (11%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PH++++LADDLGW DV +HGS+ I TP +DAL   G+ L ++Y Q LCTPSRS L++G+Y
Sbjct: 24  PHLVLVLADDLGWGDVGWHGSA-IRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRY 82

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            IH G+QH +I   +P  LPL EKLLP+ LK+AGY TH +GKWHLG +R+   PT RGFD
Sbjct: 83  QIHTGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFD 142

Query: 150 SHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           +++GY  G +DYY H      KA       LD R   +V      +YST+L+TE AI++I
Sbjct: 143 TYFGYLLGSEDYYSHDHCVLIKAKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLI 202

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A H   KP+FLYLA  +VH     EP +   E +  +  I D +RR YAGMVS +DE+VG
Sbjct: 203 ANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLMDEAVG 257

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL+++G+  N++++F  DNG  +       G+N PLRG K T W+GG+RGV  + S
Sbjct: 258 NLTDALKEYGLWNNTVLVFSTDNGGQTMA----GGNNWPLRGRKWTLWEGGVRGVGFVAS 313

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
           P LKQ    S EL HISDWLPTL   AG   N T  LDG + W  +++G  + R E+LHN
Sbjct: 314 PLLKQKGVESHELIHISDWLPTLVHLAGGHTNGTKPLDGFDVWKTISEGRPSPRVELLHN 373

Query: 385 IDN--VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVL--YSKAG 440
           ID   VD+    Y  L +D+   +   +++   +  +  + +   ++   ++L  Y   G
Sbjct: 374 IDPMFVDD----YPCLGIDNRTSLPQINSSPWDNTLFNISVHAAIRHGKWKLLTGYPGCG 429

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
             F                P  +S   +++     K L L                    
Sbjct: 430 HWF--------------PPPSLSSESQIQSYDPPMKRLWL-------------------- 455

Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
                F+I  DP E+  L+E    ++K+L  +L  Y    VP      D + DPA    +
Sbjct: 456 -----FDIVHDPEEKYELSEKYPHVVKKLLSRLQYYYKRSVPVFYPDEDPQCDPAA-TGV 509

Query: 561 WVPW 564
           W PW
Sbjct: 510 WGPW 513


>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
          Length = 484

 Score =  343 bits (880), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 203/538 (37%), Positives = 289/538 (53%), Gaps = 82/538 (15%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K PHI+ I+ADDLGWNDV +H  + + TPNID LA  GLILNQ Y Q LC+PSR A MTG
Sbjct: 24  KSPHILFIVADDLGWNDVGYHNPA-MKTPNIDKLAREGLILNQTYFQPLCSPSRHAFMTG 82

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            YP   G+QH VI+  +P   PL  K LPQ LK+ GYATH +GKWHLG      TPT+RG
Sbjct: 83  YYPYRAGLQHLVIMPWQPVCSPLNMKFLPQRLKDLGYATHMVGKWHLGMCNWDCTPTYRG 142

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDS +G++    DYY H        Y+ LD R N +      G YST  +T  A ++I +
Sbjct: 143 FDSFFGFYHAKADYYSHIS------YKYLDYRDNEKPVKNLNGTYSTFTFTSRAQDIIKK 196

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           HN S+P+FLY+A    H     EP Q P +    + +     RR Y+GMVS LDE++GN+
Sbjct: 197 HNSSQPLFLYMAFPIPH-----EPLQVPQQYEDMYPNTMSKGRRQYSGMVSALDEAIGNI 251

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
              LR+ GM ++++++F +DNG    G   + G+N+PLRG K T ++GG R V+ +    
Sbjct: 252 TDQLRESGMYDDTLIVFTSDNG----GSPMDYGNNYPLRGAKFTVYEGGTRVVSFVHGAG 307

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
           L++T +    + H  DW  TL AAAG E  D   DGV+QW  + + +   R+E+++N D 
Sbjct: 308 LQKTGEAYESMMHAVDWSDTLVAAAGGE-PDMRDDGVSQWTAIAQNSPGPRTELVYNFDP 366

Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALK 447
              P++ +AA+RV D K + G    G   +WY                            
Sbjct: 367 DSAPEQGHAAIRVGDYKLIHGYP--GLFPDWY---------------------------- 396

Query: 448 TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
              ++ Q+H  +             T+  +LE+ E A                 +  LF 
Sbjct: 397 ---KVDQQHEFN-------------TEPSVLEMLEHAE----------------RYQLFY 424

Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF--DKRADPARWNNIWVP 563
           I DDP EQN+LA+S+ D++K+L  +L +     +PP N P   D ++DP  ++N+W P
Sbjct: 425 IKDDPTEQNDLAQSKPDIVKKLSQRLLMLTKNALPP-NYPLVPDPKSDPTHFDNVWSP 481


>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
 gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
          Length = 535

 Score =  342 bits (878), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 178/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDVSFHGS+ I TP++D LA  G++L+ +Y Q LCTPSRS L+
Sbjct: 43  ADRPPHLVFVLADDLGWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLL 101

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 102 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H   A  +        LD R   QV      +YST+++TE A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCALIDSLNVTRCALDFRDGEQVATGYKNMYSTNIFTERA 221

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 222 TALITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMD 276

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+ HG+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 277 EAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVG 332

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A G       LDG + W  +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTKGTKPLDGFDVWKTISEGSPSPRKE 392

Query: 381 ILHNID 386
           +LHNID
Sbjct: 393 LLHNID 398


>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
          Length = 503

 Score =  342 bits (878), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 177/364 (48%), Positives = 238/364 (65%), Gaps = 15/364 (4%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + PH++ +LADDLGWNDV FHGS ++ TP++DALA  G+ L+ +Y Q LCTPSRS L+TG
Sbjct: 14  RPPHLVFVLADDLGWNDVGFHGS-RLRTPHLDALAAGGVQLDNYYTQPLCTPSRSQLLTG 72

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  +PL EKLLPQ L+EAGYATH +GKWHLG +++   PT RG
Sbjct: 73  RYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLQEAGYATHMVGKWHLGMYQKECLPTRRG 132

Query: 148 FDSHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           FD+++GY  G +DYY H      KA       LD R   +V  +   +YST+++T  A +
Sbjct: 133 FDTYFGYLLGSEDYYTHEHCVFIKALNVTRCALDFRDGEEVATQYKNLYSTNIFTNRATS 192

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +IA H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +DE+
Sbjct: 193 LIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHLYAGMVSLMDEA 247

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VGNV AAL+ HG+  N++ +F  DNG  +       G+N PLRG K T W+GG+RGV  +
Sbjct: 248 VGNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGVGFV 303

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEIL 382
            SP LKQ    + EL HISDWLPTL   AG + + T  LDG + W  +++G+ + R E+L
Sbjct: 304 ASPLLKQKGVKNRELIHISDWLPTLVKLAGGDTHGTKPLDGFDVWRTISEGSPSPRMELL 363

Query: 383 HNID 386
           HNID
Sbjct: 364 HNID 367


>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
          Length = 534

 Score =  342 bits (877), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 176/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 42  ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 221 TTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+ HG+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 276 EAVGNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDMWKTISEGSPSPRVE 391

Query: 381 ILHNID 386
           +LHNID
Sbjct: 392 LLHNID 397


>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
          Length = 532

 Score =  341 bits (875), Expect = 6e-91,   Method: Compositional matrix adjust.
 Identities = 177/369 (47%), Positives = 236/369 (63%), Gaps = 15/369 (4%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            + A + PH++ +LADDLGWNDV FHGSS I TP++DALA  G++L+ +Y Q LCTPSRS
Sbjct: 37  GSGAGRPPHLVFVLADDLGWNDVGFHGSS-IRTPHLDALAAGGVLLDNYYTQPLCTPSRS 95

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
            L+TG+Y I +G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   
Sbjct: 96  QLLTGRYQIRMGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECL 155

Query: 143 PTFRGFDSHYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
           PT RGFD+++GY  G +DYY H       A       LD R   +V      +YST+++T
Sbjct: 156 PTRRGFDTYFGYLLGSEDYYSHERCTLINALNVTRCALDFRDGEEVATGYKNMYSTNIFT 215

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           E A  +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS
Sbjct: 216 ERATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVS 270

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            +DE++GNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+R
Sbjct: 271 LMDEAIGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVR 326

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTK 377
           GV  + SP LKQ    + EL HISDWLPTL   AG   N T  LDG + W+ +++G+ + 
Sbjct: 327 GVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWETISEGSPSP 386

Query: 378 RSEILHNID 386
           R E+LHNID
Sbjct: 387 RVELLHNID 395


>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
          Length = 521

 Score =  341 bits (874), Expect = 8e-91,   Method: Compositional matrix adjust.
 Identities = 176/364 (48%), Positives = 235/364 (64%), Gaps = 15/364 (4%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PH++ +LADDLGW DV +HGS QI TP +DAL+  G+ L ++Y+Q LCTPSRS L+TG+Y
Sbjct: 33  PHLVFVLADDLGWQDVGWHGS-QIRTPVLDALSAAGVRLERYYIQPLCTPSRSQLLTGRY 91

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            IH G+QH +I   +P  +PL EKLLP+ LKEAGY TH +GKWHLG +R    PT RGFD
Sbjct: 92  QIHTGLQHEIIWPCQPSCVPLDEKLLPELLKEAGYVTHMVGKWHLGMYRNECLPTRRGFD 151

Query: 150 SHYGYWQGLQDYYDHS-CKATFEP---YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           +++GY  G +DYY H  C            LD+R   ++ +    +YST+++T+ A ++I
Sbjct: 152 TYFGYLLGSEDYYSHEHCVPIVSKNVTRCALDLRDGEKIADGFKNMYSTNVFTQRAQDLI 211

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A H   KP+FLYLA  +VH     EP Q P++ V  +  I D +RR YAGMVS LDE+VG
Sbjct: 212 ANHQPEKPLFLYLALQSVH-----EPLQVPEKYVEPYSFIKDEKRRKYAGMVSILDEAVG 266

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL+  G+  N++++F  DNG  +       G+N PLRG K T W+GG+RG   + S
Sbjct: 267 NITEALKTRGLWNNTVLIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGAGFVAS 322

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
           P LKQ    S EL HISDWLPTL   AG   NDT  LDG + W  L++G  + R E+LHN
Sbjct: 323 PLLKQRGVESHELIHISDWLPTLVKLAGGSTNDTKPLDGFDVWKTLSEGEPSPRKELLHN 382

Query: 385 IDNV 388
           ID +
Sbjct: 383 IDPI 386


>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
          Length = 414

 Score =  340 bits (873), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 176/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 42  ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   AG  IN T  LDG + W  +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHINGTKPLDGFDVWKTISEGSPSPRVE 391

Query: 381 ILHNID 386
           +LHNID
Sbjct: 392 LLHNID 397


>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
 gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score =  340 bits (873), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 206/549 (37%), Positives = 305/549 (55%), Gaps = 40/549 (7%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
           A L  +A  KPH+I+I+ADD+GW+D S HGS  + TPN+D L  +G+ L+++Y  ALC+P
Sbjct: 13  AALCVSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSP 72

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG--FF 137
           +R+A++TGKY   +GMQ   +   E  G+PL E+L+ QYL++AGY T  +GKWH+G  FF
Sbjct: 73  ARTAVLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFF 132

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE--PYQGLDMRHNMQVDNKTIGIYSTD 195
            ++  PT+RGF++H+G   G  DYY+++ +   +  P  GL +  ++Q D  T G Y TD
Sbjct: 133 EQL--PTYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITD 189

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           +YTE +  +I  HN S+P++L L H A H GN     QAP EEV     +    RR +A 
Sbjct: 190 VYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAA 249

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV +LD+S+G ++A L K GMLEN+I+ F  DNGAP+ G+ +N GSN+PLRG+K +PW+G
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEG 309

Query: 316 GMRGVAAIW-----SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
           G+RG A IW     +P      KV     H +DW+PTL  A G +I    LDG+  W  +
Sbjct: 310 GIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEKI-PAGLDGIPMWSHI 368

Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
            +   + R+EI   ID+  N    ++++ +   K V GT +   S   YG+    I    
Sbjct: 369 IENKPSPRTEIF-EIDDYFN----HSSVTLGRHKLVKGTIDESLSKH-YGEDLRGIIGTP 422

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P               K KL+       D KA  S     I  D  ++  R+ A V C  
Sbjct: 423 PD-------------YKQKLR-------DSKAWESLETIGIPLDADVMADRDEAIVTCGN 462

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP-PGNKPFD 549
                   S    CL++I +DPCE  +L+E    L + L  +L   ++ ++P  G    D
Sbjct: 463 VVPKPCSPSAESWCLYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREGQYVAD 522

Query: 550 KRADPARWN 558
            ++ P  +N
Sbjct: 523 PKSAPKYFN 531


>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 206/549 (37%), Positives = 305/549 (55%), Gaps = 40/549 (7%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
           A L  +A  KPH+I+I+ADD+GW+D S HGS  + TPN+D L  +G+ L+++Y  ALC+P
Sbjct: 13  AALCVSAATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSP 72

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG--FF 137
           +R+A++TGKY   +GMQ   +   E  G+PL E+L+ QYL++AGY T  +GKWH+G  FF
Sbjct: 73  ARTAVLTGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFF 132

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE--PYQGLDMRHNMQVDNKTIGIYSTD 195
            ++  PT+RGF++H+G   G  DYY+++ +   +  P  GL +  ++Q D  T G Y TD
Sbjct: 133 EQL--PTYRGFENHFGVRGGFIDYYEYNAQEQLDGRPVTGLCLFDDLQPDWTTEG-YITD 189

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           +YTE +  +I  HN S+P++L L H A H GN     QAP EEV     +    RR +A 
Sbjct: 190 VYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAA 249

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV +LD+S+G ++A L K GMLEN+I+ F  DNGAP+ G+ +N GSN+PLRG+K +PW+G
Sbjct: 250 MVKKLDDSIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANSGSNYPLRGVKKSPWEG 309

Query: 316 GMRGVAAIW-----SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
           G+RG A IW     +P      KV     H +DW+PTL  A G +I    LDG+  W  +
Sbjct: 310 GIRGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIGEKI-PAGLDGIPMWSHI 368

Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
            +   + R+EI   ID+  N    ++++ +   K V GT +   S   YG+    I    
Sbjct: 369 IENKPSPRTEIF-EIDDYFN----HSSVTLGRHKLVKGTIDESLSKH-YGEDLRGIIGTP 422

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P               K KL+       D KA  S     I  D  ++  R+ A V C  
Sbjct: 423 PD-------------YKQKLR-------DSKAWESLETIGIPLDADVMADRDEAIVTCGN 462

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP-PGNKPFD 549
                   S    CL++I +DPCE  +L+E    L + L  +L   ++ ++P  G    D
Sbjct: 463 VVPKPCSPSAESWCLYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREGQYVSD 522

Query: 550 KRADPARWN 558
            ++ P  +N
Sbjct: 523 PKSAPKYFN 531


>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
          Length = 481

 Score =  340 bits (873), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 184/415 (44%), Positives = 255/415 (61%), Gaps = 23/415 (5%)

Query: 8   FFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           F  L CT+L      +  + A +  HI+ I+ADDLGWND+ FH +  I TPNID LA  G
Sbjct: 2   FVQLLCTVLVIINLCDDVSAAGRPRHIVFIVADDLGWNDIGFH-NPDIITPNIDKLAREG 60

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           L+LN HYVQ LC+PSR+A M+G YP   G+QH VILE +P  LPL   +LPQ LKE GYA
Sbjct: 61  LLLNHHYVQPLCSPSRAAFMSGYYPFKTGLQHSVILENQPVCLPLNITILPQKLKELGYA 120

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           TH +GKWH GF     TPT+RGFDS +GY+  ++DYY H  +        LD R+N    
Sbjct: 121 THIVGKWHNGFCSWNCTPTYRGFDSFFGYYGAMEDYYTHVIRGF------LDYRNNTTPV 174

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
               G YST  +T+ A ++I  HN+S+P+FLYLA+ AV     Y P + P +  A + +I
Sbjct: 175 WTDNGTYSTLRFTDVATDIIERHNQSQPLFLYLAYQAV-----YGPIEVPAKYEAMYPNI 229

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
               RR ++GMVS LDE+VGNV   LR+ G+++++++LF ADNG    G     G+N+PL
Sbjct: 230 KSENRRKFSGMVSALDEAVGNVTKTLRQRGLMDDTLILFTADNG----GGVDESGNNYPL 285

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA-GIEINDTSLDGV 364
           RG K T ++GG R V  ++   L++T  V   + H  DWLPTL AAA G  ++D   DG+
Sbjct: 286 RGSKFTVYEGGTRAVGFMYGSGLQKTGTVFDGMIHAVDWLPTLTAAAGGTPVSDR--DGI 343

Query: 365 NQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
           N W  L+  + + R+E+++N D+   P + +AA+RV D K + G    G   +WY
Sbjct: 344 NLWPSLSTASPSPRTEVVYNYDSHPQPVQGHAAIRVGDYKLIDGYP--GPFPDWY 396



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 27/61 (44%), Positives = 40/61 (65%), Gaps = 3/61 (4%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF--DKRADPARWNNIWV 562
           LFN+ DDP E+N+L+  + D++K+L  +LA YK   VPP N P   D  ++PA + N+W 
Sbjct: 419 LFNLKDDPNERNDLSNFRPDMVKKLAARLAWYKKQAVPP-NFPETPDDLSNPALYGNVWS 477

Query: 563 P 563
           P
Sbjct: 478 P 478


>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 516

 Score =  340 bits (872), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 185/450 (41%), Positives = 261/450 (58%), Gaps = 38/450 (8%)

Query: 8   FFALTCTLLFNDA---FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +FA+   L F       L   +P+  HI+ ILADD G++D+ +H  S I TPN+D LA  
Sbjct: 15  WFAMDFRLFFTTLIVLLLRPVSPEPTHIVFILADDYGFHDIGYH-DSIIKTPNLDRLASE 73

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
           G+ L  +YVQ +CTP+RS LM+G+Y IH G+QHG+I   +P  LP+ E  +PQ LKE+GY
Sbjct: 74  GVKLENYYVQPICTPTRSQLMSGRYQIHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGY 133

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ- 183
           ATH +GKWHLG +++   PT RGFD+ +GY  G +DYY H+   +++ + G+D R NMQ 
Sbjct: 134 ATHIVGKWHLGMYKKECLPTERGFDTFFGYLTGSEDYYTHN--RSYDKFHGMDFRENMQI 191

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
           V  +  G YST ++ E A N+I  H+   P+FLYL   AVH      P Q PD+    + 
Sbjct: 192 VQPQYNGQYSTHVFAEKAKNIIKSHDPQIPLFLYLPFQAVHG-----PLQVPDQYEKPYA 246

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
           +I++ +RRTYAGMVS +DE+VGN+   L+  G+ +NS+++F  DNG          G+N 
Sbjct: 247 NITNKQRRTYAGMVSCMDEAVGNITQTLKDTGLYDNSVIIFSTDNGGQVIA----GGNNW 302

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTS- 360
           PLRG K + W+GGM GV  + SP +    +   S EL H++DW PT+   AG  +N T  
Sbjct: 303 PLRGWKGSLWEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAGGSVNGTQP 362

Query: 361 LDGVNQWDVLTKGAKTKRSEILHNID------NVDNPQKYY----------AALRVDDLK 404
           LDG NQW+ ++ G K+ R E+LHNID         N    Y          AALRV D K
Sbjct: 363 LDGYNQWETISNGKKSPRKELLHNIDPNYIALQTSNNNMIYRTDMFNASIRAALRVGDWK 422

Query: 405 YVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            + G   N     W    ++ I    P ++
Sbjct: 423 ILTGDPGN---SSWIPPPESSITSPHPTDI 449



 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 31/60 (51%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI +DP E  +L+  + D + +L +KL  Y  T VP      D   DP + +  W PW
Sbjct: 456 LFNIAEDPNEYKDLSLFRPDKVNELLEKLQQYYKTSVPVRYPDNDPACDPGKHSGAWGPW 515


>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
          Length = 534

 Score =  340 bits (871), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 176/366 (48%), Positives = 234/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 42  ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   AG  IN T  LDG + W  +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHINGTKPLDGFDVWKTISEGSPSPRVE 391

Query: 381 ILHNID 386
           +LHNID
Sbjct: 392 LLHNID 397


>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
          Length = 506

 Score =  339 bits (870), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 176/366 (48%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS  I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 14  ASRPPHLVFVLADDLGWNDVGFHGSC-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 72

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 73  TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 132

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 133 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 192

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 193 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 247

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 248 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 303

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   AG  IN T  LDG + W  +++G+ + R E
Sbjct: 304 FVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHINGTKPLDGFDVWKTISEGSPSPRVE 363

Query: 381 ILHNID 386
           +LHNID
Sbjct: 364 LLHNID 369


>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
 gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
          Length = 478

 Score =  339 bits (870), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 183/403 (45%), Positives = 244/403 (60%), Gaps = 29/403 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KPHI+ ILADD GWND+ +HGS  I TPN+D LA  G+ L  +YVQ +C+PSR  LMTG+
Sbjct: 2   KPHIVFILADDYGWNDIGYHGSF-IKTPNLDRLASEGVKLENYYVQPICSPSREQLMTGR 60

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH VI    P GLPL E  LPQ LKE GY+TH +GKWHLGFFR+ Y P  RGF
Sbjct: 61  YQIHYGLQHSVITHDRPHGLPLDEVTLPQKLKEIGYSTHLVGKWHLGFFRQEYLPLRRGF 120

Query: 149 DSHYGYWQGLQDYYDHSCKATF----EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           D+ YG+  G +DY+ H     +      Y GLD+R   +      G YST L+   AI++
Sbjct: 121 DTFYGFLTGGEDYWSHRRPNVYSTDASEYHGLDLRDQDEPVLDQNGTYSTHLFQRKAIDI 180

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           IA H+++KPMFLYL+  AVHA     P QAP+  + K+  + +   R YA MV+ +DE+V
Sbjct: 181 IAHHDRNKPMFLYLSFQAVHA-----PLQAPEGYIEKYKSVQNILIRKYAAMVTAMDEAV 235

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           GNV  AL+  G+ +N++++F  DNGA       + GSN PLRG K+T W+GG+RGV  + 
Sbjct: 236 GNVTDALKGSGLWDNTVLIFSTDNGA-----RRSAGSNWPLRGWKNTLWEGGIRGVCFVT 290

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
           S  L++    S  L HISDW PTL   A    +D+  LDG + W+ ++ GAK+ R EIL 
Sbjct: 291 SNLLERKGTKSDALIHISDWFPTLIHLAQGSTHDSKPLDGFDVWNSISTGAKSTREEILL 350

Query: 384 NIDNVDNPQKY-------------YAALRVDDLKYVAGTDNNG 413
           NID +     +             YAA+R  D K + G  + G
Sbjct: 351 NIDPLKCKIGFTCASGGGNFNTLTYAAIRSRDWKLLTGFQDQG 393



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 36/60 (60%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI +DP E+ +L+E   D+++ L  KL  Y  T VPP   P D RA+P    +++ PW
Sbjct: 418 LFNIRNDPQERTDLSEKYPDVVQDLLMKLKTYNRTAVPPFWPPRDPRANPVLHGDVFGPW 477


>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
          Length = 556

 Score =  339 bits (870), Expect = 2e-90,   Method: Compositional matrix adjust.
 Identities = 176/369 (47%), Positives = 234/369 (63%), Gaps = 24/369 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + A + PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS 
Sbjct: 71  SGATQPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQ 129

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   P
Sbjct: 130 LLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLP 189

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTE 199
           T RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+
Sbjct: 190 TRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALDLRDGEEPAKEYNNIYSTNIFTK 249

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            A  VIA H   K              + ++P Q P+E +  +  I D  RR YAGMVS 
Sbjct: 250 RATTVIANHPPEK--------------SVHDPLQVPEEYMEPYGFIQDKHRRIYAGMVSL 295

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +DE+VGNV  AL+ HG+  N++ +F  DNG    G   + G+N PLRG K T W+GG+RG
Sbjct: 296 MDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRG 351

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKR 378
              + SP LKQ    S EL HI+DWLPTL   AG   N T  LDG N W  +++G  + R
Sbjct: 352 TGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTKPLDGFNMWKTISEGHPSPR 411

Query: 379 SEILHNIDN 387
            E+LHNID 
Sbjct: 412 VELLHNIDQ 420



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++++     +++ L  +L  Y    VP    P D R DP +   +W PW
Sbjct: 497 LFDINQDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSHFPPLDPRCDP-KSTGVWSPW 555


>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
          Length = 408

 Score =  338 bits (867), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 234/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 43  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H + KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 222 IALITNHPREKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392

Query: 381 ILHNID 386
           +LHNID
Sbjct: 393 LLHNID 398


>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
          Length = 522

 Score =  338 bits (867), Expect = 5e-90,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 236/366 (64%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A  +PHI+ +LADDLGWNDV +H  S I TP++DAL+  G+ L  +Y Q LCTPSRS L+
Sbjct: 30  ATGQPHIVFVLADDLGWNDVGYH-DSNIFTPHLDALSAQGVRLENYYTQPLCTPSRSQLL 88

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH G+QH +I   +P  +PL EKLLP+ L+EAGY TH +GKWHLG FR+   PT 
Sbjct: 89  TGRYQIHTGLQHQIIWPCQPSCIPLDEKLLPELLREAGYVTHMVGKWHLGMFRKECLPTR 148

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+ +GY  G +DYY H      +  +     LD R    +      +YST+++TE A
Sbjct: 149 RGFDTFFGYLLGSEDYYTHKRCVHIDALKVTRCALDFRDGEDIAAGYENMYSTNVFTERA 208

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           +N+IA H   KP+FLYLA  +VH     EP Q P+E +  +  I +  R+ YAGMV+ +D
Sbjct: 209 VNLIANHPAQKPLFLYLALQSVH-----EPLQVPEEYLQPYDFIQNKNRQHYAGMVTLMD 263

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV  AL+K+G+  N++ +F  DNG  +       G+N PLRG K T W+GG+RGV 
Sbjct: 264 EAVGNVTRALQKYGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGIRGVG 319

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LK+ +  SSEL HISDWLPTL   AG   N T  LDG + W  +++G+ + R E
Sbjct: 320 FVSSPLLKRKRVKSSELIHISDWLPTLVNLAGGRTNGTKPLDGFDVWKTISEGSPSPRME 379

Query: 381 ILHNID 386
           +LHNID
Sbjct: 380 LLHNID 385


>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 374

 Score =  337 bits (865), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 172/354 (48%), Positives = 228/354 (64%), Gaps = 4/354 (1%)

Query: 21  FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
           FL   +  K    ++   + GW+DVS HGSSQI TPNID LA  G+ L  +YV  +CTP+
Sbjct: 23  FLKHDSADKFDGCVMSEIEQGWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPT 82

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           RSA+MTGK+PIH GMQH  I   EPWGL L EK + Q+LK  GY+THA+GKWHLG+F E 
Sbjct: 83  RSAIMTGKHPIHTGMQHDTIGADEPWGLGLDEKTMAQHLKSLGYSTHAVGKWHLGYFAED 142

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           Y PT RGFDS +GY+ G  DYY H  + T   + G D+  N +V     G YST+++TE 
Sbjct: 143 YIPTRRGFDSFFGYYNGRGDYYTH--EDTEGGFGGYDLHKNGEVHWPDFGQYSTEIFTEE 200

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
           A  +I  HN S+P+FLYLAH AVHAG    +  +AP +    F +I    RR +A MV+ 
Sbjct: 201 AQQIIKTHNASQPLFLYLAHQAVHAGVYGKDLVEAPHKYYQMFPNIKTEGRRMFAAMVAG 260

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           LD+SVGN+   L+  G+  NSI++F  DNG P+ G   N  SN PLRG K T W+GG+RG
Sbjct: 261 LDDSVGNISQTLQDAGLYNNSIIVFTTDNGGPTNGYDGNHASNWPLRGCKHTLWEGGVRG 320

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTK 372
            A + SP +++ ++ S  + H+ DWLPTL   AG +  + T+LDG +QWD L+K
Sbjct: 321 TAFVNSPLIEKPRRFSDRMMHVCDWLPTLYGVAGGDPKELTNLDGFDQWDALSK 374


>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
          Length = 415

 Score =  337 bits (863), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 43  ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392

Query: 381 ILHNID 386
           +LHNID
Sbjct: 393 LLHNID 398


>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
          Length = 413

 Score =  337 bits (863), Expect = 1e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 41  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDMWKTISEGSPSPRIE 390

Query: 381 ILHNID 386
           +LHNID
Sbjct: 391 LLHNID 396


>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
 gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 42  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 332 FVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391

Query: 381 ILHNID 386
           +LHNID
Sbjct: 392 LLHNID 397


>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
          Length = 415

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 43  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 333 FVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392

Query: 381 ILHNID 386
           +LHNID
Sbjct: 393 LLHNID 398


>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
          Length = 533

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 41  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDMWKTISEGSPSPRIE 390

Query: 381 ILHNID 386
           +LHNID
Sbjct: 391 LLHNID 396


>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
          Length = 535

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 43  ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392

Query: 381 ILHNID 386
           +LHNID
Sbjct: 393 LLHNID 398


>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
 gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 42  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391

Query: 381 ILHNID 386
           +LHNID
Sbjct: 392 LLHNID 397


>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
 gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
 gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
          Length = 413

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 41  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 390

Query: 381 ILHNID 386
           +LHNID
Sbjct: 391 LLHNID 396


>gi|350400025|ref|XP_003485710.1| PREDICTED: arylsulfatase B-like, partial [Bombus impatiens]
          Length = 301

 Score =  336 bits (862), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 157/230 (68%), Positives = 184/230 (80%), Gaps = 3/230 (1%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KPHIIIILADDLGWNDVSFHGS QIPTPNIDALAYNG+ILN HYV ALCTPSRSALMTG
Sbjct: 72  QKPHIIIILADDLGWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTG 131

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           K PIH+GMQH V+   EP GLPL+EKLLPQYL+E GY THA+GKWHLG+F++ YTPT+RG
Sbjct: 132 KNPIHLGMQHSVLYPTEPRGLPLSEKLLPQYLQEIGYKTHAVGKWHLGYFKKQYTPTYRG 191

Query: 148 FDSHYGYWQGLQDYYDHSCK---ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           FDSH+GYW GL+DYY H  +   + +  Y+G DMR N+ V   T G Y+TDL+T  A+ +
Sbjct: 192 FDSHFGYWNGLEDYYTHIAQEPDSQYNEYKGFDMRRNLTVAWDTAGKYATDLFTNEAVRL 251

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
           I EH+  +PMFLYLAHLAVH GN  +  +APDEE+AKF  I DPERR  A
Sbjct: 252 INEHDTERPMFLYLAHLAVHKGNENQLLRAPDEEIAKFSYILDPERRIQA 301


>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
 gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
 gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
 gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
 gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score =  336 bits (861), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 42  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 332 FVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391

Query: 381 ILHNID 386
           +LHNID
Sbjct: 392 LLHNID 397


>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
 gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score =  336 bits (861), Expect = 2e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 43  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 333 FVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392

Query: 381 ILHNID 386
           +LHNID
Sbjct: 393 LLHNID 398


>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
 gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
 gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
 gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
          Length = 533

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 41  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 390

Query: 381 ILHNID 386
           +LHNID
Sbjct: 391 LLHNID 396


>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
 gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 43  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 101

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 102 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 161

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 162 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 221

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 222 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 276

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 277 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 332

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 333 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 392

Query: 381 ILHNID 386
           +LHNID
Sbjct: 393 LLHNID 398


>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
 gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
 gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
 gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 42  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 331

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391

Query: 381 ILHNID 386
           +LHNID
Sbjct: 392 LLHNID 397


>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
 gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
          Length = 533

 Score =  335 bits (860), Expect = 3e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 41  ASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 99

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 100 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 159

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 160 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 219

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 220 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 274

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 275 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVG 330

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 331 FVASPLLKQKGVKNRELIHISDWLPTLMKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 390

Query: 381 ILHNID 386
           +LHNID
Sbjct: 391 LLHNID 396


>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
          Length = 534

 Score =  335 bits (859), Expect = 4e-89,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 233/366 (63%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 42  ASQPPHLVFLLADDLGWNDVGFHGS-RIHTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           I +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 221 IALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV 
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVG 331

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E
Sbjct: 332 FVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIE 391

Query: 381 ILHNID 386
           +LHNID
Sbjct: 392 LLHNID 397


>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
 gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
 gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
          Length = 599

 Score =  335 bits (858), Expect = 6e-89,   Method: Compositional matrix adjust.
 Identities = 206/552 (37%), Positives = 292/552 (52%), Gaps = 70/552 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           + TA  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 70  SVTALSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 128

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 129 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 188

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   G+YST +YT
Sbjct: 189 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GVYSTQMYT 244

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+  KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 245 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLS 299

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 300 CLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 354

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKT 376
           R +  + SP LK    V  EL HI+DW PTL + A  +I++   LDG + W+ +++G ++
Sbjct: 355 RAIGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDENIQLDGYDVWETISEGLRS 414

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
            R +ILHNID +    K                  NG     Y                 
Sbjct: 415 PRVDILHNIDPIYTKAK------------------NGSWAAGY----------------- 439

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
              GI   A+++ +++K           SD +            + F+ +  N + N+  
Sbjct: 440 ---GIWNTAIQSAIRVKHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 486

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D R++P 
Sbjct: 487 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPR 546

Query: 556 RWNNIWVPWYDE 567
               +W PWY E
Sbjct: 547 LNGGVWGPWYKE 558


>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
          Length = 598

 Score =  334 bits (857), Expect = 7e-89,   Method: Compositional matrix adjust.
 Identities = 205/549 (37%), Positives = 291/549 (53%), Gaps = 68/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +T +  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STASTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+  KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPSRYFEHYRSIVNINRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
           R +ILHNID +    K                  NG                     L +
Sbjct: 415 RVDILHNIDPIYTKAK------------------NGS--------------------LAA 436

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD +            + F+ +  N + N+   
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERIT 486

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ Y  T VP    P D R++P  
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQYNKTAVPVRYPPKDPRSNPRL 546

Query: 557 WNNIWVPWY 565
              +W PWY
Sbjct: 547 NGGVWGPWY 555


>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
          Length = 502

 Score =  333 bits (855), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 175/366 (47%), Positives = 226/366 (61%), Gaps = 15/366 (4%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A   PHI+ +LADDLGWNDV FHGS +I TP +D L+  G+ L  +Y Q LCTPSRS L+
Sbjct: 9   ATSPPHIVFVLADDLGWNDVGFHGS-EILTPTLDFLSGQGVRLAGYYTQPLCTPSRSQLL 67

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           +G+Y IH G+QH +I   +P   PL +KLLP+ LKE GY TH +GKWHLG ++    PT 
Sbjct: 68  SGRYQIHTGLQHQIIWPCQPHCHPLEDKLLPELLKERGYVTHMVGKWHLGMYKTDCLPTR 127

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFDS++GYW G +DYY H               LD R            YST L+T+ A
Sbjct: 128 RGFDSYFGYWTGGEDYYSHERCYLITTLNITRCALDFRDGEVPATDYQMKYSTHLFTDRA 187

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           +++I  HN  KP+FLYLA+ AVH+     P Q PD+ +  +  I D  RR YAGMVS LD
Sbjct: 188 VDLITNHNPEKPLFLYLAYQAVHS-----PLQVPDQYIEPYTSIHDKNRRLYAGMVSVLD 242

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+V NV  AL+K G+  N++ +F  DNG  +     + G+N PLRG K T W+GG+RGV 
Sbjct: 243 EAVANVTNALKKQGLWNNTVFIFSTDNGGQTL----SGGNNWPLRGRKWTLWEGGIRGVG 298

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + S  LKQ    S EL HISDWLPTL   AG   N T  LDG + W  +++G+ + R E
Sbjct: 299 FVTSSLLKQKGVTSKELIHISDWLPTLVNLAGGSTNSTKPLDGFDVWKTISEGSPSPRVE 358

Query: 381 ILHNID 386
           +LHNID
Sbjct: 359 LLHNID 364


>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
          Length = 483

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 201/544 (36%), Positives = 288/544 (52%), Gaps = 80/544 (14%)

Query: 21  FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
           FL   + K PHI+ I+ADD GWNDV F   S + TPNID LA +G+ILN  YV  +CTPS
Sbjct: 16  FLPYVSAKPPHIVFIVADDYGWNDVGFRNPSVL-TPNIDKLARSGMILNSSYVMPVCTPS 74

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           R++ MTG+Y    G+QH VIL  +    PL    LPQ LKE GYATHAIGKWHLGF +  
Sbjct: 75  RNSFMTGQYAFKSGLQHIVILPQQATCAPLNNTFLPQKLKELGYATHAIGKWHLGFCKWE 134

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
            TPT+RGFD+ YGY+ G +DYY+ S         G D R N    N T G YST LY   
Sbjct: 135 CTPTYRGFDTFYGYYNGAEDYYNLSIAG------GKDFRDNRTPVNAT-GEYSTILYARR 187

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK-FLDISDPERRTYAGMVSR 259
           A ++I +H+ SKPM++YL   +VH     EP Q   E  +  +  +    RRTY GMV+ 
Sbjct: 188 AESIIKDHDASKPMYMYLPFQSVH-----EPLQVSLESSSDIYSKVHTESRRTYLGMVTA 242

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +DE++GN+  AL+  GM  +++++F ADNG    G  +  G+N+PLRG K+T ++GG R 
Sbjct: 243 MDEAIGNITKALKTKGMFNDTLIVFTADNG----GWITYAGNNYPLRGGKTTVFEGGTRA 298

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
            A +    ++++  V   + H  DW+PT+ +AAG    D +LDG++QWD L    ++KR+
Sbjct: 299 TAFVSGAGIQKSNYVYPGMIHAVDWMPTVLSAAG-GTPDPTLDGIDQWDSLRTAGESKRT 357

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
           E ++NID++      +AA+R+ D K + G    G   +W          Y P +V+    
Sbjct: 358 EFIYNIDDLRPEICGHAAIRMGDYKLIDGYP--GAFQDW----------YKPDQVV---- 401

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNS 499
                                    SD    +  D  + + R F      +D        
Sbjct: 402 -------------------------SD----LYEDVSLNDTRPF------FD-------- 418

Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
             K  LFN+ DDP E  +L++   D++K+L+ +L  Y   +VP         + P  +  
Sbjct: 419 --KMMLFNLKDDPNEHVDLSKKHPDIVKKLKSRLEYYHKQIVPANFPKNSASSAPTNYGG 476

Query: 560 IWVP 563
            W P
Sbjct: 477 FWTP 480


>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
 gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
 gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
 gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
 gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
          Length = 573

 Score =  331 bits (849), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 202/554 (36%), Positives = 289/554 (52%), Gaps = 90/554 (16%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P+ PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+T
Sbjct: 44  PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 102

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT R
Sbjct: 103 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 162

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           GFD+  G   G  DYY + +C        G D+     V     G YST LY + A +++
Sbjct: 163 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHIL 220

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A HN   P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V 
Sbjct: 221 ASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 275

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + S
Sbjct: 276 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 331

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+ ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHN
Sbjct: 332 PLLKKKRRTSRALVHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHN 391

Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
           ID + N  ++               AA+RV + K + G       D  YGD       + 
Sbjct: 392 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 437

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P + L S  G  +N                              E++  +R+        
Sbjct: 438 PPQTLASFPGSWWNL-----------------------------ERMASIRQAV------ 462

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
                         LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + 
Sbjct: 463 -------------WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAANP 509

Query: 551 RADPARWNNIWVPW 564
           RA P      W PW
Sbjct: 510 RAHPDFNGGAWGPW 523


>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
          Length = 503

 Score =  331 bits (849), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 204/551 (37%), Positives = 289/551 (52%), Gaps = 96/551 (17%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+I+ +LADD G++DV +HGS +I TP +DAL+ +G+ L  +YVQ +CTP+RS LM+G+
Sbjct: 33  QPNIVFVLADDFGFHDVGYHGS-EIHTPTLDALSASGVRLENYYVQPICTPTRSQLMSGR 91

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QHG+I   +P  LP     L   LKE+GYATH +GKWHLGF+++ Y P  RGF
Sbjct: 92  YQIHTGLQHGIINSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNRGF 151

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D+++GY    +DY++H+    +   + LD+R N        G YS  L+T  AI+V+  H
Sbjct: 152 DTYFGYLNAAEDYFNHNVP--WRQVRYLDLRDNNGPVRNETGQYSAHLFTGKAIDVVQSH 209

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           N SKP+FLYLA+ +VHA     P + P++   K+ +I+D  RRT+AGMVS LDE V N+ 
Sbjct: 210 NTSKPLFLYLAYQSVHA-----PLEVPEKYEHKYRNITDKNRRTFAGMVSALDEGVANLT 264

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
            AL+  G+  N++++F  DNG     IH+  G+N+PLRG K++ W+GG  GV  +    L
Sbjct: 265 QALKDKGLWNNTVLIFSTDNGGQ---IHAG-GNNYPLRGWKASLWEGGFHGVGFVSGGAL 320

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDN 387
           K++  VS  L H+SDW PTL   AG  +N T  LDG NQWD ++    + R  +LHNID 
Sbjct: 321 KRSGAVSKGLIHVSDWFPTLVTLAGGNLNGTKPLDGFNQWDTISNETPSPREILLHNID- 379

Query: 388 VDNPQK------------YYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVL 435
           +  PQK              AA+RV D K + G   NG    W    D  +  Y   E+ 
Sbjct: 380 ILYPQKGVPLYSNTWDTRVRAAIRVGDYKLITGDPGNG---SWVPPPDGHL--YFVPEIQ 434

Query: 436 YSKAGIT--FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNK 493
            S A     FN             ADP  ++       L+ EK LE+    ++   ++N 
Sbjct: 435 ESAAKNVWLFNI-----------TADPNEHND------LSSEKPLEVLRLLQILVQFNN- 476

Query: 494 GAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRAD 553
                                                        T VPP     D R D
Sbjct: 477 ---------------------------------------------TAVPPRYPAPDPRCD 491

Query: 554 PARWNNIWVPW 564
           PA   ++W PW
Sbjct: 492 PALHGDVWGPW 502


>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
          Length = 492

 Score =  331 bits (849), Expect = 6e-88,   Method: Compositional matrix adjust.
 Identities = 173/364 (47%), Positives = 231/364 (63%), Gaps = 15/364 (4%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q L TPSRS L+TG
Sbjct: 2   RPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLXTPSRSQLLTG 60

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT RG
Sbjct: 61  RYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRG 120

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           FD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ AI 
Sbjct: 121 FDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIA 180

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +DE+
Sbjct: 181 LITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEA 235

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV  +
Sbjct: 236 VGNVTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFV 291

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEIL 382
            SP LKQ    + EL HISDWLPTL   A    N T  LDG + W  +++G+ + R E+L
Sbjct: 292 ASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTKPLDGFDVWKTISEGSPSPRIELL 351

Query: 383 HNID 386
           HNID
Sbjct: 352 HNID 355


>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
          Length = 569

 Score =  331 bits (848), Expect = 7e-88,   Method: Compositional matrix adjust.
 Identities = 201/554 (36%), Positives = 290/554 (52%), Gaps = 90/554 (16%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P+ PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+T
Sbjct: 44  PQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLT 102

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT R
Sbjct: 103 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 162

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           GFD+  G   G  DYY + +C        G D+     V     G YST LY + A +++
Sbjct: 163 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHIL 220

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V 
Sbjct: 221 ASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 275

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + S
Sbjct: 276 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 331

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+ Q+ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHN
Sbjct: 332 PLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHN 391

Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
           ID + N  ++               AA+RV + K + G       D  YGD       + 
Sbjct: 392 IDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 437

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P + L +  G  +N                              E++  +R+        
Sbjct: 438 PPQTLATFPGSWWNL-----------------------------ERMASVRQAV------ 462

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
                         LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + 
Sbjct: 463 -------------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP 509

Query: 551 RADPARWNNIWVPW 564
           RA P      W PW
Sbjct: 510 RAHPEFNGGAWGPW 523


>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
 gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
 gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
          Length = 569

 Score =  330 bits (846), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 201/554 (36%), Positives = 290/554 (52%), Gaps = 90/554 (16%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P+ PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+T
Sbjct: 44  PQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLT 102

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT R
Sbjct: 103 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 162

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           GFD+  G   G  DYY + +C        G D+     V     G YST LY + A +++
Sbjct: 163 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHIL 220

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V 
Sbjct: 221 ASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 275

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + S
Sbjct: 276 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 331

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+ Q+ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHN
Sbjct: 332 PLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHN 391

Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
           ID + N  ++               AA+RV + K + G       D  YGD       + 
Sbjct: 392 IDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 437

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P + L +  G  +N                              E++  +R+        
Sbjct: 438 PPQTLATFPGSWWNL-----------------------------ERMASVRQAV------ 462

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
                         LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + 
Sbjct: 463 -------------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP 509

Query: 551 RADPARWNNIWVPW 564
           RA P      W PW
Sbjct: 510 RAHPDFNGGAWGPW 523


>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
 gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
          Length = 478

 Score =  330 bits (846), Expect = 1e-87,   Method: Compositional matrix adjust.
 Identities = 180/405 (44%), Positives = 240/405 (59%), Gaps = 33/405 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+I+ ILADD GW+D+ +HGS  I TP +D LA  G+ L  +YVQ +C+PSR  LMTG+Y
Sbjct: 1   PNIVFILADDYGWDDIGYHGSF-IQTPKLDRLAKEGVKLENYYVQPICSPSRCQLMTGRY 59

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            I  G+QH VI    P GLPL E  LPQ LKE GY+T+ +GKWHLGFFR+ + P  RGFD
Sbjct: 60  QIRYGLQHSVITSDRPHGLPLDEVTLPQKLKENGYSTYVVGKWHLGFFRKEHMPLQRGFD 119

Query: 150 SHYGYWQGLQDYYDHSCKATFE----PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
             YGY  G +DY+ H     +      + GLD+R   +      G YST L+ + AI +I
Sbjct: 120 KFYGYLTGGEDYWTHRRPNLYAKDPLAFHGLDLRDQDKPVLDQNGTYSTHLFAKKAIEII 179

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
             H +SKPMFLYL   AVH+     P +AP E V K+ D+ +P  +TYA MVS +DE+VG
Sbjct: 180 QNHERSKPMFLYLPFQAVHS-----PLEAPTEYVEKYKDVKNPLVKTYAAMVSAMDEAVG 234

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           NV  AL++  + +N++++F  DNGA       N GSN PLRG K+T W+GG+RGV  + S
Sbjct: 235 NVTDALKRTRLWDNTVLIFSTDNGA-----RRNAGSNWPLRGWKNTLWEGGVRGVGFVTS 289

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHN 384
             L++ ++    L HISDW PTL   +    N T  LDG + WD ++ G  + R EILHN
Sbjct: 290 KLLRKGKRKCDALIHISDWFPTLLRISRSVSNGTKPLDGFDIWDTISSGTPSPRKEILHN 349

Query: 385 I----------------DNVDNPQKYYAALRVDDLKYVAGTDNNG 413
           I                D++ NP   YAA+R  D K + G    G
Sbjct: 350 IDPLHPKTDLSSLDSLHDHIFNPS-VYAAIRSGDWKLLTGDQGLG 393



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 38/60 (63%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI  DP E+N+L+    D++++L ++LA Y  T +PP     D RA+PA   +++ PW
Sbjct: 418 LFNIRSDPQERNDLSVQYPDVVQELLERLAAYNKTALPPFWPSRDPRANPALHGDLFGPW 477


>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 532

 Score =  329 bits (843), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 207/551 (37%), Positives = 294/551 (53%), Gaps = 47/551 (8%)

Query: 7   YF-FALTCTLLFNDAFL-NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           YF F     ++ N A +  +TA + PHII IL DDLGW D  FHGS QI TPN+DALAY+
Sbjct: 3   YFKFQQIFIIVLNLAVVCKSTAQQPPHIITILIDDLGWTDTGFHGSDQIKTPNMDALAYS 62

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
           G+ILN+HYV   CTPSRSAL+TG YPI  GMQ   +  G+   LPL+ KL P++LK  GY
Sbjct: 63  GMILNRHYVLPSCTPSRSALLTGLYPIRTGMQGMPLKGGDVRNLPLSFKLKPEFLKNLGY 122

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKAT-----FEPYQGLDMR 179
            TH +GKWHLG+    + P  RGFDS +GY+ G  DY+      T      E + G D+ 
Sbjct: 123 RTHLVGKWHLGYRTINHLPNQRGFDSFFGYYNGYVDYFKFGHNQTVAGEKIEYFYGYDLH 182

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEE 238
            N ++       Y+T L+T  A  +I  HN+S+P++LY +HLA H G+     + P D +
Sbjct: 183 RNGEIYQTDKDTYATRLFTREAEKIIKNHNESEPLYLYFSHLATHTGDDDIGMEVPEDAD 242

Query: 239 VAK-FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH- 296
           V K +  I    RR +AG +  LD+SVG V+ AL++  ML+NSI+L M+DNG  +  +  
Sbjct: 243 VNKTYGHIKHYGRRAFAGCLEELDKSVGEVMEALKEKNMLDNSIILIMSDNGGHTVSVDL 302

Query: 297 -SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
             N  SN PL G K T ++GG+R VA IWSP L +   ++ +  HI+DWLPTL +AAG  
Sbjct: 303 PPNWSSNWPLGGTKFTLFEGGVRSVALIWSPLLPKGV-INDDFIHITDWLPTLYSAAGGN 361

Query: 356 INDTSL-DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQ 414
             +  + DG++QW+ L + +   R  IL NID+    +      R   LK   G    G 
Sbjct: 362 PKELGIFDGIDQWEALKRNSAVARENILLNIDDKTGSEGVIIENRWKVLK--GGYKTYGS 419

Query: 415 SDEWYGDTDN--EIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTIL 472
            D  +G  +N     KY P +++ S+ G   + LK                     +  L
Sbjct: 420 YDFHWGKPENIHPNPKYDPNDIINSQTGQIVSNLK-------------------HYKNSL 460

Query: 473 TDEKILELREFARVRCN--YDNKGAHCNSTVKP---------CLFNITDDPCEQNNLAES 521
               I  LR+ + V C    +N   +    ++P         CLF+I +D CE N +  +
Sbjct: 461 NHSSIETLRKKSTVTCQPLVNNIPMNDYGFIEPSPEVRCKDFCLFDIINDQCEVNPVENN 520

Query: 522 QTDLLKQLEDK 532
           +      +E +
Sbjct: 521 ELIKFDNVESR 531


>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
          Length = 569

 Score =  329 bits (843), Expect = 3e-87,   Method: Compositional matrix adjust.
 Identities = 200/556 (35%), Positives = 291/556 (52%), Gaps = 90/556 (16%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           + P+ PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L
Sbjct: 42  SPPQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQL 100

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +TG+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT
Sbjct: 101 LTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPT 160

Query: 145 FRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
            RGFD+  G   G  DYY + +C        G D+     V     G YST LY + A +
Sbjct: 161 RRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASH 218

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           ++A H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+
Sbjct: 219 ILANHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEA 273

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           V N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  +
Sbjct: 274 VRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFV 329

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEIL 382
            SP LK+ ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EIL
Sbjct: 330 HSPLLKRKRRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEIL 389

Query: 383 HNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK 428
           HNID + N  ++               AA+RV + K + G       D  YGD       
Sbjct: 390 HNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD------- 435

Query: 429 YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC 488
           + P + L +  G  +N                              E++  +R+      
Sbjct: 436 WIPPQTLATFPGSWWNL-----------------------------ERMASVRQAV---- 462

Query: 489 NYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF 548
                           LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      
Sbjct: 463 ---------------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAE 507

Query: 549 DKRADPARWNNIWVPW 564
           + RA P      W PW
Sbjct: 508 NPRAHPDFNGGAWGPW 523


>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score =  328 bits (842), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 175/402 (43%), Positives = 244/402 (60%), Gaps = 32/402 (7%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HI+ ILADD G++D+ +H  S I TPN+D LA  G+ L  +YVQ  CTP+RS LM+G+Y 
Sbjct: 15  HIVFILADDYGFHDIGYH-DSIIKTPNLDRLASEGVKLENYYVQPKCTPTRSQLMSGRYQ 73

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QHG+I   +P  LP+ E  +PQ LKE+GYATH +GKWHLG +++   PT RGFD+
Sbjct: 74  IHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTERGFDT 133

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAINVIAEHN 209
            +GY  G +DYY H+   +++ + G+D R NMQ V  +  G YST ++ E A N+I  H+
Sbjct: 134 FFGYLTGSEDYYTHN--RSYDKFHGMDFRENMQIVQPQYNGQYSTHVFAEKAKNIIKSHD 191

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
              P+FLYL   AVH      P Q PD+    + +I++ +RRTYAGMVS +DE+VGN+  
Sbjct: 192 PQIPLFLYLPLHAVHG-----PLQVPDQYEKPYTNITNKQRRTYAGMVSCMDEAVGNITQ 246

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
            L   G+ +NS+++F  DNG    G  +  G+N PLRG K + W+GGM GV  + SP + 
Sbjct: 247 TLEDAGLYDNSVIIFSTDNG----GQVTAGGNNWPLRGWKGSLWEGGMHGVGFVNSPLIN 302

Query: 330 QTQK--VSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
              +   S EL H++DW PT+   AG  ++ T  LDG NQW+ ++ G K+ R E+LHNID
Sbjct: 303 HQARGTTSKELIHVTDWFPTIVHLAGGSVSGTQPLDGYNQWETISNGKKSPRKELLHNID 362

Query: 387 ------NVDNPQKYY----------AALRVDDLKYVAGTDNN 412
                    N    Y          AA+RV D K + G   N
Sbjct: 363 PKRTTLQTSNNNMIYRKDMFDASIQAAMRVGDWKILTGNPGN 404



 Score = 40.4 bits (93), Expect = 2.8,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI +DP E N+L+ S+ D + +L + L  Y  T VP      +   DP + +  W PW
Sbjct: 430 LFNIAEDPNEYNDLSLSRPDKVNELLEILQQYYKTSVPVHYPDDEPACDPDKNDGAWGPW 489


>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
 gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
 gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
 gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
 gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
          Length = 599

 Score =  328 bits (842), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545

Query: 556 RWNNIWVPWYDELDKQK 572
               +W PWY E  K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562


>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
          Length = 599

 Score =  328 bits (841), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545

Query: 556 RWNNIWVPWYDELDKQK 572
               +W PWY E  K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562


>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
 gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
          Length = 598

 Score =  328 bits (841), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545

Query: 556 RWNNIWVPWYDELDKQK 572
               +W PWY E  K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562


>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
          Length = 569

 Score =  328 bits (841), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 205/578 (35%), Positives = 296/578 (51%), Gaps = 92/578 (15%)

Query: 3   WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
           WA+  F A          F+  + P  PHII IL DD G++DV +HGS  I TP +D LA
Sbjct: 22  WAKPSFVADGPGEAXXXXFIYPSRP--PHIIFILTDDQGYHDVGYHGS-DIETPTLDRLA 78

Query: 63  YNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
             G+ L  +Y+Q +CTPSRS L+TG+Y IH G+QH +I   +P  LPL +  LP  L+EA
Sbjct: 79  AKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSIIRPQQPNCLPLDQVTLPHKLQEA 138

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHN 181
           GY+TH +GKWHLGF+R+   PT RGFD+  G   G  DYY + +C        G D+   
Sbjct: 139 GYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEG 196

Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
             V     G YST LY + A +++A H+  +P+FLY+A  AVH      P Q+P E + +
Sbjct: 197 ENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYR 251

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           +  + +  RR YA MV+ +DE+V N+  AL+++G   NS+++F +DNG  +F    + GS
Sbjct: 252 YRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGS 307

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-S 360
           N PLRG K T W+GG+RG+  + SP LK+ Q+ S  L HI+DW PTL   AG   +    
Sbjct: 308 NWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADG 367

Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKYV 406
           LDG + W  +++G  + R+EILHNID + N  ++               AA+RV + K +
Sbjct: 368 LDGYDVWPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLL 427

Query: 407 AGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSD 466
            G       D  YGD       + P + L +  G  +N                      
Sbjct: 428 TG-------DPGYGD-------WIPPQTLATFPGSWWNL--------------------- 452

Query: 467 ALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLL 526
                   E++  +R+                      LFNI+ DP E+ +LA  + D++
Sbjct: 453 --------ERMASVRQAV-------------------WLFNISADPYEREDLAGQRPDVV 485

Query: 527 KQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           + L  +LA Y  T +P      + RA P      W PW
Sbjct: 486 RTLLARLAEYNRTAIPVRYPAENPRAHPDFNGGAWGPW 523


>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
          Length = 517

 Score =  328 bits (841), Expect = 5e-87,   Method: Compositional matrix adjust.
 Identities = 171/366 (46%), Positives = 233/366 (63%), Gaps = 24/366 (6%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A   PH++ +LADDLGWND+ FHGS  I TP++DALA  G++L+ +YVQ LCTPSRS L+
Sbjct: 36  AAPPPHVVFVLADDLGWNDLGFHGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLL 94

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH+G+QH +I+  +P  +PL EKLLPQ LK+AGYATH +GKWHLG +R+   PT 
Sbjct: 95  TGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRKECLPTR 154

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H   A  E   G    LD+R   +   +   IYST+++T+ A
Sbjct: 155 RGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRA 214

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +IA H   K              + ++P Q P+E +  +  I D  RR YAGMVS LD
Sbjct: 215 TTLIANHPPEK--------------SVHDPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLD 260

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+VGNV  AL+  G+  N++++F  DNG    G   + G+N PLRG K T W+GG+RG  
Sbjct: 261 EAVGNVTKALKSRGLWNNTVLIFSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGAG 316

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            + SP LKQ    S EL HI+DWLPTL   AG   + T  LDG + W+ +++G+ + R E
Sbjct: 317 FVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTKPLDGFDVWETISEGSPSPRVE 376

Query: 381 ILHNID 386
           +L NID
Sbjct: 377 LLLNID 382



 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++++     +++ L  +L  Y    VP    P D R DP +   +W PW
Sbjct: 458 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 516


>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
          Length = 596

 Score =  328 bits (840), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 203/556 (36%), Positives = 292/556 (52%), Gaps = 68/556 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 415 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 436

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD           +  + F+ +  N + N+   
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIT 486

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546

Query: 557 WNNIWVPWYDELDKQK 572
              +W PWY E  K+K
Sbjct: 547 NGGVWGPWYKEETKKK 562


>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score =  328 bits (840), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 199/559 (35%), Positives = 297/559 (53%), Gaps = 73/559 (13%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           F+L   LL   A L ++A   P+II+ + DDLGWND S+ GS  I TPNID LA  G+ L
Sbjct: 4   FSLVLFLLAFSAPLVSSA-NSPNIILFVIDDLGWNDTSYQGS-DIQTPNIDKLAEEGIRL 61

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
            Q+YVQ LC+PSRSAL+ GKYP H+G+ HGVI  G P+GL L E  +  +LK+ GY+THA
Sbjct: 62  KQYYVQPLCSPSRSALLAGKYPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGYSTHA 121

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           +GKW LG  +  +TPT+RGFD+ YGY+   +DYY H           LD R+N       
Sbjct: 122 VGKWDLGMHKWEFTPTYRGFDTFYGYYDADEDYYTHKVGGY------LDFRNNTDPVKDE 175

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
            G YST L+T+A  + I   + S P F+Y A+ +VH+     P +APD  + K      P
Sbjct: 176 DGTYSTFLFTKAIEDAINAKSDS-PFFIYGAYQSVHS-----PLEAPDTYLEK-CHSPYP 228

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
            R+ + GM+  LDE + N+ + L+  G+L+++I++   DNG    G  +   SN PLRG 
Sbjct: 229 NRKIFCGMMLALDEGISNITSLLQTKGLLDDTIIILTTDNG----GQTALGSSNWPLRGN 284

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA-AGIEINDTSLDGVNQW 367
           K+T ++GG+RG++ +WS  L+++   ++ + HI+DW PT+    AG+ ++   LDG N W
Sbjct: 285 KATVFEGGVRGISFVWSTKLRKSNYDNNAMMHITDWYPTIIEGIAGMTLDTKGLDGFNMW 344

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID 427
           D +     + R+EILH +D    P KY      D+     G     Q D W         
Sbjct: 345 DTINDDTPSPRTEILHQLD----PPKY------DERTQFIGQAALRQGD-W--------- 384

Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
                                KL I Q + ++            L    + +L     + 
Sbjct: 385 ---------------------KLIIGQPNCSEK-----------LGPHTVYDLCPIGWIH 412

Query: 488 CNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKP 547
            N   +    N ++   LFN+ DDP E+N L+ S  +++ +L++ +  Y +T +   + P
Sbjct: 413 LNGTIEEPEVNPSLT-WLFNVKDDPNERNELSASHPEIVSKLKELIEAYNATHIVQMDPP 471

Query: 548 FDKRADPARWNNIWVPWYD 566
            D R+DP ++ NIW PW D
Sbjct: 472 IDPRSDPKKFGNIWTPWLD 490


>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
          Length = 382

 Score =  328 bits (840), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 165/375 (44%), Positives = 235/375 (62%), Gaps = 15/375 (4%)

Query: 12  TCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQH 71
           T TL+F   +L  +   KPH++ I+ADDLGWNDV F  + Q+ TP++D LA  G+ILN  
Sbjct: 9   TFTLIF--VYLYGSQAAKPHVVFIVADDLGWNDVGFR-NPQVLTPHLDKLAKAGVILNSS 65

Query: 72  YVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
           YVQ LC+PSR+  M+G +P H G+Q GVI    P  +P+    LPQ LKE GY+THA+GK
Sbjct: 66  YVQPLCSPSRNCFMSGYFPYHTGLQDGVIRPASPGFVPIKFTFLPQKLKELGYSTHAVGK 125

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
           WHLGF    YTPT+RGFD+  GY+ G +DYY H+ +  ++ + G D+R N  V  +  G 
Sbjct: 126 WHLGFCNLKYTPTYRGFDTFVGYYIGAEDYYKHTRE--YDKFSGYDLRFNTSVYTEAKGK 183

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
           YST ++ E A+++I  H+   P++LYL   AVHA     P + P E    +  I D  RR
Sbjct: 184 YSTRVFAERAVDIIKSHDTDTPLYLYLPFQAVHA-----PLEVPPEYENLYKHIHDLPRR 238

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
           TY GM+S LDE+V N+  AL + G+++N +++F  DNG P  G      +N PLRG K+T
Sbjct: 239 TYCGMISALDEAVANITNALEETGLIDNLLLVFTTDNGGPFNG----AANNLPLRGCKAT 294

Query: 312 PWDGGMRGVAAIWSP-WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
            W+GG +G   I+S   LK+T  +++ + H  DW PT    AG E  D ++DG++Q+D+L
Sbjct: 295 LWEGGTKGAGFIYSKTLLKKTGYLNTGMMHAVDWYPTFVELAGGENTDPNMDGISQYDML 354

Query: 371 TKGAKTKRSEILHNI 385
             G  +KR E ++NI
Sbjct: 355 INGGPSKRKEFVYNI 369


>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
          Length = 596

 Score =  328 bits (840), Expect = 6e-87,   Method: Compositional matrix adjust.
 Identities = 203/556 (36%), Positives = 292/556 (52%), Gaps = 68/556 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 415 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 436

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD +            + F+ +  N + N+   
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERIT 486

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546

Query: 557 WNNIWVPWYDELDKQK 572
              +W PWY E  K+K
Sbjct: 547 NGGVWGPWYKEETKKK 562


>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
          Length = 597

 Score =  328 bits (840), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD           +  + F+ +  N + N+  
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERI 485

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545

Query: 556 RWNNIWVPWYDELDKQK 572
               +W PWY E  K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562


>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
          Length = 596

 Score =  328 bits (840), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 203/556 (36%), Positives = 292/556 (52%), Gaps = 68/556 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 415 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 436

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD           +  + F+ +  N + N+   
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIT 486

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546

Query: 557 WNNIWVPWYDELDKQK 572
              +W PWY E  K+K
Sbjct: 547 NGGVWGPWYKEETKKK 562


>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
          Length = 575

 Score =  328 bits (840), Expect = 7e-87,   Method: Compositional matrix adjust.
 Identities = 196/548 (35%), Positives = 288/548 (52%), Gaps = 63/548 (11%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
           A L  T  + PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTP
Sbjct: 39  AKLQGTPSRPPHIIFILTDDQGFHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTP 97

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           SRS L+TG+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+++
Sbjct: 98  SRSQLITGRYQIHTGLQHSIIRPRQPNCLPLHQVTLPQKLQEAGYSTHMVGKWHLGFYKK 157

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
              PT RGFD+  G   G  DYY + +C        G D+     V     G YST LY 
Sbjct: 158 ECLPTRRGFDTFLGSLTGNVDYYSYDNCDG--PGVCGFDLHEGENVAWDQAGKYSTLLYA 215

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN  +P+F+Y+A  AVH      P Q+P E +  +  + +  RR YA MV+
Sbjct: 216 QRVNQILASHNPQQPIFIYVAFQAVHT-----PLQSPREYIYPYRGMGNVARRKYAAMVT 270

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            +D +V N+  AL+K+G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+R
Sbjct: 271 CMDAAVKNITRALKKYGYYSNSVIVFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVR 326

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTK 377
           G+  + SP +K+ ++ S  L HI+DW PTL   AG  I+    LDG + W  +++G ++ 
Sbjct: 327 GLGFVHSPLIKKKRRTSRALMHITDWYPTLVKLAGGNISSAEGLDGYDVWGAISEGKESP 386

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
           R+EILHNID + N  K             +G+   GQ                       
Sbjct: 387 RTEILHNIDPLYNLAK-------------SGSLEEGQ----------------------- 410

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
             G+   A++  +++K           SD +            +       ++ N   H 
Sbjct: 411 --GVWNTAVQASIRVKDFKLLTGDPGYSDWIPP----------QTLTNFPGSWWNLERHT 458

Query: 498 NSTVKPC-LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
           +   K   LFNIT DP E+++L+ S+ +++K+L  +L  Y  T +P      D R +P  
Sbjct: 459 DGARKSLWLFNITADPFERHDLSSSKPEVVKELLIRLGYYNRTAIPVRYPAEDPRGNPEL 518

Query: 557 WNNIWVPW 564
               W PW
Sbjct: 519 NGGAWGPW 526


>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
          Length = 599

 Score =  327 bits (839), Expect = 8e-87,   Method: Compositional matrix adjust.
 Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 485

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545

Query: 556 RWNNIWVPWYDELDKQK 572
               +W PWY E  K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562


>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
          Length = 599

 Score =  327 bits (839), Expect = 9e-87,   Method: Compositional matrix adjust.
 Identities = 204/557 (36%), Positives = 293/557 (52%), Gaps = 70/557 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 413

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 414 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 435

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD           +  + F+ +  N + N+  
Sbjct: 436 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERI 485

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 486 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545

Query: 556 RWNNIWVPWYDELDKQK 572
               +W PWY E  K+K
Sbjct: 546 LNGGVWGPWYKEETKKK 562


>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
          Length = 601

 Score =  327 bits (838), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 204/553 (36%), Positives = 290/553 (52%), Gaps = 68/553 (12%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS  +
Sbjct: 74  ATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFI 132

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   PT 
Sbjct: 133 TGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTK 192

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDLYTEAA 201
           RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT+  
Sbjct: 193 RGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDSAAWDHDN---GIYSTQMYTQRV 248

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S LD
Sbjct: 249 QQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLD 303

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+V NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R V 
Sbjct: 304 EAVSNVTLALKTYGFYNNSILIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIRAVG 359

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSE 380
            + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++    
Sbjct: 360 FVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP--- 416

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
                             RVD L  +       ++  W                  +  G
Sbjct: 417 ------------------RVDILHNIDPIYTKAKNGSWA-----------------AGYG 441

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNS 499
           I   A+++ ++++           SD           +  + F+ +  N + N+    ++
Sbjct: 442 IWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERITLST 491

Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
                LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D R++P     
Sbjct: 492 GKSIWLFNITADPYERVDLSSRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGG 551

Query: 560 IWVPWYDELDKQK 572
           +W PWY E +K+K
Sbjct: 552 VWGPWYKEENKKK 564


>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
          Length = 593

 Score =  327 bits (838), Expect = 1e-86,   Method: Compositional matrix adjust.
 Identities = 203/548 (37%), Positives = 291/548 (53%), Gaps = 70/548 (12%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS  +T
Sbjct: 67  PSQPHLIFILADDQGFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPICTPSRSQFIT 125

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           GKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   PT R
Sbjct: 126 GKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKR 185

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAI 202
           GFD+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT+   
Sbjct: 186 GFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDHDNGIYSTQMYTQRVQ 241

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
            ++A HN +KP+FLY+A+ AVH+     P QAP +    +  I +  RR YA M+S LDE
Sbjct: 242 QILASHNPTKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDE 296

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           ++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+R V 
Sbjct: 297 AIHNVTLALKAYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAVG 351

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSE 380
            + SP LK    V  EL HI+DW PTL + A  +I+ DT LDG + W+ +++G ++ R  
Sbjct: 352 FVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDTQLDGYDVWETISEGLRSPR-- 409

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
               +D + N    Y   +  +  + AG                               G
Sbjct: 410 ----VDILHNIDPIYT--KAKNGSWAAGY------------------------------G 433

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNS 499
           I   A+++ ++++           SD +            + F+ +  N + N+    ++
Sbjct: 434 IWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERITLST 483

Query: 500 TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNN 559
                LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P     
Sbjct: 484 GKSIWLFNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKDPRSNPRLNGG 543

Query: 560 IWVPWYDE 567
           +W PWY E
Sbjct: 544 VWGPWYKE 551


>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
 gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
 gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
          Length = 573

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 200/550 (36%), Positives = 288/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 48  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             KP+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 225 PQKPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 336 KKRRTSRALVHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 396 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L S  G  +N                              E++  +R+            
Sbjct: 442 LASFPGSWWNL-----------------------------ERMASIRQAV---------- 462

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA+ + D+++ L  +LA Y  T +P      + RA P
Sbjct: 463 ---------WLFNISADPYEREDLADQRPDVVRTLLARLADYNRTAIPVRYPAANPRAHP 513

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 514 DFNGGAWGPW 523


>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
          Length = 569

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + Q+ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDMWPAISEGRASPRTEILHNIDPL 395

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++  +R+            
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENPRAHP 513

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 514 DFNGGAWGPW 523


>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
          Length = 601

 Score =  325 bits (834), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 203/556 (36%), Positives = 292/556 (52%), Gaps = 70/556 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 71  STTSASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 129

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 130 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 189

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT
Sbjct: 190 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDSDNGIYSTQMYT 245

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLYLA+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 246 QRVQQILASHNPTKPLFLYLAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 300

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 301 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 355

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  +DG + W+ +++G ++
Sbjct: 356 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQIDGYDIWETISEGLRS 415

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 416 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 437

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 438 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 487

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 488 TLSTGKSIWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 547

Query: 556 RWNNIWVPWYDELDKQ 571
               +W PWY E  K+
Sbjct: 548 LNGGVWGPWYKEETKK 563


>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
          Length = 571

 Score =  325 bits (834), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 199/559 (35%), Positives = 290/559 (51%), Gaps = 92/559 (16%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +   P+ PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS
Sbjct: 40  SVAPPQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRS 98

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
            L+TG+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   
Sbjct: 99  QLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECL 158

Query: 143 PTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           PT RGFD+  G   G  DYY + +C        G D+     V     G YST LY +  
Sbjct: 159 PTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRV 216

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
            +++A H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +D
Sbjct: 217 SHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMD 271

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+V N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+ 
Sbjct: 272 EAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLG 327

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--IEINDTSLDGVNQWDVLTKGAKTKRS 379
            + SP LK+ ++ S  L HI+DW PTL   AG  + + D  LDG + W  +++G  + R+
Sbjct: 328 FVHSPLLKRKRRTSRALVHITDWYPTLVGLAGGTVSVAD-GLDGYDVWPAISEGRASPRT 386

Query: 380 EILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
           EILHNID + N  ++               AA+RV + K + G       D  YGD    
Sbjct: 387 EILHNIDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD---- 435

Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
              + P + L +  G  +N                              E++   R+   
Sbjct: 436 ---WIPPQTLAAFPGSWWNL-----------------------------ERMASARQAV- 462

Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
                              LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P   
Sbjct: 463 ------------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRY 504

Query: 546 KPFDKRADPARWNNIWVPW 564
              + RA P      W PW
Sbjct: 505 PAENPRAHPDFNGGAWGPW 523


>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
 gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
 gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
 gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
 gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
          Length = 569

 Score =  325 bits (834), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 48  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + Q+ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++  +R+            
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENPRAHP 513

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 514 DFNGGAWGPW 523


>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
 gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
          Length = 569

 Score =  325 bits (834), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 48  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 107 IHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + Q+ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 336 RKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++  +R+            
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENPRAHP 513

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 514 DFNGGAWGPW 523


>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
          Length = 578

 Score =  325 bits (833), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 202/556 (36%), Positives = 291/556 (52%), Gaps = 68/556 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 415 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 436

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD +            + F+ +  N + N+   
Sbjct: 437 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERIT 486

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++      FNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 487 LSTGKSVWPFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546

Query: 557 WNNIWVPWYDELDKQK 572
              +W PWY E  K+K
Sbjct: 547 NGGVWGPWYKEETKKK 562


>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
          Length = 605

 Score =  325 bits (833), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 202/556 (36%), Positives = 292/556 (52%), Gaps = 70/556 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 75  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 133

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 134 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 193

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 194 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 249

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 250 QRVQKILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 304

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 305 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 359

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  +DG + W+ +++G ++
Sbjct: 360 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQIDGYDIWETISEGLRS 419

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 420 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 441

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 442 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 491

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 492 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 551

Query: 556 RWNNIWVPWYDELDKQ 571
               +W PWY E  K+
Sbjct: 552 LNGGVWGPWYKEETKK 567


>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
          Length = 597

 Score =  325 bits (833), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 204/557 (36%), Positives = 294/557 (52%), Gaps = 71/557 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 68  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 126

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 127 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 186

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 187 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 242

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 243 QRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 297

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 298 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 352

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 353 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 412

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 413 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 434

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 435 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 484

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP  + P D R++P 
Sbjct: 485 TLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPV-SIPQDPRSNPR 543

Query: 556 RWNNIWVPWYDELDKQK 572
               +W PWY E  K+K
Sbjct: 544 LNGGVWGPWYKEETKKK 560


>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
          Length = 565

 Score =  325 bits (833), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 199/554 (35%), Positives = 287/554 (51%), Gaps = 90/554 (16%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P+ PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+T
Sbjct: 38  PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 96

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH G+QH +I   +P  LPL +  LPQ L+E GY+TH +GKWHLGF+R+   PT R
Sbjct: 97  GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRR 156

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           GFD+  G   G  DYY + +C        G D+     V     G YST LY +   +++
Sbjct: 157 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHIL 214

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V 
Sbjct: 215 ASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 269

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL++HG   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + S
Sbjct: 270 NITWALKRHGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 325

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+ ++ S  L HI+DW PTL A AG   +    LDG + W  +++G  + R+EILHN
Sbjct: 326 PLLKRKRRTSRALVHITDWYPTLVALAGGTASAADGLDGYDVWPAISEGRASPRTEILHN 385

Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
           ID + N  ++               AA+RV + K + G       D  YGD       + 
Sbjct: 386 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 431

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P + L +  G  +N                              E++   R+        
Sbjct: 432 PPQTLAAFPGSWWNL-----------------------------ERMASARQAV------ 456

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
                         LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + 
Sbjct: 457 -------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP 503

Query: 551 RADPARWNNIWVPW 564
           RA P      W PW
Sbjct: 504 RAHPDFNGGAWGPW 517


>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
          Length = 574

 Score =  325 bits (833), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 199/554 (35%), Positives = 287/554 (51%), Gaps = 90/554 (16%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P+ PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+T
Sbjct: 47  PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 105

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH G+QH +I   +P  LPL +  LPQ L+E GY+TH +GKWHLGF+R+   PT R
Sbjct: 106 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRR 165

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           GFD+  G   G  DYY + +C        G D+     V     G YST LY +   +++
Sbjct: 166 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTLLYAQRVSHIL 223

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V 
Sbjct: 224 ASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 278

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL++HG   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + S
Sbjct: 279 NITWALKRHGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 334

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+ ++ S  L HI+DW PTL A AG   +    LDG + W  +++G  + R+EILHN
Sbjct: 335 PLLKRKRRTSRALVHITDWYPTLVALAGGTASAADGLDGYDVWPAISEGRASPRTEILHN 394

Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
           ID + N  ++               AA+RV + K + G       D  YGD       + 
Sbjct: 395 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 440

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P + L +  G  +N                              E++   R+        
Sbjct: 441 PPQTLAAFPGSWWNL-----------------------------ERMASARQAV------ 465

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
                         LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + 
Sbjct: 466 -------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP 512

Query: 551 RADPARWNNIWVPW 564
           RA P      W PW
Sbjct: 513 RAHPDFNGGAWGPW 526


>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
          Length = 476

 Score =  325 bits (833), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 178/397 (44%), Positives = 248/397 (62%), Gaps = 28/397 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+I++I+ADDLG +DV +HGS  I TPNID LAY G+ L  +YVQ +CTPSRS LMTG+Y
Sbjct: 1   PNILLIVADDLGHHDVGYHGSV-IKTPNIDHLAYTGVRLENYYVQPICTPSRSQLMTGRY 59

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            IH G+QH +I   +P  +PL   +LP+ LK+ GY+TH +GKWHLGF+++   P  RGFD
Sbjct: 60  QIHTGLQHNIINPFQPNAVPLDLPMLPEVLKQNGYSTHMVGKWHLGFYKDEVLPMNRGFD 119

Query: 150 SHYGYWQGLQDYYDH-SCKA---TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           S+YGY  G +DY+ H  C A     +   G+D+R++ +VD    G YST L+ E A +V+
Sbjct: 120 SYYGYLTGSEDYFTHRRCGALPGANKTVCGIDLRNDFEVDWNQTGKYSTQLFAEKAEDVV 179

Query: 206 ---AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
              A H   +P+FLY+A  AVHA     P Q P+E +  + DI DP+RR  AGMV+ +DE
Sbjct: 180 RKHAVHQPDQPLFLYVAFQAVHA-----PNQVPNEYLKPY-DIDDPKRRLLAGMVTCMDE 233

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           +VGN+ AA +  G+ +N++V+F  DNG     ++ N G+N PLRG K + W+GGMRGV  
Sbjct: 234 AVGNITAAFKDTGLWDNTVVVFTTDNGG---DVNENSGNNWPLRGWKHSLWEGGMRGVGF 290

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEI 381
           + SP LK      + L H+SDW PTL   AG   +  S LDG + W  + +   + R E+
Sbjct: 291 VNSPLLKSFGYPVNGLMHVSDWFPTLLNLAGANTSSISGLDGHDIWKSIAEREDSPRKEL 350

Query: 382 LHNIDNVDNPQ----------KYYAALRVDDLKYVAG 408
           LHNID +  P+          +  +ALRV D K + G
Sbjct: 351 LHNIDPLYPPRGSKLNHTYDVRVRSALRVGDFKIITG 387



 Score = 42.7 bits (99), Expect = 0.54,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFN+ +DPCE  +L+ +   +L  + DKL  Y +T VP      D R++P      + PW
Sbjct: 417 LFNVHEDPCETTDLSVALPGVLTMMLDKLLAYNATAVPCVFPDDDPRSNPEEHGGAFGPW 476


>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 491

 Score =  325 bits (832), Expect = 5e-86,   Method: Compositional matrix adjust.
 Identities = 193/552 (34%), Positives = 291/552 (52%), Gaps = 87/552 (15%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
            ++   P+II+ + DDLGWND S+ GS  I TPNID LA  G+ L Q+YVQ LC+PSRSA
Sbjct: 18  VSSANNPNIILFVIDDLGWNDTSYQGS-DIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSA 76

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+ GKYP H+G+ HGVI  G P+GL L E  +  +LK+ GY+THA+GKW LG  +  +TP
Sbjct: 77  LLAGKYPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTP 136

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           T+RGFD+ YGY+   +DYY H           LD R+N        G YST L+T+A  +
Sbjct: 137 TYRGFDTFYGYYDADEDYYTHKVGGY------LDFRNNTDPVKDEDGTYSTFLFTKAIED 190

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
            I   + S P F+Y A+ +VH      P +APD  + K  +I  P R+ + GM+  LDE 
Sbjct: 191 AINAKSDS-PFFIYGAYQSVHG-----PLEAPDIYLNK-CNIPYPNRKIFCGMMLALDEG 243

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           + N+ + L+  G+L+++I++   DNG    G  +   SN PLRG K+T ++GG+RG++ +
Sbjct: 244 ISNITSLLQTKGLLDDTIIILTTDNG----GQTALGSSNWPLRGNKATVFEGGVRGISFV 299

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAA-AGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           WS  L+++   ++ + HI+DW PT+    AG+ ++   LDG N WD +     + R+EIL
Sbjct: 300 WSTKLRKSNYDNNAMMHITDWYPTIIEGIAGMTLDTKGLDGFNMWDTINDDTPSPRTEIL 359

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
           H +    +P +Y                            DNE + +  +  L       
Sbjct: 360 HQL----DPPRY----------------------------DNERNPFIGQAALRQG---- 383

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVK 502
                 KL I Q        N S+ +   L   K               N   H N T++
Sbjct: 384 ----DWKLIIGQ-------PNCSEKIGPHLVGNKC-------------PNGWVHLNGTIE 419

Query: 503 P--------CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFN+ DDP E+N L+ S  +++ +L++++  Y +T +   + P D R+DP
Sbjct: 420 EPEVNPSLTWLFNVKDDPNERNELSASHPEIVSKLKERIEAYNATHIVQMSPPIDPRSDP 479

Query: 555 ARWNNIWVPWYD 566
            ++ N+W PW D
Sbjct: 480 KKFGNVWTPWLD 491


>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
          Length = 571

 Score =  324 bits (831), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 48  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 225 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 336 RKRRTSRALVHITDWYPTLVGLAGGSASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L S  G  +N                              E++  +R+            
Sbjct: 442 LASFPGSWWNL-----------------------------ERMASVRQAV---------- 462

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENPRAHP 513

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 514 DFNGGAWGPW 523


>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
          Length = 573

 Score =  324 bits (831), Expect = 7e-86,   Method: Compositional matrix adjust.
 Identities = 197/553 (35%), Positives = 286/553 (51%), Gaps = 90/553 (16%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG
Sbjct: 47  QQPHIIFILTDDQGYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTG 105

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL +  LPQ L++ GYATH +GKWHLGF+R+   PT RG
Sbjct: 106 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRRG 165

Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           FD+  G   G  DYY + +C        G D+     V     G YST LY +    ++A
Sbjct: 166 FDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTLLYAQRVSRILA 223

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N
Sbjct: 224 GHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRGMGNVARRKYAAMVTCMDEAVRN 278

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP
Sbjct: 279 ITGALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSP 334

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
            LK+T++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNI
Sbjct: 335 LLKRTRRTSRALLHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNI 394

Query: 386 DNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSP 431
           D + N  ++               AA+RV + K + G       D  YGD       + P
Sbjct: 395 DPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIP 440

Query: 432 KEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYD 491
            + L +  G  +N                              E++   R+         
Sbjct: 441 PQTLAAFPGSWWNL-----------------------------ERMASARQAV------- 464

Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
                        LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + R
Sbjct: 465 ------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPR 512

Query: 552 ADPARWNNIWVPW 564
           A P      W PW
Sbjct: 513 AHPDFNGGAWGPW 525


>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
          Length = 572

 Score =  324 bits (831), Expect = 8e-86,   Method: Compositional matrix adjust.
 Identities = 198/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 48  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHS 224

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 336 RKRRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 396 YNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++  +R+            
Sbjct: 442 LATFPGSWWNL-----------------------------ERMASVRQAV---------- 462

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLAAYNRTAIPVRYPAENPRAHP 513

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 514 EFNGGAWGPW 523


>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
          Length = 599

 Score =  324 bits (830), Expect = 9e-86,   Method: Compositional matrix adjust.
 Identities = 203/552 (36%), Positives = 290/552 (52%), Gaps = 70/552 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TTA  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N +     DN   GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDRAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN  KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M++
Sbjct: 244 QRVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLA 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL  +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 299 CLDEAINNVTLALETYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 353

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL   A  +I+ D  LDG + W+ +++G ++
Sbjct: 354 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLITLAEGQIDEDVQLDGYDIWETISEGLRS 413

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
            R      +D + N    Y   +  +  + AG                            
Sbjct: 414 PR------VDILHNIDPIYT--KAKNGSWAAGY--------------------------- 438

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
              GI   A+++ ++++           SD           +  + F+ +  N + N+  
Sbjct: 439 ---GIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERI 485

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 486 TLSTGKTIWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 545

Query: 556 RWNNIWVPWYDE 567
               +W PWY E
Sbjct: 546 LNGGVWGPWYKE 557


>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
          Length = 573

 Score =  323 bits (829), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 200/543 (36%), Positives = 285/543 (52%), Gaps = 68/543 (12%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHI+ ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS  +TGK
Sbjct: 49  QPHIVFILADDQGFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGK 107

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R    PT RGF
Sbjct: 108 YQIHTGLQHSIIRPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGF 167

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINV 204
           D+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT+    +
Sbjct: 168 DTFFGSLLGSGDYYTH-FKCDSPGICGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQI 223

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           +A HN  KP+FLY+A+ AVH+     P QAP +    +  I++  RR YA M++ LDE++
Sbjct: 224 LASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAI 278

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
            NV  AL+K+G  +NSI+++ +DNG          GSN PLRG K T W+GG+R V  + 
Sbjct: 279 NNVTLALKKYGYYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFVH 334

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILH 383
           SP LK    V  EL HI+DW PTL   A  +I+ D  LDG + W+ +++G ++ R     
Sbjct: 335 SPLLKNKGSVCKELVHITDWFPTLITLAEGQIDEDIQLDGYDIWETISEGRRSPR----- 389

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
            +D + N    Y   +  +  + AG                               GI  
Sbjct: 390 -VDILHNIDPIYT--KAKNGSWAAGY------------------------------GIWN 416

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTVK 502
            A+++ +++            SD +            + F+ V  N + N+    ++   
Sbjct: 417 TAIQSAIRVNHWKLLTGNPGYSDWVPP----------QAFSNVGPNRWHNERVSWSAGKT 466

Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
             LFNIT DP E+ +L+    D++KQL  +L+ +  T VP    P D R++P     +W 
Sbjct: 467 VWLFNITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNGGVWG 526

Query: 563 PWY 565
           PW+
Sbjct: 527 PWF 529


>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
          Length = 358

 Score =  323 bits (829), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 166/348 (47%), Positives = 223/348 (64%), Gaps = 15/348 (4%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + PH+++ILADDLGWNDV FHGS +I TP++D LA  G+ L+ +YVQ LCTPSR+ LMTG
Sbjct: 21  RPPHVVLILADDLGWNDVGFHGS-EIKTPHLDRLAAQGVRLDNYYVQPLCTPSRNQLMTG 79

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y I  G+QH +I   +P+ +PL EKLLPQ L+E GY TH +GKWHLG F++   PT RG
Sbjct: 80  RYQIRTGLQHQIIWPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLPTHRG 139

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           F S +GY  G +DYY H   +   P       LD+R    V     G YST+L TE A +
Sbjct: 140 FQSFFGYLTGSEDYYTHKRCSPIAPLNVTRCALDLRDGDAVALNYSGRYSTELLTERATH 199

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +I +H   +P+FLY+A  AVHA     P Q PD  +A +  I DP RR YAGMVS +DE+
Sbjct: 200 IITQHTPDQPLFLYVALQAVHA-----PLQVPDHYIAPYSFIQDPHRRRYAGMVSAMDEA 254

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VGN+   L++ G+ +N++++F  DNG  +       G+N PLRG K T W+GG+RGVA +
Sbjct: 255 VGNITHTLQETGLWDNTVLIFSTDNGGQTL----YGGNNWPLRGRKWTLWEGGVRGVAFV 310

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVL 370
             P +++   V+ EL HISDWLPT+   AG   N T  LDG + W  +
Sbjct: 311 SGPLIEKPGAVNRELIHISDWLPTIAGLAGASTNGTKPLDGFDVWQSI 358


>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
          Length = 607

 Score =  323 bits (829), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 199/550 (36%), Positives = 286/550 (52%), Gaps = 68/550 (12%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHI+ ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS  +TGK
Sbjct: 83  QPHIVFILADDQGFRDVGYHGS-EIKTPTLDKLAAQGVKLENYYVQPICTPSRSQFITGK 141

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   PT RGF
Sbjct: 142 YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 201

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINV 204
           D+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT+    +
Sbjct: 202 DTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQI 257

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           +A H+  KP+FLY+A+ AVH+     P QAP +    +  I +  RR YA M+S LDE++
Sbjct: 258 LASHDPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAI 312

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
            NV  AL+++G  +NSI+++ +DNG          GSN PLRG K T W+GG+R V  + 
Sbjct: 313 NNVTLALKRYGFYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFVH 368

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           SP LK    +  EL HI+DW PTL   A G    DT LDG + W+ +++G ++       
Sbjct: 369 SPLLKNKGTICKELVHITDWYPTLITLAEGWIDEDTQLDGYDIWETISEGRRSP------ 422

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
                          RVD L  +       ++  W                  +  GI  
Sbjct: 423 ---------------RVDILHNIDPIYTKAKNGSWA-----------------AGFGIWN 450

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTVK 502
            A+++ ++++           SD           +  + F+    N + N+    ++   
Sbjct: 451 TAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNAGPNRWHNERISWSTGKS 500

Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
             LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P     +W 
Sbjct: 501 VWLFNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWG 560

Query: 563 PWYDELDKQK 572
           PWY E DK++
Sbjct: 561 PWYKEDDKKR 570


>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
          Length = 573

 Score =  323 bits (828), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 199/550 (36%), Positives = 286/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 50  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 108

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 109 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 168

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 169 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTLLYAQRASHILASHS 226

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 227 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 281

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NSI++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 282 ALKRYGFYNNSIIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 337

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 338 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 397

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 398 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 443

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++   R+            
Sbjct: 444 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 464

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 465 ---------WLFNISADPYEREDLAGQRPDVVRALLARLAYYNRTAIPVRYPAENPRAHP 515

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 516 DFNGGAWGPW 525


>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
          Length = 573

 Score =  323 bits (828), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 201/550 (36%), Positives = 286/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 49  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMLGKWHLGFYRKECLPTRRGFDT 167

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 168 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 225

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  +S+  RR YA MV+ +DE+V N+  
Sbjct: 226 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMSNVSRRKYAAMVTCMDEAVRNITW 280

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NSI++F +DNG  +F    + GSN PLRG K T W+GG+RG+  I SP LK
Sbjct: 281 ALKRYGFYNNSIIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLK 336

Query: 330 QTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   A G       LDG + W  +++G  + R+EILHNID +
Sbjct: 337 RKRRTSHALVHITDWYPTLVGLARGTVSAADGLDGYDVWPAISEGRASPRTEILHNIDPL 396

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 397 YNHARHGSLEGGFGIWNTAIQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 442

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++   R+            
Sbjct: 443 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 463

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 464 ---------WLFNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENPRAHP 514

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 515 DFNGGAWGPW 524


>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 498

 Score =  323 bits (828), Expect = 1e-85,   Method: Compositional matrix adjust.
 Identities = 202/559 (36%), Positives = 300/559 (53%), Gaps = 77/559 (13%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
             A    +L + + +  +  +KPH++ I+ADDLGWND+ ++  S I TP +D LA  G+I
Sbjct: 11  MLAFALVVLASMSLVICSGSRKPHVVFIVADDLGWNDIGYNNPS-IFTPTLDKLAREGVI 69

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           LNQ YV  +CTP R++LM+G Y   +G+QH V+   EP GLPL   L+PQ +KE GY T+
Sbjct: 70  LNQSYVLPMCTPDRASLMSGYYAYRVGLQHKVLDHAEPAGLPLNFTLIPQRMKEHGYTTY 129

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            +GKWHLGF +  YTPT+RGFD  YG++   +DY++H+        + LD+R+  +VD  
Sbjct: 130 MLGKWHLGFCKWEYTPTYRGFDHFYGFYNAAEDYFNHTTS------KYLDLRNGKEVDWS 183

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
             G YST +Y E A   IA HNKS PM++YL   +VH        +AP + +  +  I D
Sbjct: 184 KNGTYSTYMYAEKATEYIATHNKSTPMYMYLPFQSVHG-----VIEAPQKYLDMYTFIHD 238

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
             RR  + MV+ +D++V  VIAAL ++ + +N++++F  DNG P+    +N G+N PLRG
Sbjct: 239 TNRRIKSAMVTAMDDAVKVVIAALERYELWDNTLLVFTTDNGGPA--NPNNAGNNWPLRG 296

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW 367
            K T W+GG RGV  +    L++   V++E+ H++DW PTL   AG +  D+ +DG+N W
Sbjct: 297 SKLTLWEGGTRGVGFVHGKMLEKKGYVNNEMMHVTDWYPTLLHIAGGKA-DSDMDGLNVW 355

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID 427
           D L KG  + R E ++NID ++      AA+RV D K + G    G  D W         
Sbjct: 356 DTLNKGYPSPRKEFIYNIDEIEPTG---AAIRVGDYKLIVG--KAGHPDGWI-------- 402

Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
                                             A + D +      E+IL + E+  + 
Sbjct: 403 ---------------------------------PAPTIDGVW-----EEILPMNEYEGL- 423

Query: 488 CNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP---PG 544
              D + +  N T    LFNI  DP E+NNLAE   + + +L  +L   K  +VP   P 
Sbjct: 424 --LDGRLSKDNDTY---LFNIKADPTERNNLAEQMPEKVAELSARLEELKQKLVPAIRPA 478

Query: 545 NKPFDKRADPARWNNIWVP 563
             P  K +DP ++   W P
Sbjct: 479 PDP--KGSDPRKFGGAWSP 495


>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 784

 Score =  323 bits (828), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 182/425 (42%), Positives = 254/425 (59%), Gaps = 37/425 (8%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           L   L+     ++    +KPHII ILADDLGW+DV +HGS  I TPNIDALA  G+ L+ 
Sbjct: 23  LVYVLILEVTDIHVNGRQKPHIIFILADDLGWSDVGYHGSV-IKTPNIDALASEGVKLDN 81

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
           +Y+  LCTPSR  L++G+Y IH G+QH  I   +P  LP+ E +LPQ LKE GYATH +G
Sbjct: 82  YYMSLLCTPSRGQLLSGRYEIHTGLQHRTIDMMQPLCLPIDETILPQKLKERGYATHMVG 141

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV-DNKTI 189
           KWHLGF+R+   P +RGFD+  G++QG+ DYY H+       Y G D R N  V   K  
Sbjct: 142 KWHLGFYRKECLPNYRGFDTFMGFYQGMADYYYHNISTGI--YHGWDFRRNNDVIAQKYA 199

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE--EVAKFLDISD 247
           G YST ++ + A  +I +HN   P+FL+L+  A+H      P Q P    ++ K L  ++
Sbjct: 200 GQYSTHVFADEAQIIIMKHNPEVPLFLFLSFQAIHL-----PLQVPSRYADMYKTLIPNN 254

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
            +R+ YAGMV+ +DE+VG V+ AL+  G+ ENS+++F  DNG    GI    G N PLRG
Sbjct: 255 ADRQKYAGMVTCMDEAVGRVVTALKHTGLWENSVLVFSTDNG----GI---LGGNWPLRG 307

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTS-LDGV 364
            K++ ++GG+RGV  + SP L    +  V+ EL H++DWLPTL   AG ++  T  LDG 
Sbjct: 308 SKASLYEGGVRGVGFVTSPLLPSRVRGTVNRELLHMTDWLPTLVRLAGGDLMGTKPLDGY 367

Query: 365 NQWDVLTKGAKTKRSEILHNIDNV--------DNPQK--------YYAALRVDDLKYVAG 408
           NQW+ +++G  + R E+L  ID +        D+P            AA+RV D K + G
Sbjct: 368 NQWETISQGTPSVRDEVLLTIDPLIAKAHHFKDDPYADNDSFNVLIRAAIRVGDWKLITG 427

Query: 409 TDNNG 413
              NG
Sbjct: 428 PPGNG 432



 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/63 (33%), Positives = 35/63 (55%), Gaps = 5/63 (7%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP---PGNKPFDKRADPARWNNIW 561
           L+N+  DP E+N+ +  + D+++ L  ++  Y +T VP   P  +     ADPAR   +W
Sbjct: 456 LYNVVADPNERNDYSHDRPDIVRFLLARIGHYYNTSVPVLDPSQEIL--LADPARHGGVW 513

Query: 562 VPW 564
            PW
Sbjct: 514 SPW 516


>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
          Length = 573

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 199/547 (36%), Positives = 284/547 (51%), Gaps = 68/547 (12%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           T   +PHI+ ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS  
Sbjct: 45  TPSSQPHIVFILADDQGFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQF 103

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R    PT
Sbjct: 104 ITGKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPT 163

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEA 200
            RGFD+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT+ 
Sbjct: 164 QRGFDTFFGSLLGSGDYYTH-FKCDSPGICGYDLYEN---DNAAWDHDNGIYSTQMYTQK 219

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
              ++A HN  KP+FLY+A+ AVH+     P QAP +    +  I++  RR YA M++ L
Sbjct: 220 VQQILASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACL 274

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DE++ NV  AL+K+G  +NSI+++ +DNG          GSN PLRG K T W+GG+R V
Sbjct: 275 DEAINNVTLALKKYGYYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAV 330

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRS 379
             + SP LK    V  EL HI+DW PTL   A  +I+ D  LDG + W+ +++G ++   
Sbjct: 331 GFVHSPLLKNKGSVCKELVHITDWFPTLITLAEGQIDEDIQLDGYDIWETISEGRRSP-- 388

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKA 439
                              RVD L  +       ++  W                  +  
Sbjct: 389 -------------------RVDILHNIDPIYTKAKNGSWA-----------------AGY 412

Query: 440 GITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCN 498
           GI   A+++ +++            SD +            + F+ +  N + N+    +
Sbjct: 413 GIWNTAIQSAIRVNHWKLLTGNPGYSDWVPP----------QAFSNMGPNRWHNERVSWS 462

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
           +     LFNIT DP E+ +L+    D++KQL  +L+ +  T VP    P D R++P    
Sbjct: 463 AGKTVWLFNITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNG 522

Query: 559 NIWVPWY 565
            +W PW+
Sbjct: 523 GVWGPWF 529


>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
          Length = 572

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 198/550 (36%), Positives = 287/550 (52%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 48  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 106

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 107 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 166

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 167 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 224

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 225 PQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 279

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 280 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 335

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 336 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 395

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 396 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 441

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++  +R+            
Sbjct: 442 LAAFPGSWWNL-----------------------------ERMASVRQAV---------- 462

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 463 ---------WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENPRAHP 513

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 514 DFNGGAWGPW 523


>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
          Length = 595

 Score =  323 bits (827), Expect = 2e-85,   Method: Compositional matrix adjust.
 Identities = 201/548 (36%), Positives = 288/548 (52%), Gaps = 68/548 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TTA  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTAASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+  KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRSP 414

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
           R      +D + N    Y   +  +  + AG                             
Sbjct: 415 R------VDILHNIDPIYT--KAKNGSWAAGY---------------------------- 438

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD           +  + F+ +  N + N+   
Sbjct: 439 --GIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIS 486

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D R++P  
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546

Query: 557 WNNIWVPW 564
              +W PW
Sbjct: 547 NGGVWGPW 554


>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
          Length = 599

 Score =  322 bits (826), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 202/552 (36%), Positives = 290/552 (52%), Gaps = 70/552 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           + TA  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 70  SVTAVSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 128

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 129 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 188

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 189 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 244

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+  KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 245 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLS 299

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 300 CLDEAINNVTLALKMYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 354

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I++   LDG + W+ +++G ++
Sbjct: 355 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDENIQLDGYDVWETISEGLRS 414

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
            R      +D + N    Y   +  +  + AG                            
Sbjct: 415 PR------VDILHNIDPIYT--KAKNGSWAAGY--------------------------- 439

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
              GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 440 ---GIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 486

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D R++P 
Sbjct: 487 TLSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPR 546

Query: 556 RWNNIWVPWYDE 567
               +W PWY E
Sbjct: 547 LNGGVWGPWYKE 558


>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
 gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
          Length = 598

 Score =  322 bits (826), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 202/549 (36%), Positives = 289/549 (52%), Gaps = 70/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TTA  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 67  STTATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 185

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 241

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+  KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 242 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 297 CLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 351

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 352 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRS 411

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
            R      +D + N    Y   +  +  + AG                            
Sbjct: 412 PR------VDILHNIDPIYT--KAKNGSWAAGY--------------------------- 436

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
              GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 437 ---GIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERI 483

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D R++P 
Sbjct: 484 TLSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPR 543

Query: 556 RWNNIWVPW 564
               +W PW
Sbjct: 544 LNGGVWGPW 552


>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
          Length = 600

 Score =  322 bits (825), Expect = 3e-85,   Method: Compositional matrix adjust.
 Identities = 201/548 (36%), Positives = 288/548 (52%), Gaps = 68/548 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TTA  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTAASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+  KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRSP 414

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
           R      +D + N    Y   +  +  + AG                             
Sbjct: 415 R------VDILHNIDPIYT--KAKNGSWAAGY---------------------------- 438

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD           +  + F+ +  N + N+   
Sbjct: 439 --GIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIS 486

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D R++P  
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546

Query: 557 WNNIWVPW 564
              +W PW
Sbjct: 547 NGGVWGPW 554


>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
          Length = 503

 Score =  322 bits (825), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 177/404 (43%), Positives = 244/404 (60%), Gaps = 28/404 (6%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A K+P+I+ ILADD G++DV +HGS+ I TPNID LA+ G+ L  +YVQ +CTP+RS L+
Sbjct: 27  ASKQPNILFILADDYGYHDVGYHGSA-IRTPNIDRLAFEGVRLENYYVQPICTPTRSQLL 85

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           +G+Y IH G+QH +I   +P  LP     L   L+E GYA H +GKWHLGF++E Y PT 
Sbjct: 86  SGRYQIHTGLQHSIIWAAQPNALPKELPTLADKLREEGYANHIVGKWHLGFYKEEYVPTN 145

Query: 146 RGFDSHYGYWQGLQDYYDHS-CKATFEPYQ--GLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           RGFDS YGY  G + YY+ + C A        GLD R N +  N++   YST LY E   
Sbjct: 146 RGFDSFYGYLTGSEFYYNKTYCLAQINRSDVCGLDFRENDRSINES--EYSTHLYAERTK 203

Query: 203 NVIAEHNKS---KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            ++A+H  +   +P+FLYLA  +VH      P + P +    +  I D  R+ YAGMVS 
Sbjct: 204 QLVADHTSAHPDQPLFLYLALQSVHG-----PLEVPAQYRTPYKHIKDENRQIYAGMVSC 258

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +DE+V NV  A +K+G+ +N+I++F  DNG    G  +  G+N PLRG K + WDGGMRG
Sbjct: 259 MDEAVKNVTDAFKKYGLWDNTILVFSTDNG----GQVAEGGNNWPLRGWKGSLWDGGMRG 314

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
           V  + S  LKQ   V+ +L H+SDW PTL   AG   +   LDG +QW  ++ GA+  R+
Sbjct: 315 VGFVHSTLLKQKGAVAHQLMHVSDWFPTLLNQAGGNASSVKLDGFDQWAAISDGAEGPRT 374

Query: 380 EILHNIDNVDNPQ----------KYYAALRVDDLKYVAGTDNNG 413
           EILHNID ++  +             AALR++  K + G   NG
Sbjct: 375 EILHNIDPLNGKRGKRTEEVFDNTVRAALRMNKWKLLTGEPGNG 418



 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 35/60 (58%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E +N+A    +++K +  KLA Y +T VPP   P+D R+ P     +W PW
Sbjct: 443 LFDIEADPYEISNVAAENPEVVKTMLLKLAAYNATAVPPFYPPYDPRSYPENHGGVWGPW 502


>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
           melanoleuca]
          Length = 573

 Score =  322 bits (825), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 197/553 (35%), Positives = 286/553 (51%), Gaps = 90/553 (16%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG
Sbjct: 46  QPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTG 104

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RG
Sbjct: 105 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 164

Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           FD+  G   G  DYY + +C        G D+     V     G YST LY +   +++A
Sbjct: 165 FDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILA 222

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N
Sbjct: 223 SHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRN 277

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP
Sbjct: 278 ITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSP 333

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
            LK+ ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNI
Sbjct: 334 LLKRKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNI 393

Query: 386 DNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSP 431
           D + N  ++               AA+RV + K + G       D  YGD       + P
Sbjct: 394 DPLYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIP 439

Query: 432 KEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYD 491
            + L +  G  +N                              E++   R+         
Sbjct: 440 PQTLAAFPGSWWNL-----------------------------ERMASARQAV------- 463

Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
                        LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + R
Sbjct: 464 ------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPR 511

Query: 552 ADPARWNNIWVPW 564
           A P      W PW
Sbjct: 512 AHPDFNGGAWGPW 524


>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
          Length = 600

 Score =  322 bits (825), Expect = 4e-85,   Method: Compositional matrix adjust.
 Identities = 201/549 (36%), Positives = 288/549 (52%), Gaps = 68/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +T A  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STAATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDRLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 188 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+  KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIVYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRSP 414

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
           R      +D + N    Y   +  +  + AG                             
Sbjct: 415 R------VDILHNIDPIYT--KAKNGSWAAGY---------------------------- 438

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD           +  + F+ +  N + N+   
Sbjct: 439 --GIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERIT 486

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D R++P  
Sbjct: 487 LSTGKSVWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRL 546

Query: 557 WNNIWVPWY 565
              +W PWY
Sbjct: 547 NGGVWGPWY 555


>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 311

 Score =  322 bits (824), Expect = 5e-85,   Method: Compositional matrix adjust.
 Identities = 161/316 (50%), Positives = 208/316 (65%), Gaps = 8/316 (2%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           GW+D S HGSSQIPTPN+DA+A +G+ILNQHYVQ LCTPSR+ALMTG+YP H+GMQH VI
Sbjct: 1   GWDDTSIHGSSQIPTPNMDAIAADGIILNQHYVQPLCTPSRAALMTGRYPFHVGMQHSVI 60

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
              EPW LPL   L+P+Y +  GY TH +GKWHLG++   Y P  RGFD+  G++    D
Sbjct: 61  KPAEPWALPLNYTLMPEYFRCLGYKTHMVGKWHLGYYDRQYVPIKRGFDTFLGFYNPSLD 120

Query: 161 YYDHSCKATFEPYQGLDMR---HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLY 217
           YY+ +   T   + G D R    N   + K    Y+T  YT   + +I  HNKS PMFL+
Sbjct: 121 YYNQNF--TGNNHTGHDFRCGDQNYWAEEKE---YATYYYTNKTVEIIRRHNKSAPMFLF 175

Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
           L+H A H        Q P   V  F  I +  R  +AGMV  LD+SVG V+  L+K GML
Sbjct: 176 LSHQAPHVSGGRPLLQVPTHGVRNFSYIGENNRTLFAGMVDALDQSVGIVVDTLQKAGML 235

Query: 278 ENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE 337
            ++I++F +DNGA  +G  SN+G N PLRG K T ++GG+RG A +WS  LK T++VS++
Sbjct: 236 NDTIIVFCSDNGALPWGFKSNRGYNWPLRGGKFTLYEGGVRGTAFLWSRRLKNTKRVSNQ 295

Query: 338 LFHISDWLPTLCAAAG 353
           L HISDWLPTL +AAG
Sbjct: 296 LMHISDWLPTLYSAAG 311


>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
          Length = 545

 Score =  321 bits (823), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 195/553 (35%), Positives = 294/553 (53%), Gaps = 50/553 (9%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PHI+ I+ADDLGWNDV +H +  I TPN+D +A  G+ILN  YV  +CTPSR++ +
Sbjct: 24  AKRPPHILYIVADDLGWNDVGWH-NPDIKTPNLDRMAGGGVILNSSYVHPICTPSRNSFL 82

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG YP  +G+    I   +   + L    LP+ LK+ GY+TH IGKWHLGF  E YTPT 
Sbjct: 83  TGVYPFRVGLSGTAITPHQARFMSLKTPTLPEKLKKLGYSTHMIGKWHLGFCNERYTPTR 142

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           RGFDS  G++ G QDYY H+        +G D R N  V       YST  Y + A+++I
Sbjct: 143 RGFDSFLGFYTGTQDYYKHTTA------KGYDFRFNQTVFYPPKKQYSTKTYAKRAVDII 196

Query: 206 AEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
            EH + K P+FLYLA  +VH      P Q P +   ++ +I + +RR Y+GMV+ +D+++
Sbjct: 197 TEHKRKKNPLFLYLAFQSVHT-----PLQVPKKYEKQYNNIKNKDRRVYSGMVTAMDDAI 251

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G V+AA RKH  + N IV+F  DNG  +  +    G+N PLRG K+T W+GG R V+ ++
Sbjct: 252 GAVMAAFRKHKFMGNLIVVFTTDNGGAAHIV----GNNFPLRGSKTTVWEGGTRAVSFVY 307

Query: 325 SP-WLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
           S   LK+T      LFH +DW PT+ A AG +   +  LDGVNQW ++  G  ++R+E +
Sbjct: 308 SKNLLKKTGYTHDGLFHATDWFPTILAVAGGKPKQEKQLDGVNQWPMIRSGKPSRRTEFV 367

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT----------DNEIDKYSPK 432
           + ID      +  +A+R+   K + G  N G  + WY             +NE D     
Sbjct: 368 YEIDK----SRPVSAIRMGKYKLIVG--NPGDYNNWYRIAKSNSFCPIYDENETDSDEDI 421

Query: 433 EVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDN 492
            +  S   I  NA   K  +K   +  P              E+  +L++  + +  +  
Sbjct: 422 FIYESLNNIVRNA---KGYVKWLSSKLPGYRMVRKWHRYRQLERCRKLKKRDKAKWRFP- 477

Query: 493 KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY--KSTMVPPGNKPFDK 550
                    +  L+++  DP E+ N+A  +  L+++++ +L  Y  K+  V   +K    
Sbjct: 478 ---------RYLLYDLEADPSERKNIAGYKPVLVRKMKARLEHYQKKTKHVAGQSKRKVA 528

Query: 551 RADPARWNNIWVP 563
            ADP   + +W P
Sbjct: 529 AADPKHHHGVWTP 541


>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
          Length = 574

 Score =  321 bits (823), Expect = 6e-85,   Method: Compositional matrix adjust.
 Identities = 196/540 (36%), Positives = 285/540 (52%), Gaps = 63/540 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG
Sbjct: 48  RPPHIIFILTDDQGYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITG 106

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+++   PT RG
Sbjct: 107 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 166

Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           FD+  G   G  DYY + +C        G D+     V     G YST LY +    ++A
Sbjct: 167 FDTFLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKILA 224

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H+  +P+F+Y+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N
Sbjct: 225 SHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKN 279

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +  AL+K+G  +NS+++F  DNG  +F    + GSN PLRG K T W+GG+RG+  + SP
Sbjct: 280 ITWALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSP 335

Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
            +K+ ++ S  L HI+DW PTL + A G   N   LDG N W  +++G ++ R+EILHNI
Sbjct: 336 LIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPGLDGYNVWPAISEGKESPRTEILHNI 395

Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGITFN 444
           D + N  K Y +L      +      + +  EW   T D     + P + L +  G  +N
Sbjct: 396 DPLYNHAK-YGSLEDGFGIWNTAVQASIRVGEWKLLTGDPGYSDWIPPQTLTNFPGSWWN 454

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
             +                 +D LR  +                                
Sbjct: 455 LERL----------------TDGLRKSV-------------------------------W 467

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L+E + D+++ L  +L  Y  T +P      + RA P      W PW
Sbjct: 468 LFNITADPYERYDLSEQRPDIVRALLTRLVHYNRTAIPVRYPAENPRAHPDFNGGAWGPW 527


>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
          Length = 573

 Score =  321 bits (822), Expect = 7e-85,   Method: Compositional matrix adjust.
 Identities = 197/548 (35%), Positives = 289/548 (52%), Gaps = 63/548 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG
Sbjct: 48  RPPHIIFILTDDQGYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITG 106

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+++   PT RG
Sbjct: 107 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 166

Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           FD+  G   G  DYY + +C        G D+     V     G YST LY +    ++A
Sbjct: 167 FDTFLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKILA 224

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H+  +P+F+Y+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N
Sbjct: 225 SHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKN 279

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +  AL+K+G  +NS+++F  DNG  +F    + GSN PLRG K T W+GG+RG+  + SP
Sbjct: 280 ITWALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSP 335

Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
            +K+ ++ S  L HI+DW PTL + A G   N   LDG N W  +++G ++ R+EILHNI
Sbjct: 336 LIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPGLDGYNVWPAISEGKESPRTEILHNI 395

Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGITFN 444
           D + N  K Y +L      +      + +  EW   T D     + P + L +  G  +N
Sbjct: 396 DPLYNHAK-YGSLEDGFGIWNTAVQASIRVGEWKLLTGDPGYSDWIPPQTLTNFPGSWWN 454

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
             +                 +D LR  +                                
Sbjct: 455 LERL----------------TDGLRKSV-------------------------------W 467

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L++ + D+++ L  +L  Y  T +P      + RA P      W PW
Sbjct: 468 LFNITADPYERYDLSDQRPDVVRTLLMRLVHYNQTAIPVRYPAENPRAHPDFNGGAWGPW 527

Query: 565 YDELDKQK 572
             E D ++
Sbjct: 528 ASEDDAEE 535


>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
           caballus]
          Length = 523

 Score =  321 bits (822), Expect = 8e-85,   Method: Compositional matrix adjust.
 Identities = 173/369 (46%), Positives = 228/369 (61%), Gaps = 19/369 (5%)

Query: 26  APKKP--HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           AP +P   ++ +L DD+G +DV  H  S+  TP +DALA  GL+L+ +Y Q LC PSRS 
Sbjct: 29  APGRPAAKLVYMLTDDMGXHDVGLH-ESRFSTPRLDALAAGGLLLDNYYTQPLCXPSRSQ 87

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L++G+Y IH G+QH +I   +P  LPL EKLLPQ LKEAGY TH +GKWHLG +R+   P
Sbjct: 88  LLSGRYQIHTGLQHQIIWPCQPSCLPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 147

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATF-----EPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
           T RGFD+++GY  G +DYY H  + TF          LD R   +V      +YST ++T
Sbjct: 148 TRRGFDTYFGYLLGSEDYYSHE-RCTFIDALNVTRCALDFRDGEEVATGYKNMYSTSVFT 206

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           E A  +I  H   KP+FLYLA  +VH     +P Q P+E +  +  I D  R  YAGMVS
Sbjct: 207 ERATALITNHPPEKPLFLYLALQSVH-----QPLQVPEEYLKPYDFIQDKNRYHYAGMVS 261

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE+VGNV AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+R
Sbjct: 262 LLDEAVGNVTAALKSRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVR 317

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTK 377
           GV  + SP LKQ    + EL HISDWLPTL   AG   N T  LDG N W  +++G+ + 
Sbjct: 318 GVGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGSTNGTKPLDGFNVWKTISEGSPSP 377

Query: 378 RSEILHNID 386
           R E+LHNID
Sbjct: 378 RVELLHNID 386


>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
          Length = 573

 Score =  321 bits (822), Expect = 9e-85,   Method: Compositional matrix adjust.
 Identities = 196/550 (35%), Positives = 285/550 (51%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 49  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 107

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 108 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 167

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY +   +++A H+
Sbjct: 168 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHS 225

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+ +
Sbjct: 226 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITS 280

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 281 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 336

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 337 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 396

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 397 YNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 442

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++   R+            
Sbjct: 443 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 463

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + RA P
Sbjct: 464 ---------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPRAHP 514

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 515 DFNGGAWGPW 524


>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
          Length = 574

 Score =  320 bits (821), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 196/540 (36%), Positives = 285/540 (52%), Gaps = 63/540 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           + PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG
Sbjct: 48  RPPHIIFILTDDQGYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITG 106

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+++   PT RG
Sbjct: 107 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 166

Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           FD+  G   G  DYY + +C        G D+     V     G YST LY +    ++A
Sbjct: 167 FDTFLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGENVAWDQSGKYSTFLYAQRVSKILA 224

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H+  +P+F+Y+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N
Sbjct: 225 SHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKN 279

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +  AL+K+G  +NS+++F  DNG  +F    + GSN PLRG K T W+GG+RG+  + SP
Sbjct: 280 ITWALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSP 335

Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
            +K+ ++ S  L HI+DW PTL + A G   N   LDG N W  +++G ++ R+EILHNI
Sbjct: 336 LIKRKRRTSWALVHITDWYPTLVSLARGNLSNVPGLDGYNVWPAISEGKESPRTEILHNI 395

Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT-DNEIDKYSPKEVLYSKAGITFN 444
           D + N  K Y +L      +      + +  EW   T D     + P + L +  G  +N
Sbjct: 396 DPLYNHAK-YGSLEDGFGIWNTAVQASIRVGEWKLLTGDPGYSDWIPPQTLTNFPGSWWN 454

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
             +                 +D LR  +                                
Sbjct: 455 LERL----------------TDGLRKSV-------------------------------W 467

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L+E + D+++ L  +L  Y  T +P      + RA P      W PW
Sbjct: 468 LFNITADPYERYDLSEQRPDVVRALLTRLVHYNRTAIPVRYPAENPRAHPDFNGGAWGPW 527


>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
          Length = 555

 Score =  320 bits (821), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 199/551 (36%), Positives = 287/551 (52%), Gaps = 68/551 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            T    +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 67  GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+ +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+++G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD +            + F+ +  N + N+   
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544

Query: 557 WNNIWVPWYDE 567
              +W PWY E
Sbjct: 545 NGGVWGPWYKE 555


>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
 gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
          Length = 575

 Score =  320 bits (821), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 196/550 (35%), Positives = 285/550 (51%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 51  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 109

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 110 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 169

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY +   +++A H+
Sbjct: 170 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHS 227

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+ +
Sbjct: 228 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITS 282

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 283 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 338

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 339 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 398

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 399 YNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 444

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++   R+            
Sbjct: 445 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 465

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + RA P
Sbjct: 466 ---------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPRAHP 516

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 517 DFNGGAWGPW 526


>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
          Length = 564

 Score =  320 bits (820), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 164/360 (45%), Positives = 229/360 (63%), Gaps = 13/360 (3%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHI+ ILADD G+ D+ +HGS +I TP +D LA  G+ L  +YVQ +C+PSRS  +TGK
Sbjct: 46  QPHIVFILADDQGYRDIGYHGS-EIRTPTLDKLASEGVRLENYYVQPICSPSRSQFITGK 104

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH +I   +P  LPL    LPQ LK+AGY TH +GKWHLGF+++   PT RGF
Sbjct: 105 YQIHTGLQHSIIRPSQPNCLPLDNMTLPQKLKKAGYQTHMVGKWHLGFYKKECMPTQRGF 164

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
           DS +G   G  DYY+H  K       G D+  N     ++  GIYST++YT+  + +++ 
Sbjct: 165 DSFFGSLLGSGDYYNH-YKCDSPGICGYDLYENNNAAWDQDNGIYSTEMYTQRVLQILSS 223

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           HN +KP+FLY+A+ AVH+     P QAP   +  +  I +  RR YA M+S LD++V NV
Sbjct: 224 HNPNKPLFLYIAYQAVHS-----PLQAPGNYLDNYNSILNENRRRYAAMLSCLDDAVNNV 278

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
             AL+K+G  +NSI+++ +DNG          G+N PLRG K T W+GG+R V  + SP 
Sbjct: 279 TTALKKYGFYDNSIIIYSSDNGGQPMA----GGNNWPLRGSKGTYWEGGIRAVGFVHSPI 334

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNID 386
           LK    +  EL HI+DW PTL   AG EI+ +  LDG + W+ +++G ++ R EILHNID
Sbjct: 335 LKMKGYICKELVHITDWFPTLVTLAGGEIDEEIQLDGYDIWETISEGKRSPRQEILHNID 394



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 41/68 (60%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L++   D++KQL  +LA +  T VP    P D R++P     +W PW
Sbjct: 466 LFNITADPYERIDLSDRYPDIVKQLLRRLAQFNKTAVPVRYPPRDPRSNPKLNGGVWGPW 525

Query: 565 YDELDKQK 572
           Y +  K+K
Sbjct: 526 YKDHSKRK 533


>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 524

 Score =  320 bits (820), Expect = 1e-84,   Method: Compositional matrix adjust.
 Identities = 215/579 (37%), Positives = 300/579 (51%), Gaps = 86/579 (14%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQ---IPTPNIDALAYN 64
           F  + C  L      ++ A + P+++ ILADD G+NDV +HG S    I TPN+D LA  
Sbjct: 9   FVVIVCLTLLTSG--SSYAEQLPNVVFILADDYGFNDVGYHGRSHGSAILTPNLDMLAGE 66

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
           G+ L  +YVQ +C+P+RS LM+G+Y IH G+QHGVI   +P  LPL E  LPQ LKE GY
Sbjct: 67  GVKLENYYVQPICSPTRSQLMSGRYQIHTGLQHGVIRPPQPNCLPLDEVTLPQKLKENGY 126

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSC-KATFEPYQGLDMRHNMQ 183
           AT+ +GKWH+GF+ +   PT RGFDS++ +    +D++  SC                  
Sbjct: 127 ATNMVGKWHIGFYLDACLPTERGFDSYFAW----EDHF--SCLPXXXXXXXXXXXXXXXX 180

Query: 184 VDNKTI-----GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
             NKT      G YST L+T   I+VI  H+K+KP+F+YLA+ AVH      P + PD  
Sbjct: 181 XANKTPVFQYEGQYSTHLFTNKTIDVIERHDKTKPLFIYLAYQAVHF-----PLEVPDSY 235

Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
           +  +++I+D  RRTYAGMVS +DE VGNV  AL++ G+ +N+I++F  DNG P   I S 
Sbjct: 236 MDPYMNITDKNRRTYAGMVSCMDEGVGNVTRALKEAGLYDNTIIIFSTDNGGP---IQSG 292

Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAA-AGIE 355
             +N PLRG K + W+GG+ GV  + SP L  + K  V+ EL H+SDWLPT+ A  AG  
Sbjct: 293 -ANNWPLRGSKGSLWEGGIHGVGFVHSPLLPTSVKGTVNHELIHVSDWLPTIVAGIAGGA 351

Query: 356 INDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQ 414
           +N T  LDG N W  ++  A + R EILHNID++      Y A  +    +      +G+
Sbjct: 352 LNGTKPLDGYNVWSSISGTASSPRKEILHNIDSL------YIAPEILPSYF------DGE 399

Query: 415 SDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQ-KHAADPKANSSD------- 466
              W        DK+               +++  L++   K    P A++SD       
Sbjct: 400 DFLWRFLARVLEDKFD-------------TSIRAALRVGDWKIKTGPDASASDVWQPPAE 446

Query: 467 -ALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDL 525
             L TI   EK                         K  LFNIT DP E+ +L+    D+
Sbjct: 447 SGLHTIYPIEK----------------------PNQKVWLFNITADPLERTDLSGKYPDV 484

Query: 526 LKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           + +   +L  Y    VP      D  ADPA     W PW
Sbjct: 485 VDEFLQRLEEYYKGSVPVRYPDPDLEADPALHGGSWGPW 523


>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
          Length = 572

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 199/550 (36%), Positives = 287/550 (52%), Gaps = 70/550 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            T    +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 41  GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 99

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 100 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 159

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 160 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 215

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+ +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 216 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 270

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+++G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+
Sbjct: 271 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGI 325

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++
Sbjct: 326 RAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRS 385

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
                                 RVD L  +       ++  W                  
Sbjct: 386 P---------------------RVDILHNIDPIYTKAKNGSWA----------------- 407

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGA 495
           +  GI   A+++ ++++           SD +            + F+ +  N + N+  
Sbjct: 408 AGYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERI 457

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
             ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P 
Sbjct: 458 TLSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPR 517

Query: 556 RWNNIWVPWY 565
               +W PWY
Sbjct: 518 LNGGVWGPWY 527


>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  320 bits (819), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 185/454 (40%), Positives = 263/454 (57%), Gaps = 47/454 (10%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
            +    +CTL+         +  +PHII ILADDLGW+DV +HGS  I TP+IDALA  G
Sbjct: 7   SFLLVASCTLV---------SSSRPHIIFILADDLGWSDVGYHGSV-IKTPHIDALASEG 56

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           + L+ +Y   LCTPSRS LMTG+Y IH G+QH  I   +P  LP+ E +LPQ LK+ GYA
Sbjct: 57  VKLDNYYTSLLCTPSRSQLMTGRYEIHTGLQHRTIDMMQPLCLPIDETILPQKLKDRGYA 116

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH-NMQV 184
           TH +GKWHLGF+R+   P  RGFD+  G++Q + DYY H+       + G D R  N  +
Sbjct: 117 THMVGKWHLGFYRQECLPNNRGFDTFMGFYQAMGDYYYHNVSTG--KFNGWDFRRDNDVI 174

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE--EVAKF 242
             +  G YST ++ + A ++I++HN   P+FL+L+  A+H      P Q P    ++   
Sbjct: 175 AERYAGQYSTHVFADEARDIISKHNPDVPLFLFLSFQAIHF-----PLQVPSRYADIYNT 229

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
           L  +  +RRTYAGMV+ +DE+VG V+A L+  G+ ENS+++F  DNG    GIHS  G N
Sbjct: 230 LIPNSADRRTYAGMVTCMDEAVGGVVATLKHSGLWENSVLVFSTDNG----GIHS-LGGN 284

Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEI-NDT 359
            PLRG K++ ++GG+RGVA + SP L    +  ++ EL H++DW PTL   AG  +  + 
Sbjct: 285 WPLRGGKASLYEGGVRGVAFVTSPLLPSRVRGTINKELLHMTDWFPTLVRLAGGHLMGNK 344

Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDN--------VDNPQK--------YYAALRVDDL 403
            LDG NQWD +++   + R+EIL  I+         VD+P            AA+RV D 
Sbjct: 345 RLDGYNQWDTISQETPSLRNEILLTINPLKRKPNHFVDDPYADTDSFDILMRAAIRVGDW 404

Query: 404 KYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
           K + G   NG    W    +   +   P+E  +S
Sbjct: 405 KLITGPTGNG---SWVTPAEFGFNSLHPEERTHS 435



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 22/61 (36%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD-KRADPARWNNIWVP 563
           L+N+  DP E+++ +  + D++  L D++  Y  T VP  N P +  RADPA     W P
Sbjct: 438 LYNVIADPNERHDYSRDRPDIVGALLDRIGHYYKTSVPVLNPPQEVTRADPALHGGAWSP 497

Query: 564 W 564
           W
Sbjct: 498 W 498


>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
 gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
 gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
          Length = 597

 Score =  319 bits (818), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            T    +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 67  GTAVTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+ +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 242 QRVQQILASHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+++G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD           +  + F+ +  N + N+   
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQAFSNLGPNRWHNERIT 484

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544

Query: 557 WNNIWVPWY 565
              +W PWY
Sbjct: 545 NGGVWGPWY 553


>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
          Length = 570

 Score =  319 bits (818), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 196/550 (35%), Positives = 284/550 (51%), Gaps = 90/550 (16%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 46  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 104

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 105 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 164

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY +   +++A H+
Sbjct: 165 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHS 222

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 223 PRRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 277

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 278 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 333

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 334 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 393

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 394 YNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 439

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++   R+            
Sbjct: 440 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 460

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + RA P
Sbjct: 461 ---------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPRAHP 511

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 512 DFNGGAWGPW 521


>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
 gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
 gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
 gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
          Length = 598

 Score =  319 bits (818), Expect = 2e-84,   Method: Compositional matrix adjust.
 Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            T    +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 67  GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+ +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+++G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD +            + F+ +  N + N+   
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544

Query: 557 WNNIWVPWY 565
              +W PWY
Sbjct: 545 NGGVWGPWY 553


>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
 gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
          Length = 470

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 173/400 (43%), Positives = 238/400 (59%), Gaps = 30/400 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+II ILADD GW+D+ +H +  I TPN+D LA  G+ L  +YVQ +C+PSR  LMTG+Y
Sbjct: 1   PNIIFILADDYGWDDIGYH-NHFIHTPNLDRLASEGVKLENYYVQPVCSPSREQLMTGRY 59

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            IH G+QHGVI    P GLPL E  LPQ LK+ GY T+ +GKWHLGF ++ YTP +RGFD
Sbjct: 60  QIHYGLQHGVIRNDRPHGLPLDEVTLPQRLKDNGYRTYMVGKWHLGFCKKEYTPLYRGFD 119

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             YG+  G +DY+ H     ++  +GLD+R   +      G YST L+   A ++I +H+
Sbjct: 120 KFYGFLTGSEDYWTHR---RYKGVRGLDLRDQDEPVLDENGTYSTHLFARKATDMILKHD 176

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
           +++PMFLYL   AVH      P Q P++ + +++ I+    R YA  V+ +DE+VGNV  
Sbjct: 177 QNQPMFLYLPFQAVHG-----PLQVPEKYLQEYMHINFTVDRIYAAQVTAMDEAVGNVTN 231

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+  G+ EN++++F  DNGA       + GSN PLRG K+T W+GG+RGV  + S  LK
Sbjct: 232 ALKNSGLWENTVLIFSTDNGA-----RRHTGSNWPLRGWKNTLWEGGVRGVGFVNSNLLK 286

Query: 330 QTQKVSSELFHISDWLPTLCAAA-GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           +  K S  L HISDW PTL   + G       LDG + WD ++KG  + R E+LHNID  
Sbjct: 287 RKGKTSDALIHISDWFPTLLRISDGSTTGTKPLDGFDVWDAISKGKTSPRKELLHNIDPF 346

Query: 389 DNPQ---------------KYYAALRVDDLKYVAGTDNNG 413
            + Q                 Y+ALR  D K + G    G
Sbjct: 347 YDDQTGSYSPLSYDNIFNTSSYSALRSGDWKLLTGYQGAG 386


>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
          Length = 614

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            T    +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 67  GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+ +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+++G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD +            + F+ +  N + N+   
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544

Query: 557 WNNIWVPWY 565
              +W PWY
Sbjct: 545 NGGVWGPWY 553


>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
          Length = 571

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 197/549 (35%), Positives = 285/549 (51%), Gaps = 68/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            +T   +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 41  GSTGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 99

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 100 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 159

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 160 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 215

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+ +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 216 QRVQQILASHDPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLS 270

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+++G   NSI+++ +DNG          GSN PLRG K T W+GG+R
Sbjct: 271 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIR 326

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 327 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 386

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 387 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 408

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD +            + F+ +  N + N+   
Sbjct: 409 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 458

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 459 LSTGKSIWLFNITADPYERVDLSGRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 518

Query: 557 WNNIWVPWY 565
              +W PWY
Sbjct: 519 NGGVWGPWY 527


>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 545

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 190/458 (41%), Positives = 259/458 (56%), Gaps = 45/458 (9%)

Query: 20  AFLNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCT 78
            F  +  P++P HI+ ILADD G+ND+ +   + + TPN+D LA  G+ L+ +YVQ +CT
Sbjct: 48  GFQPSRNPRRPPHIVFILADDYGFNDIGYRNPA-MRTPNLDYLAAEGIKLDNYYVQPICT 106

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           PSR+ LM+GKY IH G+QH +I   +P  LPL    LPQ LKEAGYATH  GKWHLGF++
Sbjct: 107 PSRAQLMSGKYQIHTGLQHSIIWPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFYK 166

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPY------QGLDMRHNMQVDNKTIGIY 192
           +   PT RGFDS  G   G  D++ H+ +    PY      +GLD R  +Q  N   GIY
Sbjct: 167 KECWPTNRGFDSFLGILLGKGDHFLHTEEGGGGPYPSTWPWEGLDFRDGLQSTNAYSGIY 226

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERR 251
           ST +  E   N+I +H+K KP+FLY++  AVH      P Q P+  +  F   I D +RR
Sbjct: 227 STHVIAERVENIIEKHDKDKPLFLYVSFQAVHT-----PLQVPESYLQPFESSIQDEKRR 281

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            YAGM   +DE+VGN+   L+K G+ ++++++F +DNG    G      SN PLRG K+T
Sbjct: 282 IYAGMTYCMDEAVGNITKKLKKQGLWDDTVLVFSSDNG----GNIDQGASNWPLRGSKTT 337

Query: 312 PWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDTSLDGVNQWD 368
            W+GG+R V  + SP L +  K  VS EL  ISDW PTL    AG  ++ T LDG N W+
Sbjct: 338 LWEGGVRAVGFVTSPLLSERMKGTVSRELIDISDWYPTLIEGVAGWTLSGTKLDGYNIWE 397

Query: 369 VLTKGAKTKRSEILHNIDNVDNP-----------------QKYYAALRVDDLKYVAG--T 409
            L  G  + R E+LHNID +  P                  + YAALR  D K V G  +
Sbjct: 398 TLRSGKPSARVELLHNIDPLITPPSTWPNESIAAAHNSFSTRTYAALRYKDWKIVTGYXS 457

Query: 410 DNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALK 447
            NNG    WY   ++     +  E+L  K+   FN  +
Sbjct: 458 INNG----WYSPAESSKQSVA-SEILPGKSVWLFNITR 490



 Score = 38.5 bits (88), Expect = 9.1,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E ++L+  +  ++  L ++L  Y+S   P      D +A+PA    IW  W
Sbjct: 485 LFNITRDPREFHDLSNQEPAIVNFLLERLESYQSGASPVLYPDIDTKANPAGHGGIWDSW 544


>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
          Length = 570

 Score =  319 bits (817), Expect = 3e-84,   Method: Compositional matrix adjust.
 Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            T    +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 67  GTAGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCM 185

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGVCGYDLYENDNAAWDYDN---GIYSTQMYT 241

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+ +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 242 QRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 296

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+++G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 352

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD +            + F+ +  N + N+   
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERIT 484

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544

Query: 557 WNNIWVPWY 565
              +W PWY
Sbjct: 545 NGGVWGPWY 553


>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
          Length = 597

 Score =  318 bits (816), Expect = 4e-84,   Method: Compositional matrix adjust.
 Identities = 198/549 (36%), Positives = 286/549 (52%), Gaps = 68/549 (12%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            +T   +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 67  GSTGTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 125

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   
Sbjct: 126 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECM 185

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT
Sbjct: 186 PTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYT 241

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A H+ +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 242 QRVQQILASHDPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLS 296

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+++G   NSI+++ +DNG    G     GSN PLRG K T W+GG+R
Sbjct: 297 CLDEAIHNVTLALKRYGFYNNSIIIYSSDNG----GQPMAGGSNWPLRGSKGTYWEGGIR 352

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 353 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 412

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
                                RVD L  +       ++  W                  +
Sbjct: 413 ---------------------RVDILHNIDPIYTKAKNGSWA-----------------A 434

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAH 496
             GI   A+++ ++++           SD           +  + F+ +  N + N+   
Sbjct: 435 GYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQAFSNLGPNRWHNERIT 484

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
            ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P  
Sbjct: 485 LSTGKSIWLFNITADPYERVDLSGRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRL 544

Query: 557 WNNIWVPWY 565
              +W PWY
Sbjct: 545 NGGVWGPWY 553


>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
          Length = 476

 Score =  318 bits (815), Expect = 5e-84,   Method: Compositional matrix adjust.
 Identities = 175/400 (43%), Positives = 235/400 (58%), Gaps = 28/400 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K P+I+ I++DD GW+D+ +HGS +I TP +D LAYNG+ L  +YVQ +C+P+RS  M+G
Sbjct: 6   KPPNILFIMSDDYGWHDIGYHGS-KIRTPVLDDLAYNGIRLENYYVQPICSPTRSQFMSG 64

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            Y IH G+QH VI   +  GLPL    +   ++EAGYATH  GKWHLG+++E Y P  RG
Sbjct: 65  VYQIHTGLQHNVIWPAQANGLPLEFPTIADKMREAGYATHMAGKWHLGYYKEEYLPHNRG 124

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FD++YGY  G +DYYD   K+   PY G D R N  +    +  YST LY      ++  
Sbjct: 125 FDTYYGYLNGCEDYYD---KSYCHPYCGYDFRLNDDI-QWNLTDYSTYLYVSRVNEILLN 180

Query: 208 H---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           H   +  KP+FLYL   +VH     EP + P E   K+  I D  RRTYAGMV+ +DE+V
Sbjct: 181 HKIYSPDKPLFLYLPLQSVH-----EPLEVPKEYSDKYSHIKDNNRRTYAGMVAAMDEAV 235

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G +    +K+ M + +++ F  DNG     IH   G+N PLRG K + W+GGMRGV  + 
Sbjct: 236 GKIRDLFKKYEMWDETVLAFSTDNGGQ---IHRG-GNNWPLRGWKISLWEGGMRGVGFVH 291

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHN 384
           SP L  +   S  L H+SDWLPTL   AG       LDG N WD+LTKG ++ R EILHN
Sbjct: 292 SPLLPNSGGKSKGLMHVSDWLPTLVHVAGGNTTGMKLDGYNVWDMLTKGTESPRKEILHN 351

Query: 385 IDNVDNPQ-----------KYYAALRVDDLKYVAGTDNNG 413
           ID +  P+           +  AA+R  D K + G   NG
Sbjct: 352 IDPLMEPKGHARPNSTFDTRVRAAIRYGDWKLITGDPGNG 391



 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFN+  DP E N+L+     ++ QL ++L  Y +T VP    P +  ++PA     + PW
Sbjct: 416 LFNVDADPNETNDLSAVYPQIVDQLLERLCQYNATSVPVRFPPLEPESNPAFHGGAFGPW 475


>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score =  318 bits (814), Expect = 7e-84,   Method: Compositional matrix adjust.
 Identities = 168/388 (43%), Positives = 236/388 (60%), Gaps = 18/388 (4%)

Query: 21  FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
           FL   + K PHI+ I+ADD GWNDV F  +  + TPNID LA++G++LN  YV  +CTPS
Sbjct: 16  FLPYVSAKPPHILFIVADDYGWNDVGFR-NPDVLTPNIDKLAHSGMVLNSSYVMPVCTPS 74

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           R++ MTG Y    G+QH  IL  +    PL    LPQ LKE GYATHAIGKWHLGF +  
Sbjct: 75  RNSFMTGHYAFKSGLQHLAILPKQAACAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWE 134

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
            TPT+RGFD+ +G++ G +DYY  S         G D R N    N T G YST LY   
Sbjct: 135 CTPTYRGFDTFFGFYNGQEDYYTLSVAG------GKDFRDNRTPVNAT-GEYSTFLYARR 187

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A ++I EH+ SKPM++YL   +VH     EP Q P +    +  +    RRTY GMV+ +
Sbjct: 188 AESIIKEHDASKPMYMYLPFQSVH-----EPLQVPQKYSDMYSKVHTESRRTYLGMVTAM 242

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DE++GN+  AL+  GM  +++++F ADNG    G  +  G+N+PLRG K+T ++GG R  
Sbjct: 243 DEAIGNITKALKTKGMFNDTLIIFTADNG----GWITFAGNNYPLRGGKATVFEGGTRAT 298

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
           A +    ++++  V   + H  DW+PT+  AAG    D +LDG++QWD L    ++KR+E
Sbjct: 299 AFVSGAGIQKSNYVYPGMIHAVDWMPTVLRAAG-GTPDPTLDGIDQWDSLRTAGESKRTE 357

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAG 408
            ++NID++      +AA+R+ D K V G
Sbjct: 358 FIYNIDDLRPEICGHAAIRMGDYKLVDG 385



 Score = 42.7 bits (99), Expect = 0.52,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 31/62 (50%)

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
           K  LFN+ DDP E  +L++   D++K+L+ +L  Y   +VP         + P  +   W
Sbjct: 418 KVMLFNLKDDPNEHVDLSKKHPDIVKKLKSRLEYYHKQIVPANFPKNSASSAPTNYGGFW 477

Query: 562 VP 563
            P
Sbjct: 478 TP 479


>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
          Length = 515

 Score =  317 bits (811), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 173/376 (46%), Positives = 229/376 (60%), Gaps = 19/376 (5%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           +TT+  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS
Sbjct: 69  STTSTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRS 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
             +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF R+   
Sbjct: 128 QFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKECM 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT
Sbjct: 188 PTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDYDNGIYSTQMYT 243

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN +KP+FLY A+ AVH+     P QAP      +  I +  RR YA M+S
Sbjct: 244 QRVQQILASHNPTKPIFLYTAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLS 298

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R
Sbjct: 299 CLDEAINNVTLALKTYGFYNNSIIIYSSDNG----GQPTAGGSNWPLRGSKGTYWEGGIR 354

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
            V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ 
Sbjct: 355 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSP 414

Query: 378 RSEILHNIDNVDNPQK 393
           R +ILHNID     QK
Sbjct: 415 RVDILHNIDPYTPRQK 430


>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
          Length = 597

 Score =  317 bits (811), Expect = 1e-83,   Method: Compositional matrix adjust.
 Identities = 198/554 (35%), Positives = 282/554 (50%), Gaps = 95/554 (17%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P+ PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+T
Sbjct: 75  PQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 133

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH G+QH +I   +P  LPL +  LPQ L+E GY+TH +GKWHLGF+R+   PT R
Sbjct: 134 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRR 193

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           GFD+  G   G  DYY + +C        G D+     V     G YST LY +   +++
Sbjct: 194 GFDTFLGSLTGNVDYYTYDNCDGPG--VCGFDLHEGENVAWGLSGQYSTLLYAQRVSHIL 251

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V 
Sbjct: 252 ASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 306

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL++HG   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + S
Sbjct: 307 NITWALKRHGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHS 362

Query: 326 PWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+ ++ S  L HI+DW PTL A A G       LDG + W  +++G  + R+EILHN
Sbjct: 363 PLLKRKRRTSRALVHITDWYPTLVALAGGTASAADGLDGYDVWPAISEGRASPRTEILHN 422

Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
           ID + N  ++               AA+RV + K + G       D  YGD       + 
Sbjct: 423 IDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WI 468

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P + L +  G  +N                              E++   R+        
Sbjct: 469 PPQTLAAFPGSWWNL-----------------------------ERMASARQAV------ 493

Query: 491 DNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDK 550
                         LFNI+ DP E+ +LA  + D       +L  Y  T +P      + 
Sbjct: 494 -------------WLFNISADPYEREDLAGQRPDXA-----RLVDYNRTAIPVRYPAENP 535

Query: 551 RADPARWNNIWVPW 564
           RA P      W PW
Sbjct: 536 RAHPDFNGGAWGPW 549


>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score =  317 bits (811), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 171/409 (41%), Positives = 243/409 (59%), Gaps = 22/409 (5%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           +TC++  +  FL   + K PHI+ I+ADDLGWNDV F  +  + TPNID LA +G+ILN 
Sbjct: 8   ITCSIFVH--FLLCVSAKPPHILFIVADDLGWNDVGFR-NPDVLTPNIDKLARSGMILNS 64

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
            YV  +CTPSR++ MTG Y    G+QH  I   +    PL    LPQ LKE GYATHAIG
Sbjct: 65  SYVMPVCTPSRNSFMTGHYAFKSGLQHLAINPQQATCAPLNYTFLPQKLKELGYATHAIG 124

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           KWHLGF +   TPT+RGFD+ +G++ G +DYY  S         G D R N    N T G
Sbjct: 125 KWHLGFCKWECTPTYRGFDTFFGFYNGQEDYYTLSVAG------GKDFRDNKVPVNAT-G 177

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
            YST LY+  A ++I EH+ SKP+++YL   +VHA     P Q P+     + ++    R
Sbjct: 178 EYSTFLYSRRAESIIKEHDASKPIYMYLPFQSVHA-----PLQVPNRYTDMYTNVHTKSR 232

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
           RTY GMV+ +DE++GN+  AL+  G+  +++++F  DNG    G     G+N+PLRG K+
Sbjct: 233 RTYMGMVTAMDEAIGNITKALKTKGIFNDTLIIFTTDNG----GAVKFAGNNYPLRGGKA 288

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
           T ++GG R  A +    +++     S L H  DW+PT+ +AAG    D+ LDG++QWD L
Sbjct: 289 TLFEGGTRATAFVTGAGIQKFNYEYSGLIHAVDWMPTVLSAAG-GTADSKLDGIDQWDSL 347

Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
               ++KR+E ++NID+       +AA+R+ D K + G    G    WY
Sbjct: 348 RTARESKRNEFIYNIDDFRPEIGGHAAIRIGDYKLIDGYP--GLHQGWY 394



 Score = 42.0 bits (97), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 19/62 (30%), Positives = 31/62 (50%)

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
           K  LFN+ DDP E  +L+    D++K+L+ +L  Y + +VP         + P  +   W
Sbjct: 418 KMMLFNLQDDPNEHVDLSRKHPDIVKKLKSRLEYYHNQIVPANFPKNSASSAPTSYGGFW 477

Query: 562 VP 563
            P
Sbjct: 478 SP 479


>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
          Length = 572

 Score =  316 bits (810), Expect = 2e-83,   Method: Compositional matrix adjust.
 Identities = 171/403 (42%), Positives = 243/403 (60%), Gaps = 28/403 (6%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P++PHII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+T
Sbjct: 45  PQRPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLT 103

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT R
Sbjct: 104 GRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRR 163

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           GFD+  G   G  DYY + +C        G D+     V     G YST LY + A +++
Sbjct: 164 GFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHIL 221

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
           A HN  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V 
Sbjct: 222 ASHNPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVR 276

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  I S
Sbjct: 277 NITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFIHS 332

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK+ +  S  L HI+DW PTL   AG  ++    LDG + W  +++G  + R+EILHN
Sbjct: 333 PLLKRKRWTSHALVHITDWYPTLVGLAGGTVSAADGLDGYDVWPAVSEGRASPRTEILHN 392

Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNG 413
           ID + N  ++               AA+RV + K + G   +G
Sbjct: 393 IDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTGDPGSG 435



 Score = 43.1 bits (100), Expect = 0.37,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P      W PW
Sbjct: 465 LFNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENPRAHPDFNGGAWGPW 524


>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
          Length = 661

 Score =  316 bits (809), Expect = 3e-83,   Method: Compositional matrix adjust.
 Identities = 180/412 (43%), Positives = 241/412 (58%), Gaps = 35/412 (8%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           T A  +PH+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS 
Sbjct: 132 TAATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 190

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
            +TGKY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   P
Sbjct: 191 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 250

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYTE 199
           T RGFD+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT+
Sbjct: 251 TKRGFDTFFGSLLGSGDYYTH-YKCDSPGMCGYDLYENDNAAWDYDN---GIYSTQMYTQ 306

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
               ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S 
Sbjct: 307 RVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPVRYFEHYRSIVNINRRRYAAMLSC 361

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           LDE++ NV  AL+ +G   NSI+++ +DNG    G  +  GSN PLRG K T W+GG+R 
Sbjct: 362 LDEAIHNVTLALKAYGFYNNSIIIYSSDNG----GQPAAGGSNWPLRGSKGTYWEGGIRA 417

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
           +  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ R
Sbjct: 418 IGFVHSPLLKNKGTVCRELVHITDWYPTLISLAEGQIDEDIQLDGYDIWETISEGLRSPR 477

Query: 379 SEILHNIDNVDNPQK--------------YYAALRVDDLKYVAGTDNNGQSD 416
            +ILHNID +    K                +A+RV   K + G  N G SD
Sbjct: 478 VDILHNIDPIYTKAKNGSLAAGYGIWNTAIQSAIRVQHWKLLTG--NPGYSD 527



 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 35/61 (57%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P     +W PW
Sbjct: 557 LFNITADPYERVDLSHRYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWGPW 616

Query: 565 Y 565
           Y
Sbjct: 617 Y 617


>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
          Length = 684

 Score =  315 bits (807), Expect = 5e-83,   Method: Compositional matrix adjust.
 Identities = 172/392 (43%), Positives = 236/392 (60%), Gaps = 19/392 (4%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K PHI+ I+ADDLGWNDV FH  + + TPNID LA  GLILNQ Y+Q LC+PSR ALMTG
Sbjct: 224 KSPHILFIVADDLGWNDVGFHNPA-MKTPNIDKLAREGLILNQTYLQPLCSPSRHALMTG 282

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            YP H G+QH VIL  +P   PL  K LPQ LK+ GYATH +GKWHLGF     TPT+RG
Sbjct: 283 YYPYHAGLQHLVILPWQPVCSPLKMKFLPQRLKDIGYATHMVGKWHLGFCSWNCTPTYRG 342

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDS +GY+    D+Y H+       Y  LD R N +      G YST  +   A ++I +
Sbjct: 343 FDSFFGYYNAQGDHYSHTW------YNYLDYRDNEKPVKNLNGTYSTFTFVSRAQDIIKK 396

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           HN S+P+FLY+A   VH     +P Q P +    + +I    RR ++GMVS LDE++GN+
Sbjct: 397 HNSSQPLFLYMAFQNVH-----DPIQVPKQYEDMYPNIKTRGRRQFSGMVSALDEAIGNI 451

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
              LR+ GM ++++++F +DNG    G     G+N+PLRG K T ++GG R V+ +    
Sbjct: 452 TDQLRESGMYDDTLIIFTSDNG----GWPKYYGNNYPLRGGKFTVYEGGTRVVSFVHGAG 507

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
           L++T +    + H  DW  TL  AAG +  D   DGV+QW  +       R+E+++N D+
Sbjct: 508 LQKTGEAYESIMHAVDWSDTLVEAAGGKP-DLMGDGVSQWSSIANNFPGPRTELVYNFDS 566

Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
              P++ +AA+RV D K + G    G   +WY
Sbjct: 567 DLAPEEGHAAIRVGDYKLIDGYP--GLYPDWY 596



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 41/61 (67%), Gaps = 3/61 (4%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF--DKRADPARWNNIWV 562
           LFN+ DDP E N+L++S+ D++ ++  +L +     VPP N P   D +++P++++++W 
Sbjct: 622 LFNLKDDPTEHNDLSKSRPDIVTEMSQRLRLLTQNAVPP-NYPLVPDPKSNPSKFDDVWS 680

Query: 563 P 563
           P
Sbjct: 681 P 681


>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
          Length = 564

 Score =  314 bits (805), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 198/542 (36%), Positives = 285/542 (52%), Gaps = 70/542 (12%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           H+I ILADD G+ DV +HGS +I TP +D LA  G+ L  +YVQ +CTPSRS  +TG+Y 
Sbjct: 44  HLIFILADDQGFRDVGYHGS-EIRTPTLDRLAAEGVKLENYYVQPMCTPSRSQFITGRYQ 102

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH VI   +P  LPL    LPQ LKEAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 103 IHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMPTQRGFDT 162

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINVIA 206
            +G   G  DYY H  K       G D+  N   DN       GIYST +YT+    ++A
Sbjct: 163 FFGSLLGSGDYYTHY-KCDSPRMCGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQILA 218

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            HN  KP+FLY+A+ AVH+     P QAP     ++  I++  RR YA M++ LDE++ N
Sbjct: 219 SHNPRKPIFLYIAYQAVHS-----PLQAPGMYYERYRSINNINRRRYAAMLACLDEAINN 273

Query: 267 VIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           V  AL+++G   NS++++ +DNG  P+ G     G+N PLRG K T W+GG+R V  + S
Sbjct: 274 VTLALKRYGYYNNSVIIYSSDNGGQPTAG-----GNNWPLRGSKGTYWEGGIRAVGFVHS 328

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHN 384
           P LK    V  EL HI+DW PTL   A  +I+ DT LDG + W+ ++   +++RS     
Sbjct: 329 PLLKNKGCVCKELIHITDWFPTLITLAEGQIDEDTLLDGYDVWETIS---ESRRSP---- 381

Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
                         R+D L  +       ++  W                  +  GI   
Sbjct: 382 --------------RMDILHNIDPIYTKAKNGSWA-----------------AGFGIWNT 410

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR-CNYDNKGAHCNSTVKP 503
           A+++ +++            SD           +  + F+ V    + N+     +    
Sbjct: 411 AIQSAIRVNHWKLLTGNPGYSD----------WVPPQSFSNVGPSRWHNERVSWTAGKTV 460

Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
            LFNIT DP E+ +L+    D++KQL  +L+ +  T VP    P D R++P     +W P
Sbjct: 461 WLFNITADPYERVDLSSKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPNFNGGVWGP 520

Query: 564 WY 565
           W+
Sbjct: 521 WF 522


>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 568

 Score =  312 bits (799), Expect = 3e-82,   Method: Compositional matrix adjust.
 Identities = 172/404 (42%), Positives = 240/404 (59%), Gaps = 29/404 (7%)

Query: 27  PKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           PK+P HII IL DD G+ND+ +H S +I +P +D LA  G+ L  +YVQ LCTPSRS L+
Sbjct: 40  PKQPPHIIFILTDDQGFNDIGYH-SGEIRSPTLDKLASEGVRLENYYVQPLCTPSRSQLI 98

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH G+QH +I   +P  LPL    LPQ L+E GY+TH +GKWHLGF+R+   PT 
Sbjct: 99  TGRYQIHTGLQHSIIRPRQPNCLPLDVVTLPQRLQEIGYSTHMVGKWHLGFYRKDCLPTR 158

Query: 146 RGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           RGF +++G   G  DYY + SC    +   G D+     V     G YST LYT+    +
Sbjct: 159 RGFHTYFGSLTGSVDYYTYGSCDG--KSLCGFDLHEGESVAWGRAGKYSTHLYTQRVRKI 216

Query: 205 IAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +A H+  S+P+F++L+  AVH      P + P E +  + +I +  RR YA MVS +DE+
Sbjct: 217 LATHDPTSQPLFIFLSLQAVHT-----PLKPPKEYIYPYRNIGNVPRRKYAAMVSIVDEA 271

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           V N+  ALRK+G   NS+V+F  DNGA         GSN PLRG K T W+GG+RGV  +
Sbjct: 272 VRNITYALRKYGFYRNSVVIFSTDNGAQPL----TGGSNWPLRGCKGTYWEGGIRGVGFV 327

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEIL 382
            SP ++  +++S +L HI+DW PTL   AG  ++    LDG + W  +++G ++ R EIL
Sbjct: 328 HSPLIRHRRRISRDLIHITDWYPTLVGLAGGNVSQHQGLDGFDVWPTISEGKESPRLEIL 387

Query: 383 HNIDNVDNPQK-------------YYAALRVDDLKYVAGTDNNG 413
           HNID ++   +               AA+RV D K + G   NG
Sbjct: 388 HNIDPLNRRSRGSLKDGYGLWDTTVQAAIRVGDWKLLTGDPGNG 431



 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 43/68 (63%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFN+T DPCE+++LA  + D++++L  +LA +  T +P    P D RA+P+     W PW
Sbjct: 462 LFNVTGDPCERHDLAVHRPDVVRELLARLAFHNRTAIPVRYPPDDTRANPSANGGAWRPW 521

Query: 565 YDELDKQK 572
             E D+++
Sbjct: 522 VGEDDEEE 529


>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
          Length = 613

 Score =  311 bits (798), Expect = 4e-82,   Method: Compositional matrix adjust.
 Identities = 168/399 (42%), Positives = 240/399 (60%), Gaps = 28/399 (7%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 93  HIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 151

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 152 IHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 211

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY + A +++A H+
Sbjct: 212 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHS 269

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 270 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 324

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 325 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 380

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG  ++    LDG + W  +++G  + R+EILHNID +
Sbjct: 381 RKRRTSRALVHITDWYPTLVGLAGGVVSAVDGLDGYDVWPAISEGRASPRTEILHNIDPL 440

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNG 413
            N  ++               AA+RV + K + G   NG
Sbjct: 441 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTGDPGNG 479



 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P      W PW
Sbjct: 505 LFNISADPYERKDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENPRAHPDFNGGAWGPW 564


>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
 gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
          Length = 487

 Score =  311 bits (798), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 177/413 (42%), Positives = 244/413 (59%), Gaps = 30/413 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KPHI+ I+ADDLGWNDV +H +  + TP +D LA+ G+ILNQ YV  +CTPSR+A MTG 
Sbjct: 28  KPHILFIVADDLGWNDVGWH-NPDVKTPVLDKLAHEGVILNQSYVNYVCTPSRTAFMTGY 86

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP H G QH V L  +  G+P     LP+ LK+ GYATH +GKWHLGF    YTPT+RGF
Sbjct: 87  YPYHAGSQHLVFLPQQAQGIPYNFTFLPEKLKDLGYATHMVGKWHLGFCNWKYTPTYRGF 146

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           DS YGY+   +DYY H          GLD+R + +V N   G Y T  +T+  +++I +H
Sbjct: 147 DSFYGYYNADEDYYTHVVAG------GLDLRDDKEVVNTKNGQYGTYFFTDRMVDIIEKH 200

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
               P+FLYL    VH     EP + P+     ++++ +  RR   GMVS LDE+VGNV 
Sbjct: 201 PADTPLFLYLPFQNVH-----EPLEVPERFENIYMNVQNENRRKLLGMVSALDEAVGNVT 255

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
            A++  G+ +N++V+F  DNG    G     G+N+PLRG K T W+GG RGVA +    L
Sbjct: 256 MAMKNAGLWDNTLVIFTTDNG----GWIIASGNNYPLRGGKVTLWEGGTRGVAFVHGKML 311

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           ++T   ++E+ H  DW PT+ AAA     D ++DGV+Q D +  G  + R+E ++NID+V
Sbjct: 312 QKTGYTNNEMIHAVDWFPTILAAA-GGTADDAMDGVSQLDTIMSGQPSPRTEFVYNIDDV 370

Query: 389 DNPQKYYA-------ALRVDDLKYVAGTDNNGQSDEWYGD---TDNEIDKYSP 431
             P K  A        LRV D K + G    G+ D W      TD+++D   P
Sbjct: 371 -FPTKQGAIRIICTLCLRVGDYKLIEGV--AGKPDGWIHPDNLTDSQLDSGDP 420



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 27/59 (45%), Positives = 38/59 (64%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           LFN+ DDP E +NLA+S  D LK+++ KLA Y+ ++VP  N P D ++ P  W   W P
Sbjct: 426 LFNLKDDPTEHHNLADSMPDKLKEMQAKLAEYRKSLVPAINPPPDPKSFPKNWGGAWSP 484


>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
          Length = 473

 Score =  311 bits (798), Expect = 5e-82,   Method: Compositional matrix adjust.
 Identities = 164/344 (47%), Positives = 223/344 (64%), Gaps = 15/344 (4%)

Query: 48  HGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWG 107
           HGS  I TP++DALA  G++L+ +YVQ LCTPSRS L+TG+Y IH+G+QH +I+  +P  
Sbjct: 3   HGSV-IRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNC 61

Query: 108 LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK 167
           +PL EKLLPQ LK+AG +TH +GKWHLG +R+   PT RGFD+++GY  G +DYY H   
Sbjct: 62  VPLDEKLLPQLLKDAGSSTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEAC 121

Query: 168 ATFEPYQG----LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAV 223
           A  E   G    LD+R   +   +   IYST+++T+ A  +IA H   KP+FLYLA  +V
Sbjct: 122 APIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSV 181

Query: 224 HAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
           H     +P Q P+E +  +  I D  RR YAGMVS LDE+VGNV  AL+  G+  N++++
Sbjct: 182 H-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLI 236

Query: 284 FMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISD 343
           F  DNG    G   + G+N PLRG K T W+GG+RG   + SP LKQ    S EL HI+D
Sbjct: 237 FSTDNG----GQTRSGGNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITD 292

Query: 344 WLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
           WLPTL   AG   + T  LDG + W+ +++G+ + R E+L NID
Sbjct: 293 WLPTLVNLAGGSTHGTKPLDGFDVWETISEGSPSPRVELLLNID 336



 Score = 40.8 bits (94), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++++     +++ L  +L  Y    VP    P D R DP +   +W PW
Sbjct: 414 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 472


>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
           carolinensis]
          Length = 580

 Score =  311 bits (797), Expect = 6e-82,   Method: Compositional matrix adjust.
 Identities = 191/538 (35%), Positives = 282/538 (52%), Gaps = 65/538 (12%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           H+I IL DD G++D  +HGS  I TP +D LA  G+ L  +Y++ +CT SRS L+TG+Y 
Sbjct: 59  HLIFILTDDQGFHDXGYHGSD-IXTPTLDRLAAEGVKLENYYIRPICTLSRSQLITGRYQ 117

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LP  +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RGFD+
Sbjct: 118 IHTGLQHSIIRPQQPNCLPFNQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDT 177

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V  +  G YST LY +    ++A HN
Sbjct: 178 FLGSLTGNVDYYTYDNCDG--PGVCGYDLHDGENVAWEQSGKYSTFLYAQRVNKILAAHN 235

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+F+Y+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 236 PKEPIFIYIAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITW 290

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+K+G  +NS+++F  DNG  +F    + GSN PLRG K T W+GG+RG+  + SP +K
Sbjct: 291 ALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIK 346

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNV 388
             +++S  L HI+DW PTL   A   +++   LDG N W  +++G ++ R+EILHNID +
Sbjct: 347 HKRRISRALIHITDWYPTLVTLAQGNVSEADDLDGYNVWPAISEGKESPRTEILHNIDPL 406

Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT 448
            N  K                         YG              L S  GI   A++ 
Sbjct: 407 YNHAK-------------------------YGS-------------LESGFGIWNTAVQA 428

Query: 449 KLQIK--QKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLF 506
            +++   +    DP  +     +T+ T         F     N +        +V   LF
Sbjct: 429 SVRVGDWKLLTGDPGYSDWIPPQTLAT---------FPGSWWNLERMTGGMRKSV--WLF 477

Query: 507 NITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           NIT DP E+ +L+E + D+++ L  +L  Y  T +P      + RA P      W PW
Sbjct: 478 NITADPYERYDLSEQRPDVVRALLLRLVRYNQTAIPVRYPAENPRAHPDFNGGAWGPW 535


>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
 gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
          Length = 494

 Score =  311 bits (796), Expect = 8e-82,   Method: Compositional matrix adjust.
 Identities = 158/360 (43%), Positives = 224/360 (62%), Gaps = 12/360 (3%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KPHI+ I+ADDLGWNDV +H +  + TP +D LA+ G+ILNQ YV  +CTPSR+A MTG 
Sbjct: 22  KPHIVFIVADDLGWNDVGWH-NPDVRTPVLDQLAHEGVILNQSYVNYVCTPSRTAFMTGY 80

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           +P H+G QH V    +P  +      LP+ LK  GYATH +GKWHLGF    +TPTFRGF
Sbjct: 81  FPYHVGSQHLVFRPDQPSAILSNFTFLPEKLKSLGYATHMVGKWHLGFCNWKFTPTFRGF 140

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           DS+YGY+ G +DY+ H  ++      G+D   N  V     G YS  L+++ A +++ +H
Sbjct: 141 DSYYGYYSGAEDYFTH-FRSIRNGTGGIDFHDNKDVVTDQNGTYSAYLFSQRAADIVNKH 199

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           + + P+FLYL    VHA     P + P      + ++ D  RR   GMVS LDE+VGNV 
Sbjct: 200 DPNTPLFLYLPFQNVHA-----PLEVPKRFEDMYANVQDENRRKLLGMVSALDEAVGNVT 254

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
            A++K G+ +N++++F  DNG    G+    G+N+PLRG K+T W+GG RGVA      L
Sbjct: 255 MAMKKTGLWDNTLLIFSTDNG----GMVIQSGNNYPLRGGKTTLWEGGTRGVAFAHGKML 310

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           ++T   ++E+ H  DW PTL AAAG    D  +DGV+Q D +  G  + R+E ++NID++
Sbjct: 311 QKTGYTNNEMIHAVDWFPTLLAAAG-GTADPGMDGVSQLDTILSGKPSPRTEFVYNIDDI 369



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/78 (39%), Positives = 44/78 (56%), Gaps = 3/78 (3%)

Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
           +  ++ N G   N T    LFN+ DDP E +NLA+S  D LK+++ KL  Y+ T+VP  N
Sbjct: 417 MTASHPNSGDPINGT---WLFNLKDDPTEHHNLADSMPDKLKEMQAKLDGYRKTLVPAIN 473

Query: 546 KPFDKRADPARWNNIWVP 563
              D ++DP  W   W P
Sbjct: 474 PKSDPKSDPKNWGGAWSP 491


>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
 gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
          Length = 540

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 182/425 (42%), Positives = 251/425 (59%), Gaps = 35/425 (8%)

Query: 13  CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAYNGLILNQH 71
           C      + L+   P  PHI+ IL DDLGW+DV +H  S  + TPNID LA  G+ L  +
Sbjct: 20  CCQWPGCSLLSMAGP--PHIMFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSY 77

Query: 72  YVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
           Y Q +CTPSR ALMTGKYPIH+GMQH VI    PWG+P     +PQ L+  GY T  IGK
Sbjct: 78  YSQPMCTPSRGALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGK 137

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
           WHLGFF   YTP  RGFDS  G++ G QD++ HS K  F     LD R + +  N+  G 
Sbjct: 138 WHLGFFDWDYTPLRRGFDSFLGFFAGEQDHWRHS-KMGF-----LDFRRDEEPANEYGGQ 191

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
           +STD++T+ AIN+   HN S+P+FL L++ AVH      P QA   +V K   +SD +R+
Sbjct: 192 HSTDVFTQEAINIAMRHNASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQ 246

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKS 310
            Y GM+   D S+G +I   +++G+  N+++++ +DNGA P  G     G N PLRG KS
Sbjct: 247 NYLGMMGAADWSIGRLIDVYKRNGLWNNTLMIWASDNGAQPGKG----GGYNWPLRGYKS 302

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
           + ++GG+R  A +    L++     ++LFH++DW PTL   AG E+ +  +DGV+QW  L
Sbjct: 303 SLFEGGVRVPAFVHGEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEV-EPDIDGVDQWPTL 361

Query: 371 TKGAKTKRSEILHNIDNVDNPQK---------YY--AALRVDDLKYVAGTDNNGQSDEWY 419
           ++G  +KR EILHNID   N ++         YY  AALR   +K V    + G    WY
Sbjct: 362 SEGKPSKREEILHNIDIPANQEEERMAPRGFNYYSGAALRRGHMKLVYKMGDAG----WY 417

Query: 420 GDTDN 424
              +N
Sbjct: 418 QLPEN 422


>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 673

 Score =  310 bits (794), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 168/413 (40%), Positives = 245/413 (59%), Gaps = 30/413 (7%)

Query: 17  FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQAL 76
           FND     +  + PHII I+ DD G+ND+ +H SS I TP +D LA  G+ L  +Y+Q +
Sbjct: 38  FND---RASGVQPPHIIFIMTDDQGFNDIGYH-SSDIKTPTLDKLAAKGVKLENYYIQPI 93

Query: 77  CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
           CTPSRS  +TG+Y IH G+QH +I   +P  LP  +  LPQ L++ GY+TH +GKWHLGF
Sbjct: 94  CTPSRSQFITGRYQIHTGLQHSIIRPRQPNCLPFDQVTLPQRLQQLGYSTHMVGKWHLGF 153

Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
           +++   PT RGFD+++G   G  +YY +S     E   G D+     V     G YST L
Sbjct: 154 YKKECLPTRRGFDTYFGSLTGSVNYYTYSSCDGPE-LCGFDLHEGESVAWDQGGKYSTHL 212

Query: 197 YTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           +T+    ++A H+ +S+P+F++L+  AVH+     P Q+P E +  +  + +  RR YA 
Sbjct: 213 FTQRVRKILARHDPQSQPLFIFLSFQAVHS-----PLQSPKEYICPYSGLENVARRKYAA 267

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MVS +DE+V N+  ALRK+G  +NS+++F  DNG    G   + GSN PLRG K T W+G
Sbjct: 268 MVSTVDEAVRNITYALRKYGYYQNSVLIFSTDNG----GQPLSGGSNWPLRGRKGTYWEG 323

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGA 374
           G+RG+  + SP LK+ ++VS  L HI+DW PTL   A G E     LDG N W+ +++G 
Sbjct: 324 GIRGLGFVHSPLLKKRKRVSKALVHITDWYPTLVGLAGGNESMTDGLDGYNVWEAISEGK 383

Query: 375 KTKRSEILHNIDNVDNPQK--------------YYAALRVDDLKYVAGTDNNG 413
           ++ R EILHNID + N  +                A++R+ D K + G   +G
Sbjct: 384 ESPRLEILHNIDPLFNHARSGSLQQGFGIWNTAVQASIRMGDWKLLTGNPGHG 436



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 32/72 (44%), Positives = 43/72 (59%), Gaps = 2/72 (2%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI++DP E+ +L+E + D++KQL  +LA Y  T VP      D RADP      WVPW
Sbjct: 466 LFNISEDPYERFDLSEQRPDVVKQLLARLAYYNRTAVPVRFPSEDPRADPHLNGGAWVPW 525

Query: 565 YDELDKQKAIET 576
               DK + IE+
Sbjct: 526 V--ADKMEEIES 535


>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 507

 Score =  310 bits (794), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 168/401 (41%), Positives = 240/401 (59%), Gaps = 31/401 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+I+ ILADDLGW+DV +H  S I TPNID LA  G+ L  +YV  LCTP+R+ LMTG+
Sbjct: 28  RPNIVFILADDLGWHDVGYH-DSIIRTPNIDKLAAEGVKLENYYVTPLCTPTRAVLMTGR 86

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH GMQHGV++  EP  LP  E L+PQ LKE+GY TH +GKWHLGF++   TP  RGF
Sbjct: 87  YQIHTGMQHGVLMAQEPRCLPTDEVLMPQKLKESGYTTHMVGKWHLGFYKWACTPNHRGF 146

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAINVIAE 207
           D+ +G +    DY++H+           D+R   Q V  + +G YST ++ E A  +I +
Sbjct: 147 DTFFGMYLAGGDYFNHTRLCHGRRLAAWDLRDGDQVVAPEYVGKYSTIVFAEKAQEIIKK 206

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-DISDPERRTYAGMVSRLDESVGN 266
           H+ + P+FLYL+  AVHA     P Q P+  +  +  DI D  RR YAGM + +DE++GN
Sbjct: 207 HDPTNPLFLYLSFQAVHA-----PLQVPERYINMYKDDIRDESRRIYAGMTTCMDEAIGN 261

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V   L + G+L+N+I++F +DNG    G      SN+PLRG K T ++GG+R  A ++SP
Sbjct: 262 VTRTLARRGLLDNTIIVFSSDNG----GAKLYGASNYPLRGQKGTLYEGGVRAPAFVYSP 317

Query: 327 WLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
            L    +   + EL ++ DW PT    AG  ++    LDGVNQW  ++    +KR+EI+H
Sbjct: 318 LLSNRVRGTATHELMYVGDWFPTFLHLAGGNLHGVKPLDGVNQWHTVSAAMPSKRTEIVH 377

Query: 384 NIDNV--------DNP--------QKYYAALRVDDLKYVAG 408
           NI+ +        D P         +  A++RV + K + G
Sbjct: 378 NIEPLRGKPPRVKDEPWLTNPYFDIRVMASIRVGEWKLITG 418


>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 570

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 170/405 (41%), Positives = 244/405 (60%), Gaps = 29/405 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
            P+ PHII I+ DD G+ND+ +H SS I TP +D LA +G+ L  +Y+Q +CTPSRS  +
Sbjct: 44  VPQPPHIIFIMTDDQGFNDIGYH-SSDIRTPVLDKLAADGVKLENYYIQPICTPSRSQFI 102

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y IH G+QH +I   +P  LP  +  LPQ L+E GY+TH +GKWHLGF+++   PT 
Sbjct: 103 TGRYQIHTGLQHSIIRPCQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTR 162

Query: 146 RGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           RGFD+++G   G  +YY +  C        G D+     V     G YST LYT+    +
Sbjct: 163 RGFDTYFGSLTGSVNYYTYDGCDGA--GLCGFDLHEGESVAWGQSGKYSTHLYTQRVRKI 220

Query: 205 IAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +A H+ +S+P+F++L+  AVH      P Q PDE +  +L + +  RR YA MVS +DE+
Sbjct: 221 LATHDPQSQPLFIFLSFQAVHT-----PLQYPDEYIYPYLGLENVARRKYAAMVSAVDEA 275

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           V N+  ALRK+G  +NS+++F  DNG    G   + GSN PLRG K T W+GG+RG+  +
Sbjct: 276 VRNITYALRKYGYYQNSVIIFSTDNG----GQPLSGGSNWPLRGRKGTYWEGGIRGLGFV 331

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEIL 382
            SP L++ ++VS  L HI+DW PTL   AG + + T  +DG N W+ +++G ++ R EIL
Sbjct: 332 HSPLLRKKRRVSKALVHITDWYPTLVGLAGGDESLTEGVDGYNVWEAISEGKESPRLEIL 391

Query: 383 HNIDNVDNP------QKYY--------AALRVDDLKYVAGTDNNG 413
           HNID + N       QK Y        A++R  D K + G    G
Sbjct: 392 HNIDPLYNHARTGSLQKGYGIWNTAIQASIRAGDWKLLTGDPGYG 436



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 35/60 (58%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI+ DP E+ +L+E + D++KQL  +L  Y  T VP      D RADP      WVPW
Sbjct: 466 LFNISGDPYERYDLSEQRPDVVKQLLARLVYYNRTAVPVRYPSEDPRADPYLNGGAWVPW 525


>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
          Length = 1542

 Score =  309 bits (792), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 169/398 (42%), Positives = 232/398 (58%), Gaps = 29/398 (7%)

Query: 29   KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
            +PHII I+ DD G+ DV +HGS +I TP +D LA  G+ L  +YVQ LC+PSRS LMTG+
Sbjct: 1026 QPHIIFIMVDDQGFRDVGYHGS-EIKTPTLDRLAAAGVKLENYYVQPLCSPSRSQLMTGR 1084

Query: 89   YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
            Y IH G+QH +I   +P  LPL    LPQ LK AGY+TH +GKWHLGF++    PT RGF
Sbjct: 1085 YQIHTGLQHSIIRPTQPNCLPLENITLPQKLKNAGYSTHMVGKWHLGFYKRACMPTQRGF 1144

Query: 149  DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
            D+ +G   G  DYY H  K       G D+    +    +  G+YST +YT+ A+N++A 
Sbjct: 1145 DTFFGSLLGSGDYYSH-YKCDSPGLCGYDLHEGEEAAWEQDRGVYSTIMYTQKAVNILAS 1203

Query: 208  HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
            HN  +P+FLYLA  AVH+     P Q P   + ++  I +  RR YA MVS LDE+V N+
Sbjct: 1204 HNPKRPIFLYLAFQAVHS-----PLQVPARYLERYKSIPNLHRRKYAAMVSCLDEAVRNL 1258

Query: 268  IAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
              AL+++G  +N ++++ +DNG  P  G     GSN PLRG K + W+GG+R V  + SP
Sbjct: 1259 TLALKQYGYYDNMVMVYSSDNGGQPMAG-----GSNWPLRGSKGSYWEGGIRAVGFVHSP 1313

Query: 327  WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
             L +    S  L HI+DW PTL     G    + +LDG + W+ +++G  + R +ILHNI
Sbjct: 1314 LLVKKGTKSKALIHITDWYPTLVMLGEGTLDENQNLDGYDVWEAISEGRPSPRLDILHNI 1373

Query: 386  DNVDNPQK--------------YYAALRVDDLKYVAGT 409
            D +    K                AALRV D K + G 
Sbjct: 1374 DPIYTKAKNGSWKAGYGIWNTAVQAALRVGDWKLLTGV 1411



 Score = 45.8 bits (107), Expect = 0.060,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 33/61 (54%)

Query: 505  LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
            LFNIT DP E+ +L+     ++K++  KL  Y    VP    P D R++P     +W PW
Sbjct: 1447 LFNITADPYERVDLSRQYPHVVKKMLFKLLRYSKNAVPCRYPPKDYRSNPQYNGGVWGPW 1506

Query: 565  Y 565
            Y
Sbjct: 1507 Y 1507


>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
          Length = 584

 Score =  308 bits (790), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 165/394 (41%), Positives = 237/394 (60%), Gaps = 28/394 (7%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 55  HIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 113

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+E GY+TH +GKWHLGF+++   PT RGFD+
Sbjct: 114 IHTGLQHSIIRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDT 173

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V  +  G YST LY + A  ++A H+
Sbjct: 174 FLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGENVAWEQSGQYSTLLYAQRASQILASHS 231

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 232 PHQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 286

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G  +NS+++F  DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 287 ALKRYGYYDNSVIIFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLLK 342

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG  +++   LDG N W  +++G  + R EILHNID +
Sbjct: 343 RKRRTSRALVHITDWYPTLVTLAGGSVSEAEGLDGYNVWPAISEGQVSPRMEILHNIDPL 402

Query: 389 DNPQKY--------------YAALRVDDLKYVAG 408
            N  K+               A++RV + K + G
Sbjct: 403 YNHAKHGSLEGGFGIWNTAVQASIRVGEWKLLTG 436



 Score = 43.1 bits (100), Expect = 0.40,   Method: Compositional matrix adjust.
 Identities = 21/60 (35%), Positives = 31/60 (51%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + RA P      W PW
Sbjct: 472 LFNISADPYEREDLAAQRPDVVRSLLARLVYYNRTAIPVRYPAENPRAHPDFNGGAWGPW 531


>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 648

 Score =  308 bits (789), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 178/422 (42%), Positives = 247/422 (58%), Gaps = 35/422 (8%)

Query: 20  AFLNTT--APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALC 77
           +F+++T  + K PHI+  LADDLGW+DV +H  S I TPNID LA  G+ L  +YV  +C
Sbjct: 16  SFVSSTVASTKPPHIVFFLADDLGWHDVGYH-DSIIRTPNIDKLAAEGVKLENYYVTPIC 74

Query: 78  TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           TP+R+ LMTG+Y IH  MQHGV++  E   LP  E L+PQ LKE+GY+TH +GKWHLGF+
Sbjct: 75  TPTRAVLMTGRYQIHTTMQHGVLMAQEQRCLPTDEVLMPQKLKESGYSTHMVGKWHLGFY 134

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH-NMQVDNKTIGIYSTDL 196
           +   TP  RGFD+ +G++    +Y+ H+ K         D+R  +  V  +  G YST L
Sbjct: 135 KWDCTPNHRGFDTFFGFYLAGGEYFTHTRKCHGHRLDAWDLRDGDKMVGPEYTGEYSTML 194

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYAG 255
           Y   A  VI + + + PMFLY++  AVHA     P + PD     +  DI D  R+ YAG
Sbjct: 195 YARKAQEVIRKQDPNVPMFLYVSFQAVHA-----PLEVPDSYADAYGKDIYDQSRKLYAG 249

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGMKSTPWD 314
           M S +DE++GNV   L+  GM +N++V+F  DN GA  FG      SN+PLRG K   ++
Sbjct: 250 MTSCMDEAIGNVTQTLKDRGMWDNTVVVFSTDNGGATGFG-----ASNYPLRGQKCGHYE 304

Query: 315 GGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLT 371
           GG+RG A + SP LK   + +  +EL ++ DWLPT    AG   + T  LDGVNQW+ ++
Sbjct: 305 GGVRGPAFVSSPLLKPHVRGTKNTELMYVGDWLPTFVHLAGGTTHGTKPLDGVNQWEAIS 364

Query: 372 KGAKTKRSEILHNIDNVD-------------NPQ---KYYAALRVDDLKYVAGTDNNGQS 415
           +G  ++R EILHNID +              NP    +  AA+RV   K + G   N   
Sbjct: 365 EGLPSRRKEILHNIDPIRAKSPTLKTEPWMLNPYFDIRVMAAIRVGHWKLLTGQKGNANW 424

Query: 416 DE 417
            E
Sbjct: 425 QE 426


>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 558

 Score =  308 bits (789), Expect = 5e-81,   Method: Compositional matrix adjust.
 Identities = 171/403 (42%), Positives = 241/403 (59%), Gaps = 31/403 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHII I+ DD G+ND+ +H ++ I TP +D LA  G+ L  +Y+Q +CTPSRS  +TG+
Sbjct: 37  RPHIIFIMTDDQGFNDIGYH-NTDIHTPTLDRLAAAGVKLENYYIQPICTPSRSQFITGR 95

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH +I   +P  LP   + LPQ L+EAGYATH +GKWHLGF++    PT RGF
Sbjct: 96  YQIHTGLQHSIIRSRQPSCLPFGLRTLPQRLQEAGYATHMVGKWHLGFYKRDCLPTRRGF 155

Query: 149 DSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           ++++G   G  DYY + SC        G D+    +V     G YST LYT+    ++A 
Sbjct: 156 NTYFGSLTGSVDYYTYKSCDG--PKVCGFDLHDGERVAWGQGGRYSTHLYTQRVRKILAA 213

Query: 208 HN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           H+  S+P+F++L+  AVH      P Q+P E V  +  + +  RR YAGMVS +DE+V N
Sbjct: 214 HDPSSQPLFIFLSFQAVHT-----PLQSPKEYVYPYRRMGNMFRRKYAGMVSAVDEAVRN 268

Query: 267 VIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           V  ALRK+G  +N+++ F  DNG  P FG     GSN PLRG K T W+GG+RG+  + S
Sbjct: 269 VTYALRKYGYYKNTVIFFSTDNGGQPLFG-----GSNWPLRGRKGTYWEGGVRGIGFVHS 323

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHN 384
           P L++ ++VS  L HI+DW PTL   AG  ++    LDG + WD ++ G ++ R EILHN
Sbjct: 324 PLLRRRRRVSRALIHITDWYPTLMRLAGGNVSQAEGLDGFDMWDTISDGKESPRFEILHN 383

Query: 385 IDNVDNPQKY--------------YAALRVDDLKYVAGTDNNG 413
           ID + N  ++               AA+RV D K + G    G
Sbjct: 384 IDPLYNAARHGSLKNGYGIWNTAVQAAVRVGDWKLLTGNPGYG 426



 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 39/69 (56%)

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
           H  +     LFN+ DDP E+ +LAE + D++KQL  +L+ Y  T VP      D RADP+
Sbjct: 447 HTEAKKSLWLFNVADDPYERFDLAERRPDVVKQLLARLSFYNRTAVPVRYPSEDPRADPS 506

Query: 556 RWNNIWVPW 564
           R    W PW
Sbjct: 507 RNGGAWGPW 515


>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 406

 Score =  308 bits (788), Expect = 7e-81,   Method: Compositional matrix adjust.
 Identities = 161/376 (42%), Positives = 228/376 (60%), Gaps = 6/376 (1%)

Query: 21  FLNTTAP--KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCT 78
           FL  T+   + P++++ILADDLGW+DVS+HGS QIPTPNID LA +G+IL  +YVQ L T
Sbjct: 18  FLEATSASSQPPNLVMILADDLGWHDVSYHGSDQIPTPNIDVLAMDGIILFHNYVQPLST 77

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQY---LKEAGYATHAIGKWHLG 135
           P+R+AL+TG YPIH G Q   I   +P GL     LLPQ    L +   +  A   WHLG
Sbjct: 78  PTRAALLTGLYPIHTGTQRLDIGSADPIGLSADFTLLPQLSVTLADNFTSLGARSGWHLG 137

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
           F ++ + PT RGFD+ YG + G  DY+ H  +       G  ++   +   +  G Y T 
Sbjct: 138 FCKDEFKPTKRGFDTFYGIYNGDSDYWTHFARDNNIDVSGHALKDEKRALVEEAGRYLTS 197

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           L    A+ +I    K+KP FLY A  AVH G +    QAP E ++KF  ++D +R+ +AG
Sbjct: 198 LLANQAVQLIHNRPKNKPFFLYFAPTAVHCGGSNGSLQAPKEYISKFGYLADYDRQLFAG 257

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
            ++ LD+SVG ++ AL   G L N+++ F  DNG    G  +N   N PLRG K T  +G
Sbjct: 258 SLAELDKSVGLIVEALYVSGQLNNTVIAFSTDNGGAPVGFSANTSPNWPLRGTKGTVAEG 317

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGA 374
           G+RG   +WS  L    +V+ +LFH++DW+PT   AAG +  D T +DGV+QW+ LT+GA
Sbjct: 318 GVRGPGFLWSSSLTTRGRVTQQLFHVTDWMPTFYTAAGGQAKDLTMIDGVDQWESLTRGA 377

Query: 375 KTKRSEILHNIDNVDN 390
            + R E+L+NID + +
Sbjct: 378 PSPRKEVLYNIDPISD 393


>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 570

 Score =  307 bits (787), Expect = 8e-81,   Method: Compositional matrix adjust.
 Identities = 173/415 (41%), Positives = 243/415 (58%), Gaps = 36/415 (8%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           +  P  PHII I+ DD G+ND+ +H SS I TP +D LA +G+ L  +Y+Q +CTPSRS 
Sbjct: 42  SPVPTSPHIIFIMTDDQGFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQ 100

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L+TG+Y IH G+QH +I   +P  LP  +  LPQ L+E GY+TH +GKWHLGF+++   P
Sbjct: 101 LITGRYQIHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLP 160

Query: 144 TFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           T RGFD+++G   G  +YY + SC        G D+     V     G YST LYT+   
Sbjct: 161 TRRGFDTYFGSLTGSVNYYTYDSCDG--PGMCGFDLHEGESVAWSQKGKYSTHLYTQRVR 218

Query: 203 NVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
            ++A H+ +S+P+F++L+  AVH      P Q P E +  +  + +  RR YA MVS +D
Sbjct: 219 KILATHDPRSQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVD 273

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E+V N+   LRK+G  ENSI++F  DNG        + GSN PLRG K T W+GG+RG+ 
Sbjct: 274 EAVRNITYGLRKYGYYENSIMIFSTDNGGQPL----SGGSNWPLRGRKGTYWEGGVRGLG 329

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
            I SP L++ ++VS  L HI+DW PTL   A G E +   +DG + W+ ++ G ++ R E
Sbjct: 330 FIHSPLLRKKKRVSKALVHITDWYPTLVGLAGGKESHIEGVDGYDVWEAISDGRESPRLE 389

Query: 381 ILHNIDNVDNP------QKYY--------AALRVDDLKYVAGTDNNGQSDEWYGD 421
           ILHNID + N       QK Y        A++R  D K + G       D  YGD
Sbjct: 390 ILHNIDPLYNHARSGSLQKGYGIWNTAVQASIRAGDWKLLTG-------DPGYGD 437



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 35/60 (58%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI+ DP E+ +L+E + D++KQL  +L  Y  T VP      D RADP      WVPW
Sbjct: 466 LFNISGDPYERLDLSEQRPDVVKQLLARLVYYNRTAVPVRYPTEDLRADPHLNGGAWVPW 525


>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
          Length = 575

 Score =  306 bits (785), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 170/411 (41%), Positives = 244/411 (59%), Gaps = 30/411 (7%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           L  T  + PHII IL DD G++D+ +HGS  I TP +D LA  G+ L  +Y+Q +CTPSR
Sbjct: 42  LPPTFTRPPHIIFILTDDQGYHDIGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSR 100

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           S L+TG+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+++  
Sbjct: 101 SQLITGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKEC 160

Query: 142 TPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
            PT RGFD+  G   G  DYY + +C        G D+     V     G YST LY + 
Sbjct: 161 LPTRRGFDTFLGSLTGNVDYYTYDNCDG--PDVCGYDLHEGEDVAWDQSGKYSTFLYAQR 218

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
              ++A H+  +P+F+Y+A  AVH      P Q+P E + ++  + +  RR YA MV+ +
Sbjct: 219 VSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCM 273

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DE+V N+  AL+K+G  +NS+++F  DNG  +F    + GSN PLRG K T W+GG+RG+
Sbjct: 274 DEAVKNITWALKKYGYYDNSVMVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGI 329

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
             + SP +K+ ++ S  L HI+DW PTL + A G   N   LDG + W  +++G  + R+
Sbjct: 330 GFVHSPLIKRKRRTSWALVHITDWYPTLVSLARGNLSNVQGLDGYDVWPAISEGKDSPRT 389

Query: 380 EILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSD 416
           EILHNID + N  KY               A++RV + K + G  + G SD
Sbjct: 390 EILHNIDPLYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSD 438



 Score = 45.4 bits (106), Expect = 0.074,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 36/68 (52%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L+E + D+++ L  +L  Y  T +P      + RA P      W PW
Sbjct: 468 LFNITADPYERYDLSEQRPDVVRTLLTRLVHYNRTAIPVRYPAENPRAHPDFNGGAWGPW 527

Query: 565 YDELDKQK 572
             E D ++
Sbjct: 528 ASEDDGEE 535


>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
          Length = 438

 Score =  306 bits (784), Expect = 2e-80,   Method: Compositional matrix adjust.
 Identities = 178/451 (39%), Positives = 263/451 (58%), Gaps = 39/451 (8%)

Query: 76  LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           +CTPSR A ++G++P+  GMQ   +   EP GL L + LLP+YL++ GY TH +GKWH+G
Sbjct: 3   VCTPSRVAFLSGRHPLRTGMQGYPLKAAEPRGLHLNDTLLPEYLRKLGYTTHLLGKWHVG 62

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---Y 192
           +    Y PT RGFD+  GY+ G+  Y++H+ +   +   G D+ H +  DN T+     Y
Sbjct: 63  YLTRNYVPTRRGFDTFLGYFNGVIQYFNHTIEENEQV--GYDL-HRIVGDNHTVEYRYDY 119

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD--EEVAKFLDISDPER 250
            T+L TE A N+I+ HN  KPM+L LAHLA HA N  E  +  D  E  A    I D  R
Sbjct: 120 MTNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEAMEVYDWEETNATLGYIQDVNR 179

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG-IHSNKGSNHPLRGMK 309
           R +A +V+ LD+S+G V+ AL K  ML+N+I+LF++DNGA + G ++ N GSN+PLRG+K
Sbjct: 180 RKFASVVTTLDKSIGRVVDALNKKDMLKNTIILFISDNGAQTEGPLYQNFGSNYPLRGLK 239

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWD 368
            + ++GG+RG A I+SP ++ + KVS++LFHI+DWLPTL +AAG   +D   LDG++QW 
Sbjct: 240 FSLYEGGVRGAACIYSPLIENSSKVSTQLFHITDWLPTLYSAAGGNSSDLYQLDGIDQWS 299

Query: 369 VLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK 428
            +     +KR  ++ NI++VD     Y+A  +   K +    +  +    Y   D    K
Sbjct: 300 TIKTQEASKRKSVVINIEDVD-----YSAALMGRYKLIQDKSDFQKHYTNYKGNDPSYPK 354

Query: 429 YSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRC 488
           Y+   +L S                   AA   AN S+    I+T  KI ELR+ A V C
Sbjct: 355 YNVTNILESP------------------AASAIANISNH---IVTANKIEELRKEATVVC 393

Query: 489 NYDNKGAHCNSTVKPCLFNITDDPCEQNNLA 519
             +   + C S  + CLF+I +DPCE  +++
Sbjct: 394 E-NLMTSDCES--RTCLFDIYEDPCEVTDIS 421


>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 527

 Score =  306 bits (783), Expect = 3e-80,   Method: Compositional matrix adjust.
 Identities = 171/416 (41%), Positives = 244/416 (58%), Gaps = 36/416 (8%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
            ++ P  PHII I+ DD G+ND+ +H SS I TP +D LA +G+ L  +Y+Q +CTPSRS
Sbjct: 41  RSSVPTPPHIIFIMTDDQGFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRS 99

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
            L+TG+Y IH G+QH +I   +P  LP  +  LPQ L+E GY+TH +GKWHLGF+++   
Sbjct: 100 QLITGRYQIHTGLQHSIIRPRQPNCLPFDQITLPQRLQELGYSTHMVGKWHLGFYKKECL 159

Query: 143 PTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           PT RGFD+++G   G  +YY + SC        G D+     V     G YST LYT+  
Sbjct: 160 PTRRGFDTYFGSLTGSVNYYTYDSCDG--PGVCGFDLHEGESVAWSQRGKYSTHLYTQRV 217

Query: 202 INVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
             ++A H+ +S+P+F++L+  AVH      P Q P E +  +  + +  RR YA MVS +
Sbjct: 218 RKILATHDPQSQPLFIFLSFQAVHT-----PLQCPQEYIYPYRGLENIARRKYAAMVSAV 272

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DE+V N+   LRK+G  +NSI++F  DNG        + GSN PLRG K T W+GG+RG+
Sbjct: 273 DEAVRNITYGLRKYGYYQNSIIIFSTDNGGQPL----SGGSNWPLRGRKGTYWEGGVRGL 328

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
             + SP L++ ++VS  L HI+DW PTL   A G E +   +DG + W+ ++ G ++ R 
Sbjct: 329 GFVHSPLLRKKRRVSKALVHITDWYPTLVGLAGGDESHIEGVDGYDVWEAISDGKESPRL 388

Query: 380 EILHNIDNVDNP------QKYY--------AALRVDDLKYVAGTDNNGQSDEWYGD 421
           EILHNID + N       QK Y        A++R  D K + G       D  YGD
Sbjct: 389 EILHNIDPLYNHARSGSLQKGYGIWNTAVQASIRAGDWKLLTG-------DPGYGD 437



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/60 (45%), Positives = 35/60 (58%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI+ DP E+ +L+E + D++KQL  +L  Y  T VP      D RADP      WVPW
Sbjct: 466 LFNISGDPYERFDLSEQRPDVVKQLLARLVYYNRTAVPVRYPTEDLRADPHLNGGAWVPW 525


>gi|241834402|ref|XP_002414990.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509202|gb|EEC18655.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 425

 Score =  305 bits (781), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 184/451 (40%), Positives = 248/451 (54%), Gaps = 42/451 (9%)

Query: 76  LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           LCTPSRSALMTG YPIH GMQ  VIL   PWGLPL  K+LPQ+ K+ GY  + IGKWHLG
Sbjct: 3   LCTPSRSALMTGLYPIHNGMQSTVILPAAPWGLPLENKILPQHFKDLGYDVNMIGKWHLG 62

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
           FF+  Y P  RGFD+ +G + G  DYY+H+  +  +  + +     + V   T G+    
Sbjct: 63  FFQTPYVPMKRGFDTFFGLYTGSNDYYNHTSGSVSKKQEQITGALCLTVGGITHGV--EP 120

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           L T++ +N   +     P F Y +H AVH     E FQAP   V KF  I +  R  YAG
Sbjct: 121 LLTDSPLNFYPQ-----PFFCYFSHQAVHRALDSETFQAPARNVLKFPYIEESNRTIYAG 175

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           M+  LDES+G V+ AL   GMLE++I++F +DNG   +GI SN+G N PLRG K T W+G
Sbjct: 176 MLDALDESMGRVVEALSSAGMLEDTIIVFSSDNGGGPYGIESNRGFNWPLRGAKFTVWEG 235

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGA 374
           G+R  A +WSP   +  +VS++L HISDWLPTL  AAG + +   ++DGV+ W+ LT  +
Sbjct: 236 GVRVPAFVWSPKFLKKSRVSNQLMHISDWLPTLYTAAGGDASALGTIDGVDMWNSLTHRS 295

Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY---GDT--DNEIDKY 429
            +KR EIL N +++ N     + LR    K V G   +G+ D+ Y   G T   N+ID+ 
Sbjct: 296 ASKRKEILLNSNSLYN----ISGLRYKQYKLVVGGGFDGELDDHYYFPGGTRPTNDIDQL 351

Query: 430 SPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN 489
             +         T   L+ K   KQ  +  P A S++                   V C 
Sbjct: 352 RNEST-------TARVLR-KFYEKQDRSWSPLAWSNN-----------------VAVDCR 386

Query: 490 YDNKGAHCNSTVKPCLFNITDDPCEQNNLAE 520
            +    +      P LFNI  DPCE NNLA+
Sbjct: 387 RNKLTENFVPMQPPYLFNIEKDPCELNNLAK 417


>gi|156364432|ref|XP_001626352.1| predicted protein [Nematostella vectensis]
 gi|156213225|gb|EDO34252.1| predicted protein [Nematostella vectensis]
          Length = 270

 Score =  305 bits (781), Expect = 5e-80,   Method: Compositional matrix adjust.
 Identities = 145/271 (53%), Positives = 187/271 (69%), Gaps = 6/271 (2%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           KKP+I++I+ADDLGW+DVSFHGSSQIPTP ID LA  G+ILN +YV  +CTP+R++LMTG
Sbjct: 1   KKPNIVMIVADDLGWDDVSFHGSSQIPTPTIDKLASEGVILNSYYVSPICTPTRASLMTG 60

Query: 88  KYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           K+P+++GM    H  +   +P+GLPL E   PQY+K  GY TH IGKWHLGFF + YTPT
Sbjct: 61  KHPMNLGMLIHTHATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPT 120

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           +RGFDS YG+W G +DY+DHS +   E   G D+R N +      G Y T+L+ E A  +
Sbjct: 121 YRGFDSFYGFWNGKEDYWDHSSQ---EDVWGTDLRDNEKPVRNESGHYGTELFAERAAQI 177

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  HN++KP++LYLA   VH+ N  EP QAP   + KF  IS P+RR YA MVS LDESV
Sbjct: 178 IHLHNQTKPLYLYLAQQGVHSANGNEPLQAPKRLIKKFSHISSPKRRIYAAMVSSLDESV 237

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
             V  AL + GML N++++F  DNG    G 
Sbjct: 238 ETVHKALSETGMLNNTVLVFTTDNGGAPRGF 268


>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 500

 Score =  304 bits (779), Expect = 8e-80,   Method: Compositional matrix adjust.
 Identities = 166/422 (39%), Positives = 246/422 (58%), Gaps = 25/422 (5%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F +   ++ N          KP+I+ IL DDLG+NDV +HGS +I TPNID LA  G+ 
Sbjct: 5   LFFINLIIISNVCSKKIIQSTKPNIVFILIDDLGYNDVGYHGS-EIYTPNIDKLAREGVR 63

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +YVQ +CTP+RS L++G+Y IH G+QH  I   +P  LP         L+EAGYATH
Sbjct: 64  LENYYVQPICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATH 123

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-----SCKATFEPYQGLDMRHNM 182
           A+GKWHLGF+++   PT RGFDS++GY  G +DY+ H             + G+D+  N 
Sbjct: 124 AVGKWHLGFYKKECLPTQRGFDSYFGYLTGGEDYWTHHRAGDGLLPNSNHWLGMDLWDNE 183

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
           +V  + +G YS  ++TE    ++A+H   +P  LYL   +VH+     P Q P     ++
Sbjct: 184 KVAWEYVGNYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHS-----PLQVPSSYEERY 238

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
            +I +  RR YAGM++ LDE+VG ++ +L++ G+ +N++++F  DNG    G  +  G+N
Sbjct: 239 KNIKNTNRRIYAGMMTCLDEAVGKIVHSLQQAGVWDNTVLIFSTDNG----GEVAAGGNN 294

Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDT 359
            PLRG K + W+GGMRGV  + S  L  + +  V+ +L H+SDW PTL    AG  +++ 
Sbjct: 295 WPLRGWKRSIWEGGMRGVGFVNSRLLPASVQGTVNKQLIHVSDWFPTLVQGVAGSSLDNI 354

Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYY-----AALRVDDLKYVAG--TDNN 412
           +LDG N W  ++    + R E+LHN+D   N  K +     AA+RV   K + G    NN
Sbjct: 355 TLDGFNMWKTISSNTSSPRQELLHNMDPWPNQPKEFNANISAAIRVGHWKLLTGDPGSNN 414

Query: 413 GQ 414
           GQ
Sbjct: 415 GQ 416



 Score = 42.0 bits (97), Expect = 0.92,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 34/60 (56%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E ++L+ S  +++++L  KL  Y +  VP       + A+PA    +W PW
Sbjct: 440 LFNITADPNEYHDLSTSHPEVVEELMAKLKKYYADSVPVQYPSEVEEANPAYHQGVWGPW 499


>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 172/409 (42%), Positives = 241/409 (58%), Gaps = 38/409 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHI+ IL DD G+ND+ +H  + I TP +D LA  G+ L  +YVQ +CTPSRS L+TG+Y
Sbjct: 44  PHIVFILIDDQGFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLLTGRY 102

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            IH G+QH +I   +P  LP     LP+ L+ AGY+TH +GKWHLGF+R+   PT +GFD
Sbjct: 103 QIHTGLQHSIIRSRQPSCLPRHMDTLPETLRRAGYSTHMVGKWHLGFYRKSCLPTRKGFD 162

Query: 150 SHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           + +G   G  DYY + SC        G D+  N +V     G YST L+T+ A  ++  H
Sbjct: 163 TFFGSLTGSVDYYSYGSCNG--PGVCGYDLHDNERVAWGHEGKYSTTLFTQRAHKILKSH 220

Query: 209 NKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           + + +P+FL L+  AVH      P Q P   V  + D+++ +RR +A MVS +DE+V NV
Sbjct: 221 DPADRPLFLLLSLQAVHG-----PLQPPKSFVYLYRDMANVDRRKFAAMVSTVDEAVRNV 275

Query: 268 IAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
             ALRK+G  +NS++++  DNGA P  G     GSN PLRG K T W+GG+RGVA + SP
Sbjct: 276 TYALRKYGYYKNSVIIYSTDNGAQPYVG-----GSNWPLRGRKGTYWEGGVRGVAFVHSP 330

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
            LK  ++VS+ L HI+DW PTL   AG  I+ +  LDG + W  +++G ++ R EILHNI
Sbjct: 331 LLKHRRRVSTALLHITDWFPTLVGLAGGNISQSPGLDGFDVWPTISEGKESPRKEILHNI 390

Query: 386 DNVDN-PQK--------------------YYAALRVDDLKYVAGTDNNG 413
           + +   P K                      AA+RV D K + G   +G
Sbjct: 391 NPLHRAPSKRGICNHSMLPSGLSAVWDTSVQAAIRVGDWKLLTGDPGDG 439



 Score = 52.0 bits (123), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 32/71 (45%), Positives = 43/71 (60%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L++ + D++ QL  +LA Y  T VP    P D +ADP R    WVPW
Sbjct: 471 LFNITADPYERQDLSQQRPDVVLQLLGRLAFYNQTAVPVYFPPDDPQADPDRHQGAWVPW 530

Query: 565 YDELDKQKAIE 575
            DE D++   E
Sbjct: 531 VDEDDEEGKYE 541


>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
          Length = 481

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 170/416 (40%), Positives = 240/416 (57%), Gaps = 35/416 (8%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           L    P       I+ DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSR
Sbjct: 27  LVADGPGXXXXXFIVTDDQGYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSR 85

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           S L+TG+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+  
Sbjct: 86  SQLLTGRYQIHTGLQHSIIRPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKEC 145

Query: 142 TPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
            PT RGFD+  G   G  DYY + +C        G D+     V     G YST LY + 
Sbjct: 146 LPTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWGLSGQYSTMLYAQR 203

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A +++A H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +
Sbjct: 204 ASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCM 258

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DE+V N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+
Sbjct: 259 DEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGL 314

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRS 379
             + SP LK+ Q+ S  L HI+DW PTL   AG   +    LDG N W  +++G  + R+
Sbjct: 315 GFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYNVWPAISEGRASPRT 374

Query: 380 EILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGD 421
           EILHNID + N  ++               AA+RV + K + G       D  YGD
Sbjct: 375 EILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD 423


>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
          Length = 477

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 167/397 (42%), Positives = 233/397 (58%), Gaps = 19/397 (4%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           +TA  KPHI+ I+ DDLGWNDV +  +  I TP +D LA +G+ILN  YVQ LC+PSR+ 
Sbjct: 18  STAADKPHIVFIVVDDLGWNDVGWI-NPDIKTPTLDRLARSGVILNSSYVQPLCSPSRNC 76

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
            M+GK+P H G+Q  V+   +P  +P     +PQ LK  GY TH +GKWHLGF    YTP
Sbjct: 77  FMSGKFPYHTGLQDKVVFIEQPKYMPANLTTIPQRLKTLGYDTHMVGKWHLGFCNWKYTP 136

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           T+RGFD+  GY+ G++DY+ H  +     Y G D R+   V     G YST ++   AI+
Sbjct: 137 TYRGFDTFMGYYAGMEDYFTH-VRDEVADYNGYDFRNMTDVYKGAQGEYSTYVFANRAID 195

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +I  H+KSKPM+LYL   AVH      P Q P +   ++  I D  R+ Y+GMVS LD++
Sbjct: 196 IIMNHDKSKPMYLYLPFQAVHV-----PLQVPTKYSDRYSHIHDLNRKVYSGMVSALDDA 250

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           V N+   L   G ++N +++F  DNG P     +   +N PLRG K+T W+GG RG A +
Sbjct: 251 VYNITTVLEDLGFMDNLLLVFTTDNGGPP----TRGANNWPLRGGKNTLWEGGTRGAAFV 306

Query: 324 WS-PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           +S   LK+       LFH  DW PTL   AG + +   +DG++Q D++  G  + R++ +
Sbjct: 307 YSKTLLKKKGYTHPGLFHAVDWYPTLLDIAGGDSSQLDIDGLSQVDMILNGDSSVRNQFV 366

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
           +NI +      Y AALR  DLK + G  N G S  WY
Sbjct: 367 YNIYD----DHYAAALRWGDLKLIVG--NPGYS-GWY 396



 Score = 42.4 bits (98), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 32/59 (54%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           L+NI+ DP E  ++++   DL+  L   LA    T VP  + P D R++P  ++  W P
Sbjct: 418 LYNISTDPTEHEDISKDYPDLVNFLLQMLARKNGTRVPAMDPPSDPRSNPVNFDGSWSP 476


>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 1410

 Score =  304 bits (778), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 154/368 (41%), Positives = 231/368 (62%), Gaps = 22/368 (5%)

Query: 32  IIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPI 91
           I+ I+ DD+GWNDV +HGSS I TP++D LA  G  L  +YV  LC+PSR   +TGK+ I
Sbjct: 29  IVFIMIDDVGWNDVGYHGSS-ISTPHMDTLAKEGTKLENYYVAHLCSPSRGMFLTGKHMI 87

Query: 92  HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
           H+GM+ G+I+  E   LP+ E  + Q LK   Y+THAIGKWH+G++++   P  RGFD+ 
Sbjct: 88  HLGMEGGIIMPFERKCLPVNEATIAQELKLKNYSTHAIGKWHVGYYKKACLPNNRGFDTF 147

Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS 211
           +G   G  D+Y H     +E Y     R+N+ +  +  G YSTDLY   A NVI  H+ S
Sbjct: 148 FGIIGGCADHYTHKNTHWWELY-----RNNISIAQEYQGHYSTDLYAREATNVIRNHDAS 202

Query: 212 KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAAL 271
           KPMF+YLA  A H      P QAP + +  + +I DP+RR YA MV+ +D+++GNV+  L
Sbjct: 203 KPMFMYLAFQAAHL-----PLQAPRKYIDMYSNIEDPDRRVYAAMVTCMDDAIGNVVDQL 257

Query: 272 RKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT 331
           R+ G+  N++++   DNG          G+N PL+G K++ W+GG+RGVA + SP+L   
Sbjct: 258 REAGLWNNTVLIVSTDNGGAKIS-----GNNWPLKGSKASLWEGGVRGVAFVTSPFLADH 312

Query: 332 QKVSS--ELFHISDWLPTLCAAAGI---EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
            + +S  +L H++DW PT+   AG+    I   +LDGVNQW+ +++  +T+R+++L N+ 
Sbjct: 313 VRGTSNHQLMHVTDWFPTMLHVAGVYSDVIKSMTLDGVNQWNTISENRQTERNDVLINL- 371

Query: 387 NVDNPQKY 394
           NV+   K+
Sbjct: 372 NVNRKVKH 379


>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 585

 Score =  303 bits (777), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 169/385 (43%), Positives = 238/385 (61%), Gaps = 18/385 (4%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K PHI+++LADD+GWNDV ++ +  +PTP ++ LA NG+ILN  Y Q  C+PSR+AL+TG
Sbjct: 106 KPPHIVLVLADDVGWNDVGWN-NDFMPTPILNELASNGVILNNTYSQPACSPSRAALLTG 164

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           KYP + G+QH V+ E  P+ LPL   LL   LKE GY  HAIGKWHLGF    YTP +RG
Sbjct: 165 KYPANAGIQHLVVQEQHPYYLPLHNTLLSTKLKELGYMNHAIGKWHLGFCNWKYTPLWRG 224

Query: 148 FDSHYGYWQG-LQDYYDHSCKATF--EP-YQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           FDS YG + G L DY  H   + F  EP   GLD+R N  V     G + T L+TE A  
Sbjct: 225 FDSFYGIFNGYLSDYSTHIVHSPFIGEPGASGLDLRDNTGVVAHENGTHVTYLFTERAER 284

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +I  HN + P+FLY A L  H      P  AP       ++++D  RR YAGM+S +DE+
Sbjct: 285 IIRNHNPAAPLFLYTALLVPHG-----PLDAP-PGFENSVEVNDTRRREYAGMISAMDEA 338

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VGN+  AL+  GM  +++ +F++DNG   F      G+N+P RG K++ W+G ++    I
Sbjct: 339 VGNITNALKDKGMWNDTLFIFLSDNGGDMF----YAGNNYPFRGNKASLWEGSVKIPGFI 394

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           +   L++    ++EL H +D   TL +AAG   +D  +DG+NQWD + +G  +KRSEIL 
Sbjct: 395 YGNMLEKKGYSNNELIHFTDLFATLLSAAGGTPDD-DIDGINQWDTVCRGDASKRSEILL 453

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAG 408
           ++D  +  +   AALR++D KY+ G
Sbjct: 454 HLDTFEATRG--AALRMNDYKYIEG 476


>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 732

 Score =  303 bits (777), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 162/370 (43%), Positives = 227/370 (61%), Gaps = 18/370 (4%)

Query: 23  NTTAPKK---PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
           + TA KK   PHI+ IL DD G+ND+ +H  + I TP +D LA  G+ L  +YVQ +CTP
Sbjct: 39  DETAQKKKNQPHIVFILTDDQGFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTP 97

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           SRS L+TG+Y IH G+QH +I   +P  LP     LP+ L+EAGY TH +GKWHLGF+R+
Sbjct: 98  SRSQLITGRYQIHTGLQHSIIRPRQPSCLPSHMDTLPERLREAGYTTHMVGKWHLGFYRK 157

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
              PT +GFD+ +G   G  DYY + SC    +   G D+  +  V     G YST L+T
Sbjct: 158 ACLPTRKGFDTFFGSLTGSVDYYSYESCDG--KDLCGYDLHDDEGVAWGQEGKYSTTLFT 215

Query: 199 EAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
           + A  ++  H+ + +P+FL L+  AVH      P   P+  +  F D+ +  RR +AGMV
Sbjct: 216 QRARKILESHDPAERPLFLLLSFQAVHT-----PLNPPESYIYPFRDMVNVARRNFAGMV 270

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
           S +DE+V NV  ALRK+G   NS++++  DNGA  F      G+N PLRG K T W+GG+
Sbjct: 271 STVDEAVRNVTYALRKYGYYRNSVIIYSTDNGAQPF----TGGNNWPLRGRKGTYWEGGV 326

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKT 376
           RGV  + SP L++ ++VS  L HI+DW PTL   A   I+ +  LDG + W  +++G ++
Sbjct: 327 RGVGFVHSPLLRRRRRVSKALMHITDWFPTLVGLARGNISQSQGLDGFDVWPAISEGRES 386

Query: 377 KRSEILHNID 386
            R EILHNID
Sbjct: 387 PRKEILHNID 396



 Score = 59.7 bits (143), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 28/60 (46%), Positives = 40/60 (66%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+++LA+ + D+++QL  +LA Y  T VP    P D RA+P++    WVPW
Sbjct: 622 LFNITGDPYERHDLADQRPDIVQQLLARLAYYNQTAVPVYFPPDDPRANPSQHRGAWVPW 681


>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 631

 Score =  303 bits (777), Expect = 1e-79,   Method: Compositional matrix adjust.
 Identities = 195/580 (33%), Positives = 304/580 (52%), Gaps = 56/580 (9%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+II++LADDLG++DVSFHG+ QIPTPN+DA+A +G+ILN+HY     TPSR A  TGK
Sbjct: 41  RPNIIVVLADDLGYDDVSFHGNEQIPTPNLDAMAADGVILNRHYAAMSGTPSRGAFFTGK 100

Query: 89  YPIHIGMQHGVILEGE-PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            P+ IG+  G IL+G    GL L  ++LP Y+++ GY TH IG+W LGF++E   PT RG
Sbjct: 101 LPLRIGLNEGPILKGVYGTGLSLEHEVLPFYMRDLGYETHLIGRWGLGFYKESLLPTNRG 160

Query: 148 FDSHYGYWQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           FD+HYG +     Y  H  +     +       D+  + + D    G Y TDLY      
Sbjct: 161 FDTHYGPYSDSASYSSHLSREDAWKSLSVPPAYDLHRDGKPDFSGFGSYVTDLYKGRFER 220

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP---DEEVAKFLDISDPERRTYAGMVSRL 260
           +I +  + KP+F+ L+H   H G ++ P   P       ++FL I D  RR+YA  V  L
Sbjct: 221 IIEQ--RRKPLFIVLSHQTPH-GASFGPLHQPPPRTNRASQFLHIKDRSRRSYARTVESL 277

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFG--IHSNKGSNHPLRGMKSTPWDGGMR 318
           D+S  +++  L+++ +L  S++LF +DNG  +F         SN PLRG K+T W+GG+R
Sbjct: 278 DDSFAHMVDVLQRNNILNESVILFSSDNGGTTFKDPYFKTGASNWPLRGQKNTQWEGGVR 337

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWD-VLTKGAKT 376
             + +WS  + +   VS EL+H  DWLPT     G +I D   +DG + WD ++ +   +
Sbjct: 338 VSSFVWSSQITEPY-VSEELYHFVDWLPTFRRLGGGDIGDLKDIDGFDIWDSIVNRTTAS 396

Query: 377 KRSEILHNIDN---VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE 433
            R++IL   D    VD+P   +AA+ +   KYV+G        +W G   ++ + Y P  
Sbjct: 397 PRTKILLAYDRDFTVDSPG--FAAI-IGKFKYVSG--------KWSGR--DQTNHYVPAS 443

Query: 434 VLYSKAGITFNALKTKLQI----KQKHAADPKANSSDALRTI-----------LTDEKIL 478
             YS+       L+   ++    +      P A+  +  R             ++ +KI+
Sbjct: 444 FKYSEDSQVLERLRNDSKVWKNLRDIGRPGPAADWREQARIKCSWGAHLTCNGVSCDKIM 503

Query: 479 ELREFAR----VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLA----ESQTDLLKQLE 530
           E    +R    +R    N    C+   +PCLF++ +DPCE  N+A    + + D+   + 
Sbjct: 504 ETFRGSRGNVEIRVEPVNVSHACDFAHEPCLFDLENDPCETTNIANDMPQIRDDIGSHII 563

Query: 531 DKLAIYKSTMVPPGNKPFDKRADPA-RWNNIWVPWYDELD 569
           +     +ST     +  FD+ ++PA R   +W PW    D
Sbjct: 564 NDFDPVESTARLRYDACFDQNSNPANRDPPVWEPWLSGCD 603


>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
          Length = 551

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 159/363 (43%), Positives = 222/363 (61%), Gaps = 14/363 (3%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHII IL DD G+ DV +HGS  I TP +D LA  G+ L  +YVQ LC+PSRS LMTG+
Sbjct: 23  QPHIIFILVDDQGFRDVGYHGSD-IKTPTLDRLAAEGVKLENYYVQPLCSPSRSQLMTGR 81

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH VI   +P  LPL    LPQ LK AGY+TH +GKWHLGF++    PT RGF
Sbjct: 82  YQIHTGLQHSVIRAAQPNCLPLENVTLPQKLKNAGYSTHMVGKWHLGFYKRGCLPTQRGF 141

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
           D+ +G   G  DYY H  K       G D+    +    +  G+YST+++T+ A++++A 
Sbjct: 142 DTFFGSLLGSGDYYSHY-KCQGPSMCGYDLYEGEEAAWEQDRGLYSTEMFTQKAVSILAN 200

Query: 208 HN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           HN + +P+FLYLA+ AVH+     P Q P   + ++  I +P RR YA MVS LDE++ N
Sbjct: 201 HNPRKQPLFLYLAYQAVHS-----PLQVPARYLERYKGIPNPYRRKYAAMVSCLDEAIHN 255

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +  AL+ +G  +N+++++ +DNG          GSN PLRG K+T W+GG+R V  + SP
Sbjct: 256 LTLALKHYGYYDNTVIVYSSDNGGQPLA----GGSNWPLRGSKATYWEGGIRTVGFVHSP 311

Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
            L         L HI+DW PTL     G    D +LDG + W+ +++G  + R +ILHNI
Sbjct: 312 LLMNKGTKCRSLIHITDWFPTLVTLGEGTLDEDLNLDGYDVWETISEGRPSPRHDILHNI 371

Query: 386 DNV 388
           D +
Sbjct: 372 DPI 374



 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 42/73 (57%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L++    ++K++  +LA Y  T VP      D R++P     +W PW
Sbjct: 445 LFNITADPYERVDLSQRYPHIVKKMLMRLAQYNKTAVPVCYPTKDLRSNPQYNGGVWGPW 504

Query: 565 YDELDKQKAIETM 577
           + ++ +Q+  + M
Sbjct: 505 HKDVREQQEEDEM 517


>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 562

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 197/581 (33%), Positives = 287/581 (49%), Gaps = 93/581 (16%)

Query: 28  KKPHIIIILADDLGWNDVSFHGS---SQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           ++PH++ +LADD G+ND+ +H     S + TP +D+LA  G+IL  +YVQ +C+P+R  L
Sbjct: 32  QRPHVVFVLADDFGFNDIGYHAREHYSDMYTPFLDSLAAKGVILENYYVQPICSPTRGQL 91

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +TG+Y IH G+ HG+I   +P+GLPL   LL Q L++ GY T+ +GKWHLGFFRE Y P 
Sbjct: 92  LTGRYQIHTGLAHGIIRAAQPYGLPLDNILLSQQLRQCGYKTNMVGKWHLGFFREEYLPW 151

Query: 145 FRGFDSHYGYWQ-GLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
            RGF + +G+   G+  +  + C+      F  Y  +D R+     N T G YST+L+  
Sbjct: 152 NRGFQNFFGFLNGGVNHFTRYHCEPKKTRRFCGYDMIDSRYG--PTNATYGEYSTNLFIR 209

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            +  +I +HNK KPMFLYL+  AVH      P Q P++ + +F  I D  RR YAGMV  
Sbjct: 210 KSKEMIDKHNKQKPMFLYLSLQAVHG-----PLQVPNQYLKRFKHIRDKNRRIYAGMVYA 264

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +D  +  ++  L++  M +N+I +F  DNG    G  +  G+N PLRG K T W+GG+RG
Sbjct: 265 MDRGIRQLVKHLKRARMWKNTIFIFSTDNG----GQTTRGGNNWPLRGKKGTLWEGGIRG 320

Query: 320 VAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE--INDTSLDGVNQWDVLTKGAKT 376
           V  +   P    T +V+ EL H+SDW PT+ +A      +    +DG +QW+ L     +
Sbjct: 321 VGFVHGKPLQVTTPRVNKELLHVSDWYPTIMSATHCPYVVGTPPIDGYDQWETLRSNKTS 380

Query: 377 KRSEILHNIDNVDNP-----------QKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNE 425
           KR+EILHNID +                  AA+R  D K + G       +EW       
Sbjct: 381 KRTEILHNIDPLHKKLTESEFREGFDVSVRAAIRSGDWKLLTGYAG---MNEW------- 430

Query: 426 IDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR 485
                P E  +    I         QI+      P  NS+  +R     E   E RE   
Sbjct: 431 ---VVPPECTHGNITIG--------QIRDDFHVYP--NSTKNVRLFKITEDPHEERE--- 474

Query: 486 VRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
                                 ++D   E  NL      LL +L++    Y ++ +P   
Sbjct: 475 ----------------------VSDQYPEVVNL------LLARLKE----YNTSSLPVDY 502

Query: 546 KPFDKRADPARWNNIWVPW--YDELDKQKAIETMRLHKPLL 584
              DK++DP R    W+PW       K+K     R  KP L
Sbjct: 503 PHNDKKSDPKRHGGFWLPWRGVGRKYKRKKFRPHRARKPKL 543


>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 588

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 164/369 (44%), Positives = 234/369 (63%), Gaps = 17/369 (4%)

Query: 30  PHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           PH+I+I+ADDLG+NDV +H   G S I TPNID +AY+G+ L  +YVQ +CTP+RS L+T
Sbjct: 94  PHVIMIIADDLGYNDVGYHAKYGRSMIRTPNIDEMAYSGVRLENYYVQPVCTPTRSQLIT 153

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH GMQH  +  G P  LPL E  L Q LK+ GY+THA+GKWHLG+  +   P+ R
Sbjct: 154 GRYQIHTGMQHLNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSRR 213

Query: 147 GFDSHYGYWQGLQDYYDHSCKATF--EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           GF+S +G   G  D++ H+  A F  +   G  M +N ++  K  G +ST LYT  A  +
Sbjct: 214 GFESFFGNIMGSADHWSHNKTALFGDKLVMGKSMYYNERIYWKHEGTFSTTLYTNRARQL 273

Query: 205 IAEHNKSKPMFLYLAHLAVHAG-NTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           I +  ++KP+FLYL++ AVH   N  E +  P E +     I + +RR YAG+V+ LDE+
Sbjct: 274 IRKQPRNKPLFLYLSYEAVHTPLNVPEQYAKPYEGI-----IHNSKRRRYAGLVNILDEA 328

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           V NV  AL+ +G+ +NS+++F  DNG        + G+N PLRG KST W+GG+RGV  +
Sbjct: 329 VRNVTEALKYNGLYDNSVIIFTTDNGGRP--KPRSVGNNWPLRGGKSTLWEGGIRGVGFV 386

Query: 324 WSPWLKQTQK--VSSELFHISDWLPTLCA--AAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
            SP +    +  V+ +L H+SDW PT+    A G  + +  LDG +QW  ++KG ++ R 
Sbjct: 387 HSPLIPWELRGTVNRQLIHVSDWFPTIVXGIAGGKLVTNKPLDGXHQWKTISKGTESNRH 446

Query: 380 EILHNIDNV 388
           EILHNID +
Sbjct: 447 EILHNIDPI 455



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN-KPFDKRADPARWNNIWVP 563
           L+NI  DP E+ N+A  Q  ++ +L  +L  Y++T V P +  P D+R +P      WVP
Sbjct: 525 LYNIKKDPNERQNMAPYQKKIVYRLLKRLQDYQNTAVTPIHLGPKDERGNPKYHGGAWVP 584

Query: 564 WYDE 567
           W D 
Sbjct: 585 WMDR 588


>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 522

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 185/480 (38%), Positives = 254/480 (52%), Gaps = 78/480 (16%)

Query: 2   TWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
           TW R     L   LL     L T +  +PHII ILADDLGW+DV +HGS  + TP IDAL
Sbjct: 4   TWLR----CLLSILLGVSLSLATASATRPHIIFILADDLGWDDVGYHGSV-MKTPYIDAL 58

Query: 62  AYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE 121
           A  G+ L  +Y+ +LCTPSRS L+TG+Y IH G+QHG IL  +P  LPL E  LPQ LKE
Sbjct: 59  AAEGVTLENYYMPSLCTPSRSVLLTGRYEIHTGLQHGTILMMQPLCLPLDEITLPQKLKE 118

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
            GY TH +GKWHLGF+R+   P  RGFDS  G++Q + D++ H+  A+   + G D R N
Sbjct: 119 EGYDTHMVGKWHLGFYRKECLPNNRGFDSFLGFYQAMGDHFYHNISASPGHFNGFDFRRN 178

Query: 182 MQ-VDNKTIGIYSTDLYTEAAIN--------------------------VIAEHNKSKPM 214
              V ++  G YST ++    IN                          + + +N  +P+
Sbjct: 179 NDVVADQYAGKYSTHIFXXXFINTQTLSFVCVNNVKGVYFRGYLSSFTPITSSYNPQQPL 238

Query: 215 FLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI--SDPERRTYAGMVSRLDESVGNVIAALR 272
           FLYL+  AVH      P Q P      + D+  +D +RR YAGMV+ +DE+VG V   L+
Sbjct: 239 FLYLSFQAVHT-----PLQVPSRYAELYNDLIPNDEDRRIYAGMVTCMDEAVGYVTTVLK 293

Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
           +  + ENS+V+F                    LRG K T ++GG+RGV  + SP L +  
Sbjct: 294 QSALWENSVVIFSTX-----------------LRGGKGTLYEGGVRGVGFVTSPLLHEQV 336

Query: 333 K--VSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV- 388
           +  ++ EL HISDW PTL   AG ++     LDG NQW+ +++G  + R+EIL NID + 
Sbjct: 337 RGTINHELIHISDWFPTLTRLAGGDLMGGGRLDGYNQWETISQGTPSLRTEILLNIDPLY 396

Query: 389 -------DNP--------QKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE 433
                  D+P            AA+RV D K + G   NG    W   T++ I    P+ 
Sbjct: 397 TAVRNLRDDPWAESTCFNVSMRAAIRVGDWKLITGGAGNG---SWIPPTESGIAALHPER 453



 Score = 44.3 bits (103), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 21/61 (34%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR-ADPARWNNIWVP 563
           L+N+  DP E+ + +  + D+ ++L  +L  Y  T VPP     + R ADPA +  +W P
Sbjct: 461 LYNVHVDPYERMDYSHERPDIAEELLQRLGHYYRTSVPPNFPQQEVRSADPANYGGVWSP 520

Query: 564 W 564
           W
Sbjct: 521 W 521


>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
          Length = 513

 Score =  300 bits (767), Expect = 2e-78,   Method: Compositional matrix adjust.
 Identities = 157/337 (46%), Positives = 210/337 (62%), Gaps = 14/337 (4%)

Query: 55  TPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKL 114
           TP++DALA  G++L+ +Y Q LCTPSRS L+TG+Y I  G+QH +I   +P  +PL EKL
Sbjct: 49  TPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKL 108

Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ 174
           LPQ LKEAGY TH +GKWHLG +R+   PT RGFD+++GY  G +DYY H      +   
Sbjct: 109 LPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLIDALN 168

Query: 175 ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE 230
                LD R   +V      +YST+++T+ AI +I  H   KP+FLYLA  +VH     E
Sbjct: 169 VTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----E 223

Query: 231 PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
           P Q P+E +  +  I D  R  YAGMVS +DE+VGNV AAL+  G+  N++ +F  DNG 
Sbjct: 224 PLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGG 283

Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
            +       G+N PLRG K + W+GG+RGV  + SP LKQ    + EL HISDWLPTL  
Sbjct: 284 QTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVK 339

Query: 351 AAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
            A    N T  LDG + W  +++G+ + R E+LHNID
Sbjct: 340 LARGHTNGTKPLDGFDVWKTISEGSPSPRIELLHNID 376


>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
          Length = 1941

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 170/409 (41%), Positives = 252/409 (61%), Gaps = 28/409 (6%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           + FA+   ++F   +   T P  PHI++ILADDLGWND+ +  + QI TP +D LAY G+
Sbjct: 23  FLFAVAA-IMFITLYDKKTNP--PHIVLILADDLGWNDIGW-NNLQIKTPVLDKLAYEGV 78

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
           I NQ YVQ LCTP+R+ALMTG YP  IGMQH ++L  +P GLPL  K+LPQ LK+AGY  
Sbjct: 79  IFNQTYVQPLCTPTRAALMTGYYPFRIGMQHQMVLPFQPSGLPLHLKILPQKLKQAGYIN 138

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP------YQGLDMRH 180
           H +GKWHLG+    YTP  RGFDS YG      +  +H+ K +  P      Y+G D R 
Sbjct: 139 HIVGKWHLGYCNWEYTPLNRGFDSFYG---SFSNSVNHNNKISQLPISDHSKYKGYDFRD 195

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-V 239
           N  V  +  G   T L+T+  +++I+ H+K  PMF+Y        G+       PDE+  
Sbjct: 196 NTGV-VQNDGQPLTKLFTQRVVDIISNHHKDYPMFMYFPINQPSKGS------EPDEKYT 248

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
           A + +++D + R +   +S +D++VG+VI AL+  GM E ++++F++DNGA  F     +
Sbjct: 249 ALYPNVTDDQVRQWYATLSLMDDAVGHVIDALKSRGMWEETLLIFISDNGAFDFM----E 304

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
           G+N PLRG  +T ++GGMR  A      +K+T  +++EL HI+DW  T+ + AGI+  + 
Sbjct: 305 GTNLPLRGAGTTLFEGGMRVPAIAHGKMIKKTGYINNELNHITDWHSTILSLAGIQ-PEP 363

Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
            LDG+N W  +++G  + R+EI++NID +D  +   AALR+ D K + G
Sbjct: 364 DLDGINIWGTVSEGKASARNEIVYNIDEIDECRG--AALRLGDWKVITG 410



 Score = 41.2 bits (95), Expect = 1.4,   Method: Compositional matrix adjust.
 Identities = 18/59 (30%), Positives = 32/59 (54%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           LFN+ +DP E++N+AE   ++   L D+L  Y+  +VP  +    K  +P  +  +  P
Sbjct: 451 LFNLKEDPEERHNVAEKYPEVFSALLDRLNEYRKNVVPAIDTSLVKEGNPENYGGVLTP 509


>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 437

 Score =  299 bits (765), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 167/410 (40%), Positives = 241/410 (58%), Gaps = 18/410 (4%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
            A+T  LL +  F  TT   KP+I+I+  DDLGW DV FHGS QIP PNIDALA +G+IL
Sbjct: 4   LAVTWVLLSHLEFSFTT---KPNIVIVTVDDLGWADVGFHGSRQIPVPNIDALAADGVIL 60

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
           N++Y Q     SR  L+TG YP   G+   V+L  +P  LP   ++LP + K  GY TH 
Sbjct: 61  NKYYAQPWPLSSRIGLLTGIYPYRTGVGR-VMLPCQPVALPSVFRILPTFFKSLGYRTHF 119

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           +G W+LGF+++ +TP  RGFD+ Y  W G  DY+ H  +      QG D+R N  +    
Sbjct: 120 VGVWNLGFYKKRFTPVNRGFDTAYAKWTGPGDYWTHDMQTKM---QGFDLRLNDDLMWNQ 176

Query: 189 IGIYSTDLYTEAA--INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
            G+YST L+TE A     +      +P+ L L+H A+H  N +   Q P E +  F  I 
Sbjct: 177 TGVYSTRLFTERADPTRKLCFFVLHQPLLLILSHQALHTANYHGSLQCPLEHLDSFGFIR 236

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS-NHPL 305
           D +RR +AG ++ +D+S+G +  AL    +L N+I++F++ +G    G  +N  S N PL
Sbjct: 237 DRKRRIFAGALTEVDQSLGQIFEALHNRRLLNNTILVFVSASGGMPVGDFTNNLSFNFPL 296

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS----L 361
           RG K T ++GG+R  A +WSP L +T++ SS+L H++DWLPTL +AAG +I        L
Sbjct: 297 RGTKGTCFEGGVRVPAFVWSPLLNKTRRTSSQLMHVTDWLPTLFSAAGGDIKKLGAVFDL 356

Query: 362 DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN 411
           D VN W+ L+   ++ R+EI+HNID       +  AL+  + K +A T N
Sbjct: 357 DSVNMWEALSGDLQSPRTEIIHNIDLAG----HNFALQEGNYKVIAVTKN 402


>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
          Length = 300

 Score =  299 bits (765), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 150/299 (50%), Positives = 197/299 (65%), Gaps = 4/299 (1%)

Query: 76  LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           +CTPSR+AL+TGKYPI+ GMQH VI+  +PWGLPL E  + +  +E GY T  +GKWHLG
Sbjct: 1   MCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLG 60

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
             +  +TPT RGFD H GY     DYY  S +   + Y G D R +++  +  +G Y TD
Sbjct: 61  LSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQNKGYNGHDFRDSLKSTHDHVGHYVTD 120

Query: 196 LYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           L T+AA+  I +H   N S+P+FL L HLA HA N  +P QAP EEV++F  IS+   R 
Sbjct: 121 LLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRY 180

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           YA MVSRLD+SVG+VI AL +  ML+NSI+LF++DNG P+ G HS   SN+PLRG K++P
Sbjct: 181 YAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQHSTTASNYPLRGQKNSP 240

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
           W+G +R  AAIWS   ++   V  +  +I D LPTL AAAGI  +    LDG+N W  L
Sbjct: 241 WEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTLAAAAGISPDPALHLDGLNLWSAL 299


>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 517

 Score =  298 bits (764), Expect = 4e-78,   Method: Compositional matrix adjust.
 Identities = 195/567 (34%), Positives = 287/567 (50%), Gaps = 63/567 (11%)

Query: 8   FFALTCTLLFNDAFLNTTAP-KKPHIIIILADDLGWNDVSFHG---SSQIPTPNIDALAY 63
            + L   LLF    L   A   KP+I+ ILADD G+ND+ +H     S + TP +D+LA 
Sbjct: 1   MYLLRVVLLFGLHVLCVPAELTKPNIVFILADDYGFNDIGYHAVEHHSDMKTPFLDSLAM 60

Query: 64  NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
            G+ L  +Y+Q +C+PSRS LM+G+Y IH G+QH VI   +  GLPL   +LP+ L + G
Sbjct: 61  AGVRLENYYIQPICSPSRSVLMSGRYQIHTGLQHYVISPQQRNGLPLDNIILPEQLHKCG 120

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNM 182
           Y TH +GKWHLGF+++ Y P  RGF+S++GY  G +DYY    C        G DM    
Sbjct: 121 YDTHMVGKWHLGFYKDEYLPWKRGFNSYFGYLTGGEDYYTKWRCDGKL---CGYDMTSEK 177

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
              N T G YS +L+   A   I +H+K+KP+FLY+A  +VH+     P + P+     F
Sbjct: 178 GPTNATYGQYSANLFANKANEAIDKHDKTKPLFLYVAFQSVHS-----PMEVPESYAKPF 232

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
             I +  R+ Y GMV+ +DE+V N+   L+  G+ +N+I++F ADNG  +     + G+N
Sbjct: 233 DYIKNHNRKMYGGMVAAMDEAVKNITEHLQAAGLWDNTILVFSADNGGQTL----SGGNN 288

Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK---VSSELFHISDWLPTLCAAAGIEINDT 359
            PLRG K T W+GG++GV  +    L        V++E+ HISDW PT+  A      + 
Sbjct: 289 WPLRGRKLTLWEGGIKGVGFVHGKILNVPNPNYIVNNEMIHISDWFPTIMEATQCPYVEG 348

Query: 360 S--LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
           +  LDGVNQWD +     + R EIL NID +  P  +Y      +++   G D +  +  
Sbjct: 349 TQKLDGVNQWDTIFGKTVSARKEILLNIDPLFTPL-FYKETPKQNIQ--MGFDTSVHAGI 405

Query: 418 WYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKI 477
            YG+                        L T LQ   +    P+  +         +E+ 
Sbjct: 406 RYGEW----------------------KLLTGLQGDDRWIEPPEWKNK-------AEEEQ 436

Query: 478 LELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
            E  E         + G    +T    L+NI +DP EQ  +++    ++  L  KLA Y 
Sbjct: 437 QEDTE---------HPGFQKFATKSVQLYNIVNDPYEQVEVSDLYPSVVNDLLSKLAAYN 487

Query: 538 STMVPPGNKPFDKRADPARWNNIWVPW 564
           +T VP    P D  +DP+     W PW
Sbjct: 488 ATAVPVNYPPSDPTSDPSLHGGFWGPW 514


>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
          Length = 775

 Score =  296 bits (758), Expect = 2e-77,   Method: Compositional matrix adjust.
 Identities = 185/483 (38%), Positives = 266/483 (55%), Gaps = 40/483 (8%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y +  GMQ   +   E  G+PL   LLP+YL+  GY TH +GKWH+G+  + + PT RGF
Sbjct: 295 YKLLRGMQGYPLRGAERRGIPLNNTLLPEYLRRLGYTTHLVGKWHVGYHTKNFGPTRRGF 354

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGI---YSTDLYTEAAINV 204
           D+  GY+ G+  Y++H+    +E  Q G D+ H +  +N T+     Y TDL T+   ++
Sbjct: 355 DTFLGYYTGMIQYFNHTL---YESGQLGYDL-HRIVGENHTVEYRYDYMTDLLTDEVESI 410

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD--EEVAKFLDISDPERRTYAGMVSRLDE 262
           I+ HN  KPM+L L+HLA HA +  E  +  D  E    F  I D  RR YA +V+ LD+
Sbjct: 411 ISSHNTEKPMYLQLSHLAPHASDAEEVMEVRDWKETNDTFGYIKDLNRRKYASVVATLDK 470

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           SVG V+ AL++  +L N+I++FMADNGA + G  +N GSN+PLRG+K T ++G +RGVA 
Sbjct: 471 SVGRVVDALKRKDILNNTIIIFMADNGAQTEGYLANYGSNYPLRGVKFTLFEGAVRGVAC 530

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEI 381
           I+SP + ++ +VS++LFHI+DWLPTL +AAG   +D   LDG+NQW  + K    +R  I
Sbjct: 531 IYSPLIDRSSRVSTKLFHITDWLPTLYSAAGGNPSDLDQLDGINQWSTIKKVKIDERRSI 590

Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
           L NI   DN     AAL     KY    D +        D     D YS   V Y K  +
Sbjct: 591 LVNIQE-DNE----AALMG---KYKLVRDRS--------DNQKHYDYYSGNNVSYPKYNV 634

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
           T N L           +   +  +     +LT  KI  LR+ A + C      + C +  
Sbjct: 635 T-NVL----------TSPAASAIASVSAHVLTAAKIKALRKKATIVCKNLTNPSSCKN-- 681

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
           + CLFNI  DPCE  +L+   T ++++L   +  YKS ++   N P D    P  +NN W
Sbjct: 682 RTCLFNIYKDPCEFTDLSSKNTKVVERLSVFIDSYKSVLMKESNTPVDPAGFPCHFNNTW 741

Query: 562 VPW 564
           +PW
Sbjct: 742 MPW 744



 Score =  237 bits (605), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 132/316 (41%), Positives = 188/316 (59%), Gaps = 28/316 (8%)

Query: 95  MQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGY 154
           MQ   I  GEP GLPL  K+LP++L+  GY T+ IGKWHLG+    +TP++RGFD   G+
Sbjct: 1   MQGEDIQGGEPRGLPLNVKILPEHLQGLGYTTNLIGKWHLGYHTLQHTPSYRGFDYFCGF 60

Query: 155 WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI---YSTDLYTEAAINVIAEHNKS 211
           +     Y+D+  K +++   G DM      D    G+   Y TDL+T+ A+ +I  H+  
Sbjct: 61  YNSHVSYHDY--KYSYQNMSGYDMHCG---DAPAYGLNDKYVTDLFTDKAVKIIENHDSF 115

Query: 212 KPMFLYLAHLAVHAGNTYEPFQAP---DEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           +P++L ++HLAVHA     P + P   D    +F+ I +  RR YA MVS+LD+SVG ++
Sbjct: 116 RPLYLQISHLAVHA-----PLENPQDYDHSDRRFIHIREQHRRKYARMVSKLDDSVGRIV 170

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
             L   GML +S++LF+ DNGA   G   N GSN+PLRG       GG+RGV+ +WSP L
Sbjct: 171 HTLGNRGMLRDSLILFLTDNGAAPIGKFRNFGSNYPLRG-------GGVRGVSVLWSPRL 223

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDN 387
            +  +V   L HISDWLPTL AAA  +++D   +DGV+QW +L  G    R ++L NID 
Sbjct: 224 NKPARVCDNLIHISDWLPTLYAAADGDLSDLGEIDGVDQWRMLNDGHPIVRDKLLLNIDE 283

Query: 388 VDNPQ----KYYAALR 399
           +   +    K+Y  LR
Sbjct: 284 ISKTEGAIYKHYKLLR 299


>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
          Length = 560

 Score =  296 bits (757), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 158/369 (42%), Positives = 231/369 (62%), Gaps = 24/369 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+PH++ IL DD G+ DV +HGS +I TP +D LA  G+ L  +YVQ LC+PSRS LMTG
Sbjct: 36  KQPHVVFILVDDQGFRDVGYHGS-EIKTPTLDRLAAQGVKLENYYVQPLCSPSRSQLMTG 94

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL    LP  LK+AGYATH +GKWHLGF++    PT RG
Sbjct: 95  RYQIHTGLQHSIIRATQPNCLPLENVTLPLKLKQAGYATHMVGKWHLGFYKRGCLPTQRG 154

Query: 148 FDSHYGYWQGLQDYYDH-SCKA----TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           FD+ +G   G  D+Y H  C+A     ++ Y+G +         +  G+YST ++T+ AI
Sbjct: 155 FDTFFGSLLGSGDHYSHYKCEAPGMCGYDLYEGEEA-----AWEQDRGLYSTVMFTQKAI 209

Query: 203 NVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           +++A+H+   KP+FLYLA+ AVH+     P Q P   + ++  IS+  RR YA MVS LD
Sbjct: 210 SILAKHDPHRKPLFLYLAYQAVHS-----PLQVPSRYLERYKGISNVHRRKYAAMVSCLD 264

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           E++ N+  AL+++G  +N+++++ +DNG  P  G     GSN PLRG K++ W+GG+R V
Sbjct: 265 EAIRNLTLALKRYGYYDNTVLVYSSDNGGQPLLG-----GSNWPLRGSKASYWEGGIRAV 319

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
             + SP L+        L HI+DW PTL +   G      +LDG + W+ +++G  + R 
Sbjct: 320 GFVHSPLLRNKGTKCRSLIHITDWFPTLVSLGEGTLEEHLNLDGYDVWEAISEGRPSPRH 379

Query: 380 EILHNIDNV 388
           +ILHNID +
Sbjct: 380 DILHNIDPI 388



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 35/61 (57%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L++    ++K++  +LA Y  T VP      D R++P     +W PW
Sbjct: 459 LFNITADPYERVDLSQRYPQIVKKMLIRLAQYNKTAVPVRYPAKDLRSNPQYNGGVWGPW 518

Query: 565 Y 565
           Y
Sbjct: 519 Y 519


>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
          Length = 502

 Score =  293 bits (749), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 180/486 (37%), Positives = 256/486 (52%), Gaps = 35/486 (7%)

Query: 93  IGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHY 152
           +GMQ   I  GEP GLPL+ ++LP++L+  GY T  IGKWHLG+    +TP  RGFDS  
Sbjct: 1   MGMQGESIQGGEPRGLPLSVRILPEHLRGLGYTTKMIGKWHLGYHTPQHTPLHRGFDSFL 60

Query: 153 GYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK 212
           G++     YYD+  K +++   G DM             Y TDL+T+ A+ +I  H+ S+
Sbjct: 61  GFYNSHVSYYDY--KYSYQNMSGYDMHRGDAPAYGLTDKYVTDLFTDEAVRIIQTHDPSR 118

Query: 213 PMFLYLAHLAVHAGNTYEPFQAP---DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
           P++L ++HLAVHA     P + P   D    +F+ I +  RR YA MVS+LD+SVG ++ 
Sbjct: 119 PLYLQISHLAVHA-----PLENPQDYDHYDKRFMHIVEQNRRKYAKMVSKLDDSVGLIVN 173

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
            L   GML +S++LF+ DNGA   G   N GSN PLRGMK T ++GG+RGVA +WSP L 
Sbjct: 174 TLGSRGMLRDSVILFLTDNGAAPVGKFRNYGSNFPLRGMKYTLYEGGVRGVAVLWSPRLS 233

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           +  +V + L HI+DWLPTL AAA  ++ D   +DGVNQW +L  G    R ++L NID +
Sbjct: 234 KAARVCNNLMHITDWLPTLYAAADGDLRDLGKIDGVNQWRMLNDGYPIVRDKLLLNIDEI 293

Query: 389 DNPQ----KYYAALR--VDDLKYVAGTDNNGQSD--EWYGDTDNEIDKYSPKEVLYSKAG 440
              +    K Y  LR  ++D  Y    D + + D    Y          S K +  +   
Sbjct: 294 SKTEGAIYKQYKLLRGSIEDGHY----DGHHRHDLERGYNILPPFFGNNSRKNIQENVPP 349

Query: 441 ITFNALKTKL-QIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYD-NKGAHCN 498
            T   L + + Q    H   P    S           +++LR  A V C ++ +  +   
Sbjct: 350 YTETILISPVSQSITYHLGGPVTQPS----------TMVQLRNEATVSCRFNVSYYSFVT 399

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
             V  CLF+I +DPCE  N+AE    + ++L+  L  Y   +        D  ADP R N
Sbjct: 400 CNVTECLFDIENDPCETKNIAEQYPRIARELDQILEQYARNVAMQIKPSVDWLADPRRTN 459

Query: 559 NIWVPW 564
           N W PW
Sbjct: 460 NTWEPW 465


>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 519

 Score =  292 bits (748), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 169/427 (39%), Positives = 246/427 (57%), Gaps = 38/427 (8%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            FAL   LL     +      +P+II IL+DD GWND+ +H S  I +PNI+AL  +G+ 
Sbjct: 2   IFALLEILLC--FHITVIVGSQPNIIFILSDDHGWNDIGYH-SHIIRSPNINALCNDGVR 58

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +Y+Q  CTPSRS LM+G+Y IH G+QH VI   +P  LPL E  L + LKE GYATH
Sbjct: 59  LENYYIQPGCTPSRSQLMSGRYQIHTGLQHSVIRNDQPNCLPLDEVTLAEKLKEVGYATH 118

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDN 186
            +GKWHLGF+     PT RGFDS +GY  G +DYY H          G D+ RH   V  
Sbjct: 119 LVGKWHLGFYTPSCLPTRRGFDSFFGYLIGQEDYYKHIHDG------GYDLKRHETDVSK 172

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-I 245
           +  G Y+T ++T  A N+I  H+ S P+FLY+++ +VHA       Q P+     +   I
Sbjct: 173 QYQGDYTTHVFTSEAQNIIKSHDPSTPLFLYMSYQSVHANY----LQVPEHYSNMYNGVI 228

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
            D +RR  AGMV+ +DE +GN+   L++  +  N+I++F +DNG    G  ++ G+N PL
Sbjct: 229 DDEDRRIVAGMVTCMDEGIGNITQTLKETELWNNTIIIFSSDNG----GDPNDGGNNWPL 284

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTS-LD 362
           RG K T W+G + G+  ++S  + +  +  V++E+ H+SDW PT+   AG  +N T  LD
Sbjct: 285 RGEKGTHWEGAIHGLGFVYSSDIAKDVRGTVNTEMIHVSDWFPTIVNLAGGSLNGTKPLD 344

Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNP----QKYY------------AALRVDDLKYV 406
           G +QW  +++GA++ R+EIL NID +  P    + ++             A+RV D K +
Sbjct: 345 GFDQWQTISQGAESPRTEILINIDPLVPPPSDMEHFFHGWNGTYNTSMCGAIRVGDWKLI 404

Query: 407 AGTDNNG 413
            G    G
Sbjct: 405 TGYPGEG 411



 Score = 47.0 bits (110), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 34/63 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI +DP E  +L+E   D + +L  K+  Y  T VP    P D  ADP+  + IW PW
Sbjct: 436 LFNIAEDPTETTDLSEDNLDKVIELLIKMDEYLLTSVPVVYPPNDPLADPSLNDGIWRPW 495

Query: 565 YDE 567
             E
Sbjct: 496 SPE 498


>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 562

 Score =  292 bits (748), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 165/410 (40%), Positives = 234/410 (57%), Gaps = 35/410 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHI+ ++ DD G+ND+ +HGS +I TP +D LA  G+ L  +YVQ +C+PSRS LMTG+Y
Sbjct: 30  PHIVFVMVDDQGYNDIGYHGS-EIQTPVLDQLAGEGVKLENYYVQPICSPSRSQLMTGRY 88

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            IH G+QH +I   +P  LP     LP+ L+EAGY+TH +GKWHLGF      PT RGF 
Sbjct: 89  QIHTGLQHSIIRARQPLCLPPDTPTLPERLQEAGYSTHMVGKWHLGFCHPECLPTSRGFQ 148

Query: 150 SHYGYWQGLQDYYD-HSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           S  G   G  D++   SC  T     G D+    +   +  G YST LYTE   +++  H
Sbjct: 149 SFLGSLTGSGDHFSFQSCDGT--EACGFDLHDGDRPAWELRGNYSTRLYTERVKDILRRH 206

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           +  KP+FLY+A  AVH      P QAP   + ++  + +  RR YA MVS +DESVG ++
Sbjct: 207 DHRKPLFLYVALQAVHT-----PLQAPGHLLRRYQALGNRPRRHYAAMVSGVDESVGEIV 261

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
           + LR+ G  +NS++++ +DNG        + G N PLRG K + W+GG+R V  + SP L
Sbjct: 262 SELRERGYYDNSVLIYSSDNGGQPL----SGGCNWPLRGGKGSYWEGGVRAVGFVHSPLL 317

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDN 387
           K+   VS  L HISDW PTL + AG   +D++ +DG + W+ ++ G    R+EIL NID 
Sbjct: 318 KRKGVVSHALIHISDWYPTLLSLAGYRESDSNHVDGQDVWETISSGLPCPRTEILFNIDP 377

Query: 388 VD------NPQ------------KYYAALRVDDLKYVAGTDNNGQSDEWY 419
           V       NP+               AA+R  D K + G   +G   +W+
Sbjct: 378 VSRRHGDINPKLLTLNGFGIWDTGVRAAIRAGDWKLLTGNIGDG---DWF 424



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/63 (42%), Positives = 38/63 (60%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +LAE++ +++K L  +L+ Y  T V P N P D  ADP     +W PW
Sbjct: 452 LFNITADPYERADLAEARPEVVKVLLTRLSEYNRTAVSPRNPPDDPMADPQLHGGVWTPW 511

Query: 565 YDE 567
             +
Sbjct: 512 LGQ 514


>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
           purpuratus]
          Length = 784

 Score =  292 bits (747), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 152/377 (40%), Positives = 226/377 (59%), Gaps = 18/377 (4%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F +   ++ N      +   KP+I+ IL DDLG+NDV +HGS +I TPNID LA  G+ 
Sbjct: 5   LFFINLLIISNVCSKKFSQLTKPNIVFILIDDLGYNDVGYHGS-EIYTPNIDKLAREGVR 63

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +YVQ +CTP+RS L++G+Y IH G+QH  I   +P  LP         L+EAGYATH
Sbjct: 64  LENYYVQPICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATH 123

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
           A+GKWHLGF+++   PT RGFDS++GY  G +DY+ H  K        L +RH  +V  +
Sbjct: 124 AVGKWHLGFYKKECLPTQRGFDSYFGYLTGGEDYWTHHRKRPX-----LALRHVDKVAWE 178

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
             G YS  ++TE    ++A+H   +P  LYL   +VH+     P Q P     ++ +I +
Sbjct: 179 YGGYYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHS-----PLQVPSSYEERYKNIKN 233

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
             RR YAGM++ LDE+VG ++ +L++ G+ +N++++F  DNG    G  +  G+N PLRG
Sbjct: 234 TNRRIYAGMMTCLDEAVGKIVHSLQQAGLWDNTVLIFSTDNG----GEVAAGGNNWPLRG 289

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDTSLDGV 364
            K + W+GGMRGV  + SP L  + +  V+ +L H+SDW PTL    AG  +++ +LDG 
Sbjct: 290 WKRSIWEGGMRGVGFVNSPLLPASVQGTVNKQLIHVSDWFPTLVQGVAGSSLDNITLDGF 349

Query: 365 NQWDVLTKGAKTKRSEI 381
           N W  + +  +  R+ I
Sbjct: 350 NMWKTIREELQKCRTSI 366


>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
 gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
          Length = 449

 Score =  291 bits (744), Expect = 8e-76,   Method: Compositional matrix adjust.
 Identities = 161/389 (41%), Positives = 219/389 (56%), Gaps = 34/389 (8%)

Query: 61  LAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK 120
           LA  G+ L  +Y+Q +C+PSR  LMTG+Y I  G+QH VI    P GLPL E  LPQ L+
Sbjct: 2   LASEGVKLENYYIQPICSPSRCQLMTGRYQIRYGLQHSVITSDRPHGLPLDEVTLPQKLR 61

Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF----EPYQGL 176
           E GY ++ +GKWHLGFFR+ Y P  RGFD  YGY  G +DY+ H     +      + GL
Sbjct: 62  ENGYRSYIVGKWHLGFFRKEYMPLQRGFDRFYGYLTGGEDYWTHRRPNGYARDPSAFHGL 121

Query: 177 DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           D+R   +      G YST L+ + AI  I  H +SKPMFLYL   AVH+     P +AP 
Sbjct: 122 DLRDQDKPVLDQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHS-----PLEAPT 176

Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
           + V K+ D++ P  + YA MV+ +DE+VGNV  AL++ G+ +N++++F  DNGA      
Sbjct: 177 KYVEKYKDVTHPLIKIYAAMVTAMDEAVGNVTDALKRTGLWDNTVLIFSTDNGA-----R 231

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
            N GSN PLRG K+T W+GG+RGV  + S  L++  + S  L HISDW PTL   +    
Sbjct: 232 ENAGSNWPLRGWKNTLWEGGVRGVGFVNSKLLRKGNRKSDALIHISDWFPTLLRISRSVS 291

Query: 357 NDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---------------KYYAALRV 400
           N T  LDG + WD ++ G  + R EILHNID +                     YAA+R 
Sbjct: 292 NGTKPLDGFDIWDTISTGTPSPRKEILHNIDPLHRKTASSSFNLLYDDIFNISVYAAIRS 351

Query: 401 DDLKYVAGTDNNGQSDEWYGDTDNEIDKY 429
            D K + G   +G     + D+D    K+
Sbjct: 352 GDWKLLTGDQGHGG----WKDSDTSGSKF 376



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/79 (35%), Positives = 44/79 (55%), Gaps = 5/79 (6%)

Query: 491 DNKGAHCNSTVKP-----CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN 545
           D  G+     V P      LFNI +DP E+++L++   D++++L  KLA Y  T +PP  
Sbjct: 370 DTSGSKFEYAVDPPTKHLWLFNIRNDPQERSDLSDRYPDVVQELLQKLAAYNKTALPPFW 429

Query: 546 KPFDKRADPARWNNIWVPW 564
              D RA+PA   +++ PW
Sbjct: 430 PSRDPRANPALHGDLFGPW 448


>gi|241601772|ref|XP_002405049.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215500554|gb|EEC10048.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 451

 Score =  291 bits (744), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 183/484 (37%), Positives = 246/484 (50%), Gaps = 41/484 (8%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           I  GMQ  VI +  PWGLPL  K++PQ+ K+ GY  + IGKWHLGFF+  YTP  RGFDS
Sbjct: 1   ILSGMQAEVIRDAAPWGLPLELKIMPQHFKDLGYQVNMIGKWHLGFFQTPYTPPKRGFDS 60

Query: 151 HYGYWQGLQDYYDHSCKATFEP--YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
            +GY+ G  DYY+H+      P  Y     +    V  K    YST  +       +  +
Sbjct: 61  FFGYYTGENDYYNHTAGTLNRPPLYGTFKTKQTRCVLAK----YSTHTWGLLEAFPLLLN 116

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
             S+P F Y ++ AVHA     PFQAP+  V KF  I +  R  YAGM+  LDES+G V+
Sbjct: 117 LHSQPFFCYFSYQAVHAALMNVPFQAPERNVLKFPYIGETNRTLYAGMLDSLDESIGRVV 176

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
             L   GMLE++I+ F +DNG   +   SN+G N PLRG K T W+GG+R  A +WS   
Sbjct: 177 EILGNAGMLEDTIIAFSSDNGGLPYNAFSNRGFNWPLRGAKGTLWEGGVRVPAFVWSSKF 236

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
            +  +VS +L HISDWLPTL +AAG ++ N  SLDGVN W  LT G  + R E+L  ID 
Sbjct: 237 LKKPRVSDQLMHISDWLPTLYSAAGGDVSNLGSLDGVNMWRYLTLGLGSARKEVLLTID- 295

Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG-----DTDNEIDKYSPKEVLYSKAGIT 442
              P    A LR  + K V G   + + D+ Y         N++ +   K+   ++   T
Sbjct: 296 ---PMYNLAGLRYKNYKLVVGEGFDEELDDRYAFPGGLRPTNDVPQLR-KDSKVARVLKT 351

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVK 502
           F   + +       + +P   S D     LT+  +                     +   
Sbjct: 352 FYGKQNRNWSPLDWSKNP---SVDCRERWLTNNFV---------------------ARRP 387

Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
           P LFN+  DPCE NNLA S+  +L  +  KLA Y +T VPP   P D R  P     +W 
Sbjct: 388 PYLFNLARDPCELNNLAGSKRIILHFMMHKLAAYNATAVPPRFLPEDPRGYPEYHGGVWS 447

Query: 563 PWYD 566
           PW D
Sbjct: 448 PWLD 451


>gi|242002216|ref|XP_002435751.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215499087|gb|EEC08581.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 433

 Score =  289 bits (739), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 180/480 (37%), Positives = 252/480 (52%), Gaps = 57/480 (11%)

Query: 92  HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL--GFFREVYT-PTFRGF 148
           + GMQH V++E EP GLPL  KLLP++L + GY  +   K  L     R VY  P  R  
Sbjct: 1   YTGMQHFVLMEMEPRGLPLNFKLLPEWLGDLGYVHNQTTKQTLLHKELRFVYAFPYLR-- 58

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
                                    +G D R  +    +  G Y T+L TE AI +I  H
Sbjct: 59  -------------------------RGFDFRRGLLGGREFDGRYYTELMTEEAIRIIETH 93

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
              KP+FLYL+HLA H+GN  +P Q P +    + DI    R  YAGMVS LDESVG V+
Sbjct: 94  PLEKPLFLYLSHLAPHSGNLRDPLQVPKKFSDMYHDIGSRNRTIYAGMVSALDESVGAVV 153

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
            AL + GML +++++F +DNGA S G + N  S  P +G K TPW+GG+R  A IWSP L
Sbjct: 154 EALGRRGMLGDTVLVFSSDNGANSRGENPNYASAWPFKGQKITPWEGGVRAPAIIWSPSL 213

Query: 329 KQTQ-KVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
             TQ +  S +FHISDWLPTL   AG + +D   +DG++Q + L  G +  R E+L NID
Sbjct: 214 SGTQGRDYSNIFHISDWLPTLYQLAGGDPSDLGDIDGISQLEALRHGTEAPRKEVLINID 273

Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNAL 446
            +DN      A+     K V+GT   G  D+W+    N                + ++  
Sbjct: 274 PIDN----VTAIIEGHFKLVSGTVKGGALDDWFQIPGN----------------VAWDTD 313

Query: 447 KTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLF 506
           +T+ + +   AA    NS   + T  + +    +     V+C   +    C  T+ PCLF
Sbjct: 314 RTRHECETSVAAQVLRNSGHDV-TCGSGQGSYAI----PVKCGRRDTSKPCTPTLAPCLF 368

Query: 507 NITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYD 566
           ++++DPCE NN+AE  ++++++L  KLA+YK+T   PGN P D RA PA  N  WV W D
Sbjct: 369 DLSEDPCEYNNVAEQHSEVVRRLLRKLAVYKATSWSPGNVPEDPRAHPALHNGAWVSWGD 428


>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 539

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 157/377 (41%), Positives = 218/377 (57%), Gaps = 14/377 (3%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           LT  +L         A ++PHI+ ILADDLG+ DV +   S + TPNID LA  G+ L +
Sbjct: 42  LTRLILLTSYLTVIQATQRPHIVFILADDLGYFDVGYRNGSIVKTPNIDKLAAEGVKLER 101

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVI-LEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
           HY Q  C PSRS LM G+Y IH G  +      G    +      +P  LKE GYATH +
Sbjct: 102 HYAQPSCMPSRSCLMMGRYQIHTGFNYKCTDGSGSQLCMHPDTITIPMKLKENGYATHMV 161

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDNKT 188
           GKWHLG  R    P  +GFD+ +GY    +DYY H   A  E     D+ R+   V  + 
Sbjct: 162 GKWHLGNIRWECLPNAKGFDTFFGYHGASEDYYTHFSPAGRE---CRDLWRNRDDVAQEY 218

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
            G YST ++T  A+N+I  H+ SKPMFLYL + AVH      P Q P++    +    D 
Sbjct: 219 YGQYSTHIFTNEALNIIENHDVSKPMFLYLPYQAVHG-----PLQVPEKYYNMYERPLDD 273

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
            RR YA M + +DE +GNV  AL++ GM +N++ +F++DNGA         G N PLRG 
Sbjct: 274 IRRKYAAMATCMDEGIGNVTNALKQRGMWDNTVFIFVSDNGAQ----FPTGGRNWPLRGG 329

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWD 368
           K + ++GG+R V+ + SP +++ Q+ S+E+ H++DWLPTL   AG  ++D  LDGVN WD
Sbjct: 330 KGSVFEGGIRVVSFVTSPLIERPQRSSNEMIHLTDWLPTLTHLAGGTVDDVELDGVNMWD 389

Query: 369 VLTKGAKTKRSEILHNI 385
            ++KG+ T R E+L  I
Sbjct: 390 TISKGSATPRKELLVQI 406


>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 514

 Score =  285 bits (729), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 160/384 (41%), Positives = 231/384 (60%), Gaps = 18/384 (4%)

Query: 32  IIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPI 91
           III+     GWNDV +H +  +PTPN++ LA  G+IL+  Y Q +CTPSR ALMTGKYP 
Sbjct: 41  IIILTISASGWNDVGWH-NDFMPTPNLNTLAREGVILDNMYSQPICTPSRVALMTGKYPA 99

Query: 92  HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
            +GMQH V+L   P+ LP     L + LKE GY  H +GKWHLG     YTP +RGFDSH
Sbjct: 100 KVGMQHFVVLPMRPYYLPGNYATLAEKLKEQGYTNHIVGKWHLGSCDWKYTPMWRGFDSH 159

Query: 152 YGYWQGLQDYYDHSCKATFEPYQGL---DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           YG  +G+   ++ +   T+ P  G+   D+R N  +     G ++T L++E A  ++  H
Sbjct: 160 YGCHEGVTSNFE-THMLTWPPVVGVSGRDLRDNTGLVTHENGTHNTMLFSERAERIVKNH 218

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           N   P+FLY+ ++A H      P QAP +   + + + D +RRT+AGM++ +D+ VGN+ 
Sbjct: 219 NPESPLFLYVPYMAPHF-----PLQAP-QGFEEAVQLDDTDRRTFAGMMAAMDQGVGNIT 272

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
            AL+  GM ++++ +F++DNG    G     GSN+P RG K+T W+GG++  A I    L
Sbjct: 273 QALKDKGMWDDTLFVFLSDNG----GDKMFAGSNYPYRGNKATMWEGGVKVPAFIRGSML 328

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           K +  V++EL+H +D   TL   AG    D  +DG NQWD   KG  +KRSEIL ++D  
Sbjct: 329 KTSGTVNNELYHFTDIFATLLKVAG-GTPDEDIDGKNQWDSFCKGKPSKRSEILLHLD-- 385

Query: 389 DNPQKYYAALRVDDLKYVAGTDNN 412
           ++P    AALR+ D KY+ G   N
Sbjct: 386 EHPVTNGAALRMGDYKYMEGVTIN 409


>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
          Length = 506

 Score =  285 bits (728), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 179/546 (32%), Positives = 277/546 (50%), Gaps = 65/546 (11%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           L+    KKP+++I++ADDLGWND+S H +S + TPN+D L    L L  +YV  +CTP+R
Sbjct: 11  LSVAIAKKPNVLILIADDLGWNDISLH-NSYLSTPNVDGLIQESLHLQSYYVNPICTPTR 69

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           S LM+G+Y IH G+QH VIL  +P GLPLT+ + P+  K+ GY TH +GKWHLGF+ E  
Sbjct: 70  SVLMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTHMVGKWHLGFYDEKM 129

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
            P  RGF+SHYGY  G + +Y+HS     +   G+D R      N + G YS DL+ +  
Sbjct: 130 VPENRGFESHYGYLIGAEGHYNHS--QFMQGQNGVDFRDGGASTNSSWGQYSADLFAKRV 187

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
            +++  H+  + +++Y+    VH      P +AP   V +F  I D +RR YA M   +D
Sbjct: 188 EDLVEAHDVEESLYMYVGLQNVHY-----PLEAPQHYVDQFSWIKDRDRRVYAAMTKSMD 242

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           ++VG +I + +  G+ E++IV F  DNG  +       G+N PLRG+K+T W+GG+RG+ 
Sbjct: 243 DTVGRIINSFKNKGIWEDTIVYFTTDNGGSTL----YGGNNWPLRGLKNTLWEGGIRGIG 298

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSE 380
           +I  P +   ++   +L H+ D  PTL              DG +Q  +L    +++   
Sbjct: 299 SIKIPGVSADKR--DQLMHVVDMFPTLLDLTSCPNTRKVKFDGKSQAKMLVDNDESQSDS 356

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
            L NI               D L+   G DN      W+ D D  +           +AG
Sbjct: 357 FLINI---------------DPLRKNKGADNR----TWHSDFDVRV-----------QAG 386

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRT--ILTDEKILELREFARVRCNYDNKGAHCN 498
           + +   K              A SSD   T  I   E   + R+ +  + N++      +
Sbjct: 387 LRWQQWKLLT----------GAPSSDNYPTGHIYPPEWPQDGRKNS--QSNFE------S 428

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
           S +   LF+I +DP E   ++   T++++++ + L+ Y ST VP      D    P +  
Sbjct: 429 SNLNVRLFDIDNDPLEDFEISNQHTEIVEKMLNMLSSYNSTAVPCRWPNPDPAGLPDQNG 488

Query: 559 NIWVPW 564
             W PW
Sbjct: 489 GFWKPW 494


>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
 gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
          Length = 538

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 176/471 (37%), Positives = 272/471 (57%), Gaps = 28/471 (5%)

Query: 8   FFALTCTLLFNDAFLN---TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           F ++    ++++ F++     A +KP+I+II+ADDLGWNDV +H +S+I TPN+D LA +
Sbjct: 18  FISVAIAAIYSNFFVSDGVRAAAQKPNIVIIMADDLGWNDVGYH-NSEIKTPNLDKLAES 76

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
              L++ YV + CTP+R+ALMTG++P   GM  GVI   +  GLPL EK + Q LKEAGY
Sbjct: 77  STRLDRFYVTSSCTPTRAALMTGRHPSRYGMSSGVIWPWDKVGLPLEEKTIAQTLKEAGY 136

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ- 183
            T  +GKWHLG ++E Y PT RGFD HYG++ G  DY+ H   A  +   GLD   N Q 
Sbjct: 137 YTAIVGKWHLGHYKEEYLPTRRGFDYHYGHYCGSIDYFTHQLDAGIQ--GGLDWHRNEQP 194

Query: 184 VDNKTIGIYSTDLYTEAAINVI--AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
           V+ +    Y+TDL  + A+ +I   ++NKS P+FLY++  A HA     P QA ++++  
Sbjct: 195 VEEEG---YATDLLAQEAVKLIRDCDYNKS-PLFLYVSFNAPHA-----PLQAKEKDIKN 245

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           + +I D  RR +A  V  +DE+VG ++AAL++  + +N+++ F +DNG  S      +G 
Sbjct: 246 YANIQDEGRRIFAAQVQSMDEAVGRIVAALKEKQVWDNTLLFFTSDNGGGS-DQPWTRGD 304

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTS 360
           N PLRG K T +DGG+R    I  P   +  +V  ++F + D  PTL   AG+ E +   
Sbjct: 305 NRPLRGQKGTLYDGGVRVPTIISFPAQLKGGQVIEQVFSVVDLYPTLAKLAGVKESSQNQ 364

Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
           LDGV+  + +  G+ T+R E+L     + +P  + AAL  ++ K V    N+     +YG
Sbjct: 365 LDGVDILESIATGS-TRRDEVLIQYQALASPPWFKAALVKENYKLVL---NSTSPSPYYG 420

Query: 421 DTDNEIDKYSPKE---VLYSKAGITFNALKTKLQIKQKHAADPKANSSDAL 468
             +    K  P E   + YS+       L+ +++  +K A  P  N+S  +
Sbjct: 421 IAELFNLKNDPLETRNLAYSEPEKVLE-LQKRVEAHKKEAKSPILNNSTEM 470


>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
 gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
          Length = 502

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 168/388 (43%), Positives = 225/388 (57%), Gaps = 31/388 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+I+ ILADDLGW DV FHGS  I TPNID LA  G  L Q+YVQ +CTP+R+A MTG+
Sbjct: 59  KPNIVYILADDLGWKDVGFHGSD-IKTPNIDELAEKGARLEQYYVQPMCTPTRAAFMTGR 117

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP   GMQ  VI +G  +GL L + LLP+ LKEAGYAT A GKWHLG  +  + P  RGF
Sbjct: 118 YPFRYGMQTAVIPQGGTYGLALDDHLLPELLKEAGYATAASGKWHLGHAKTAFWPRQRGF 177

Query: 149 DSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           DS YG   G  D++ H S     + Y+  D       DN         L+ + A+ VI E
Sbjct: 178 DSFYGALLGEIDHFTHKSANGNADWYRNNDALEEEGFDNV--------LFADEAVRVINE 229

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H+++KP+FLYLA  + H      PFQAP E + +   I+D  RR YA M+S +D+ +G V
Sbjct: 230 HDQAKPLFLYLAFTSPHT-----PFQAPKEFLERNSHIADESRRNYAAMISVMDDGIGRV 284

Query: 268 IAALRKHGMLENSIVLFMADNGA---PSFGIHSN-KGS----NHPLRGMKSTPWDGGMRG 319
           + AL   GM EN++++F +DNG      F   SN KG     N P R  K T ++GG R 
Sbjct: 285 VEALEMRGMRENTLIVFHSDNGGVVDAFFAGESNVKGKLPADNGPYREGKGTTYEGGTRV 344

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
           V  +  P  K     ++ + H+ D  PTL A AG E+  +  LDG++ W VL +G  + R
Sbjct: 345 VGLVNWPG-KIEPGTANGMMHVVDMYPTLAALAGAELGKNKPLDGMDMWPVLGEGKPSPR 403

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYV 406
           +EI++NI+ +       AA+R  D K V
Sbjct: 404 TEIVYNIEPMT------AAVRQGDWKLV 425


>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score =  283 bits (723), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 153/364 (42%), Positives = 219/364 (60%), Gaps = 16/364 (4%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           TA  +PHII ILADDLG+ DV +   S I TPNID LA  G+ L +HY Q  C PSR+ L
Sbjct: 7   TATHRPHIIFILADDLGYFDVGYRNRSVIKTPNIDKLAAEGVKLERHYAQPSCLPSRACL 66

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFREVYT 142
           M G+Y IH G +   + +   +   +   +  LP  +K+ GY TH IGKWHLG       
Sbjct: 67  MMGRYQIHTGYRDECMNDTR-YQRCMNHDIVTLPMKMKQNGYVTHMIGKWHLGNNNWDCL 125

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDNKTIGIYSTDLYTEAA 201
           P  +GFD+++GY    +DYY H            D+ R  M V +K IG YST ++TE A
Sbjct: 126 PNAKGFDTYFGYNAAAEDYYTHMLSGR---QNCSDLWRDRMDVADKYIGQYSTRIFTEEA 182

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           +N+I  H+ S+PMF+YLAH AVHA     P Q P E  + +    +  RR YA MV+ +D
Sbjct: 183 VNIIENHDISQPMFMYLAHQAVHA-----PLQVPQEYYSLYETTLNDVRRKYAAMVTCMD 237

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E +GNV  AL++ GM +N++++F++DNG    G  S  G N PLRG K + ++GG+R V+
Sbjct: 238 EGIGNVTEALKESGMWDNTVLVFVSDNG----GELSAGGRNWPLRGGKGSVFEGGIRTVS 293

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEI 381
            + S  +++ Q++S+ + HI+DW PTL   AG  I+D  +DG+N WD ++ G+   R+E+
Sbjct: 294 FVTSSLIERPQRISNAMIHITDWFPTLTHLAGGTIDDEMVDGINMWDTISTGSVEPRTEL 353

Query: 382 LHNI 385
           L  I
Sbjct: 354 LIKI 357


>gi|291225021|ref|XP_002732506.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  282 bits (722), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 178/542 (32%), Positives = 270/542 (49%), Gaps = 80/542 (14%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHII+I  DDLGWNDV +H +  I  PN+  LA +G+I NQ Y    CTPSR+A+MTG 
Sbjct: 26  QPHIILIFVDDLGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGL 84

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP   G QH ++    P GLPL  KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGF
Sbjct: 85  YPFKTGNQHQMLFNLHPSGLPLEFKLLPEKLKEIGYSTHMVGKWHLGFCKDEYLPTNRGF 144

Query: 149 DSHYGYWQ-GLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           DSHYG W  G+ DY          P +G D R N  V  K+ G Y   +  + A +++  
Sbjct: 145 DSHYGIWTLGVGDY--DKMNGVLSPSKGYDFRDNTGVVQKSNG-YLALMLGDRAEHIVNT 201

Query: 208 HNKSKPMFLYLA------HLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           H    P+FL         HLA+           P+E   K+ DI D   R + G ++ +D
Sbjct: 202 HYPGTPLFLAFTLDIPAKHLAI-----------PEEYENKYADIEDSRTRHFYGKLTMMD 250

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           + +G V+ AL+  G+ ++S+++F+ DNGA    + S  GSN+P RG+  T ++G  R  +
Sbjct: 251 DIIGGVVDALKNRGIYDDSLIIFIGDNGA----LSSQAGSNYPFRGIAGTLFEGATRVPS 306

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEI 381
            +    +K+T   S+EL+ I D   T+   AG +  +  LDG++ W   +KG  + R+EI
Sbjct: 307 IVSGKGIKKTGYESNELYSIVDIHRTILDVAGAKA-EPELDGLSMWPTWSKGKPSPRNEI 365

Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
           ++NID  D+P    AA+R+ D K + G  +        G TDN  +   P          
Sbjct: 366 IYNID--DDPVSPGAAIRIGDYKLITGHPDLLYPYRDIGITDNYYNYGDP---------- 413

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
               L + L    + A  P     + L  ++                             
Sbjct: 414 ----LASGLPTADQAAMAPPPLDVEYLFNVIV---------------------------- 441

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
                    DP E+N+++    D+++ L D+L  ++  ++PP N+  +     A +N++W
Sbjct: 442 ---------DPEERNDISADHPDIVQDLRDRLDEHRKLLIPPVNQTTELAGSAANFNDVW 492

Query: 562 VP 563
            P
Sbjct: 493 SP 494


>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
 gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
          Length = 357

 Score =  281 bits (719), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 153/360 (42%), Positives = 216/360 (60%), Gaps = 18/360 (5%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KPHI++I+ADDLGW+DVS++ +  +  PN+  LA  G+I NQ Y Q LCTPSR+AL+TGK
Sbjct: 7   KPHILLIVADDLGWSDVSWN-NPNVVMPNLHTLATTGVIFNQTYSQRLCTPSRTALLTGK 65

Query: 89  YPIHIGMQ-HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +P  +GMQ    + E    GLPL E+LLPQ LK+ GYATH +GKWHLG  +  YTPT RG
Sbjct: 66  FPYRLGMQVQKSMFEKNSHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERG 125

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDS YG+  G +DYY H  +      +GLD        +   G+YST+ +   A N+I++
Sbjct: 126 FDSFYGFHHGGEDYYTHMSE------RGLDFWDGRTSVSDRNGVYSTESFARRAENIISQ 179

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H+ + P+FLYL   +VH      P Q P   +  F  I D  R++  GM + LD+++  V
Sbjct: 180 HDPNTPLFLYLPFQSVHT-----PHQVPSSYLQTFSTIQDDNRKSILGMATALDDAIKRV 234

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
             AL+  G+ +N++ +FM+DNG    G +    SN PLRG K T W+GG R  A +    
Sbjct: 235 TDALKGKGLWDNTLTIFMSDNG----GNYVQGQSNWPLRGAKGTLWEGGTRVPAFVHGNM 290

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
           L++T      + H  D LPTL + AG    D  LDG N W  ++ GA + R+E +++ID+
Sbjct: 291 LERTGYTYHGMMHGVDILPTLVSVAG-GTEDAGLDGKNMWQSISTGADSPRTEFVYDIDS 349


>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
          Length = 621

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 151/336 (44%), Positives = 204/336 (60%), Gaps = 18/336 (5%)

Query: 58  IDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQ 117
           +D LA  G+ L  +YVQ +CTPSRS  +TGKY IH G+QH +I   +P  LPL    LPQ
Sbjct: 80  LDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 139

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
            LKE GY+TH +GKWHLGF+R+   PT RGFD+ +G   G  DYY H  K       G D
Sbjct: 140 KLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYD 198

Query: 178 MRHNMQVDNKTI----GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQ 233
           +  N   DN       G+YST +YT+    ++A H+  KP+FLY+A+ AVH+     P Q
Sbjct: 199 LYEN---DNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQ 250

Query: 234 APDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
           AP      +  I +  RR YA M+S LDE++ NV  AL+ +G   NSI+++ +DNG    
Sbjct: 251 APGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNG---- 306

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           G  +  GSN PLRG K T W+GG+R V  + SP LK    V  EL HI+DW PTL + A 
Sbjct: 307 GQPTAGGSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAE 366

Query: 354 IEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
            +I+ DT LDG + W+ +++G ++ R +ILHNID +
Sbjct: 367 GQIDEDTQLDGYDVWETISEGLRSPRVDILHNIDPI 402


>gi|390341601|ref|XP_796347.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 497

 Score =  280 bits (715), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 185/546 (33%), Positives = 263/546 (48%), Gaps = 93/546 (17%)

Query: 29  KPHIIIILADDLGWNDVSFHG---SSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           +P+++ ILADD G+NDV +HG    S + TPN+D LA  G+ L ++YVQ +C+P+RS L+
Sbjct: 34  RPNVVFILADDFGYNDVGYHGREHGSMVLTPNLDGLAGEGVKLEKYYVQPICSPTRSQLL 93

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           +G+Y IH G+Q+GVI   +P  LPL E  LPQ LKE  YATH +GKWH+GF+++  TPT 
Sbjct: 94  SGRYQIHTGLQYGVIRPAQPHCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTPTE 153

Query: 146 RGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           RGFDS++GY  G +DYY H  S +   +  +GLD+  N     +  G YST L+T  AI+
Sbjct: 154 RGFDSYFGYLSGAEDYYSHSRSFQIGSKTLKGLDLMANKTPAFQYKGQYSTHLFTSKAID 213

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           VI  H +SKP+FLYLA+ AVH+     P Q P +    + +I+   RR YAGMVS +DE 
Sbjct: 214 VINNHERSKPLFLYLAYQAVHS-----PLQVPSKYEEPYANITSSARRAYAGMVSCMDEG 268

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           +GNV  AL   G+  N+I++F             + G +  L G   T            
Sbjct: 269 IGNVTRALVDAGLYNNTIIIF-------------STGMSSSLVGKNMTD----------- 304

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
                ++T      L      +  +  AA     DT   G++          + R E+LH
Sbjct: 305 -----RETNSCGKLLIVGLACVDIVNVAAVYPTEDTDQKGMD---------PSPRQELLH 350

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
           N+D +D                    D  G + +  G    E      K     KA +  
Sbjct: 351 NLDPLD-----------------VAADFEGDA-KCSGKKTTECRMIPGKFNTSMKAALRV 392

Query: 444 NALKTKLQIKQKHAADPKAN-----SSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
              K +     K++ D + N         L TI   +K  +                   
Sbjct: 393 GDWKIQTGSGAKYSGDVQVNLWKPPPESGLHTIYPVQKPKQ------------------- 433

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
              K  LFNIT DP E+N+L+++  D++ QL  +L  Y    VPP   P D  ADPA   
Sbjct: 434 ---KVWLFNITADPLEENDLSDTYPDVVAQLLHRLQYYYKDSVPPRYPPPDVNADPALHG 490

Query: 559 NIWVPW 564
             W PW
Sbjct: 491 GAWGPW 496


>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
 gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
          Length = 353

 Score =  279 bits (714), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 156/358 (43%), Positives = 214/358 (59%), Gaps = 18/358 (5%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHI++I+ADDLGW+DVS++ +  +  PN+  LA  G+I NQ Y Q LCTPSR+AL+TGK+
Sbjct: 1   PHILLIVADDLGWSDVSWN-NPNVVMPNLHTLATTGVIFNQTYCQRLCTPSRTALLTGKF 59

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           P  +GMQ   I   +  GLPL E+LLPQ LK+ GYATH IGKWHLG  +  YTPT RGFD
Sbjct: 60  PYRLGMQR-PIRHKKAHGLPLDEELLPQKLKKLGYATHMIGKWHLGCCKWEYTPTERGFD 118

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           S YGY +G QDYY H          GLD        +   G+YST+ +   A N+I++H+
Sbjct: 119 SFYGYHRGSQDYYTHMSDG------GLDFWEGKTAISDQNGVYSTESFATRAENIISQHD 172

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
            + P+FLYL    VH      P Q P   +  F  I D  R++  GMV+ LD+++  V  
Sbjct: 173 PNTPLFLYLPLQPVHT-----PHQVPSSYLQTFSTIQDHNRKSILGMVAALDDAIKRVTD 227

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+  G+  N++ +FM+DNG    G + +  SN PLRG K T W+GG R  A      L+
Sbjct: 228 ALQGKGLWNNTLTIFMSDNG----GDYLDGQSNWPLRGAKGTVWEGGTRVPAFAHGNMLE 283

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
           +T      + H  D LPTL + AG    D  LDG N W  ++ GA++ R+E +++ID+
Sbjct: 284 RTGYTYHGMMHGVDILPTLVSVAG-GTEDAGLDGKNMWQSISTGAESPRTEFVYDIDS 340


>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score =  279 bits (714), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 190/548 (34%), Positives = 270/548 (49%), Gaps = 74/548 (13%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+I+ +LAD+LGW+D+ +HGS ++ TP++D LA  G+ L  +YVQ +CTP+RS LM+G+
Sbjct: 22  QPNIVFVLADELGWHDIGYHGS-RVQTPHLDKLASEGIKLENYYVQPMCTPTRSQLMSGR 80

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH VI   +   LPL E  + Q LKEAGY+TH +GKWHLG + +   PT RGF
Sbjct: 81  YQIHTGLQHTVINPDQRSCLPLDEVTIAQKLKEAGYSTHMVGKWHLGHYTKGCLPTKRGF 140

Query: 149 DSHYGYWQGLQDYYDHS----CK---ATFEPYQGLDMRHNMQ--VDNKTIGIYSTDLYTE 199
           DS +G++    DYY +     CK    T    +G D+  N +  V     G Y T +  +
Sbjct: 141 DSFFGFYNCAVDYYTYEKGKFCKFENETVLRMRGTDLWRNDEEHVAPYYQGHYQTHVLAK 200

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            A +VI +HN SKP+FLYLA  AVH      P + P      + D+ D  RR    M + 
Sbjct: 201 EAEDVIRKHNPSKPLFLYLAFGAVHV-----PLEVPKVYEDMYADVKDNSRRILLAMATC 255

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +D+S+ +V  AL+  G+ +N+I +F  DNGA +    +  G + PL+G K++ ++GG+R 
Sbjct: 256 MDDSIADVTGALKDTGLWDNTIFIFSTDNGAAT----AYGGCSWPLKGGKTSLFEGGIRA 311

Query: 320 VAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKT 376
           V  + SP L    + +   EL H+SDW PTL   AG E++ T  LDG NQW+ ++ G  +
Sbjct: 312 VGFVTSPLLPPDMQGTENMELMHVSDWFPTLVHVAGGELSGTKPLDGYNQWETISLGRPS 371

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
           +R+EIL NID +   +      + D  K+        +  EW   T N    Y      Y
Sbjct: 372 QRTEILINIDPLIPCEACANHTKWDHKKFDVHVHAAIRMGEWKLITGNPHHNYWVTPAEY 431

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH 496
                 +     K+      AADP  N  D L                            
Sbjct: 432 PNMKNIYLNEPGKMVWLYNIAADP--NEQDDL---------------------------- 461

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
             S   P + NI                    L D+LA Y  T VP      D  ADPAR
Sbjct: 462 --SDTFPGIVNI--------------------LLDRLAEYNETAVPVQFPDCDTGADPAR 499

Query: 557 WNNIWVPW 564
               W PW
Sbjct: 500 RGGSWGPW 507


>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 505

 Score =  279 bits (714), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 155/362 (42%), Positives = 218/362 (60%), Gaps = 17/362 (4%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P+II ILADDLG+ DV +   S + TPNID LA  G+ L ++Y Q+ C PSRS LM G
Sbjct: 24  EQPNIIFILADDLGYFDVGYREGSIVKTPNIDKLAAEGVKLERYYAQSSCMPSRSCLMMG 83

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKL--LPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           +Y IH G  +   L+G+   L +      LP  L++ GYATH IGKWHLG  R+   PT 
Sbjct: 84  RYQIHTGFDYRC-LDGQLTRLCMAPDTVTLPMKLRQYGYATHMIGKWHLGHERKECVPTH 142

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAINV 204
           +GFD+ +GY    ++YY H+  A   P +  D+  NM+ V     G YST  YT+ A ++
Sbjct: 143 KGFDTFFGYHGAAENYYTHT--ALGRP-RCHDLWRNMENVAEDYDGQYSTLFYTKEAQDI 199

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE-EVAKFLDISDPERRTYAGMVSRLDES 263
           I  H+K KP F+YL++ AVH      P + P    +  +    D  R+ YA M + +DE 
Sbjct: 200 IKNHDKKKPFFMYLSYQAVHG-----PLEVPSNYRMPYYTSGVDESRQIYAAMATCMDEG 254

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           +GN+   L+  GML N+I++F +DNGA     H+  G N PLRG K T W+GG+R +A +
Sbjct: 255 IGNITETLKASGMLNNTILVFSSDNGAQ---FHTG-GRNWPLRGGKGTMWEGGVRVLAFV 310

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
            SP LK+++ V+ E+ H SDW PT    AG  ++D  LDGV+QWD ++ GA TKR E+L 
Sbjct: 311 NSPLLKKSRTVNHEMIHTSDWFPTFVRLAGGTVDDVELDGVDQWDTISTGAPTKRQELLI 370

Query: 384 NI 385
            I
Sbjct: 371 QI 372


>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 571

 Score =  278 bits (712), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 160/406 (39%), Positives = 215/406 (52%), Gaps = 31/406 (7%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P+ PH I I+ DD G+ D+ +HGS  I TP +D LA  G+ L  +YVQ +C+PSRS LMT
Sbjct: 43  PRPPHFIFIMVDDQGYGDIGYHGSD-IHTPVLDRLAAEGVKLENYYVQPICSPSRSQLMT 101

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y IH G+QH +I   +P  LP     LP+ L EAGYATH +GKWHLGF R    PT R
Sbjct: 102 GRYQIHTGLQHSIIRPRQPLCLPPDSPTLPERLAEAGYATHMVGKWHLGFCRPSCLPTGR 161

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           GF S  G   G  D++ + SC        G D+    +   +  G YST LY +    ++
Sbjct: 162 GFQSFLGTLTGSGDHFSYQSCDGA--EACGFDLHDGDRPAWEMAGNYSTLLYIDRVKQIL 219

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
             H+   P+FLYL+  A H      P Q PD  +  +    +  RR YA M+S LD  V 
Sbjct: 220 KRHDPHTPLFLYLSLQAAHT-----PLQVPDHFLHLYDSHRNRLRRHYAAMLSCLDSGVA 274

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            V+  L+  G+ ENS++++ +DNG        + GSN PLRG K T W+GG+R V  + S
Sbjct: 275 QVVQELKTQGLYENSVLIYSSDNGGQPL----SGGSNWPLRGGKGTYWEGGIRAVGFVHS 330

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
           P LK+   VS  L H+SDW PTL   AG + +   LDG + W  +++G    R+EIL NI
Sbjct: 331 PLLKKKGIVSRALIHVSDWYPTLLGLAGAQQSHHGLDGHDVWGSISEGLPCPRTEILFNI 390

Query: 386 DNVDNPQ------------------KYYAALRVDDLKYVAGTDNNG 413
           D V                         AALR  D K + G   +G
Sbjct: 391 DPVSRKAGEPYDKALILSGFGIWDTAVRAALRAGDWKLLTGNVGDG 436



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/60 (41%), Positives = 35/60 (58%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFN++ DP E+ +LAE++ +++K L  +LA Y  T V   N P D  ADP      W PW
Sbjct: 467 LFNVSADPYERYDLAETRPEVVKHLMTRLAEYNQTAVMARNPPDDPMADPELHGGAWSPW 526


>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 473

 Score =  278 bits (712), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 156/363 (42%), Positives = 212/363 (58%), Gaps = 46/363 (12%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHI+ ILADDLGW+DV +H  S+I TPNID LA  G+ L  +YV  LCTPSR+ LMTG+
Sbjct: 23  RPHILFILADDLGWHDVGYH-DSEIQTPNIDMLAAEGVKLENYYVTPLCTPSRAVLMTGR 81

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH GMQHGV++   P  LP  E LLPQ LK++GY+TH +GKWHLGF +   TP  RGF
Sbjct: 82  YLIHSGMQHGVLVAQNPRCLPTDEILLPQMLKDSGYSTHMVGKWHLGFCKFQCTPNHRGF 141

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D+ +G++            A   P+     R                    A       H
Sbjct: 142 DTFFGWYNA----------AVISPFISCRRR-------------------SAGCEFRVSH 172

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVGNV 267
             + PMF+YL   AVHA     P Q PD+    + D I D  RRTYAGM S +DE++GN+
Sbjct: 173 T-TVPMFIYLPFQAVHA-----PLQVPDKYSDMYKDTIHDESRRTYAGMTSCMDEAIGNI 226

Query: 268 IAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
              L+  G+  N++++F  DN GA +FG      SN+PLRG K++ ++GG+RG A + SP
Sbjct: 227 TRTLKDKGIWNNTVIVFSTDNGGARTFG-----ASNYPLRGQKASNFEGGIRGPAFVSSP 281

Query: 327 WLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
            L    +  +++EL +I DW PT    A G  I    LDG+NQW+ +++G  +KR EI++
Sbjct: 282 LLDPLVRGTINNELMYIGDWFPTFMNLAGGHVIGSKPLDGINQWETISRGVPSKRVEIVY 341

Query: 384 NID 386
           NID
Sbjct: 342 NID 344



 Score = 46.2 bits (108), Expect = 0.047,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 34/61 (55%), Gaps = 1/61 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN-KPFDKRADPARWNNIWVP 563
           L++I +D  E+ NL   + D++  L  +L  + +T VPPG    F  RADPAR    W P
Sbjct: 412 LYHIKEDAEEKVNLVHYRPDIVFMLLQRLHYHNTTAVPPGEVDKFSVRADPARHGMAWNP 471

Query: 564 W 564
           W
Sbjct: 472 W 472


>gi|291225029|ref|XP_002732496.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  277 bits (709), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 177/537 (32%), Positives = 273/537 (50%), Gaps = 70/537 (13%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PH+I+I  DD+GWNDV +H +  I  PN+  LA +G+I NQ Y    CTPSR+A+MTG 
Sbjct: 26  QPHVILIFVDDMGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGL 84

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP   G QH ++    P G+PL  KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGF
Sbjct: 85  YPFKTGNQHQMVFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGF 144

Query: 149 DSHYGYWQ-GLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           DSHYG W  G+ DY          P  G D R NM V  K+ G Y   +  + A +++  
Sbjct: 145 DSHYGLWTLGVGDY--DKMNGVLSPSAGYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNT 201

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H    P+FL    L + A +     + P+E   K+ DI D   R + G ++ +D+ +G V
Sbjct: 202 HYPGTPLFLAFT-LDIPAKH----LEIPEEYEEKYSDIEDDRTRQFYGKLTMMDDIIGGV 256

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           + AL+  GM ++S+++F+ DNGA    + S  GSN+P RG+  T ++G  R  + +    
Sbjct: 257 VDALKNRGMYDDSLIIFIGDNGA----LSSQSGSNYPFRGIAGTMFEGATRVPSIVSGKG 312

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
           +K+T   S+EL+ I D   T+   AG +  +  LDG++ W   ++   + R+EIL+NID 
Sbjct: 313 IKKTGYESNELYSIVDIHRTILDVAGAKA-EPELDGLSMWPTWSEDKPSPRNEILYNID- 370

Query: 388 VDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI-DKYSPKEVLYSKAGITFNAL 446
            D+P    AA+R+ D K +      G  D  Y   D  + D Y      Y+      + L
Sbjct: 371 -DDPVSPGAAIRIGDYKLIT-----GHPDLLYPYRDISLTDNY------YNYGDAPASGL 418

Query: 447 KTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLF 506
            +  Q  Q HA  P   + + L  ++ D                                
Sbjct: 419 PSTDQ--QSHA--PAPQNVEYLFNVVVD-------------------------------- 442

Query: 507 NITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
                P E+N+++    D+++ L D+L  ++  ++PP N+  +       +N++W P
Sbjct: 443 -----PEERNDISADHPDIVQDLRDRLDEHRKLLIPPVNQTTELAGAAGNFNDVWSP 494


>gi|354473078|ref|XP_003498763.1| PREDICTED: arylsulfatase B [Cricetulus griseus]
          Length = 460

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 178/486 (36%), Positives = 245/486 (50%), Gaps = 70/486 (14%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  +PL EKLLPQ LKEAGYATH +GKWHLG  ++   PT RGFD+
Sbjct: 32  IHTGLQHLLIRVCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMCQKECLPTRRGFDT 91

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           ++GY  G ++YY H   A  EP       LD R   +   K    YST ++++ A  +IA
Sbjct: 92  YFGYLLGSENYYTHEVCALIEPLNVTRCALDFRDGEEPATKYNNTYSTHIFSQRATTLIA 151

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H   KP+FLYLA  +VH     +P Q P+E +  +  I D  RR YAGMVS +DE+VGN
Sbjct: 152 NHPPEKPLFLYLALQSVH-----DPLQVPEEYMKPYDFIQDKHRRIYAGMVSLMDEAVGN 206

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V AAL+ HG+  N++ +F  DNG  +     + G+N PLRG K T W+GG+RGV  + SP
Sbjct: 207 VTAALKSHGLWNNTVFVFSTDNGGQTL----SGGNNWPLRGRKWTLWEGGIRGVGFVASP 262

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
            LKQ    S EL HISDWLPTL   AG     T  LDG + W  +++G  + R E+LHNI
Sbjct: 263 LLKQKGVKSRELIHISDWLPTLVNLAGGSTYGTKPLDGFDVWRTISEGTPSPRVELLHNI 322

Query: 386 -----DNVDNPQKYYAALRVDD--LKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
                D+   P K  A  + D   LKY A           +  + +   +Y   ++L   
Sbjct: 323 DQDFFDSSPCPGKSMAEAKNDSFPLKYSA-----------FNTSIHAGIRYKNWKLLTGH 371

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCN 498
            G  F                P  ++   +R++ +  K L                    
Sbjct: 372 PGCGF------------WFPPPSQSNISEIRSLDSSSKTL-------------------- 399

Query: 499 STVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWN 558
                 LF+I  DP E+++L+E    ++ +L  +L  Y    VP    P D R DP    
Sbjct: 400 -----WLFDINQDPEERHDLSEKHPHIVLKLLSRLQYYHEHSVPSYFPPMDPRCDPKD-T 453

Query: 559 NIWVPW 564
            +W PW
Sbjct: 454 GVWSPW 459


>gi|260819479|ref|XP_002605064.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
 gi|229290394|gb|EEN61074.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
          Length = 485

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 181/539 (33%), Positives = 269/539 (49%), Gaps = 87/539 (16%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+I+ I+ADDLGWNDV +H +  + TP +D LA  G+ILNQ YV  +CTPSR+A MTG 
Sbjct: 27  KPNILFIVADDLGWNDVGWH-NPDVKTPVLDQLANEGVILNQCYVNYVCTPSRTAFMTGY 85

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           +P H+G QH V    +  G+P     LP+ LK+ GYATH +GK                 
Sbjct: 86  FPYHVGSQHTVFFPQQAQGIPSKFPFLPEKLKQLGYATHMVGK----------------- 128

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
                                           +  V     G Y T  +T+  ++++ +H
Sbjct: 129 -------------------------------DDKDVVKTRSGQYGTYFFTDRVVDIVEKH 157

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
               P+F+YL +  VH     +P Q P      +  + D  RR   GMVS LDE+VGNV 
Sbjct: 158 PADTPLFMYLPYQNVH-----QPLQVPQRFEDMYASVQDENRRKLLGMVSALDEAVGNVT 212

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
            A++K G+ +N++V+F  DNG    G  S  G+N+PLRG K T W+GG RGVA      L
Sbjct: 213 MAMKKAGLWDNTLVIFTTDNG----GWISASGNNYPLRGGKVTLWEGGTRGVAFAHGKML 268

Query: 329 KQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           ++T   ++E+ H  DW PT+ AAAG    D ++DGV+Q D +  G  + R+E ++NID+ 
Sbjct: 269 QKTGYTNNEMIHAVDWFPTILAAAG-GTADEAIDGVSQLDTIMSGQPSPRTEFVYNIDD- 326

Query: 389 DNPQKYYAALRVDDLKYVAGTDNNGQSDEWY---GDTDNEIDKYSPKEVLYSKAGITFNA 445
           + P K   A+RV D K + G    G+ D W      T +++D   P              
Sbjct: 327 EFPTK-QGAIRVGDYKLIEGY--AGKPDGWILPPNLTGSQLDSGDP-------------V 370

Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFAR-VRCNYDNKGAHCNSTVKPC 504
           + T L    K + DP       + T L + K  + R   R +    +++    +  +   
Sbjct: 371 IGTWL-FNLKVSGDP------VIGTWLFNLKAGQRRPRHRDLALQPESQPDSGDPVIGTW 423

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           LFN+ DDP E +NLA+S  D L++++ KLA Y+ ++VP      D +++P  W   W P
Sbjct: 424 LFNLKDDPTEHHNLADSMPDKLREMQAKLAEYRKSLVPAIFPAPDPKSNPDNWGGAWSP 482


>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 191/572 (33%), Positives = 283/572 (49%), Gaps = 78/572 (13%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
            +P+I+ I+ DDLG++DV F  S QI TPNID                     ++++++G
Sbjct: 33  SQPNIVFIVIDDLGFDDVGFK-SHQIKTPNID---------------------QASILSG 70

Query: 88  KYPIHIGMQHGV---ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +Y +H    HG+   I   + +GLPL    LPQ LK  GY THAIGKWHLGF++  YTPT
Sbjct: 71  RYAMH----HGIVNWIPPKDSYGLPLNHTTLPQLLKNGGYDTHAIGKWHLGFYKWDYTPT 126

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFE------PYQGLDMRHNMQVDNKTIGIYSTDLYT 198
           FRGF+S  GY+ G ++Y+ H     ++      P  G +     Q+     G YST +++
Sbjct: 127 FRGFNSFLGYYSGGENYFTHKNGPAYDMHRDPLPSCGQNCS---QIAFDLQGQYSTTIFS 183

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMV 257
           + A+ VI +H   KP+FLYLA+ AVH     EP QAP   +  + D I D +RRT+AGM+
Sbjct: 184 DEAVRVIDDHIGPKPLFLYLAYQAVH-----EPAQAPQSYIDPYTDLIPDAQRRTFAGML 238

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
           S LDE +GNV  AL+++G   N++++F  DNG P  G  S    N+P+RG K   ++GG+
Sbjct: 239 SCLDEGIGNVTRALKRNGFYNNTLIVFTTDNGGPILGGDSVGARNYPMRGGKHAVYEGGV 298

Query: 318 RGVAAIWSPW--LKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGA 374
           RG+A +  P     +T      L H +DWLPTLC AAG+E +   SLDGV+ WD +T   
Sbjct: 299 RGIAFVHPPQNTFVETGAGYHNLMHAADWLPTLCDAAGVECHTQQSLDGVSHWDAMTTNT 358

Query: 375 KT--KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGT------------DNNGQSDEWYG 420
            +  +   +L N  ++ +      A R  D K++ GT             + G  +  YG
Sbjct: 359 NSTLRGDVVLGNATDLCDATNCGFAYRKGDWKFIRGTGGMPFSWSTPMNQSGGNCNLAYG 418

Query: 421 DTDNEIDKYSPKEVLYSKAG-ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILE 479
                +D  S   V  + A    +    T         A+  +N+      +L       
Sbjct: 419 QC---VDGASLGRVTAATAADCCYTCTNTTKCTYSTFLANQTSNNC-----LLFASSTAS 470

Query: 480 LREFARVRCNYDNKGAHCNSTVKPC---LFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
             +   +   +D K     +T+ P    LFNI  DP E  ++A S  D+ KQ+  +L   
Sbjct: 471 TLDTNCISGAHDPKQVP-YTTLNPLGYQLFNIAQDPNEHTDVASSHADVAKQMNSELDSI 529

Query: 537 KSTMVPPGNK---PFDKRADP-ARWNNIWVPW 564
             + V   N    PF     P  R+  + VPW
Sbjct: 530 LQSYVQAQNDPNCPFPTSGWPQTRYGPVMVPW 561


>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 564

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 167/415 (40%), Positives = 237/415 (57%), Gaps = 39/415 (9%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           N+++  +PHI++I+ DDLG +D+  H +S I TP+ D LA +GL L+Q+YV   C+P+R+
Sbjct: 57  NSSSFFRPHILMIIMDDLGSHDLGIHENSGIQTPHADQLARDGLYLDQYYVLPYCSPTRA 116

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           +L++G+YP+H G  H ++ + E  GLPL E+ LPQ L+ AGY  HA+GKWH+G  R   T
Sbjct: 117 SLLSGRYPLHTGC-HTIVNDWETQGLPLDEETLPQVLRRAGYQAHAVGKWHVGHSRWTQT 175

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKA----TFEPYQGLDMRHNMQVDNKTI----GIYST 194
           PTFRGF S +G++ G QDY  H  +      +E +   D R     D   +    G YST
Sbjct: 176 PTFRGFQSFFGFYLGAQDYNTHIKQGERGNAYEMH--WDARGKCGRDCSRLVDERGNYST 233

Query: 195 DLYTEAAINVIAEHNKS--KPMFLYLAHLAVHAGNTYEPFQAPDE-----EVAKFLDISD 247
            ++T  AI VI  H +   +P+FLYLAH AVH      P Q P+      E A + + +D
Sbjct: 234 HVFTREAIRVIENHPQRPHEPLFLYLAHQAVHW-----PDQVPETYRKFYEGATYSNWTD 288

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
            +R+TYAGM+S  DES+GNV  AL+  GM EN++V+F  DNG P+    +   SN+P RG
Sbjct: 289 -QRKTYAGMLSAADESIGNVTKALQDAGMWENTLVVFTTDNGGPTAVCAAQGSSNYPKRG 347

Query: 308 MKSTPWDGGMRGVAAIWSP-WLK----------QTQKVSSELFHISDWLPTLCAAAGIEI 356
            K T ++GG  G   +  P W K          +T ++ S++FH+ DWLPTL    G   
Sbjct: 348 GKCTVYEGGTTGDGFVSGPAWNKVARSRKKEYSETLELYSKVFHVVDWLPTLARMTGATP 407

Query: 357 NDTSLDGVNQWDVLTK---GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
           N   LDGVNQWD + +    A   R E+        N Q Y  A+R    K + G
Sbjct: 408 NGKPLDGVNQWDSMLQREPSAPPPREEVFVGYAYFGN-QWYGPAIRYKHWKLIQG 461



 Score = 40.0 bits (92), Expect = 3.7,   Method: Compositional matrix adjust.
 Identities = 19/64 (29%), Positives = 34/64 (53%), Gaps = 4/64 (6%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK----PFDKRADPARWNNI 560
           L+++  DP E  N+A S   +++ L+ KL  Y ++ VPP +     PF    + + +   
Sbjct: 490 LYDLQSDPSETQNIASSYPLIVQILQGKLIEYHASFVPPISNDPTCPFTGTTNTSTFGPT 549

Query: 561 WVPW 564
           W+PW
Sbjct: 550 WLPW 553


>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 158/404 (39%), Positives = 223/404 (55%), Gaps = 31/404 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KPH+I I+ DD G+ D+ +HGS  + TP +D LA  G+ L  +YVQ +C+PSRS LMTG+
Sbjct: 43  KPHLIFIMVDDQGYADIGYHGS-DVHTPVLDQLAAEGVKLENYYVQPICSPSRSQLMTGR 101

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH +I   +P  LP     LP+ L +AGY TH +GKWHLGF R    PT RGF
Sbjct: 102 YQIHTGLQHSIIRPRQPLCLPPDIPTLPECLLKAGYHTHMVGKWHLGFCRPSCLPTRRGF 161

Query: 149 DSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
            S +G   G  D++ + SC        G D+    +   +  G YST LY E    ++  
Sbjct: 162 QSFFGTLTGSGDHFSYQSCDGA--EACGFDLHDGSRPAWEMRGNYSTLLYIERVKQILRN 219

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H+ + P+FLYL+  A H      P Q P   +  +   ++  RR YA M+S LD+ V  V
Sbjct: 220 HDPNTPLFLYLSLQAAHT-----PLQVPAHFLNPYKSQNNRHRRHYAAMLSCLDDGVAQV 274

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           +  L+  G+ +NSI+++ +DNG        + GSN PLRG K + W+GG+R V  + SP 
Sbjct: 275 VKELKTTGIYDNSILIYSSDNGGQPL----SGGSNWPLRGGKGSYWEGGIRAVGFVHSPL 330

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
           LK+   +S E+ H+SDW PTL   AG   +   LDG + W+ +++G+ + RSEIL NID 
Sbjct: 331 LKKKGVISKEMIHVSDWYPTLLGLAGALQSHNGLDGHDVWESISEGSPSPRSEILFNIDP 390

Query: 388 VD-NPQKYY-----------------AALRVDDLKYVAGTDNNG 413
           V   P + Y                 AALRV D   + G   +G
Sbjct: 391 VSRRPGEPYEKALVLNGFGIWDTAVRAALRVGDWNLLTGNVGDG 434



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 24/60 (40%), Positives = 35/60 (58%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFN++ DP E+++LAE   +++K L  +LA Y  T +   N P D  ADP     +W PW
Sbjct: 465 LFNVSGDPYERSDLAELHPEVVKHLLTRLAEYNQTAMIANNPPDDPMADPELHGGVWSPW 524


>gi|291221683|ref|XP_002730830.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 155/382 (40%), Positives = 220/382 (57%), Gaps = 15/382 (3%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++PHI++I  DDLGWNDV +H +  I  P +  LA +G+I NQ Y    CTPSR+A+MTG
Sbjct: 25  EQPHIVLIFVDDLGWNDVEWH-NPDIKMPVLSKLAADGVIFNQSYTHPTCTPSRAAMMTG 83

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            YP   G QH +I    P G+PL  KLLP+ LKE GY TH +GKWHLGF +E Y PT RG
Sbjct: 84  MYPFRTGNQHQMIFNLHPSGVPLEFKLLPEKLKEVGYFTHMVGKWHLGFCKEEYLPTSRG 143

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDSHYG W     +YD        P +G D R N+ V  K+   Y T +  E A ++I  
Sbjct: 144 FDSHYGLWTLGVGHYD-KMNGVLSPSEGYDFRDNIGVVPKS-DEYLTLMLAERAEHIING 201

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H    P+FL    + + A +     + PD   A + DI D   R + G +S +D  +G V
Sbjct: 202 HYNKHPLFLQFT-MDIPAKH----LEIPDTFEALYCDIEDERTRKFYGKLSLMDHVIGRV 256

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           + AL+   M ++++++F+ DNGA    + S  GSN+P RG+  T ++G  R  + I    
Sbjct: 257 VDALKAREMWDDTVLMFIGDNGA----LASQAGSNYPFRGIAGTLFEGATRVPSLIHGSM 312

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNID 386
           L++T  V++EL+++ D   T+ A AG E      LDG+N WD  +KG K+ R E ++NID
Sbjct: 313 LRETGYVNNELWNLVDLHRTILALAGAESGSMCELDGMNMWDTFSKGKKSPREEHVYNID 372

Query: 387 NVDNPQKYYAALRVDDLKYVAG 408
             D+P    AA+R+ D K + G
Sbjct: 373 --DDPISPGAAIRIGDYKLITG 392



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 20/59 (33%), Positives = 36/59 (61%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           LFN+  DP E+NN+A+   D+++ L D+L  ++  ++PP N+  +   D A +  +W P
Sbjct: 438 LFNVVVDPSEKNNIADQHPDIVQNLRDRLDEHRKLLIPPVNQTTELAGDAANYCGVWSP 496


>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 496

 Score =  274 bits (701), Expect = 8e-71,   Method: Compositional matrix adjust.
 Identities = 163/383 (42%), Positives = 223/383 (58%), Gaps = 27/383 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHI+  + DDLGWNDV +H +S I TP ID LA +G+ LN +YV + C PSR+ L++G+
Sbjct: 24  QPHIVFFVVDDLGWNDVGYH-NSYIKTPTIDMLAKSGVRLNNYYVASHCVPSRNMLISGR 82

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           + I IG+QHG I    P GLPL E  +   LKE GYATH IGKWH G +     P  RGF
Sbjct: 83  HVIDIGLQHGEI-GYYPRGLPLDEFTIADKLKEIGYATHLIGKWHCGCYSNHSLPHNRGF 141

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGL-DMRHNMQ-VDNKTIGIYSTDLYTEAAINVIA 206
           D+ +GY     D+Y H   +      GL D+R N + V  K  G YST +Y   A N+IA
Sbjct: 142 DTFFGYLGSSDDHYTHIIMS-----NGLADLRLNDECVGYKYFGDYSTIMYANEAKNIIA 196

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVG 265
           +H+++KP+FL LA  AVH     +P Q PD  +  +   I D ERRTYAGM S +DE++ 
Sbjct: 197 QHDENKPLFLMLAFSAVH-----KPIQVPDVYMQYYKSTIHDNERRTYAGMASCVDEAIA 251

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           N+  AL   GML+NS+++   DNG  S       G+N PLRG K++ W+GG+RGVA ++S
Sbjct: 252 NITEALENKGMLKNSVIVLTTDNGGGSV------GNNWPLRGRKNSHWEGGVRGVAFVYS 305

Query: 326 PWLKQTQKVS--SELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
             L    + +  SEL HI+DW PTL     G      +L GV+QW+++     + R E+L
Sbjct: 306 DLLPMDVRGTENSELMHITDWFPTLVKLGGGNPCERKNLYGVDQWEMIRGMEPSARGEVL 365

Query: 383 HNIDNVDNPQKYYAALRVDDLKY 405
             +D      + Y   R    KY
Sbjct: 366 IALDTTRKASEEY---RTGPFKY 385


>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
 gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
          Length = 498

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 170/408 (41%), Positives = 230/408 (56%), Gaps = 36/408 (8%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
            L  TL    A   +T   KP+I+ I+ADDLGW DV FHGS  I TPNID LA  G  L 
Sbjct: 39  GLAATLASGTALAQST---KPNIVFIVADDLGWKDVGFHGSD-IKTPNIDQLAEKGGRLE 94

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
           Q Y Q +CTP+R+ALMTG+YP   GMQ  VI +G  +GL L + LLP+ LKEAGYAT A 
Sbjct: 95  QFYAQPMCTPTRAALMTGRYPFRYGMQTAVIPQGGTYGLALDDYLLPEMLKEAGYATAAS 154

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKT 188
           GKWHLG     + P  RGFDS YG   G  D++ H S     + Y     R+N  ++   
Sbjct: 155 GKWHLGHADTAFWPRQRGFDSFYGALLGEIDHFTHKSANGNADWY-----RNNEAIEEAG 209

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
              +   L+   A+ VI EH++SKP+FLYLA  + H      PFQAP E + +   I+D 
Sbjct: 210 ---FDNILFATEAVRVINEHDQSKPLFLYLAFTSPHT-----PFQAPKEYLDRNSHIADE 261

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA---PSFGIHSN-KGS--- 301
            RR YA M+S +D+ +G V+ AL   GM EN+++LF +DNG      F   S+ KG    
Sbjct: 262 SRRAYAAMISVMDDGIGRVVEALETRGMRENTLILFHSDNGGVVDAFFAGESDVKGKLPA 321

Query: 302 -NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE-LFHISDWLPTLCAAAGIEIN-D 358
            N P R  K T ++GG R V  I   W  + +  S++ + H+ D  PTL A AG     +
Sbjct: 322 DNGPYRQGKGTTYEGGTRVVGLI--NWPGKIEPGSADGMMHVVDMYPTLAALAGATPGKN 379

Query: 359 TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
             LDG++ W +L +G  + R+EI++N++ +       AA+R  D K V
Sbjct: 380 KPLDGMDMWPMLGEGKPSPRTEIVYNVEPM------TAAVRQGDWKLV 421


>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 153/392 (39%), Positives = 228/392 (58%), Gaps = 17/392 (4%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           +   LT   L  +  L  TA ++PHI+ ILADDLG+ DV +   S + TPNID LA  G+
Sbjct: 4   FVIILTLISLMTNFDLARTA-QRPHIVFILADDLGYFDVGYRNGSIVKTPNIDKLAAEGV 62

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKL--LPQYLKEAGY 124
            L +HY Q  C PSRS LM G+Y IH G  +    +G+   L +      +P  LKE GY
Sbjct: 63  KLERHYAQPSCMPSRSCLMMGRYQIHTGFDYRC-KDGKRSQLCMHPDTITMPMKLKENGY 121

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
           ATH IGKWHLG  R    P  +GFD+ +GY   ++DY+ H   A    +     R++ +V
Sbjct: 122 ATHMIGKWHLGNIRWECLPNAKGFDTFFGYLSAIEDYFTHYTPAGANCHDFW--RNHDEV 179

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
            +   G YST L+T+ A ++I  H+ ++PMF+YL++ AVH      P Q P      + D
Sbjct: 180 ADDYKGQYSTHLFTKEAQDIIKNHDINQPMFMYLSYQAVHG-----PLQVPPHYYDMYND 234

Query: 245 --ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
                  R+TYA M + +DE +GNV + L++ GM +N++++F++DNGA     H+  G N
Sbjct: 235 GPRVSTTRQTYAAMATCMDEGIGNVTSTLKQRGMWDNTVLIFVSDNGAQ---FHTG-GRN 290

Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD 362
            PLRG K + ++GG+R V+ + SP +++ +++S ++ H+ DW PTL   AG  ++D  LD
Sbjct: 291 WPLRGGKGSVFEGGIRVVSFVTSPLIERPRRISEDIVHLMDWFPTLTHLAGGTVDDVMLD 350

Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY 394
           GVN WD ++KG  + R E+L +I   D    Y
Sbjct: 351 GVNVWDTISKGTLSPRRELLVHISATDGVTNY 382


>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
           caulinodans ORS 571]
 gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
           caulinodans ORS 571]
          Length = 490

 Score =  273 bits (699), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 156/388 (40%), Positives = 217/388 (55%), Gaps = 24/388 (6%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F A    L    +     A  +PHI+ ILADDLG+ DV FHGS  I TPN+D LA  G  
Sbjct: 25  FMAAVAGLSALASPARAQAAPRPHILYILADDLGFADVGFHGS-DIKTPNLDKLAATGAT 83

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L Q Y Q +CTP+R+ALMTG+YP+  G+Q GVI  G  +GL   E LLPQ LK  GY+T 
Sbjct: 84  LGQFYTQPMCTPTRAALMTGRYPLRYGLQTGVIPSGASYGLATDEFLLPQALKSVGYSTA 143

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            IGKWHLG  ++ + P  RGFD  YG   G  D+Y H      + Y     R N QV  +
Sbjct: 144 LIGKWHLGHAKQDFWPRQRGFDYFYGPLVGEIDHYKHEAHGVVDWY-----RDNKQVVEE 198

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
               Y T+L+   A  +I  H+   P+FLYLA  A H      PFQAP   V ++ +I+D
Sbjct: 199 G---YDTELFGTDAARLIGAHDPKTPLFLYLAFTAPHT-----PFQAPQAYVDRYPNITD 250

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------- 300
           P RR YA M+S +D+ +G V+AAL + GM EN++++F +DNG     +   +G       
Sbjct: 251 PARRLYAAMISAMDDQIGKVVAALEQRGMRENTLIVFHSDNGGTRSKMFVGEGAFHGELP 310

Query: 301 -SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
            SN P R  K T ++GG R VAA+ +   + T   +  + H+ D +PTL   AG  ++ +
Sbjct: 311 ASNAPYRDGKGTLYEGGTR-VAALANWPGRITPGAADGVMHVVDMMPTLARLAGANLDRS 369

Query: 360 S-LDGVNQWDVLTKGAKTKRSEILHNID 386
             LDGV+ W  L+    + R  +++N++
Sbjct: 370 KPLDGVDVWSALSSARTSPRDGVVYNVE 397


>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
 gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
          Length = 495

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 161/418 (38%), Positives = 228/418 (54%), Gaps = 39/418 (9%)

Query: 8   FFALTCTLLFNDAFLNTT---------APKKPHIIIILADDLGWNDVSFHGSSQIPTPNI 58
           F   TC + +    LN           + + PHI+ I++DD GW DV FHGS  I TPN+
Sbjct: 17  FAVFTCCITWKLLTLNQQDLPVAVAQQSSQPPHILFIMSDDQGWKDVGFHGSD-IRTPNL 75

Query: 59  DALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQY 118
           D LA  G  L Q+Y Q +CTPSR+AL+TG+YP   G+Q  VI     +GLP  E LLPQ 
Sbjct: 76  DQLAKTGARLEQYYSQPMCTPSRAALLTGRYPHRYGLQTLVIPSAGKYGLPTDEYLLPQA 135

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
           LKEAGY T  +GKWHLG     Y P  RGFD  YG   G  DY+ HS     + Y     
Sbjct: 136 LKEAGYETAIVGKWHLGHADPKYWPRQRGFDYQYGPLLGEIDYFTHSAHGKVDWY----- 190

Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
           R+N  +  +    Y T L  + A+ +I +HN   P+FLYLA  A HA     P+QAP + 
Sbjct: 191 RNNQLIKEEG---YVTTLLGQDAVKLIEKHNPKTPLFLYLAFTAPHA-----PYQAPQKY 242

Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG---- 294
           + ++  I+DP RR YA M++ +D+ +G V+AAL K GM  N++++F +DNG P       
Sbjct: 243 LDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMRNNTLIVFQSDNGGPRSAQFTG 302

Query: 295 -IHSNKGS----NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
            + ++ G+    N P R  K++ ++GG R VA    P   Q   V +   HI D  PTL 
Sbjct: 303 EVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGKIQPGTVVNHPIHIVDMYPTLT 362

Query: 350 AAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
             A + +  +  LDG+N W  L++   + RS+++++I+       + AAL  +D K V
Sbjct: 363 GLASVSVGKNKPLDGLNIWPALSEAKPSPRSQVVYDIE------PFRAALSQEDWKLV 414


>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
          Length = 609

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 179/566 (31%), Positives = 273/566 (48%), Gaps = 112/566 (19%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+PH + ILADD+G++D+ +H  ++I TP +D LA  G+ L Q+YVQ +CTP+R  LMTG
Sbjct: 96  KQPHFLFILADDMGYHDIGYH-QAEILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTG 154

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y I  GMQHGV+   +P G+PL EKLLP+ L++ GY T  IGKWHLG F E Y P  RG
Sbjct: 155 RYQIRYGMQHGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRG 214

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-----GIYSTDLYTEAAI 202
           FD   G++ G QD+Y H+    F    G D R       + I       YST ++ +   
Sbjct: 215 FDHFMGFYTGSQDFYSHN--KCFSGMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELE 272

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
             +++ N S+P F YL+  AVH+     P QAP + + ++ ++ + +RR Y  MV+ +D 
Sbjct: 273 KRLSKMNPSEPSFTYLSFQAVHS-----PLQAPRKFIDQYRNLKNRKRRNYNAMVTAMDT 327

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           ++  V+ A +K    +++I++F +DNG    G      SN PLRG K + ++GG+R +  
Sbjct: 328 AIYKVVKAYKKFKFWDDTILIFSSDNG----GNLKGGASNWPLRGAKGSLFEGGVRTIGF 383

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND---TSLDGVNQWDVL---TKGAKT 376
           + SP           L  ++DW PTL   AG + +D     LDGV+Q   L   + G   
Sbjct: 384 VHSP-----------LLPVTDWFPTLLQLAGCKTSDYGGKPLDGVSQVRQLWMGSNGNDH 432

Query: 377 KRSEILHNID------NVDNPQKY------------YAALRVDDLKYVAGTDNNGQSDEW 418
            R EILH +D      N  +P+++              ALR    K + G  +  Q + W
Sbjct: 433 ARDEILHGLDPLNKVKNTVDPRQFPIIKNRTFSVDVEGALRWKQWKLITGKQSPDQ-NRW 491

Query: 419 YGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL 478
                      +P   +Y  A   F+ +            +PK +    L  ++ D    
Sbjct: 492 ----------STPPTTMYGGAKTIFDKV------------EPKKSKLVRLYNVVDD---- 525

Query: 479 ELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKS 538
                                            P E+N++++    ++  +  KLA Y  
Sbjct: 526 ---------------------------------PFERNDISDEHPQVVNVMLTKLADYYD 552

Query: 539 TMVPPGNKPFDKRADPARWNNIWVPW 564
             VPP    +D RADPA  + +W PW
Sbjct: 553 ASVPPRYPAYDLRADPALGDGVWRPW 578


>gi|291233691|ref|XP_002736785.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  272 bits (696), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 182/488 (37%), Positives = 265/488 (54%), Gaps = 44/488 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K PHI++ILADDLGWNDV +H + ++  P ++ LA +G+I NQ YVQ  CTP+R+ALM+G
Sbjct: 29  KPPHIVLILADDLGWNDVGWH-NPEVKMPVLNQLAADGVIFNQAYVQPTCTPTRAALMSG 87

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            YP   G QH ++L   P GLPL  K LPQ LK+ GY TH +GKWHLGF +E + PT RG
Sbjct: 88  YYPFKTGNQHQLLLNLHPGGLPLRFKTLPQRLKDVGYLTHIVGKWHLGFCKEAFLPTNRG 147

Query: 148 FDSHYGYWQGLQDYYDHSCKAT-FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           FDS YG   GL     H  K        G D   N  V  +T   Y   +  + A+ +I 
Sbjct: 148 FDSFYG---GLTLGTSHFSKMNGILSTPGYDFYDNSGVVPQT-NDYLAFMLADRAVKIIN 203

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H +  P+F+Y + + V A +     + P E  A + +I+D   R + G +S +D++VG+
Sbjct: 204 GHYQEYPLFMYFS-MDVPAKSP----EVPPEYEALYANITDDRTRRFYGKLSVMDDAVGS 258

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V+ AL+K  M ++++++F+ DNGA    + S+ GSN PLRG+ +T ++G  R  A I   
Sbjct: 259 VVDALKKRDMWDDTLLIFIGDNGA----LASSSGSNWPLRGIAATLFEGATRVPAFIHGN 314

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
            LKQT   ++EL+HI D   T+ + AG E +   +DG++ WD  +K   + R+EI++NID
Sbjct: 315 MLKQTGYENNELWHIVDLHKTILSLAGAE-SSCEIDGMDMWDTFSKNESSPRNEIVYNID 373

Query: 387 NVDNPQKYYAALRVDDLKYVAGTDN-------NGQSDEW--YGDTDNE-------IDKYS 430
             D+P    AA+RV D K +AG  +         QSD W  YGD            D+  
Sbjct: 374 --DDPISPGAAIRVGDYKLIAGNPDLFYPLRLINQSDGWFNYGDVPTSGLPVLPTGDEPP 431

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNY 490
           P  V Y      FN +    +  + + AD       ALR  L + +I  +    +V   +
Sbjct: 432 PPNVTY-----LFNVIDDPEE--RNNIADDHPEIVQALRQRLDEHRIYYMPAVNQV---F 481

Query: 491 DNKGAHCN 498
           D  G   N
Sbjct: 482 DEAGDPSN 489



 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 38/59 (64%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           LFN+ DDP E+NN+A+   ++++ L  +L  ++   +P  N+ FD+  DP+ ++ ++ P
Sbjct: 438 LFNVIDDPEERNNIADDHPEIVQALRQRLDEHRIYYMPAVNQVFDEAGDPSNFDGVFSP 496


>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
          Length = 518

 Score =  271 bits (693), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 157/394 (39%), Positives = 230/394 (58%), Gaps = 26/394 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           TT P  P+I+++LADDLGWNDVS+H +S +  PN+  LA  G+IL   Y Q  CTPSR+A
Sbjct: 26  TTNP--PNIVMVLADDLGWNDVSWH-NSIVQMPNLQDLAERGVILEHAYAQEKCTPSRAA 82

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
            +TG+YPI+ GMQ  V++  +  GLP+  KLLP YLK+ GYATH IGKWH+G+  E YTP
Sbjct: 83  FLTGRYPINTGMQEEVVVATQMSGLPIEFKLLPSYLKDQGYATHMIGKWHVGYCDEAYTP 142

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           T RGFDSHYG++     Y ++S     +   G D R ++ ++    G Y+T  +T+ A  
Sbjct: 143 TRRGFDSHYGFYNSGISYSNYSSTEGTD--VGYDYRDDLALNLAAEGKYTTTDFTDQAKT 200

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD--ISDPERRTYAGMVSRLD 261
           +I  H+++ PMFLY+A+ A H      PF+  +E      D  + D  R+TY GM+S LD
Sbjct: 201 LIDNHDQTNPMFLYMAYNAPHT-----PFEV-EESYRDIYDGNLRDGNRKTYLGMISALD 254

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGA--PSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           E VG ++  L++ GM  N++ +F +DNG   P  G     G+N PLRG K + ++GG R 
Sbjct: 255 EQVGQLVDKLKEVGMWSNTVFVFYSDNGGTQPQSG---QSGNNFPLRGKKGSLFEGGYRV 311

Query: 320 ----VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGA 374
               +A   +  L+     SS LFHISD   T  A AG +      +DG+N W  ++   
Sbjct: 312 PAFLIARTRAGNLELIASTSSTLFHISDMFATFIALAGGDAAVPADIDGINAWPAISGNT 371

Query: 375 KTKRSEILHNI---DNVDNPQKYYAALRVDDLKY 405
            + R+ +++NI   +  +N +       + DL+Y
Sbjct: 372 PSPRTTLVYNIITKNFANNARNTEPKAAIRDLQY 405


>gi|313235834|emb|CBY19819.1| unnamed protein product [Oikopleura dioica]
          Length = 518

 Score =  271 bits (692), Expect = 8e-70,   Method: Compositional matrix adjust.
 Identities = 178/547 (32%), Positives = 277/547 (50%), Gaps = 58/547 (10%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           +++ ++ DD GW D   HGS ++ TPN+DA+A +G++L ++Y Q +C+P+RS+L+TG+YP
Sbjct: 19  NVVYLMIDDFGWGDFGVHGS-KLETPNLDAIARDGILLEKYYTQQVCSPTRSSLLTGRYP 77

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           I  GMQH VIL G+  G+P    LLPQ LK  G+ATH +GKWH G   E   P  RGFDS
Sbjct: 78  IRYGMQHNVILAGQTTGIPKEYALLPQDLKSCGFATHMVGKWHCGHSHEYMLPFNRGFDS 137

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           +YGY QG +D+Y    C+A  + + G+D        N T G Y T++Y+     V+ + +
Sbjct: 138 YYGYLQGAEDHYSRIQCQA--KEWCGVDFCTENGPTNSTWGTYGTEIYSAQVAQVLDKVS 195

Query: 210 K-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD-ESVGNV 267
           K  KP FLY A   VH     +P QAP+    KF  I DP+RRTYA MVS +D E+VG+ 
Sbjct: 196 KEEKPFFLYYAMQNVH-----DPLQAPEHYKIKFDWIEDPDRRTYAAMVSIMDEETVGDF 250

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           +  L +  + ++++++  ADNG    G   + G+N PLR  K + ++GG+   A +    
Sbjct: 251 MTMLHERNLYDDTLIIITADNG----GETRSGGNNFPLRSQKWSLYEGGVHVNAMLSGGL 306

Query: 328 LK-QTQKVSSELFHISDWLPTLCAAAGIE--INDTSLDGVNQWDVLTKGAKTKRSEILHN 384
           +K Q++     L H+SDW  T+  A   +   + + LDG++QW  + + A   R  I+HN
Sbjct: 307 IKDQSKSTFDGLVHVSDWRETILEAMECKPPADKSDLDGISQWQAILQNASPPRQSIVHN 366

Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
           ID +               K V GTD      +   D    I+  S   + Y +    + 
Sbjct: 367 IDPI---------------KTVKGTDERQWVQKLAADASFNINSQS--AIRYGR----WK 405

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
            L             P    +  +   L D+ + E  E                 T    
Sbjct: 406 LLTGDPAFGVPDGNIPPPEDARPVWQKLLDDSMFESSE---------------TLTRLVQ 450

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR----WNNI 560
           L++I +DP E+++++++  D++K   D L+ +  T V P     D  ++P      +   
Sbjct: 451 LYDIEEDPFEKHDISDNHPDIVKIGLDLLSRHNETAVWPNFPHTDPNSNPETRSDGFAGF 510

Query: 561 WVPWYDE 567
           W PW  E
Sbjct: 511 WWPWVQE 517


>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 469

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 154/380 (40%), Positives = 229/380 (60%), Gaps = 29/380 (7%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
             L  T  F  A        KP+II+ILADDLGWNDV FHGS +I TPN+D LA +G+ L
Sbjct: 14  LTLIITYYFRGANELKAEINKPNIIVILADDLGWNDVGFHGS-EIKTPNLDKLAASGVRL 72

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYA 125
            + YV+++CTP+R+A +TG++P   GM  GVI   +PW   GLPL EK + + LKEAGY 
Sbjct: 73  ERFYVKSVCTPTRAAFLTGRHPFRYGMSTGVI---KPWDKVGLPLEEKTIAETLKEAGYY 129

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           T  +GKWHLG ++E + PT RGFD HYG++ G  DY+ H+     + + G    H  ++ 
Sbjct: 130 TAILGKWHLGHYQESFLPTSRGFDYHYGHYLGGIDYFTHN-----DDFLGALDWHRNRIH 184

Query: 186 NKTIGIYSTDLYTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
            K  G Y+TDL  + A+ +I  HN + +P+FLY+A  A H      P  A  E++  +L 
Sbjct: 185 LKEEG-YATDLIGQEAVKLINNHNYEQQPLFLYIAFNAPHT-----PLHAKTEDIEDYLT 238

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I D +RR +A  V  +DE++G +I +L    + +N+ V F++DNG     + +N+G N P
Sbjct: 239 IDDEKRRVFAAQVQSMDEAIGKIIQSLENQQVCDNTFVFFVSDNGGSV--MRANRGDNRP 296

Query: 305 LRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLD 362
           LRG K++ ++GG+R  A + + P L   Q++ +++F I D  PTL   AG+E +    LD
Sbjct: 297 LRGGKNSLYEGGVRVPAIVSYPPKLSANQEI-NQIFSIVDLYPTLAKLAGVESVEGKQLD 355

Query: 363 GVNQWDVLTKGAKTKRSEIL 382
           G+   D+  K   ++R E+L
Sbjct: 356 GL---DIFAK--HSRRDELL 370


>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
          Length = 782

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 145/323 (44%), Positives = 200/323 (61%), Gaps = 15/323 (4%)

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
           N  YV +    ++S   T +  IH G+QH +I   +P  +PL EKLLP+ LKEAGYATH 
Sbjct: 333 NYGYVPSYLNATKSP-HTKREQIHTGLQHQIIWACQPNCVPLDEKLLPELLKEAGYATHM 391

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQV 184
           +GKWHLG +R+   PT RGFDS++GY  G +DYY H               LD R   +V
Sbjct: 392 VGKWHLGMYRKECLPTRRGFDSYFGYLLGSEDYYSHERCVLIRSLNVTRCALDFRDGEEV 451

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
                 +YST+++ + A+++IA H   KP+FLYLA  +VH     EP Q P+E V  +  
Sbjct: 452 AVGYKNMYSTNVFAKRAVDLIANHPPDKPLFLYLAFQSVH-----EPLQVPEEYVKPYSF 506

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I + +RR YAGMVS +DE+VGNV AAL++HG+ +N++++F  DNG  +       G+N P
Sbjct: 507 IQNKKRRNYAGMVSIMDEAVGNVTAALKRHGLWDNTVLIFSTDNGGQTLA----GGNNWP 562

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDG 363
           LRG K T W+GG+RG+  + SP LKQ    + EL HISDWLPTL   AG  +N T  LDG
Sbjct: 563 LRGRKWTLWEGGVRGIGFVASPLLKQKGVKNRELIHISDWLPTLVNLAGGHVNGTKPLDG 622

Query: 364 VNQWDVLTKGAKTKRSEILHNID 386
            + W  +++G+ + R E+LHNID
Sbjct: 623 FDVWKTISEGSPSPRVELLHNID 645


>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
 gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
          Length = 501

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 152/369 (41%), Positives = 213/369 (57%), Gaps = 24/369 (6%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P +P+I+ I+ADD+G+ D  F+GS  IPTPN+D LA  G  L Q Y   +CTP+R+ALMT
Sbjct: 57  PARPNIVYIVADDMGFGDAGFNGS-DIPTPNLDKLAAEGARLEQFYALPMCTPTRAALMT 115

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+YP+  G+Q GVI     + LP+ E LLPQ LK+ GY T  +GKWHLG  +  + P  R
Sbjct: 116 GRYPLRYGLQVGVIPAAGTYSLPVDEYLLPQALKDTGYTTAMVGKWHLGHAKPEFWPRQR 175

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           GFD  YG   G  D++ HS     + Y     R+N  ++      +   L+   A+ V+ 
Sbjct: 176 GFDYFYGALVGEIDHFKHSSHGVKDWY-----RNNKPLNETG---FDNTLFGNEAVRVVE 227

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H    P+FLYLA  A H      PFQAP + + +F DI+D  RR YA M+S +D+ VG 
Sbjct: 228 RHEGKSPLFLYLAFTAPHT-----PFQAPQDYLDRFKDIADDNRRKYAAMISVMDDGVGK 282

Query: 267 VIAALRKHGMLENSIVLFMADNGAPS---FGIHSNKG-----SNHPLRGMKSTPWDGGMR 318
           V+AAL K GM +N+I++F +DNG      F   S  G     SN P R  K T ++GG R
Sbjct: 283 VVAALEKKGMRDNTIIVFHSDNGGVKNSLFAGDSKVGGELPASNGPYRDGKGTLYEGGTR 342

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTK 377
            VAA+ +   K    V   + H+ D  PTL A A  ++  +  LDG+N WD L++G  + 
Sbjct: 343 -VAALANWPGKIKPGVVDSMVHVVDMYPTLAAVADAKLEKNKPLDGMNVWDALSEGKPSP 401

Query: 378 RSEILHNID 386
           RSE+++N+D
Sbjct: 402 RSEVVYNVD 410


>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 533

 Score =  269 bits (688), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 173/522 (33%), Positives = 262/522 (50%), Gaps = 76/522 (14%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           +I+ ++ADDLG+NDV FHGS QIPTP +DALA +G+ L  ++   +C+PSR+++++G++ 
Sbjct: 57  NILFLVADDLGFNDVGFHGSKQIPTPRLDALAADGVDLLNYHTHPVCSPSRASMLSGRHA 116

Query: 91  IHIGMQHGVIL---EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           IH    HG+ +   +G  + L L  +LLP+ L+  GY THA+GKWHLG       PT RG
Sbjct: 117 IH----HGIYMPFAQGTAYHLSLEYELLPEALRRLGYETHAVGKWHLGQNTRAALPTGRG 172

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDS  GYW G +DY+ H C   +      D  +N        G+YS   +T+ A++V+A 
Sbjct: 173 FDSFLGYWSGAEDYFAHDCAGAY------DFANNETTAWAYDGVYSAYSFTDRAVDVVA- 225

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
            + S P FLY+A   VH      P + P   +  F  +    RR  +GMVS LDE+VGNV
Sbjct: 226 -SASTPYFLYVAWQNVH-----WPLEVPARYLEPFGHVPAGPRRNVSGMVSALDEAVGNV 279

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
             AL+     ++++V+F++DNG P+ G      +N PLRG K+T ++GG R V  + +P 
Sbjct: 280 TGALKARRAYDDAVVVFVSDNGGPTNGDEGTASNNWPLRGGKNTLYEGGTRVVGLLKAPG 339

Query: 328 LKQTQKVSSELFHISDWLPTL--CAAAGIEINDTSL---DGVNQWDVLTKGAKTKRSEIL 382
           + ++    ++L H  DW PTL   A  G  + +  L   DGV+ +DVL +GA  +   +L
Sbjct: 340 VAKSSASRAKL-HAVDWFPTLYRFAGGGPLVLNPPLAPGDGVDSYDVLARGAPGRDELLL 398

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVA-GTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
                  N   +  AL V D K++  GT        W   T     +  P+ V+      
Sbjct: 399 EAHAPGSNDTVHGEALIVGDWKFLRIGTVRPDSEAGW---TRPPGQEAHPRSVV------ 449

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTV 501
                           A P+   ++A R                                
Sbjct: 450 -------------DCGAAPEPADANATRR---------------------------QCVA 469

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP 543
            PCLF++  DPCE+ + A    D+L  L+ +L  Y++T VPP
Sbjct: 470 APCLFDVAADPCERVDRAVEFPDVLAALDARLDAYRATAVPP 511


>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
          Length = 659

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 159/365 (43%), Positives = 203/365 (55%), Gaps = 20/365 (5%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++PHI+ +LADDLGWNDV +H  S I TP++DALA  G+ L  +Y Q LCTPSRS L+TG
Sbjct: 172 RQPHIVFVLADDLGWNDVGYH-DSNIFTPHLDALAAGGVRLENYYTQPLCTPSRSQLLTG 230

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL EKLLP+ L+EAGY TH +GKWHLG FR+   PT RG
Sbjct: 231 RYQIHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMFRKECLPTRRG 290

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           FD+ +GY  G +DYY H      +        LD R    V       YST+++TE AI+
Sbjct: 291 FDTFFGYLLGSEDYYSHKHCVHIDALNVTRCALDFRDGEDVAEGYNNTYSTNIFTEKAID 350

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +IA+H   K        L V   +   P     EE   FL        T   ++      
Sbjct: 351 LIAKHPAQKGT----EDLIVILDSRGVPTLVSSEEQVSFLCKDQSSDHTSLQII-----H 401

Query: 264 VGNVIAALRKHGMLENSIVLF-MADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
            G++          ENS   F   DNG  +       G+N PLRG K T W+GG+RGV  
Sbjct: 402 FGSIKNVQTPRASSENSSSAFPFPDNGGQTLA----GGNNWPLRGRKWTLWEGGIRGVGF 457

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEI 381
           + SP LKQT   S EL HISDWLPTL   AG   N T  LDG + W  +++G  + R E+
Sbjct: 458 VASPLLKQTGVQSRELIHISDWLPTLVNLAGGHTNSTKPLDGFDVWKTISEGNPSPRMEL 517

Query: 382 LHNID 386
           LHNID
Sbjct: 518 LHNID 522



 Score = 38.9 bits (89), Expect = 7.7,   Method: Compositional matrix adjust.
 Identities = 20/60 (33%), Positives = 32/60 (53%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+ +L+     ++K+L D+L  Y+ + VP      D   DP +   +W PW
Sbjct: 600 LFDIQQDPEERYDLSSKHPSIVKKLLDRLQYYQKSSVPVFYPDEDPHCDP-KATGVWGPW 658


>gi|260788444|ref|XP_002589260.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
 gi|229274435|gb|EEN45271.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
          Length = 455

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 194/552 (35%), Positives = 265/552 (48%), Gaps = 115/552 (20%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+I+ ILADD GWND+ +H +S I TPN+D LA  G+ L  +YVQ +C+PSR  LMTG+Y
Sbjct: 1   PNIVFILADDYGWNDIGYH-NSFIRTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRY 59

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            IH G+QH VI+   P GLPL E  LPQ LKE GY+T+ +GKWHLGFFR+ Y P  RGF+
Sbjct: 60  QIHYGLQHSVIMCDRPHGLPLDEVTLPQRLKENGYSTYMVGKWHLGFFRKEYMPLQRGFE 119

Query: 150 SHYGYWQGLQDYYDHSCKATFEP----YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
             +GY  G +DY+ H   + F      + GLD+R   +      G YST L+   AI +I
Sbjct: 120 RFFGYLTGGEDYWTHRKPSQFSKDPSEFHGLDLRDQDKPVLDQNGTYSTHLFARKAIEMI 179

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
             H++SKPMFLYL   AVH      P +AP+E    + DI +   RTYA MV+ +DE+VG
Sbjct: 180 LNHDQSKPMFLYLPFQAVHG-----PLEAPEEYKRIYEDIDNSLVRTYAAMVTVMDEAVG 234

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG----GMRGVA 321
           NV  AL+  GM  N+I++F   + A +                    +DG    G+R V 
Sbjct: 235 NVTGALKHRGMWRNTILIFSTASSAFT---------------QSQIKYDGTISYGIRLVK 279

Query: 322 -AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
             I +  L   Q     LF   D L  +                  + + + GA + R E
Sbjct: 280 FCILALILSDVQPHYINLFPYLDMLFFMS-----------------FKICSTGAASPRKE 322

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAG 440
           ILHNID    P K  + L+               S + YGD  N                
Sbjct: 323 ILHNID----PLKPSSPLQ--------------SSHKLYGDIFN---------------- 348

Query: 441 ITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNST 500
                                 +   ALR+   D KIL     ++ R  + ++G   N  
Sbjct: 349 ---------------------TSVYAALRS--GDWKILT---GSQDRGGWKHEGPSHNLF 382

Query: 501 VKP--------CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA 552
            KP         LFNI  DP E+ +L++S   +++ +  KLA Y  T V P   P D RA
Sbjct: 383 DKPNDPPEKSIWLFNIRSDPEEKKDLSDSHPGIVQDMLIKLAEYNKTAVSPFWPPRDLRA 442

Query: 553 DPARWNNIWVPW 564
           +PA   ++W PW
Sbjct: 443 NPALHGDLWGPW 454


>gi|350414673|ref|XP_003490384.1| PREDICTED: hypothetical protein LOC100748821, partial [Bombus
           impatiens]
          Length = 413

 Score =  268 bits (685), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 138/325 (42%), Positives = 210/325 (64%), Gaps = 20/325 (6%)

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           +VS+LD+SVG+V+ ALR  GMLENSIV+FM+DNGAP+ GI SN+GSN+PLRG+K++PW+G
Sbjct: 1   VVSKLDQSVGDVMDALRSRGMLENSIVVFMSDNGAPTNGILSNQGSNYPLRGIKNSPWEG 60

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGA 374
           G RGVAAIWSP ++++++VS+++  +SDWLPTL +AAG++ I   ++DG + W  L  G 
Sbjct: 61  GTRGVAAIWSPLIRKSKRVSNQMMFMSDWLPTLLSAAGVDRIQLGNIDGFDLWPSLVSGK 120

Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID----KYS 430
            + RSEI+ NID++ +    YAA+R  D KY+ G    G +  W G + +  +    +Y 
Sbjct: 121 ISPRSEIVINIDDLAD----YAAIRRGDFKYIIGRTETGIA--WLGASGDPSEGVSPRYD 174

Query: 431 PKEVLYSKAGITFNAL---KTKLQIKQKHAADPK-----ANSSDALRTILTDEKILELRE 482
           P +VLYSK G+  + +   K  +++ +K   + +     ++ ++    ILT E I+E+R+
Sbjct: 175 PHKVLYSKTGVAISGIITAKQAMELNKKKKENIRIVYDTSSKTNFQEKILTSEDIIEMRK 234

Query: 483 FARVRCNYDNKGA-HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
            A+++CN   K    C   + PCLFNI  DPCE  NLAE    +L   E  L  ++ T++
Sbjct: 235 KAQIKCNVTEKDKIPCEPMIAPCLFNIEQDPCEMVNLAERNPVILTIFETILIRHRLTVI 294

Query: 542 PPGNKPFDKRADPARWNNIWVPWYD 566
           PP N   D R++P+ WNN W  W +
Sbjct: 295 PPSNLDGDPRSNPSLWNNTWTSWAE 319


>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
          Length = 571

 Score =  267 bits (683), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 190/563 (33%), Positives = 286/563 (50%), Gaps = 67/563 (11%)

Query: 21  FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
           F+     + P+ + I+ADDLG+NDVS+H + Q+ TPN+  +A NG+IL + Y QA CTPS
Sbjct: 23  FVAAQNDRPPNFLYIMADDLGYNDVSWH-NPQVLTPNLGKMAKNGVILTESYSQAACTPS 81

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           R++ MTG YP  IG+Q+ V+ EG    +PL    LP+ LKEAGY +H +GKWHLG  R  
Sbjct: 82  RASYMTGYYPFRIGVQNSVVREGMEDYVPLDVDFLPKRLKEAGYVSHLVGKWHLGHCRRD 141

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKAT-----FEPYQG---LDMRHNMQVDNKTIGIY 192
            TP  RGFD+  G   G  DYY    +A      F+P       D   N  +       Y
Sbjct: 142 VTPPGRGFDTFLGLLNGYNDYYTKKIRAIASHEDFDPNAPGTIYDFFSNYTLQPSPETDY 201

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA-PDEEVAKFLDISDPERR 251
           +TD++T  AI +I + +K  P FL L + A H      P Q  P      + D+ D  RR
Sbjct: 202 TTDIFTNRAIELI-QQSKDTPFFLALHYTAPHW-----PLQKHPGFSDKDYPDVKDLNRR 255

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            Y   ++ +D  +G V+ AL++H +L+N++++F +DNG    G      +N PLR  K+T
Sbjct: 256 IYLSSITAMDTGIGKVVDALKQHNLLDNTLIVFQSDNG----GDTHFSANNGPLRDRKTT 311

Query: 312 PWDGGMRGVAAIWSP-WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
            ++GG++  +  + P  L  T + S +LFHISDW  T+ +AAG+E  +  +DG++QW++L
Sbjct: 312 LYEGGVKVPSMAYGPGLLTNTPRTSDDLFHISDWFTTILSAAGLEAGN--VDGIDQWNLL 369

Query: 371 TKGAKTKRSEILHNI--DNV-----------DNPQKYYAALR---VD----DLKYVAGTD 410
             G+K+ R   ++NI  DN             NP+ + A  R   VD    DL       
Sbjct: 370 RYGSKSPRQTFIYNIHGDNAAIRAGRYKLIEGNPKAFQARRRRRSVDVLDQDLAVHGSAL 429

Query: 411 NNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT 470
              + DE + D       Y   E    KA  +   LK   QI+      P  +       
Sbjct: 430 TRHKRDEEFVDRR----PYESDEDKVMKAFSSLTGLKKLEQIR------PTGD------- 472

Query: 471 ILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
             T + IL + E  ++R +     A C+   K  L+N+T DP E+ +L + +  ++  + 
Sbjct: 473 -WTMDSILSVLE--KIRPDQVKVQADCH---KCQLYNLTADPSEKKDLVKEKPAVVLFMH 526

Query: 531 DKLAIYKSTMVPPG-NKPFDKRA 552
             L +YK+  V P    PF K A
Sbjct: 527 TLLELYKAEEVEPVLAYPFKKDA 549


>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 496

 Score =  267 bits (682), Expect = 1e-68,   Method: Compositional matrix adjust.
 Identities = 159/390 (40%), Positives = 219/390 (56%), Gaps = 27/390 (6%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+I+ ILADDLG +DV + GS +I TP++D LA  G  L+Q YVQ +C+P+R+AL+TG+
Sbjct: 52  RPNIVFILADDLGSHDVGWRGS-EIKTPHLDELARAGATLDQFYVQPVCSPTRAALLTGR 110

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP   G Q GV+     +GLPL E+ LPQ LKEAGY T   GKWHLG F+  Y PT RGF
Sbjct: 111 YPFRYGFQTGVVRPWAEYGLPLEERTLPQALKEAGYETAITGKWHLGHFQPAYLPTKRGF 170

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D  YG++ G+ DYY H        + G D   N Q +N   G YST+L  + A   + E 
Sbjct: 171 DHQYGHYNGMLDYYTHIR------HGGFDWHRNDQ-ENHDEG-YSTELVGKEAARRVRER 222

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           +KS+P+FLY+    VH+     P Q P E    +      +R+TYAGMV+ LD +VG ++
Sbjct: 223 DKSRPLFLYVPFNGVHS-----PHQVPVEYETPYTQFEG-KRKTYAGMVTALDAAVGRIV 276

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA-AIWSPW 327
            A+R   ++EN++ +F +DNG P+        SN PLR  K T ++GG+R  A A W   
Sbjct: 277 QAVRDEKLVENTLFIFSSDNGGPN---PKQLTSNGPLRAGKGTVYEGGVRVCAFATWPGK 333

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNID 386
           +K    V + + HI+DW PTL   AG  +     LDG + W  + +GA +    IL N  
Sbjct: 334 IKPGTSVQAPI-HIADWFPTLLGLAGASLKQPLPLDGRDVWATVAEGAASPHDTILINTT 392

Query: 387 NVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
                     A+R  D K V G  N    D
Sbjct: 393 PSG------GAIRSGDWKLVIGGANRDDED 416


>gi|395545568|ref|XP_003774672.1| PREDICTED: arylsulfatase J-like, partial [Sarcophilus harrisii]
          Length = 466

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 170/490 (34%), Positives = 245/490 (50%), Gaps = 67/490 (13%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   PT RGF
Sbjct: 1   YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 60

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINV 204
           D+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT+    +
Sbjct: 61  DTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQI 116

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           +A HN  KP+FLY+A+ AVH+     P QAP +    +  I +  RR YA M+S LDE++
Sbjct: 117 LASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAI 171

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
            NV  AL+++G  +NSI+++ +DNG          GSN PLRG K T W+GG+R V  + 
Sbjct: 172 NNVTLALKRYGFYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFVH 227

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           SP LK    +  EL HI+DW PTL   A G    D  LDG + W+ +++G ++ R +ILH
Sbjct: 228 SPLLKNKGTICKELVHITDWYPTLITLAEGWIDEDIQLDGYDIWETISEGLRSPRVDILH 287

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
           NID +          +  +  + AG                               GI  
Sbjct: 288 NIDPI--------YTKAKNGSWAAG------------------------------FGIWN 309

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTVK 502
            A+++ +++K           SD +            + F+    N + N+    ++   
Sbjct: 310 TAIQSAIRVKHWKLLTGNPGYSDWVPP----------QSFSNAGPNRWHNERISWSAGKS 359

Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
             LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P     +W 
Sbjct: 360 VWLFNITADPYERVDLSSKYPGIVKKLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVWG 419

Query: 563 PWYDELDKQK 572
           PWY E DK +
Sbjct: 420 PWYKEDDKTR 429


>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
          Length = 431

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 140/301 (46%), Positives = 189/301 (62%), Gaps = 14/301 (4%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL EKLLP+ L+EAGY TH +GKWHLG +++   PT RGFD+
Sbjct: 1   IHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYKKECLPTHRGFDT 60

Query: 151 HYGYWQGLQDYYDHS----CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           ++GY  G +DYY H      KA       LD R   +V      +YST+L+TE AI++IA
Sbjct: 61  YFGYLLGSEDYYSHDRCVLIKAKNITRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIA 120

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H   KP+FLYLA  +VH     EP + P+E +  +  I D +RR YAGMV+ +DE+VGN
Sbjct: 121 HHKTEKPLFLYLAFQSVH-----EPLEVPEEYMKPYSSIKDAKRRHYAGMVTLMDEAVGN 175

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +  AL+++G+  N++++F  DNG  +       G+N PLRG K T W+GG+RGV  + SP
Sbjct: 176 LTDALKRYGLWNNTVLVFSTDNGGQTLA----GGNNWPLRGRKWTLWEGGVRGVGFVASP 231

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
            LKQ    S EL HISDWLPTL   AG   N T  LDG + W  +++G  + R E+LHNI
Sbjct: 232 LLKQKGVESHELIHISDWLPTLVHLAGGHTNGTKPLDGFDVWKAISEGRPSPRVELLHNI 291

Query: 386 D 386
           D
Sbjct: 292 D 292


>gi|241680395|ref|XP_002412684.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215506486|gb|EEC15980.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 461

 Score =  266 bits (680), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 163/440 (37%), Positives = 234/440 (53%), Gaps = 35/440 (7%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           +   A ++PHI++I+ADDLGW DVSFHGS QIPTPNID LA +GL L+ +YVQ+    +R
Sbjct: 1   MTAQASRRPHIVVIVADDLGWGDVSFHGSRQIPTPNIDTLAGDGLALHHYYVQSSGAATR 60

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           SALMTG YPIH G++H   +      +PL+  ++  ++         +   H+GF  E  
Sbjct: 61  SALMTGLYPIHTGIKHASHICISARLIPLSGGIVRCHIVLTIKYAFMVNIPHIGFLFEHI 120

Query: 142 TPTFRGFDSHY---GYWQG-------LQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
           +      ++ +   GYW         LQ + +      F  +   +MR            
Sbjct: 121 SKGAHAVNASWWTNGYWSKRFIHAFFLQLFMNAEIFFFFNQWPCREMR------------ 168

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
                       +   HN S P+ LYLAH AVH GN Y P +AP   V KF  I D  RR
Sbjct: 169 -------SPKWKIAPLHNCSVPLLLYLAHQAVHVGNAYHPLEAPSSYVKKFRYIKDLRRR 221

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            +AGMV+ LD+SVG+V  AL++ G+LE++IVL  +D+G  + G+    GSN PLRG K T
Sbjct: 222 IFAGMVAALDQSVGDVFRALQRKGILEDTIVLLTSDSGGAAGGVEQGVGSNWPLRGTKKT 281

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
            W+GG+R V  +WS  L +  +V+S+L H S + P   A +  +I D  S+DG + W VL
Sbjct: 282 LWEGGVRAVGLLWSAKLARG-RVASQLIHRSAYFPLTNADSCGDITDLGSIDGKDLWTVL 340

Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
                + R+E+L+NID     +   AA+RV   K V G  ++ Q D+WY   +++     
Sbjct: 341 VDDRPSPRTELLYNIDY----RWKLAAIRVGPYKLVLGASSSDQFDDWYEPIESDATNPY 396

Query: 431 PKEVLYSKAGITFNALKTKL 450
            +     KA  T  AL  KL
Sbjct: 397 LESAALRKALFTDCALLAKL 416


>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score =  265 bits (677), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 159/380 (41%), Positives = 218/380 (57%), Gaps = 27/380 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP++++ILADD+GW DV +H  S+I TP++D LA +G  L Q YVQ +C+P+R+ALMTG+
Sbjct: 36  KPNVVLILADDMGWGDVGWH-DSEIKTPHLDKLAASGTRLEQFYVQPVCSPTRAALMTGR 94

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP+  G+Q GV+     +GLPL E+ LPQ LKE GY T   GKWHLG F+  Y PT RGF
Sbjct: 95  YPMRHGLQVGVVRPWAQYGLPLNERTLPQALKEVGYETAICGKWHLGHFQPEYLPTHRGF 154

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D  YG++ G  DY+ H          G D   + +V N+  G YST L    A  +I  H
Sbjct: 155 DHQYGHYNGALDYFTHIRDG------GFDWHRDDRV-NRDEG-YSTHLIGREATRIIGHH 206

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           + SKP+FLY+   AVHA     P Q P++    +  + +P RRTYAGM++ +DE+VG + 
Sbjct: 207 DTSKPLFLYVPFNAVHA-----PHQVPEKYKEPYSKLKEP-RRTYAGMLAAMDEAVGQIA 260

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPW 327
           +A+   GM  N+I LF +DNG P+ G      SN PLRG K T ++GG+R  A I W   
Sbjct: 261 SAIDGKGMRSNTIFLFSSDNGGPAPG---QVTSNGPLRGQKGTLYEGGVRVPAFISWEGH 317

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNID 386
           L+    V++ L HI D  PTL   AG  +  +   DG + W  +     +    IL N  
Sbjct: 318 LRPGTVVNAPL-HIVDLFPTLLTLAGAPLEQSLPTDGRDAWTAIAHHGPSPHDAILLNT- 375

Query: 387 NVDNPQKYYAALRVDDLKYV 406
               P     A+RV D K +
Sbjct: 376 ---TPDS--GAIRVGDWKLI 390


>gi|440894324|gb|ELR46807.1| Arylsulfatase J, partial [Bos grunniens mutus]
          Length = 466

 Score =  265 bits (677), Expect = 5e-68,   Method: Compositional matrix adjust.
 Identities = 168/487 (34%), Positives = 247/487 (50%), Gaps = 69/487 (14%)

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   PT RG
Sbjct: 1   RYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRG 60

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIYSTDLYTEAAIN 203
           FD+ +G   G  DYY H  K       G D+  N       DN   G+YST +YT+    
Sbjct: 61  FDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDYDN---GVYSTQMYTQRVQQ 116

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           ++A H+  KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S LDE+
Sbjct: 117 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 171

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           + NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+R +  
Sbjct: 172 INNVTLALKMYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAIGF 226

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEI 381
           + SP LK    V  EL HI+DW PTL + A  +I++   LDG + W+ +++G ++ R +I
Sbjct: 227 VHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDENIQLDGYDVWETISEGLRSPRVDI 286

Query: 382 LHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGI 441
           LHNID +          +  +  + AG                               GI
Sbjct: 287 LHNIDPI--------YTKAKNGSWAAG------------------------------YGI 308

Query: 442 TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNST 500
              A+++ +++K           SD +            + F+ +  N + N+    ++ 
Sbjct: 309 WNTAIQSAIRVKHWKLLTGNPGYSDWVPP----------QSFSNLGPNRWHNERITLSTG 358

Query: 501 VKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNI 560
               LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D R++P     +
Sbjct: 359 KSVWLFNITADPYERVDLSNKYPGIVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGV 418

Query: 561 WVPWYDE 567
           W PWY E
Sbjct: 419 WGPWYKE 425


>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 504

 Score =  265 bits (676), Expect = 6e-68,   Method: Compositional matrix adjust.
 Identities = 151/383 (39%), Positives = 224/383 (58%), Gaps = 34/383 (8%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           F+ +C L+      +  +  + HII I+ DDLGWNDV +H +  I TP ID LA NG+ L
Sbjct: 13  FSSSCLLI------DGLSSDQAHIIFIVVDDLGWNDVGYH-NLYIKTPTIDRLANNGVKL 65

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
           N +Y   LC PSR+ LM+G++       HGVI+ G P GLPL E  +   LKEAGY+TH 
Sbjct: 66  NNYYAANLCVPSRNMLMSGRH------VHGVIM-GYPRGLPLNETTIANKLKEAGYSTHL 118

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN-MQVDNK 187
           +GKW+ GF+ + + P  RGFD+ +G+    +D+Y H        +   D+R N + V +K
Sbjct: 119 VGKWNCGFYSKEFLPHNRGFDTFFGFVDSKEDHYTHMV------HDISDLRRNDLCVADK 172

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-DIS 246
             G YST +Y      +I  H+ +KP+FL+++  AVH     EP Q P     +++  I 
Sbjct: 173 YYGNYSTIMYGNEGTTIIDNHDTNKPLFLFMSFSAVH-----EPLQVPSVYEKEYIPTID 227

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
           D +RR YAGMVS +DE++ N+I  L    ML+NSI++F +DNG          G+N PLR
Sbjct: 228 DTDRRIYAGMVSCVDEAIANIIETLENKDMLKNSIIVFTSDNGGDPL----YSGNNWPLR 283

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIN-DTSLDG 363
           G K++ W+GG+R +  ++S  L    + +  +EL HI+DW PTL   +G  I+ +  L G
Sbjct: 284 GWKASNWEGGVRALGFVYSEILPTNARGTDNNELMHITDWFPTLVDISGGNIHTEELLYG 343

Query: 364 VNQWDVLTKGAKTKRSEILHNID 386
           V+QW V+   AK+ R ++L  +D
Sbjct: 344 VSQWGVIRGLAKSPREDVLIALD 366


>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
 gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
          Length = 491

 Score =  264 bits (675), Expect = 9e-68,   Method: Compositional matrix adjust.
 Identities = 151/389 (38%), Positives = 216/389 (55%), Gaps = 27/389 (6%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F A    L    +        +PHI+ ILADDLG+ DV FHGS  I TPN+D LA  G  
Sbjct: 27  FLAAVAGLSLLSSGARAADAPRPHIVYILADDLGFADVGFHGS-DIKTPNLDHLAAQGAR 85

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L Q Y Q  CTP+R+A +TG+YP+H G+Q G I  G  +GL   E LLPQ LK+ GY T 
Sbjct: 86  LGQFYTQPFCTPTRAAFLTGRYPLHYGLQVGAIPSGAKYGLATDEFLLPQALKDVGYRTA 145

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            +GKWHLG   + + P  RGFDS YG   G  D++ H      + Y       N QV  +
Sbjct: 146 LVGKWHLGHADQKFWPRQRGFDSFYGPLVGEIDHFKHEAHGVTDWYH-----DNTQVKEE 200

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
               Y T+L+ + A+ +IA H+   P+FLYLA  A H      PFQAP   + ++  I+ 
Sbjct: 201 G---YDTELFGKEAVRLIAAHDPKTPLFLYLAFTAPHT-----PFQAPQSYLDQYAHIAA 252

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------- 300
           P+RR YA M++ +D+ +G+V+AAL   GM EN++++F +DNG     + + +G       
Sbjct: 253 PQRRAYAAMITAMDDQIGHVVAALTSRGMRENTLIVFHSDNGGTRSKMFAGEGAVAGDLP 312

Query: 301 -SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE-LFHISDWLPTLCAAAGIEIND 358
            SN P R  K + ++GG R VA   + W  +    ++E + H+ D LPTL   AG  +  
Sbjct: 313 ASNAPYRDGKGSLYEGGTRVVA--LANWPGRIAPGAAEGVMHVVDMLPTLAKLAGASLAK 370

Query: 359 TS-LDGVNQWDVLTKGAKTKRSEILHNID 386
           +  LDGV+ W  L  G +  R+ I++N++
Sbjct: 371 SKPLDGVDVWPALAAG-QAGRAGIVYNVE 398


>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
 gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
          Length = 455

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 162/408 (39%), Positives = 226/408 (55%), Gaps = 28/408 (6%)

Query: 1   MTWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDA 60
           M   R     +  TL        T AP+ P+I+ +LADDLG  DVS+ GS  I TP +DA
Sbjct: 1   MPAIRSLILTIALTLASVATTFATDAPR-PNIVFLLADDLGGADVSWRGSP-IKTPQLDA 58

Query: 61  LAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK 120
           LA +G  L Q YVQ +C+P+RSAL+TG+YP+  G+Q GV+     +GLPL E+ L + L+
Sbjct: 59  LANSGAKLEQFYVQPVCSPTRSALLTGRYPMRYGLQVGVVRPWADYGLPLDERTLAEALQ 118

Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
           +AGY T  +GKWHLG     Y P  RGFD  YG++ G  DY+ H          G D   
Sbjct: 119 DAGYETAIVGKWHLGHVSPAYLPMARGFDHQYGHYNGALDYFTHDRDG------GHDWHK 172

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +  V N+  G Y+T L  + A+ VI + +K KP+FLY+   AVH+     P Q P+   A
Sbjct: 173 DDHV-NRDEG-YATHLIAQEAVRVIQDRDKKKPLFLYVPFNAVHS-----PLQVPESYAA 225

Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
            + D+    R+ YAGMV+ LDE+VG ++  +++  ML+N++ +F +DNG P  G  ++ G
Sbjct: 226 PYGDMKK-RRQAYAGMVAALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGPEPGKLTDNG 284

Query: 301 SNHPLRGMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
              PLRG K T ++GG+R  A A W   +    KV + L HI DW PTL   AG  +   
Sbjct: 285 ---PLRGGKHTLYEGGVRVCAFASWKGRIAPGSKVEAPL-HIVDWYPTLIELAGGSLQQA 340

Query: 360 S-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
             LDG N W  +T G  +    I+ NI   +       A+RV D K V
Sbjct: 341 KPLDGRNIWPSITTGEPSPHDVIVCNITPTE------GAIRVGDWKLV 382


>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 513

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 159/430 (36%), Positives = 232/430 (53%), Gaps = 40/430 (9%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F   C L+     +     K+P+I+ ILADD GW+D+ +HGS  + +P +D LA  G+ 
Sbjct: 6   LFRAVCFLIL--IRITEGGDKRPNIVFILADDFGWHDIGYHGSI-VRSPYMDFLASEGVK 62

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +YVQ +C+P+R+ LM+G+Y I  G+QH VI   +   LP  E  + Q +KEAGYATH
Sbjct: 63  LENYYVQPMCSPTRAQLMSGRYQIRYGLQHLVIQPDQRACLPPDEVTIAQKMKEAGYATH 122

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD--------MR 179
            +GKWHLGF+R+   P  RGFD+ +G+   L  +Y +       P  G           R
Sbjct: 123 MVGKWHLGFYRKECLPINRGFDTFFGFLNCLIYHYTYDFGYWHTPESGNKTIMFGWDLFR 182

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
           ++  V  +  G YST L+ E A  VI  H++  PMFLYL   AVH      P   P E  
Sbjct: 183 NHDCVAKEHKGEYSTILFAEEAQRVIWNHDQETPMFLYLPFAAVH-----NPLDVPREYE 237

Query: 240 AKFLD-ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
           A +   I D +RR  A MV+ +DE++ N+  AL   GM +N++++F  DNG    G  S 
Sbjct: 238 AMYEGIIEDEQRRKKAAMVTCMDEAIWNITKALMDTGMWDNTVLIFSTDNG----GATSV 293

Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEI 356
            G+N PLRG K + ++GG+RGV  + SP L +  + +  ++L H++DW PT    A   +
Sbjct: 294 GGNNWPLRGGKRSMFEGGIRGVGFVTSPLLDEAVRGTENNQLIHVTDWFPTFVHLAEGNL 353

Query: 357 NDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVD-----------NPQKYY-----AALR 399
           + T  LDG NQWD +++G  + R+E L NID ++            P  Y+       +R
Sbjct: 354 SGTKPLDGFNQWDTISQGMFSPRNEFLINIDPMEICANPLATNQFRPNPYFDVCVRGGIR 413

Query: 400 VDDLKYVAGT 409
           V   K V G 
Sbjct: 414 VGKWKLVTGV 423


>gi|291225027|ref|XP_002732497.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 461

 Score =  263 bits (671), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 173/526 (32%), Positives = 264/526 (50%), Gaps = 70/526 (13%)

Query: 40  LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGV 99
           +GWNDV +H +S I  PN+  LA +G+I NQ Y    CTPSR+A+MTG YP   G QH +
Sbjct: 1   MGWNDVHWH-NSDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59

Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQ-GL 158
           +    P G+PL  KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGFDSHYG W  G+
Sbjct: 60  VFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119

Query: 159 QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
            DY          P  G D R NM V  K+   Y   +  + A +++  H    P+FL  
Sbjct: 120 SDY--DKMNGVLSPSAGYDFRDNMGVVPKS-DDYLALMLGDRAEHIVNTHYPGTPLFLAF 176

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
             L + A +     + P+E   K+ +I D   R + G ++ +D+ +G V+ AL+  GM +
Sbjct: 177 T-LDIPAKH----LEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMYD 231

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
           +S+++F+ DNGA    + S  GSN+P RG+  T ++G  R  + +    +K+T   S+EL
Sbjct: 232 DSLIIFIGDNGA----LSSQSGSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTGYESNEL 287

Query: 339 FHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAAL 398
           + I D   T+   AG E  +  LDG++ W   +KG  + R+EIL+NID  D+P    AA+
Sbjct: 288 YSIVDIHRTILDVAGAEA-EPELDGLSMWPTWSKGKPSPRNEILYNID--DDPVSPGAAI 344

Query: 399 RVDDLKYVAGTDNNGQSDEWYGDTDNEI-DKYSPKEVLYSKAGITFNALKTKLQIKQKHA 457
           R+ D K +      G  D  Y   D  + D Y      Y+      + L +  Q  Q HA
Sbjct: 345 RIGDYKLIT-----GHPDLLYPYRDISLTDNY------YNYGDAPASGLPSTDQ--QSHA 391

Query: 458 ADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNN 517
             P   + + L  I+ D                                     P E+N+
Sbjct: 392 --PAPQNVEYLFNIVVD-------------------------------------PEERND 412

Query: 518 LAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           ++    D+++ L  +L  ++  ++PP N+  +       +N++W P
Sbjct: 413 ISADHPDIVQDLRARLDEHRKLLIPPVNQTTELAGAAGNFNDVWSP 458


>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
          Length = 431

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 138/303 (45%), Positives = 185/303 (61%), Gaps = 14/303 (4%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           + IH G+QH +I   +P  +PL EKLLPQ LKEAGYATH +GKWHLG +R+   PT RGF
Sbjct: 1   FKIHTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGF 60

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           D+++GY  G +DYY H      +        LD R   +V      +YST+++TE A  +
Sbjct: 61  DTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATTL 120

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  RR YAGM S +DE+V
Sbjct: 121 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDEAV 175

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           GNV AAL + G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV  + 
Sbjct: 176 GNVTAALERRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVA 231

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
           SP LK+    + EL HISDWLPTL   AG   N T  LDG + W+ +++G+ + R E+LH
Sbjct: 232 SPLLKRKGVKTRELIHISDWLPTLVKLAGGSTNGTKPLDGFDVWNTISEGSPSPRMELLH 291

Query: 384 NID 386
           NID
Sbjct: 292 NID 294


>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 486

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 168/477 (35%), Positives = 239/477 (50%), Gaps = 55/477 (11%)

Query: 3   WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
           WA     A T       A  +  A ++P+I+ I+ADDLGW DV F+G ++I TPNIDALA
Sbjct: 9   WAAVLLVAFT-------AVASQAADRQPNIVHIVADDLGWKDVGFNGCTEIKTPNIDALA 61

Query: 63  YNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
             G   +Q YVQ +CTP+R+ LMTG++P   G+Q  VI     +GL  +E L+PQ L +A
Sbjct: 62  KGGAKFSQFYVQNMCTPTRACLMTGRFPYRYGLQTIVIPTAAGYGLDTSEYLMPQCLGDA 121

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
           GY T  IGKWHLG   + Y P  RGFD  YG   G  DY+ H      + +     R N 
Sbjct: 122 GYKTAIIGKWHLGHADQKYWPKQRGFDYQYGAMIGELDYFTHDEHGVLDWF-----RDNK 176

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
            V  +    Y+T L  + A+  I   +  KP +LYL   A H      P+QAP E + K+
Sbjct: 177 PVHEQG---YTTTLIGDDAVKYIHGQDGKKPFYLYLTFNAPHT-----PYQAPKEYITKY 228

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS- 301
           L+I++P RRTYA MV  LDE++G V+AAL + G+ EN+++ F +DNG     + + + + 
Sbjct: 229 LNIAEPTRRTYAAMVDCLDENIGKVVAALDQKGLRENTLIFFHSDNGGTKDKMFAGQMAD 288

Query: 302 ---------NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
                    N P R  K + ++GG R  A    P   + Q V   + H  D  PT  A A
Sbjct: 289 MSKVVLPCDNGPYRNGKGSLFEGGSRVCALANWPGKIKAQTVDG-MIHAVDLYPTFAALA 347

Query: 353 GIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN 411
           G  I     LDG N WD + +G  + R+E  ++I      + + A LR  D K +     
Sbjct: 348 GASIAKCKPLDGTNVWDTIAEGKPSPRTEFFYSI------EPFRAGLRQGDWKLI----- 396

Query: 412 NGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT------FNALKTKLQIKQKHAADPKA 462
                 W     + +D Y+  E  Y K  I          ++ +++   K AA P A
Sbjct: 397 ------WRTMLPSSVDLYNLAEDPYEKNNIAAAHPDKVATMQARIETASKDAAKPLA 447


>gi|431918040|gb|ELK17268.1| Arylsulfatase I [Pteropus alecto]
          Length = 547

 Score =  262 bits (669), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 177/550 (32%), Positives = 261/550 (47%), Gaps = 115/550 (20%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 50  HIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 108

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +       +                         RGFD+
Sbjct: 109 IHTGLQHSIIRPRQPNCLPLDQSXXXXXXR-------------------------RGFDT 143

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V     G YST LY +   +++A H+
Sbjct: 144 FLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHS 201

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 202 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITW 256

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
           AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP LK
Sbjct: 257 ALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLK 312

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
           + ++ S  L HI+DW PTL   AG   +    LDG + W  +++G  + R+EILHNID +
Sbjct: 313 RKRRTSRALVHITDWYPTLVGLAGGTASAADGLDGYDVWPAISEGRASPRTEILHNIDPL 372

Query: 389 DNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
            N  ++               AA+RV + K + G       D  YGD       + P + 
Sbjct: 373 YNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIPPQT 418

Query: 435 LYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKG 494
           L +  G  +N                              E++   R+            
Sbjct: 419 LAAFPGSWWNL-----------------------------ERMASARQAV---------- 439

Query: 495 AHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADP 554
                     LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P
Sbjct: 440 ---------WLFNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENPRAHP 490

Query: 555 ARWNNIWVPW 564
                 W PW
Sbjct: 491 DFNGGAWGPW 500


>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
 gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
          Length = 500

 Score =  262 bits (669), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 146/409 (35%), Positives = 234/409 (57%), Gaps = 39/409 (9%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY--NGLILNQHYVQALCTPSRS 82
           T  +KP+++ I+ADDLG+ND+ +  S+ + TPN+  LA+  N  +L   YV  LCTP+RS
Sbjct: 26  TLTQKPNVLFIMADDLGFNDLDWKDST-LHTPNLRNLAFHKNTALLTNSYVNQLCTPTRS 84

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           A MTG YP  +G Q+GV L  EP G+P     L + +++  Y+T+ +GKWHLG+ ++ + 
Sbjct: 85  AFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFL 144

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSC----KATFEPYQGLDMRHNMQVDNKTI------GIY 192
           PT RGFD  YG++     Y++HS     +      +GLD+   +  + K++      G+Y
Sbjct: 145 PTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVG-NGKSVPDFSQNGVY 203

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVH------AGNTYEPFQAPDEEVAKFLD-- 244
           STDL+T+ A++V+  HN +KP F++L++ AVH      +     P  A   EV +  D  
Sbjct: 204 STDLFTDVAMSVLDNHNTTKPFFMFLSYQAVHPPLQIKSEKKLTPKIATHFEVPRSFDKY 263

Query: 245 ---ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKG 300
              + D  RR Y GM++ +D ++G ++  L+   + EN++++F +DNG  S FG      
Sbjct: 264 CSRVKDRYRRIYCGMLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTSNFG-----A 318

Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
           SN PLRG K T W+GG +    + SP   +       +FH+ DW  T+ +  G+EI D+ 
Sbjct: 319 SNAPLRGEKDTIWEGGTKTTTFVHSPMYIEEGGTREMMFHVVDWHATILSITGLEI-DSY 377

Query: 361 LDGVNQWDVLTKGA-KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
            DG+NQW+ +  G  K +R + ++NIDN      + +A+R  D K + G
Sbjct: 378 GDGINQWEYIKTGRPKFRRFQFVYNIDN------HGSAIRDGDYKLIVG 420


>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score =  261 bits (668), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 190/585 (32%), Positives = 278/585 (47%), Gaps = 76/585 (12%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
            ++   LF   F    A +  H+++I+ADD GW DVSFH +  I TPNID L   GL L 
Sbjct: 3   GISILYLFWYLFSFVYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELT 62

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT-HA 128
            +Y Q +CTP+RSAL+TG+YPIH G+Q  VI   +  GL   E LLP+YL+       H 
Sbjct: 63  NYYTQHICTPTRSALLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHM 122

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVD-- 185
           +GKWH+G      TP  RGF++  GY  G +D+Y          + G+D   H+  +   
Sbjct: 123 VGKWHVGHGHSWMTPWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGP 182

Query: 186 -NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
            N + G YS DLY +    ++   + +K  F+Y A   VH      P QA +  + K+  
Sbjct: 183 TNSSWGKYSGDLYLQKMSEIVKSIDPTKDSFIYFAPQHVHY-----PLQALERHLVKYSW 237

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I D  RR YA MV+ LDE VG+VI    K+ +L+N+I  F+ADNG    G   + G+N P
Sbjct: 238 IKDTNRRNYAAMVASLDEIVGDVIDLYHKNRLLKNTIGFFVADNG----GETRDGGNNWP 293

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
           L G K T  +GG++    ++   +   +   + LFH+SD LPT+  A    I    +DG 
Sbjct: 294 LSGQKWTLLEGGVKATGFMFGGLVPLGKY--NGLFHVSDILPTVMEAFNCPIK-KKIDGY 350

Query: 365 NQWDVLTKGAKTKRSEILHNID--------------NVDNPQKYYAALRVDDLKYVAGTD 410
            QW+ +       RS+ILHNID              N D   +  AALR    K + G  
Sbjct: 351 AQWEAIIYSNDFPRSQILHNIDPTKINKVANDSRTWNSDWDVRVQAALRKGPWKLITGDP 410

Query: 411 NNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT 470
                    G T +E+ ++      + + G      KT+  I Q         SSD    
Sbjct: 411 ---------GFTSDEVPEFPVSPPEWHQQG----PYKTRGLISQI-----TGKSSDYY-- 450

Query: 471 ILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
                              +DN   H N  V+  L+ I +DP E  ++++    +++++ 
Sbjct: 451 -------------------FDN---HRNKLVR--LYKIDEDPTESRDISDQNPSIVEEML 486

Query: 531 DKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWY-DELDKQKAI 574
           + L  Y STMVPP     D  ++P      W PW  DE + +K +
Sbjct: 487 NILIKYNSTMVPPQLPDLDPFSNPKLHEGFWFPWISDEYNSKKQM 531


>gi|291225017|ref|XP_002732505.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  261 bits (667), Expect = 8e-67,   Method: Compositional matrix adjust.
 Identities = 145/381 (38%), Positives = 218/381 (57%), Gaps = 17/381 (4%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PH+I+I  DD+GWNDV +H +  I  PN+  LA +G+I  Q Y    CTPSR+A+MTG 
Sbjct: 26  QPHVILIFVDDMGWNDVHWH-NPDIAMPNLMDLADDGVIFEQSYTHPTCTPSRAAMMTGL 84

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP   G QH ++    P G+PL  KLLP+ LKE GYATH +GKWHLGF ++ Y PT RGF
Sbjct: 85  YPFKTGNQHQMVFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGF 144

Query: 149 DSHYGYWQ-GLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           DSHYG W  G+ DY          P  G D R N++V  K+ G Y   +  + A  ++  
Sbjct: 145 DSHYGLWTLGVGDY--DKLNGVLSPSAGYDFRDNLEVVPKSDG-YLALMLGDRAEEIVNN 201

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H+   P+FL    L + A +     + P+E    + DI D   R + G ++ +D+ +G V
Sbjct: 202 HSPETPLFLVFT-LDIPAKH----LEIPEEYEELYADIEDDRTRQFYGKLTMMDDIIGGV 256

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           + AL+   M ++S+ +F+ DNGA    + S  GSN+P RG+  T ++G  R  + +    
Sbjct: 257 VDALKDRDMYDDSLFIFIGDNGA----LSSQAGSNYPFRGIAGTLFEGATRVPSIVSGKG 312

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
           +++T   S+EL+ I D   T+   AG +  +  LDG++ W   ++   + R+EI++NID 
Sbjct: 313 IEKTGYKSNELYSIVDIHRTILDVAGAKA-EPELDGLSMWPTWSEDEPSPRNEIIYNID- 370

Query: 388 VDNPQKYYAALRVDDLKYVAG 408
            D+P    AA+R+ D K + G
Sbjct: 371 -DDPVSPGAAIRIGDYKLITG 390



 Score = 46.2 bits (108), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 19/59 (32%), Positives = 36/59 (61%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           LFNI DDP E+++++    D+++ L D+L  ++  + PP N+  +     A +N++W P
Sbjct: 436 LFNIVDDPEERHDISADNQDIVQDLRDRLDEHRKLLHPPVNQTTELAGSAANFNDVWSP 494


>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
          Length = 514

 Score =  260 bits (664), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 145/372 (38%), Positives = 214/372 (57%), Gaps = 18/372 (4%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           GWNDVS+H + ++ +P ++ LA  G+IL+QHY    CTP+R+ALMTG+YP  +G+Q   I
Sbjct: 36  GWNDVSWH-NERMESPILEQLAKEGVILDQHYALPTCTPTRAALMTGRYPYKLGIQSHGI 94

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
              EP GLPL    L + LK  GY THA GKWHLG+  +  TPT RGFD+  G++ G QD
Sbjct: 95  RTLEPNGLPLGVTTLAEELKRTGYTTHAFGKWHLGYCNQSLTPTRRGFDTFRGFYVGGQD 154

Query: 161 YYDHSCKA--TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
           Y+ H+     T    +G D R+  +VD    G+Y+T L     ++ I E    KPMFLY+
Sbjct: 155 YFSHTLSGGKTSATAKGYDYRNGDEVDYSAKGVYTTTLIANHVLSAIEESQPDKPMFLYV 214

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
           A  AVHA     P Q P +         +P+R+    M++ +D +VG ++  L+   M  
Sbjct: 215 AFQAVHA-----PLQVPTQYRKMCSIYRNPKRKLLCEMLAVMDHAVGLIVKKLKMKDMWR 269

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
           N++++F++DNG    G     G+N PLRG K+T ++GG R  A +  P ++   + SS +
Sbjct: 270 NTLLVFISDNG----GQILYGGNNWPLRGNKNTLFEGGTRVPAFVAGPLIRNGGRNSSSI 325

Query: 339 FHISDWLPTLCAAAGIEINDTSL---DGVNQWDVLTKGA-KTKRSEILHNIDNVDNPQKY 394
            H+ DW PTL +AAG E  ++ L   DGV+QW  +  G     R+E ++N+D      K 
Sbjct: 326 IHVVDWFPTLLSAAGGESAESQLKDMDGVDQWAAIRDGTLDGTRTEFVYNLDCKKG--KL 383

Query: 395 YAALRVDDLKYV 406
             A+R  DLK++
Sbjct: 384 VGAIRDGDLKFI 395


>gi|390362801|ref|XP_793927.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 739

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 154/363 (42%), Positives = 211/363 (58%), Gaps = 29/363 (7%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QHGVI   +P  LPL E  LPQ LKE+GYAT+ +GKWH+GF+ +  TPT RGFDS
Sbjct: 318 IHTGLQHGVIRHAQPNCLPLDEVTLPQKLKESGYATNMVGKWHIGFYLDACTPTERGFDS 377

Query: 151 HYGYWQGLQDYYDHS--CK------ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
            +G+  G +D+Y HS  C+       + +   G D+  N        G YST L+T   I
Sbjct: 378 FFGFLVGGEDHYTHSGHCEVYNADDGSTKTLSGHDLHANKTPVFDYEGQYSTHLFTNKTI 437

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           + I  H+K+KP+F+YLA+ AVH      P Q PD  +  + +I+D  RRTYAGMVS +DE
Sbjct: 438 DAIERHDKTKPLFMYLAYQAVHL-----PLQVPDRYMDPYKNIADENRRTYAGMVSCMDE 492

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
            +GNV  AL+  G+ +N+I++F  DNG P   I S   +N PLRG K   W+GG+ GV  
Sbjct: 493 GIGNVTRALKDAGLYDNTIIIFSTDNGGP---IQSG-ANNWPLRGSKGNMWEGGIHGVGF 548

Query: 323 IWSPWLKQTQK--VSSELFHISDWLPTLCAA-AGIEINDTS-LDGVNQWDVLTKGAKTKR 378
           + SP L  + K  V+ +L H+SDWLPT+ A  AG  +N T  LDG + W+ ++  A + R
Sbjct: 549 VHSPLLPTSVKGTVNHDLMHVSDWLPTIVAGIAGGALNGTKPLDGFDVWNSISGTASSPR 608

Query: 379 SEILHNID-------NVDNPQKYYAALRVDDLKYVAGTDNNGQS-DEWYGDTDNEIDKYS 430
           +EILHNID       N        AALRV D K   G D +  + + W    ++ +D   
Sbjct: 609 TEILHNIDLKKKQPANEKFDTSIRAALRVGDWKIKTGADYDVTTHNVWMPPPESGLDTIY 668

Query: 431 PKE 433
           P E
Sbjct: 669 PTE 671



 Score = 47.8 bits (112), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 31/60 (51%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP EQ +L+E   D++ +L  KL  Y    VP      D  ADPA     W PW
Sbjct: 679 LFNITADPLEQTDLSEKYPDVVNELVQKLEEYYEGSVPMRYPDEDLEADPALRGGSWGPW 738


>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
          Length = 519

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 137/301 (45%), Positives = 183/301 (60%), Gaps = 14/301 (4%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT RGFD+
Sbjct: 91  IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 150

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           ++GY  G +DYY H      +        LD R   +V      +YST+++TE A  +I 
Sbjct: 151 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIT 210

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +DE+VGN
Sbjct: 211 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAVGN 265

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V AAL+ HG+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV  + SP
Sbjct: 266 VTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVGFVASP 321

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
            LKQ    + EL HISDWLPTL   A    N +  LDG + W+ +++G+ + R E+LHNI
Sbjct: 322 LLKQKGVKNRELIHISDWLPTLVKLARGSTNGSKPLDGFDVWETISEGSPSPRMELLHNI 381

Query: 386 D 386
           D
Sbjct: 382 D 382


>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
          Length = 339

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 138/324 (42%), Positives = 200/324 (61%), Gaps = 17/324 (5%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+P+I+ I+ADDLGWNDVS HGS QIPTP+IDA+A++G+ L  ++VQ +CTP+RS  ++G
Sbjct: 31  KRPNIVFIVADDLGWNDVSLHGSPQIPTPHIDAIAHSGVHLTNYHVQPVCTPTRSTFLSG 90

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           ++ IH G+ +    +G    L L+  LLP YLK+ GY T A+GKWHLG   E   PT RG
Sbjct: 91  RHVIHTGI-YMPFAQGTALRLNLSYTLLPAYLKKLGYRTAAVGKWHLGQNVEKALPTGRG 149

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FD + GYW G +DYY H      + + G D +   +   K    YST ++ E A+N I E
Sbjct: 150 FDEYLGYWSGAEDYYTH------DTHGGYDFQDGTECAIKYNNTYSTYIFAERAVNTILE 203

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
            +  +P+FLY A   VH      P +AP E VA+F  I + ER+  A M S LD++VGN+
Sbjct: 204 ADPEQPLFLYTAFQNVH-----WPLEAPAEYVARFSHIPNSERQYVAAMTSILDDAVGNI 258

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
             AL++  + +N+I++F +DNG P   +H  + + +N+PLRG K+T W+GG + V  I  
Sbjct: 259 TDALKRSRIADNTILIFTSDNGGP---VHDENTESNNYPLRGGKNTLWNGGTQVVGMIAG 315

Query: 326 PWLKQTQKVSSELFHISDWLPTLC 349
             ++        + H SDWLP+L 
Sbjct: 316 KGIENPGTDCHGMMHASDWLPSLV 339


>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 186/574 (32%), Positives = 272/574 (47%), Gaps = 75/574 (13%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
            ++   LF   F    A +  H+++I+ADD GW DVSFH +  I TPNID L   GL L 
Sbjct: 3   GISILYLFWYLFSFVYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELT 62

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT-HA 128
            +Y Q +CTP+RSAL+TG+YPIH G+Q  VI   +  GL   E LLP+YL+       H 
Sbjct: 63  NYYTQHICTPTRSALLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHM 122

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVD-- 185
           +GKWH+G      TP  RGF++  GY  G +D+Y          + G+D   H+  +   
Sbjct: 123 VGKWHVGHGHSWMTPWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGP 182

Query: 186 -NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
            N + G YS DLY +    ++   + +K  F+Y A   VH      P QA +  + K+  
Sbjct: 183 TNSSWGKYSGDLYLQKMSEIVKSIDPTKDSFIYFAPQHVHY-----PLQALERHLVKYSW 237

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I D  RR YA MV+ LDE VG+V+    K+ +L+N+I  F+ADNG    G   + G+N P
Sbjct: 238 IKDTNRRNYAAMVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNG----GETRDGGNNWP 293

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
           L G K T  +GG++    ++   +   +   + LFH+SD LPT+  A    I    +DG 
Sbjct: 294 LSGQKWTLLEGGVKATGFMFGGLVPLGK--YNGLFHVSDILPTVMEAFNCPIK-KKIDGY 350

Query: 365 NQWDVLTKGAKTKRSEILHNID--------------NVDNPQKYYAALRVDDLKYVAGTD 410
            QW+ +       RS+ILHNID              N D   +  AALR    K + G  
Sbjct: 351 AQWEAIIYSNDFPRSQILHNIDPTKINKVANDSRTWNSDWDVRVQAALRKGPWKLITGDP 410

Query: 411 NNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT 470
                    G T +E+ ++      + + G      KT+  I Q         SSD    
Sbjct: 411 ---------GFTSDEVPEFPVSPPEWHQQG----PYKTRGLISQI-----TGKSSDYY-- 450

Query: 471 ILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLE 530
                              +DN   H N  V+  L+ I +DP E  ++++    +++++ 
Sbjct: 451 -------------------FDN---HRNKLVR--LYKIDEDPTESRDISDQNPSIVEEML 486

Query: 531 DKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           + L  Y STMVPP     D  ++P      W PW
Sbjct: 487 NILIKYNSTMVPPQLPDLDPFSNPKLHEGFWFPW 520


>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 530

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 148/407 (36%), Positives = 226/407 (55%), Gaps = 32/407 (7%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCTPSRSA 83
           +AP+ PHI+ +L DD+GWND+ +  +     TP +  LA NG+ L Q+Y Q+ CT SR+A
Sbjct: 22  SAPETPHIVYVLFDDMGWNDIGYQSTDMAALTPVLSDLAENGVKLTQYYTQSTCTVSRAA 81

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L++G  P+H G+ HG I+   P GLPL  KLLPQYL+E+GY T+ +GKW +G F E Y P
Sbjct: 82  LLSGVLPMHNGISHGTIVMDSPIGLPLKYKLLPQYLQESGYRTYMVGKWDIGHFNEEYLP 141

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATF--EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
             RGFD  +G++     Y+ H     +   P    D+R+  +        Y+TDL+ E A
Sbjct: 142 HNRGFDHFFGFYGADITYFSHISSRGYCANPNCFPDLRNEDETMANASMRYTTDLFRERA 201

Query: 202 INVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDE----EVAKFLDISDPERRTYA 254
           +  +  H   + + P+FLYL+  A H      P  AP E    E       ++ ERR +A
Sbjct: 202 VGFVEGHAANHATDPLFLYLSFNAPH-----YPTSAPQEFMRNEAELLAPFTNRERRVFA 256

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH-------SNKGSNHPLRG 307
            +V+  D SVG V+ AL+  G   +S++LF +DNGA     +       SN GSN PLRG
Sbjct: 257 AVVNYADRSVGRVVEALKATGAYNDSVILFASDNGAVPTTCNQSHCTSESNTGSNWPLRG 316

Query: 308 MKSTPWDGGMRGVAAIWSPWL---KQTQKVSSELFHISDWLPTLCAAAGIEINDTSL--- 361
           MK+T W+GG R  A + +P     + +  +   + HI+DW+PT+    G+  + ++L   
Sbjct: 317 MKATYWEGGCRVPAFVHAPKYLGDRASGSLYQGIVHITDWIPTI--VRGVLRSSSTLLGA 374

Query: 362 --DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
              G + WD + K A   R+E+L+ +D+     +   A+RVD+ K++
Sbjct: 375 EVSGADHWDSIRKLATPPRTELLYGVDHCVAGGQLTGAIRVDNWKFI 421


>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
          Length = 431

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 137/303 (45%), Positives = 184/303 (60%), Gaps = 14/303 (4%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           + IH G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT RGF
Sbjct: 1   FKIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 60

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           D+++GY  G +DYY H      +        LD R   +V      +YST+++TE A  +
Sbjct: 61  DTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATAL 120

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +DE+V
Sbjct: 121 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAV 175

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           GNV AAL+ HG+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV  + 
Sbjct: 176 GNVTAALKSHGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVGFVA 231

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILH 383
           SP LKQ    + EL HISDWLPTL   A    N +  LDG + W+ +++G+ + R E+LH
Sbjct: 232 SPLLKQKGVKNRELIHISDWLPTLVKLARGSTNGSKPLDGFDVWETISEGSPSPRMELLH 291

Query: 384 NID 386
           NID
Sbjct: 292 NID 294


>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
 gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
          Length = 498

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 150/368 (40%), Positives = 210/368 (57%), Gaps = 26/368 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHI+ IL DDLG+ DV +HGS  + TPN+D LA  G  L Q Y Q LCTP+R+ALMTG 
Sbjct: 52  RPHILYILVDDLGYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGC 110

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP+  G+Q GVI  G  +GL   E LLPQ LKEAGY T  +GKWHLG   + Y P  RGF
Sbjct: 111 YPMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGF 170

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D  YG   G  D++ H      + Y     R N  V       Y T+L+   AI +I EH
Sbjct: 171 DYFYGPLVGEIDHFKHEAHGITDWY-----RDNEMVKEPG---YDTELFGADAIRLIEEH 222

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           + + P+++YL+  A H      P+QAPD+    + DI+D  R  YA M+S +D+ VG V+
Sbjct: 223 DSATPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRNAYAAMISCMDDQVGPVL 277

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGS--------NHPLRGMKSTPWDGGMRGV 320
            AL + GM E+++V+F +DNG     + + +G+        N PLR  K T ++GG R V
Sbjct: 278 QALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVV 337

Query: 321 A-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKR 378
           A A W   +   +  +  + H+ D LPTL   A  EI +   LDG++ W  ++ G  + R
Sbjct: 338 ALANWPGRIPAGE--THGMMHVVDMLPTLAGLAQAEIAHAGQLDGMDVWQAISAGKASPR 395

Query: 379 SEILHNID 386
            E+++NI+
Sbjct: 396 EEVVYNIE 403


>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
           sphaeroides KD131]
 gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
           KD131]
          Length = 509

 Score =  259 bits (663), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 149/368 (40%), Positives = 211/368 (57%), Gaps = 26/368 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHI+ IL DDLG+ DV +HGS  + TPN+D LA  G  L Q Y Q LCTP+R+ALMTG+
Sbjct: 63  RPHILYILVDDLGYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGR 121

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP+  G+Q GVI  G  +GL   E LLPQ LKEAGY T  +GKWHLG   + Y P  RG 
Sbjct: 122 YPMRYGLQTGVIPSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGV 181

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D  YG   G  D++ H      + Y     R N  V       Y T+L+   AI +I EH
Sbjct: 182 DYFYGPLVGEIDHFKHEAHGITDWY-----RDNEMVKEPG---YDTELFGADAIRLIEEH 233

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           + + P+++YL+  A H      P+QAPD+    + DI+D  R+ YA M+S +D+ VG V+
Sbjct: 234 DSATPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRKAYAAMISCMDDQVGLVL 288

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGS--------NHPLRGMKSTPWDGGMRGV 320
            AL + GM E+++V+F +DNG     + + +G+        N PLR  K T ++GG R V
Sbjct: 289 QALERRGMREDTLVIFHSDNGGTRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVV 348

Query: 321 A-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKR 378
           A A W   +   +  +  + H+ D LPTL   A  EI +   LDG++ W  ++ G  + R
Sbjct: 349 ALANWPGRIPAGE--THGMMHVVDMLPTLAGLAQAEIAHAGQLDGMDVWQAISAGKASPR 406

Query: 379 SEILHNID 386
            E+++NI+
Sbjct: 407 EEVVYNIE 414


>gi|291230930|ref|XP_002735418.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 480

 Score =  259 bits (661), Expect = 3e-66,   Method: Compositional matrix adjust.
 Identities = 150/386 (38%), Positives = 220/386 (56%), Gaps = 17/386 (4%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           T    +PHII+I  DD+GWNDV +H +  I  PN+  LA +G+I NQ Y    CTPSR+A
Sbjct: 4   TEKSDQPHIILIFVDDMGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAA 62

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           +MTG YP   G QH ++    P G+PL  KLLP+  KE GY+TH +GKWHLGF ++ Y P
Sbjct: 63  MMTGLYPFKTGNQHQMVFNLHPSGVPLEFKLLPEKFKEVGYSTHMVGKWHLGFCKDEYLP 122

Query: 144 TFRGFDSHYGYWQ-GLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           T RGFDSHYG W  G+ DY          P  G D R NM V  K+ G Y   +  + A 
Sbjct: 123 TNRGFDSHYGIWTLGVGDY--DKMNGVLSPSAGYDFRDNMGVVPKSNG-YLALMLGDRAE 179

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           +++  H    P+FL    L + A +     + P+E    + DI D   R + G ++ +D+
Sbjct: 180 HIVNNHYPGTPLFLAFT-LDIPAKH----LEIPEEYEEPYADIEDDRTRQFYGKLTMMDD 234

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
            +G V+ AL+  GM ++S+++F+ DNGA    + S  GSN+P RG+  T ++G  R  + 
Sbjct: 235 IIGGVVDALKNRGMYDDSLIIFIGDNGA----LSSQAGSNYPFRGIAGTLFEGATRVPSI 290

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           +    +K+T   S+EL+ I D   T+   AG +  +  LDG++ W   ++   + R+EIL
Sbjct: 291 VSGKGIKKTGYESNELYSIVDIHRTILDVAGAKA-EPELDGLSMWPTWSEDKPSPRNEIL 349

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAG 408
           +NID  D+P    AA+R+ D K + G
Sbjct: 350 YNID--DDPVSPGAAIRIGDYKLITG 373



 Score = 46.6 bits (109), Expect = 0.038,   Method: Compositional matrix adjust.
 Identities = 17/59 (28%), Positives = 38/59 (64%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           LFN+  DP E+N+++  + D++++L D+L  +++ ++PP N+  +       +N++W P
Sbjct: 419 LFNVVVDPEERNDISADRPDIVQELRDRLDEHRNLLIPPVNQTTELAGAAGNFNDVWSP 477


>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 480

 Score =  258 bits (660), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 188/554 (33%), Positives = 253/554 (45%), Gaps = 106/554 (19%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           GW DVSFHGS+QIPTPNID LA +G+IL+ +Y   LCTPSR+ALMTG YPIH GM  GVI
Sbjct: 5   GWGDVSFHGSTQIPTPNIDVLAGDGVILDNYYALPLCTPSRAALMTGLYPIHTGMHAGVI 64

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
            +  PWGL L  K++PQ+ ++ GY  + IGK H                 H G       
Sbjct: 65  QDAAPWGLTLETKIMPQHFEDLGYEVNMIGKSH-----------------HDG------- 100

Query: 161 YYDHSCKATFEPYQGLDMRH------------NMQVDNKTIGIYSTDLYTEAAINVIAE- 207
                C        G+D+ H            N  +  KT   Y T         VI   
Sbjct: 101 -----CHNFDSTKSGIDLLHTPLISSIPGQHDNTWMYGKTYNFYRTRAICRLQKRVIQGV 155

Query: 208 ---------HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
                       S+P F YL+H AVH+    +PFQAP   + KF  I +  R  YAG   
Sbjct: 156 VVYLIRFLLDLHSQPFFCYLSHQAVHSALMKDPFQAPARNLLKFPYIGETNRTVYAGETK 215

Query: 259 RLDES-VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
           +     +G  + +L K     + +++  +DNG   FG+ SN+G N PLRG K+  W+GG+
Sbjct: 216 KTHLPWLGLALVSLVKTKNYVDILLMGFSDNGGSPFGLFSNRGFNWPLRGAKANLWEGGV 275

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKT 376
           R  A +WS    +  +VS +L HISDWLPTL +AAG ++ N  S+DG++ W  LT G  +
Sbjct: 276 RVPAFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAGGDVSNLGSMDGIDMWRYLTLGFGS 335

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
            R E+L  ID    P+   AALR  + K V G   N            E+D        Y
Sbjct: 336 PRKEVLLTID----PKNNLAALRYKNYKLVVGEGFN-----------EELDGR------Y 374

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAH 496
           S  G     L+    + Q                +  D K+      ARV  ++  K   
Sbjct: 375 SFPG----GLRPTNDVPQ----------------LRKDSKV------ARVLKSFYGKQNR 408

Query: 497 CNSTVKPCLFNITDDPCEQ----NNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA 552
             S +      + D  C +    NN    Q    + +  KLA Y +T VPP  +P D RA
Sbjct: 409 NWSPLDWSKNPVVD--CRERWLTNNFVARQPPYFRFMMSKLAAYNATAVPPLFQPQDPRA 466

Query: 553 DPARWNNIWVPWYD 566
            P     +W PW D
Sbjct: 467 YPEYHGGVWSPWLD 480


>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 486

 Score =  258 bits (660), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 156/393 (39%), Positives = 217/393 (55%), Gaps = 33/393 (8%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           AP  P+II I+ADDLGW DV F+G   I TP++DALA  G    Q YVQ +CTP+R+ALM
Sbjct: 26  APSAPNIIHIVADDLGWQDVGFNGCKDIQTPHLDALAKGGARFTQFYVQPMCTPTRAALM 85

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+YP   G+Q  VI     +GL   E LLPQ L++AGY T  IGKWHLG   + + P  
Sbjct: 86  TGRYPFRYGLQTAVIPSVSTYGLDTGEYLLPQCLQDAGYTTAIIGKWHLGHADKKFWPKQ 145

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           RGF+  YG   G  DYY HS     + +     R N  V  +    Y+T+L    A+  +
Sbjct: 146 RGFEYQYGAMIGELDYYTHSEHGVLDWF-----RDNEPVHEEG---YTTNLLGADAVKYL 197

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
            +    +P +LYLA  A H      P+QAP E + ++  I+DP RRTYAGMV+ LD+++G
Sbjct: 198 EKQKADRPFYLYLAFNAPHT-----PYQAPQEYIDRYTHIADPTRRTYAGMVACLDDNIG 252

Query: 266 NVIAALRKHGMLENSIVLFMADNGA---PSF-GIHSNKGS------NHPLRGMKSTPWDG 315
            V+AAL K G+ EN+++LF +DNG    P F G+ ++         N P R  K + ++G
Sbjct: 253 RVVAALDKKGLRENTLILFHSDNGGTHNPMFAGVMADVSKIKIPCDNSPYRDGKGSLFEG 312

Query: 316 GMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKG 373
           G+R  A A W   +K        L H +D   T    AG   +    LDGV+ W  + +G
Sbjct: 313 GVRVCALANWPGKIKPA--TVDGLIHATDIFTTFTKLAGASTSRCKPLDGVDAWPTMAEG 370

Query: 374 AKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
             + R E+++N+      + Y AALR  D K +
Sbjct: 371 RPSLREEVVYNV------EPYRAALRQGDWKLI 397


>gi|21483362|gb|AAM52656.1| HL07902p [Drosophila melanogaster]
          Length = 365

 Score =  258 bits (659), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 151/387 (39%), Positives = 208/387 (53%), Gaps = 35/387 (9%)

Query: 216 LYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHG 275
           + L+HLAVH GN   P QAP+EEVAKF  I DP+RRTYAGM+S LD+SV   I AL+ +G
Sbjct: 1   MVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQTIGALKDNG 60

Query: 276 MLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS 335
           ML NSI+L  +DNGAP+ GIHSN GSN+P RG K +PW+GG+R   A+WSP LK+   VS
Sbjct: 61  MLNNSIILLYSDNGAPTIGIHSNAGSNYPYRGQKESPWEGGIRSAGALWSPLLKERGYVS 120

Query: 336 SELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY 394
           ++  H  DWLPTL  AAG+ +  D  LDG+N W +L+   + K   ++H +D V      
Sbjct: 121 NQAIHAVDWLPTLAGAAGVSLPQDLPLDGINLWPMLSGNEEPKPRTMIHVLDEVFG---- 176

Query: 395 YAALRVDDLKYVAGTDNNGQSDEWYGDTD-NEIDKYSPKEVLYSKAGITFNALKTKLQIK 453
           Y++   D LKYV G+   G+ D+W G+ + NE D                       +  
Sbjct: 177 YSSYMRDTLKYVNGSSFKGRYDQWLGELETNEDDPLG--------------------ESY 216

Query: 454 QKHAADPKANSSDALRTILTDEKILELREFARVRC------NYDNKGAHCNSTVKPCLFN 507
           ++H       S    R  LT ++I ++R  A   C      N       C     PC F+
Sbjct: 217 EQHVLASDVQSLLGNRG-LTKDRIRQMRSEATETCPPIEGQNPLESHFKCEPLKAPCFFD 275

Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPF-DKRADPARWNNIWVPWYD 566
           +  DPCE+ NLA+     L+QL D+L   + T +P    P  D RA+P   N  W  W++
Sbjct: 276 LAKDPCERYNLAQMYPLQLQQLADELEQIRKTAIPSARVPHSDSRANPTFHNGNW-EWWN 334

Query: 567 ELDKQKAIETMRLHKPLLSPTTITLIY 593
             D Q    T  ++  L   + I L+Y
Sbjct: 335 NTDTQSGSGTFSINLWLAMGSIIGLVY 361


>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
          Length = 551

 Score =  258 bits (659), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 136/301 (45%), Positives = 182/301 (60%), Gaps = 14/301 (4%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           I  G+QH +I   +P  +PL EKLLPQ+LKEAGY TH +GKWHLG +R+   PT RGFD+
Sbjct: 123 IRTGLQHQIIWPCQPSCVPLDEKLLPQFLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 182

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           ++GY  G +DYY H      +        LD R   +V      +YST+++T+ A  +I 
Sbjct: 183 YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAATLIT 242

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +DE+VGN
Sbjct: 243 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 297

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV  + SP
Sbjct: 298 VTAALKSRGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASP 353

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
            LKQ    + EL HISDWLPTL   AG   N T  LDG + W  +++G+ + R E+LHNI
Sbjct: 354 LLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWKTISEGSPSPRVELLHNI 413

Query: 386 D 386
           D
Sbjct: 414 D 414


>gi|149726732|ref|XP_001501521.1| PREDICTED: arylsulfatase I-like [Equus caballus]
          Length = 476

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 165/493 (33%), Positives = 244/493 (49%), Gaps = 89/493 (18%)

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+R+   PT RG
Sbjct: 8   RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 67

Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           FD+  G   G  DYY + +C        G D+     V     G YST LY +   +++A
Sbjct: 68  FDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILA 125

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N
Sbjct: 126 SHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRN 180

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +  AL+++G   NS+++F +DNG  +F    + GSN PLRG K T W+GG+RG+  + SP
Sbjct: 181 ITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRGRKGTYWEGGVRGLGFVHSP 236

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
            LK+ ++ S  L HI+DW PTL   AG   + T  LDG + W  +++G  + R+EILHNI
Sbjct: 237 LLKRKRRTSRALVHITDWYPTLVGLAGGTASATDGLDGYDVWPAISEGRASPRTEILHNI 296

Query: 386 DNVDNPQKY--------------YAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSP 431
           D + N  ++               AA+RV + K + G       D  YGD       + P
Sbjct: 297 DPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYGD-------WIP 342

Query: 432 KEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYD 491
            + L +  G  +N                              E++   R+         
Sbjct: 343 PQTLAAFPGSWWNL-----------------------------ERMASARQAV------- 366

Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
                        LFNI+ DP E+ +LA  + D+++ L  +L  Y  T +P      + R
Sbjct: 367 ------------WLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENPR 414

Query: 552 ADPARWNNIWVPW 564
           A P      W PW
Sbjct: 415 AHPDFNGGAWGPW 427


>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
          Length = 551

 Score =  257 bits (657), Expect = 1e-65,   Method: Compositional matrix adjust.
 Identities = 175/517 (33%), Positives = 269/517 (52%), Gaps = 47/517 (9%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+II I  DD G+NDV F   + I +PN+DALA +G+IL   Y    C+PSR + M+G+
Sbjct: 25  QPNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFMSGR 83

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y     MQHGVIL+ +P  L L   +LP YLKE GY THA GKWHLG+ R+  TPT RGF
Sbjct: 84  YSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGF 143

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           DS  G + G  ++Y+H+  AT   Y   D     +VD   IG +S DL        + E+
Sbjct: 144 DSFSGGFSGEGEFYEHTT-ATGGYY---DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEY 199

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRTYAGMVSRLDESVGN 266
           ++S P+F+Y+A   VH+     P     E +A +  LD++D +R+ Y G+VS +D  +G 
Sbjct: 200 DQSSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTD-DRKKYLGLVSGMDYIIGG 253

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           ++  +++ G+ EN+ +LF +DNG         +G N P RG KST W+GG +  +   SP
Sbjct: 254 IVDKIKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSP 308

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
            L  T   ++ L H++DWLPT+   AG  +  +  LDGV Q D++     + R+ +++NI
Sbjct: 309 LLGATGVENNGLMHVTDWLPTIVDLAGGSVPASDGLDGVVQTDMVINNGDSARTTMIYNI 368

Query: 386 D-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
           D   D  Q  +  +   +L+Y       GQ D +  D      KY      Y +A   + 
Sbjct: 369 DRESDIGQPKFGEMACRNLQYKLIWGFPGQQDGYGADA-----KYIYDIPFYQEAAQAY- 422

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
                  I  + + D K         +LT ++   L  +       D + A  + T    
Sbjct: 423 -------IDAEGSTDKKRGGP----YVLTADQTSVLNNYMSSVHVSDQELA--DGTGSML 469

Query: 505 LFNITDDPCEQNNLAES--------QTDLLKQLEDKL 533
           LF++ +DP E  +L+ES        + +L+K ++D +
Sbjct: 470 LFDLINDPNETTDLSESTDPDVVAAKAELIKMIQDAV 506


>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 476

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 152/384 (39%), Positives = 217/384 (56%), Gaps = 30/384 (7%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
             LT T  F  A        KP+II+ILADDLGWNDV FHGS +I T N+D LA +G+ L
Sbjct: 14  LTLTITYYFRGANNLKAEINKPNIIVILADDLGWNDVGFHGS-EIKTTNLDKLAVSGVRL 72

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYA 125
            + YV+A+CTP+R+A +TG++P   GM     +   PW   GLPL EK + + LKEAGY 
Sbjct: 73  ERFYVKAMCTPTRAAFLTGRHPFRYGMS---AINVTPWSETGLPLEEKTIAETLKEAGYY 129

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           T  +GKWHLG ++E Y PT RGFD HYG++    DY+ H          GLD   N    
Sbjct: 130 TAILGKWHLGHYQESYLPTSRGFDYHYGHYLAGIDYFTHKSG------DGLDWHRN---- 179

Query: 186 NKTIGI--YSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
           N  + I  YSTDL  + A+ +I  H+  K P+FLY+A  A H      P QA  E++  +
Sbjct: 180 NNPVYIEGYSTDLIAQDAVQLINNHDYHKNPLFLYIAFNAPHI-----PLQAKAEDLEDY 234

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG---IHSNK 299
           L I D +RR +A  V  +DE++G++I +L+   +  N++++F +DNG          + +
Sbjct: 235 LTIEDEQRRLFAAQVQSMDEAIGSIIQSLQAKQVWNNTLLVFTSDNGGEIVANDIRFTGR 294

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-D 358
           G N PLRG K   ++GG+R    I  P    + K   ++F I D  PT    AG++IN +
Sbjct: 295 GDNRPLRGGKRNLYEGGVRVPTIISYPGHLSSGKTVEQMFSIVDLYPTFAKLAGLKINQE 354

Query: 359 TSLDGVNQWDVLTKGAKTKRSEIL 382
             +DGV+  + + +   T R E+L
Sbjct: 355 QQIDGVDILESIAE-KNTNRDELL 377


>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
          Length = 475

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 140/335 (41%), Positives = 190/335 (56%), Gaps = 30/335 (8%)

Query: 56  PNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLL 115
           PN+ +L YN                +S      + IH G+QH +I   +P  +PL EKLL
Sbjct: 30  PNLGSLVYN-------------VEEQSCPFFRNFQIHTGLQHQIIWPCQPSCIPLDEKLL 76

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ- 174
           PQ LKEAGYATH +GKWHLG +R+   PT RGFD+++GY  G +DYY H      +    
Sbjct: 77  PQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTVIDALNV 136

Query: 175 ---GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
               LD R   +V      +YST+++TE A  +I  H   KP+FLYLA  +VH     EP
Sbjct: 137 TRCALDFRDGEEVATGYKNMYSTNVFTERATTLITTHPPEKPLFLYLALQSVH-----EP 191

Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
            Q P+E +  +  I D  RR YAGM S +DE+VGNV AAL   G+  N++ +F  DNG  
Sbjct: 192 LQVPEEYLKPYDFIQDKNRRHYAGMASLMDEAVGNVTAALESRGLWNNTVFIFSTDNGGQ 251

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
           +       G+N PLRG K + W+GG+RGV  + SP LK+    + EL HISDWLP L   
Sbjct: 252 TLA----GGNNWPLRGRKWSLWEGGVRGVGFVASPLLKRKGVKTRELIHISDWLPPLXGT 307

Query: 352 AGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
            G +     LDG + W+ +++G+ + R E+LHNID
Sbjct: 308 DGTK----PLDGFDVWNTISEGSPSPRMELLHNID 338


>gi|291239589|ref|XP_002739705.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 489

 Score =  256 bits (654), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 161/426 (37%), Positives = 234/426 (54%), Gaps = 32/426 (7%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
             AL C ++           K+PHI++ILA+DLGWNDV +H ++ I  P ++ LA +G+I
Sbjct: 8   LVALVCVIVVESG-------KQPHIVLILAEDLGWNDVGWH-NADIKMPILNQLAADGVI 59

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
            NQ YVQ  CTPSR+AL+TG YP  I  QH ++L  E  GL L  K LP+ LK+ GY TH
Sbjct: 60  FNQSYVQPACTPSRAALLTGYYPFKIQRQHQMLLNLEADGLSLDLKTLPEMLKDVGYLTH 119

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            +GKWHLGF +E Y P  RGFDS YG W  L     +S +    P  G D R N  V  +
Sbjct: 120 LVGKWHLGFCKEEYLPNKRGFDSFYG-WLTLGTTL-YSKENIIAP--GYDFRDNTGVVQE 175

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
           +   Y   +  E A++++  H K  P+FL  + +A+      +  + P E    + DI D
Sbjct: 176 S-DTYLPFMLAERAVDIVMGHYKEYPLFLEFS-MALSG----KFLEVPQEYEDLYSDIED 229

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
             +R + G +S +D S+G ++ AL+  GM E+++ +F +DNGA    + S  GSN PLRG
Sbjct: 230 DRQRKFYGKLSVMDNSIGLLVDALKTRGMWEDTLFIFTSDNGA----LASESGSNWPLRG 285

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW 367
             +T ++G  R         LK T  V++EL HI D   T+   AG +  +  +DG++ W
Sbjct: 286 NIATLFEGATRVPTFFHGSMLKNTGYVNNELMHIVDLHKTIIELAGAK-TECDIDGMDLW 344

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN-------NGQSDEWYG 420
             ++KG  + R+E ++NID  DN     AA+RV D K + G  +         +SD+WY 
Sbjct: 345 KTISKGKPSPRTEFVYNID--DNELSPGAAIRVGDYKLITGHPDLLYPFRMLNRSDDWYN 402

Query: 421 DTDNEI 426
             D  I
Sbjct: 403 YGDEPI 408



 Score = 59.3 bits (142), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 26/59 (44%), Positives = 37/59 (62%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           LFNI DDP E+NNLAE   + +++L  +L  Y+  +VPP N+ FD   DP  +  +W P
Sbjct: 428 LFNIIDDPEERNNLAEEYPNKVEELRHRLDEYREHLVPPVNRVFDPAGDPVNFGGVWSP 486


>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
          Length = 529

 Score =  256 bits (654), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 177/526 (33%), Positives = 272/526 (51%), Gaps = 49/526 (9%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+II I  DD G+NDV F   + I +PN+DALA +G+IL   Y    C+PSR + ++G+
Sbjct: 3   QPNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGR 61

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y     MQHGVIL+ +P  L L   +LP YLKE GY THA GKWHLG+ R+  TPT RGF
Sbjct: 62  YSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGF 121

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           DS  G + G  ++Y+H+  AT   Y   D     +VD   IG +S DL        + E+
Sbjct: 122 DSFSGGFSGEGEFYEHTT-ATGGYY---DWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEY 177

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRTYAGMVSRLDESVGN 266
           ++S P+F+Y+A   VH+     P     E +A +  LD++D +R+ Y G+VS +D  +G 
Sbjct: 178 DQSSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTD-DRKKYLGLVSGMDYIIGG 231

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           ++  +++ G+ EN+ +LF +DNG         +G N P RG KST W+GG +  +   SP
Sbjct: 232 IVDKIKEIGIYENTYILFSSDNGG-----DVGEGDNSPHRGGKSTLWEGGCKANSWTHSP 286

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
            L  T   ++ L H++DWLPT+   AG  +  +  LDGV Q D++     + R+ +++NI
Sbjct: 287 LLGATGVENNGLMHVTDWLPTIVDLAGGSVPASDGLDGVVQTDMVINNGDSARTTMIYNI 346

Query: 386 D-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
           D   D  Q  +  +   +L+Y       GQ D +  D      KY      Y +A   + 
Sbjct: 347 DRESDIGQPKFGEMACRNLQYKLIWGFPGQQDGYGADA-----KYIYDIPFYQEAAQAY- 400

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
                  I  + + D K         +LT ++   L  +       D + A  + T    
Sbjct: 401 -------IDAEGSTDKKRGGP----YVLTADQTSVLNNYMSSVHVSDQELA--DGTGSML 447

Query: 505 LFNITDDPCEQNNLAES--------QTDLLKQLEDKLAIYKSTMVP 542
           LF++ +DP E  +L+ES        + +L+K ++D  A+     VP
Sbjct: 448 LFDLINDPNETTDLSESTDPDIVAAKAELIKMIQD--AVNDGGYVP 491


>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
          Length = 501

 Score =  256 bits (653), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 159/422 (37%), Positives = 233/422 (55%), Gaps = 27/422 (6%)

Query: 21  FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
           F      K+P+ + I ADDLG++D+    +  + TP +D LA  G+    +YV   C+PS
Sbjct: 13  FQGIHTSKQPNFVWIFADDLGYHDIGLR-NPDLHTPTLDKLATKGVQFKNNYVMHACSPS 71

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           R + M+G+YP    MQ  VI    P   PL  K LP+YLKE GY THA+GKWHLG+ RE 
Sbjct: 72  RHSFMSGRYPFTSQMQKDVIFPVSPDCSPLKLKFLPEYLKELGYGTHAVGKWHLGYCRED 131

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
             PT RGFDS YG   G  DY  H     +      D   N  VD    G +S DL+T A
Sbjct: 132 CMPTSRGFDSFYGTLDGEGDYLTHMSAGFY------DWHTNGTVDRSKSGQHSQDLHTAA 185

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL--DISDPERRTYAGMVS 258
             ++I    + KP FLY A     A N + P QA  +++ K+   ++SD  ++ Y G+++
Sbjct: 186 LADIIERQTEEKPFFLYFA-----AQNPHTPLQAKKKDMKKYKNQNLSDTLKK-YYGLIT 239

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LD  V  +   L K  +L N+ ++F+ADNGA   GI    GSN P RG KST W+GG++
Sbjct: 240 GLDGMVKTIKDTLEKKNLLNNTYIIFLADNGA-DVGI----GSNEPFRGTKSTLWEGGVK 294

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTK 377
            ++ I+SP LK+T   +  L HI+DW+PTL + AG  + ++  +DG+NQ  ++T   K++
Sbjct: 295 SLSFIYSPLLKKTGYENDGLIHITDWMPTLVSLAGGSVPESEGIDGINQVAMVTDARKSE 354

Query: 378 RSEILHNID---NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEV 434
           R ++++N+D       P     A+R +  K + G    GQ+D  YG T   + K    +V
Sbjct: 355 RKDMVYNLDMESQFGLPSSGQIAVRDERYKLLWGFP--GQND-GYGLTTTYLAKVEEFKV 411

Query: 435 LY 436
           +Y
Sbjct: 412 IY 413


>gi|449499600|ref|XP_002193905.2| PREDICTED: arylsulfatase J [Taeniopygia guttata]
          Length = 488

 Score =  255 bits (652), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 166/483 (34%), Positives = 237/483 (49%), Gaps = 67/483 (13%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R    PT RGF
Sbjct: 23  YQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGF 82

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDLYTEAAINV 204
           DS +G   G  DYY H  K       G D+  N       DN   GIYST +YT+    +
Sbjct: 83  DSFFGSLLGSGDYYTH-FKCDSPGICGYDLYENNNAAWDHDN---GIYSTQMYTQKVQQI 138

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           +A HN  KP+FLY+A+ AVH+     P QAP      +  I++  RR YA M++ LDE++
Sbjct: 139 LASHNPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSINNINRRRYAAMLACLDEAI 193

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
            NV  ALRK+G  ENSI+++ +DNG          GSN PLRG K T W+GG+R V  + 
Sbjct: 194 NNVTLALRKYGYYENSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFVH 249

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILH 383
           SP LK    V  EL HI+DW PTL   A  +I+ D  LDG + W+ +++G ++       
Sbjct: 250 SPLLKNKGSVCKELVHITDWFPTLITLAEGQIDEDIQLDGYDVWETISEGRRSP------ 303

Query: 384 NIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITF 443
                          RVD L  +       ++  W                  +  GI  
Sbjct: 304 ---------------RVDILHNIDPIYTKAKNGSWA-----------------AGYGIWN 331

Query: 444 NALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTVK 502
            A+++ +++            SD           +  + F+    N + N+    ++   
Sbjct: 332 TAIQSAIRVNHWKLLTGNPGYSD----------WVPPQAFSNAGPNRWHNERVSWSAGKT 381

Query: 503 PCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWV 562
             LFNIT DP E+ +L+    +++KQL  +L+ +  T VP    P D R++P     +W 
Sbjct: 382 VWLFNITADPYERVDLSARHPEVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNGGVWG 441

Query: 563 PWY 565
           PW+
Sbjct: 442 PWF 444


>gi|449265842|gb|EMC76972.1| Arylsulfatase J, partial [Columba livia]
          Length = 467

 Score =  255 bits (652), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 165/484 (34%), Positives = 239/484 (49%), Gaps = 67/484 (13%)

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R    PT RG
Sbjct: 1   RYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRG 60

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAIN 203
           FD+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT+    
Sbjct: 61  FDTFFGSLLGSGDYYTH-FKCDSPGICGYDLYEN---DNAAWDHDNGIYSTQMYTQKVQQ 116

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           ++A HN SKP+FLY+A+ AVH+     P QAP +    +  I++  RR YA M++ LDE+
Sbjct: 117 ILASHNPSKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEA 171

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           + NV  AL+K+G  +NSI+++ +DNG          GSN PLRG K T W+GG+R V  +
Sbjct: 172 INNVTLALKKYGYYDNSIIIYSSDNGGQPMA----GGSNWPLRGSKGTYWEGGIRAVGFV 227

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEIL 382
            SP LK    V  EL HI+DW PTL   A  +I+ D  LDG + W+ +++G ++      
Sbjct: 228 HSPLLKNKGSVCKELVHITDWFPTLITLAEGQIDEDIQLDGYDIWETISEGRRSP----- 282

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
                           RVD L  +       ++  W                  +  GI 
Sbjct: 283 ----------------RVDILHNIDPIYTKAKNGSWA-----------------AGYGIW 309

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTV 501
             A+++ +++            SD           +  + F+    N + N+     +  
Sbjct: 310 NTAIQSAIRVNHWKLLTGNPGYSD----------WVPPQAFSNAGPNRWHNERVSWTAGK 359

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
              LFNIT DP E+ +L+    D++KQL  +L+ +  T VP    P D R++P     +W
Sbjct: 360 TVWLFNITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPKLNGGVW 419

Query: 562 VPWY 565
            PW+
Sbjct: 420 GPWF 423


>gi|432104048|gb|ELK30879.1| Arylsulfatase J [Myotis davidii]
          Length = 496

 Score =  255 bits (651), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 146/341 (42%), Positives = 193/341 (56%), Gaps = 26/341 (7%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH VI   +P  LPL    LPQ L+EAGYATH +GKWHLGF+R    PT RGF
Sbjct: 31  YQIHTGLQHSVIRPAQPNCLPLGHATLPQKLREAGYATHMVGKWHLGFYRRECMPTHRGF 90

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
           D+ +G   G  DYY H  K       G D+  N +   +   G+YST +YT     ++A 
Sbjct: 91  DTFFGSLLGSGDYYTH-YKCDGPGACGYDLYENDRAAWDHDDGVYSTHMYTRRVQQILAA 149

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H+  KP+FLY+A+ AVH+     P QAP      +  I++  RR YA M+S LDE+VGNV
Sbjct: 150 HDPGKPLFLYIAYQAVHS-----PLQAPSRYFRHYRSIANTHRRRYAAMLSCLDEAVGNV 204

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
             ALR HG+  NSI+++ +DNG    G  S  GSN PLRG K T W+GG+R V  + SP 
Sbjct: 205 TLALRAHGLYNNSILIYSSDNG----GQPSAGGSNWPLRGSKGTYWEGGIRAVGFVHSPL 260

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
           L+Q   V  EL HI+DW PTL A A   I  D  LDG + W+ +++G ++ R +ILHNID
Sbjct: 261 LRQRGTVCRELVHITDWYPTLVALAEGHIEEDAHLDGYDVWETISEGLRSPRVDILHNID 320

Query: 387 NVDNPQKY--------------YAALRVDDLKYVAGTDNNG 413
            +    K+               +A+RV   K + G    G
Sbjct: 321 PIYTKAKHGSWAAGYGIWNTAVQSAIRVQHWKLLTGNPGYG 361



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/63 (38%), Positives = 36/63 (57%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L+     +++QL  +L+ +  T VP    P D R++P     +W PW
Sbjct: 392 LFNITADPYERVDLSGRYPGVVRQLLRRLSQFNRTAVPVRYPPGDPRSNPRLNGGVWGPW 451

Query: 565 YDE 567
           Y E
Sbjct: 452 YIE 454


>gi|355691424|gb|EHH26609.1| Arylsulfatase B, partial [Macaca mulatta]
          Length = 430

 Score =  255 bits (651), Expect = 5e-65,   Method: Compositional matrix adjust.
 Identities = 136/301 (45%), Positives = 181/301 (60%), Gaps = 14/301 (4%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT RGFD+
Sbjct: 2   IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           ++GY  G +DYY H      +        LD R   +V      +YST+++T+ A  +I 
Sbjct: 62  YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALIT 121

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +DE+VGN
Sbjct: 122 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 176

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV  + SP
Sbjct: 177 VTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASP 232

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
            LKQ    + EL HISDWLPTL   AG   N T  LDG + W  +++G+ + R E+LHNI
Sbjct: 233 LLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWKTISEGSPSPRVELLHNI 292

Query: 386 D 386
           D
Sbjct: 293 D 293


>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 500

 Score =  255 bits (651), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 178/548 (32%), Positives = 273/548 (49%), Gaps = 80/548 (14%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K PHII I+ DDLGWNDV ++ +    TP ID LA +G+ L  +YV + C PSR+ LMTG
Sbjct: 21  KPPHIIFIVVDDLGWNDVGYN-NPVFKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTG 79

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           ++ I +G+QH       P  LPL E  + + LK  GY+TH +GKWH GF+ +   P  RG
Sbjct: 80  RHAIQLGIQHDD-YGFHPRSLPLNETTIAEPLKHVGYSTHIVGKWHCGFYSDNCLPHNRG 138

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAAI 202
           FD+ +G+     D+Y HS           +  HN++     +  K IG YST ++     
Sbjct: 139 FDTFFGFVGAGIDHYTHS--------DHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGK 190

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLD 261
           ++I  H+++KP+FLYL+  AVHA     P + P   + ++   I D +RRTYA M S +D
Sbjct: 191 DIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRTYAAMTSCVD 245

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E++ N+  AL +  ML+NSI++F +DNG     + ++ G+N PLRG K T W+GG+R + 
Sbjct: 246 EAIANITEALEEKNMLKNSIIVFTSDNGG---ALVNSAGNNWPLRGGKHTSWEGGVRAIG 302

Query: 322 AIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
            ++   L +  + +  + L HI+DW PTL   AG   +    L GV+QW+++T   K+ R
Sbjct: 303 FVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHVGKKLYGVDQWNMITGKDKSGR 362

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
            ++L  +      Q+ Y A       Y  G   N   D                  + + 
Sbjct: 363 DDVLITL------QEDYKA----GAHYRTGIYKNNAFD------------------ITTH 394

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALR--TILTDEKILELREFARVRCNYDNKGAH 496
           A I           K K    P    SD ++   I +D +I+E  +F   R         
Sbjct: 395 AAIKMG--------KWKLLTGPIGGPSDWVKPQEINSDHRIIEDPDFDATRI-------- 438

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPAR 556
               V+  L+NI DDP E  +L+E    ++ ++  KL  +    +P  ++ +  RAD   
Sbjct: 439 ----VR--LYNIEDDPTETTDLSEINQHIVIEMLAKLHNFSKQSLPAQSRIY-PRADFDG 491

Query: 557 WNNIWVPW 564
                 PW
Sbjct: 492 MGECISPW 499


>gi|402871949|ref|XP_003899908.1| PREDICTED: arylsulfatase B-like, partial [Papio anubis]
          Length = 316

 Score =  254 bits (650), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 127/263 (48%), Positives = 171/263 (65%), Gaps = 10/263 (3%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + PH++ +LADDLGWNDV FHGS +I TP++DALA  G++L+ +Y Q LCTPSRS L+
Sbjct: 42  ASRPPHLVFVLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLL 100

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT 
Sbjct: 101 TGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTR 160

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           RGFD+++GY  G +DYY H      +        LD R   +V      +YST+++T+ A
Sbjct: 161 RGFDTYFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
             +I  H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +D
Sbjct: 221 TALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMD 275

Query: 262 ESVGNVIAALRKHGMLENSIVLF 284
           E+VGNV AAL+  G+  N++ +F
Sbjct: 276 EAVGNVTAALKSSGLWNNTVFIF 298


>gi|327291280|ref|XP_003230349.1| PREDICTED: arylsulfatase J-like, partial [Anolis carolinensis]
          Length = 471

 Score =  254 bits (650), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 167/490 (34%), Positives = 247/490 (50%), Gaps = 69/490 (14%)

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
            L++ +Y IH G+QH VI   +P  LPL    LPQ LKEAGY+TH +GKWHLGF+R+   
Sbjct: 2   VLVSSRYQIHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECM 61

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYT 198
           PT RGFD+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT
Sbjct: 62  PTQRGFDTFFGSLLGSGDYYTHY-KCDSPRMCGYDLYEN---DNAAWDHDNGIYSTQMYT 117

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           +    ++A HN  KP+FLY+A+ AVH+     P QAP     ++  I++  RR YA M++
Sbjct: 118 QKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPGTYYERYRSINNINRRRYAAMLA 172

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            LDE++ NV  AL+++G   NS++++ +DNG  P+ G     G+N PLRG K T W+GG+
Sbjct: 173 CLDEAINNVTLALKRYGYYNNSVIIYSSDNGGQPTAG-----GNNWPLRGSKGTYWEGGI 227

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKT 376
           R V  + SP LK    V  EL HI+DW PTL   A  +I+ DT LDG + W+ +++   +
Sbjct: 228 RAVGFVHSPLLKNKGCVCKELIHITDWFPTLITLAEGQIDEDTLLDGYDVWETISE---S 284

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLY 436
           +RS                   R+D L  +       ++  W                  
Sbjct: 285 RRSP------------------RMDILHNIDPIYTKAKNGSWA----------------- 309

Query: 437 SKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR-CNYDNKGA 495
           +  GI   A+++ +++            SD           +  + F+ V    + N+  
Sbjct: 310 AGFGIWNTAIQSAIRVNHWKLLTGNPGYSD----------WVPPQSFSNVGPSRWHNERV 359

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
              +     LFNIT DP E+ +L+    D++KQL  +L+ +  T VP    P D R++P 
Sbjct: 360 SWTAGKTVWLFNITADPYERVDLSSKYPDVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPN 419

Query: 556 RWNNIWVPWY 565
               +W PW+
Sbjct: 420 FNGGVWGPWF 429


>gi|355750020|gb|EHH54358.1| Arylsulfatase B, partial [Macaca fascicularis]
          Length = 430

 Score =  254 bits (650), Expect = 6e-65,   Method: Compositional matrix adjust.
 Identities = 136/301 (45%), Positives = 181/301 (60%), Gaps = 14/301 (4%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           I  G+QH +I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG +R+   PT RGFD+
Sbjct: 2   IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           ++GY  G +DYY H      +        LD R   +V      +YST+++T+ A  +I 
Sbjct: 62  YFGYLLGSEDYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALIT 121

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H   KP+FLYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +DE+VGN
Sbjct: 122 NHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGN 176

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V AAL+  G+  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV  + SP
Sbjct: 177 VTAALKSSGLWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGVRGVGFVASP 232

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
            LKQ    + EL HISDWLPTL   AG   N T  LDG + W  +++G+ + R E+LHNI
Sbjct: 233 LLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTKPLDGFDVWKTISEGSPSPRMELLHNI 292

Query: 386 D 386
           D
Sbjct: 293 D 293


>gi|390336076|ref|XP_784530.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 545

 Score =  254 bits (650), Expect = 7e-65,   Method: Compositional matrix adjust.
 Identities = 154/370 (41%), Positives = 213/370 (57%), Gaps = 39/370 (10%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QHGVI   +P  LP+ E  LPQ LKE GY T+ +GKWH+GF+ +  TPT RGF
Sbjct: 122 YWIHTGLQHGVIRHAQPNCLPVDEVTLPQKLKENGYTTNMVGKWHIGFYLDACTPTERGF 181

Query: 149 DSHYGYWQGLQDYYDH--SCKA------TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           DS +G+  G +D+Y    SC+A      + +   G D+  N        G YST L+T  
Sbjct: 182 DSFFGFLIGAEDHYTRMLSCEAYNANDGSTKTLSGHDLHANKTPVFDYEGQYSTHLFTNK 241

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
            ++VI  H+K+KP+F+YLA+ AVHA     P Q PD  +  + +I+D  RRTYAGMVS +
Sbjct: 242 TVDVIERHDKTKPLFIYLAYQAVHA-----PLQVPDSYMDPYKNIADKTRRTYAGMVSCM 296

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH-PLRGMKSTPWDGGMRG 319
           DE +GNV  AL+  G+ +N+I++F  DNG      H ++G N+ PLRG K + W+GG+ G
Sbjct: 297 DEGIGNVTQALKDAGLYDNTIIIFSTDNGG-----HIDRGGNNWPLRGSKGSMWEGGIHG 351

Query: 320 VAAIWSPWLKQTQK--VSSELFHISDWLPTLCAA-AGIEINDTS-LDGVNQWDVLTKGAK 375
           V  + SP L    K  V+ EL H+SDWLPT+ A  AG  +N T  LDG N W  +   A 
Sbjct: 352 VGFVHSPLLPDAVKGTVNHELMHVSDWLPTIVAGIAGGALNGTKPLDGYNVWSSIRGTAS 411

Query: 376 TKRSEILHNIDNVDNPQK-----------YYAALRVDDLKYVAGTDNNGQS-DEWYGDTD 423
           + ++E+LHNI    NP+K             AALRV D K   G + +    +EW    +
Sbjct: 412 SPQTELLHNI----NPKKKRPENEKFDTSMKAALRVGDWKIKTGANYDVTVFNEWMPPPE 467

Query: 424 NEIDKYSPKE 433
           + +D   P E
Sbjct: 468 SGLDTIYPTE 477



 Score = 79.0 bits (193), Expect = 6e-12,   Method: Compositional matrix adjust.
 Identities = 35/62 (56%), Positives = 44/62 (70%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QHGVI   +P  LP+ E  LPQ LKE GY T+ +GKWH+GF+ +  TPT RGFDS
Sbjct: 38  IHTGLQHGVIRHAQPNCLPMDEVTLPQKLKENGYTTNMVGKWHIGFYLDACTPTERGFDS 97

Query: 151 HY 152
            +
Sbjct: 98  FF 99



 Score = 45.8 bits (107), Expect = 0.063,   Method: Compositional matrix adjust.
 Identities = 26/60 (43%), Positives = 30/60 (50%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP EQ +LAE    ++ +L  KL  Y    VP      D  ADPA     W PW
Sbjct: 485 LFNITADPLEQTDLAEKYPGVVDELVHKLEEYYKGSVPMRYPDEDLEADPALHGGSWGPW 544


>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
           obscuriglobus UQM 2246]
          Length = 459

 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 151/406 (37%), Positives = 216/406 (53%), Gaps = 26/406 (6%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           F +   LL   +     AP +P ++ +LADDLG  D  F G  +I TPNID +A  G  L
Sbjct: 4   FTVLLFLLAPASVARAEAPAQPSVVFLLADDLGREDCGFMGGKEIKTPNIDKIAAAGATL 63

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
           +  Y Q +C+P+R+A MTG+YP+  G+Q GV+     +GLPL E+ + Q LK+AGY T  
Sbjct: 64  DAFYAQPVCSPTRAAFMTGRYPMRHGLQVGVVRPWAQYGLPLDERTVAQGLKDAGYTTAV 123

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           IGKWHLG F   Y PT RGFD  YG++ G  DY+ H          G D   + +V++  
Sbjct: 124 IGKWHLGHFAPEYLPTKRGFDHQYGHYNGALDYFTHIRDG------GFDWHRDDKVNSDE 177

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
              YST L  + A+  +  H   KP FLY+   AVHA     P Q P+E +  +  +   
Sbjct: 178 --GYSTHLVAKDAVQFVQTHAGKKPFFLYVPFNAVHA-----PHQVPEEYLKPYGHLKG- 229

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
           ER+ YAGM++ +DE+VG ++AA+ K G+ +N++ +F++DNG P  G+ +   SN   R  
Sbjct: 230 ERQKYAGMLAAMDEAVGQIVAAVEKAGVRKNTLFVFLSDNGGPQPGVVT---SNGKFRAG 286

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG-IEINDTSLDGVNQW 367
           K T ++GG+R  A        +     +E  H+ DW PTL    G        +DG++ W
Sbjct: 287 KGTLYEGGVRVAACAAFDGHIRPGTTITEPLHVVDWYPTLLKLGGPTGEQQLPVDGLDIW 346

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA--GTDN 411
            VLT G  +    IL N      P     A+R  D K V   GTD+
Sbjct: 347 PVLTGGKPSPHGAILLNT----TPNT--GAVRAGDWKLVVKNGTDD 386


>gi|350587859|ref|XP_003129285.2| PREDICTED: arylsulfatase J [Sus scrofa]
          Length = 539

 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 167/491 (34%), Positives = 247/491 (50%), Gaps = 69/491 (14%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+R+   PT RGF
Sbjct: 75  YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 134

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDLYTEAAINV 204
           D+ +G   G  DYY H  K       G D+  N       DN   GIYST +YT+    +
Sbjct: 135 DTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENENAAWDYDN---GIYSTQMYTQRVQQI 190

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           +A H+  +P+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S LDE++
Sbjct: 191 LASHDPKRPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 245

Query: 265 GNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
            NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+R V  +
Sbjct: 246 NNVTLALKMYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAVGFV 300

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEIL 382
            SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W+ +++G ++ R    
Sbjct: 301 HSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLDGYDVWETISEGLRSPR---- 356

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
             +D + N    Y   +  +  + AG                               GI 
Sbjct: 357 --VDILHNIDPIYT--KAKNGSWAAGY------------------------------GIW 382

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTV 501
             A+++ ++++           SD           +  + F+ +  N + N+    ++  
Sbjct: 383 NTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQSFSNLGPNRWHNERITLSTGK 432

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
              LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D R++P     +W
Sbjct: 433 SVWLFNITADPYERVDLSNRYPGVVKQLLRRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 492

Query: 562 VPWYDELDKQK 572
            PWY E +K++
Sbjct: 493 GPWYKEENKKQ 503


>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 512

 Score =  254 bits (648), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 151/373 (40%), Positives = 207/373 (55%), Gaps = 21/373 (5%)

Query: 26  APKKPHIIIILADDLGWNDVSFHG---SSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           A KKPH+++ILADDLG+NDV + G    S   TP +D+LA NG+ L  +Y  ++C+P+R 
Sbjct: 25  ATKKPHVVLILADDLGYNDVGYWGQNHGSAAKTPFLDSLAENGVRLENYYTHSVCSPTRG 84

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           ALMTG+  I IG+ HG+I   +  GLPL   LLP+ L   GY T  IGKWHLGF    Y 
Sbjct: 85  ALMTGRNRIDIGLAHGIIHTTQIEGLPLDNVLLPEQLSNCGYNTQMIGKWHLGFSSSKYA 144

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSC-KATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEA 200
           P  RGF   YG+  G ++Y+      A      G+D   +     N+T G YS  +Y   
Sbjct: 145 PWNRGFHGFYGFLAGSENYWSKWLPMARHSNIGGVDFTDSTTGPTNETWGQYSAHVYASR 204

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A  VI  H++SKP+FLYL     H      P  AP      F DI D +R  Y  MVS L
Sbjct: 205 ARYVIQHHDQSKPLFLYLPLQTPHT-----PLGAPSHYYEPFKDIEDDDRMKYLSMVSVL 259

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DE+V NV   L+  GM E+++++F  DNG    G   + G+N P RG K+T ++GG++ V
Sbjct: 260 DETVRNVTNYLKDAGMWEDTLLIFSTDNG----GEVKHGGNNWPYRGTKNTLYEGGVKAV 315

Query: 321 AAIWSPWLKQ---TQKVSSELFHISDWLPTLCAAAGIEI----NDTSLDGVNQWDVLTKG 373
             +    L      Q+V+ EL H++DW PT+ AAA        +   L G+NQW  L   
Sbjct: 316 GFVHGKILGDLSPAQRVNRELIHVTDWYPTIMAAADCPYVSGNSRNKLYGLNQWGALRGE 375

Query: 374 AKTKRSEILHNID 386
             ++R+EI+H ID
Sbjct: 376 RGSQRTEIVHTID 388


>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
          Length = 418

 Score =  253 bits (647), Expect = 1e-64,   Method: Compositional matrix adjust.
 Identities = 136/292 (46%), Positives = 177/292 (60%), Gaps = 14/292 (4%)

Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ 159
           I   +P  +PL EKLLPQ LKEAGY TH +GKWHLG FR+   PT RGFD+++GY  G +
Sbjct: 1   IWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMFRKECLPTRRGFDTYFGYLLGSE 60

Query: 160 DYYDHSCKATFEPYQ----GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMF 215
           DYY H      +        LD R   +V      +YST+++TE A  +IA H   KP+F
Sbjct: 61  DYYSHERCTLIDALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIANHPPEKPLF 120

Query: 216 LYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHG 275
           LYLA  +VH     EP Q P+E +  +  I D  R  YAGMVS +DE+VGNV AAL+ HG
Sbjct: 121 LYLALQSVH-----EPLQVPEEYLKPYKFIQDKNRHHYAGMVSLMDEAVGNVTAALKSHG 175

Query: 276 MLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS 335
           +  N++ +F  DNG  +       G+N PLRG K + W+GG+RGV  + SP LKQ    +
Sbjct: 176 LWNNTVFIFSTDNGGQTLA----GGNNWPLRGRKWSLWEGGIRGVGFVASPLLKQKGIKN 231

Query: 336 SELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
            EL HISDWLPTL   AG   N T  LDG + W  +++G+ + R E+LHNID
Sbjct: 232 RELIHISDWLPTLVTLAGGSTNGTKPLDGFDVWKTISEGSPSPRMELLHNID 283


>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 469

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 181/549 (32%), Positives = 263/549 (47%), Gaps = 115/549 (20%)

Query: 26  APKKPHIIIILADDLGWNDVSFHG-SSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           +P  PHI+ ++ DDLG+NDV +H   S I T NIDALA  GL L  +YV  LCTPSRS  
Sbjct: 25  SPPTPHIVFVMVDDLGYNDVGYHSDGSAIETDNIDALAAGGLKLESYYVAPLCTPSRSQF 84

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           ++GKY IH GMQH VI    P  LPL +  +   L +AGYATH +GKWHLGF+++   P 
Sbjct: 85  LSGKYLIHNGMQHLVIDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWPL 144

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
            RGF S +G                            M +   T+               
Sbjct: 145 NRGFQSFFG----------------------------MLLGQSTV--------------- 161

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDES 263
             +   +KPMF+YL++ A H      PF+ PDE    +   I++  RRTYAGMV+ LDES
Sbjct: 162 --DQGGTKPMFMYLSYQAPHL-----PFEVPDEYFVSYRGKINNRNRRTYAGMVTMLDES 214

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           +G +   L++ G+  +++ +F  DNG        N G+N PLRG+K   ++GG+RGV  +
Sbjct: 215 IGKLTDTLKEEGLWNDTVFIFSTDNGGVG---KKNAGNNWPLRGVKGNYFEGGIRGVGFV 271

Query: 324 WSPWLKQTQK--VSSELFHISDWLPTLC-AAAGIEINDT--SLDGVNQWDVLTKG-AKTK 377
             P L    +  +S++L HISDW PTL    A + +N T   LDGVN WDV+++G +   
Sbjct: 272 AGPLLSTNVQGTISTDLMHISDWYPTLVEGVAKVTLNHTELGLDGVNMWDVISQGESGDP 331

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS 437
             EI++   N+D   +  A++                        D++ D YS    +  
Sbjct: 332 DREIVY---NIDPDMQRPASI------------------------DSKYDSYSTIFDISI 364

Query: 438 KAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHC 497
            A I     +  +++   + AD     +D L  + +         FA V      K    
Sbjct: 365 HASIR----RGDMKLVTGYQAD-----TDWLAPVESG--------FANVYATDAGKAI-- 405

Query: 498 NSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY--KSTMVPPGNKPFDKRADPA 555
                  L+NIT DP EQ++L+++  D +  L  +L  Y   S    P N   D  +DPA
Sbjct: 406 ------WLYNITADPTEQDDLSDTMADEVVDLLGRLQSYYTGSRFEAPANPGIDCDSDPA 459

Query: 556 RWNNIWVPW 564
              ++W  W
Sbjct: 460 LHGDLWTNW 468


>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 843

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 143/365 (39%), Positives = 209/365 (57%), Gaps = 29/365 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K PHII I+ DDLGWNDV ++ +    TP ID LA +G+ L  +YV + C PSR+ LMTG
Sbjct: 364 KPPHIIFIVVDDLGWNDVGYN-NPVFKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTG 422

Query: 88  KYPIHIGMQ-HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           ++ I +G+Q HG      P  LPL E  + Q LK+ GY+TH IGKWH GF+ +   P  R
Sbjct: 423 RHAIQLGIQRHG--FGYHPRSLPLDETTIAQPLKQVGYSTHIIGKWHCGFYSDNCLPHNR 480

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAA 201
           GFD+ +G+     ++Y HS           +  HN++     +  + IG YST +Y    
Sbjct: 481 GFDTFFGFVGAGIEHYTHS--------DHFNHMHNLRKNDDCIAKQYIGKYSTTIYANEG 532

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRL 260
            N+I  H+++KP FLYL+  AVH      P + P   + ++   I D +RRTYA M S +
Sbjct: 533 KNIINAHDQNKPFFLYLSFSAVHT-----PLEVPSSYLKQYESTIYDEDRRTYAAMTSCV 587

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DE++ N+  AL +  ML+NSI++F +DNG     +  + GSN PLRG K T W+GG++ V
Sbjct: 588 DEAIANITEALEEKDMLKNSIIVFTSDNGG---AVERSAGSNWPLRGGKHTIWEGGVKAV 644

Query: 321 AAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTK 377
             ++   L    + +    L HI+DW PTL   AG   +    L GV+QW ++T   K++
Sbjct: 645 GFVYGELLPHHSRGTENTGLMHITDWFPTLVTLAGANPHVGKKLYGVDQWKMVTGEGKSR 704

Query: 378 RSEIL 382
           R ++L
Sbjct: 705 RDDVL 709


>gi|348564635|ref|XP_003468110.1| PREDICTED: arylsulfatase J-like [Cavia porcellus]
          Length = 543

 Score =  252 bits (643), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 166/486 (34%), Positives = 245/486 (50%), Gaps = 69/486 (14%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF+++   PT RGF
Sbjct: 78  YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYKKECMPTKRGF 137

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINV 204
           D+ +G   G  DYY H  K       G D+  N   DN       GIYST +YT+    +
Sbjct: 138 DTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYEN---DNAAWDHDNGIYSTQMYTQRVQQI 193

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           +A HN +KP+FLY+A+ AVH+     P QAP      +  I +  RR YA M+S LDE++
Sbjct: 194 LASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 248

Query: 265 GNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
            NV  AL+ +G   NSI+++ +DNG  P+ G     GSN PLRG K T W+GG+R V  +
Sbjct: 249 HNVTLALKAYGFYNNSIIIYSSDNGGQPTAG-----GSNWPLRGSKGTYWEGGIRAVGFV 303

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEIL 382
            SP LK    V  EL HI+DW PTL + A  +I++ T LDG + W+ +++G ++ R    
Sbjct: 304 HSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEHTQLDGYDVWETISEGFRSPR---- 359

Query: 383 HNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGIT 442
             +D + N    Y   +  +  + AG                               GI 
Sbjct: 360 --VDILHNIDPIYT--KAKNGSWAAGY------------------------------GIW 385

Query: 443 FNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN-YDNKGAHCNSTV 501
             A+++ ++++           SD +            + F+ +  N + N+    ++  
Sbjct: 386 NTAIQSAIRVQHWKLLTGNPGYSDWVPP----------QAFSNLGPNRWHNERITLSTGK 435

Query: 502 KPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
              LFNIT DP E+ +L+     ++K+L  +L+ +  T VP    P D R++P     +W
Sbjct: 436 SIWLFNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKDPRSNPRLNGGVW 495

Query: 562 VPWYDE 567
            PWY E
Sbjct: 496 GPWYKE 501


>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 726

 Score =  251 bits (642), Expect = 5e-64,   Method: Compositional matrix adjust.
 Identities = 186/587 (31%), Positives = 290/587 (49%), Gaps = 92/587 (15%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQI-PTPNIDALAYNGLILNQHYVQALCTPSR 81
           +++    P++  I+ DD+GWND+ +H +     TPN+D L+ +G+ ++Q+Y  ++CTP+R
Sbjct: 105 SSSTADNPNVFFIMIDDMGWNDIGYHSTDLANVTPNLDRLSASGVKVSQYYSMSICTPAR 164

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           +ALMTG+YP+  G+Q+ VI  G PWGLPLTEKLLP+Y+ EAGY +H +GKWHLG +   +
Sbjct: 165 AALMTGRYPVRYGLQYNVIQPGAPWGLPLTEKLLPEYMNEAGYESHMVGKWHLGSYTHAH 224

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSC----------------KAT--------FEPYQGLD 177
           TP  RGF++++GY    + Y+ H                   AT        F+P  G D
Sbjct: 225 TPHRRGFETYFGYLNDEEMYWTHQTWTATINGRKFFDFGFGNATGFYDVIERFDPPPGDD 284

Query: 178 ----MRHNMQVDNKTI---GIYSTDLYTEAAINVIAEH--NKSKPMFLYLAHLAVHAGNT 228
                     V + ++   G YST+++T+ A+ ++++   +   P+FLYL+H AVH    
Sbjct: 285 DLVSTGPTSSVYSSSLEIKGDYSTEIFTDRALEILSQKTPHDENPLFLYLSHQAVHDPLG 344

Query: 229 YEPFQA-PDEEVAKFLDISDPE-----RRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
             P  A   EE A    + D       R  +A ++  LD+S+G ++  L   G LENSI+
Sbjct: 345 VPPADAFSPEEWAVLTALEDESSDASLRVRFAKVLMYLDKSIGRLVDYLETEGWLENSII 404

Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE---LF 339
           +  +DNG    G  S  GSN+PLRG+K + W+GG++  A ++SP     +   +E   L 
Sbjct: 405 VVASDNG----GCPSTGGSNYPLRGLKHSNWEGGVKVPAFVYSPSHIPEEHWGTEYRGLM 460

Query: 340 HISDWLPTLCAAAGIEI--NDTSLDGVNQWDVLT------KGAKTKRSEILHNIDNVDNP 391
           H++DWLPTL  AA  E+  +   LDGV+ W  +           ++  E    +++    
Sbjct: 461 HVTDWLPTLADAANFELLGSHGDLDGVSHWKHIVPAPGDDTATVSESPEAAAAVEDNRLK 520

Query: 392 QKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYS------KAGIT--- 442
           +   A   VD      G    G S   YG    E+   + +EVLY+       A IT   
Sbjct: 521 RARTATAAVDGGGGGGGGGGGGSS---YG--LGEVSYGARREVLYNFDPYVLGANITDDI 575

Query: 443 -----------FNALKTKLQIKQKHAADPKANSSDALRTILTDE--KILELREFARVRCN 489
                      F   K K       +     ++   L   LTDE     EL E ++    
Sbjct: 576 GEADFTEVQGAFRQGKWKFMFNSWCSGYYAFDNKTLLADDLTDEDNTCQELGECSQC--- 632

Query: 490 YDNKGAHCNS---TVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKL 533
               GA+C S        LF++ DDP E NNLA    ++++ L  ++
Sbjct: 633 ----GANCGSLSIDYTDWLFDLEDDPRELNNLANEHPEVVESLRRRV 675


>gi|345326826|ref|XP_003431088.1| PREDICTED: arylsulfatase J [Ornithorhynchus anatinus]
          Length = 504

 Score =  251 bits (642), Expect = 6e-64,   Method: Compositional matrix adjust.
 Identities = 167/503 (33%), Positives = 251/503 (49%), Gaps = 69/503 (13%)

Query: 77  CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
           C  +  +    KY IH G+QH +I   +P  LPL    LPQ LKE GY+TH +GKWHLGF
Sbjct: 24  CPQNEGSCFQQKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGF 83

Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQVDNKTIGIY 192
           +R+   PT RGFD+ +G   G  DYY H  K       G D+  N     + DN   GIY
Sbjct: 84  YRKECMPTQRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDDAAWEYDN---GIY 139

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           ST +YT+    ++A H+  +P+FLY+A+ AVH+     P QAP      +  I +  RR 
Sbjct: 140 STQMYTQRVQQILASHDPRRPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRR 194

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKST 311
           YA M+S LDE++ NV  AL+++G  +NSI+++ +DNG  P+ G     GSN PLRG K T
Sbjct: 195 YAAMLSCLDEAIHNVTLALKRYGFYDNSILIYSSDNGGQPTAG-----GSNWPLRGSKGT 249

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVL 370
            W+GG+R V  + SP LK    V  EL HI+DW PTL   A  ++++   LDG + W+ +
Sbjct: 250 YWEGGIRAVGFVHSPLLKNKGTVCRELVHITDWYPTLITLAEGQLDEGVRLDGYDVWETI 309

Query: 371 TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYS 430
              ++ +RS                   RVD L  +       ++  W            
Sbjct: 310 ---SEGRRSP------------------RVDILHNIDPIYTKAKNGSWA----------- 337

Query: 431 PKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCN- 489
                 +  GI   A+++ ++++           S+           +  + F+ +  N 
Sbjct: 338 ------AGFGIWNTAIQSAIRVRHWKLLTGNPGFSE----------WVPPQAFSNLGPNR 381

Query: 490 YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFD 549
           + N+    ++     LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    P D
Sbjct: 382 WHNERISWSAGKSVWLFNITADPYERVDLSARYPGVVKQLLRRLSQFNKTAVPVRYPPRD 441

Query: 550 KRADPARWNNIWVPWYDELDKQK 572
            R++P     +W PWY E +K+K
Sbjct: 442 PRSNPRLNGGVWGPWYREEEKKK 464


>gi|442749327|gb|JAA66823.1| Putative arylsulfatase b precursor [Ixodes ricinus]
          Length = 274

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 123/226 (54%), Positives = 156/226 (69%), Gaps = 2/226 (0%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+II ILADDLGW+DVSFHGS QIPTPN+DALA +G+ILNQHY QALCTPSR+ALMTG+Y
Sbjct: 48  PNIIFILADDLGWDDVSFHGSPQIPTPNMDALAADGIILNQHYAQALCTPSRAALMTGRY 107

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PI+ GMQH VI  GEPWGLPL  +L+P++  + GY TH +GKWHLG F++ + P  RGFD
Sbjct: 108 PIYTGMQHFVIQPGEPWGLPLEYRLMPEFFSDLGYKTHMVGKWHLGSFKKDFIPVRRGFD 167

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           S YG++   QDYY+ +   T   + G D   N  +       YST  YTE A+++I  HN
Sbjct: 168 SFYGFYNADQDYYNKTL--TEGEHTGYDFWLNEDIHIYPNNRYSTHHYTERAVSLIRSHN 225

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
            S+P+FLYL++     G      +AP+E V KFL I +  R  YAG
Sbjct: 226 PSQPLFLYLSYXXXXVGTGPSLLEAPEENVNKFLYIPEKNRTIYAG 271


>gi|403049780|ref|ZP_10904264.1| sulfatase [SAR86 cluster bacterium SAR86D]
          Length = 515

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 153/375 (40%), Positives = 212/375 (56%), Gaps = 41/375 (10%)

Query: 21  FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
           F   +   +P++IIIL DDLGW DVS+H +  IPTPNID+   NG+ LN+ Y    C+P+
Sbjct: 9   FFTLSLCSQPNVIIILTDDLGWGDVSYH-NGFIPTPNIDSFVSNGIELNRFYANPTCSPT 67

Query: 81  RSALMTGKYPIHIGMQHGVIL-----EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           R++L+TG   +HI   HGVI        E  GLP   K++P+Y KEAGY T   GKWHLG
Sbjct: 68  RASLLTG---LHI-FNHGVIRPFMNPSAEQTGLPEHLKIMPEYFKEAGYQTALSGKWHLG 123

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-GLDMRHNMQVDNKTIGIYST 194
             +E Y PT RGFDS YG+  G   YYDH      + ++ G+ +  +          YST
Sbjct: 124 MHKEEYLPTNRGFDSSYGHMLGGIGYYDHVHTNRMDWHRDGVSLNEDG---------YST 174

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
           +L  + AIN+I   +  +P+FLY+A  A H      P +AP+E+V  FL I D   R YA
Sbjct: 175 ELIADEAINIIKNKDDDRPLFLYVAFNAPHT-----PIEAPEEDVNNFLYIEDELDRNYA 229

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-----------GIHSNKGSNH 303
             +S+LD  +G +I +++  G+LE +I++F++DNG P F           G+   KGS  
Sbjct: 230 ANISKLDIEIGRIINSIKDLGILEETIIIFLSDNG-PVFDINPIVKTIAPGLTKAKGSTA 288

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELF-HISDWLPTLCAAAGIEINDTSL- 361
            LRG K++  DGG+R  AAIW  W    +K  S+ F  I D LPTL  A GI  ++  L 
Sbjct: 289 GLRGSKTSALDGGIRVPAAIW--WKGVLEKSKSDQFIFIQDLLPTLLTATGIGYDENLLI 346

Query: 362 DGVNQWDVLTKGAKT 376
           DGV++W  L     T
Sbjct: 347 DGVDRWSNLITNKIT 361


>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
          Length = 318

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 141/333 (42%), Positives = 196/333 (58%), Gaps = 20/333 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KP+ + I+ADDLG++D+       I TPN+DALA  G+IL  +YVQALC+PSR ALMTG
Sbjct: 1   RKPNFVFIMADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCSPSRHALMTG 59

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +YP    MQ  V+L  E     L  KLLPQYLKE GY  H IGKWHLG+ RE   PT RG
Sbjct: 60  RYPYKSAMQSFVVLPFEAKCTGLEYKLLPQYLKELGYENHLIGKWHLGYCREECLPTSRG 119

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FDS YG   G  DY++H+    +      D   N +V ++  G +S DL  +    + AE
Sbjct: 120 FDSFYGLLDGAGDYWEHTTSGVY------DWHLNDEVFHEAYGNHSQDLELDRLDKLFAE 173

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL--DISDPERRTYAGMVSRLDESVG 265
           H+   P+FLY A       N + P Q  +E ++ +   D SD  RR Y  + S LD  VG
Sbjct: 174 HDNKDPLFLYFA-----PQNPHVPSQPTEEYLSMYSADDFSDVRRR-YLALTSALDSMVG 227

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            ++  L+ + M++N+ ++FM+DNGA        +G N P RG K++ ++GG +  + I+S
Sbjct: 228 KIVQHLKDNCMMDNTYLIFMSDNGA-----DPGEGQNTPFRGGKTSLFEGGTKSNSFIYS 282

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
             LK+T+  +  L HI+DWLPTL   AG +I D
Sbjct: 283 QLLKKTEYENDGLMHITDWLPTLVKLAGGDIED 315


>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
          Length = 370

 Score =  249 bits (636), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 150/372 (40%), Positives = 208/372 (55%), Gaps = 25/372 (6%)

Query: 21  FLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPS 80
           FL     + P+II ILADDLG+NDV F  +  I TPNID LA  G+++   Y   +CTPS
Sbjct: 2   FLFPGKNRPPNIIFILADDLGYNDVGFR-NPDIITPNIDKLARKGVVMTNSYSTHVCTPS 60

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           R ALMTG+YP   GMQ+ VI    P    L  K LPQYLK  GY THA+GKWHLG  R+ 
Sbjct: 61  RHALMTGRYPYKTGMQNFVIPGDAPVCSGLEYKFLPQYLKSLGYNTHAVGKWHLGDCRDE 120

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
             PT RGFDS YG   G   Y++H    T+  +   D  +N  +D    G +S DL  + 
Sbjct: 121 CLPTERGFDSFYGLLLGGGGYWNH----TYTLFGAYDWFNNKDLDLSANGTHSQDLMVDR 176

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-----DISDPERRTYAG 255
              V A HN+ +PMFLY A    H          P E    FL      +    RR Y G
Sbjct: 177 LSAVFASHNREEPMFLYFAPQNPH---------TPSEATENFLRHYPSSMYTEIRRQYLG 227

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           +VS LD  VG ++  +  + ML+N+ ++F +DNG  +      +  N P RG K + ++G
Sbjct: 228 LVSGLDWMVGELLELVIANNMLDNTYIIFQSDNGGDAV-----EALNGPYRGGKGSLFEG 282

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGA 374
           G + V  I+SP LK+    ++   HI+DW+PT+ + AG E+     +DG+NQ  ++ KGA
Sbjct: 283 GSKVVNFIYSPLLKKVGYENNGWMHITDWMPTIISLAGGEVPLVEGIDGINQMSMVLKGA 342

Query: 375 KTKRSEILHNID 386
           ++KR+ +++NID
Sbjct: 343 ESKRTSMVYNID 354


>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
 gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
          Length = 488

 Score =  249 bits (636), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 143/404 (35%), Positives = 228/404 (56%), Gaps = 41/404 (10%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY--NGLILNQHYVQALCTPSRS 82
           TA +KP+++ I+ADDLG++DV +  S+ + TPN+  LA+  N  +L+  YV  LCTP+RS
Sbjct: 26  TATQKPNVLFIMADDLGFSDVDWKDST-LHTPNLRHLAFHKNTALLSNSYVNQLCTPTRS 84

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           A MTG YP  +G Q+GV L  EP G+P     L + +++  Y+T+ +GKWHLG+ ++ + 
Sbjct: 85  AFMTGYYPFRVGTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFL 144

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSC----KATFEPYQGLDMRHNMQV-----DNKTIGIYS 193
           PT RGFD  YG++     Y++HS     +      +GLD+   +       D    G+YS
Sbjct: 145 PTNRGFDYFYGFYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYS 204

Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE-------EVAKFLDIS 246
           TDL+T+ A++V+  HN SKP F++L++ AVH      P Q   +       + A F+  S
Sbjct: 205 TDLFTDVAMSVLDNHNNSKPFFMFLSYQAVHP-----PLQVSQQSKTIGQGKEATFILRS 259

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPL 305
                    M++ +D ++G ++  L+   + EN++++F +DNG  + FG      SN PL
Sbjct: 260 HAHS---TRMLTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTANFG-----ASNAPL 311

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
           RG K T W+GG +    + SP   +       +FH+ DW  T+ +  G+EI D+  DG+N
Sbjct: 312 RGEKDTIWEGGTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILSITGLEI-DSYGDGIN 370

Query: 366 QWDVLTKGA-KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
           QW+ L  G  K +R + ++NIDN      + +A+R  D K + G
Sbjct: 371 QWEYLKTGRPKFRRFQFVYNIDN------HGSAIRDGDYKLIVG 408


>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
 gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
          Length = 497

 Score =  249 bits (635), Expect = 4e-63,   Method: Compositional matrix adjust.
 Identities = 155/388 (39%), Positives = 210/388 (54%), Gaps = 33/388 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+I+ I++DD GW DV +HGS  I TPNID LA  G  L Q YVQ +CTPSR+A MTG+Y
Sbjct: 55  PNIVYIISDDSGWKDVGYHGSD-IRTPNIDRLAAEGARLEQFYVQPMCTPSRAAFMTGRY 113

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           P   G+Q  VI +   +GL L E  LPQ LK+AGY T   GKWHLG  +  Y P  RGFD
Sbjct: 114 PFRYGLQTAVIPQSGTYGLALDEYPLPQVLKDAGYYTAMSGKWHLGHSKTAYWPRQRGFD 173

Query: 150 SHYGYWQGLQDYYDHSCKATFEP--YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           S YG   G  D++ H   A   P  Y+          DN  IG          A+ VI +
Sbjct: 174 SFYGALLGEIDHFTHKA-ANGNPDWYRNNKALKEEGYDNILIG--------AEAVRVINK 224

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
           H++ KP+FLYLA  A H      P+QAP E + +   I+D  RR YA M+S +D+ +G V
Sbjct: 225 HDQQKPLFLYLAFTAPHT-----PYQAPKEYLDRNSHIADESRRKYAAMISVMDDEIGKV 279

Query: 268 IAALRKHGMLENSIVLFMADNG----APSFGIHSNKGS----NHPLRGMKSTPWDGGMRG 319
           +AAL K GM +N++++F +DNG    A   G    KG     N P R  K T ++GG R 
Sbjct: 280 VAALEKRGMRDNTLIVFHSDNGGVVNAFFAGESEVKGKLPADNGPYREGKGTTYEGGTRV 339

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
              +  P   +   +   + H+ D  PTL    G  +  +  LDG++ W  L +G  + R
Sbjct: 340 AGLVNWPGKIKPGTIDGPM-HVVDMYPTLAGLVGATLGKNKPLDGMDMWPTLAEGKPSPR 398

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYV 406
           +EI++NI+ +       AA+R  D K V
Sbjct: 399 TEIVYNIEPMT------AAVRQGDWKLV 420


>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
          Length = 486

 Score =  248 bits (633), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 146/361 (40%), Positives = 201/361 (55%), Gaps = 36/361 (9%)

Query: 47  FHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW 106
           +  S  I TPNIDA++  G+ L  +YVQ +CTPSRS L++G+Y IH G+QH +I  G P 
Sbjct: 5   YKSSDIIKTPNIDAISAAGVRLENYYVQPICTPSRSQLLSGRYQIHTGLQHQLIWMGMPS 64

Query: 107 GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYD--- 163
            LPL  +LLP+ ++  GY T A GKWHLG+ +   TP  RGF +  GY  G +DYY    
Sbjct: 65  ALPLDTELLPETMRNCGYHTMAAGKWHLGYAKTANTPWGRGFHNFTGYLGGSEDYYKKTR 124

Query: 164 ----HSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAAINVIAEHNKSKPM 214
               H C        G+D   + +     V N     YS   Y   A N I   +K KP 
Sbjct: 125 CIDHHKC--------GIDQNTDGEIFGERVYNADASEYSAFKYIRQAKNYIDGRDKDKPF 176

Query: 215 FLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVGNVIAALRK 273
           FLYL   +VHA     P +AP E +AK+   I D  R+ YAGMVS +DE +GN+ + L++
Sbjct: 177 FLYLPMQSVHA-----PLEAPAEYIAKYDGIIEDKPRKIYAGMVSVMDEGIGNITSHLKE 231

Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK 333
            G+  N+I +F  DNG  ++      G+N PLRG K+  W+GG+ G   +   + +  +K
Sbjct: 232 SGLWNNTIFIFSTDNGGQAY----VGGNNLPLRGNKAGYWEGGIHGNGFVAGGYFEDRRK 287

Query: 334 ----VSSELFHISDWLPTLCAAAGIEI--NDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
               V++EL HISDW PT+  A   E   N   LDG++QW +L    K+ R EILHNID 
Sbjct: 288 GSELVNNELIHISDWYPTIMEATSCEKAQNAPDLDGMSQWKMLLDLEKSPRDEILHNIDE 347

Query: 388 V 388
           +
Sbjct: 348 M 348



 Score = 43.5 bits (101), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF++  DP E N +++   D+++ L D+L+ + ST VP      D ++ P      W PW
Sbjct: 425 LFDVIKDPTESNEISDYHEDIVEFLLDRLSYWNSTQVPINFPAVDPKSFPEYHQGFWKPW 484


>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
          Length = 602

 Score =  248 bits (632), Expect = 9e-63,   Method: Compositional matrix adjust.
 Identities = 148/395 (37%), Positives = 217/395 (54%), Gaps = 37/395 (9%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
            F      KKPH+I +L DDLG++D+ +     I +PNIDALA + L L +HYVQ  CTP
Sbjct: 35  TFSQKVLEKKPHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTP 93

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           SR+A +TG+Y I +GMQ GVI   EP G+PL E LL +  K+ GY T   GKWHLGF+  
Sbjct: 94  SRAAFLTGRYNIRMGMQSGVIRAPEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTY 153

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-------GIY 192
            + P  RGFD  YG++ G QD+Y H      E Y G     N  V+N TI       G Y
Sbjct: 154 KHCPQNRGFDRFYGFYLGSQDFYFHDS-GRLEAYPG-----NGDVENDTILDDFHTNGTY 207

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP---E 249
           ST L+ +  IN +A+H+ + P+F Y++   VH      P Q  ++    + + +D    E
Sbjct: 208 STKLFVDDFINDLAKHDPAVPLFNYVSFQDVHG-----PLQTVNKFKKHYENKTDTWTHE 262

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           R   +  ++ LD  +G ++ AL++     N++++F +DNG    G  +   SN PLRG K
Sbjct: 263 RILISTKITTLDHHIGRMVTALKEKNYWNNTVLVFTSDNG----GQPNEGASNWPLRGSK 318

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEI---NDTSLDGV 364
            T +DGG++  A + SP L+   K    + LFH+SDW PTL   +G ++       LDG 
Sbjct: 319 GTIYDGGLKSRAFVASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSGCDVPRKEGRPLDGK 378

Query: 365 NQWDVLTKGAKTKRSEILHNID-----NVDNPQKY 394
              ++     K  R+E+LH +D     + D+P+ +
Sbjct: 379 IH-ELFVYKPKKARNEMLHYLDPLSQFDYDDPRHF 412


>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
 gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
          Length = 333

 Score =  247 bits (631), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 143/341 (41%), Positives = 195/341 (57%), Gaps = 25/341 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KP+ + ILADDLG++D+       I TPN+DALA  G+IL  +YVQALCTPSR ALMTG
Sbjct: 1   RKPNFVFILADDLGYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMTG 59

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +YP    MQ  VIL        L  KLLPQYLK+ GY  H +GKWHLG+ R+   PT RG
Sbjct: 60  RYPSASAMQTSVILPMRAKCTGLEYKLLPQYLKDLGYKNHMVGKWHLGYCRDECLPTSRG 119

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-----RHNMQVDNKTIGIYSTDLYTEAAI 202
           FD+ YG + G  DY+ H+    ++ +   D+       + QV +  +     DL  E   
Sbjct: 120 FDTFYGLYAGTGDYWSHTFFGKYDWHTNADIDFEANSTHSQVRSSYMNFVLQDLEMERLD 179

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-----KFLDISDPERRTYAGMV 257
            V  EH+   P+FLY A       N + P Q  DE +      KF DI    RR Y G+ 
Sbjct: 180 KVFDEHDSKDPLFLYFA-----PQNPHTPSQPTDEFLNLYPEDKFNDI----RRKYLGLT 230

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
           S LD  VG ++  L ++GM+ N+ V+F++DNGA        +G N P RG KS+ ++GG 
Sbjct: 231 SGLDAMVGKIVEKLIENGMMNNTYVIFVSDNGA-----DPPEGLNTPFRGGKSSLFEGGT 285

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
           +  + I+SP LK+T+  +  L HI+DWLPTL   AG ++ D
Sbjct: 286 KSNSFIYSPLLKKTEYENDGLMHITDWLPTLVRLAGGQVMD 326


>gi|410446533|ref|ZP_11300636.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409980205|gb|EKO36956.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 517

 Score =  247 bits (631), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 152/379 (40%), Positives = 215/379 (56%), Gaps = 39/379 (10%)

Query: 14  TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
            L+F   F++ +   KP++IIIL DDLGW DVS+HG   IPTPNIDALA NG+ LN+ Y 
Sbjct: 3   VLIFLILFVSASIYSKPNVIIILTDDLGWGDVSYHGG-HIPTPNIDALAKNGVELNRFYA 61

Query: 74  QALCTPSRSALMTGKYPIHIGMQHGVIL-----EGEPWGLPLTEKLLPQYLKEAGYATHA 128
             +C+P+R++L+TG   +HI   HG+I        E +GLP+  K++PQ+ KEAGY T  
Sbjct: 62  SPVCSPTRASLLTG---LHI-FNHGIIRPLANPTAEQYGLPVDLKIMPQFFKEAGYQTAL 117

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
            GKWHLG   E Y PT RGFD  YG+  G   Y+DH   +       LD   N +   + 
Sbjct: 118 SGKWHLGMHLEEYWPTNRGFDQSYGHMLGGIGYFDHVHSSR------LDWHRNEEPLFED 171

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
              YST+L    A+ +I   + ++P+FLY+A  A H      P QAPD+ +  F  I DP
Sbjct: 172 --GYSTELIANEAVRIIETKDPNRPLFLYVAFNAPHT-----PIQAPDKNIELFSYIEDP 224

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-----------GIHS 297
             R YA  V+ LD  +G +I A+   G+LE +I++F +DNG P F           G+  
Sbjct: 225 LDRAYAANVNALDAEIGKIIYAIEDQGILEETIIVFFSDNG-PVFDINPIIAVLAPGLVE 283

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
            +GS   L G K++  +GG+R  A+I W   L  ++  S +   + D LPTL +AA IE+
Sbjct: 284 ARGSTAGLPGSKTSAREGGIRVPASIMWKGVLDNSK--SDQYIFVQDLLPTLLSAAEIEV 341

Query: 357 NDTSL-DGVNQWDVLTKGA 374
           +D  + DG ++W  L  GA
Sbjct: 342 SDLKVFDGTDKWQNLVTGA 360


>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 506

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 157/441 (35%), Positives = 236/441 (53%), Gaps = 43/441 (9%)

Query: 11  LTCTLLFNDAFLNTT-APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           L C++L     ++   + K+PHI++ILA+DLGWNDV +H +  +  P ++ LA +G+I N
Sbjct: 3   LACSILVALVCVSVVKSDKQPHIVLILAEDLGWNDVGWH-NPDLKMPILNQLAADGVIFN 61

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
           Q YVQ  CTPSRSAL TG YP  I  QH ++L  E  GL L  K LP+ LK+ GY TH +
Sbjct: 62  QSYVQPACTPSRSALFTGYYPFKIKRQHQMLLNLEADGLSLDLKTLPEMLKDVGYLTHLV 121

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
           GKWHLGF +E Y P  RGFDS YG+     D Y  + +    P  G D R N  V  ++ 
Sbjct: 122 GKWHLGFCKEEYLPNKRGFDSFYGWLTLGTDLY--TKENVLAP--GYDFRDNTGVVQES- 176

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
             Y   +  E A++++  H K  P+FL  + +A+      +  + P +    + DI D  
Sbjct: 177 DTYLPFMLAERAVDIVMGHYKEYPLFLEFS-MALPG----KFLEVPQDYEDLYSDIDDDR 231

Query: 250 RRTYAGM-----------------VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
            R + G+                 +S +D S+G ++ AL+  GM ++++ +F +DNGA  
Sbjct: 232 TRKFYGICNLNIVVSKNTKLPNRKLSAMDNSIGLLVEALKTRGMWDDTLFIFTSDNGA-- 289

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
               +  GSN PLRG  +T ++GG R     +   LK+   V++EL HI D   T+   A
Sbjct: 290 --FATESGSNWPLRGNIATLFEGGTRVPTFFYGSMLKKAGYVNNELIHIVDLHKTIIELA 347

Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN- 411
           G    ++ +DG++ W  ++KG  + R+E ++NID  DN     AA+RV D K + G  + 
Sbjct: 348 G-ATTESDIDGMDMWKTISKGEPSPRTEFVYNID--DNELSPGAAIRVGDYKLITGHPDL 404

Query: 412 ------NGQSDEWYGDTDNEI 426
                   +SD+WY   D  I
Sbjct: 405 MYPFRLLNRSDDWYNYGDEPI 425



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/59 (38%), Positives = 36/59 (61%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
           LFNI DDP E+NNLA+   + +++L  +L  Y+  +VPP ++ FD   D   +  +W P
Sbjct: 445 LFNIIDDPEERNNLADVYPNKVEELRHRLDEYREHLVPPESRIFDPSGDAVNFGGVWSP 503


>gi|406832341|ref|ZP_11091935.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 490

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 159/404 (39%), Positives = 220/404 (54%), Gaps = 42/404 (10%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           K+F  +  T L   +F   +A ++P+I++I+ADDLG  D+ + GS +I TP+IDALA  G
Sbjct: 5   KWFLPIFVTTL---SFGAASAAERPNIVVIVADDLGNADLGYRGS-KIKTPHIDALAKGG 60

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           + L  +Y   LCTP+R+ALMTG+YP+  G+Q  VI     +GLP  EK LPQ LKE GY 
Sbjct: 61  VRLESYYGLPLCTPARAALMTGRYPMRQGLQTLVIFPSHRYGLPTDEKTLPQALKEVGYH 120

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           T  +GKWHLG   + + P  RGFD  YG   G  DY+      T E    +D + N +  
Sbjct: 121 TAMVGKWHLGHADKKFWPQNRGFDHFYGNVVGEVDYF------TRERGGVVDWQRNGEFL 174

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
            +    Y  DL    A+ +IA H+K+KP+FLY A LA HA     P+QAP  ++  + DI
Sbjct: 175 RED--GYYVDLIGTEAVKLIAGHDKAKPLFLYFASLAPHA-----PYQAPKADIDAYNDI 227

Query: 246 SDPE-RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG---- 300
            D E  RTYAGM+S LD  VG V+AAL  +GM EN+I+ F  DNG  +  + +       
Sbjct: 228 FDNEMHRTYAGMISNLDRQVGRVVAALETNGMRENTIIFFTTDNGGATSALFATGARAPE 287

Query: 301 ----------------SNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISD 343
                           SN P  G K +  DGG+   A + W   LK +  V +E  H  D
Sbjct: 288 EREASGGVALGSKPPCSNLPFSGGKGSLKDGGVHLPAIVNWPGRLKPS--VVNEPLHHVD 345

Query: 344 WLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNID 386
            +PTL A AG + + +   DG + W  L +G  +   EIL N++
Sbjct: 346 LMPTLLALAGGKGDPSKPFDGKDAWGTLAEGKPSPHEEILINVE 389


>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 495

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 173/526 (32%), Positives = 266/526 (50%), Gaps = 79/526 (15%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K PHII I+ DDLGWNDV ++ +    TP ID LA NG+ L  +YV + C PSR+ LMTG
Sbjct: 21  KPPHIIFIVVDDLGWNDVGYN-NPVFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTG 79

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           ++ I +G+ +       P  LPL E  + + LK AGY+ H +GKWH G++ +   P  RG
Sbjct: 80  RHAIQLGIPNDG-FGYHPRSLPLDETTIAEPLKHAGYSNHIVGKWHCGYYADNCLPHNRG 138

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAAI 202
           FD+ +G+     D+Y HS           +  HN++     +  K IG YST ++     
Sbjct: 139 FDTFFGFVGAGIDHYTHS--------DHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEGK 190

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLD 261
           ++I  H+++KP+FLYL+  AVHA     P + P   + ++   I D +RRTYA M S +D
Sbjct: 191 DIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRTYAAMTSCVD 245

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           E++ N+  AL +  ML+NSI++F +DNG     + ++ G+N PLRG K T W+GG+R V 
Sbjct: 246 EAIANITEALEEKNMLKNSIIVFTSDNGG---ALVNSAGNNWPLRGGKHTSWEGGVRAVG 302

Query: 322 AIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKR 378
            ++   L +  + +  + L HI+DW PTL   AG   +    L GV+QW+++T   K+ R
Sbjct: 303 FVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHVGKKLYGVDQWNMITGEDKSGR 362

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSK 438
            ++L  ++  DN  K  A  R    K                  +N  D       + + 
Sbjct: 363 DDVLITLEE-DN--KAGAPYRTGIYK------------------NNAFD-------ITTH 394

Query: 439 AGITFNALKTKLQIKQKHAADPKANSSDALR--TILTDEKILELREFARVRCNYDNKGAH 496
           A I           K K    P    SD ++   I +D +I+E R+F             
Sbjct: 395 AAIKMG--------KWKLLTGPIGGPSDWVKPQEINSDHRIVEDRDF------------- 433

Query: 497 CNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
            ++T    L+NI DD  E  +L+E    ++ ++  KL  +    VP
Sbjct: 434 -DATRMVRLYNIEDDQTETTDLSEINQHIVIEMLAKLHNFSKQSVP 478


>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
          Length = 345

 Score =  246 bits (628), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 142/352 (40%), Positives = 194/352 (55%), Gaps = 18/352 (5%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F  T  L+ N   L     +KP+ + I+ADDLG++D+       I TPN+DALA  G+I
Sbjct: 4   LFLFTGVLVTNAYLLQKDKVRKPNFVFIMADDLGYHDIGLRNPDVI-TPNLDALASKGVI 62

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +YVQALCTPSR ALM+G+YP    MQ  VI   E     L  K LPQYLKE GY  H
Sbjct: 63  LTNNYVQALCTPSRHALMSGRYPSASAMQSMVIQPMEARCAGLEYKFLPQYLKELGYKNH 122

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            +GKWHLG+ R+   PT RGFD+ YG + G  DY+ H      E +   D   N  +  +
Sbjct: 123 MVGKWHLGYCRDECLPTSRGFDTFYGLYAGAGDYWSH------EIFGKYDWHINGDIHFE 176

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDIS 246
             G +S DL  E    V  EH+   P+FLY A       N + P Q  DE +  +  D  
Sbjct: 177 ANGTHSQDLEMEGLDKVFDEHDSKDPLFLYFA-----PQNPHTPSQPTDEFLNLYPEDKF 231

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
           D  RR Y G+ S LD  VG ++  L  +G+++N+ V+F++DNGA        +G N P R
Sbjct: 232 DDIRRKYLGLTSGLDAMVGKIVEKLENNGLMDNTYVIFVSDNGA-----DPPEGLNTPFR 286

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
           G K + ++GG +  + ++SP L +T   +  L HI+DWLPTL   AG ++ D
Sbjct: 287 GGKGSLFEGGTKSNSFVYSPLLNKTGYENDGLMHITDWLPTLVKLAGGKVMD 338


>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
          Length = 581

 Score =  246 bits (627), Expect = 3e-62,   Method: Compositional matrix adjust.
 Identities = 145/382 (37%), Positives = 210/382 (54%), Gaps = 32/382 (8%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
            F      KKPH+I +L DDLG++D+ +     I +PNIDALA + L L +HYVQ  CTP
Sbjct: 14  TFSQKVLEKKPHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTP 72

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           SR+A +TG+Y I +GMQ GVI   EP G+PL E LL +  K+ GY T   GKWHLGF+  
Sbjct: 73  SRAAFLTGRYNIRMGMQSGVIRATEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTY 132

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-------GIY 192
            + P  RGFD  YG++ G QD+Y H      + Y G     N  V+N TI       G Y
Sbjct: 133 KHCPQIRGFDRFYGFYLGSQDFYFHDS-GRLKAYPG-----NGDVENDTILDDLHTNGTY 186

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP---E 249
           ST L+ +  IN +A+H+ + P+F Y++   VH      P Q  ++    + + +D    E
Sbjct: 187 STKLFVDDFINDLAKHDPAVPLFNYVSFQDVHG-----PLQTVNKFKKHYENKTDTWTHE 241

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           R   +  ++ LD  +G ++ AL++     N++++F +DNG    G      SN PLRG K
Sbjct: 242 RILISTKITTLDHHIGKMVTALKEKNYWNNTVLVFTSDNG----GQPKEGASNWPLRGSK 297

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEI---NDTSLDGV 364
            T +DGG++  A I SP L+   K    + LFH+SDW PTL   +G ++       LDG 
Sbjct: 298 GTIYDGGLKSRAFIASPLLQNRMKGQNYNYLFHVSDWFPTLLTLSGCDVPRKEGRPLDGK 357

Query: 365 NQWDVLTKGAKTKRSEILHNID 386
              ++     K  R+++LH +D
Sbjct: 358 IH-ELFVYKPKKARNKMLHYLD 378


>gi|313236221|emb|CBY11544.1| unnamed protein product [Oikopleura dioica]
          Length = 511

 Score =  246 bits (627), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 145/392 (36%), Positives = 217/392 (55%), Gaps = 35/392 (8%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           Y F L+          N  A ++P+I++I+ADDLG +DVS HG+  I TPN+D LA +G+
Sbjct: 6   YIFGLSA---------NIFAEQQPNIVVIVADDLGKHDVSMHGAD-IYTPNLDMLARDGV 55

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
           +LN +YVQ +C+P+R +LMTG+YP  +G+QH  ++   P GLPL E ++PQY+KE GY T
Sbjct: 56  LLNNYYVQPVCSPTRGSLMTGRYPYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKT 115

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ---GLDMRHNMQ 183
           + +GKW LGFF++ Y P  RGFD  +G   G QDYY          Y    G D+R    
Sbjct: 116 YMVGKWQLGFFKDNYLPWKRGFDEFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQG 175

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
               T   Y   LY + A      HNK+ P+++Y+A  +VH      P QAP + V+ + 
Sbjct: 176 PVRDTSMKYQPFLYADKAREKFFAHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYN 230

Query: 244 D---ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
           D   +S+ + R +AGMV+ LD +VG ++  L+  G+  N+++ F  DNG           
Sbjct: 231 DNVQMSNGQ-RVFAGMVTALDAAVGEIVDGLKDSGLWNNTLIYFSNDNG-------REDD 282

Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIE-IN 357
           S+  L+G K+  W GG+ G      P +  K   K  ++L H+SD LPT+ +AA    + 
Sbjct: 283 SDFVLKGGKNDFWQGGITGYGFFSGPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLP 342

Query: 358 DTSLDGVNQWDVL---TKGAKTKRSEILHNID 386
              +DG + W  L   + G   KR EI+  ++
Sbjct: 343 PKVIDGHDLWSRLQGDSLGQDEKRKEIIMQMN 374


>gi|313215020|emb|CBY41206.1| unnamed protein product [Oikopleura dioica]
          Length = 427

 Score =  245 bits (626), Expect = 4e-62,   Method: Compositional matrix adjust.
 Identities = 142/376 (37%), Positives = 213/376 (56%), Gaps = 26/376 (6%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           N  A ++P+I++I+ADDLG +DVS HG+  I TPN+D LA +G++LN +YVQ +C+P+R 
Sbjct: 13  NIFAEQQPNIVVIVADDLGKHDVSMHGAD-IYTPNLDMLARDGVLLNNYYVQPVCSPTRG 71

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           +LMTG+YP  +G+QH  ++   P GLPL E ++PQY+KE GY T+ +GKW LGFF++ Y 
Sbjct: 72  SLMTGRYPYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVGKWQLGFFKDNYL 131

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ---GLDMRHNMQVDNKTIGIYSTDLYTE 199
           P  RGFD  +G   G QDYY          Y    G D+R        T   Y   LY +
Sbjct: 132 PWKRGFDEFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRDTSMKYQPFLYAD 191

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD---ISDPERRTYAGM 256
            A      HNK+ P+++Y+A  +VH      P QAP + V+ + D   +S+ + R +AGM
Sbjct: 192 KAREKFFAHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQMSNGQ-RVFAGM 245

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V+ LD +VG ++  L+  G+  N+++ F  DNG           S+  L+G K+  W GG
Sbjct: 246 VTALDAAVGEIVDGLKDSGLWNNTLIYFSNDNG-------REDDSDFVLKGGKNDFWQGG 298

Query: 317 MRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL--- 370
           + G      P +  K   K  ++L H+SD LPT+ +AA    +    +DG + W  L   
Sbjct: 299 ITGYGFFSGPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLPPKVIDGHDLWSRLQGD 358

Query: 371 TKGAKTKRSEILHNID 386
           + G   KR EI+  ++
Sbjct: 359 SLGQDEKRKEIIMQMN 374


>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
 gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
          Length = 526

 Score =  243 bits (621), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 153/411 (37%), Positives = 224/411 (54%), Gaps = 30/411 (7%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           N  + ++P+I+IILADDLGW DV +HGS  I TP+ID LA  G  LN+ Y    C+P+R+
Sbjct: 44  NNKSDERPNIVIILADDLGWGDVGYHGS-DIQTPHIDRLAKEGAKLNRFYATPFCSPTRA 102

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           ALMTG+ P+ +G+ + V++  E  G+ L E  LPQ  + AGY T  +GKWHLG   E +T
Sbjct: 103 ALMTGRDPLKLGVAYSVLMPWENGGVSLDEHFLPQSFQAAGYNTAMVGKWHLGHTIEQHT 162

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAA 201
           P  RGFD  YG+      Y+DH          G D + N + VD+   G Y+TD++   A
Sbjct: 163 PNARGFDLFYGHMHTQVSYFDHQIA------NGHDFQENGKPVDHN--GEYATDVHGAQA 214

Query: 202 INVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF---LDISDPERRTYAGMV 257
              I +  +K+KP  LY+  LA H+     P +AP+    K+   LD+    R+TYA MV
Sbjct: 215 ARFITDLRDKTKPFLLYVPFLAPHS-----PMEAPEALKDKYSSRLDLPGSTRKTYAAMV 269

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
             +D+++G V+ AL + G+ +N+IV F +DNG    G  +    N P RG K   ++GG+
Sbjct: 270 DSMDQAIGKVLTALDEEGVADNTIVFFFSDNG----GFENYGSDNGPYRGGKLEVYEGGI 325

Query: 318 RGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAK 375
           R  A + W   L    +V  E+  + D LPTL  AAG+E   +  +DGVN+W  +T    
Sbjct: 326 RVTAVMRWPEKLAAGSEV-DEIVSVMDLLPTLTHAAGVENGTEKKIDGVNRWSTITGEKV 384

Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
           +KR   L+   N+    K+   +     K V   D+   S     D DN++
Sbjct: 385 SKRKGALYFASNIPVYNKFQLGVIDGKWKLVQNIDHKMTST----DFDNQL 431


>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
          Length = 330

 Score =  243 bits (619), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 144/341 (42%), Positives = 195/341 (57%), Gaps = 27/341 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           KKP+ + ILADDLG+ND+       I TPN+DALA  G+IL  +YVQA+C+PSR ALMTG
Sbjct: 1   KKPNFVFILADDLGYNDLGLRDPDVI-TPNMDALASKGVILTNNYVQAVCSPSRHALMTG 59

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +YP    MQ  VI   E     L  K LPQYLK+ GY  H IGKWHLG+ RE   PT RG
Sbjct: 60  RYPSASAMQSIVIQPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRG 119

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQ--GLDMRH---NMQVDNKTIGIYSTDLYTEAAI 202
           FD+ YG +    DY++H     ++ +   G+D      + QV ++   IY  DL  E   
Sbjct: 120 FDTFYGLYASSGDYWEHGIMGMYDWHTEAGVDFEARGTHAQVGSRYWHIY--DLEMERLD 177

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-----KFLDISDPERRTYAGMV 257
            V  EH+   P+FLY A       N++ P Q  DE +      KF DI    RR Y G+ 
Sbjct: 178 KVFDEHDNKDPLFLYFA-----PQNSHTPHQPTDEFINLYSEDKFSDI----RRKYLGLT 228

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
           S LD  VG ++  L ++ M+ N+ V+F++DNGA        +G N P RG K + ++GG 
Sbjct: 229 SGLDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPI-----EGRNAPFRGGKGSLFEGGT 283

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
           +  + I+SP LK+T+  +  L HI+DWLPTL   AG ++ D
Sbjct: 284 KSSSFIYSPLLKKTEYENDGLMHITDWLPTLVKLAGGQVKD 324


>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
          Length = 548

 Score =  242 bits (618), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 146/396 (36%), Positives = 218/396 (55%), Gaps = 26/396 (6%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+II   ADD G++D  +  +S I TPNID L  +G+     Y    C+PSRS+ ++G+
Sbjct: 28  KPNIIWFFADDYGYHDAGYR-NSDIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGR 86

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y    GMQHGVI +     + L    L  YLKE  Y THA GKWHLG+  +  TPT+RGF
Sbjct: 87  YAYTSGMQHGVISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGF 146

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGL-DMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           D+  G + G   YY+H+       +QG+ D  +  ++D    GI+S DL     +N + E
Sbjct: 147 DTFSGGYSGEGKYYEHTT------FQGMYDWHNGTKLDRSASGIHSQDLIEADLLNQLDE 200

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVGN 266
            N   P+F Y    A H  NT+ P Q   E  A + D+  P  R+ Y G+VS +D  +G 
Sbjct: 201 -NDGTPLFYY---AAFH--NTHSPMQPKPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGK 254

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V+A L++  M +N+ + F +DNG     +H  +G N P RG KS+ +DGG    +   SP
Sbjct: 255 VVAKLKEKDMFDNTYIFFSSDNGGD---VH--EGDNSPKRGAKSSLFDGGCHAHSWASSP 309

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEILHNI 385
            LK+    +  L HI+DWLPT+   AG E+ ++  +DG+ Q D++T    +KR+++++NI
Sbjct: 310 LLKKKGVEADGLLHITDWLPTIVKLAGGEVPESDGIDGIEQVDLITNAGPSKRTDMIYNI 369

Query: 386 D---NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW 418
           D    +  P   + A+   DL+Y       G+SD +
Sbjct: 370 DMELEIGTPD--FGAMAARDLRYKLIWGFEGRSDGY 403


>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 658

 Score =  242 bits (618), Expect = 4e-61,   Method: Compositional matrix adjust.
 Identities = 152/409 (37%), Positives = 223/409 (54%), Gaps = 43/409 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+P+I+ +LADDLGW+DV +HGS  I TPNID LA  G+ L  +YV + C PSR  L+TG
Sbjct: 202 KRPNIVFVLADDLGWHDVGYHGSI-IDTPNIDHLAAEGVKLENYYVSSWCAPSRVNLLTG 260

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF--REVYTPTF 145
           +Y I  G+ +G + +     + + E  L   L EAGY T  +GKWHL  F  RE Y P  
Sbjct: 261 RYRIRTGL-YGDVCD----FMGIHEITLADKLYEAGYYTAMVGKWHLSGFQHRECY-PAH 314

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           RGF +  G+  G Q+Y+ H    +   Y      ++  +  +  G YST +Y E A  +I
Sbjct: 315 RGFQTFLGFHGGSQNYFTHRRGGSNSEYDF--WANDTSIGREYDGRYSTMVYAEEAQRII 372

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYAGMVSRLDESV 264
             H   +P+FLYL+  AVH+     P   P     K+   I D +RR YA M S +DE++
Sbjct: 373 RHHRTEQPLFLYLSFQAVHS-----PLLVPSAYEDKYRTGIEDDKRRVYAAMTSCMDEAI 427

Query: 265 GNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           GNV   L++ G+   ++ +F +DNG  P+FG     GSN PLRG K   ++GG+R V  +
Sbjct: 428 GNVTDTLKETGLWNKTVFIFSSDNGGTPTFG-----GSNWPLRGRKGLWYEGGIRAVGFV 482

Query: 324 WSPWLKQ--TQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            S  L    T   + EL HI+DW PTL   A  +++    LDG++QW+ +++G KT R E
Sbjct: 483 NSYLLSNHVTGSTNHELIHITDWFPTLLRLAKTDLHRLKPLDGIDQWETISQGKKTNRKE 542

Query: 381 ILHNI------DNVDNPQKY-----------YAALRVDDLKYVAGTDNN 412
           +L N        N++  ++            +AA+RV D K + G  ++
Sbjct: 543 VLLNFLPGNISGNINENREGEATADYFDTMPFAAIRVGDWKLLTGNKDS 591


>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 355

 Score =  241 bits (615), Expect = 8e-61,   Method: Compositional matrix adjust.
 Identities = 139/354 (39%), Positives = 207/354 (58%), Gaps = 29/354 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K PHII I+ DDLGWNDV ++ +    TP ID LA NG+ L  +YV + C PSR+ LMTG
Sbjct: 21  KPPHIIFIVVDDLGWNDVGYN-NPVFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTG 79

Query: 88  KYPIHIGM-QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           ++ I +G+ Q G      P  LPL E  + + LK AGY+TH +GKWH G++ +   P  R
Sbjct: 80  RHAIQLGIPQDG--FGYHPRSLPLDETTIAEPLKHAGYSTHIVGKWHCGYYADNCLPHNR 137

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-----VDNKTIGIYSTDLYTEAA 201
           GFD+ +G+     D+Y HS           +  HN++     +  K IG YST ++    
Sbjct: 138 GFDTFFGFVGAGIDHYTHS--------DHFNHMHNLRKNDDCIAKKYIGKYSTTIFANEG 189

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRL 260
            ++I  H+++KP+FLYL+  AVHA     P + P   + ++   I D +RRTYA M S +
Sbjct: 190 KDIINAHDQNKPLFLYLSFSAVHA-----PLEVPSSYLKQYESTIHDEDRRTYAAMTSCV 244

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DE++ N+  AL +  ML+NSI++F +DNG     + ++ G+N PLRG K T W+GG+R V
Sbjct: 245 DEAIANITEALEEKNMLKNSIIVFTSDNGG---ALVNSAGNNWPLRGGKHTSWEGGVRAV 301

Query: 321 AAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
             ++   L +  + +  + L HI+DW PTL   AG   +    L GV+QW+++T
Sbjct: 302 GFVYGDLLPRHSRGTENTGLMHITDWFPTLITLAGGNPHVGKKLYGVDQWNMIT 355


>gi|241176208|ref|XP_002399501.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215495150|gb|EEC04791.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 418

 Score =  241 bits (615), Expect = 9e-61,   Method: Compositional matrix adjust.
 Identities = 164/480 (34%), Positives = 230/480 (47%), Gaps = 72/480 (15%)

Query: 94  GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG 153
           G+QH  I EGE WGLPL   L+PQ+ K+ GYATH +GK  +G F    T   R    H  
Sbjct: 1   GLQHLTIEEGEAWGLPLRFTLMPQHFKKLGYATHMVGK--VGQFIISDTSCRRYVKRH-- 56

Query: 154 YWQGLQDYYDHSCK---ATFEP----YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
                Q      CK      EP    + GLD+  N        G Y TD+ T+ A+++IA
Sbjct: 57  ----TQKSCSQRCKRERKVKEPCGGDHIGLDLWDNEVAVRTECGNYDTDMLTDKAVSIIA 112

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H+ +KP+FLYL+H + H+  T  P QAP++ +AKF  I +  R  YAGMV  +D++ G 
Sbjct: 113 SHDTTKPLFLYLSHRSAHSAFTSAPLQAPEKNIAKFGYIGERNRTIYAGMVDSMDDAFGR 172

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           ++ AL   G+L N+IV+F +DNGA       N+G N PLRG K T W+G +R  A IWSP
Sbjct: 173 LVEALSGAGVLNNTIVVFSSDNGAVPIIEFPNRGFNWPLRGAKKTLWEGAVRVPAFIWSP 232

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNI 385
            L+ + +VS ++ HI DWLPT  + AG  ++     DG + W  L++G K+ R E+L NI
Sbjct: 233 LLESSGRVSDQMMHIVDWLPTFYSVAGGNLSHLGEQDGFDMWKALSEGTKSPREEMLLNI 292

Query: 386 DNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNA 445
           D V N     ++LR  + K V GT +NG  D  +  T           V   +  +  + 
Sbjct: 293 DPVLN----TSSLRYKNHKVVLGTYHNGSFDHRFKTTGRRPGWRRRATVRCGECNLAKDN 348

Query: 446 LKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCL 505
           +              K +S   L  ++ D                            PC 
Sbjct: 349 V--------------KPSSPPYLFDVVAD----------------------------PCE 366

Query: 506 FNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK-PFDKRADPARWNNIWVPW 564
            N         NLA    ++L+QL  +L  Y +T V P N    D R+ P R N  W PW
Sbjct: 367 LN---------NLAPEHPEILRQLLKRLEDYATTAVDPENVYETDARSFPERHNGTWAPW 417


>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
          Length = 558

 Score =  241 bits (614), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 143/402 (35%), Positives = 216/402 (53%), Gaps = 25/402 (6%)

Query: 29  KPHIIIILADDL-----GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           KP++I   ADD      G+ D  +  S+ I TPNID L  +G+     Y    C+PSRS+
Sbjct: 65  KPNVIWFFADDYDLALPGFQDAGYRNSA-IHTPNIDKLVGDGISFTNAYSSQQCSPSRSS 123

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
            ++G+YP   GMQHGVI +  P  + L  K L  YLK+  Y THA+GKWHLG+  +  TP
Sbjct: 124 FLSGRYPYKSGMQHGVISDEGPNCMDLKFKFLSDYLKDLNYNTHAVGKWHLGYCNKKCTP 183

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           T+RGFD+  G + G  DYY+H+    +      D  +  + D    G++S DL  EA + 
Sbjct: 184 TYRGFDTFSGGYSGEGDYYEHTTHGGW-----YDWHNGTETDLTAAGVHSQDL-VEADMM 237

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-SDPERRTYAGMVSRLDE 262
              + N   P F Y    A H  NT+ P Q   E  A + D+ + P R+ Y G+VS +D 
Sbjct: 238 KNLDENDGTPFFYY---TAFH--NTHSPLQPKPEHKALYNDMDATPSRKKYLGLVSGMDA 292

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
            +GNV+A L+  GM +N+ + F +DNG        N+G N P RG KS+ ++GG    + 
Sbjct: 293 VIGNVVAKLKAKGMFDNTYIFFSSDNGGD-----VNEGDNSPWRGAKSSLFEGGCHAHSW 347

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
             SP +++    +  L H++DWLPT+   AG  + D  +DG+ Q D++T    +KR+ ++
Sbjct: 348 ASSPLIRKKGVEADGLLHMTDWLPTILTLAGGTVPD-GIDGIEQVDLITNAGLSKRTNMI 406

Query: 383 HNID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
           +NID  ++     +  +   DL+Y       G+SD +  D D
Sbjct: 407 YNIDMELETGPPNFGCMAARDLRYKLIWGFEGRSDGYGVDAD 448


>gi|313219945|emb|CBY43646.1| unnamed protein product [Oikopleura dioica]
          Length = 1683

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 158/507 (31%), Positives = 247/507 (48%), Gaps = 64/507 (12%)

Query: 61   LAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK 120
            L    L L  +YV  +CTP+RS LM+G+Y IH G+QH VIL  +P GLPLT+ + P+  K
Sbjct: 1226 LIQESLHLQSYYVNPICTPTRSVLMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFK 1285

Query: 121  EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            + GY TH +GKWHLGF+ E   P  RGF+SHYGY  G + +Y+HS     +   G+D R 
Sbjct: 1286 DCGYRTHMVGKWHLGFYDEKMVPENRGFESHYGYLIGAEGHYNHS--QFMQGQHGVDFRD 1343

Query: 181  NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
                 N + G YS DL+ +   +++  H+  + +++Y+    VH      P +AP   V 
Sbjct: 1344 GGASTNSSWGQYSADLFVKRVEDLVEAHDVEESLYMYVGLQNVHY-----PLEAPQHYVD 1398

Query: 241  KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
            +F  I D +RR YA M   +D++VG +I + +  G+ E++IV F  DNG  +       G
Sbjct: 1399 QFSWIKDRDRRVYAAMTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTL----YGG 1454

Query: 301  SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDT 359
            +N PLRG+K+T W+GG+RG+ +I  P +   ++   +L H+ D  PTL            
Sbjct: 1455 NNWPLRGLKNTLWEGGIRGIGSIKIPGVSADKR--DQLMHVVDMFPTLLDLTSCPNTRKV 1512

Query: 360  SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWY 419
              DG +Q  +L    +++    L N               VD L+   G DN      W+
Sbjct: 1513 KFDGKSQAKMLVDNDESQSDSFLIN---------------VDPLRKNKGADNR----TWH 1553

Query: 420  GDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRT--ILTDEKI 477
             + D  +           +AG+ +   K              A SSD   T  I   E  
Sbjct: 1554 SEFDVRV-----------QAGLRWQQWKLLT----------GAPSSDNYPTGHIYPPEWP 1592

Query: 478  LELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYK 537
             + R+ +  + N++      +S +   LF+I +DP E   ++   T++++++ + L+ Y 
Sbjct: 1593 QDGRKNS--QSNFE------SSNLNVRLFDIDNDPLEDFEISNQHTEIVEKMLNMLSSYN 1644

Query: 538  STMVPPGNKPFDKRADPARWNNIWVPW 564
            ST VP      D    P +    W PW
Sbjct: 1645 STAVPCRWPNPDPAGLPDQNGGFWKPW 1671


>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 635

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 144/383 (37%), Positives = 213/383 (55%), Gaps = 35/383 (9%)

Query: 30  PHIIIILADDLGWNDVSF----HGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           PH+++ILADD+G+ND+ +    H ++Q+ TP +D LA  G+ L ++Y Q  C+PSR AL+
Sbjct: 118 PHLVLILADDMGYNDIGYNRAPHQTNQVSTPFLDELASEGVTLTRYYTQCDCSPSRGALL 177

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG YP   G+ HGVI+    WGLPL   L+PQ+L    Y +HAIGKW +G +   + PT 
Sbjct: 178 TGLYPASTGLYHGVIVTQSHWGLPLEYHLIPQFLPSR-YRSHAIGKWDVGHYTWNHVPTG 236

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGL--------DMRHNMQVDNKTIG-----IY 192
           RGF S+ G++    DYY H   A    Y           D   ++  D    G      Y
Sbjct: 237 RGFHSYVGFYGTDIDYYTHEIGAGCNSYNCSSAIKRCMNDSITDLNYDGAATGDEYYNRY 296

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV----AKFLDISDP 248
           STD++T+ A+ ++   +   P+FLY+A  AVH      P +A D  +    A FL  +  
Sbjct: 297 STDIFTDRAVELLRTESARNPLFLYVAFNAVHG-----PLEADDATIDRFNASFLYEART 351

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
           ER T+A     +D +V +++  +   G  +NSIVLF +DNGA      +  GSN PLRG 
Sbjct: 352 ERATFAAAALLMDRAVESIVKTMHAVGAYDNSIVLFASDNGATL--AQTGGGSNWPLRGS 409

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTSLDGVNQ 366
           K TP++GG+R  A + SP L   ++    + LFH++D LPTL  AAG     + +DGVNQ
Sbjct: 410 KFTPYEGGVRVPAFLHSPLLGSGRRGITHAGLFHVTDILPTLVHAAGGS-PPSDIDGVNQ 468

Query: 367 WDVLTKG---AKTKRSEILHNID 386
           +  L      A + R+++L ++D
Sbjct: 469 YHALFGSDATASSPRADVLVHVD 491


>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 614

 Score =  240 bits (613), Expect = 1e-60,   Method: Compositional matrix adjust.
 Identities = 151/438 (34%), Positives = 230/438 (52%), Gaps = 45/438 (10%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCT 78
           A L   A  +PH++ ++ DD+G+NDV +  S     TP +D L  +G+ +++ Y Q +CT
Sbjct: 53  AKLAAAAETRPHVVFVMLDDVGFNDVGYASSDLGEMTPFLDGLMADGVRVDRLYGQQVCT 112

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           PSR+A++TGK PIH+ +QH  +   EPWGLP  E  L QYLK  GY+TH +GKWHLG + 
Sbjct: 113 PSRAAMLTGKLPIHLELQHWQVAPSEPWGLPTREATLAQYLKALGYSTHMVGKWHLGHYN 172

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
              TP  RGFDS YG++ G  DY  H     +      D+  + +      G + T L  
Sbjct: 173 NASTPLNRGFDSFYGFYSGGVDYLTHDPSTGYVWRCYRDLWDDERPVTDAHGQHQTSLMN 232

Query: 199 EAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV----AKFLDISDPERR 251
           E AI V+  H    KS+P+F Y+++      N + P Q P E +    A  LDI + +R+
Sbjct: 233 ERAIAVLERHAVEKKSEPVFAYVSY-----PNAHLPLQPPTELLERRNATLLDIPNHDRK 287

Query: 252 TYAGMVSRLDESVGNVI-AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
            +A ++   DES+ N+    L+  G+ + ++++  +DNG     +    GSN+PLRG K 
Sbjct: 288 NFAALMMNADESLANLTNRGLKATGLYDTAVIVIASDNGGQVSAM--GGGSNYPLRGEKK 345

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAA---------------G 353
             ++GG+R  A I SP L +  + SS  +L H+SDWLPTL   A               G
Sbjct: 346 YLFEGGVRAHAVIHSPLLPRAARGSSYGKLMHMSDWLPTLLQGALGASADELAGLKLPRG 405

Query: 354 IEINDTSLDGVNQWDVLTKGAKTK----RSEILHNIDNVDNPQKYY----AALRVDDLKY 405
           +  +  ++DGV+ W  +           R E+L NID +D    Y     AAL + D K+
Sbjct: 406 VVNDYGAIDGVDHWAHVVGAVDAAAAAPRDELLLNIDYLDGSGDYLGYFRAALIIGDWKF 465

Query: 406 VAGTDNNGQSDEWYGDTD 423
           +     N ++  W+  +D
Sbjct: 466 I----YNEKNISWFAPSD 479


>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
          Length = 382

 Score =  238 bits (606), Expect = 1e-59,   Method: Compositional matrix adjust.
 Identities = 141/388 (36%), Positives = 214/388 (55%), Gaps = 27/388 (6%)

Query: 3   WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
           ++R  FF     +   D+         P+II I  DD G+NDV F   + I +PN+DALA
Sbjct: 2   FSRSNFFPAAANIKRGDS--------PPNIIWIFGDDYGFNDVGFRNPNVI-SPNMDALA 52

Query: 63  YNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
            +G+IL   Y    C+PSR + ++G+Y     MQHGVIL+ +P  L L   +LP YLKE 
Sbjct: 53  QSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGVILDNKPQCLGLDYTILPGYLKEL 112

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
           GY THA GKWHLG+ R+  TPT RGFDS  G + G  ++Y+H+  AT   Y   D     
Sbjct: 113 GYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEGEFYEHT-TATGGYY---DWHLGT 168

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
           +VD   IG +S DL        + E+++S P+F+Y+A   VH+     P     E +A +
Sbjct: 169 EVDYDAIGKHSEDLIGYYVNKTLDEYDQSSPLFMYVAFHNVHS-----PLDPKPEFLALY 223

Query: 243 LDI---SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
            D+   +D     ++G+VS +D  +G ++  +++ G+ EN+ +LF +DNG         +
Sbjct: 224 DDLDASTDYSGFFFSGLVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGG-----DVGE 278

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
           G N P RG KST W+GG +  +   SP L  +   ++ L H++DWLPT+   AG  +  +
Sbjct: 279 GDNSPHRGGKSTLWEGGCKANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAGGSVPAS 338

Query: 360 -SLDGVNQWDVLTKGAKTKRSEILHNID 386
             LDG  Q D++     + R+ +++NID
Sbjct: 339 DGLDGFVQTDMVINNGDSARTTMIYNID 366


>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
 gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
          Length = 452

 Score =  237 bits (604), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 137/394 (34%), Positives = 218/394 (55%), Gaps = 43/394 (10%)

Query: 36  LADDLGWNDVSFHGSSQIPTPNIDALAY--NGLILNQHYVQALCTPSRSALMTGKYPIHI 93
           +ADDLG++DV +  S+ + TPN+  LA+  N  +L+  YV  LCTP+RSA MTG YP  +
Sbjct: 1   MADDLGFSDVDWKDST-LHTPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRV 59

Query: 94  GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG 153
           G Q+GV L  EP G+P     L + +++  Y+T+ +GKWHLG+ ++ + PT RGFD  YG
Sbjct: 60  GTQNGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYG 119

Query: 154 YWQGLQDYYDHSC----KATFEPYQGLDMRHNMQV-----DNKTIGIYSTDLYTEAAINV 204
           ++     Y++HS     +      +GLD+   +       D    G+YSTDL+T+ A++V
Sbjct: 120 FYGPQTGYFNHSADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSV 179

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY--------AGM 256
           +  HN SKP F++L++ AVH      P Q   +       I   +  T+          M
Sbjct: 180 LDNHNNSKPFFMFLSYQAVHP-----PLQVSQQSKT----IGQGKEATFILRSHAHSTRM 230

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPLRGMKSTPWDG 315
           ++ +D ++G ++  L+   + EN++++F +DNG  + FG      SN PLRG K T W+G
Sbjct: 231 LTAMDFAIGRLVEYLKASNLYENTVIVFTSDNGGTANFG-----ASNAPLRGEKDTIWEG 285

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA- 374
           G +    + SP   +       +FH+ DW  T+ +  G+EI D+  DG+NQW+ L  G  
Sbjct: 286 GTKTTTFVHSPMYIEEGGTRDMMFHVVDWHATILSITGLEI-DSYGDGINQWEYLKTGRP 344

Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
           K +R + ++NIDN      + +A+R  D K + G
Sbjct: 345 KFRRFQFVYNIDN------HGSAIRDGDYKLIVG 372


>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 143/378 (37%), Positives = 208/378 (55%), Gaps = 23/378 (6%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
            +F + F +    KKPHII ++ DDLG++D+ +     I +PNID LA N L +  +Y Q
Sbjct: 9   FVFRECFGHGRFSKKPHIIFVMIDDLGFDDLGYVNDDVI-SPNIDFLAKNALHIENYYNQ 67

Query: 75  ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
             CTPSR+A MTG+Y I  GMQ GVI   EP  +PL+E LLPQ LK+ GY T   GKWHL
Sbjct: 68  PSCTPSRAAFMTGRYNIRYGMQSGVIKPDEPEAIPLSETLLPQALKKCGYNTSMHGKWHL 127

Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
           GF+ E + P  RGFD  +G++ G QDY+ H        Y+    R  +++D    G YST
Sbjct: 128 GFYTEKHCPQNRGFDRFFGFYLGSQDYFYHDSGNNCYLYEPC-YREKVRLDLN--GTYST 184

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
               E  I  + E++   P+F +L+   VH      P Q   E    F    + +   Y 
Sbjct: 185 KAIAEDFIAKLDEYDPETPLFEFLSFQEVHG-----PLQWLSEYSTLF---KEKDWTHYR 236

Query: 255 GMVSR----LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
            M+SR    +D  +G V+  L++ G  +++I++  +DNG    G      SN PLRG K 
Sbjct: 237 KMLSRKIVIVDHFIGEVVKKLKEKGFWKDTILVITSDNG----GQTREGASNWPLRGRKG 292

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEINDTSLDGVNQWD 368
             ++GG+R  A I SP L  + K SS   +FH++DW PTL   +G +  D++LDG  Q D
Sbjct: 293 DVFEGGIRSRAFIHSPKLPNSLKGSSFPHVFHVTDWFPTLLRFSGCQQPDSNLDGEAQ-D 351

Query: 369 VLTKGAKTKRSEILHNID 386
           +  +  + KR+ +L+ +D
Sbjct: 352 IFHQKPEVKRTHLLNFLD 369


>gi|410932865|ref|XP_003979813.1| PREDICTED: arylsulfatase I-like, partial [Takifugu rubripes]
          Length = 316

 Score =  236 bits (603), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 127/303 (41%), Positives = 181/303 (59%), Gaps = 14/303 (4%)

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LP  +  LPQ L+E GY+TH +GKWHLGF+++   PT RGFD+
Sbjct: 2   IHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDT 61

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           ++G   G  +YY + SC        G D+     V     G YST LYT+    ++A H+
Sbjct: 62  YFGSLTGSVNYYTYDSCDG--PGMCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILATHD 119

Query: 210 -KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
            +S+P+F++L+  AVH      P Q P E +  +  + +  RR YA MVS +DE+V N+ 
Sbjct: 120 PRSQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRNIT 174

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
             LRK+G  ENSI++F  DNG        + GSN PLRG K T W+GG+RG+  I SP L
Sbjct: 175 YGLRKYGYYENSIMIFSTDNGGQPL----SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLL 230

Query: 329 KQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
           ++ ++VS  L HI+DW PTL   A G E +   +DG + W+ ++ G ++ R EILHNID 
Sbjct: 231 RKKKRVSKALVHITDWYPTLVGLAGGKESHIEGVDGYDVWEAISDGRESPRLEILHNIDP 290

Query: 388 VDN 390
           + N
Sbjct: 291 LYN 293


>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
 gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
          Length = 496

 Score =  236 bits (602), Expect = 2e-59,   Method: Compositional matrix adjust.
 Identities = 148/412 (35%), Positives = 219/412 (53%), Gaps = 28/412 (6%)

Query: 13  CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
             LLF        A +KP+I+IILADDLG+ D+ F GS  I TPN+D LA  G++LN+ Y
Sbjct: 6   LILLFVAFISGAKASEKPNIVIILADDLGYADLGFRGS-DIQTPNLDRLAAEGIVLNRFY 64

Query: 73  VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
              +CTP+RSALMT + PI +G  +  +   E  G+   E  +P+  K+AGY T  IGKW
Sbjct: 65  SLPICTPTRSALMTARDPIKLGTAYAGLQPWENGGVSPDEHFMPESFKKAGYQTAMIGKW 124

Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI--G 190
           H+G   E   P  RGFD  +G+     DYY H+              H++Q + K++   
Sbjct: 125 HIGRQYESLVPHHRGFDHFFGHLNTQVDYYTHASAGG----------HDLQENGKSLKRD 174

Query: 191 IYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF---LDIS 246
            Y+TD++ + ++  + E  + SKP  LY+  LA H+     P QAP+   AK+   L+  
Sbjct: 175 AYATDIHGDESVRYLKEIRDPSKPFLLYVPFLAPHS-----PMQAPEALEAKYKSRLNFP 229

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
            P +RTYA MV  +D+++G ++  L + G+ +N+I+LF +DNG    G     G N P R
Sbjct: 230 VP-KRTYAAMVDSMDQAIGRIMDELDRQGVSDNTIILFFSDNG----GFAGFGGVNKPYR 284

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
           G K   ++GG+R  A +  P        +++   + D  PTL AAA IE+ N+  +DG+N
Sbjct: 285 GGKLETFEGGIRVNAIMRWPGAIPAGSKTAQAISVMDVFPTLAAAADIEMHNEKKIDGIN 344

Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
               LTK    KR + L    N      +  A+   D K V   D+N +S E
Sbjct: 345 ILKALTKAKNIKRPKPLFFASNSPEYNSFRLAMIDGDHKLVQIIDHNNRSTE 396


>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
          Length = 370

 Score =  236 bits (602), Expect = 3e-59,   Method: Compositional matrix adjust.
 Identities = 133/353 (37%), Positives = 207/353 (58%), Gaps = 26/353 (7%)

Query: 41  GWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVI 100
           G++D+ F     I TPNIDALA  G+I   +YVQ++CTPSR AL+TG+YP    MQ+ VI
Sbjct: 10  GYHDLGFRNPDVI-TPNIDALATEGVIFTNNYVQSVCTPSRHALLTGRYPHRSAMQNLVI 68

Query: 101 LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD 160
           +  +     L  K LP+YLK+ GY+THA+GKWH+G+ RE   PT RGFDS +G + G   
Sbjct: 69  MSNQARCTGLGYKFLPEYLKDLGYSTHAVGKWHVGYCREECLPTHRGFDSFFGLYDGDGY 128

Query: 161 YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
           Y++H+  +T  P    D  ++  V  +  GI+S DL  E    ++   N  +P FLY + 
Sbjct: 129 YWNHT--STVIP-GAFDWNNSTGVYLEARGIHSEDLGAERLTAILDGQNAKEPFFLYFSP 185

Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPE------RRTYAGMVSRLDESVGNVIAALRKH 274
              H          P +  A+FL++  PE      RR Y G+VS LD  VG ++  L+++
Sbjct: 186 QNPH---------TPSQPQAEFLNLY-PENRYPDIRRKYLGLVSGLDRMVGGIVDGLKRN 235

Query: 275 GMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKV 334
           G++ N+ ++F++DNGA        +G N P RG K + ++GG +  + I+SP L +    
Sbjct: 236 GLMNNTYIIFVSDNGA-----DPEEGLNDPFRGGKGSLFEGGTKSASFIYSPLLNKAGYE 290

Query: 335 SSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNID 386
           ++ L HI+DW+PT+   AG  I +   LDG++Q D++  G  ++R +++++ID
Sbjct: 291 NNGLMHITDWMPTILKLAGGSIPEGEQLDGIDQSDMVKNGKASQRKKMVYSID 343


>gi|354488427|ref|XP_003506371.1| PREDICTED: arylsulfatase I [Cricetulus griseus]
          Length = 560

 Score =  235 bits (600), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 125/285 (43%), Positives = 177/285 (62%), Gaps = 14/285 (4%)

Query: 23  NTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           ++ AP +P HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSR
Sbjct: 94  SSAAPSQPPHIIFILTDDQGYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSR 152

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           S L+TG+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGFFR+  
Sbjct: 153 SQLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKEC 212

Query: 142 TPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
            P+ RGFD+  G   G  DYY + +C        G D+     V     G YST LY + 
Sbjct: 213 LPSCRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGETVAWGLSGQYSTMLYAQR 270

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A +++A H+   P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +
Sbjct: 271 ASHILASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCM 325

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
           DE+V N+  AL+++G   NS+++F +DNG  +F    + GSN PL
Sbjct: 326 DEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPL 366



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 52/219 (23%), Positives = 78/219 (35%), Gaps = 76/219 (34%)

Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKY 405
           +  G + W +  +G  + R+EILHNID + N  ++               AA+RV + K 
Sbjct: 357 TFSGGSNWPLXXEGRASPRTEILHNIDPLYNHARHGSLEGGFGIWNTAVQAAIRVGEWKL 416

Query: 406 VAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSS 465
           + G       D  YGD       + P + L S  G  +N L     I+Q           
Sbjct: 417 LTG-------DPGYGD-------WIPPQTLASFPGSWWN-LARMASIRQAV--------- 452

Query: 466 DALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDL 525
                                                  LFNI+ DP E+ +LA  + D+
Sbjct: 453 --------------------------------------WLFNISADPYEREDLAGQRPDV 474

Query: 526 LKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           ++ L  +LA Y  T +P      + RA P      W PW
Sbjct: 475 VRTLLARLADYNRTAIPVRYPAANPRAHPDFNGGAWGPW 513


>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score =  235 bits (599), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 141/378 (37%), Positives = 206/378 (54%), Gaps = 23/378 (6%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
            +F + F +    KKPHII ++ DDLG++D+ +     I +PNID LA N L +  +Y Q
Sbjct: 9   FVFRECFGHGRFSKKPHIIFVMIDDLGFDDLGYVNDDVI-SPNIDFLAKNALHIENYYNQ 67

Query: 75  ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
             CTPSR+A MTG+Y I  GMQ GVI   EP  +PL+E LLPQ  K+ GY T   GKWHL
Sbjct: 68  PSCTPSRAAFMTGRYNIRYGMQSGVIKPDEPEAIPLSETLLPQAFKKCGYNTSMHGKWHL 127

Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKAT---FEPYQGLDMRHNMQVDNKTIGI 191
           GF+ E + P  RGFD  +G++ G QDY+ H        +EP     +R ++       G 
Sbjct: 128 GFYTEKHCPQNRGFDRFFGFYLGSQDYFYHDSGNNCYLYEPNGTEKVRLDLN------GT 181

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP-ER 250
           YST    E  I  + E++   P+F +L+   VH      P Q    EV K     +  ER
Sbjct: 182 YSTKAIAEDFIAKLDEYDPETPLFEFLSFQEVHG-----PLQWLPAEVLKMAGGWEGWER 236

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
           +  +  +  +D  +G V+  L++ G  +++I++  +DNG    G      SN PLRG K 
Sbjct: 237 KMLSRKIVIVDHFIGEVVKKLKEKGFWKDTILVITSDNG----GQTREGASNWPLRGRKG 292

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEINDTSLDGVNQWD 368
             ++GG+R  A I SP L  + K SS   +FH++DW PTL   +G +  D++LDG  Q D
Sbjct: 293 DVFEGGIRSRAFIHSPKLPDSLKGSSFQHVFHVTDWFPTLLRFSGCQQPDSNLDGKAQ-D 351

Query: 369 VLTKGAKTKRSEILHNID 386
           +     + KR+ +L+ +D
Sbjct: 352 IFHNKPEVKRTHLLNFLD 369


>gi|241789348|ref|XP_002400616.1| sulfatase, putative [Ixodes scapularis]
 gi|215510801|gb|EEC20254.1| sulfatase, putative [Ixodes scapularis]
          Length = 224

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 117/226 (51%), Positives = 149/226 (65%), Gaps = 3/226 (1%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+II +LADDLGW DVSF G  QIPTPN+D LA  G+ILN +YV  LCTPSR ALM+G Y
Sbjct: 2   PNIIFLLADDLGWADVSFRGDPQIPTPNLDVLASQGIILNNYYVLHLCTPSRGALMSGLY 61

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           PIH G+QH V L  EP GLPL   ++P++LK  GY TH IGKW+LG+++E YTPT RGFD
Sbjct: 62  PIHTGLQHYVQLPAEPHGLPLNVTIMPEHLKNLGYTTHMIGKWNLGYYKESYTPTRRGFD 121

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           S YG+  G +DYYDH+         GLD              YSTDL+T+ A+ +I +H+
Sbjct: 122 SFYGFLNGGEDYYDHTILFV---STGLDFWDGTTPVRNQSHHYSTDLFTKKALALIKDHD 178

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           ++KPMFLY +H AVH+G+     +AP   +A F  I +  R   AG
Sbjct: 179 QAKPMFLYFSHQAVHSGDFKVELEAPASAIAHFPYIKEQNRSIIAG 224


>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 536

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 140/371 (37%), Positives = 202/371 (54%), Gaps = 19/371 (5%)

Query: 29  KPHIIIILADDLGWNDVSFHG----SSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           +PH+ +I+ADDLG+ DVS++G    ++ + TP +D LA +G+ L + Y Q  CTP+R+AL
Sbjct: 32  RPHVFLIVADDLGYGDVSYNGDGSLTNAVATPYLDRLAADGITLTRFYSQCDCTPARAAL 91

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +TG+YP + GMQH V+     W LP    LLP  L E GY  HAIGKW +G  R   TPT
Sbjct: 92  LTGRYPSNTGMQHEVVTAQSQWSLPHEFALLPSALPE-GYRKHAIGKWDVGHARAADTPT 150

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
            RGFDSH GY+     Y +H+   +       DM H+      T   YST L+ + A+ +
Sbjct: 151 ARGFDSHLGYYGAEITYDEHAALRSCSNGTIRDMNHDGATLAATEDRYSTHLFADHAMAL 210

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK----FLDISDPERRTYAGMVSRL 260
           +        +FLYL   AVH     +P  A    V +    FLD SD  R T+A +   L
Sbjct: 211 VDREADEYKLFLYLCFQAVH-----QPLAADAALVKRFADAFLDESDAARSTFAAVALEL 265

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D +V   +   +  G  ++++  FM+DNGA      S  GSN PLRG K T ++GG+R  
Sbjct: 266 DRAVERFVEFTKARGAYDDAVFFFMSDNGATL--AQSGGGSNWPLRGSKFTAYEGGVRVP 323

Query: 321 AAIWSPWLKQTQKVSSE--LFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTK 377
           A + S  +   ++ ++   LFH+ D LPTL A A   +     +DGV+Q D +  GA   
Sbjct: 324 AFLHSARVPPARRGAAHAGLFHVVDVLPTLAAFAESADRVPAGVDGVDQRDAVLHGAAAP 383

Query: 378 RSEILHNIDNV 388
           R+E L + D++
Sbjct: 384 RTEALLHADDL 394


>gi|424863174|ref|ZP_18287087.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
 gi|400757795|gb|EJP72006.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
          Length = 519

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 141/376 (37%), Positives = 209/376 (55%), Gaps = 39/376 (10%)

Query: 13  CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
             L+F+ + L++   ++P+IIII+ DDLGW DVS++G   I TPNID LA +GL +N+ Y
Sbjct: 6   LILVFSISLLSS---ERPNIIIIITDDLGWGDVSYNGGP-INTPNIDKLADDGLQMNRFY 61

Query: 73  VQALCTPSRSALMTGKYPIHIGMQHGVIL-----EGEPWGLPLTEKLLPQYLKEAGYATH 127
               C+P+R+AL TG       +++G+I        E +GLPL  K+LP+YLKE GY T 
Sbjct: 62  SAPTCSPTRAALFTGIN----SLKNGIIRPLNNPTAERYGLPLKHKILPEYLKEIGYQTA 117

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
             GKWHLG F + Y P  RGF+S YG+  G   Y+DH+          LD   N ++  +
Sbjct: 118 LSGKWHLGMFSDEYLPRNRGFESTYGHLGGGIGYFDHALSGR------LDWHRNGEILYE 171

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
               YST L  + AI +I   N   P+FLY+A  A H      P QA ++ +    DISD
Sbjct: 172 D--GYSTTLIADEAIRIIENKNNETPLFLYVAFNAPHT-----PIQAEEKIINNLSDISD 224

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH----------- 296
            + R YA  +  LD  +G +I A+   G+L  +I++F +DNG P F I            
Sbjct: 225 KKERVYAANIITLDREIGKIIDAIESKGILNETIIIFFSDNG-PVFDIDPIAATIAPDIL 283

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
            +KG+   L+G K + ++GG+R  A I+   + + + +S++ F + D LPTL  A  I  
Sbjct: 284 DSKGNTLGLKGSKGSAYEGGIRVPAVIYYKGILE-KSLSNQFFFVDDILPTLFGALNINP 342

Query: 357 NDTSLDGVNQWDVLTK 372
           N  ++ GVN+W+ L K
Sbjct: 343 NQNNITGVNRWNDLIK 358


>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
          Length = 328

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 135/336 (40%), Positives = 189/336 (56%), Gaps = 18/336 (5%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+ + ILADDLG+ND+ F  +  + TPN+D LA  G+IL  +YVQA+CTPSR ALMTG+Y
Sbjct: 1   PNFVFILADDLGYNDIGFR-NPDVQTPNLDYLANKGVILTNNYVQAVCTPSRHALMTGRY 59

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           P    MQ+ VI   +     L  K LPQYLKE GY  H IGKW+LG+ RE   PT RGFD
Sbjct: 60  PYRSAMQNFVINPDQAKCTALEYKFLPQYLKELGYQNHLIGKWNLGYCREECLPTSRGFD 119

Query: 150 SHYGYWQGLQDYYDHSCKATFEPY-QGLDMRHNMQVDNKTIGIYS-----TDLYTEAAIN 203
           S +G   G  DY++H+    ++   Q L     +    + I I        DL  +    
Sbjct: 120 SFFGLLDGAGDYWEHTTYGLYDCTGQSLAGMACLCEFTQKISILMPVICFQDLELDRLDK 179

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYAGMVSRLDE 262
           +  EH+  +P+FLY A       N + P Q  DE + K+  D  +  RR Y  + S LD 
Sbjct: 180 IFTEHDNKQPLFLYFA-----PQNPHTPSQPTDEFLNKYSADDFEDVRRRYLALTSGLDS 234

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
            VG V+  L  + M++N+ ++F++DNGA        +G N P RG K + ++GG +  + 
Sbjct: 235 MVGKVVELLNDNHMMDNTYLIFLSDNGA-----DPAEGLNAPFRGGKGSLFEGGTKSSSF 289

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
           I+SP LK+T+  +  L HI+DWLPTL   AG  + D
Sbjct: 290 IYSPLLKKTEYENDGLMHITDWLPTLVKLAGGHVED 325


>gi|432963754|ref|XP_004086820.1| PREDICTED: uncharacterized protein LOC101172478 [Oryzias latipes]
          Length = 873

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 124/303 (40%), Positives = 180/303 (59%), Gaps = 13/303 (4%)

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y IH G+QH VI   +P  LPL    LPQ L++AGY+TH +GKWHLGF++    PT RGF
Sbjct: 421 YQIHTGLQHSVIRATQPNCLPLENVTLPQKLRDAGYSTHMVGKWHLGFYKRGCLPTQRGF 480

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIAE 207
           DS +G   G  DYY H  K       G D+    +    +  G+YST ++T  A++++A 
Sbjct: 481 DSFFGSLLGSGDYYSHY-KCEVPGMCGYDLYEGEEAAWEQDRGLYSTVMFTRKAVSILAH 539

Query: 208 HN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           HN ++ P+FLYLA+ AVH+     P Q P   + ++  IS+  RR YA MVS LDE++ N
Sbjct: 540 HNPRAHPLFLYLAYQAVHS-----PLQVPARYLERYKGISNLSRRKYAAMVSCLDEAIHN 594

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           +  AL+++G  +N+++++ +DNG          GSN PLRG K+T W+GG+R V  + SP
Sbjct: 595 LTLALKRYGYYDNTVIVYSSDNGGQPLA----GGSNWPLRGSKATYWEGGIRAVGFVHSP 650

Query: 327 WLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
            L +       L HI+DW PTL     G    D +LDG + W+ +++G  + R +ILHNI
Sbjct: 651 LLMKKGTKCRSLIHITDWFPTLVTLGEGTLDEDLNLDGYDVWEAISEGRPSPRLDILHNI 710

Query: 386 DNV 388
           D +
Sbjct: 711 DPI 713



 Score = 45.8 bits (107), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 41/77 (53%), Gaps = 4/77 (5%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L++    ++K +  +LA Y  T VP      D R++P     +W PW
Sbjct: 784 LFNITADPYERVDLSQHYPHIVKTMLLRLAQYNRTAVPVRYPAKDLRSNPQYNGGVWGPW 843

Query: 565 YD----ELDKQKAIETM 577
           Y     E D+++   T+
Sbjct: 844 YKDERGEPDEEEQYNTL 860


>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 706

 Score =  233 bits (593), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 139/418 (33%), Positives = 227/418 (54%), Gaps = 64/418 (15%)

Query: 29  KPHIIIILADDLGWNDVSFHGSS-QIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           KP++  I+ DD+GWND+ +     Q  TP++D LA  G+ +  +Y  ++CTP+R++LMTG
Sbjct: 130 KPNVFFIMVDDMGWNDIGYQSVDLQGVTPHLDRLAAGGVKMTNYYTMSICTPARASLMTG 189

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y +  G+Q+ VI  G PWGLPLTEK+ P+Y+K+AGY TH IGKWH+G +   + P+ RG
Sbjct: 190 RYVMRYGLQYSVIQPGAPWGLPLTEKIFPEYMKDAGYETHMIGKWHIGSYTSRHIPSQRG 249

Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEP--------------YQGLDMRHNMQVDNKTI--- 189
           FD++ GY    + Y+ H S  A +                Y  +D     Q+ + TI   
Sbjct: 250 FDTYLGYLNDEEMYWTHQSWTAKYNGRKFFDFGFGNATGYYDVIDREVPDQLRDGTISDG 309

Query: 190 ---------------------GIYSTDLYTEAAINVIAEHN--KSKPMFLYLAHLAVHAG 226
                                G YST ++   A+ +I E      +P+++YLAH AVH  
Sbjct: 310 DDDETMLSATSSYSSSGGFFKGRYSTQVFHNRALEIIHEKTPYDDEPLYMYLAHQAVHDP 369

Query: 227 NTYEPFQ--APDE-EVAKFLDISDPE-----RRTYAGMVSRLDESVGNVIAALRKHGMLE 278
               P +   PD+ ++ + ++    E     R  +A ++  LD ++G ++  L  +G +E
Sbjct: 370 LGLPPAEHFTPDQLDLLERIETGSEEGTEALRARFAKVLMYLDYTIGELVEYLETNGWME 429

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE- 337
           NSI++  +DNG    G  S  GSN+PLRG+K + W+GG +  + ++S      ++  +E 
Sbjct: 430 NSIIIVASDNG----GCPSCGGSNYPLRGIKHSYWEGGTKVPSFVYSKSHIPEERWGTEY 485

Query: 338 --LFHISDWLPTLCAAAGIEINDTS--LDGVNQWDVLTKGAK-----TKRSEILHNID 386
             L H++DWLPT+  AA I+++  +   DGV+QW ++T  A      + R E+++N D
Sbjct: 486 DGLMHVTDWLPTIAGAAEIDLDGGAGPFDGVDQWPLITSVAAEAEFGSARDEMMYNFD 543


>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 534

 Score =  231 bits (589), Expect = 8e-58,   Method: Compositional matrix adjust.
 Identities = 142/373 (38%), Positives = 204/373 (54%), Gaps = 33/373 (8%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+I+ +LADDLGW+DV +H  S I TPNID LA  G+ L  +YV + C PSR  LMTG+
Sbjct: 68  RPNIVFVLADDLGWHDVGYH-DSVIKTPNIDQLAAEGVKLENYYVSSWCAPSRVNLMTGR 126

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL-GFFREVYTPTFRG 147
           Y I  G+ +G + +     + + E  L   L EAGY T  +GKWHL GF      PT RG
Sbjct: 127 YRIRTGL-YGDVCD----FMGIHETTLADKLYEAGYYTAMVGKWHLSGFEHAECYPTHRG 181

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           F +  GY  G Q+Y+ H       PY       ++ V  K  G YST ++ + A  +I +
Sbjct: 182 FQTFLGYHGGSQNYFTHRRGGPHAPYDFWANDTSIAV--KYEGQYSTMIFADEAQRIIRQ 239

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVGN 266
           HN  +P+FLYL+  AVH      P   P     ++   I D +RR YA M S +DE++GN
Sbjct: 240 HNTKQPLFLYLSFQAVHV-----PLLVPPSYEDQYRSLIEDDKRRVYAAMASCMDEAIGN 294

Query: 267 VIAALRKHGMLENSIVLFMA-----------DNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           V A L++ G+  N++ +F +           DNGA      ++ GSN PLRG K   ++G
Sbjct: 295 VTATLKEKGLWNNTVFIFSSARLNFKHDFCLDNGATP----TDGGSNWPLRGRKGLWYEG 350

Query: 316 GMRGVAAIWSPWLKQ--TQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTK 372
           G+R V  + S  L    T   + EL H++DW PTL   A  +++    LDG++QW+ +++
Sbjct: 351 GIRAVGFVNSHLLSDHVTGTTNHELIHVTDWFPTLLRLAKADLHGLKPLDGIDQWETISR 410

Query: 373 GAKTKRSEILHNI 385
           G  T R E+L N 
Sbjct: 411 GNVTNRKEVLLNF 423


>gi|291244839|ref|XP_002742301.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 263

 Score =  231 bits (588), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 114/248 (45%), Positives = 162/248 (65%), Gaps = 6/248 (2%)

Query: 174 QGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQ 233
            G D+R N+ V     G Y+T+++T  A  +I +HNK  P+FLYLAHLAVH+GN   P +
Sbjct: 1   MGYDLRDNVNVTYYMNGSYTTEVFTTKAEQIIKKHNKQNPLFLYLAHLAVHSGNPSNPLE 60

Query: 234 APDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
           AP++ +++F  I D  R+T+AGMVS LDESVGN+   L+  GM  N+I++F  DNG P+ 
Sbjct: 61  APEKYMSRFPHIQDKNRKTFAGMVSALDESVGNLTKVLKDSGMYNNTIIIFTTDNGGPAA 120

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           G ++N  SN PLRG+K+T W+GG+RG   + SP L++  +VS+++ H+ DWLPT+   AG
Sbjct: 121 GFNNNHASNWPLRGIKATLWEGGVRGDGFVHSPLLEKPGRVSTDMIHVCDWLPTIYHIAG 180

Query: 354 IEIND--TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDN 411
             ++   T+LDG N WD L++G+ + R+EILHNID    P    AALRV   K + G   
Sbjct: 181 GNVSSLPTNLDGFNVWDTLSRGSVSPRTEILHNID----PILQIAALRVGKYKIIMGECY 236

Query: 412 NGQSDEWY 419
            G  D WY
Sbjct: 237 RGNWDGWY 244


>gi|156340112|ref|XP_001620356.1| hypothetical protein NEMVEDRAFT_v1g148421 [Nematostella vectensis]
 gi|156205165|gb|EDO28256.1| predicted protein [Nematostella vectensis]
          Length = 260

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 122/253 (48%), Positives = 160/253 (63%), Gaps = 12/253 (4%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           T+A K PHI+ IL DDLGW+DV +H  S  + TPNID LA  G+ L  +Y Q +CTPSR 
Sbjct: 19  TSATKPPHILFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSRG 78

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           ALMTGKYPIH+GMQH VI    PWG+P     +PQ L+  GY T  IGKWHLGFF   YT
Sbjct: 79  ALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDYT 138

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P  RGFDS  G++ G QD++ HS K  F     LD R + +  N+  G +STD++T+ AI
Sbjct: 139 PLRRGFDSFLGFFAGEQDHWRHS-KMGF-----LDFRRDEEPANEYGGQHSTDVFTQEAI 192

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           N+   HN S+P+FL L++ AVH      P QA   +V K   +SD +R+ Y GM+   D 
Sbjct: 193 NIAMRHNASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGAADW 247

Query: 263 SVGNVIAALRKHG 275
           S+G +I   +++G
Sbjct: 248 SIGRLIDVYKRNG 260


>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 588

 Score =  230 bits (586), Expect = 2e-57,   Method: Compositional matrix adjust.
 Identities = 182/625 (29%), Positives = 286/625 (45%), Gaps = 144/625 (23%)

Query: 36  LADDLGWNDVSFHGSS-QIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIG 94
           + DD+GW+D+ +     +  TP +D LA  G+ +  +Y    CTP+R++LMTG+Y +  G
Sbjct: 1   MIDDMGWDDIGYQSVDLKGVTPVLDKLAAGGVKITNYYTMNTCTPARASLMTGRYTVRYG 60

Query: 95  MQHGVILE-GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG 153
           MQ+ V +  GEPWG+PL+EK+LP+Y KEAGY TH +GKWHLG     + P+ RGFD++ G
Sbjct: 61  MQYNVAINPGEPWGVPLSEKMLPEYFKEAGYGTHLVGKWHLGSHSPEHIPSQRGFDTYMG 120

Query: 154 YWQGLQDYYDHS-----------CKATF---------------EPYQGLDMRHNMQV--- 184
           Y  G + Y+ H            C   F               +   G D R + +    
Sbjct: 121 YVGGFEAYWTHETVGVISDGRHVCDFGFGNASGYYDIIDRPSSQDIYGSDHREDQKEAEE 180

Query: 185 ---DNKTIGIYS-------------------------TDLYTEAAINVIAEHN--KSKPM 214
              D    G++S                         T ++ E AI ++   +    +P+
Sbjct: 181 NRDDGDGEGVHSTTPTKTPWASSFRSNSGVMKGQQYSTQMFQERAIEILETKSPFDEEPI 240

Query: 215 FLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-----ISDPE----RRTYAGMVSRLDESVG 265
           +++LAH AVH+     P  +  E+    LD     I D +    RR    ++  LD+SVG
Sbjct: 241 YMHLAHQAVHSPLGLPPADSFSEDETLLLDEIEGKIGDVQGAQTRRHLTEVLMFLDKSVG 300

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           ++I  L   G +ENSIV+  +DNG  S    +  GSN+PLRG K++ W+GG +  A ++S
Sbjct: 301 HLIHYLEDEGWMENSIVVVASDNGGDS----NEGGSNYPLRGKKASYWEGGSKVPAFVYS 356

Query: 326 PWLKQTQKVSSE---LFHISDWLPTLCAAAGIEINDTS--LDGVNQWDVLTKGAKT---- 376
                  +  SE   L H++DWLPTL A AG+E++ ++  LDGVNQW+ +   A      
Sbjct: 357 TSHIPEARRGSEYDGLMHVTDWLPTLAAGAGLEMSGSAGPLDGVNQWEHIVGSAAAVDGG 416

Query: 377 -KRSEILHNID------NVDNPQKYYAAL---RVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
             R+E+L+N D      N ++    + +L   RV D  Y  G   +G             
Sbjct: 417 RPRTELLYNYDPYRLLPNSNHCYCCWCSLCHYRVYDTGYAQGAFRSG------------- 463

Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
            KY     +      TF+                          +L ++ + +L +    
Sbjct: 464 -KYKMMFNVSCHGWYTFDG------------------------DMLEEDHLTDLSKVCGG 498

Query: 487 RCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDK-----LAIYKSTMV 541
            CN  + G   +      LF++  DP E NNL +   D+ +++ D+        Y  T+ 
Sbjct: 499 TCN--DGGTCSDEATSDYLFDLDQDPRESNNLIDELPDVAQEMRDRWYEVTFREYTDTVF 556

Query: 542 PPGN-KPFDKRADPARWNNIW-VPW 564
            P + K +D  A+    NN W +PW
Sbjct: 557 EPLDLKAYDVWAE----NNWWMIPW 577


>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1818

 Score =  229 bits (583), Expect = 5e-57,   Method: Compositional matrix adjust.
 Identities = 145/390 (37%), Positives = 211/390 (54%), Gaps = 41/390 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHG----SSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           K PH++  LADD+G+ DV ++G    ++++ TP IDALA  G+ L+++Y Q  CTPSR+A
Sbjct: 42  KPPHVVFFLADDVGFGDVGYNGDPTLTNRVSTPVIDALADAGVKLSRYYTQPDCTPSRAA 101

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           L++GKYP   G  HGV+     WGLPL   LLP+ L  A Y +HA+GKW +G       P
Sbjct: 102 LLSGKYPATTGTYHGVLNPQSTWGLPLEHALLPEALPGA-YRSHAVGKWDVGHSSAKRLP 160

Query: 144 TFRGFDSHYG----YWQGLQDYYDHSC---KATFEPYQGLDMRHNMQVDNKTI------- 189
             RGFDS  G    ++  + DY  H                     QV   T+       
Sbjct: 161 EARGFDSFLGFYLCFYGPMIDYSTHEIHDHDLACAGDACAAALAKCQVRGSTVADFSGLG 220

Query: 190 -------GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK- 241
                  G+Y+TD++ + A+++I       P+FLY+A  AVH      P +A + +VA  
Sbjct: 221 GQRRDYDGMYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVHG-----PLEADEADVAAF 275

Query: 242 ---FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
              FL  SD +R T+A +   +D ++G V+ AL+  G  EN++V+F +DNGA        
Sbjct: 276 EATFLPESDSKRATFAAVALAMDRAIGRVVDALQATGAYENALVVFASDNGA--IPGQMG 333

Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEI 356
            GSN PLRG K + W+GG+R  A + SP L    +  V   LFH++DWLPT+ AAAG  +
Sbjct: 334 GGSNWPLRGGKFSAWEGGVRVPAFVHSPLLPAHMRGAVYDGLFHVADWLPTIAAAAGAAV 393

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
           +  S DGV+ +  L       RSE+L ++D
Sbjct: 394 D--SGDGVSHYRALFADGPAPRSELLVHVD 421



 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 136/396 (34%), Positives = 218/396 (55%), Gaps = 42/396 (10%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHG----SSQIPTPNIDALAYNGLILNQHYVQALCT 78
           +  A   PH+++ LADD G++DV ++     ++Q+ TP +D+LA  G+ L ++Y Q  CT
Sbjct: 612 DDAASNPPHLLLFLADDAGFDDVGYNSDPSKTNQVQTPFLDSLAAGGVKLARYYTQPDCT 671

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           PSR+AL++G YP   GM H +I     WGL L  +L+PQ L  A Y +HA+GKW +G + 
Sbjct: 672 PSRAALLSGMYPASSGMYHKMITAQSNWGLDLDLELIPQRLP-AAYRSHAVGKWDVGHYT 730

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH--------------NMQV 184
             + P FRGF S  G++  + DYY H    T +  + +++                +   
Sbjct: 731 WSHVPQFRGFRSFLGFYSPIIDYYTHETFDTLQCLEEMELTECEARLASECSSSIKDFNF 790

Query: 185 DNKTI----GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV- 239
           D   +    G YSTD++   A ++I +     P+FLY+A  AVHA     P +A DE+V 
Sbjct: 791 DGDPLPLADGTYSTDVFAARARDLIRKEAPKHPLFLYVAFNAVHA-----PLEA-DEDVI 844

Query: 240 ----AKFLDISDPERRTYAGMVSRLDESVGNVIAALRK-HGMLENSIVLFMADNGAPSFG 294
               + FL+ +   R ++A +   +D ++  V+  L+   G  ENS+++F +DNG     
Sbjct: 845 ESFASSFLEEARDARASFAAVALGMDRAMEAVVDELKAVDGAYENSVIMFASDNGGLPGQ 904

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAA 352
           I    GSN PLRG K +P++GG+R  A + SP L   ++  +   LFH++DWLPTL   A
Sbjct: 905 IAG--GSNWPLRGQKFSPFEGGVRVPAFVHSPLLPAARRGGAYGGLFHVTDWLPTLVRLA 962

Query: 353 GIEINDTSLDGVNQWDVL--TKGAKTKRSEILHNID 386
           G      ++DGV+Q++ +     A+  R+E+L ++D
Sbjct: 963 G-GAPPRNVDGVDQYEAIFGEHPAEHPRTELLVHVD 997


>gi|395504882|ref|XP_003756775.1| PREDICTED: arylsulfatase I [Sarcophilus harrisii]
          Length = 598

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 115/264 (43%), Positives = 164/264 (62%), Gaps = 9/264 (3%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYP 90
           HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSRS L+TG+Y 
Sbjct: 215 HIIFILTDDQGYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQ 273

Query: 91  IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           IH G+QH +I   +P  LPL +  LPQ L+E GY+TH +GKWHLGF+++   PT RGFD+
Sbjct: 274 IHTGLQHSIIRPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDT 333

Query: 151 HYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             G   G  DYY + +C        G D+     V  +  G YST LY + A  ++A HN
Sbjct: 334 FLGSLTGNVDYYTYDNCDGP--GVCGYDLHEGESVAWEQSGQYSTLLYAQRASQILASHN 391

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
             +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N+  
Sbjct: 392 PRQPLFLYVAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAVRNITW 446

Query: 270 ALRKHGMLENSIVLFMADNGAPSF 293
           AL+++G  +NS+++F  DN   +F
Sbjct: 447 ALKRYGYYDNSVIIFSTDNXLANF 470



 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P      W PW
Sbjct: 490 LFNISADPYEREDLAAQRPDIVRSLLARLAYYNRTAIPVRYPAENPRAHPDFNGGAWGPW 549


>gi|1030057|emb|CAA63067.1| murine arylsulfatase B [Mus musculus]
          Length = 252

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 121/257 (47%), Positives = 156/257 (60%), Gaps = 14/257 (5%)

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LD 177
           AGYATH +GKWHLG +R+   PT RGFD+++GY  G +DYY H   A  E   G    LD
Sbjct: 1   AGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIESLNGTRCALD 60

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +R   +   +   IYST+++T+ A  VIA H   KP+FLYLA  +VH     +P Q P+E
Sbjct: 61  LRDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEE 115

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
            +  +  I D  RR YAGMVS +DE+VGNV  AL+ HG+  N++ +F  DNG    G   
Sbjct: 116 YMEPYGFIQDKHRRIYAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNG----GQTR 171

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN 357
           + G+N PLRG K T W+GG+RG   + SP LKQ    S EL HISDWLPTL   AG   N
Sbjct: 172 SGGNNWPLRGRKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTN 231

Query: 358 DTS-LDGVNQWDVLTKG 373
            T  LDG N W  +++G
Sbjct: 232 GTKPLDGFNMWKTISEG 248


>gi|117558328|gb|AAI27451.1| Arsb protein [Rattus norvegicus]
          Length = 523

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 121/268 (45%), Positives = 164/268 (61%), Gaps = 14/268 (5%)

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMR 179
           YATH +GKWHLG +R+   PT RGFD+++GY  G +DYY H   A  E   G    LD+R
Sbjct: 128 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTHEACAPIECLNGTRCALDLR 187

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
              +   +   IYST+++T+ A  +IA H   KP+FLYLA  +VH     +P Q P+E +
Sbjct: 188 DGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYM 242

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
             +  I D  RR YAGMVS LDE+VGNV  AL+  G+  N++++F  DNG    G   + 
Sbjct: 243 EPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNG----GQTRSG 298

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT 359
           G+N PLRG K T W+GG+RG   + SP LKQ    S EL HI+DWLPTL   AG   + T
Sbjct: 299 GNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGT 358

Query: 360 S-LDGVNQWDVLTKGAKTKRSEILHNID 386
             LDG + W+ +++G+ + R E+L NID
Sbjct: 359 KPLDGFDVWETISEGSPSPRVELLLNID 386



 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 19/60 (31%), Positives = 31/60 (51%), Gaps = 1/60 (1%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LF+I  DP E+++++     +++ L  +L  Y    VP    P D R DP +   +W PW
Sbjct: 464 LFDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVPSYFPPLDPRCDP-KGTGVWSPW 522


>gi|344250866|gb|EGW06970.1| Arylsulfatase I [Cricetulus griseus]
          Length = 484

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 118/268 (44%), Positives = 168/268 (62%), Gaps = 10/268 (3%)

Query: 23  NTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           ++ AP +P HII IL DD G++DV +HGS  I TP +D LA  G+ L  +Y+Q +CTPSR
Sbjct: 41  SSAAPSQPPHIIFILTDDQGYHDVGYHGSD-IETPTLDRLAAEGVKLENYYIQPICTPSR 99

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           S L+TG+Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGFFR+  
Sbjct: 100 SQLLTGRYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKEC 159

Query: 142 TPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
            P+ RGFD+  G   G  DYY + +C        G D+     V     G YST LY + 
Sbjct: 160 LPSCRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGETVAWGLSGQYSTMLYAQR 217

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A +++A H+   P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +
Sbjct: 218 ASHILASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCM 272

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADN 288
           DE+V N+  AL+++G   NS+++F +DN
Sbjct: 273 DEAVRNITWALKRYGFYNNSVIIFSSDN 300



 Score = 43.5 bits (101), Expect = 0.33,   Method: Compositional matrix adjust.
 Identities = 22/60 (36%), Positives = 32/60 (53%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNI+ DP E+ +LA  + D+++ L  +LA Y  T +P      + RA P      W PW
Sbjct: 378 LFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAANPRAHPDFNGGAWGPW 437


>gi|298706912|emb|CBJ29739.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 781

 Score =  226 bits (575), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 140/429 (32%), Positives = 220/429 (51%), Gaps = 72/429 (16%)

Query: 18  NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGS--SQIPTPNIDALAYNGLILNQHYVQA 75
            D    + +  KP++   L DD+G+ D+ +  +  S+I TPN+DALA  G+ L+ +Y   
Sbjct: 205 GDGGPGSVSTSKPNVFFFLIDDMGFGDMGYQSTDLSEI-TPNLDALAAGGVKLSNYYTMT 263

Query: 76  LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           LCTP+R+++MTG+YP+  GMQ+ VI+ G PWGLP +EK+LP+Y+ EAGY +H +GKWHLG
Sbjct: 264 LCTPARASIMTGRYPVRYGMQYSVIMPGSPWGLPTSEKILPEYMNEAGYESHMVGKWHLG 323

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFE----------------------- 171
            +R+   P+ RGF +  GY  G++ YY H + +A+ +                       
Sbjct: 324 SYRDESLPSQRGFKTFLGYLNGIETYYSHKNPEASVDGQYFFDFGYGNATGYHDVTLQNH 383

Query: 172 ------------PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN--KSKPMFLY 217
                       P  G  M +    D    G YSTD +   A  ++         P+F+Y
Sbjct: 384 DENVGGPCTDGGPRWGDVMENEDPADVCFTGTYSTDAFVGRAKQIVKSKAPFDEDPLFMY 443

Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFL----DISDPERRT-YAGMVSRLDESVGNVIAALR 272
           +AH +VH+     P++    E    L    D ++ E+RT +AG++  LD+ +G  +  L 
Sbjct: 444 IAHQSVHSPTGPAPYEEFSPEELALLEGVKDATESEKRTAFAGVLLYLDKRIGEFVDMLE 503

Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
           + G LENS+++  +DNGA       + GSN+PLRG K + ++GG R         L + Q
Sbjct: 504 QEGWLENSVIVVSSDNGA----CPDDGGSNYPLRGSKQSVFEGGSRTGYQPSPALLGKRQ 559

Query: 333 ----KVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL----TKGA------KTKR 378
                 SS  +        LC    +  +  +LDGV+QW+ L     +GA      K  R
Sbjct: 560 PAGLNTSSPRY--------LCNPCNLPDSAGALDGVDQWECLKSNGVRGAGSGNTSKPPR 611

Query: 379 SEILHNIDN 387
           +E+L+N D+
Sbjct: 612 TELLYNWDS 620


>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1517

 Score =  225 bits (574), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 138/365 (37%), Positives = 195/365 (53%), Gaps = 44/365 (12%)

Query: 18  NDAFLNTTAPKKPHIIIILADDLGWNDVSF-HGSSQIPTPNIDALAYNGLILNQHYVQAL 76
           +D  L + A   PH++  L DDLG++DV + +    + TP+IDALA  G+ L+++Y    
Sbjct: 59  DDRDLGSDAASYPHVVFALMDDLGFDDVGYGNADGAVATPHIDALAKEGVTLSRYYSAFS 118

Query: 77  CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
           CTP+R AL+TG  P  +G+QHG +   +PWGLP    +LPQ+L + GY +H +GKWHLG 
Sbjct: 119 CTPARGALLTGLSPHRLGLQHGQVFPEQPWGLPSKFSILPQHLAKLGYRSHLVGKWHLGH 178

Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDH----SCKATFEP-YQG---------------- 175
           F     PT RGFDS +G   G Q Y  H     CK   +  Y+G                
Sbjct: 179 FSAERLPTARGFDSFFGGLDGAQYYATHIDAMDCKLPGDVLYRGFEVGDYDSLKAVTAEH 238

Query: 176 ---LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK-----SKPMFLYLAHLAVHAGN 227
               D+R N        G YST L+   A  +I  H+K      KP+FL L+  AVHA  
Sbjct: 239 GCYFDLRENNDRVEDLFGSYSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVHA-- 296

Query: 228 TYEPFQAPD---EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
              P  AP+   E     L++++  RR +A  +  +D++VGNV+AAL +  M  NSI++F
Sbjct: 297 ---PVWAPEDTYETHPDLLNVTNGNRRKFAAALRLVDDAVGNVVAALEEARMDANSILIF 353

Query: 285 MADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISD 343
            +DNGA P  G     GSN PLRG K   ++GG+R  A + +P            FH++D
Sbjct: 354 ASDNGANPEHG-----GSNAPLRGSKGYLFEGGVRVPAFVRAPKYLPRGATYDHPFHVTD 408

Query: 344 WLPTL 348
           W+PTL
Sbjct: 409 WVPTL 413


>gi|405964717|gb|EKC30169.1| Arylsulfatase B [Crassostrea gigas]
          Length = 898

 Score =  224 bits (572), Expect = 8e-56,   Method: Compositional matrix adjust.
 Identities = 117/298 (39%), Positives = 179/298 (60%), Gaps = 25/298 (8%)

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           +GKWH+GF+++ Y PT RGFDS++GY  G +DYY H     +    G+D+R N    N +
Sbjct: 526 VGKWHVGFYKKEYLPTSRGFDSYFGYLTGSEDYYTH--HRCYGKMCGMDLRENAVPANYS 583

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
            GIYST L+ + A +++ +H+  +P+FLYL   +VH+     P Q P++ +  +L I D 
Sbjct: 584 -GIYSTYLFAQKAADIVQKHDSDQPLFLYLPFQSVHS-----PLQVPEKYIQPYLHIKDK 637

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
           +RR YAGMV+ +DE++GN+    ++ G+  N++++F  DNG    G     G+N+PLRG 
Sbjct: 638 KRRKYAGMVAAMDEAIGNLTEVFKQKGIWNNTLMVFSTDNG----GQILEGGNNYPLRGW 693

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQW 367
           K++ W+GGM GV  +    LK+   VS  L H++DW PTL + AG  +N T  LDGV+QW
Sbjct: 694 KASLWEGGMHGVGFVHGQMLKRKGMVSKALMHVTDWFPTLVSLAGGNLNGTKPLDGVDQW 753

Query: 368 DVLTKGAKT-KRSEILHNIDNVDNP-----------QKYYAALRVDDLKYVAGTDNNG 413
             +++G+   +RS ILHNID +  P            +  AA+R  D K + G+  NG
Sbjct: 754 KTISEGSTVNQRSSILHNIDPLFQPVGHRLPKSPFDNRIRAAVRFGDWKLITGSPGNG 811



 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 36/77 (46%), Positives = 55/77 (71%), Gaps = 1/77 (1%)

Query: 15 LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
          +LF + F       +PHI+ +LADD G+ND+ +HGS +I TPN+D LA +G+ L  +YVQ
Sbjct: 12 VLFFNTFPVQGKSSQPHILFVLADDFGYNDIGYHGS-EIKTPNLDKLAGDGIRLENYYVQ 70

Query: 75 ALCTPSRSALMTGKYPI 91
           +CTP+RS L++G+Y +
Sbjct: 71 PICTPTRSQLLSGRYQV 87



 Score = 46.2 bits (108), Expect = 0.049,   Method: Compositional matrix adjust.
 Identities = 23/60 (38%), Positives = 31/60 (51%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E  ++A     ++ Q+ D LA Y  T VP      D++A+P      W PW
Sbjct: 838 LFNITADPYEHKDVAHINPKVVMQMLDLLAQYNQTAVPCRYPKMDEKANPKYLGGYWGPW 897


>gi|313228866|emb|CBY18017.1| unnamed protein product [Oikopleura dioica]
          Length = 482

 Score =  223 bits (567), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 141/387 (36%), Positives = 199/387 (51%), Gaps = 32/387 (8%)

Query: 17  FNDAFLNTT-APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA 75
           F  AFL    +  KP++I+ILADDLG+ND+ ++  + I  PN+ +LA NG IL+  YVQ 
Sbjct: 69  FFAAFLVAKGSSSKPNVILILADDLGFNDMPWNNPAII-APNLHSLAKNGTILSNFYVQP 127

Query: 76  LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           +CTPSRSALMT +YPI +G+Q  VI   +P  LPL E  +    + AGY TH +GKWHLG
Sbjct: 128 VCTPSRSALMTSRYPIRLGLQTDVITAPQPSCLPLDEVTIGNEFQSAGYTTHIVGKWHLG 187

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ-----GLDMRHNMQVDNKTIG 190
            +     P  RGFD+  GY  G +DYY  +      P Q     G D   N     K  G
Sbjct: 188 HYCPQCLPNNRGFDTFRGYLTGAEDYYKKTFCIPLVPNQRPAACGFDFYDNENRMPKANG 247

Query: 191 IYSTDL-------------YTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPD 236
            YST               + +A+  VI  H  SK P FLYL   +VH      P   P 
Sbjct: 248 TYSTYQVLIYLFIHTIILKFADASREVIKSHEGSKTPFFLYLPFQSVHY-----PVMVPK 302

Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
                + +  +  R+ Y GMV+ +DE+VG+++  L+  G  EN++  +  DNG    G+ 
Sbjct: 303 NYSDLYPNEKNHTRKEYMGMVTAMDEAVGHIVDELKAAGAYENTLFFYATDNG----GLV 358

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE- 355
              G N P RG K+T W GG+R  A I  P +       + + HI D   T+  A G + 
Sbjct: 359 GAGGRNAPFRGQKATLWQGGLRVPAFISGPGVPSASNYKN-MMHIVDVQMTVLDAIGHQR 417

Query: 356 INDTSLDGVNQWDVLTKGAKTKRSEIL 382
             +  +DGV+ W  +T G + +  ++L
Sbjct: 418 TGEKHVDGVSHWSAITAGDEKETMDLL 444


>gi|323454930|gb|EGB10799.1| hypothetical protein AURANDRAFT_22180 [Aureococcus anophagefferens]
          Length = 530

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 137/369 (37%), Positives = 194/369 (52%), Gaps = 24/369 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPT--PNIDALAYNGLILNQHYVQALCTPSRSALM 85
           K+PHI  +L DDLG+ND     +S +    P++ ALA   + +  +Y   LCTP+R A M
Sbjct: 18  KQPHIAFMLVDDLGFNDAWAVNTSDVAGAWPHVGALANESVRIATYYTTMLCTPTRGAFM 77

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+ P  +G+ HGVI   + +GLP  E  +   LK AGYAT  +GKWHLG F +   PT 
Sbjct: 78  TGRLPQRLGLHHGVIGGFQDYGLPANETTIADKLKGAGYATAHVGKWHLGNFDDASEPTR 137

Query: 146 RGFDSHYGYWQGLQDYYDHSCKAT--FEPYQGLDMRHNMQVDN--KTIGIYSTDLYTEAA 201
           RGFD+ YGY  G +D++    +    F+     D      VD+  K  G +++ L+ + A
Sbjct: 138 RGFDASYGYQNGEEDHFTRILQGLGDFQASSLADGASYYTVDDLLKINGTHNSFLFVDRA 197

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP----DEEVAKFLDIS----DPERRTY 253
           + V+  H+ S P+FLYLA    HA     P QAP    D+     +  S    D  RR +
Sbjct: 198 VAVVGAHDASIPLFLYLALQDTHA-----PIQAPAGYADDAACASIGSSRHYPDGARREF 252

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
            GMV   DES+ NV AA+  +    NS+++   DNG    GI    G+N PLRG K  PW
Sbjct: 253 CGMVRAADESIANVTAAIDANFGKRNSVIVIAGDNG----GIVKGGGNNWPLRGQKGQPW 308

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG 373
           +GG+R  A I +P           + H+ D   TL A AG   +   +DG++ W  L  G
Sbjct: 309 EGGVRNHALIRAPGRLAPGLYDKGMVHVVDLHATLVALAG-GASAYEIDGLDVWAALRDG 367

Query: 374 AKTKRSEIL 382
           + + RSE+L
Sbjct: 368 SPSPRSEVL 376


>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 484

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 136/378 (35%), Positives = 208/378 (55%), Gaps = 33/378 (8%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           ++  +LF+      +  ++P+I++IL DDLG+NDV F+GS+ I TPN+D LA +G I   
Sbjct: 12  VSAVVLFSSCINAQSQTEQPNILLILCDDLGYNDVGFNGSTDITTPNLDQLAQDGTIFTS 71

Query: 71  HYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWG--LPLTEKLLPQYLKEAGYATH 127
            YV    C PSR+AL+TG+YP  +G Q  +   G   G  + + EK +   +++AGY T 
Sbjct: 72  AYVAHPFCGPSRAALLTGRYPHTLGSQFNLPANGASTGKGISVEEKFMGVPMQKAGYYTG 131

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----------LD 177
           AIGKWHLG   E Y P  RGF+  YG+  G   Y+    K  ++  +           L 
Sbjct: 132 AIGKWHLGETAE-YHPNKRGFNDFYGFLGGGHKYFPEEYKLQYKHQKEMGTKNINDYVLP 190

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           + HN  +  +    Y TD+ +   I    E H+K KP FLYLA+ A H      P +A +
Sbjct: 191 LEHNGAIVEEND--YLTDVLSREGIRFTKEAHDKKKPFFLYLAYNAPHV-----PLEAKE 243

Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
           +++ KF DI D +RRTYA MV  +D  VG ++A+L+K G  +N++++F++DNG      H
Sbjct: 244 KDLEKFKDIEDIDRRTYAAMVYAVDRGVGEIVASLKKTGQFDNTLIIFLSDNGG-----H 298

Query: 297 SNKGS-NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAG 353
           +  G+ N+PL G K   W+GG R    ++  W K+ +K     + +S  D  PT+     
Sbjct: 299 TGHGANNYPLTGRKGDTWEGGFR--VPMFFHWPKKIKKGQKFDYPVSALDLYPTIAHIGQ 356

Query: 354 IEI-NDTSLDGVNQWDVL 370
            +I +D  LDG N W+ +
Sbjct: 357 AKIGSDKILDGKNIWEAI 374


>gi|328726458|ref|XP_001949756.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 402

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 152/392 (38%), Positives = 204/392 (52%), Gaps = 25/392 (6%)

Query: 192 YSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNT----YEPFQAPDEEVAKFLDIS 246
           Y+TD++TE A  +I +   ++  MFL ++HLAVH+GN+     E       + A F  I 
Sbjct: 22  YATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSGPNHLEVLNKTYNDEA-FGYIE 80

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS---FGIHSNKGSNH 303
           +  RR YAGMV+ LD+SVG +I +L +H MLENSIV+FM+DNGAP+      H N GSN 
Sbjct: 81  NENRRLYAGMVTSLDDSVGKIIESLHEHEMLENSIVVFMSDNGAPTDDPLWGHDNFGSNW 140

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LD 362
           PLRG K++  +GG+RGVAAIWSPWLK+   VS+ LFHI+DWLPTL  AAG    D   +D
Sbjct: 141 PLRGKKASVLEGGVRGVAAIWSPWLKEKNTVSNNLFHITDWLPTLYTAAGGNFEDLGKID 200

Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG--TDNNGQSDEWYG 420
           GVNQW  LT  +K  RS+IL NID     +    AL  +  K V    T N   + ++ G
Sbjct: 201 GVNQWKSLTDTSKQVRSKILINIDETRGEE----ALIFNQWKVVKSNRTSNMVFNLKYAG 256

Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
           D  N   KY+   V+ S  G   + +    Q       D   N+S A         + ++
Sbjct: 257 DPGNAGPKYNMSGVVASTVGKRLSKINCLTQ-------DSCMNASTARDVFYVSRSLAKV 309

Query: 481 R-EFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKST 539
             +    R +  N           CLF+I  DPCE  N+A+     L      L  +K  
Sbjct: 310 DGQTCPERVSNPNPDQQYECFDGYCLFDILQDPCEYRNVAKQNPQTLDATIHMLEQFKKE 369

Query: 540 MVPPGNKP-FDKRADPARWNNIWVPWYDELDK 570
           M      P  D  ADP R+   W PW +  DK
Sbjct: 370 MTMQAPDPIIDPDADPRRFAGYWEPWLEPFDK 401


>gi|291513548|emb|CBK62758.1| Arylsulfatase A and related enzymes [Alistipes shahii WAL 8301]
          Length = 467

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 143/356 (40%), Positives = 192/356 (53%), Gaps = 31/356 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P+I+ I+ADDLGW DV +HGS  IPTPNIDALA  G+ +N+ Y   + +P+R+ LMTG
Sbjct: 30  ERPNILFIVADDLGWGDVGYHGSV-IPTPNIDALAARGIEMNRFYTAPVSSPTRAGLMTG 88

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI-GKWHLGFFREVYTPTFR 146
           +YP   G++  VI     +GL   E+ L   L   GYA  AI GKWHLG  R  Y P  R
Sbjct: 89  RYPSRFGIRKTVIPPWRDYGLDPEEQTLADMLAANGYAHRAIVGKWHLGHGRRAYYPLNR 148

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           GF   YG   G  DY+ H  +   +        HN     +  G YSTDL  + A+  I 
Sbjct: 149 GFTHFYGCLNGALDYFTHEREGELD-------WHNDWESCRDEG-YSTDLIADEAVRCIG 200

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA------KFLDISDPER--RTYAGMVS 258
            +    P FLY+A  A H      PFQAP++E+A      KF  ++  E+   TY  MV+
Sbjct: 201 GYASEGPFFLYVAFNAPHT-----PFQAPEDEIAEHISPEKFAALTPREKDAYTYRAMVT 255

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG--SNHPLRGMKSTPWDGG 316
           R+D+ VG ++  L + G++EN++VLFM+DNG    GI +  G  S  PLRG K   WDGG
Sbjct: 256 RMDKGVGRILEKLERSGLVENTLVLFMSDNG----GIPNLPGGSSCAPLRGHKGREWDGG 311

Query: 317 MRGVAAIWSPWLKQTQKVSSE-LFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
           +R V A+     +  + V SE L    D  PT+ A  G        DGVN    LT
Sbjct: 312 LR-VPAVLCRKGRFAEGVRSEQLAAFVDMAPTIAALVGAGEPPRPYDGVNLLPALT 366


>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  221 bits (562), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 144/413 (34%), Positives = 221/413 (53%), Gaps = 32/413 (7%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           L C L+           K P+I+ +L DDLG+ DV F   + I +PN D LA  GL+LN+
Sbjct: 4   LQCLLVCVLCSCFVATAKLPNIVFVLIDDLGFADVGFRNPA-ISSPNFDQLAKTGLVLNR 62

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
           HYV   C+PSR++L+TG++P H   Q   +++    G  +   +LP  LK A Y+TH +G
Sbjct: 63  HYVFKYCSPSRASLLTGRWPHH-AHQWNPLMDST-IGTNINMTMLPAKLKAANYSTHMVG 120

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           KWHLGFF   Y P  RGFD+  G++   +D+ +     +      +D   N   D +  G
Sbjct: 121 KWHLGFFDPRYLPINRGFDTSTGFFGCCEDHMNEKSGCS------IDYWKNNAPDPRN-G 173

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
            Y    Y +   +V++ +N   P+FLYL    VHA     PFQAP E +  +   S  E+
Sbjct: 174 TYDAYNYRDDLTDVMSNYNTENPLFLYLPLHNVHA-----PFQAPKEWLDIYPANSTCEK 228

Query: 251 R-TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGM 308
           R T   M+S  D   G+++  L+K G+ +N+I++  ADN GAP        GSN+PL+G 
Sbjct: 229 RHTLQAMISVADNVTGHLVELLKKKGIWDNTIMVISADNGGAPC------AGSNYPLKGC 282

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----L 361
           K+T ++GG+R VA +    L ++++  S     HI+DW PT C  AG++ +D+      +
Sbjct: 283 KTTFFEGGVRSVAFVNGGLLPESRRGQSTDGFIHIADWYPTFCKLAGVDPDDSGTGKFPV 342

Query: 362 DGVNQWDVLT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNG 413
           DG++ W ++T +  KT+  EI+    N D  +    A+ V + K V G    G
Sbjct: 343 DGLDVWPIITGENEKTQHEEIVLGY-NFDFRRPGQGAIIVGNYKLVVGLQEIG 394


>gi|410039868|ref|XP_003950702.1| PREDICTED: arylsulfatase I isoform 1 [Pan troglodytes]
          Length = 426

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 146/452 (32%), Positives = 218/452 (48%), Gaps = 89/452 (19%)

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNK 187
           +GKWHLGF+R+   PT RGFD+  G   G  DYY + +C        G D+     V   
Sbjct: 2   VGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWG 59

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
             G YST LY + A +++A H+  +P+FLY+A  AVH      P Q+P E + ++  + +
Sbjct: 60  LSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGN 114

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
             RR YA MV+ +DE+V N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG
Sbjct: 115 VARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRG 170

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQ 366
            K T W+GG+RG+  + SP LK+ Q+ S  L HI+DW PTL   AG   +    LDG + 
Sbjct: 171 RKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDV 230

Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNN 412
           W  +++G  + R+EILHNID + N  ++               AA+RV + K + G    
Sbjct: 231 WPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG---- 286

Query: 413 GQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTIL 472
              D  YGD       + P + L +  G  +N                            
Sbjct: 287 ---DPGYGD-------WIPPQTLATFPGSWWNL--------------------------- 309

Query: 473 TDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDK 532
             E++  +R+                      LFNI+ DP E+ +LA  + D+++ L  +
Sbjct: 310 --ERMASVRQAV-------------------WLFNISADPYEREDLAGQRPDVVRTLLAR 348

Query: 533 LAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LA Y  T +P      + RA P      W PW
Sbjct: 349 LAEYNRTAIPVRYPAENPRAHPDFNGGAWGPW 380


>gi|119582144|gb|EAW61740.1| arylsulfatase family, member I [Homo sapiens]
          Length = 426

 Score =  219 bits (557), Expect = 4e-54,   Method: Compositional matrix adjust.
 Identities = 146/452 (32%), Positives = 218/452 (48%), Gaps = 89/452 (19%)

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNK 187
           +GKWHLGF+R+   PT RGFD+  G   G  DYY + +C        G D+     V   
Sbjct: 2   VGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWG 59

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
             G YST LY + A +++A H+  +P+FLY+A  AVH      P Q+P E + ++  + +
Sbjct: 60  LSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGN 114

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
             RR YA MV+ +DE+V N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG
Sbjct: 115 VARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRG 170

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQ 366
            K T W+GG+RG+  + SP LK+ Q+ S  L HI+DW PTL   AG   +    LDG + 
Sbjct: 171 RKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDV 230

Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNN 412
           W  +++G  + R+EILHNID + N  ++               AA+RV + K + G    
Sbjct: 231 WPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG---- 286

Query: 413 GQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTIL 472
              D  YGD       + P + L +  G  +N                            
Sbjct: 287 ---DPGYGD-------WIPPQTLATFPGSWWNL--------------------------- 309

Query: 473 TDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDK 532
             E++  +R+                      LFNI+ DP E+ +LA  + D+++ L  +
Sbjct: 310 --ERMASVRQAV-------------------WLFNISADPYEREDLAGQRPDVVRTLLAR 348

Query: 533 LAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LA Y  T +P      + RA P      W PW
Sbjct: 349 LAEYNRTAIPVRYPAENPRAHPDFNGGAWGPW 380


>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1605

 Score =  218 bits (556), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 165/567 (29%), Positives = 254/567 (44%), Gaps = 74/567 (13%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCT 78
             L TT   KPH++ +L DD+G NDV +     +  TP ID LA  G+ L ++Y   +CT
Sbjct: 82  GLLATTDSAKPHVLFVLVDDMGSNDVGYQSHDMVGVTPFIDGLAEQGVRLKEYYSMHMCT 141

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           P+R+ALM+G YP+ IGMQ   I    PWG+P     + + LK  GY TH +GKW LG  +
Sbjct: 142 PARAALMSGHYPMRIGMQLENIKPDSPWGMPRELTTMAETLKNLGYNTHGVGKWGLGHSQ 201

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP---------------YQGLDMRHNMQ 183
             + P  RGFD+ YGY     DYY H   A FE                +      +++Q
Sbjct: 202 HGFLPVNRGFDTWYGYLSDEIDYYSHEYPAPFETVEDGATVMASFTDYVFMERSRPYDLQ 261

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQA-PDEEVAK 241
                 G +S++LYT+    ++   N S+ P+F+Y A    H      P  A  D E A+
Sbjct: 262 YMPDLNGTHSSELYTQRVQQIVKSANASREPLFVYYASQMTHGPLDVPPRSAFSDAEWAR 321

Query: 242 F-LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
              DI    RR +A ++  LD+SVG V+  +++  +  +++ +F +DNG    G  +  G
Sbjct: 322 IGADIKGTVRRKFASLLLALDQSVGRVVEEMKRSDVWRDTLFVFASDNG----GCWAQGG 377

Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT--QKVSSELFHISDWLPTLC--------- 349
           +N+PLRG K   W+GG++  A ++S  +  +   +    LFH++DW PTL          
Sbjct: 378 TNYPLRGGKHFLWEGGVKVPAFVYSELIPSSYRNRAHRGLFHVTDWYPTLVKLVASSNSV 437

Query: 350 ----------AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
                         +E  D  LDG  QW+ L   A   R E+  N+      Q Y     
Sbjct: 438 LASTNDDDDETTTALEKVD-DLDGFFQWEALMGLADPPRVEVPINV------QSYAFTCT 490

Query: 400 VDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKT---KLQIKQKH 456
             +   +   D+ G             D   P  +L  +   T  AL+    KL + +  
Sbjct: 491 GPNGTLLQDADSLGAVS----------DCADPWHILVRQGNATRAALRYGDWKLLVNE-- 538

Query: 457 AADP----KANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDP 512
            A+P     A  +DAL   L D       +        DN G    +     LF++ +DP
Sbjct: 539 FAEPWYGVPAEGADALGLDLADGSA----DLGDDEGPMDNCGTAVGTEAHTYLFDLKNDP 594

Query: 513 CEQNNLAESQTDLLKQLEDKLAIYKST 539
            E  +L+  + ++   L+ +L    +T
Sbjct: 595 SETMDLSGEKPEVRDALKARLTELMAT 621


>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
 gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
          Length = 467

 Score =  218 bits (556), Expect = 5e-54,   Method: Compositional matrix adjust.
 Identities = 142/386 (36%), Positives = 213/386 (55%), Gaps = 28/386 (7%)

Query: 1   MTWARKYF-FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
           MT+ +K+   +L   L++      TT   KP+II+IL+DD G+ D   +G+ +IPTPNI+
Sbjct: 1   MTYLKKWLSISLLVVLIYTTQAQTTT---KPNIIVILSDDAGYADFECYGNKEIPTPNIN 57

Query: 60  ALAYNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQH---GVILEG---EPWGLPLTE 112
            LA  G + ++ YV A +C PSR+ L+TG+Y    G ++   G   EG   E  GL L+E
Sbjct: 58  RLAKEGTLFSKAYVSASVCAPSRAGLLTGRYQQRFGFENNPTGKPREGFKKEDMGLALSE 117

Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
           K +   +KE GY T A+GKWHLG   + + P  RGFD  YG+ +G +D++    K   E 
Sbjct: 118 KTIGDRMKEEGYRTLAVGKWHLGNDAKFF-PLKRGFDEFYGFQEGHRDFFSFKKKRA-EK 175

Query: 173 YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEP 231
           Y   D   N ++  +    Y TD++T+ A+  I E+ +K +P F+YLA+ AVH      P
Sbjct: 176 YALWD---NDKIIPEEEITYLTDMFTDKALKFIDENADKKQPFFIYLAYNAVHT-----P 227

Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
            QA   ++ KF  +    R+TY  M+S +D ++G V+  L+  G+ +N++V+F+ DNG  
Sbjct: 228 LQAKKNDLDKFAQVGSEGRQTYDAMLSNMDYNIGRVMQELKNKGIDDNTLVIFLNDNG-- 285

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
             G  +N   N  LRGMK + W+GG+R G    W+  +     V  +     D LPT  A
Sbjct: 286 --GATTNYSDNGQLRGMKGSVWEGGVRVGYIMRWNGKI-PANVVYDKAVSSLDILPTSLA 342

Query: 351 AAGIEINDTSLDGVNQWDVLTKGAKT 376
           AAG    D  LDGVN    + K + T
Sbjct: 343 AAGNTRKDKHLDGVNLLPYIQKNSGT 368


>gi|109897214|ref|YP_660469.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699495|gb|ABG39415.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 500

 Score =  218 bits (554), Expect = 9e-54,   Method: Compositional matrix adjust.
 Identities = 133/370 (35%), Positives = 197/370 (53%), Gaps = 27/370 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           +KP+I+ +LADDLG+NDV F+GS+ I TPN+D LA NG+  +  YV    C PSR+A+MT
Sbjct: 38  EKPNILFVLADDLGYNDVGFNGSTDIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMT 97

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+YP  IG Q  +  +    G+   E  + Q +K AGY T A+GKWHLG   E Y P   
Sbjct: 98  GRYPHKIGAQFNLPEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLGEASE-YHPNKH 156

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPY--QGLD--------MRHNMQVDNKTIGIYSTDL 196
           GFD  YG+  G  +Y+    +A +     QG+         + HN +   +T   Y TD 
Sbjct: 157 GFDEFYGFLGGGHNYFPEQFEAAYNKRVAQGMTNINMYLTPLEHNGKEVRETE--YITDG 214

Query: 197 YTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
            +  A+N + +   K KP FLYLA+ A H      P QA +E++A F  I D +RRTYAG
Sbjct: 215 LSREAVNFVDKAAAKKKPFFLYLAYNAPHV-----PLQAKEEDMAMFSQIKDKKRRTYAG 269

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV  +D  VG ++  L+K+G  +N++++F +DNG    G      +N+PL+  K +  +G
Sbjct: 270 MVYAVDRGVGRIVEQLKKNGQFDNTVIVFTSDNG----GKLGQGANNYPLKEGKGSVQEG 325

Query: 316 GMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKG 373
           G R    + W   +K   + S  +  + D  PT     G  +  D  LDG + W  +   
Sbjct: 326 GFRTPMLVHWPKHMKAGSRFSHPVLAL-DLYPTFAGLGGAVLPEDKKLDGKDIWADIQAN 384

Query: 374 AKTKRSEILH 383
               + E ++
Sbjct: 385 TAPHKDEFIY 394


>gi|193784090|dbj|BAG53634.1| unnamed protein product [Homo sapiens]
          Length = 426

 Score =  217 bits (553), Expect = 1e-53,   Method: Compositional matrix adjust.
 Identities = 145/452 (32%), Positives = 218/452 (48%), Gaps = 89/452 (19%)

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNK 187
           +GKWHLGF+R+   PT RGFD+  G + G  DYY + +C        G D+     V   
Sbjct: 2   VGKWHLGFYRKECLPTRRGFDTFLGSFTGNVDYYTYDNCDG--PGVCGFDLHEGENVAWG 59

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
             G YST  Y + A +++A H+  +P+FLY+A  AVH      P Q+P E + ++  + +
Sbjct: 60  LSGQYSTMPYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGN 114

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
             RR YA MV+ +DE+V N+  AL+++G   NS+++F +DNG  +F    + GSN PLRG
Sbjct: 115 VARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTF----SGGSNWPLRG 170

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQ 366
            K T W+GG+RG+  + SP LK+ Q+ S  L HI+DW PTL   AG   +    LDG + 
Sbjct: 171 RKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAADGLDGYDV 230

Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKY--------------YAALRVDDLKYVAGTDNN 412
           W  +++G  + R+EILHNID + N  ++               AA+RV + K + G    
Sbjct: 231 WPAISEGRASPRTEILHNIDPLYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG---- 286

Query: 413 GQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTIL 472
              D  YGD       + P + L +  G  +N                            
Sbjct: 287 ---DPGYGD-------WIPPQTLATFPGSWWNL--------------------------- 309

Query: 473 TDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDK 532
             E++  +R+                      LFNI+ DP E+ +LA  + D+++ L  +
Sbjct: 310 --ERMASVRQAV-------------------WLFNISADPYEREDLAGQRPDVVRTLLAR 348

Query: 533 LAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LA Y  T +P      + RA P      W PW
Sbjct: 349 LAEYNRTAIPVRYPAENPRAHPDFNGGAWGPW 380


>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
          Length = 511

 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 129/356 (36%), Positives = 189/356 (53%), Gaps = 60/356 (16%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PHI+ ++ DD GWND  F G+ ++ TP ID L   G+ LNQHYVQ +C+P+R+ALMTG+
Sbjct: 13  RPHILFVVVDDHGWNDCGFAGT-RVKTPTIDTLRSEGIALNQHYVQKVCSPTRAALMTGR 71

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP   G+Q      G    L   E LLPQY+K AGY T A+GKWHLGF    +TPTFRGF
Sbjct: 72  YPHRYGLQF-PFCGGAAMALNSNETLLPQYMKSAGYTTRAVGKWHLGFTEWQFTPTFRGF 130

Query: 149 DSHYGYWQGLQDYYDH-----------------------SC-----KATFEPYQGLDMRH 180
           DS YG++   +DY+ H                       SC     KA FE   G D +H
Sbjct: 131 DSFYGFYSCAEDYFFHGLGFKNSSGAHVKSLDFHDDARPSCGADCSKAAFEAV-GTDWQH 189

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
                      YST L+    ++++  H+ S+P+FLY A     + +T+ P + P     
Sbjct: 190 -----------YSTTLFAGRIVDIVDGHDPSQPLFLYFA-----SQDTHNPGEVPQCYKD 233

Query: 241 KF-LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP-SFGIHSN 298
           ++   + D  RR  A  ++ +D ++ N+  A +++GM +N ++++ ADNG P +     N
Sbjct: 234 RYNTTVPDQVRRKLAAKLTTVDSALANITEAYKRNGMYDNLLIVYTADNGGPITVQSGPN 293

Query: 299 KG--------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL---FHISD 343
           KG        SN+PLRG K   ++GG+R  A +  P LK   ++  +     HI D
Sbjct: 294 KGKQEDSIGSSNYPLRGGKHNAYEGGVRSTAFLSGPALKTLARLVRKYQYKLHIGD 349


>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
          Length = 507

 Score =  216 bits (550), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 134/392 (34%), Positives = 203/392 (51%), Gaps = 39/392 (9%)

Query: 8   FFALTCTLLFNDAFL-NTTAPKKPHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAY 63
            FA+T  LL     +       KPHI+++L DD GW +V +H    + ++ TPNID L  
Sbjct: 1   MFAVTLALLSMIGLVAGQPVTLKPHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDDLVK 60

Query: 64  NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEP-------WGLPLTEKLLP 116
            GL LNQHY    C+PSRS+L++G+ PIH+  Q+       P         +P     + 
Sbjct: 61  QGLELNQHYAYRCCSPSRSSLISGRLPIHVSDQNIAPTNYNPNDPISGFSAIPRNMTGIA 120

Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH---SCKATFEPY 173
           + +KEAGYATH +GKW  G     +TP  RGFD+ +GY+    DYY     SC  T    
Sbjct: 121 EKMKEAGYATHQVGKWDAGMATPDHTPKGRGFDTSFGYFHHYNDYYTEVVDSCNGTGV-- 178

Query: 174 QGLDMRHNMQVDNKTIGI----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
             +D+ +  Q  +   G     Y   L+ E  ++++++H+ S P+FLY A   VH     
Sbjct: 179 --VDLWNTDQPAHGINGTGPDKYEEALFRERLLDIVSKHDPSTPLFLYYAPHIVHT---- 232

Query: 230 EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
            P Q PDE + KF  I D +R  Y  MV+ LD+ VG++++AL+K GM +N + +  +DNG
Sbjct: 233 -PLQVPDEYLNKFSFIDDKDRMYYHAMVNYLDDVVGDLVSALKKKGMWDNLLYVTSSDNG 291

Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPT 347
            P +       +N+PL+G K + W GG+R  A +    L  K   + +    H++DW  T
Sbjct: 292 GPVYPF--GGANNYPLKGGKLSDWQGGIRVNAFVSGGHLPEKMRGQKTDGYIHLADWYGT 349

Query: 348 LCAAAGIEINDTS--------LDGVNQWDVLT 371
            CA AG++  D          +D  N W +++
Sbjct: 350 FCAIAGVDPTDEKAAKAKLPPVDSYNMWPLIS 381



 Score = 45.1 bits (105), Expect = 0.097,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 35/66 (53%), Gaps = 3/66 (4%)

Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN---KPFDKRADPARWNNI 560
           CL+NI DDP E+ NLA S  D LK++++K   Y +T   P      P   +A   ++   
Sbjct: 442 CLYNIKDDPEERKNLASSMPDKLKEMQEKSKKYLATYFNPDRGHASPLACQAAVNKYGGF 501

Query: 561 WVPWYD 566
           W P+ D
Sbjct: 502 WGPFVD 507


>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 539

 Score =  216 bits (550), Expect = 3e-53,   Method: Compositional matrix adjust.
 Identities = 137/409 (33%), Positives = 204/409 (49%), Gaps = 56/409 (13%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           F L+  +    +  +  A  KP+II+++ADDLGW DV FHG+  I TP++D +A  G  L
Sbjct: 7   FCLSVLVFLTSS--SVFAGDKPNIIVMVADDLGWADVGFHGNQIIETPSLDRIAAEGTQL 64

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYA 125
           N+ Y   +C+P+R+ALMTG+ PI +G+ +  I+   PW   G+   E  LP+    AGY 
Sbjct: 65  NRFYTTPICSPTRAALMTGRDPIRLGVAYSTIM---PWHNNGIHPEETFLPELFAGAGYQ 121

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD----MRHN 181
           T  +GKWHLG  ++ Y P  RGF+  YG+         H+    F P+  L      R+ 
Sbjct: 122 TAMVGKWHLGHAQQTYHPNARGFEHFYGHL--------HTEVGFFPPFASLGGKDFQRNG 173

Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
           + +D++    Y + L  +     I E + +KP F+Y+  +A H      P  AP++  AK
Sbjct: 174 VSIDDQG---YESYLLADEVSRYIRERDAAKPFFIYMPFIAPHT-----PLDAPEDLKAK 225

Query: 242 FLDISDPE-----------------------RRTYAGMVSRLDESVGNVIAALRKHGMLE 278
           + D+ D                         R  YA +V  +D+S+G V+  L   G+ +
Sbjct: 226 YADMEDDRGKSRSKMADNTRFMAKVTRRESARPMYAAVVDGMDQSIGRVLDTLDAEGLTD 285

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSE 337
           N+IVLF +DNG   + I      N PLRG K   ++GG+R VA + W   +    KV S 
Sbjct: 286 NTIVLFFSDNGGAVYAI--GGADNAPLRGGKGDTFEGGIRVVATMRWPEKIAPGGKVDS- 342

Query: 338 LFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
           +  + D  PTL AAAGIE      LDG N    +  G    R ++L  I
Sbjct: 343 IMSVMDVFPTLLAAAGIEPATRHRLDGRNLLPAIVDGEDVAREDLLFFI 391


>gi|403182689|gb|EJY57565.1| AAEL017303-PA [Aedes aegypti]
          Length = 176

 Score =  215 bits (548), Expect = 4e-53,   Method: Compositional matrix adjust.
 Identities = 94/140 (67%), Positives = 117/140 (83%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P+I++I+ DDLGWNDVSFHGSSQIPTPNIDALAY G+ILN+HY   LCTPSR++LM+G
Sbjct: 28  ERPNIVVIVTDDLGWNDVSFHGSSQIPTPNIDALAYQGIILNRHYTPPLCTPSRASLMSG 87

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           K+PI++GMQH VI   EPWGL L +KL+P+Y +EAGY T  +GKWHLGFFR+ YTPT RG
Sbjct: 88  KHPINVGMQHHVIESNEPWGLGLDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTRRG 147

Query: 148 FDSHYGYWQGLQDYYDHSCK 167
           FDSH+GY     DY+DHS +
Sbjct: 148 FDSHFGYIGPYIDYWDHSLQ 167


>gi|323455793|gb|EGB11661.1| hypothetical protein AURANDRAFT_70842 [Aureococcus anophagefferens]
          Length = 1489

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 175/550 (31%), Positives = 252/550 (45%), Gaps = 129/550 (23%)

Query: 26  APKKPHIIIILADDLGWNDVSF--HGSSQIP--TPNIDALAYNGLILNQHYVQALCTPSR 81
           A + PHI + L DD G  D+ +   G  Q+   TP +D LA +G+ L+++Y Q LCTPSR
Sbjct: 103 ASEYPHIFLSLIDDQGKADMGYGNDGLDQLAEATPFMDLLADDGIKLDKYYSQQLCTPSR 162

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           SAL++G YPIH GMQH VI     +G+P   KL+P +LK                     
Sbjct: 163 SALLSGYYPIHTGMQHDVIQPESMFGMPAGHKLMPSFLK--------------------- 201

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
                  D    Y+        H+C  +   Y   DM+ N  +   +IG+YST L+ +A 
Sbjct: 202 -------DDQLWYYN---HKSPHACDGSNCYY---DMQRNGAIAEDSIGVYSTFLFVDAF 248

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP------DEEVAKFLDISDPERRTYAG 255
            +V+   + S P+F+YL+   VHA     P   P        E++    I DP RRT+A 
Sbjct: 249 TDVLENEDASTPLFMYLSWQNVHA-----PLDPPPASFYTKTELSLVNSIEDPHRRTFAS 303

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           +   LD ++  V+ A++K G  +NSI++  +DNG    G   + G N+P RG K   ++G
Sbjct: 304 ITIVLDNAMKKVVGAMKKSGFYDNSILVVASDNG----GCSGSGGYNYPYRGGKQYLYEG 359

Query: 316 GMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEINDT----SLDGVNQW-D 368
           G+   A I SP + ++ + +S   LFH+SDWLPTL   A ++   T     LDGV+QW  
Sbjct: 360 GIHVNAFIHSPLIPKSARGTSYQGLFHVSDWLPTLVTGA-MKTEPTMLPADLDGVDQWHA 418

Query: 369 VLTKG----AKTKRSEILHNID--------NVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
           +L  G    A   R+EILHNID        NV   +    A+RV D+K V G D +    
Sbjct: 419 ILGSGRGAPAAYPRTEILHNIDLWSLEIYGNVTKLRTPIQAIRVGDMKLVMGQDASTH-- 476

Query: 417 EWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEK 476
                       YSP E    + GI +                P   S++       D+K
Sbjct: 477 ------------YSPTETTCPE-GIGYCM--------------PSYESTEDCSYEYGDQK 509

Query: 477 ILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIY 536
                                       L+NIT DP E  +L+     L+  LE KLA Y
Sbjct: 510 Y---------------------------LYNITADPSELVDLSTVHPQLVSTLEAKLAGY 542

Query: 537 KSTMVPPGNK 546
           ++TMV P  +
Sbjct: 543 EATMVSPAYR 552


>gi|119626708|gb|EAX06303.1| hCG20717, isoform CRA_a [Homo sapiens]
          Length = 427

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 147/451 (32%), Positives = 220/451 (48%), Gaps = 69/451 (15%)

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN----MQV 184
           +GKWHLGF+R+   PT RGFD+ +G   G  DYY H  K       G D+  N       
Sbjct: 2   VGKWHLGFYRKECMPTRRGFDTFFGSLLGSGDYYTHY-KCDSPGMCGYDLYENDNAAWDY 60

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
           DN   GIYST +YT+    ++A HN +KP+FLY+A+ AVH+     P QAP      +  
Sbjct: 61  DN---GIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRS 112

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNH 303
           I +  RR YA M+S LDE++ NV  AL+ +G   NSI+++ +DNG  P+ G     GSN 
Sbjct: 113 IININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG-----GSNW 167

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLD 362
           PLRG K T W+GG+R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LD
Sbjct: 168 PLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEDIQLD 227

Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDT 422
           G + W+ +++G ++                      RVD L  +       ++  W    
Sbjct: 228 GYDIWETISEGLRSP---------------------RVDILHNIDPIYTKAKNGSWA--- 263

Query: 423 DNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELRE 482
                         +  GI   A+++ ++++           SD           +  + 
Sbjct: 264 --------------AGYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQS 299

Query: 483 FARVRCN-YDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMV 541
           F+ +  N + N+    ++     LFNIT DP E+ +L+     ++K+L  +L+ +  T V
Sbjct: 300 FSNLGPNRWHNERITLSTGKSVWLFNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAV 359

Query: 542 PPGNKPFDKRADPARWNNIWVPWYDELDKQK 572
           P    P D R++P     +W PWY E  K+K
Sbjct: 360 PVRYPPKDPRSNPRLNGGVWGPWYKEETKKK 390


>gi|254515652|ref|ZP_05127712.1| arylsulfatase B [gamma proteobacterium NOR5-3]
 gi|219675374|gb|EED31740.1| arylsulfatase B [gamma proteobacterium NOR5-3]
          Length = 507

 Score =  215 bits (547), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 146/430 (33%), Positives = 226/430 (52%), Gaps = 61/430 (14%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+++IILADD+GWNDV +HGS  I TP+ID LA  GL L++ Y Q  C+P+R+AL++G+
Sbjct: 40  RPNVVIILADDMGWNDVGYHGS-DIHTPHIDQLAAEGLELDRFYAQTACSPTRAALLSGQ 98

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
               +G+ +  + +  P GL L +K++P Y ++AGY T  +GKWHLGF+   Y P  RGF
Sbjct: 99  SSQSLGI-YSPLSKLNPTGLALDQKIMPAYFRDAGYQTFMVGKWHLGFYEPEYRPLARGF 157

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D  YG   G   Y++H          GLD + N +   +    YST L +     +I + 
Sbjct: 158 DHFYGNLTGGVGYWNHVHGG------GLDWQRNGKTLRQE--GYSTHLQSAEITRLIQQR 209

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVI 268
           +  KP+FLY A  A H      P +AP + +A++  I +P RR +A MV+ LD ++G ++
Sbjct: 210 DPEKPLFLYAAFNAPHL-----PNEAPADTLARYAHIENPNRRIHAAMVTELDSAIGQLM 264

Query: 269 AALRKHGMLENSIVLFMADNGA------PS------------FG----------IHSNK- 299
             L   GMLEN+++ FM+DNG       PS            FG          I +N  
Sbjct: 265 ETLSTEGMLENTLIWFMSDNGGLNRTAMPSGLVSMSQRLEDWFGKPLFPKTLEFIRTNAL 324

Query: 300 --GS-NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
             GS N P R  K + ++GG R  + ++       +++ +++  + D LPTL +A  I+ 
Sbjct: 325 DGGSDNSPHRKGKQSIYEGGARVPSFVYWKGRLSPERI-TQMVTVKDVLPTLLSATDIDA 383

Query: 357 ND------TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY-----YAALRVDDLKY 405
           N       T   GV+QW  LT+G   +  + L  I+ +D    Y       AL+  DL+ 
Sbjct: 384 NGLPTATTTESAGVDQWPGLTRGEFIQAPDYL--INGMDGEALYRFPWKLLALKSGDLEL 441

Query: 406 VAGTDNNGQS 415
               ++ G+S
Sbjct: 442 YNVDEDPGES 451


>gi|241844556|ref|XP_002415496.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509708|gb|EEC19161.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 328

 Score =  215 bits (547), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 136/357 (38%), Positives = 185/357 (51%), Gaps = 36/357 (10%)

Query: 214 MFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRK 273
           MFLY +H AVH G+     +AP   VA F  I    R  +AG V  LD+SVG V+ AL K
Sbjct: 1   MFLYFSHQAVHCGDYKVKLEAPASAVAHFPYIKSQNRSIHAGAVYELDKSVGLVVEALNK 60

Query: 274 HGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK 333
            GML NSIV+F  DNG   +GI  N G N PLRG K+T W+GG+RG A +WSP L ++ +
Sbjct: 61  RGMLSNSIVVFSTDNGGLPWGIDPNSGYNWPLRGAKATLWEGGVRGAAFVWSPLLFKSGR 120

Query: 334 VSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ 392
           +S++L HI+DWLPTL +AAG  ++     DG + W+ L++  ++ R EIL NID    P 
Sbjct: 121 LSNQLMHITDWLPTLYSAAGGNLSTLGDTDGKDMWEALSEDLESPRQEILINID----PT 176

Query: 393 KYYAALRVDDLKYVAGTDNNGQSDEWY-----GDTDNEIDKYSPKEVLYSKAGITFNALK 447
              +AL V   K V GT   GQ D+         T + +D+     +L S+ G       
Sbjct: 177 GNTSALIVGRRKVVLGTFQGGQHDQRMKAPGGSRTVDGLDQM----MLSSRTGKVLKDFY 232

Query: 448 TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
              Q++              +R    +E +        VRC+      +  +   P  F+
Sbjct: 233 NVRQLR--------------VRPNWRNETV--------VRCDQYAPRDNFVAASPPYYFD 270

Query: 508 ITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           +  DPCE NNLA S    LK+L  K+  Y   MVPPGNKP D R  P   + +W PW
Sbjct: 271 LEHDPCELNNLAASNVTDLKELIKKIEEYGKGMVPPGNKPLDPRGLPQNNHGLWGPW 327


>gi|241999506|ref|XP_002434396.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215497726|gb|EEC07220.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 458

 Score =  215 bits (547), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 163/504 (32%), Positives = 228/504 (45%), Gaps = 85/504 (16%)

Query: 94  GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK--WHLGFFREVYTPTFRGFDSH 151
           GMQH +I   EP GL L  KLLPQ+L + GY TH IGK   HL                H
Sbjct: 1   GMQHSIIFNEEPRGLSLRFKLLPQWLADRGYTTHMIGKVAMHLSL-------------QH 47

Query: 152 YGYWQGLQDYYDHS----CKATFEPYQ-----------GLDMRHNMQVDNKTIGIYSTDL 196
           Y Y Q    +  H      +  F P             GLD R ++ +     G Y T L
Sbjct: 48  YSYEQIKHLHCRHLNSKLAEVFFRPQTLVLLLQPGGDTGLDFRRDLSISPNESGTYITQL 107

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYA- 254
            T  A  VIA H   K     + H           + A D E + F LD      ++ A 
Sbjct: 108 LTREAEAVIANHPIDK-----VCH-----------YLASDRETSVFALDTVVICTKSIAI 151

Query: 255 --------GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
                   GMVS LDESVG+V  +L   GML+N++ +F +DNGA +   ++N  S+ P +
Sbjct: 152 SFHKCLLPGMVSALDESVGSVFESLNHRGMLDNTVFVFSSDNGADTDSANANTASSWPFK 211

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQ-TQKVSSELFHISDWLPTLCAAAG--IEINDTSLDG 363
           G K TPW+GG+R  A IWSP        V + +FHISDWLPTL   AG   E     +DG
Sbjct: 212 GQKYTPWEGGVRAPAVIWSPLFSGLLGSVYNNMFHISDWLPTLYQLAGGLAEGMADDIDG 271

Query: 364 VNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
           ++    LT G +  R E+L NID V+N    Y+A+     K V G  +    D+WY    
Sbjct: 272 LSHLHSLTGGTEPPRHELLINIDPVEN----YSAIIEGHFKLVKGRASGSAFDKWYSIPG 327

Query: 424 NEI-DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELRE 482
           N   D   P++   S   +         ++ +     P   +      +  D    + RE
Sbjct: 328 NVTWDSDMPRKQCESSVVV---------RVLRNIGLSPVCGAGRTRYAVPVDCGQRDPRE 378

Query: 483 FARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
            +            C      CLF+++ DPCE N+++    +++++L  KLA YK+T + 
Sbjct: 379 KS------------CAPERSACLFDLSKDPCEYNDVSGQHPEVVERLLKKLAGYKATALK 426

Query: 543 PGNKPFDKRADPARWNNIWVPWYD 566
           P N P D R++P   NN+WV W D
Sbjct: 427 PANLPDDPRSNPGLHNNVWVAWDD 450


>gi|383110963|ref|ZP_09931781.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
 gi|313694533|gb|EFS31368.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
          Length = 458

 Score =  214 bits (546), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 132/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +K FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP +DAL   
Sbjct: 2   KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  S G
Sbjct: 224 LYCDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--SAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +  +  S+ PLRG K   WDGG+   A + W    KQ + +SS++    D +PTL    G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLCWKKAEKQYKNLSSQVTGFVDLVPTLKELVG 339


>gi|414068777|ref|ZP_11404774.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410808616|gb|EKS14585.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 480

 Score =  214 bits (546), Expect = 7e-53,   Method: Compositional matrix adjust.
 Identities = 143/394 (36%), Positives = 205/394 (52%), Gaps = 35/394 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           +KP+II+I+ADDLG+ DV F+GS  I TPNID LA +G   +  YV    C PSR+ALMT
Sbjct: 28  QKPNIILIVADDLGYADVGFNGSKDIITPNIDDLAKSGTSFSDAYVAHPFCGPSRAALMT 87

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+YP  IG Q  +   G   G+P   K + + L E  Y T A+GKWH+G   + Y P  R
Sbjct: 88  GRYPHKIGSQFNLPTRGSNVGVPTDAKFISKLLNENNYFTGALGKWHMGDAPQ-YHPNKR 146

Query: 147 GFDSHYGYWQGLQDYYD------------HSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
           GFD +YG+  G  +Y+                K  FE    L+  HN +   +T   Y T
Sbjct: 147 GFDEYYGFLGGGHNYFPDQYQPQYKKQQAQGLKNIFEYITPLE--HNGKEVKETQ--YIT 202

Query: 195 DLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
           D  +  A+N + +  NK  P FLYLA+ A H      P QA DE++A F +I + +R+TY
Sbjct: 203 DALSREAVNFVDKAVNKKNPFFLYLAYNAPHV-----PLQAKDEDMAMFPNIKNKDRKTY 257

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
           AGMV  +D  VG ++  L+K+   +N++++FM+DNG    G  S   +N PL+  K +  
Sbjct: 258 AGMVYAVDRGVGKLVETLKKNNQYDNTLIVFMSDNG----GKLSKGANNFPLKAGKGSTQ 313

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTK 372
           +GG R       P      K  S      D  PT  A AG ++ +   LDG N W    K
Sbjct: 314 EGGFRVPMLFHWPKHVPAGKRFSHPVSALDLYPTFAALAGAKVEENQHLDGTNMWPAFIK 373

Query: 373 GAKTKRSEILHNIDNVDNPQKYY--AALRVDDLK 404
                + E ++ + +    +K Y  AA+R++  K
Sbjct: 374 NENPHKDEPIYALRH----RKGYSDAAIRMNQWK 403


>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 495

 Score =  214 bits (546), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 140/399 (35%), Positives = 212/399 (53%), Gaps = 30/399 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K P+I+ +L DD G+ DV F   + I +PN D LA  GL+LN+HYV   C PSR++L+TG
Sbjct: 22  KLPNIVFVLVDDWGFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCAPSRASLLTG 80

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           ++P H+  Q  +  +    G  L   + P  LK A Y+TH +GKWH GFF   Y P  RG
Sbjct: 81  RWPHHV-YQWNLATDATA-GTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYLPINRG 138

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FD+  G+  G +D+       T      +D   N   D +  G Y   +Y +   ++I  
Sbjct: 139 FDTSSGFLCGSEDH------MTQNAICAIDYWKNNAPDPRN-GTYDAYIYRDDLTDIINS 191

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR-TYAGMVSRLDESVGN 266
           HN  +P+FLYL    VH      P QAP E +  +   S  ++R T   MVS  D   G+
Sbjct: 192 HNTDEPLFLYLPLHNVHL-----PLQAPKEWLDIYPANSTCDKRHTLQAMVSVADNVTGH 246

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           ++  L+K G+ +N+I++  ADNG    G H + GSN+PLRG K T ++GG+R +A +   
Sbjct: 247 LVELLKKKGLWDNTIMVISADNG----GAHCS-GSNYPLRGCKETLFEGGVRSLAFVNGG 301

Query: 327 WLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDT-----SLDGVNQWDVLT-KGAKTKR 378
            L ++++  S     HI+DW  T C  AG++ +D+     S+DG++ W ++T +  KT+ 
Sbjct: 302 LLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFSVDGLDVWPIITGENEKTQH 361

Query: 379 SEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
            EI+    N D       A+ V + K + G  N  + D 
Sbjct: 362 EEIVLGY-NFDFNHPGQGAIIVGNFKLLVGLQNAFECDS 399


>gi|291225025|ref|XP_002732499.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 307

 Score =  214 bits (546), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 121/315 (38%), Positives = 177/315 (56%), Gaps = 14/315 (4%)

Query: 40  LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGV 99
           +GWNDV +H +  I  PN+  LA +G+I NQ Y    CTPSR+A+MTG YP   G QH +
Sbjct: 1   MGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59

Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQ-GL 158
                P G+PL  KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGFDSHYG W  G+
Sbjct: 60  AFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119

Query: 159 QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
            DY          P  G D R NM V  K+   Y   +  + A +++  H    P+FL  
Sbjct: 120 GDY--DKMDGVLSPSAGYDFRDNMGVVPKS-DDYLALMLGDRAEHIVNTHYPGTPLFLAF 176

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
             L + A +     + P+E   K+ +I D   R + G ++ +D+ +G V+ A++  GM +
Sbjct: 177 T-LDIPAKH----LEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDAMKNRGMYD 231

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
           +S+++F+ DN     G+ S  GSN+P RG+  T ++G  R  + +    +K+T   S+EL
Sbjct: 232 DSLIIFIGDNE----GLSSQSGSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTGYESNEL 287

Query: 339 FHISDWLPTLCAAAG 353
           + I D   T+   AG
Sbjct: 288 YSIVDIHRTILDVAG 302


>gi|221119831|ref|XP_002168522.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 223

 Score =  214 bits (546), Expect = 8e-53,   Method: Compositional matrix adjust.
 Identities = 108/225 (48%), Positives = 146/225 (64%), Gaps = 7/225 (3%)

Query: 95  MQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGY 154
           MQ   I     WG+ L EK LPQYLK  GY THAIGKWHLGFF + YTPT+RGFDS YGY
Sbjct: 1   MQSDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGY 60

Query: 155 WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYSTDLYTEAAINVIAEHNK 210
           + G  DY+DHS  +    + GLD+ ++    +K I    G YST +Y+  AI+ I  HN 
Sbjct: 61  YGGQADYWDHSLAS--NGWWGLDLHYDTPSSSKNIFNQWGNYSTAMYSMEAIDRIRNHNS 118

Query: 211 SKPMFLYLAHLAVHAGNTYE-PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
           ++PMFLYLA+ AVH+ N  E P QAP E V KF  I    R+ YA M+  +D  +G + +
Sbjct: 119 TQPMFLYLAYQAVHSANLREYPLQAPQEWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITS 178

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            L K  ML++SI++F +DNG P+ G+++N  +N+PLRG K+T ++
Sbjct: 179 ELLKQNMLDDSIIIFTSDNGGPANGLNNNWATNYPLRGTKTTVYE 223


>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
          Length = 330

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 130/328 (39%), Positives = 174/328 (53%), Gaps = 44/328 (13%)

Query: 28  KKPHIIIILADDLG------------------WNDVSFHGSSQIPTPNIDALAYNGLILN 69
           KKP+ + ILADDLG                  +ND+       I TPN+DALA  G+IL 
Sbjct: 19  KKPNFVFILADDLGGMLASKLKNFQACSFSVGYNDLGLRDPDVI-TPNMDALASKGVILT 77

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
            +YVQA+C+PSR ALMTG+YP    MQ  VI   E     L  K LPQYLK+ GY  H I
Sbjct: 78  NNYVQAVCSPSRHALMTGRYPSASAMQSIVIQPMEAKCSGLKYKFLPQYLKDLGYKNHMI 137

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
           GKWHLG+ RE   PT RGFD+ YG +    DY++H     +      D      VD +  
Sbjct: 138 GKWHLGYCREECLPTSRGFDTFYGLYASSGDYWEHGIMGMY------DWHTEAGVDFEAR 191

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA-----KFLD 244
           G ++ DL  E    V  EH+   P+FLY A       N++ P Q  DE +      KF D
Sbjct: 192 GTHAQDLEIERLDKVFDEHDNKDPLFLYFA-----PQNSHTPLQPTDEFINLYPEDKFSD 246

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           I    RR Y G+ S LD  VG ++  L ++ M+ N+ V+F++DNGA        +G N P
Sbjct: 247 I----RRKYLGLTSGLDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPI-----EGRNAP 297

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
            RG K + ++GG +  + I+SP LK+T+
Sbjct: 298 FRGGKGSLFEGGTKSSSFIYSPLLKKTE 325


>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 494

 Score =  214 bits (544), Expect = 1e-52,   Method: Compositional matrix adjust.
 Identities = 139/397 (35%), Positives = 211/397 (53%), Gaps = 30/397 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+II +L DD G+ DV F   + I +PN D LA  GL+LN+HYV   C+PSR++L+TG++
Sbjct: 23  PNIIFVLVDDWGFADVGFKNPA-ISSPNFDHLAKTGLVLNRHYVYMYCSPSRASLLTGRW 81

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           P H    +  +      G  L   ++P  LK A YATH +GKWH GFF   Y P  RGFD
Sbjct: 82  PHHTHQWN--LGNNSTAGTNLAMTMIPAKLKAANYATHMVGKWHQGFFDPRYLPINRGFD 139

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
           +  G+  G +D+       T      +D   N   D +  G Y   +Y +   ++I  HN
Sbjct: 140 TSSGFLCGSEDH------MTQNAICAIDYWKNNAPDPRN-GTYDAYIYRDDLTDIINSHN 192

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR-TYAGMVSRLDESVGNVI 268
            ++P+FLYL    VH      P QAP E +  +   S  E+R T   M+S  D   G+++
Sbjct: 193 TNEPLFLYLPLHNVHL-----PLQAPKEWLDIYPANSTCEKRHTLQAMISVADNVTGHLV 247

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
             L+K G+ +N+I++  ADNG    G H + GSN+PLRG K T ++GG+R +A +    L
Sbjct: 248 ELLKKKGIWDNTIMVISADNG----GAHCS-GSNYPLRGCKETLFEGGVRSLAFVNGGLL 302

Query: 329 KQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-KGAKTKRSE 380
            ++++  S     HI+DW  T C  AG++ +D+      +DG++ W ++T +  KT+  E
Sbjct: 303 PESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFPVDGLDVWPIITGENEKTQHEE 362

Query: 381 ILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDE 417
           I+    N D       A+ V + K + G  N  + D 
Sbjct: 363 IVLGY-NFDFNHPGQGAIIVGNFKLLVGLQNAFECDS 398


>gi|423214938|ref|ZP_17201466.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692201|gb|EIY85439.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 458

 Score =  214 bits (544), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 130/355 (36%), Positives = 195/355 (54%), Gaps = 33/355 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +K FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP +DAL   
Sbjct: 2   KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTL 348
           +  +  S+ PLRG K   WDGG+   A + W    KQ + +SS++    D +PTL
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|299147176|ref|ZP_07040243.1| arylsulfatase B [Bacteroides sp. 3_1_23]
 gi|298515061|gb|EFI38943.1| arylsulfatase B [Bacteroides sp. 3_1_23]
          Length = 458

 Score =  213 bits (543), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 130/355 (36%), Positives = 195/355 (54%), Gaps = 33/355 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +K FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP +DAL   
Sbjct: 2   KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTL 348
           +  +  S+ PLRG K   WDGG+   A + W    KQ + +SS++    D +PTL
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|336415441|ref|ZP_08595781.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
           3_8_47FAA]
 gi|335941037|gb|EGN02899.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
           3_8_47FAA]
          Length = 458

 Score =  213 bits (543), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 130/355 (36%), Positives = 195/355 (54%), Gaps = 33/355 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +K FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP +DAL   
Sbjct: 2   KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTL 348
           +  +  S+ PLRG K   WDGG+   A + W    KQ + +SS++    D +PTL
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTL 334


>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 487

 Score =  213 bits (543), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 133/361 (36%), Positives = 190/361 (52%), Gaps = 32/361 (8%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A  +P+I++I+ DD+G+ DV FHG   IPTPN+DALA +G+     YV    C+P+R+ L
Sbjct: 33  AADRPNILLIVGDDMGYADVGFHGCKDIPTPNLDALAKSGVQFTSGYVTGPYCSPTRAGL 92

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +TG+Y    G  H     G   GLPLTE  +   LK+ GY T  +GKWHLG  +    P 
Sbjct: 93  LTGRYQQRFG--HEFNPSGANTGLPLTEVTIADRLKQVGYTTGLVGKWHLGS-QPAMHPQ 149

Query: 145 FRGFDSHYGYWQGLQDYYD-HSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
            RGF+   G+  G   ++D        EP + +D              Y+TDL+   A++
Sbjct: 150 ERGFEEFIGFLGGAHSFFDAQGILRGHEPVKTID--------------YTTDLFGREAVS 195

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
            I +H + KP FLYL+  AVH      P  A ++ +AK   ISD ERRTYA M+  +DE+
Sbjct: 196 FIEKH-RDKPWFLYLSFNAVHT-----PMHATEDRMAKLASISDQERRTYAAMMLAMDEA 249

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           +G V+  L   G  + ++V+F++DNG P+  G+  N   N PLRG K T  +GG+R    
Sbjct: 250 IGKVLTQLETTGQKQKTLVMFISDNGGPTMPGVTINGSINTPLRGSKRTTLEGGIRVPFV 309

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           +  P         S +  + D   T  A AG+E  D   DGVN    L    + K+SE+ 
Sbjct: 310 VSWPGKIAPAVFDSPVIQL-DLTATALAVAGVE-KDVKSDGVNLLPYL----QGKQSEVP 363

Query: 383 H 383
           H
Sbjct: 364 H 364


>gi|293372058|ref|ZP_06618453.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
 gi|292632962|gb|EFF51547.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
          Length = 458

 Score =  213 bits (543), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 131/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +K FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP +DAL   
Sbjct: 2   KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +  +  S+ PLRG K   WDGG+   A + W    KQ + +SS++    D +PTL    G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339


>gi|372209242|ref|ZP_09497044.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 479

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 140/373 (37%), Positives = 199/373 (53%), Gaps = 31/373 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           K+P+I+IIL DDLG+NDV F+GS  I TPN+D LA NG+I+   YV    C PSR+++MT
Sbjct: 26  KRPNIVIILCDDLGYNDVGFNGSKDIKTPNLDKLADNGMIMTAGYVAHPFCGPSRTSIMT 85

Query: 87  GKYPIHIGMQHGVI--LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           GKY   +G Q  +    EG  +G+PL  K + + L+EAGY T A GKWHLG     + P 
Sbjct: 86  GKYAHTMGAQFNIPSESEGTGYGIPLNNKFISKELQEAGYYTGAFGKWHLG-ADTPFHPN 144

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD----------MRHN-MQVDNKTIGIYS 193
            RGFD  YG+  G  DY     K  +E  +             + HN  +VD K    Y 
Sbjct: 145 KRGFDEFYGFLGGGHDYIPEQYKPKYEFLKQRGSKNIRDYIKPLEHNGTEVDEKE---YI 201

Query: 194 TDLYTEAAINVIAEHN-KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           TD  +  A+N + + + K +P F+YLA+ A H      P QA  E++A F  I D +RRT
Sbjct: 202 TDGLSREAVNFVYKASEKKQPFFMYLAYNAPHV-----PLQAKKEDMAVFKSIKDEKRRT 256

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           YA MV  +D  VG ++ AL+ +  LEN++++F +DNG    G      +N PL   K + 
Sbjct: 257 YAAMVYAVDRGVGKLVEALKANQQLENTLLVFFSDNG----GKLGKGANNFPLTEGKGSA 312

Query: 313 WDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
           ++GG R      W   +K  QK  + +  + D  PT  + A  +  N  +L G N    L
Sbjct: 313 YEGGHRVPMFFHWPNRIKAGQKFHNPVSAL-DLYPTFASLAKTKPSNAKNLAGTNILPHL 371

Query: 371 TKGAKTKRSEILH 383
            KG    + E+L+
Sbjct: 372 QKGTNPHKDEMLY 384


>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 507

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 139/408 (34%), Positives = 211/408 (51%), Gaps = 44/408 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           +KPHI+++L DD GW +V +H    + ++ TPNID L   GL L+QHY    C+PSRS L
Sbjct: 21  QKPHIVLMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQGLELDQHYAYKFCSPSRSCL 80

Query: 85  MTGKYPIHIGMQHGVILEGEP-------WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           M+G+ PIH+   +       P         +P     + + +KEAGYATH +GKW  G  
Sbjct: 81  MSGRLPIHVNDLNLAPTNYNPNDPVSGFSAIPRNMTGIAEKMKEAGYATHQVGKWDAGMA 140

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSC--KATFEPYQGL-DMRHNMQVDNKT-IGIYS 193
              +TP  RGFD+ +GY+    DYY      + +  P   L D  H     N T    Y 
Sbjct: 141 TPDHTPKGRGFDTSFGYYHHDNDYYTEVVGPQCSGSPIVDLWDTDHPAHGINGTGPDKYE 200

Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
             L+ E  ++V+++H+ + P+FLY A    H      P Q PD+ + KF  I D +R+ Y
Sbjct: 201 EGLFKERLMDVVSKHDPNTPLFLYYAPHIAHT-----PLQVPDDYLNKFSFIDDSDRKYY 255

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS-NHPLRGMKSTP 312
             MV+ LD+ VG+++AAL++ GM +N + +  +DNG P   I++  G+ N+PL+G K T 
Sbjct: 256 HAMVNYLDDVVGDLVAALKQKGMWDNLLYVTSSDNGGP---IYAGGGANNYPLKGGKLTD 312

Query: 313 WDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIEINDTS--------LD 362
           W GG+R  A +   +L  K   + +    H++DW  T CA AG++  D          +D
Sbjct: 313 WQGGIRVNAFVSGGYLPEKMRGQKTDGYVHLADWYGTFCAIAGVDPTDEKAAKAKLPPVD 372

Query: 363 GVNQWDVLTKGAKTK-RSEILHNIDNVDNPQKYYAALRVDDLKYVAGT 409
             N W  ++    T  R++I  +I+          AL   D K + GT
Sbjct: 373 SYNMWPFISGETDTSPRTDIPASIN----------ALISGDYKILIGT 410



 Score = 43.5 bits (101), Expect = 0.28,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 3/64 (4%)

Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGN---KPFDKRADPARWNNI 560
           CL+NI DDP E++NLA S  + LK +++KL  Y +T   P      P   +A   ++   
Sbjct: 442 CLYNIKDDPEERHNLASSMPEQLKTMQEKLKKYLATYFNPDRGHVSPLACQAAVNKYGGF 501

Query: 561 WVPW 564
           W P+
Sbjct: 502 WGPF 505


>gi|423294191|ref|ZP_17272318.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
           CL03T12C18]
 gi|392676448|gb|EIY69884.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
           CL03T12C18]
          Length = 458

 Score =  213 bits (542), Expect = 2e-52,   Method: Compositional matrix adjust.
 Identities = 131/360 (36%), Positives = 197/360 (54%), Gaps = 33/360 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +K FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP +DAL   
Sbjct: 2   KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETVADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI+ I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITKEAIHCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +  +  S+ PLRG K   WDGG+   A + W    KQ + +SS++    D +PTL    G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339


>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 129/346 (37%), Positives = 188/346 (54%), Gaps = 28/346 (8%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           +P++++++ADDLG+ D+ F G+  IPTP++DALA  G+     YV    C+P+R+ L+TG
Sbjct: 36  RPNVVLLVADDLGYGDLGFQGARDIPTPHLDALAQGGVRCTSGYVSGPYCSPTRAGLLTG 95

Query: 88  KYPIHIGMQHG-----VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           +Y    G +            +  GLP+TE  L   LK AGYAT  +GKWHLG     + 
Sbjct: 96  RYQQRFGHEFNPGGGGGAAAAKNVGLPVTETTLADRLKAAGYATGLVGKWHLGS-EAKFH 154

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P  RGFD  +G+  G   Y+      + + Y+G ++             Y TD ++  A+
Sbjct: 155 PQKRGFDEFFGFLGGQHTYF---ASKSGDVYRGTEVVKEEA--------YLTDAFSREAL 203

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           + I  H K  P FL L+  AVH      P  A ++ VA+F  I DP+RRTYA M+S LDE
Sbjct: 204 SFIDRH-KDHPFFLQLSFNAVHT-----PMDATEDRVARFSSIEDPKRRTYAAMLSALDE 257

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           + GNVI  LR  G+ E ++++F +DNG P+  G  +N   N PLRG K T  +GG+R   
Sbjct: 258 AAGNVIERLRAAGLEEKTLIIFFSDNGGPTMAGTTTNGSKNDPLRGSKRTTLEGGVRVPF 317

Query: 322 AI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVN 365
            + W   L   +     +  + D LPT  AAAG++   +  LDGVN
Sbjct: 318 VLSWKGTLPAGKVYDQPIIQL-DILPTALAAAGVDAEPEWKLDGVN 362


>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
 gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
          Length = 473

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 135/361 (37%), Positives = 197/361 (54%), Gaps = 23/361 (6%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           N  A   P+I+ IL DDLG+ DV + GS QI TPNID LA  G+ L   Y   +C+P+R+
Sbjct: 36  NNQATGAPNILFILLDDLGYGDVGYLGS-QIQTPNIDNLASQGVTLKHGYAYPICSPTRA 94

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           ALMTGK P++ G+     +E +   LP     +P+  +EAGY T  +GKWHLG  +    
Sbjct: 95  ALMTGKNPLNFGIDGP--MENDAM-LPEDLTTMPERFQEAGYQTWMVGKWHLGMAKRSAM 151

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P  RGFD  YG+  G  DYY H        + GLD ++N     +  G + T+L T  AI
Sbjct: 152 PHNRGFDDFYGFLGGFVDYYTHVY------FGGLDWQNN-DTSLREEG-FVTELLTAKAI 203

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           + I      KP F+YL++ A H      P Q     +  +      +R+ +A M S +D+
Sbjct: 204 DKITHFKGDKPFFMYLSYSAPHT-----PLQYVPNAINDYSGFESKDRQVFADMTSDVDQ 258

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
            +G V+AAL K G+ EN+IV+FM+DNG  +    +N GS   L+G KS+ ++GG+R    
Sbjct: 259 DIGEVLAALDKRGIAENTIVVFMSDNGG-NLEAGANNGS---LQGGKSSVYEGGVRVPVL 314

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG-AKTKRSEI 381
           I  P   Q++ ++  +F   DW PTL  AA I+      DGV+Q+  L +   KT + E+
Sbjct: 315 IRWPAQLQSKTMTQPMF-TQDWAPTLLEAANIDYQPNDFDGVSQFVNLKEAQEKTFKREV 373

Query: 382 L 382
           +
Sbjct: 374 V 374


>gi|345510992|ref|ZP_08790548.1| arylsulfatase B [Bacteroides sp. D1]
 gi|229442597|gb|EEO48388.1| arylsulfatase B [Bacteroides sp. D1]
          Length = 498

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 130/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           ++ FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP++DAL   
Sbjct: 42  KEEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPSLDALVGE 99

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 100 GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 156

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 157 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 216

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 217 DKG--------YSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 263

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 264 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 321

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +  +  S+ PLRG K   WDGG    A + W    KQ + +SS++    D +PTL    G
Sbjct: 322 VPGS--SSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 379


>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
 gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 511

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 132/377 (35%), Positives = 204/377 (54%), Gaps = 34/377 (9%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
           +P+I+++L DDLG+ DV F+GS+ I TP +D LA NG I    YV    C PSRSA++TG
Sbjct: 47  RPNILVVLCDDLGYADVGFNGSTDILTPELDNLAQNGSIFTSAYVAHPFCGPSRSAILTG 106

Query: 88  KYPIHIGMQHGVIL-----EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           +YP   G  + +       + +  G+P+ E  + + L+ AGY T AIGKWHLG   + + 
Sbjct: 107 RYPHLTGTAYNLFHNSSEDDKDNMGVPVEETYMSKVLQNAGYYTSAIGKWHLGAAPK-FH 165

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----------LDMRHNMQVDNKTIGIY 192
           P  RGFD  YG+  G  DY+    + T++  +             M HN +  N+T   Y
Sbjct: 166 PNKRGFDDFYGFLGGGHDYFPSEYQKTYKAQKKAGNPNIRDYVFPMEHNGKPANETE--Y 223

Query: 193 STDLYTEAAI-NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
            TD ++  AI N+     K +P F+YLA+ A H      P QA  E+VAKF  I D +RR
Sbjct: 224 ITDGFSREAIKNIKIAAAKKQPFFIYLAYNAPHV-----PLQAKAEDVAKFAHIKDKDRR 278

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
           TYA MV  +D  VG ++  L++    +N++++F++DNG    G  ++  +N+PL+G K  
Sbjct: 279 TYAAMVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNG----GNFNHGANNYPLKGTKGD 334

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEI-NDTSLDGVNQWD 368
            W+GG R    ++  W K+ +K     F +S  D  PT    A  ++     LDG N  D
Sbjct: 335 TWEGGYR--VPMFFHWPKKIKKDQRFDFPVSSLDLYPTFTGLAEAKLPKGKQLDGKNIMD 392

Query: 369 VLTKGAKTKRSEILHNI 385
            + K  +  + E+++++
Sbjct: 393 DVLKNTEPYKDEMIYSL 409


>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 551

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 124/379 (32%), Positives = 201/379 (53%), Gaps = 41/379 (10%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P++I+++ADDLGWNDV +HG + I TP++D LA  G+ LN+ Y   +C+P+R+ALMTG+
Sbjct: 35  RPNVIVMVADDLGWNDVGYHGGN-IDTPSLDKLAEQGVQLNRFYTTPICSPTRAALMTGR 93

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
            P+ +G+ +GVIL  +  G+   E  +PQ  + AGY T  +GKWHLG  +  Y P  RGF
Sbjct: 94  DPMRLGIAYGVILPWDNIGVNPAEHFMPQSFQAAGYQTAMVGKWHLGHAQMTYHPNQRGF 153

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN-MQVDNKTIGIYSTDLYTEAAINVIAE 207
           +  YG+      +Y       F    G D + N + +D++    Y T L  +     I +
Sbjct: 154 EHFYGHLHTEVGFY-----PPFANVGGKDFQENGVSIDDEG---YETYLLADEVSRYIRD 205

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-----------SDPERR----- 251
            ++ KP F+Y+  +A H      P  AP E   K+ DI           +D  RR     
Sbjct: 206 RDEEKPFFIYMPFIAPHT-----PLDAPQELQEKYKDIETDLAPARSNQTDSTRRMAKLM 260

Query: 252 -------TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
                   YA +V  +D+++G V+  L + G+ +N+IVLF +DNG  ++        N P
Sbjct: 261 MQPSARPMYAAVVDAMDQAIGQVLDTLDEEGLADNTIVLFFSDNGGAAYSY--GGADNAP 318

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDG 363
           LRG K   ++GG+R V+ +  P + +  +   ++  + D  PTL  A G+  +N+   DG
Sbjct: 319 LRGGKGETFEGGIRVVSLMRWPGVLEPAQSFDQIMTVMDVFPTLAEATGVTPLNNFPFDG 378

Query: 364 VNQWDVLTKGAKTKRSEIL 382
            + W  + +G   +R E++
Sbjct: 379 SSLWSSIKEGEIHERDELV 397


>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
          Length = 617

 Score =  212 bits (540), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 139/419 (33%), Positives = 212/419 (50%), Gaps = 62/419 (14%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P+I+ I+ADDLG+ND+ ++ S +  TPNI+ LA NG+IL+ HY Q +C+PSRS  +TG
Sbjct: 156 EQPNIVFIMADDLGYNDIGYN-SIEAFTPNINYLAKNGIILDSHYSQPVCSPSRSQFLTG 214

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y    GMQH  IL  +P G+PLTEK+LP+  KE GY+T   GKWH G F E Y PT RG
Sbjct: 215 RYSFRYGMQHRNILPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRG 274

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEP-------YQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           FD   G + G   +  H  K    P       + G DMR    V+ +     S  L  E 
Sbjct: 275 FDKFVGSYSGSSQHSTHE-KCFNSPQRDSPFSFCGYDMREAEVVEARQNLTGSDFLLREI 333

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-----DISDPERRTYAG 255
            I +                  VH   T +P    DE++ + L     + +  +R     
Sbjct: 334 CIRI-----------------TVHGPLTTDP---RDEKIYEELYERHRNETIYKRTKMLA 373

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV+ LD ++G +I + +K+G  +N+I  F +DNG    G  + + +N P RG K+   +G
Sbjct: 374 MVTGLDRAIGRIIESFKKYGKYKNTIFAFASDNG----GSLTYRANNMPFRGGKNDLLEG 429

Query: 316 GMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCAAAGIEIND--TSLDGVNQWD-VL 370
           G+R ++ + S  +   +  S  S LF I+DW PTL  A      +    LDG + ++   
Sbjct: 430 GVRTLSFLHSSLIPNDKIGSRYSNLFDITDWFPTLLDAGKCSQKNLPNQLDGKSHFENFF 489

Query: 371 TKGAKTKRSEILHNID-------NVDNPQKYY------------AALRVDDLKYVAGTD 410
           + G  + R EI+H++D        V++P+ +              A+RV   K + G +
Sbjct: 490 SAGNASVRDEIVHHLDPLKKADAEVEDPRPFVKVYGFKYDIRMKTAIRVGKYKLITGAE 548


>gi|423290535|ref|ZP_17269384.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
           CL02T12C04]
 gi|392665922|gb|EIY59445.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
           CL02T12C04]
          Length = 458

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 131/360 (36%), Positives = 195/360 (54%), Gaps = 33/360 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +K FF L C      AFL     ++P+I+I+LADDLGW DV FHGS +I TP +DAL   
Sbjct: 2   KKEFFGLLCGCTLLPAFLYAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +  +  S+ PLRG K   WDGG+   A + W    KQ + +SS++    D +PTL    G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339


>gi|160885330|ref|ZP_02066333.1| hypothetical protein BACOVA_03329 [Bacteroides ovatus ATCC 8483]
 gi|156108952|gb|EDO10697.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
          Length = 458

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 130/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +K FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP +DAL   
Sbjct: 2   KKEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITKEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KF------LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            +      L   + ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTNNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +  +  S+ PLRG K   WDGG+   A + W    KQ + +SS++    D +PTL    G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339


>gi|262405390|ref|ZP_06081940.1| arylsulfatase B [Bacteroides sp. 2_1_22]
 gi|262356265|gb|EEZ05355.1| arylsulfatase B [Bacteroides sp. 2_1_22]
          Length = 458

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 130/360 (36%), Positives = 196/360 (54%), Gaps = 33/360 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           ++ FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP++DAL   
Sbjct: 2   KEEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPSLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +  +  S+ PLRG K   WDGG    A + W    KQ + +SS++    D +PTL    G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339


>gi|390361685|ref|XP_789654.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like
           [Strongylocentrotus purpuratus]
          Length = 401

 Score =  212 bits (539), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 122/285 (42%), Positives = 168/285 (58%), Gaps = 27/285 (9%)

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF------EP 172
           L++AGYATHA+GKWH+G ++    PT RGFDS +G   G   +Y H            + 
Sbjct: 5   LRKAGYATHAVGKWHMGLYKTCCWPTNRGFDSFFGLLTGRGGFYTHKHYGGHPGLVDSKN 64

Query: 173 YQGLDMRHNM-QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
           + G D+R N+ QV     G+YST L+T+ + N+I  HN+SKP+FLY +  AVH      P
Sbjct: 65  WSGYDLRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSKPLFLYHSFQAVHY-----P 119

Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
            + P   +  F  I+D  RRTYAGMV  +DE+VGN+   L+K G+  N+I++F +DNGA 
Sbjct: 120 LEVPPRYMEDFNYIADERRRTYAGMVKCMDEAVGNLTRTLKKTGLWNNTIIIFSSDNGAN 179

Query: 292 -SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTL 348
            ++G     GSN PLRGMK + W+GG++ V  I SP L +  +  V++ LFH++DW PTL
Sbjct: 180 FNYG-----GSNWPLRGMKRSLWEGGIKSVGFIASPLLPKLVRGTVNNNLFHVTDWFPTL 234

Query: 349 C-AAAGIEINDTSLDGVNQWDVLTKGAKTK------RSEILHNID 386
               A   +  T LDG N W  L    KT+      R EILHNID
Sbjct: 235 VRGVARGSLKGTHLDGHNLWKHLLNKKKTRGKDSWPRKEILHNID 279


>gi|291225031|ref|XP_002732500.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 286

 Score =  211 bits (538), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 119/299 (39%), Positives = 170/299 (56%), Gaps = 14/299 (4%)

Query: 40  LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGV 99
           +GWNDV +H +  I  PN+  LA +G+I NQ Y    CTPSR+A+MTG YP   G QH +
Sbjct: 1   MGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQM 59

Query: 100 ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQ-GL 158
           +    P G+PL  KLLP+ LKE GYATH +GKWHLGF ++ Y PT RGFDSHYG W  G+
Sbjct: 60  VFNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGV 119

Query: 159 QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
            DY          P  G D R NM V  K+ G Y   +  + A +++  H    P+FL  
Sbjct: 120 GDY--DKLNGVLSPSAGYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNTHYPGTPLFLTF 176

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
             L + A    +  + P+E    + DI D   R + G ++ +D+ +G V+ AL+  GM +
Sbjct: 177 T-LDIPA----KHLEIPEEYEEAYADIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMYD 231

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE 337
           +S+++F+ DNGA S    S  GSN+P RG+  T ++G     + +    +K+T   S+E
Sbjct: 232 DSLIIFIGDNGALS----SQSGSNYPFRGIAGTLFEGATCVPSIVSGKGIKKTGYESNE 286


>gi|295086308|emb|CBK67831.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
           XB1A]
          Length = 458

 Score =  211 bits (538), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 129/360 (35%), Positives = 196/360 (54%), Gaps = 33/360 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           ++ FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP++DAL   
Sbjct: 2   KEEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPSLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RGF   YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +  +  S+ PLRG K   WDGG+   A + W    KQ + +SS++    D +P L    G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGVHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPMLKDLVG 339


>gi|449680632|ref|XP_004209636.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 336

 Score =  211 bits (538), Expect = 7e-52,   Method: Compositional matrix adjust.
 Identities = 138/379 (36%), Positives = 196/379 (51%), Gaps = 62/379 (16%)

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE-PFQAPDEEVAKFLDISDP 248
           G YST +Y+  AI+ I  HN ++PMFLYLA+ AVH+ N  E P QAP E V KF  I   
Sbjct: 13  GNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQEWVDKFSHIKHK 72

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
            R+ YA M+  +D  +G + + L K  ML++SI++F +DNG P+ G+++N  +N+PLRG 
Sbjct: 73  GRQNYAAMLGVMDHWIGQITSELLKQNMLDDSIIIFTSDNGGPANGLNNNWATNYPLRGT 132

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQW 367
           K+T ++GG+RG A IWS  L +  +VS++L HI+DWLPTL +AAG++     +LDG+N W
Sbjct: 133 KTTVYEGGVRGAACIWSKELSKNPRVSNDLMHITDWLPTLLSAAGVDTTLLPNLDGLNLW 192

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYY-AALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
           D L     + R+E+L NID    P  Y  AALRV D K V     N   D W        
Sbjct: 193 DTLINQVPSPRTEVLINID----PLLYMNAALRVGDWKIVK---QNNFYDGW-------- 237

Query: 427 DKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARV 486
             Y P E+              +++    H + P       L     D+K  +L      
Sbjct: 238 --YPPPEI------------SNEIEEYDVHCSPP------PLLPRSCDKKDGKL------ 271

Query: 487 RCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK 546
                            CLFNI  DPCE  +L+ S   + + + ++L  Y  +MV     
Sbjct: 272 -----------------CLFNIKYDPCEYIDLSASNPHIYQFMLERLNFYNKSMVESRRN 314

Query: 547 PF-DKRADPARWNNIWVPW 564
            F D  ADP  +N +W  W
Sbjct: 315 TFRDPLADPKLYNGVWKSW 333


>gi|167520280|ref|XP_001744479.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776810|gb|EDQ90428.1| predicted protein [Monosiga brevicollis MX1]
          Length = 314

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 125/325 (38%), Positives = 183/325 (56%), Gaps = 37/325 (11%)

Query: 40  LGWNDVSFHGSS--------QIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPI 91
           LG+ DVS+   +         IPTP+ID LA +G+ L  +YV  LC+P+R++L++G+Y  
Sbjct: 7   LGFTDVSYKNQTYPDVAFGPMIPTPSIDELALSGVRLESYYVNQLCSPTRTSLLSGRYAY 66

Query: 92  HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
            +GM   VI++G P  LPLT K + ++L+E G+AT A GKW  G      TPT RGFD+ 
Sbjct: 67  TLGMNAEVIVDGHPDQLPLTVKTIGEHLQEGGWATFAGGKWDAGMTSWGCTPTCRGFDTF 126

Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ---VDNKTIGIYSTDLYT---EAAINVI 205
            G++    DY+ H           LD+RH+      D+   G+Y T+L T   +  I   
Sbjct: 127 SGFYNAFNDYFTHRVG------NYLDLRHDFAPDLADDNHTGVYMTELLTSRVQQFITTA 180

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE----RRTYAGMVSRLD 261
           AE  +S   F Y+AH AVHA     P Q P   +  +   + PE    RR   GM+  +D
Sbjct: 181 AETEQST--FAYVAHQAVHA-----PNQVPMSYLEGYCLDTIPEDRPTRRILCGMMRAVD 233

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           ESV N+ A  ++ G+  +++++F  DNG    G     GSN+PLRG K+T ++GGMRGV 
Sbjct: 234 ESVRNITATYKQLGLWNDTVLIFTTDNG----GNPETGGSNYPLRGQKATTFEGGMRGVG 289

Query: 322 AIWSPWLKQTQK--VSSELFHISDW 344
            + SP L ++Q+  +S EL H+SDW
Sbjct: 290 FVNSPLLNESQRGFISDELIHVSDW 314


>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 508

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 141/433 (32%), Positives = 211/433 (48%), Gaps = 50/433 (11%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFH---GSSQIPTPNIDALA 62
           K  F L CT     A L    P  P+I+++L DD GW +V +H    + ++ TPNID+L 
Sbjct: 5   KAAFILLCTT----ACLAVPPPPGPNIVMMLVDDWGWANVGYHRNPPTKEVVTPNIDSLV 60

Query: 63  YNGLILNQHYVQALCTPSRSALMTGKYPIHI---GMQHGVILEGEPW----GLPLTEKLL 115
             GL L+QHYV  +C+PSRS+LM+G+ PIH+    ++       +P      +P     +
Sbjct: 61  RQGLELDQHYVFNVCSPSRSSLMSGRLPIHVNDLNIEPDYYNPDDPVSGFSAIPRNMTGI 120

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
            Q +K  GY TH +GKW  G     +TP  RGFDS +GY+    D+Y    +   +P   
Sbjct: 121 AQKMKLGGYDTHQVGKWDAGMATHTHTPKGRGFDSSFGYFHHANDFY---TEIDGKPCNK 177

Query: 176 LDMRHNMQVDNKTIGI-------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
             +      D    G+       Y   L+ E  + V+ EH+  KP+FLY A   VHA   
Sbjct: 178 TKIVDIWVTDKPGYGLNGTGPDNYEEGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA--- 234

Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
             P Q P     KF  I D +R+ Y  MV+ LD+ VG ++ AL+K G+ EN + +  +DN
Sbjct: 235 --PLQVPQRYQDKFSFIDDHDRQIYHAMVNYLDDVVGELVDALKKKGLWENLLFVTSSDN 292

Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLP 346
           G P +       +N+PL+G K++ W GG+R  A +   +L  K   K      H++DW  
Sbjct: 293 GGPVY--PGGGANNYPLKGGKASDWQGGIRSNAFVSGGYLPEKMRGKKLDGYIHLADWYA 350

Query: 347 TLCAAAGIEINDTS--------LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAAL 398
           T CA A ++  D +        +D +N W +++    T           VD P   Y  L
Sbjct: 351 TFCALANVDPTDEAAAKADLPPIDSLNMWPLISGETMTSP--------RVDIPIS-YKTL 401

Query: 399 RVDDLKYVAGTDN 411
              + K + GT+N
Sbjct: 402 ISGEYKILTGTNN 414



 Score = 41.6 bits (96), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 3/64 (4%)

Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKST-MVPPGNKPFDKRADPA--RWNNI 560
           CL+NI  DP E+ +LA    D LK+++ KLA Y+ T   P    P+    D A   +N  
Sbjct: 443 CLYNIMTDPEERVDLAAKMPDKLKEMQQKLAQYQKTHFAPDRGSPWPGACDAALNTYNGF 502

Query: 561 WVPW 564
           W P+
Sbjct: 503 WGPF 506


>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
            queenslandica]
          Length = 2318

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 135/369 (36%), Positives = 198/369 (53%), Gaps = 33/369 (8%)

Query: 29   KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
            KPHI+ +L DD G+ D SF   + I TPN   L  NGLILN+HYV   C+PSR++ +TG+
Sbjct: 1140 KPHIVFVLVDDWGFADASFRNPA-IKTPNFQYLVDNGLILNRHYVFKYCSPSRASFLTGR 1198

Query: 89   YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
            +P H+   +   L     G  +   +LP  LK AGYATH +GKWH G +   Y P  RGF
Sbjct: 1199 FPHHVHQWNPTPLG--MVGTNINMTMLPAKLKNAGYATHMVGKWHQGLYDPAYLPINRGF 1256

Query: 149  DSHYGYWQGLQDYYDHS--CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
            D+  G+ Q  + +++ +  C   F      D R N   D+    IY+ DL T     V  
Sbjct: 1257 DTSSGFLQAEEGHFNQTIGCAVDFWKNDAPDTR-NGTCDSY---IYNKDLTT-----VFN 1307

Query: 207  EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVG 265
            EH+ SKP+FLYL    VHA     P +AP E +  +   S  + RR    MVS  D   G
Sbjct: 1308 EHDASKPLFLYLPLHNVHA-----PIEAPQEWLDLYPANSTCDTRRILQAMVSVADNVTG 1362

Query: 266  NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            +V+  LR++ M +N++++  ADNG         KGSN+PL+G K T ++GG+R +A    
Sbjct: 1363 HVVNLLRENDMWDNTLMVVSADNGGADC-----KGSNYPLKGSKYTFFEGGVRVIAFASG 1417

Query: 326  PWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-KGAKTK 377
              +   ++  S     HI+DW  T C  AG++ ND+      +DG++ W ++T +  +T+
Sbjct: 1418 GLIPAGRRGKSTDGFIHIADWYTTFCKMAGVDHNDSGKGKFPVDGLDVWPIITGENEETQ 1477

Query: 378  RSEILHNID 386
              EI+   D
Sbjct: 1478 HEEIVLGYD 1486


>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 474

 Score =  210 bits (535), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 136/386 (35%), Positives = 211/386 (54%), Gaps = 22/386 (5%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +KY+  LT  L F    L     K+P+I+IIL+DD GW DV F+G++ IPTPN++ALA +
Sbjct: 2   KKYYGYLTLLLAFCCTTLQAQKSKQPNILIILSDDQGWGDVGFNGATDIPTPNLNALAKD 61

Query: 65  GLILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEP----WGLPLTEKLLPQYL 119
           G+I +Q Y     C+PSR+ L+TG+Y    G ++    E +      GLPL E ++ + L
Sbjct: 62  GVIFSQGYSSHPYCSPSRAGLLTGRYQQKFGHENNPENEKQNEDTVIGLPLNELMISEVL 121

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR 179
           ++  Y T AIGKWHLG   + + P  RGF   +G+  G  +Y+  +     E   G+ M+
Sbjct: 122 QQNNYHTCAIGKWHLGNAHK-FLPNQRGFKDWFGFSGGGFNYWGKTTPKNKE--LGV-MK 177

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEE 238
           +   V   T+  Y TD ++  AIN I +++K+ +P F+YLA+ A HA     P QA  E 
Sbjct: 178 NGKPVPENTL-TYLTDDFSNQAINYIDQYSKTEQPFFMYLAYNAPHA-----PIQATKEY 231

Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
                 I + ER  YA MV+ +D  +G VI  L++ G+  N++++F +DNG      HS+
Sbjct: 232 TNLVTHIENGERAAYAAMVAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGG-----HSH 286

Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-N 357
             S+ P RG K   ++GG+R    I  P   +  +   E     D  PT+ AA   ++  
Sbjct: 287 GASSAPYRGHKGMLFEGGIREPFVISWPNKIKGNRTYKEPIIALDIFPTILAATNTQVPA 346

Query: 358 DTSLDGVNQWDVLTKGAKTKRSEILH 383
             ++DGVN   +LTK +KT +   ++
Sbjct: 347 SKNIDGVNLLPILTKKSKTLKERPMY 372


>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 161/580 (27%), Positives = 260/580 (44%), Gaps = 116/580 (20%)

Query: 28  KKPHIIIILADDLGWNDVSFHGS----SQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + PHI+++L DDLGW DV +H S    S I TP ID L   G+ L +HYV  +C+P+R++
Sbjct: 51  QPPHILMVLVDDLGWADVGYHRSGPHKSDIQTPTIDKLVSQGIALERHYVHKVCSPTRAS 110

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
             +G+ P+H G+   V+L     G+P     + Q+L +AGYA+H +GKW +G     +TP
Sbjct: 111 FQSGRLPVH-GIDGQVVLCAPRAGIPENMTTVAQHLNKAGYASHFVGKWDVGMATPSHTP 169

Query: 144 TFRGFDSHYGY------------WQGLQDYYDHS--CKA--TFEPYQGLDMRHNMQVDNK 187
             RG+++   Y            WQG Q+   H   CKA   F+ +   D        N 
Sbjct: 170 HGRGYNTSLNYFGHANWMWNQDEWQGSQNNVSHRPPCKAPDCFKDFWDTDR----PAHNL 225

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
           +  +Y   L+ +   ++I  H+ S+P+FL  A    H      P QAP E   +F +I  
Sbjct: 226 SGTLYEEQLFVQRITDIIEAHDPSQPLFLTYASKVAH-----YPLQAPIEYQQQFANIEP 280

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI-------HSNKG 300
           P RR Y  MV  LD+++  +   ++K GM  N++++F +DNG P           H ++G
Sbjct: 281 PSRRVYHAMVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHPSRG 340

Query: 301 ----------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTL 348
                     +N+PLRG K + ++GG+R  A +    +   Q+ S+   + HI+DW  T 
Sbjct: 341 YMCYNGEAGANNYPLRGGKYSFFEGGIRVNAFVSGGLIPADQRGSNRTGIMHIADWYATF 400

Query: 349 CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
           CA AG++  D      N   + +           HN+                   Y+AG
Sbjct: 401 CALAGVDPTDHRAAAANLPPIDS-----------HNM-----------------WPYLAG 432

Query: 409 TDNNGQSDEWYGDTDNEIDKYSPKE-VLYSKAGITFNALKTKLQIKQKHAADPKANSSDA 467
                              ++SP++ VL  +  + F   K  L      A++P AN S  
Sbjct: 433 Q-----------------ARFSPRDTVLVDRDVLVFGDYKLLLG-----ASNPMANWSGP 470

Query: 468 LRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLK 527
                T          +R   N  + G          LFN+ +DP E+++LAE + D+LK
Sbjct: 471 AYPNATTAT----SPISRYTLNCSDPGC---------LFNVVEDPEERHDLAEEKPDILK 517

Query: 528 QLEDKLAIYKSTMVPPGNKPFDK---RADPARWNNIWVPW 564
            ++  L     ++ P    P D    +    +++  + PW
Sbjct: 518 LMKFTLTELAKSIFPYNPGPVDPQCMKTAETKYDGFYGPW 557


>gi|294647729|ref|ZP_06725288.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294809280|ref|ZP_06767994.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|292636934|gb|EFF55393.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294443524|gb|EFG12277.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
          Length = 458

 Score =  210 bits (534), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 129/360 (35%), Positives = 195/360 (54%), Gaps = 33/360 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           ++ FF L C      AFL+    ++P+I+I+LADDLGW DV FHGS +I TP++DAL   
Sbjct: 2   KEEFFGLLCGCTLLPAFLHAQT-ERPNIVIVLADDLGWGDVGFHGS-EIKTPSLDALVGE 59

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKE 121
           G+ L + Y   + TP+R+ LMTG+YP   G++  VI    PW   GL   E+ +   L  
Sbjct: 60  GVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI---PPWREDGLDENEETMADMLAR 116

Query: 122 AGYATHA-IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            GY   A IGKWHLG  ++V+ P  RG    YG+  G  DY+D + +   + +   +  H
Sbjct: 117 NGYKNRAIIGKWHLGHTKKVHYPMNRGLSHFYGHLNGAIDYFDLTREGELDWHNDWETCH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +          YST+L T+ AI  I  + K  P  LY+A+ A H      P QA ++++ 
Sbjct: 177 DKG--------YSTELITQEAIRCIDAYEKEGPFMLYVAYNAPHT-----PLQAQEKDIK 223

Query: 241 KFLDISDP------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + D  D       ++ TY+ MVS +D  +G ++ AL+K G+++N+  +F +DNG  + G
Sbjct: 224 LYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIVDALKKKGIMDNTFFIFFSDNG--TAG 281

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +  +  S+ PLRG K   WDGG    A + W    KQ + +SS++    D +PTL    G
Sbjct: 282 VPGS--SSGPLRGHKFDEWDGGGHAPAVLYWKKAEKQYKNLSSQVTGFVDLVPTLKDLVG 339


>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 474

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 137/381 (35%), Positives = 204/381 (53%), Gaps = 43/381 (11%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPS 80
           ++T A K+P+I+ ILADD G+ D  F GSS+  TP++D LA   +  +Q YV  A+C PS
Sbjct: 28  VSTLAQKQPNIVFILADDAGYADFGFQGSSEFKTPHLDQLASQSIRFSQAYVSAAVCGPS 87

Query: 81  RSALMTGKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKW 132
           R+ ++TGKY    G +   +          G+  GLPL +KLLP+YLKE GY T   GKW
Sbjct: 88  RAGILTGKYQQRFGYEENNVPGYMSASATTGDEMGLPLDQKLLPEYLKEQGYKTALFGKW 147

Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG-- 190
           H+G   + + PT RGFD+ YG+  G + YY+ +           + ++N Q D    G  
Sbjct: 148 HMG-NADKFHPTKRGFDTFYGFRGGARSYYEFNE----------NNKNNRQEDRLERGFG 196

Query: 191 ------IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
                 +Y TD   EA  + I E N+ +P F+YL+  AVH      P +A  +++ +F +
Sbjct: 197 NFEESKLYLTDALAEATTDFI-EKNQKQPFFVYLSFNAVHT-----PMEARPDDLKQFPN 250

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
           +   +R+T A M   LD + GNV+A L + G+ EN++++F  DNG PS    +N  SN P
Sbjct: 251 LKG-KRKTLAAMTLGLDRACGNVLAKLDELGVRENTLIVFTNDNGGPS---GTNASSNKP 306

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCA-AAGIEINDTSL 361
           L G K+   +GG+R        W KQ ++ +   + IS  D LPT    A G E N   +
Sbjct: 307 LGGTKANHLEGGIR--VPFLMSWPKQLKENTVYDYPISTLDLLPTFYQIAGGNEKNINDI 364

Query: 362 DGVNQWDVLTKGAKTKRSEIL 382
           DGVN    ++   + +  E L
Sbjct: 365 DGVNLLPFISGKNEARPHETL 385


>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
          Length = 492

 Score =  209 bits (533), Expect = 2e-51,   Method: Compositional matrix adjust.
 Identities = 141/406 (34%), Positives = 203/406 (50%), Gaps = 68/406 (16%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
            F      KKPH+I +L DDLG++D+ +     I +PNIDALA + L L +HYVQ  CTP
Sbjct: 14  TFSQKVLEKKPHVIFVLVDDLGFDDLGYVNRDVI-SPNIDALAKDALHLKKHYVQPSCTP 72

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           SR+A +TG+Y I +GMQ GVI   EP G+PL E LL +  K+ GY T   GKWHLGF+  
Sbjct: 73  SRAAFLTGRYNIRMGMQSGVIRPSEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTY 132

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH------SCKATFE--------------PYQ---GL 176
            + P  RGFD  YG++ G QD+Y H      S K ++E              P+    G 
Sbjct: 133 KHCPQNRGFDRFYGFYLGSQDFYFHDSGRLSSHKFSYEDEVSGVNNKRWLQGPHGRPWGY 192

Query: 177 DMRH----NMQVDNKTI-------GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA 225
           D R     N  V+N TI       G YST L+    IN +A+H+ +  +F Y++   VH 
Sbjct: 193 DFREAYPANGDVENDTILDDFHTNGTYSTKLFVNDFINDLAKHDPAVSLFNYVSFQDVHG 252

Query: 226 GNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
                P Q  ++    + + +D        + +++                   S+++F 
Sbjct: 253 -----PLQTVNKFKKHYKNKTDTWTHERILISTKI------------------TSLLVFT 289

Query: 286 ADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISD 343
           +DNG    G      SN PLRG K T +DGG++  A + SP L+   K    + LFH+SD
Sbjct: 290 SDNG----GRPKEGASNWPLRGSKGTIYDGGLKSRAFVASPLLQNRMKGQNYNYLFHVSD 345

Query: 344 WLPTLCAAAGIEI---NDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
           W PTL   +G ++       LDG    ++     K  R+++LH +D
Sbjct: 346 WFPTLLTLSGCDVPRKEGRPLDGKIH-ELFVYKPKKARNKMLHYLD 390


>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 441

 Score =  209 bits (533), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 141/395 (35%), Positives = 214/395 (54%), Gaps = 56/395 (14%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F+L CT L         A + P+IIIILADD G +D S +GS Q+ TP+ID++A+NG+ 
Sbjct: 7   LFSLLCTSLL--------ANEPPNIIIILADDAGSSDFSCYGSKQLLTPHIDSIAHNGIK 58

Query: 68  LNQHYV-QALCTPSRSALMTGKYPIHIG----MQHGVILEGEP--WGLPLTEKLLPQYLK 120
             Q Y   ++C+PSR+ L+TG+Y    G    + H      +P   GLP+TE  L   LK
Sbjct: 59  FTQAYTASSVCSPSRAGLLTGRYQQTFGHLANIPHSKHSANDPELLGLPVTEITLADSLK 118

Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
           E GY+TH IGKWHLG   + + P  RGFD+ YG+  G + Y           + G ++R 
Sbjct: 119 ELGYSTHCIGKWHLG-EADHFHPNARGFDNFYGFLSGARTY-----------FLGGELRG 166

Query: 181 NMQ--VDNKTIGI----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA 234
           +M   + NK        Y+T+++T+ AI +I E  + KP F+YL+H AVH      P  A
Sbjct: 167 DMDRIMRNKEFAEPSSGYTTEVFTQEAIRIIQEE-QDKPFFIYLSHNAVHG-----PMDA 220

Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            DE++  + D  +P R+ Y+G++  LD+  G ++ AL+     EN+++ FM+DNG P+  
Sbjct: 221 KDEDIMSY-DFKNPLRKKYSGLMKNLDDQTGLLLQALKDSKQYENTLIFFMSDNGGPT-- 277

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQ-TQKVSSEL-----FHISDWLPTL 348
              N  SN PLRG K + ++GG R      +P+L Q  +K+S+ L         D   T 
Sbjct: 278 -THNGSSNWPLRGFKGSEFEGGNR------TPFLLQWPEKISAGLSSDKPIIAYDVFATC 330

Query: 349 CAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
             AAG E + D +  G++   V+ K  +T   ++ 
Sbjct: 331 IQAAGGELVTDRTYHGIDLLPVINKPQETNARKLF 365


>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 441

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 142/380 (37%), Positives = 197/380 (51%), Gaps = 27/380 (7%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           +FFA+    L + A     A K P+I++I+ADDLG+ DV FHG   IPTP+ID+LA +G 
Sbjct: 10  FFFAILWMGLASAA----DAAKLPNILVIVADDLGYADVGFHGGKDIPTPHIDSLAASGT 65

Query: 67  ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
             +  YV    C+P+R+ L+TG+Y    G  H     G   GLPLTE  +   L+ AGYA
Sbjct: 66  RFSSGYVSGPYCSPTRAGLLTGRYQQRFG--HEFNPGGANKGLPLTETTIADRLQAAGYA 123

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           T  +GKWHLG     + P  RGF   +G+  G   Y+D          Q  D++      
Sbjct: 124 TGLVGKWHLGT-DPKFHPLKRGFGEFFGFLAGHHTYFDK---------QEADIQRG--TT 171

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
             T   Y TD +   A++ I  H ++ P FLYLA  AVH      P +A D+ +  F  I
Sbjct: 172 KVTEPGYLTDAFGREAVSFIERH-QNHPFFLYLAFNAVHT-----PLEADDDRLNAFSSI 225

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-GIHSNKGSNHP 304
               RRTYA M+S +D+++G V+A+LR+    EN+++ F++DNG P+  G   N   N P
Sbjct: 226 DATPRRTYAAMLSAMDDAIGKVLASLRETAQDENTLIFFISDNGGPTMPGTTINSSINAP 285

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDG 363
           LRG K T  +GG+R    +  P       V +      D  PT  AAAG+ I  D  LDG
Sbjct: 286 LRGSKRTTLEGGIRVPFVVRWPGHVPAGAVYAHPVIQLDIQPTALAAAGVTIQPDWKLDG 345

Query: 364 VNQWDVLTKGAKTKRSEILH 383
           VN    +T   K    E L+
Sbjct: 346 VNLLPFVTGQQKNIPHETLY 365


>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
 gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
          Length = 470

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 131/370 (35%), Positives = 199/370 (53%), Gaps = 28/370 (7%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           + +   ++C  +   A       KKP++I+I++DD G+ D   +G  +IPTPNIDALA N
Sbjct: 7   KAFLLCISCVTVVAQA-----QQKKPNVIVIVSDDAGYADFGCYGGKEIPTPNIDALAKN 61

Query: 65  GLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVI-LEGEPW-----GLPLTEKLLPQ 117
           G +    YV A +C PSR+ ++TG Y    G +H +  L  +P+     G+    K +  
Sbjct: 62  GTLFTDAYVSASVCAPSRAGILTGMYQQRFGFEHNISELPVKPYTLNDVGMDPKIKTIGD 121

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
            +K  GY T AIGKWH G   + Y P  RGFD  YG+  G + ++ +       P   L 
Sbjct: 122 QMKHNGYRTIAIGKWHQGDLPQ-YFPLKRGFDEFYGFVGGHRSFFGYPGGKA--PSHELA 178

Query: 178 MRHNMQ-VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           +  N + V   TIG Y TD++T+ AI+ + E NKSKP F+YLA+ AVH      P  A  
Sbjct: 179 LFDNDKIVPENTIG-YLTDMFTDKAISFVKE-NKSKPFFMYLAYNAVHV-----PMNAKK 231

Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
           E + +F +I+DP R+ YA M+  LD+ VG ++  L++  + +N++++F+ DNGA +    
Sbjct: 232 ELMDRFPNITDPGRKAYAAMMVSLDDGVGELVKTLKEQNLYDNTLIVFVNDNGAAT---- 287

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA-GIE 355
            N   N  LRG+K + W+GG+R    +  P      KV S++    D +PT   A  G  
Sbjct: 288 GNYADNGKLRGLKGSKWEGGIRVAYIMQYPDQIPAGKVYSKMVSGLDIMPTAVGAGKGEL 347

Query: 356 INDTSLDGVN 365
           I+    DG N
Sbjct: 348 ISGQVTDGKN 357


>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  209 bits (532), Expect = 3e-51,   Method: Compositional matrix adjust.
 Identities = 138/396 (34%), Positives = 209/396 (52%), Gaps = 32/396 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K P+I+ +L DD G+ DV F   + I +PN D LA  GL+LN+HYV   C+PSR++ +TG
Sbjct: 21  KVPNIVFVLVDDWGFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCSPSRASFLTG 79

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           ++P H   Q   +++    G  L   +LP  LK A YATH +GKWHLGFF   Y P  RG
Sbjct: 80  RWPHH-AHQWNPLMDNM-IGTNLNMTMLPAKLKAANYATHMVGKWHLGFFDPRYLPINRG 137

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FD+  G+  G +D+ +     +      +D   N   D +  G Y    Y +   +V+  
Sbjct: 138 FDTSTGFLGGGEDHMNEKSGCS------IDYWKNNAPDPRN-GTYDAYNYRDDLTDVMNN 190

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR-TYAGMVSRLDESVGN 266
           HN   P+F YL    VH      P QAP E +  +   S  E+R     MVS  D   G+
Sbjct: 191 HNADNPLFFYLPLHNVHT-----PLQAPKEWLDIYPANSTCEKRHILQAMVSVADNVTGH 245

Query: 267 VIAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           ++  L+K GM +N+I++  ADN GAP        GSN+PL+G KST ++GG+R +A +  
Sbjct: 246 LVELLKKKGMWDNTIMVISADNGGAPC------SGSNYPLKGCKSTFFEGGIRSLAFVNG 299

Query: 326 PWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-KGAKTK 377
             L ++++  S     HI+DW  T C  AG++ +D+      +DG++ W ++T +  KT+
Sbjct: 300 GLLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFPVDGLDVWPIITGENEKTQ 359

Query: 378 RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNG 413
             EI+    N ++      AL + + K + G    G
Sbjct: 360 HEEIVLAY-NFNHSHPEQGALIMGNYKLIVGPQGYG 394


>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 472

 Score =  209 bits (532), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 139/373 (37%), Positives = 193/373 (51%), Gaps = 34/373 (9%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F LT   +    F +    KKP+II+ +ADD+G+ D  F G++ I TPN+D LA NG  
Sbjct: 7   LFTLTIIGI---TFGHAQKNKKPNIIVFMADDMGYADTGFTGATDIQTPNLDNLAKNGAF 63

Query: 68  LNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEP--------WGLPLTEKLLPQY 118
             Q Y   A C PSR+AL++G+Y      QH    E  P         G+ + EKL P+ 
Sbjct: 64  FKQGYANHAYCGPSRAALLSGRY------QHRFGFETNPAYDPANPHMGIDVGEKLFPKR 117

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYD-HSCKATFEPYQGLD 177
           L+EAGY T AIGKWHLG   E + P  RGFD  YG+  G  DY+     K  +E Y    
Sbjct: 118 LQEAGYKTGAIGKWHLGAAAE-FHPLNRGFDYFYGFLGGGHDYFRIDGTKKVWEAYLQPL 176

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +R N + DN   G  +T L  +AA  V  + NK  P FLY+A+ A H      P QAP E
Sbjct: 177 VR-NKRADNFE-GYLTTALSNDAAQFV--KDNKENPFFLYVAYNAPHM-----PLQAPKE 227

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
           ++A++  I D +RR YA MV  +D  +G VI +L+  G+ EN+++ F++DNG P     S
Sbjct: 228 DIARYAHIKDNKRRVYAAMVDVMDRGIGTVIQSLKDSGIYENTLIFFLSDNGGPQSKGGS 287

Query: 298 NKG----SNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
            KG     N P RG K   +DGG+     A W   +K    V+  +  +      +  A 
Sbjct: 288 TKGWNGSDNQPFRGGKGNLYDGGVHVPFFASWPNKIKPGTVVNYPVNSLDIGRTAVALAG 347

Query: 353 GIEINDTSLDGVN 365
           G      +++GVN
Sbjct: 348 GDATAAPAMEGVN 360


>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 500

 Score =  209 bits (532), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 126/375 (33%), Positives = 201/375 (53%), Gaps = 41/375 (10%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P KPHI+ +L DD G+ D SF   + I TP+   L  NGLILN+HYV   C+PSR++ +T
Sbjct: 25  PDKPHIVFVLVDDWGFADASFRNPA-IKTPSFQYLVDNGLILNRHYVFKYCSPSRASFLT 83

Query: 87  GKYPIHIGMQH----GVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           G++P H+   +    G++      G  +   +LP  LK AGY+TH +GKWH G +   Y 
Sbjct: 84  GRFPHHVHQWNPTPPGMV------GTNINMTMLPAKLKTAGYSTHMVGKWHQGLYDPAYL 137

Query: 143 PTFRGFDSHYGYWQGLQDYYDHS--CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           P  RGFD+  G+ Q  + +++ +  C   F      D R+         G Y + +Y + 
Sbjct: 138 PVNRGFDTSSGFLQAGEGHFNQTIGCAVDFWKNHAPDTRN---------GTYDSYIYNKD 188

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYAGMVSR 259
              + ++H+ SKP+FLYL    VHA     P +AP E +  + ++ +   RR    MVS 
Sbjct: 189 LTTIFSKHDASKPLFLYLPLHNVHA-----PIEAPQEWLDLYPVNSTCATRRVLQAMVSV 243

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
            D   G+V+  L+++ M +N+I++  ADNG         KGSN+PL+G K T ++GG+R 
Sbjct: 244 ADNVTGHVVNLLKENDMWDNTIMVISADNGGADC-----KGSNYPLKGSKHTFFEGGVRV 298

Query: 320 VAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT- 371
           +A      +   ++  S     H++DW  T C  AG++ ND+      +DG++ W ++T 
Sbjct: 299 IAFASGGLIPAARRGKSTDGFIHVADWYTTFCKMAGVDHNDSGKGKFPVDGLDVWPIITG 358

Query: 372 KGAKTKRSEILHNID 386
           +  +T+  EI+   D
Sbjct: 359 ENEETQHEEIVLGYD 373


>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 470

 Score =  209 bits (531), Expect = 5e-51,   Method: Compositional matrix adjust.
 Identities = 134/364 (36%), Positives = 194/364 (53%), Gaps = 26/364 (7%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQAL-CTPSRS 82
           T   +KP++II  ADDLGW +    G+ QIPTP+ID++A NG+   Q +V A  C+PSR+
Sbjct: 35  TQTSRKPNVIIFYADDLGWGETGIQGNPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRA 94

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
            L+TG+YP   G +   I      GL L E  L   L   GY T  +GKWHLG   E Y 
Sbjct: 95  GLLTGRYPTRFGHEFNRIANVS--GLDLQETTLADRLHGLGYKTACVGKWHLGDGPE-YR 151

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR-HNMQVDNKTIGIYSTDLYTEAA 201
           PT RGFD  +G           +    F P + +D R  N   +      Y+TD Y + +
Sbjct: 152 PTKRGFDEFFG---------TLANTPFFHPTKFVDSRVSNDVAEVSDENFYTTDEYAKRS 202

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           +  I +  +S P FLYL   A HA     P QAP + + +F  I+DP+R+ +A M+S +D
Sbjct: 203 VEWIGQQQQS-PWFLYLPFNAQHA-----PLQAPQKYLDRFESIADPKRKLFAAMMSAMD 256

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           +++G V+  +R+ G  EN++V F++DNG P+ G  S  G   PLRG K T ++GG R   
Sbjct: 257 DAIGQVLGKVRELGQEENTLVFFISDNGGPTQGTTSQNG---PLRGFKMTTFEGGTRVPF 313

Query: 322 AI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRS 379
            + W   L   +   + + ++ D LPT+  AAG +I+    LDGV+     T     K  
Sbjct: 314 LVQWKGKLPAGKTYDNPVINL-DVLPTVLTAAGSKIDPAWKLDGVDLVPYFTSSIANKPH 372

Query: 380 EILH 383
           E L+
Sbjct: 373 ETLY 376


>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 465

 Score =  207 bits (528), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 137/404 (33%), Positives = 207/404 (51%), Gaps = 29/404 (7%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ 74
           ++F    LN     +P++I+I+ADDLG+NDV F+G ++IPTP ID++A NG+     Y  
Sbjct: 6   IIFILISLNAICASRPNLIVIMADDLGYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTS 65

Query: 75  -ALCTPSRSALMTGKYPIHIGMQHGVI--LEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
            ++C PSR+  +TG+Y    G +      L      LP +E  + + L + GY    IGK
Sbjct: 66  YSVCGPSRAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGK 125

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDY--------YDHSCKATFEPYQGLDMRHNMQ 183
           WHLG       P  RGFD  +G+  G   +        +    K   + Y+    R++  
Sbjct: 126 WHLGA-EPSLRPNKRGFDEFFGHLGGGHRFMPEDLVIQHTEEVKNELDSYRSWITRNDTP 184

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
           V       Y T+ +++ A++ I + N  KP FL+L++ A H      P QA ++ +A+F 
Sbjct: 185 VKTTK---YLTEEFSDEAVSFI-KRNHQKPFFLFLSYNAPHL-----PLQATEKYLARFP 235

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
            I DP+R+TYA MVS +D+ V  V+ +L++  + +N+IV F++DNG PS   H NK  N 
Sbjct: 236 HIKDPKRKTYAAMVSAVDDGVSQVMQSLKETNIADNTIVFFLSDNGGPS---HKNKSDNF 292

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLD 362
           PL+G KS  W+GG R   A+  P   Q ++V        D   T+ + A    + D  LD
Sbjct: 293 PLKGQKSDVWEGGFRVPFAMQYPAAIQAKQVYDHPVSSLDIFATIASLAQSPTHADKPLD 352

Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
           GVN    +T G KT+       I   D  Q  Y  +R  D K V
Sbjct: 353 GVNLIPFIT-GEKTQAPHAQIFIRKFD--QSRY-VVRQGDFKLV 392


>gi|325286699|ref|YP_004262489.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324322153|gb|ADY29618.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 494

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 134/375 (35%), Positives = 198/375 (52%), Gaps = 31/375 (8%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
           T  K+P+I+++L DDLG++DV F+GS+ I TP +D LA  G I    YV    C PSR+A
Sbjct: 37  TETKQPNILLVLCDDLGYSDVGFNGSTDIKTPELDKLANAGTIFTSAYVAHPFCGPSRAA 96

Query: 84  LMTGKYPIHIGMQHGVILEGEPWG--LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           L+TGKYP  IG Q  +   GE  G  +   E+ + + L+E+GY T AIGKWHLG   E +
Sbjct: 97  LLTGKYPHTIGSQFNLPANGESLGKGIDTNEQFIAKTLQESGYYTGAIGKWHLGATEEFH 156

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----------LDMRHNMQVDNKTIGI 191
            P  RGF   YG+  G  +Y+    +A ++  +           L + HN +   +T   
Sbjct: 157 -PNQRGFTDFYGFLGGGHNYFPEQYQAQYQKQKKAKKKIIRDYILPLEHNGKEVKETE-- 213

Query: 192 YSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
           Y TD ++  A   + E  NK KP FLYLA+ A H      P +A +E++ KF  I D +R
Sbjct: 214 YLTDAFSREASRFVKEASNKKKPFFLYLAYNAPHV-----PLEAKEEDLEKFSVIKDKDR 268

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
           RTYA MV  +D  +  ++  L++   LEN++++F++DNG    G   +  +N+PL G K 
Sbjct: 269 RTYAAMVYAVDRGIKQIVNTLKETNQLENTLIIFLSDNG----GNTDHGATNNPLHGRKG 324

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEINDTS-LDGVNQW 367
             W+GG R    ++ P  K   K  +  F I+  D  PTL      ++ D   L G N W
Sbjct: 325 DTWEGGYRVPMFMYCP--KNIAKGHTNNFPITSLDLYPTLAHIGKAKLPDNKVLHGKNVW 382

Query: 368 DVLTKGAKTKRSEIL 382
           + +       + E L
Sbjct: 383 ESILNNKNIHKKETL 397


>gi|313236789|emb|CBY12041.1| unnamed protein product [Oikopleura dioica]
 gi|313242643|emb|CBY39450.1| unnamed protein product [Oikopleura dioica]
          Length = 622

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 130/410 (31%), Positives = 212/410 (51%), Gaps = 33/410 (8%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           FA+    L  ++F   T   +P+I+ IL DDLGW DV ++      TP +D L  NG   
Sbjct: 8   FAIVAVFLKRESFAQETEDLRPNIVYILVDDLGWADVGWNNKGLESTPFMDKLVKNGTQF 67

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
            Q Y    C+PSR+  +TG+Y    GM    I    P+G+   +K LP+YLKE GY THA
Sbjct: 68  TQMYSSHRCSPSRAMALTGRYAFRSGMGSFPIAREVPFGMNTQDKTLPEYLKEVGYDTHA 127

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL--DMRHNMQVDN 186
           +GKWHLG     Y PT RGFD+ YG++ G  DY  H  K +   Y     +     ++D 
Sbjct: 128 VGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVDYRGHFIKRSKNFYHDFFDNTIEQHKLDL 187

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
           ++ G ++TDL+ +  I+++ E  +SK P ++YLA  A H     EP +AP + +A+ L+ 
Sbjct: 188 ESDGQWTTDLFRDRTIDILKEAKRSKTPAYVYLAFNAPH-----EPTRAPADLIARILE- 241

Query: 246 SDPE----RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP--SFGIHSN- 298
             P     R  +   V+++D +V  +    +K    E ++++FM+DNG     FG   + 
Sbjct: 242 KHPNLPYTRAEHLAAVNQIDTAVRQIYQQAKKDVKRE-TLIVFMSDNGGAVKKFGGGRSP 300

Query: 299 --------------KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
                         +  N+P +G K++ ++GG+   + + S   K ++   +  FHI DW
Sbjct: 301 DNTGHEYDDEESLPRACNYPFKGFKNSLFEGGILSPSFLISTARKFSRTRFNFPFHIIDW 360

Query: 345 LPTLCAAAGIEINDTSLDGVNQWDVL-TKGAKTKRSEILHNIDNVDNPQK 393
            PT    AGI + D   DG+++   L ++  K++R+  ++ + +  N +K
Sbjct: 361 APTFLDFAGI-VPDADFDGISRKSELESRWIKSQRTNFVYGVLDFYNEKK 409


>gi|340367653|ref|XP_003382368.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
          Length = 504

 Score =  206 bits (524), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 133/402 (33%), Positives = 212/402 (52%), Gaps = 44/402 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY------------VQA 75
           K P+I+ +L DD G+ DV F   + I +PN D LA  GL+LN+HY            V  
Sbjct: 22  KLPNIVFVLVDDWGFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYLAKTGLVLNRHYVFK 80

Query: 76  LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
            C+PSR++ +TG++P H    +  + + +  G  L   ++P  LK A Y+TH +GKWH G
Sbjct: 81  YCSPSRASFLTGRWPHHAHQWN--LPQSDMAGTNLNMTMIPAKLKAAKYSTHMVGKWHQG 138

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
           FF   Y P  RGFD+  G+  G +D+       T E    +D   N   D +  G Y   
Sbjct: 139 FFDPRYLPINRGFDTSSGFLGGGEDH------MTQEAECAIDYWKNDAPDPRN-GTYDAY 191

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTYA 254
            Y +   +++  HN + P+F+YL    VHA     P QAPDE +  + ++ +   RRTY 
Sbjct: 192 NYGDDLTDIMNNHNVNDPLFIYLPLHNVHA-----PLQAPDEWLNIYSVNSTCENRRTYQ 246

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            MVS  D   G ++  L+K GM +N+I++  ADNG  + G     GSN+PL+G K + ++
Sbjct: 247 AMVSVADNVTGYLVQLLKKKGMWDNTIIIISADNGGAACG-----GSNYPLKGCKGSFFE 301

Query: 315 GGMRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQW 367
           GG+R +A +    L ++++  S     HI+DW  T C  AG++ +D+      +DG++ W
Sbjct: 302 GGVRALAFVNGGLLPESRRGESTDGFIHIADWYTTFCKLAGVDPDDSGTGKFPVDGLDVW 361

Query: 368 DVLT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
            ++T +  KT+  +I+   D      ++  A+ + D K + G
Sbjct: 362 PIITGEDEKTQHEDIILGYDFF---SRHTGAIIMGDYKLLVG 400


>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 453

 Score =  206 bits (524), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 132/364 (36%), Positives = 199/364 (54%), Gaps = 30/364 (8%)

Query: 11  LTCTLLFNDAFL---NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           L   LLF+   +      A  KP+I++I+ADDLG+ D+ F G   IPTP ID LA +G+ 
Sbjct: 11  LAAALLFSMTAVVGWAADASSKPNIVLIVADDLGYGDLGFQGGRDIPTPRIDGLARSGVT 70

Query: 68  LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHG---VILEGEPWGLPLTEKLLPQYLKEAG 123
            +  YV    C+P+R+ L+TG+Y    G +          E +GL L E+ LPQ LK+AG
Sbjct: 71  CSSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNPGNAARVTETFGLSLEERTLPQRLKQAG 130

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
           YAT  +GKWHLGF  + + P  RGFD  +G+  G   Y+  +   + +P     +R   +
Sbjct: 131 YATGIVGKWHLGFAPQ-FQPLERGFDEFFGFLGGAHPYFPDAN--SNDP-----IRRGRE 182

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
              ++   Y TD +   ++  I + NK++P FLYLA  AVH      P  A  E + +F 
Sbjct: 183 AVVESE--YLTDAFARESVAYI-DRNKNRPFFLYLAFNAVH-----NPQHAKPEHLERFK 234

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
            I+D  RRTYAGM++ +D++VG V+  L    + + ++V+F++DNG P     +N  +N 
Sbjct: 235 SIADERRRTYAGMLTAMDDAVGQVLDKLAAEKLEQKTLVVFISDNGGPP----ANGSTNT 290

Query: 304 PLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPT-LCAAAGIEINDTSL 361
           PL G K+T W+GG+R    + W   L   +K    +  + D LPT L AA  +  +   L
Sbjct: 291 PLNGQKATTWEGGVRVPFVVSWPGQLAAGKKYEEPVIQL-DLLPTILTAAGVVADDAAKL 349

Query: 362 DGVN 365
           DGV+
Sbjct: 350 DGVD 353


>gi|423330335|ref|ZP_17308119.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
           CL03T12C09]
 gi|409231951|gb|EKN24799.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
           CL03T12C09]
          Length = 459

 Score =  206 bits (523), Expect = 3e-50,   Method: Compositional matrix adjust.
 Identities = 143/387 (36%), Positives = 204/387 (52%), Gaps = 25/387 (6%)

Query: 6   KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           K    LT  +L N   LN  + A  KP+ +II  DD+G+ D+S +G+  I TPNID +A 
Sbjct: 5   KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
            G+ L Q YV A + TPSR+ALMTG+ P+  G+   +  V+      GL   E  + + L
Sbjct: 65  EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
           +++GYAT  +GKWHLG F   Y PT  GFD+++G  Y   +    +    A   P   L 
Sbjct: 125 QQSGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPL- 182

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +    Q++++      T  YTE A++ I +H+K +P FLY AH   H          P  
Sbjct: 183 IVDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
             A+F   S  +R  Y  +V  +D SVG V+ ALR++G+ EN+ V+F +DNG P    H 
Sbjct: 233 TNARFAGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
           N GS  PL+  K T W+GG R  A  W P  K    ++ E+    D  PT  + AGIE  
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D  LDGVNQ  +L +   + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375


>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 454

 Score =  206 bits (523), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 133/356 (37%), Positives = 193/356 (54%), Gaps = 30/356 (8%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           +  TLLF+ + L  T   KP+I+IILADDLG+ DV +HG  +IPTPNID +A  G+  + 
Sbjct: 3   IFITLLFSCSLLWAT--DKPNILIILADDLGYADVGYHGLEEIPTPNIDRIANEGVQFSA 60

Query: 71  HYVQ-ALCTPSRSALMTGKYPIHIGMQHGV-----ILEGEPWGLPLTEKLLPQYLKEAGY 124
            Y   ++C P+R+ALM+G Y   IG + G+     + E    G+P   K L QY +EAGY
Sbjct: 61  GYSNGSICGPTRAALMSGVYQQRIGCE-GICGGRKLNEHVVVGMPREVKTLAQYFQEAGY 119

Query: 125 ATHAIGKWHLG---FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
           AT   GKWHLG    F +   PT RGFD  +G  +G   Y D   +     Y    +R +
Sbjct: 120 ATGLFGKWHLGGERLFDKTLMPTSRGFDEFFGILEGASLYDDTVNRE--RKY----IRQD 173

Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
             +D +  G Y TD     A++ I      KP FLYL   AVHA     P QA ++ + +
Sbjct: 174 TVIDYE--GEYFTDAIGREAVSFITRKG-DKPFFLYLPFTAVHA-----PMQASEKYMQR 225

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           F  I+DP RR +A M+S +D+++G V  AL   G+L+N++++F +DNG       +N   
Sbjct: 226 FAHIADPNRRVFAAMLSAMDDNIGRVFDALEHQGILDNTLIVFWSDNGGKP---DNNYSL 282

Query: 302 NHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
           NHPL+G K+  ++GG+R  A + W        K   +   + D  P+   AA I +
Sbjct: 283 NHPLKGQKTQFYEGGIRVPACVRWPKGQIPAGKTLDQPVFLMDIFPSALEAAQITV 338


>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  206 bits (523), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 138/400 (34%), Positives = 204/400 (51%), Gaps = 34/400 (8%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+ + +L DD G+ DVSF   + I +PN + LA  GLIL++HYV   C+PSR++ +TG+
Sbjct: 21  KPNFVFVLVDDWGYADVSFRNPA-IHSPNFEKLAKEGLILDRHYVFKYCSPSRASFLTGR 79

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           +P H    +    E    G  L   ++P  LK A Y TH IGKWH G ++E Y P  RGF
Sbjct: 80  WPHHAHQWNPP--EDALVGANLKMTMIPAKLKLARYKTHMIGKWHEGLYKEAYLPINRGF 137

Query: 149 DSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           D+  G+  G +++ +    C   F    G D R+         G Y    Y +   ++I 
Sbjct: 138 DTMSGFLGGGENHMNQQVGCATDFWKNDGPDSRN---------GSYDAYTYRDDLTDIIT 188

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVG 265
            HN S P FLYL    VH      P QAPDE +  +   S  + RR Y  MVS  D   G
Sbjct: 189 NHNPSDPFFLYLPLHNVHT-----PLQAPDEWLNIYPANSTCQNRRIYQAMVSVADNVTG 243

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
           +++  L+ +GM +N++++  ADNG    G     GSN+PL+G K T ++GG+R +A +  
Sbjct: 244 HLVKLLQDNGMWDNTLLVISADNGGAPCG-----GSNYPLKGSKGTFYEGGVRSLAFVNG 298

Query: 326 PWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-KGAKTK 377
             L   +K  S     HI+DW  T C  AGI+ +D+      +DG+N W ++  +  KT 
Sbjct: 299 GLLPTNRKGKSTQGFIHIADWYTTFCQLAGIDHDDSGSGKFPVDGMNVWPIINGENEKTL 358

Query: 378 RSEILHNID-NVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
             +I    D +  N +    AL V + K + G  +    D
Sbjct: 359 HEQIPLGYDFDYGNLRPNQGALIVGNNKLIVGPQDTKSCD 398


>gi|323451422|gb|EGB07299.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 512

 Score =  206 bits (523), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 157/554 (28%), Positives = 251/554 (45%), Gaps = 80/554 (14%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KPH+I +LADD GWN+  ++    + TP+ DAL   G+ L  HYV   C+P+RS+ ++G+
Sbjct: 20  KPHLIFLLADDEGWNNGFYN--EDVLTPHTDALVAAGVKLTNHYVYKFCSPTRSSFLSGR 77

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
            PIH+  ++    E    G+PL    + + ++ AGY T  +GKWH G     +TP  RGF
Sbjct: 78  LPIHVNQENSAT-EQAGAGVPLGMSTIGETMRAAGYRTAHVGKWHCGMATPAHTPYGRGF 136

Query: 149 DSHYGYWQGLQDYYDH--SCKATFEP-----YQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           DS  G++   +D+Y      +A  E        G+D+  +        G Y    +T AA
Sbjct: 137 DSSIGFFNFGEDHYTQVRGGQALEEGGASGQCSGVDLWRDGAPARGENGTYGGYTFTRAA 196

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
           +  + E N +KP+FL+ A   +H      P + P E VA++ +  +  R T  GM + LD
Sbjct: 197 VAAV-EANDTKPLFLFAAFQNLH-----PPLEVPPEYVARYAN--ETLRTTINGMAAFLD 248

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           +SVGNV  AL+K  M ++S++++  DNG    G  +  G + P RG K + + GG+R VA
Sbjct: 249 DSVGNVTKALKKQNMYDSSLIVYTPDNG----GYLTQGGDDAPFRGGKFSDFQGGVR-VA 303

Query: 322 AIWSPWLKQT---QKVSSELFHISDWLPTLCAAAGIEINDTSL------DGVNQWDVLTK 372
           A  S  L         S+ L H+ DW+ T  A  G  I   ++      D ++ W  L++
Sbjct: 304 AFISGGLVPAGLRGTASAALVHVCDWMATFAALGGGAIPADAVAGLPAPDSLDVWPALSR 363

Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPK 432
           G  + R+E+  +I   D   + +  +  D + Y  G +                      
Sbjct: 364 GRPSPRTEVPLSILPADGAARAF--VDFDGVGYFVGGE---------------------- 399

Query: 433 EVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDN 492
                  G+    +K  L  +     D   N++ A               +A  R     
Sbjct: 400 -------GLIVGDMKLLLGYQHIGPFDKVTNATCAGVA----------GPWAAGR----- 437

Query: 493 KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP--GNKPFDK 550
            G  C+     CL+N+T DP E ++LA ++   +  L  +L   ++T+  P  G K    
Sbjct: 438 PGVPCDCATSGCLYNVTADPTESDDLAAARPGAVAALRTRLEALRATVYAPDRGAKDDAA 497

Query: 551 RADPARWNNIWVPW 564
            A  A     W PW
Sbjct: 498 CAQIATNGGFWGPW 511


>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 389

 Score =  205 bits (522), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 132/377 (35%), Positives = 203/377 (53%), Gaps = 40/377 (10%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN-GLILNQHYVQALCT 78
           A    T   KP+++ +L DD G+ DVSF   + I +P+ ++LA   GLIL++HYV   C+
Sbjct: 13  AIAAATVNAKPNLVFVLVDDWGFADVSFRNPA-ISSPHFESLATKEGLILDRHYVFKYCS 71

Query: 79  PSRSALMTGKYPIHIGM----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
           PSR++ +TG++P H       Q G++      G  +   ++P  LK AGY TH +GKWH 
Sbjct: 72  PSRASFLTGRFPHHAHQWNPPQSGLV------GANINMTMIPAKLKTAGYKTHMVGKWHE 125

Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
           GF+ + + P  RGFD+  G+  G +D+       T E    +D   N  +D +  G Y  
Sbjct: 126 GFYLKKFLPINRGFDTMSGFLGGGEDH------MTQEIICAVDYWKNDAIDTRN-GTYDD 178

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDISDPERRTY 253
             Y E    ++  H+ S P FLYL    VHA     P QAPDE +  + ++ +  +RRT 
Sbjct: 179 YTYVEDLSKILTNHDPSDPFFLYLPLHNVHA-----PLQAPDEFLNLYPVNSTCSDRRTI 233

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN-GAPSFGIHSNKGSNHPLRGMKSTP 312
             MVS  D   G V+  L+  GM +N++++  ADN GAP       +GSN+PL+G K T 
Sbjct: 234 QAMVSVADNVTGFVVKMLKDKGMWDNTLIVVSADNGGAPC------EGSNYPLKGSKMTF 287

Query: 313 WDGGMRGVAAIWSPWLKQTQ--KVSSELFHISDWLPTLCAAAGIEINDTS-----LDGVN 365
           ++GG+R +A +    L   +  K +    HI+DW  T C  AG++ +D+      +DG+N
Sbjct: 288 YEGGVRSLAFVSGGLLPSDRRGKKTEGFIHIADWYTTFCQLAGVDSDDSGAGKFPVDGMN 347

Query: 366 QWDVLT-KGAKTKRSEI 381
            W +LT +  KT+  +I
Sbjct: 348 VWPILTGENEKTQHEQI 364


>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 540

 Score =  205 bits (521), Expect = 6e-50,   Method: Compositional matrix adjust.
 Identities = 137/425 (32%), Positives = 210/425 (49%), Gaps = 47/425 (11%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           F L    L     L     K+P+III L DDLGW D+S  G+  I TP ID+L   GL L
Sbjct: 3   FKLFIVSLVTFCALANAEEKRPNIIIFLVDDLGWADISLRGAP-IDTPAIDSLFSEGLTL 61

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYA 125
           ++ Y   +C+P+R+ALMTG+ P+ +G+ + V++   PW   G+   E  +P+  K AGY 
Sbjct: 62  DRFYTTPICSPTRAALMTGRDPLRLGISYSVVM---PWMNNGVHPDEHFMPESFKAAGYQ 118

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN-MQV 184
           T  +GKWHLG  +E++ P  RGFD  YG+      Y+       F    G+D + N + +
Sbjct: 119 TAMVGKWHLGHSQEIFHPNARGFDDFYGHLHTEVGYF-----LPFANQGGVDFQRNGVTI 173

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
            ++    Y T L  + A   I   +K KP FLY+  +A H+     P +APD+ V K+ +
Sbjct: 174 ADEG---YETFLLADEASRWIKARDKDKPFFLYMPFIAPHS-----PLEAPDDLVKKYEN 225

Query: 245 ISDPE---------------------RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
           + D                       R  YA +V  LD+++G V+  L K G+ E +I+L
Sbjct: 226 LEDTRELTRSAAIDRTRTLNGFSPSARPMYAAVVDGLDQAIGQVLDTLTKEGIEEETIIL 285

Query: 284 FMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHIS 342
           F +DNG  +         N PLRG K   ++GG+R VAA+ W   L   +   S +  + 
Sbjct: 286 FSSDNGGAA--YAGGGADNFPLRGGKGDTYEGGIRVVAAMRWKGTLDAGKSFES-IMTVM 342

Query: 343 DWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVD 401
           D  PTL +AAG+  +N   + G + W  +  G   K  + ++      N  +++  +  D
Sbjct: 343 DVFPTLASAAGVSMLNTKEIWGRDLWSAIQNGKNIKLDKEIYFASETPNYGEFHTTVFND 402

Query: 402 DLKYV 406
             K V
Sbjct: 403 KWKLV 407


>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 478

 Score =  205 bits (521), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 121/345 (35%), Positives = 185/345 (53%), Gaps = 17/345 (4%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
           +P+II+I+ADD+G+ D  F G++ I TPN+D LA +G+I NQ YV  A C PSR+  + G
Sbjct: 21  RPNIIVIMADDMGYADAGFTGATDILTPNLDKLAESGVIFNQGYVTHAFCGPSRAGFLAG 80

Query: 88  KYPIHIGMQHGVILE-GEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           +Y    G +H    +   P  G+ + E L P  L++ GY T  IGKWHLG     Y P  
Sbjct: 81  RYQHRFGFEHNTPYDPANPLAGIDVRETLFPARLQDVGYTTGIIGKWHLGASSPFY-PLN 139

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           RGFD  YG+  G  DY++       +      +  N +V N     Y T   +  A+  +
Sbjct: 140 RGFDYFYGFLTGGHDYFEIDVTQPVKSAYLQGLFRNKRVAN--FEGYLTTALSRDAVQFV 197

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
            + NK  P FL+L++ A H     +P QAP E++A++  I D +RR YA MV  +D  +G
Sbjct: 198 ND-NKENPFFLFLSYNAPH-----QPLQAPQEDIARYAHIKDKKRRVYAAMVDVMDRGIG 251

Query: 266 NVIAALRKHGMLENSIVLFMADNGAP-SFGIHSNKG---SNHPLRGMKSTPWDGGMR-GV 320
            V+ AL ++G+ EN+++ F++DNG P S   H  KG   SN P RG K+  ++GG+    
Sbjct: 252 EVVTALDRNGLRENTLIFFLSDNGGPVSSKKHPGKGNGSSNGPFRGGKTDFYEGGVHVPF 311

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
            A W   ++  Q     +  +      +  A G  ++  +++GVN
Sbjct: 312 IASWPAKIQPGQVYDKPVISLDIGRTAVELAGGDPLSGNAMEGVN 356


>gi|255014752|ref|ZP_05286878.1| arylsulfatase A [Bacteroides sp. 2_1_7]
 gi|410102839|ref|ZP_11297764.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
 gi|409237966|gb|EKN30761.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
          Length = 459

 Score =  204 bits (520), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 143/387 (36%), Positives = 203/387 (52%), Gaps = 25/387 (6%)

Query: 6   KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           K    LT  +L N   LN  + A  KP+ +II  DD+G+ D+S +G+  I TPNID +A 
Sbjct: 5   KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
            G+ L Q YV A + TPSR+ALMTG+ P+  G+   +  V+      GL   E  + + L
Sbjct: 65  EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
           ++ GYAT  +GKWHLG F   Y PT  GFD+++G  Y   +    +    A   P   L 
Sbjct: 125 QQNGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPSTPL- 182

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +    Q++++      T  YTE A++ I +H+K +P FLY AH   H          P  
Sbjct: 183 ILDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
             A+F   S  +R  Y  +V  +D SVG V+ ALR++G+ EN+ V+F +DNG P    H 
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
           N GS  PL+  K T W+GG R  A  W P  K    ++ E+    D  PT  + AGIE  
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D  LDGVNQ  +L +   + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375


>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 468

 Score =  204 bits (520), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 129/348 (37%), Positives = 194/348 (55%), Gaps = 25/348 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           +P+I++I+ DD+G++D+  HG   IPTP++DALA +G+     YV    C+P+R+ L+TG
Sbjct: 29  RPNIVVIVGDDMGYHDLGVHGCKDIPTPHLDALATSGVRCTSGYVSGPYCSPTRAGLLTG 88

Query: 88  KYPIHIGMQH--GVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           +Y    G +   G    GE  GLPL+E  L   LK+ GY T  +GKWHLG   E   P  
Sbjct: 89  RYQQRFGHEFNPGPTPTGE-IGLPLSETTLADRLKKVGYKTGMVGKWHLGN-DEKRHPLS 146

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM-RHNMQVDNKTIGIYSTDLYTEAAINV 204
           RGFD  +G+  G + Y+     A+     G  + R    VD K    Y TD +   A+  
Sbjct: 147 RGFDEFFGFLGGARTYFATPGNAS----AGTKLLRGREVVDEKE---YLTDAFAREAVAY 199

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I + +K+ P FLYL   AVH      P +A  + + +F  +SDP+R+ Y  M+S +D++V
Sbjct: 200 I-DRSKASPFFLYLTFNAVHT-----PMEASQKYLDRFTAVSDPKRQKYCAMMSAMDDAV 253

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI- 323
           G V+A L +  +LEN+++ F++DNG P+    +N G N PLRG K+T W+GG+R    + 
Sbjct: 254 GQVVAKLEREKLLENTLIFFVSDNGGPT---AANTGDNTPLRGFKATTWEGGIRVPYFVS 310

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
           W   +   +     +  I D++PT  AAAG    + + DGVN    LT
Sbjct: 311 WKGKIPAGKTYDQPVIQI-DFVPTALAAAGAPAAEKT-DGVNLLPYLT 356


>gi|262383796|ref|ZP_06076932.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|262294694|gb|EEY82626.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
          Length = 459

 Score =  204 bits (520), Expect = 8e-50,   Method: Compositional matrix adjust.
 Identities = 143/387 (36%), Positives = 203/387 (52%), Gaps = 25/387 (6%)

Query: 6   KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           K    LT  +L N   LN  + A  KP+ +II  DD+G+ D+S +G+  I TPNID +A 
Sbjct: 5   KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
            G+ L Q YV A + TPSR+ALMTG+ P+  G+   +  V+      GL   E  + + L
Sbjct: 65  EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
           ++ GYAT  +GKWHLG F   Y PT  GFD+++G  Y   +    +    A   P   L 
Sbjct: 125 QQNGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPSTPL- 182

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +    Q++++      T  YTE A++ I +H+K +P FLY AH   H          P  
Sbjct: 183 ILDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
             A+F   S  +R  Y  +V  +D SVG V+ ALR++G+ EN+ V+F +DNG P    H 
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
           N GS  PL+  K T W+GG R  A  W P  K    ++ E+    D  PT  + AGIE  
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D  LDGVNQ  +L +   + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375


>gi|150008924|ref|YP_001303667.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937348|gb|ABR44045.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
          Length = 459

 Score =  204 bits (520), Expect = 9e-50,   Method: Compositional matrix adjust.
 Identities = 143/387 (36%), Positives = 203/387 (52%), Gaps = 25/387 (6%)

Query: 6   KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           K    LT  +L N   LN  + A  KP+ +II  DD+G+ D+S +G+  I TPNID +A 
Sbjct: 5   KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
            G+ L Q YV A + TPSR+ALMTG+ P+  G+   +  V+      GL   E  + + L
Sbjct: 65  EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
           +++GYAT  +GKWHLG F   Y PT  GFD+++G  Y   +    +    A   P   L 
Sbjct: 125 QQSGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPL- 182

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +    Q++++      T  YTE A++ I  H+K +P FLY AH   H          P  
Sbjct: 183 IVDGKQIESEPDQGELTRRYTEKAVSFIKNHSK-EPFFLYFAHTFPH---------IPLY 232

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
             A+F   S  +R  Y  +V  +D SVG V+ ALR++G+ EN+ V+F +DNG P    H 
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
           N GS  PL+  K T W+GG R  A  W P  K    ++ E+    D  PT  + AGIE  
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D  LDGVNQ  +L +   + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375


>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 136/401 (33%), Positives = 210/401 (52%), Gaps = 50/401 (12%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K P+I+ +L DD G+ DV F   + I +PN D LA  GL+LN+HYV   C PSR++L+TG
Sbjct: 22  KLPNIVFVLVDDWGFADVGFRNPA-ISSPNFDQLAKTGLVLNRHYVFKYCAPSRASLLTG 80

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           ++P H+  Q  +  +    G  L   + P  LK A Y+TH +GKWH GFF   Y P  RG
Sbjct: 81  RWPHHV-YQWNLATDATA-GTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYLPINRG 138

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQG--LDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           FD+  GY  G  D+++          QG  +D   N   D +  G Y +  Y +   N++
Sbjct: 139 FDTSSGYLCGWVDHFNQK--------QGCAVDCWKNNAPDPRN-GTYDSYYYRDDLTNIV 189

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-----SDPERRTYAGMVSRL 260
             H+ + P+F+YL    VH      P +AP E    +LDI     +   R T   MVS  
Sbjct: 190 NNHDANNPLFIYLPLHNVHT-----PLEAPKE----WLDIYPANSTCKTRHTLQAMVSVA 240

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D   G+++  L++  M +N+I++  ADNG           SN+PLRG K T ++GG+R +
Sbjct: 241 DNVTGHLVELLKEKEMWDNTIMVISADNGGDQC-----VASNYPLRGCKYTFFEGGVRSL 295

Query: 321 AAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDTS-----LDGVNQWDVLT-K 372
           A +    L ++++  S     HISDW  T C  AG++ +D+      +DG++ W ++T +
Sbjct: 296 AFVNGGLLPESRRGQSTDGFIHISDWYTTFCKLAGVDPDDSGTGKFPVDGLDVWPIITGE 355

Query: 373 GAKTKRSEIL--HNIDNVDNPQKYYAALRVDDLKYVAGTDN 411
             KT+  EI+  +N  +       + A+ VD+ K + G+ N
Sbjct: 356 NEKTQHQEIVLGYNFSS-------HGAIIVDNYKLIVGSQN 389


>gi|298375848|ref|ZP_06985804.1| sulfatase family protein [Bacteroides sp. 3_1_19]
 gi|298266885|gb|EFI08542.1| sulfatase family protein [Bacteroides sp. 3_1_19]
          Length = 459

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 143/387 (36%), Positives = 203/387 (52%), Gaps = 25/387 (6%)

Query: 6   KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           K    LT  +L N   LN  + A  KP+ +II  DD+G+ D+S +G+  I TPNID +A 
Sbjct: 5   KSLSVLTAAVLSNSLSLNAASDAVNKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
            G+ L Q YV A + TPSR+ALMTG+ P+  G+   +  V+      GL   E  + + L
Sbjct: 65  EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
           ++ GYAT  +GKWHLG F   Y PT  GFD+++G  Y   +    +    A   P   L 
Sbjct: 125 QQNGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPL- 182

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +    Q++++      T  YTE A++ I +H+K +P FLY AH   H          P  
Sbjct: 183 IVDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
             A+F   S  +R  Y  +V  +D SVG V+ ALR++G+ EN+ V+F +DNG P    H 
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTKHE 289

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
           N GS  PL+  K T W+GG R  A  W P  K    ++ E+    D  PT  + AGIE  
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D  LDGVNQ  +L +   + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375


>gi|256841082|ref|ZP_05546589.1| arylsulfatase A [Parabacteroides sp. D13]
 gi|256736925|gb|EEU50252.1| arylsulfatase A [Parabacteroides sp. D13]
          Length = 459

 Score =  204 bits (518), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 142/387 (36%), Positives = 204/387 (52%), Gaps = 25/387 (6%)

Query: 6   KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           K    LT  +L N   LN  + A  KP+ +II  DD+G+ D+S +G+  I TPNID +A 
Sbjct: 5   KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
            G+ L Q YV A + TPSR+ALMTG+ P+  G+   +  V+      GL   E  + + L
Sbjct: 65  EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
           +++GYAT  +GKWHLG F   Y PT  GFD+++G  Y   +    +    A   P   L 
Sbjct: 125 QQSGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVQNKGAHARNFPPTPL- 182

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +    +++++      T  YTE A++ I +H+K +P FLY AH   H          P  
Sbjct: 183 ILDGKEIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
             A+F   S  +R  Y  +V  +D SVG V+ ALR++G+ EN+ V+F +DNG P    H 
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
           N GS  PL+  K T W+GG R  A  W P  K    ++ E+    D  PT  + AGIE  
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQP 348

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D  LDGVNQ  +L +   + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375


>gi|340369111|ref|XP_003383092.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 507

 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 128/384 (33%), Positives = 197/384 (51%), Gaps = 48/384 (12%)

Query: 30  PHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           PHI+++L DD GW +V +H    + ++ TPNID L   GL LNQHY    C+PSRS+L++
Sbjct: 25  PHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDKLVKEGLELNQHYAFRFCSPSRSSLIS 84

Query: 87  GKYPIHIGMQHGVILEGEP-------WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           G+ PIH+   +       P         +P     + + ++ AGYATH +GKW  G    
Sbjct: 85  GRLPIHVNDLNLAPTNYNPNDPVSGFSAIPRNMTGIAEKMRGAGYATHQVGKWDAGMATS 144

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH---SCK-------ATFEPYQGLDMRHNMQVDNKTI 189
            +TP  RGF + +GY+    DYY      CK        T +P  G++       DN   
Sbjct: 145 DHTPKGRGFQTSFGYFHHANDYYTEIAGKCKIPIVDLWETDKPAHGIN---GTGPDN--- 198

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
             Y   L+ E  ++V++ H+ S P+FLY A   VHA     P++ PD  + KF  I +  
Sbjct: 199 --YEEGLFKERVLSVVSNHDPSTPLFLYYAPHIVHA-----PYEVPDRYLNKFSFIDNHL 251

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS-NHPLRGM 308
           R+ Y  MV+ LD+ VG +  AL+  GM +N + +  +DNG P   +H   G+ N+PL+G 
Sbjct: 252 RQYYHAMVNYLDDVVGELTDALKNKGMWDNLLFVTSSDNGGP---VHEGTGANNYPLKGG 308

Query: 309 KSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIEINDTS------ 360
           K+T W GG+R  A +   +L  K   + +    H++DW  T CA AG++  D        
Sbjct: 309 KTTDWQGGVRVNAFVSGGYLPEKMRGQKTDGYIHLADWYGTFCAIAGVDPTDEKAAKAKL 368

Query: 361 --LDGVNQWDVLTKGAKTK-RSEI 381
             +D  N W +++    T  R+++
Sbjct: 369 PPVDSYNMWPLISGETDTSPRTDV 392



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 23/57 (40%), Positives = 34/57 (59%), Gaps = 2/57 (3%)

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA 552
           HC      CL+NI DDP E+NN+A  + ++LK+++DKL  Y++T   P   P    A
Sbjct: 436 HCGD--DGCLYNIKDDPEERNNIATKEPEMLKKMQDKLKNYQATYFNPDRGPVSPDA 490


>gi|29348898|ref|NP_812401.1| arylsulfatase [Bacteroides thetaiotaomicron VPI-5482]
 gi|383125065|ref|ZP_09945724.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
 gi|29340804|gb|AAO78595.1| arylsulfatase B precursor [Bacteroides thetaiotaomicron VPI-5482]
 gi|251837419|gb|EES65514.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
          Length = 458

 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 123/337 (36%), Positives = 186/337 (55%), Gaps = 32/337 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P+I+I+LADDLGW DV FHGS +I TP ++ALA  G++L++ Y   + TP+R+ LMTG
Sbjct: 24  ERPNIVIVLADDLGWGDVGFHGS-EIKTPCLNALAAEGVVLDRFYTAPISTPTRAGLMTG 82

Query: 88  KYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYATHA-IGKWHLGFFREVYTP 143
           +YP   G++  VI    PW   GL   E+ L   L   GY+  A IGKWHLG  R+V+ P
Sbjct: 83  RYPNRFGIRTTVI---PPWREDGLDENEETLADMLARNGYSNRAIIGKWHLGHTRKVHYP 139

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
             RGF   YG+  G  DY+DH  +   + +   +  ++          YST+L T+ A+ 
Sbjct: 140 INRGFSHFYGHLNGAIDYFDHMREGELDWHNDWETCYDKG--------YSTELITQEAVR 191

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD------ISDPERRTYAGMV 257
            I  + K  P  LY+A+ A H      P QA ++++  + D        + +R TY  MV
Sbjct: 192 CINTYEKEGPFLLYVAYNAPHT-----PLQAQEKDIELYCDDFGSLTPKEQKRVTYQAMV 246

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
           S +D  +G ++ AL+K G+++N+ ++F +DNG P+    S+ G    LRG K   WDGG+
Sbjct: 247 SCMDRGIGTIVDALKKKGIMDNTFLIFFSDNG-PAGVPGSSSGK---LRGRKFDEWDGGV 302

Query: 318 RGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           R  A   W       + +SS++    D +PTL    G
Sbjct: 303 RVPAVFYWKRAESNYKNLSSQVTGFVDIVPTLKELVG 339


>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
          Length = 545

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 125/402 (31%), Positives = 207/402 (51%), Gaps = 47/402 (11%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KPHI+ I+ DD G+++      ++  TPN+D LA +GL+L+Q Y    CTPSRS+ ++G+
Sbjct: 40  KPHIVFIMIDDWGFHNFGIRNQTEAKTPNMDKLARDGLLLDQAYSYFWCTPSRSSFLSGR 99

Query: 89  YPIHIGMQHGVILEGEPW----------GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
            P+H+   H   +    W          G+P     +P ++++AGY TH +GKW  G   
Sbjct: 100 LPLHV--FHSNRVSSASWDSQHPDTAGVGIPRNMTTIPAFMRKAGYKTHMVGKWDAGIAT 157

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHS------------CKATFEPYQGLDMRHNMQVDN 186
             ++P  RGFDS   Y+    +YY ++            C+        +D+ ++ +  +
Sbjct: 158 PQHSPLGRGFDSSLHYFNHDNNYYAYNYTDTVSVQFPVKCQLLKYVTGFVDLWNSTEPAD 217

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
             IG Y   ++ + A++VI++H+ S P+FLY A    HA     P Q P   + +F  I 
Sbjct: 218 APIGTYEEHVFRDHALDVISKHDASTPLFLYYASHIAHA-----PLQVPQAYLDRFQHIP 272

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF----GIHSN---K 299
           DP RRTY  MV   D+ VGN+ AAL+  GM EN++++  +DNG   +    G+ ++    
Sbjct: 273 DPIRRTYQAMVLAADDVVGNITAALKAKGMWENTLLVVHSDNGGAIYQCLDGMDASLCGG 332

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEIN 357
            SN PLRG K +P+ GG+R  A +   +L   ++  V + L  ++DW+ T C  AG+   
Sbjct: 333 ASNWPLRGGKLSPFQGGIRVNAFVNGGFLPSHRRGQVETGLVVLADWMATFCELAGVACE 392

Query: 358 D--------TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNP 391
           D         ++D  + W +L+ GA +    + + I    +P
Sbjct: 393 DPIAKKANLPAIDSQSLWSLLS-GANSTSPRVEYPITFDQSP 433


>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
           siliculosus]
          Length = 597

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 144/387 (37%), Positives = 206/387 (53%), Gaps = 54/387 (13%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSS--QIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           ++  +P++I IL DD+G NDV +  +   Q+ TP ++ LA  G+IL+ +Y   +CTPSR+
Sbjct: 69  SSSGQPNVIFILVDDVGTNDVGYESTDLWQL-TPFMNTLAAEGVILDDYYSNEICTPSRA 127

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           +LMTG+     GMQ GV+ +   WGLP+ E  L +  + AGY+TH  GKWHLG +     
Sbjct: 128 SLMTGRDSFRTGMQFGVVEDSAAWGLPIDEVTLAERFQAAGYSTHMTGKWHLGVYSNANY 187

Query: 143 PTFRGFDSHYGYWQGLQDYYDH-------------SCKATFEPYQGLDMRHNMQVDN--- 186
           P  RGFD+  GY  G + YY H             +C   F  Y   D   N   +    
Sbjct: 188 PFSRGFDTFLGYTGGGEGYYTHRECVTPEFEGGQYTCYQDFG-YGDKDGYINFTTNTTRK 246

Query: 187 --KTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAP------ 235
             +  G YST + T+ AI V  EH   + S P+FLY+AH AVH     EP  +P      
Sbjct: 247 GPEMTGKYSTTVITDRAIEVAREHVEKSPSDPLFLYVAHQAVH-----EPLDSPPPGSFT 301

Query: 236 DEEVA--KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PS 292
           DEEV   + LD S  ERR  A +V  LD+ V  +  AL   G+L+N++++F +DNGA PS
Sbjct: 302 DEEVELLEGLDYSHRERRALASIVYYLDKEVRRLHTALDALGVLDNAVIVFASDNGACPS 361

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS------PWLKQTQKVSSELFHISDWLP 346
            G     GSN+PLRG K T + GG+R    ++S      P   +  + S  + H  DW P
Sbjct: 362 KG-----GSNYPLRGFKHTTFQGGVRVPGFVYSKSADLIPEEARGTRYSG-MMHSMDWTP 415

Query: 347 TLCAAA-GIEINDTS--LDGVNQWDVL 370
           TL +   G+ +++    L G +QWD +
Sbjct: 416 TLGSVVPGLPLDEMGKELSGFDQWDAI 442



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 35/72 (48%), Gaps = 2/72 (2%)

Query: 496 HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA 555
            C+  V   LF+I  DP E  NLAES  +   ++ ++  +     V PG KP D     A
Sbjct: 526 QCDGVVFDFLFHIETDPLETENLAESMPEKYAEMVERFDVATKDEVVPGFKPQDPSGMEA 585

Query: 556 --RWNNIWVPWY 565
             ++NN  V W+
Sbjct: 586 WTKFNNWVVAWH 597


>gi|390356461|ref|XP_793612.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 181

 Score =  203 bits (516), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 92/157 (58%), Positives = 119/157 (75%), Gaps = 6/157 (3%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           FF L C L      + T  P  P++I ++ADDLGWNDVSFHGSSQIPTP+IDALA  G+I
Sbjct: 13  FFLLGCML----ECVYTAQP--PNVIFVVADDLGWNDVSFHGSSQIPTPHIDALAQEGVI 66

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L  +YV  +CTP+RSA+MTGK+PIH G+Q+ VI+  EP+GL   E ++PQYL+  GY TH
Sbjct: 67  LTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVIIADEPYGLGTNETIMPQYLRSLGYRTH 126

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH 164
            +GKWHLGFF+E  TP+ RGF+S+YGY+ G+QDY+ H
Sbjct: 127 MVGKWHLGFFKESLTPSHRGFESYYGYYGGMQDYFTH 163


>gi|449671825|ref|XP_002165184.2| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 160

 Score =  203 bits (516), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 93/154 (60%), Positives = 117/154 (75%), Gaps = 3/154 (1%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KPHII+I+ADDLGWND+ FHGS +I TPNID LA NG++L+ +YV  +CTPSRSA+MTG
Sbjct: 8   EKPHIILIVADDLGWNDIGFHGSKEISTPNIDRLATNGVVLDNYYVLPICTPSRSAIMTG 67

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +YPIH GMQ   I    P+G+ L EK LPQYLK+ GY T+ +GKWHLGFF + YTPT+RG
Sbjct: 68  RYPIHTGMQQDTIYGPNPYGVSLNEKFLPQYLKQQGYKTYGVGKWHLGFFAKEYTPTYRG 127

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
           FDS+YG + G  DY++HS   TF    GLD+  N
Sbjct: 128 FDSYYGSYLGKGDYWNHSNTETF---TGLDLHDN 158


>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
 gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
          Length = 466

 Score =  203 bits (516), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 129/380 (33%), Positives = 207/380 (54%), Gaps = 26/380 (6%)

Query: 14  TLLFNDAFLNT---TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
             LF    L T   +A ++P+II+IL+DD G+ D   +GS  IPTP+IDALA  G+    
Sbjct: 8   VFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGGIRFTN 67

Query: 71  HYVQA-LCTPSRSALMTGKYPIHIGMQHGVI------LEGEPWGLPLTEKLLPQYLKEAG 123
            YV A +C PSR+ L+ G+Y    G +H V        + +  GL  T + +   ++  G
Sbjct: 68  SYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQIQDIGLSDTVRTIADQMQSNG 127

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
           Y T AIGKWH G     + P  +GF+  +G+  G + ++    +   +  + + +    +
Sbjct: 128 YETMAIGKWHQGN-ETKHHPLHKGFNHFFGFIGGHRSFF--PIRTAIKQEEKI-LNDYTE 183

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
           VD K +  Y TD++T+ AI+ + +  + KP F+YL++ AVH      P +A  +++A+F 
Sbjct: 184 VDEKDV-YYLTDMFTDKAISYMRQ-KRDKPYFIYLSYNAVHT-----PVEATPQKLAQFA 236

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
            + +  RR+YAG++S +D+ +G ++  L+K G LEN++++F+ DNGA +    +N   N 
Sbjct: 237 RLKNAHRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT----NNGADNG 292

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA-GIEINDTSLD 362
           PLRG+K + W+GG+R    +  P   Q  K    L    D +PT  AAA G   +D  LD
Sbjct: 293 PLRGLKGSKWEGGIRVPMIMHWPKQLQAGKDYKGLVSSLDLVPTCVAAAQGTVRSDMQLD 352

Query: 363 GVNQWDVLTKGAKTKRSEIL 382
           GVN    LT   K +  + L
Sbjct: 353 GVNILPFLTGKNKQEPHDFL 372


>gi|346464573|gb|AEO32131.1| hypothetical protein [Amblyomma maculatum]
          Length = 418

 Score =  202 bits (515), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 148/432 (34%), Positives = 206/432 (47%), Gaps = 32/432 (7%)

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           Y PT RGFD+ YGY+   + Y+    K  F    G D   N  +     G Y T L T+ 
Sbjct: 1   YLPTRRGFDTFYGYYNTAKYYFTQ--KLNFYNICGRDYWDNETLVTDKDGTYDTHLITDR 58

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A+++I+ H+  KP+F+Y+A LA H   T     AP   +AKF  I D  R   AG V  L
Sbjct: 59  AVDIISSHDPDKPLFIYMAPLAAHGQTTNLTVDAPQRNLAKFPYIGDHNRTLLAGAVDAL 118

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           DESVG ++AAL +  ML NS+++F +DNG   +G  SN GSN PLRG K T W+GG+R  
Sbjct: 119 DESVGRIVAALHQRRMLANSVLVFTSDNGGIPWGDFSNTGSNWPLRGTKGTMWEGGVRLP 178

Query: 321 AAIWSPWLKQTQK-VSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTK- 377
           A +WS  L    + V     H+ DWLPTL +AAG  I D  ++DG++ W VL+    +  
Sbjct: 179 AVVWSSMLAPGSRVVVPHPMHLVDWLPTLYSAAGGHIGDLGNIDGIDLWPVLSSAKTSSK 238

Query: 378 -----RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEW-YGDTDNEIDKYSP 431
                R+E L N D         +A R  D K VA T    +   + +GD   + +  +P
Sbjct: 239 STSWPRTEFLINYDT----DTGLSAYRDGDYKLVAVTTPKSKDARYPFGDPGLQRNVPTP 294

Query: 432 KEV---LYSKAGI----TFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFA 484
             +      K  I    T  A     +  Q    D +   +++L          E R  A
Sbjct: 295 GGIPPDRCRKGAIKRLNTLMASSLAWRTLQGFYGDGEIGRAESLAP--------EWRVQA 346

Query: 485 RVRCNYDNKGAHCNSTVKPC--LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVP 542
            VRCN   +       ++    LF+I  DPCE NNLA S   ++ +L  KL  Y +  + 
Sbjct: 347 SVRCNPRKETVPGLEQLREGNYLFDIKRDPCELNNLAHSLPSVVAKLSKKLQSYVAITMA 406

Query: 543 PGNKPFDKRADP 554
           P N   D    P
Sbjct: 407 PANITMDPAGLP 418


>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
 gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
          Length = 466

 Score =  202 bits (514), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 131/383 (34%), Positives = 209/383 (54%), Gaps = 30/383 (7%)

Query: 13  CTLLFNDAFLNT---TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
              LF    L T   +A ++P+II+IL+DD G+ D   +GS  IPTP+IDALA  G+   
Sbjct: 7   IVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGGVRFT 66

Query: 70  QHYVQA-LCTPSRSALMTGKYPIHIGMQHGVI------LEGEPWGLPLTEKLLPQYLKEA 122
             YV A +C PSR+ L+ G+Y    G +H V        + +  GL  T + +   ++  
Sbjct: 67  NSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQMQDIGLSDTVRTIADQMQSN 126

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
           GY T AIGKWH G     + P  +GF+  +G+  G + ++    +   +  + + +    
Sbjct: 127 GYETMAIGKWHQGN-ETKHHPLHKGFNHFFGFIGGHRSFF--PIRTAIKQEEKI-LNDYT 182

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
           +VD K +  Y TD++T+ AI+ + +  + KP F+YL++ AVH      P +A  +++A+F
Sbjct: 183 EVDEKDV-YYLTDMFTDKAISYMRQ-KRDKPYFIYLSYNAVHT-----PVEATPQKLAQF 235

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
             + D +RR+YAG++S +D+ +G ++  L+K G LEN++++F+ DNGA +    +N   N
Sbjct: 236 AHLKDAQRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT----NNGADN 291

Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQ--TQKVSSELFHISDWLPTLCAAA-GIEINDT 359
            PLRG+K + W+GG+R    +   W KQ    K    L    D +PT  AAA G   +D 
Sbjct: 292 GPLRGLKGSKWEGGIR--VPMIMHWPKQLPAGKDYKGLVSSLDLVPTCVAAAQGTVRSDM 349

Query: 360 SLDGVNQWDVLTKGAKTKRSEIL 382
            LDGVN    LT   K +  + L
Sbjct: 350 QLDGVNILPFLTGKNKQEPHDFL 372


>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 465

 Score =  202 bits (514), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 133/376 (35%), Positives = 194/376 (51%), Gaps = 46/376 (12%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
            +T  + F    L   A  KP+I+++ ADD G+ D  F GS  + TPN+D LA +G+   
Sbjct: 3   VITFLICFFGIHLVVNAQNKPNIVLLFADDAGYMDFGFQGSKVMKTPNLDKLAKSGVTFT 62

Query: 70  QHYVQ-ALCTPSRSALMTGKYPIHIG---------MQHGVILEGEPWGLPLTEKLLPQYL 119
           Q YV    C PSR+ +MTGKY    G         M     L+G+  GLPL +KL+  YL
Sbjct: 63  QGYVSDPTCGPSRAGMMTGKYQARFGYEEINVPGYMSSHSALKGDEMGLPLDQKLMSNYL 122

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYY-------DHSCKATFEP 172
           K+ GY T   GKWHLG   + + P  RGFD  YG+  G + Y+       D   +  F  
Sbjct: 123 KDLGYKTAVYGKWHLG-NADRFHPLNRGFDEFYGFRGGARSYFAYKNPTGDRKMETNFGQ 181

Query: 173 YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
           Y+  +  H           Y+TD++ E A++ I E NK  P F+YL+  AVH      P 
Sbjct: 182 YE--EPNH-----------YATDVFAEKAVHFI-ERNKEHPFFIYLSFNAVHT-----PM 222

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
           +A + ++A+F +++  +R+  A M   LD + G V+  L++ G+  N+IV+F  DNG PS
Sbjct: 223 EATEADLAQFPNLTG-KRQQLAAMTLALDRACGKVLNKLKELGIDRNTIVVFSNDNGGPS 281

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCA 350
                N  +N PL G KS   +GG+R        W K+ +K S   + +   D LPT  A
Sbjct: 282 ---DKNASNNKPLSGTKSNHLEGGIR--VPFLMSWPKEIKKKSRFNYPVVTFDLLPTFYA 336

Query: 351 AAGIEIND-TSLDGVN 365
           AAG ++ D   ++GVN
Sbjct: 337 AAGGDLKDLKDINGVN 352


>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
          Length = 540

 Score =  202 bits (514), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 126/390 (32%), Positives = 210/390 (53%), Gaps = 43/390 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+PHII  L DDLGWN  SFH +++I TP +  L  NG+ L  HY    C+P+R++ +TG
Sbjct: 28  KQPHIIFFLTDDLGWNAPSFH-NNEIITPTLHHLHANGVELYSHYTYMFCSPTRASFLTG 86

Query: 88  KYPIHIGMQHGVILE-GEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           ++P    M +  +L      GL L+   L   LK+A Y+TH IGKWHLG +++ YTP +R
Sbjct: 87  RFPYKHEMTNTNLLPPTRMLGLDLSYTTLADKLKQANYSTHHIGKWHLGMYKKEYTPRYR 146

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           GFD+ +G+  G +++Y    +  F     +D+           G ++  ++T+AA+++I 
Sbjct: 147 GFDTTFGFLTGGENHY---TQRAFVSPPAVDLWDEEAPAYGMNGTWTGKMFTDAALDIIR 203

Query: 207 EHNK-------SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE---RRTYAGM 256
            + +       + P+F+Y A   VHA     P Q+P   +A+  + +  +   + T+ GM
Sbjct: 204 NNAQLRNATGDAPPLFIYFALHDVHA-----PTQSP-VRLAQLYNFTGDDAGLKSTFYGM 257

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---------------GS 301
           +S +DE+V NV  AL+  GM  ++++++ +DNG+P+ G  +                 GS
Sbjct: 258 ISGVDEAVRNVTDALKAAGMWNDTLLVWTSDNGSPTMGSFNGGCKNVVPDSRGGACMCGS 317

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWL--KQTQKVSSELFHISDWLPTLCAAAGIEIND- 358
           N+PLRG K T ++GG+R    +  P L      K    L H+ DW  TLC+ AG++  D 
Sbjct: 318 NYPLRGSKHTNFEGGVRTPTFVSGPVLPAHMRGKRLDGLMHVCDWYSTLCSVAGVDPADP 377

Query: 359 ---TSLDGVNQWDVLTKG-AKTKRSEILHN 384
                LD ++ W  ++   +++ R  I+H+
Sbjct: 378 GGIAPLDAIDMWPYISGAVSESPRDFIVHD 407


>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
          Length = 534

 Score =  201 bits (512), Expect = 6e-49,   Method: Compositional matrix adjust.
 Identities = 135/399 (33%), Positives = 187/399 (46%), Gaps = 56/399 (14%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSR 81
           L   A  KPH++ +L DD+G+ DV +  S  + +P +D LA  GL L +HY    C PSR
Sbjct: 19  LARAADTKPHLLFVLGDDVGFGDVGYS-SPDVISPTLDRLAAEGLKLGRHYAYMWCAPSR 77

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKE-AGYATHAIGKWHLGFFREV 140
           +AL+TG YP   G+       G    LPL   LLP  L++ AGY T A+GKWHLGF  E 
Sbjct: 78  AALLTGYYPSTTGVYS---TSGAQNALPLEFALLPGLLRDRAGYRTAAVGKWHLGFMSEA 134

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDH------SCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
             P  RGFD  +G+  G +D+Y         C   F+ +     R     D    G YS 
Sbjct: 135 DLPERRGFDGFFGFLDGGEDHYSRVGAGAPGCDRVFDLWDS-RRRGPATDDPSAFGRYSA 193

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHA----------------------------- 225
           +LY EAA + I  H+ ++P+FLY A    H+                             
Sbjct: 194 ELYGEAAADAIRGHDAAEPLFLYAAFQVAHSPLEVPPAKYFDAYADAALCANASTKGFAC 253

Query: 226 --GNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
                  P++AP+ +          +R   A MV+ LD +V N+  AL    M  + +++
Sbjct: 254 ARPGADAPYRAPNHDCGC-------DRLVVAAMVTALDAAVANMTTALASKNM-ADFVLV 305

Query: 284 FMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISD 343
           +  DNG P    H N G    LRG K T ++GG+R  A + SP L +    +  L H+ D
Sbjct: 306 YSGDNGGPEIEGHWNGG----LRGGKWTWFEGGVRPAAFVHSPLLTRRGWHNGTL-HLVD 360

Query: 344 WLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           WLPT  A AG   +D + DG +QW  L  GA   R   L
Sbjct: 361 WLPTFLALAGAARDDDAFDGADQWASLAFGAAPAREATL 399


>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
          Length = 913

 Score =  201 bits (512), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 129/389 (33%), Positives = 205/389 (52%), Gaps = 38/389 (9%)

Query: 18  NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALC 77
           N A L + A  KPHI+  L DDLGW+DV +H  S + TP + ALA +G++L++ Y    C
Sbjct: 399 NAAPLPSDAAPKPHIVYFLVDDLGWHDVPWHNPS-LKTPTLAALAADGVVLDRFYAYRFC 457

Query: 78  TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +PSRS+L++G+YP+H+  Q+ +  +    G+ +    + + LK AGYATH +GKWH G  
Sbjct: 458 SPSRSSLLSGRYPMHVN-QYNMAGDALGGGVHVNMTTIAKKLKGAGYATHQLGKWHAGQS 516

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI----GIYS 193
                P  RGFD+  GY  G +D++  +      P  G+    ++   +       G Y 
Sbjct: 517 SADLVPAARGFDTSLGYLNGAEDHWTQA-----RPACGVGNFVDLYATDGPAFGKNGTYG 571

Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE---- 249
             +Y +AA+++IA+H+ S P+F+Y A    HA     P Q PD   A+     DP+    
Sbjct: 572 AQIYHDAALDIIADHDASVPLFVYFAFQINHA-----PMQVPD-AYARVYPCDDPKTCGV 625

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           R TY  M +  D+ V N  AAL+  GM ++++++  +DNG PS G  ++  +N PLRG K
Sbjct: 626 RSTYQAMTAMADDVVRNATAALKAKGMWDDTLIVLTSDNGGPS-GTDADSSNNFPLRGGK 684

Query: 310 STPWDGGMRGVAAIWSPWL-----KQTQKVSSELFHISDWLPTLCAAAGIEINDT----- 359
            +  +GG+R    +   +L      +T   +    HI DW  T    AG++  D      
Sbjct: 685 YSDLEGGVRAAGLVAGGFLPDAARGRTLDGAGAYVHICDWYATFARLAGVDAADDASDAD 744

Query: 360 -----SLDGVNQWDVLTKGA-KTKRSEIL 382
                ++D ++ WD L+  A  + R+EI+
Sbjct: 745 GRPLPAVDSLDVWDFLSGAADASPRTEIV 773


>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 489

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 136/409 (33%), Positives = 206/409 (50%), Gaps = 53/409 (12%)

Query: 7   YFFALTCTLLFNDAFLNTTAP---KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           Y FA+    L   A ++       + PHI+ I+ADDLGWNDV FHGS  I TPNID LA 
Sbjct: 20  YAFAILGCFLGAGAKVSAETDVEMQPPHIVHIIADDLGWNDVGFHGS-DIRTPNIDRLAR 78

Query: 64  NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
             ++L++ YV  +C+P+R+ ++TG YP   G+  GV+   +  GLP   +  P++L + G
Sbjct: 79  ESVVLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPTKKHGLPSELETTPEHLAKLG 138

Query: 124 YATHAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
           Y   A+ GKWHLG    ++ P   G    YG++ G  DY+        + ++  D  H  
Sbjct: 139 YDHRAMFGKWHLGLASTLFHPLRHGMTEFYGHYNGAIDYFSRERFGQLDWHRNHDSVHEE 198

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA-------- 234
                    YST+L   A ++ I  H   +P++ Y+A  A H+     P QA        
Sbjct: 199 G--------YSTELVGNAVVDFIDRHAGQQPLYAYVAFNAPHS-----PLQALRSDLDDY 245

Query: 235 -----------PDEEVAKFLDI--------SDPERRTYAGMVSRLDESVGNVIAALRKHG 275
                       D ++AK  +          +  R+T+A M + +D  +G ++ A+ ++G
Sbjct: 246 GFDPTGKLAPNTDRKIAKRENALEYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNG 305

Query: 276 MLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKV 334
           M EN++V+F +DNGA P  G     GSN PLRG K T W+GG+R VA +  P        
Sbjct: 306 MRENTLVVFHSDNGADPKHG-----GSNEPLRGNKFTTWEGGVRVVATMRWPREFPAGVT 360

Query: 335 SSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT-KGAKTKRSEIL 382
              +    D LPT+ AAAG +  + + DGVN    L+ K A  +R+ +L
Sbjct: 361 YESVASYVDLLPTMVAAAGGQPPERT-DGVNLLPFLSGKAAPPERTMLL 408


>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 445

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 133/372 (35%), Positives = 203/372 (54%), Gaps = 34/372 (9%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           YFF L    L +      +   +P+I++I+ADDLG+ D+S +G+  I TPN+D LA  G+
Sbjct: 5   YFFLLMSAWLLSCGGEKESLSMRPNIVLIMADDLGYGDLSCYGNEYINTPNLDLLASEGV 64

Query: 67  IL-NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEG-EPWGLPLTEKLLPQYLKEAGY 124
           +  + H   ++C+P+R+ALMTGKY    G++  V  +     GL L E  L + LK+ GY
Sbjct: 65  LFTDYHSNGSVCSPTRAALMTGKYQQRTGVEGVVTAKSHRDVGLALAEVTLAEELKQLGY 124

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
            T   GKWHLG + + + PT +GFD   G+  G  DY+ H  +  +     LD    +++
Sbjct: 125 NTGMFGKWHLG-YDKAFNPTLQGFDEFVGFVSGNVDYHGHIDQEGY-----LDWWDGVKI 178

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHN---KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
            N+    Y+TDL +E  +  I EHN   K  P FLYL H A H+     P+Q   ++V +
Sbjct: 179 KNEK--GYTTDLISEYGVKFIQEHNPEVKRAPFFLYLPHEAPHS-----PYQRRIDKVLR 231

Query: 242 FLD-------ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            +        I D     Y  MV  +DE VG ++ +L++ G  EN+IV+F++DNGA  + 
Sbjct: 232 EIGTAGTQEVIQDSISSIYKEMVEVMDEGVGRIMQSLKETGQYENTIVIFISDNGANHY- 290

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-PWLKQTQKVSSELFHISDWLPTLCAAAG 353
                G N  LRG K+ P++GG R V AI+S P   +   V+++     D LPTL    G
Sbjct: 291 -----GDNGGLRGFKAGPYEGGSR-VPAIFSFPKEVKGGSVNNQTVLSMDLLPTLLDFIG 344

Query: 354 IEINDTSLDGVN 365
            + + TS+DG++
Sbjct: 345 RKPS-TSVDGIS 355


>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
 gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
          Length = 545

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 122/387 (31%), Positives = 197/387 (50%), Gaps = 42/387 (10%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           + C +L   A   +   +KP+I+I++ADDLGW DV +HG   I TP++D LA  G+ LN+
Sbjct: 17  MMCVILGVPA---SAQSQKPNILIMVADDLGWADVGYHGGD-IDTPSLDRLAQQGVRLNR 72

Query: 71  HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
            Y   +C+P+R+ALMTG+ PI +G+ +GVI   +  G+   E  +P+  + AGY T  IG
Sbjct: 73  FYTTPICSPTRAALMTGRDPIRLGVTYGVIFPWDNIGVHPDEHFMPETFQAAGYQTAIIG 132

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           KWHLG  +  Y P  RGF+  YG+      +Y        + +Q    R+ + +D++   
Sbjct: 133 KWHLGHAQMTYHPNNRGFEHFYGHLHTEVGFYPPFSNQGGKDFQ----RNGVSIDDQG-- 186

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD--- 247
            Y T L  +     I E ++ +P  +Y+  +A H      P  AP E   K+ DI     
Sbjct: 187 -YETYLLADEVSRYIRERDRDRPFLVYMPFIAPHT-----PLDAPVELQDKYKDIETDLP 240

Query: 248 --------------------PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMAD 287
                                 R  YA +V  +D+++G V+  L + G+ +N+IVLF +D
Sbjct: 241 MARSRQTDDTRLISRVMLQPSARPMYAAVVDAMDQAIGRVLDTLDQEGISDNTIVLFFSD 300

Query: 288 NGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPT 347
           NG  ++       +N PLRG K   ++GG+R  + +  P + +  ++  ++  + D  PT
Sbjct: 301 NGGAAYSY--GGANNAPLRGGKGETFEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPT 358

Query: 348 LCAAAGIEI-NDTSLDGVNQWDVLTKG 373
           L  AA +   N+ +LDG + W  L  G
Sbjct: 359 LVDAADVRPGNNFALDGRSMWTALKSG 385


>gi|301311962|ref|ZP_07217884.1| sulfatase family protein [Bacteroides sp. 20_3]
 gi|423339482|ref|ZP_17317223.1| hypothetical protein HMPREF1059_03148 [Parabacteroides distasonis
           CL09T03C24]
 gi|300830064|gb|EFK60712.1| sulfatase family protein [Bacteroides sp. 20_3]
 gi|409230863|gb|EKN23724.1| hypothetical protein HMPREF1059_03148 [Parabacteroides distasonis
           CL09T03C24]
          Length = 459

 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 142/387 (36%), Positives = 202/387 (52%), Gaps = 25/387 (6%)

Query: 6   KYFFALTCTLLFNDAFLN--TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           K    LT  +L N   LN  + A  KP+ +II  DD+G+ D+S +G+  I TPNID +A 
Sbjct: 5   KSLSVLTAAVLSNSLSLNAASDAANKPNFVIIFCDDMGYGDLSCYGNPTIRTPNIDRMAC 64

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
            G+ L Q YV A + TPSR+ALMTG+ P+  G+   +  V+      GL   E  + + L
Sbjct: 65  EGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDRVAVLFPNSKAGLGQDEVTIAKVL 124

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD 177
           ++ GYAT  +GKWHLG F   Y PT  GFD+++G  Y   +    +    A   P   L 
Sbjct: 125 QQNGYATGCVGKWHLGAFSP-YLPTDHGFDTYFGIPYSNDMSPVRNKGAHARNFPPTPL- 182

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +    Q++++      T  YTE A++ I +H+K +P FLY AH   H          P  
Sbjct: 183 ILDGKQIESEPDQGELTRRYTEKAVSFIKDHSK-EPFFLYFAHTFPH---------IPLY 232

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
             A+F   S  +R  Y  +V  +D SVG V+ ALR++G+ EN+ V+F +DNG P    H 
Sbjct: 233 TNARFEGTS--KRGLYGDVVEEIDWSVGEVLKALRENGLDENTFVIFTSDNG-PWLTEHE 289

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
           N GS  PL+  K T W+GG R  A  W    K    ++ E+    D  PT  + AGIE  
Sbjct: 290 NGGSAGPLKDGKGTWWEGGFRVPAICWM-LGKINPAINDEIMTSMDLYPTFLSMAGIEQP 348

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D  LDGVNQ  +L +   + R E+ +
Sbjct: 349 KDLVLDGVNQTGLLFEEKHSARDEVYY 375


>gi|187735676|ref|YP_001877788.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
 gi|187425728|gb|ACD05007.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
          Length = 465

 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 144/433 (33%), Positives = 217/433 (50%), Gaps = 41/433 (9%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           + C L    A         P++I+ILADDLG+ D+   GS QI TP++D LA  G+  ++
Sbjct: 10  ILCGLFCGTAVAQPRLSSPPNMIVILADDLGYGDLGCTGSKQIKTPSLDRLAREGVFCSR 69

Query: 71  HYVQA-LCTPSRSALMTGKYPIHIGM------QHGVILEGEPWGLPLTEKLLPQYLKEAG 123
            YV A +C+PSR  L+TG++P   G+      Q   + E   +GLP TEKL+P+YL   G
Sbjct: 70  AYVTAPMCSPSRMGLLTGRFPKRYGITTNPNIQMDYLPESH-YGLPQTEKLIPEYLAPCG 128

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
           Y +   GKWHLG  +  YTP  RGF   +G+  G + Y+    +A     +GL+   +M 
Sbjct: 129 YRSAVFGKWHLGHTKG-YTPPERGFTHWWGFLGGSRHYFPVKKEA-----EGLN--PSMI 180

Query: 184 VDN---KTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
           V N   KT   Y TD  T+ A+  + E  K  KP F+++++ A H  N  +P     E++
Sbjct: 181 VSNFTDKTDITYLTDDITDRAVEFLQEAGKDKKPFFMFVSYNAPHWPNEAKP-----EDI 235

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
           AKF ++ + ERR Y  MV  +D  +G ++ AL+  G+ +++IV+F++DNG    G     
Sbjct: 236 AKFRNVQNGERRVYCAMVYAMDRGIGRILDALKADGLEKDTIVVFLSDNG----GAPEAS 291

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQ--TQKVSSELFHISDWLPTLCAAAGIEIN 357
             N P RG K   ++GG+R    I  P  K+     V  +     D LP L  A G  I 
Sbjct: 292 SCNAPFRGAKRQHFEGGVRVPFIIRYPADKRLVPGSVCRQPVSSVDLLPALLKANGRHI- 350

Query: 358 DTSLDGVNQWDVL-TKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
              LDG++  +++  KGA   R+              Y +A+   D+KY+   D   Q  
Sbjct: 351 PRKLDGMDILELVGNKGAPVPRTFFWCT--------DYTSAVLTGDMKYLLVPDRAPQFY 402

Query: 417 EWYGDTDNEIDKY 429
               D   + D Y
Sbjct: 403 NVADDPQEQRDLY 415


>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 472

 Score =  200 bits (509), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 136/412 (33%), Positives = 205/412 (49%), Gaps = 52/412 (12%)

Query: 9   FALTCTLLFNDAFLNTTA---PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           FA+     F  A ++  A    + PHI+ I+ADDLGWNDV FHGS +I TPNID LA   
Sbjct: 5   FAIVACFAFVGANVSAEANGGSQPPHIVHIIADDLGWNDVGFHGS-EIRTPNIDRLANES 63

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           + L++ YV  +C+P+R+ ++TG YP   G   GV+   +  GLP   +  P++L + GY 
Sbjct: 64  VTLDRFYVTPICSPTRAGVLTGLYPFRFGFWGGVVSPTKKHGLPPQLETTPEHLSKLGYD 123

Query: 126 THAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
             A+ GKWHLG    ++ P   G    YG++ G  DY+        + ++  D  H    
Sbjct: 124 HRAMFGKWHLGLASTLFHPLQHGMTEFYGHYNGAIDYFSRERFGQLDWHRNFDSVHEEG- 182

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA---------- 234
                  YST+L   A ++ I  +  + P++ Y+A  A H+     P QA          
Sbjct: 183 -------YSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDEYGF 230

Query: 235 -PDEEVAKFLD--ISDPE--------------RRTYAGMVSRLDESVGNVIAALRKHGML 277
            P+ ++A   D  I+  E              R+T+A M + +D  +G ++ A+ ++GM 
Sbjct: 231 DPNNKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMR 290

Query: 278 ENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
           EN++V+F +DNGA P  G     GSN PLRG K T W+GG+R VA +  P          
Sbjct: 291 ENTLVVFHSDNGADPKHG-----GSNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYD 345

Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
            +    D LPT+ AAAG    + + DG+N    L+  A      IL + + V
Sbjct: 346 SVTSYVDLLPTMVAAAGGPPPEET-DGINLLPFLSGKASPSERTILLDAETV 396


>gi|298715187|emb|CBJ27859.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 610

 Score =  200 bits (508), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 138/390 (35%), Positives = 206/390 (52%), Gaps = 59/390 (15%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCTPSRSA 83
           +A  +PH+I IL DD+G ND+ +  +     TP +D+LA +G++L+ +Y   LCTPSR++
Sbjct: 65  SASGQPHVIFILIDDVGTNDMGYRSTDLWELTPFLDSLASSGVVLDNYYTNQLCTPSRAS 124

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           LMTG+     GMQHG++    PWGLP  E  L    K AGY+TH  GKWHLG F +   P
Sbjct: 125 LMTGRDSFRTGMQHGIVDYSAPWGLPFEEVTLADRFKAAGYSTHMTGKWHLGVFSDASYP 184

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEP-YQG-----------------LDMRHNMQVD 185
             RGFD+  GY  G + YY+HS    F P ++G                 +D   N   +
Sbjct: 185 FSRGFDTFLGYTGGGEGYYNHS--TCFTPTFEGGEYSCLKDFGYGDEDGYIDYTTNTTKE 242

Query: 186 NKT-IGIYSTDLYTEAAINVIAEHNKSK----PMFLYLAHLAVHAGNTYEPFQAP----- 235
               +  YST + T+ AI+V  EH  +     P+FLY+A+ A H      P  AP     
Sbjct: 243 GPAMVDNYSTTIMTDRAIDVAREHTGTASSDDPLFLYVAYQAAHT-----PLDAPPPGTF 297

Query: 236 ----DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
               +E +A+ +   +P+R  +A ++  LD+ V  +  AL   G+L+N++++  +DNGA 
Sbjct: 298 TEEEEETLARVVS-DEPDRHVFARIIYYLDKEVRRLHDALEALGVLDNAVIVLASDNGAC 356

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE--------LFHISD 343
           S    S+ GSNHPLRG K+T + GG+R  A ++S   K T+ +  E        + H +D
Sbjct: 357 S----SSGGSNHPLRGFKNTTFQGGVRVPAFVYS---KSTELIPEEARGTTYSGMMHSTD 409

Query: 344 WLPTLC-AAAGIEINDTS--LDGVNQWDVL 370
           W  TL     G+ +      L G +QWD +
Sbjct: 410 WTQTLGRVVPGLPLAGMGKELSGFDQWDAI 439


>gi|260816809|ref|XP_002603280.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
 gi|229288598|gb|EEN59291.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
          Length = 377

 Score =  199 bits (507), Expect = 2e-48,   Method: Compositional matrix adjust.
 Identities = 105/224 (46%), Positives = 141/224 (62%), Gaps = 13/224 (5%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KPHI++I+ADDLGW+DVS++ +  + TPN+  LA  G+I NQ Y Q  C+PSR+AL+TGK
Sbjct: 24  KPHILLIVADDLGWSDVSWN-NPYVVTPNLHTLATTGVIFNQTYAQPTCSPSRTALLTGK 82

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           +P  +GMQ  V+   +P GLPL E+LLPQ LK+ GYATH +GKWHLG  +  YTPT RGF
Sbjct: 83  FPFRLGMQR-VMDSKKPHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGF 141

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           DS YGY  G QDYY H         +GLD        +   G+YST+ +   A N+I++H
Sbjct: 142 DSFYGYHHGSQDYYTHKSA------RGLDFWDGKTSISDQNGVYSTESFATRAENIISQH 195

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           + + P+FLYL   +VH      P Q P   +  F  I D  R++
Sbjct: 196 DPNTPLFLYLPFQSVHT-----PHQVPSSYLQTFSTIQDDNRKS 234



 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 56/210 (26%), Positives = 84/210 (40%), Gaps = 72/210 (34%)

Query: 356 INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQS 415
           I D  LDG N W  ++ GA + R+E +++ID+  N  +Y  A+R+ D K V G  N    
Sbjct: 235 ILDPGLDGKNMWQSISTGADSPRNEFVYDIDSGAN--RY--AIRIGDYKLVTG--NRAPK 288

Query: 416 DEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDE 475
           D+W          Y+P E+   + GI                                  
Sbjct: 289 DDW----------YAPAEM--QQMGI---------------------------------- 302

Query: 476 KILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNL--AESQTDLLKQLEDKL 533
               L E A      D+ G        P LFNI +DP E++NL  A     + + +  +L
Sbjct: 303 ----LTEIA------DDTG--------PFLFNIKEDPLEKSNLYNAAEYASIQRAMRRRL 344

Query: 534 AIYKSTMVPPGNKPFDKRADPARWNNIWVP 563
              K   VP  +   D  ADP+ + N+W+P
Sbjct: 345 NELKPFEVPFPDLADDPAADPSNYGNVWMP 374


>gi|449680623|ref|XP_004209633.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 327

 Score =  199 bits (507), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 134/374 (35%), Positives = 196/374 (52%), Gaps = 63/374 (16%)

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT-YEPFQAPDEEVAKFLDISDPERRTYA 254
           +YT  AI+ I  HN S+P+FLYLA  AVH+ N+  EP QAP E + KF  I   +RR YA
Sbjct: 1   MYTSKAISYITNHNCSEPLFLYLAFQAVHSSNSEVEPLQAPQEWIDKFSYIKHEQRRKYA 60

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            M++ +D  +G V  AL +  ML+NS+++F  DNG PS G ++N  +N PLRG+K+T ++
Sbjct: 61  AMLAYMDYGIGRVHEALIEKKMLDNSVIIFTTDNGGPSNGFNNNWANNFPLRGVKATLFE 120

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKG 373
           GG+RGV  ++S  ++++ +VS +L H++DWLPTL + AG  ++D S LDG +QW  L   
Sbjct: 121 GGVRGVGFVYSKLIEKS-RVSHDLIHVTDWLPTLVSLAGGNVSDISFLDGFDQWATLQNK 179

Query: 374 AKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE 433
             ++R+EIL NID      +   ALRV   K +     N   D WY              
Sbjct: 180 QSSQRNEILLNIDEHVWKNE---ALRVGSWKLIKEGLKNW--DGWY-------------- 220

Query: 434 VLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNK 493
                                     P  N S        DE +      + ++C +D  
Sbjct: 221 ------------------------PPPSYNES-------MDESL----NSSILKCGHDFP 245

Query: 494 GA--HCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPP-GNKPFDK 550
               HC+     CLF++ +DPCE  +L++   D+L  +  +L IYK +MVPP  N   D 
Sbjct: 246 VVINHCDDY---CLFHVVEDPCEIVDLSKKFPDVLATMMYRLNIYKQSMVPPRNNMTIDS 302

Query: 551 RADPARWNNIWVPW 564
           R+DP   N +W PW
Sbjct: 303 RSDPKLHNGVWQPW 316


>gi|298706368|emb|CBJ29377.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 653

 Score =  199 bits (506), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 165/590 (27%), Positives = 271/590 (45%), Gaps = 85/590 (14%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTP 79
            F ++   + P++++++ DDLGW DV FH ++   TPN+DA+   G+ L+  Y    CTP
Sbjct: 27  GFGSSATTENPNVLVVVVDDLGWKDVGFHDTT-FSTPNLDAMVAEGVELSTFYTAPTCTP 85

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           SR+ LMTG+Y   IGMQ  V+   EP G+PLTE  + + L+ AGY+T A+GKWHLG    
Sbjct: 86  SRAQLMTGRYSYRIGMQDSVLHTTEPRGVPLTETFVGEKLQAAGYSTAAVGKWHLGMHMP 145

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEP---------YQGLDMRHNMQV--DNK 187
            + P  RGFD  YG   G   +Y H S    F P         + G ++    +V  DN+
Sbjct: 146 QFLPVERGFDDFYGILTGGGGHYTHMSVSPEFTPRGGSGKTRTFSGPNIVEGSEVSDDNQ 205

Query: 188 TIGIYSTDLYTEAAINVIAEHN--KSKPMFLYLAHLAVH----------AGNTYEPFQAP 235
               +ST+LYT+ A   +   +  + +P FLYL++ A+H           G + E   A 
Sbjct: 206 DTRTHSTELYTKKAAEYVEAMSAKEDQPWFLYLSYQAIHDPIETDDSWINGRSCESISAE 265

Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
           +  + +  +     RR   GMV+++D  +G +   L   G  +N++V+F +DNG    G+
Sbjct: 266 NSSIGETDETDYDNRRIACGMVAQMDNGLGALRGLLENLGEWDNTVVIFFSDNG----GL 321

Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS-------SELFHISDWLPTL 348
            S+   N P RG K   W+GG+   A +   ++    + +         L H++D   T+
Sbjct: 322 PSHGSVNRPFRGGKGDYWEGGVHVPAFVSGGFVSAALRRTGVEPYRYGHLTHVTDVHATI 381

Query: 349 CAAAGIEIND------TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDD 402
              AG+   D        LDG++ WD L +     R  ++ N+++ +       A+R  +
Sbjct: 382 LGLAGVPGADDGGNGEAELDGIDLWDSLVETKAPARQAVVINLNSPNFASS--GAVRWSN 439

Query: 403 LKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE-VLYSKAGITFNALKTKLQIKQKHAADPK 461
            K +   +                    PKE V+YS+       +++KL   ++     +
Sbjct: 440 YKLIRNPE--------------------PKETVIYSR-------VQSKL--VEEEGVVSE 470

Query: 462 ANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQ-NNLAE 520
           +  + A+ T+     + E + +      +D +     S V+ C  +    P E   NL  
Sbjct: 471 STLASAISTVQAQIHLSEPKMYM-----FDLEANPSESVVEGCGTDRALGPVESCANLYS 525

Query: 521 --SQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR--ADPARWNNIWVPWYD 566
             +  D  K+LE  +       V P  +  D    ADP+ W   WVPW D
Sbjct: 526 IPAFRDARKKLEGIIRKADRESVAPTLRWEDDGPLADPSNWGG-WVPWRD 574


>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 472

 Score =  199 bits (506), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 135/414 (32%), Positives = 205/414 (49%), Gaps = 52/414 (12%)

Query: 7   YFFALTCTLLFNDAFLNTT---APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           + FA+     F  A ++       + PHI+ I+ADDLGWNDV FHGS +I TPNID LA 
Sbjct: 3   FPFAIMTCFAFVGANVSAETNGGSRPPHIVHIIADDLGWNDVGFHGS-EIRTPNIDRLAS 61

Query: 64  NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
             + L++ YV  +C+P+R+ ++TG YP   G+  GV+   +  GLP   +  P++L + G
Sbjct: 62  ESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPTKKHGLPPQLETTPEHLSKLG 121

Query: 124 YATHAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
           Y   A+ GKWHLG    ++ P   G    YG++ G  DY+        + ++  D  H  
Sbjct: 122 YDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAIDYFSRERFGQLDWHRDFDSVHEE 181

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA-------- 234
                    YST+L   A ++ I  +  + P++ Y+A  A H+     P QA        
Sbjct: 182 G--------YSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDEY 228

Query: 235 -----------PDEEVA---KFLDI-----SDPERRTYAGMVSRLDESVGNVIAALRKHG 275
                       D ++A   K LD       +  R+T+A M + +D  +G ++ A+ ++G
Sbjct: 229 GFDPTGKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNG 288

Query: 276 MLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKV 334
           M EN++V+F +DNGA P  G     GSN PLRG K T W+GG+R VA +  P        
Sbjct: 289 MRENTLVVFHSDNGADPKHG-----GSNEPLRGNKFTTWEGGVRVVAMMRWPRELPAGIA 343

Query: 335 SSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
              +    D LPT+ AAAG    + + DG+N    L+  A      IL + + V
Sbjct: 344 YDSVTSYVDLLPTMVAAAGGPPPEET-DGINLLPFLSGKASPPERTILLDAETV 396


>gi|443716207|gb|ELU07840.1| hypothetical protein CAPTEDRAFT_136127, partial [Capitella teleta]
          Length = 311

 Score =  199 bits (505), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 116/310 (37%), Positives = 176/310 (56%), Gaps = 18/310 (5%)

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           SR + ++G+Y     MQHGVIL+ +P  L L   +LP YLKE GY THA GKWHLG+ R+
Sbjct: 1   SRGSFLSGRYSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRD 60

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
             TPT RGFDS  G + G  ++Y+H+  AT   Y   D     +VD   IG +S DL   
Sbjct: 61  ECTPTHRGFDSFSGGFSGEGEFYEHT-TATGGYY---DWHLGTEVDYDAIGKHSEDLIGY 116

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRTYAGMV 257
                + E+++S P+F+Y+A   VH+     P     E +A +  LD++D +R+ Y G+V
Sbjct: 117 YVNKTLDEYDQSSPLFMYVAFHNVHS-----PLDPKPEFLALYDDLDVTD-DRKKYLGLV 170

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
           S +D  +G ++  +++ G+ EN+ +LF +DNG         +G N P RG KST W+GG 
Sbjct: 171 SGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGD-----VGEGDNSPHRGGKSTLWEGGC 225

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKT 376
           +  +   SP L  +   ++ L H++DWLPT+   AG  +     LDG  Q D++     +
Sbjct: 226 KANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAGGSVPASDGLDGFVQTDMVINNGDS 285

Query: 377 KRSEILHNID 386
            R+ +++NID
Sbjct: 286 ARTTMIYNID 295


>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 470

 Score =  198 bits (504), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 131/372 (35%), Positives = 194/372 (52%), Gaps = 30/372 (8%)

Query: 8   FFALTCTLLFNDAFLNTTAPKK------PHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
           F  L   L+F+ +    + P        P+ ++I+ADDLG+ D+SF GS+Q+ TP+ID L
Sbjct: 7   FLFLQSLLIFSLSAQIISTPSTSGRNSHPNFVLIVADDLGYGDLSFTGSTQVKTPHIDEL 66

Query: 62  AYNGLILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEP------WGLPLTEKL 114
           A +G+   + YV  A+C+PSR+ L+TG+  +  G  +  +   +P       GLP+  K 
Sbjct: 67  AASGVFFPEGYVSSAVCSPSRAGLLTGRNQVSFGYDNN-LANSQPGFDPAFLGLPVNVKT 125

Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ 174
           +  +LK+ GY T  +GKWHLG+  + ++P  RGFD  +GY  G  DY++ S     E  +
Sbjct: 126 VGDHLKKLGYVTGLVGKWHLGY-EDQFSPLNRGFDEFWGYLGGGHDYFEAS-----EAKR 179

Query: 175 GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA 234
           G   +            Y TD   +  IN I  H K +P FLY +  A H      P QA
Sbjct: 180 GYKAKIKCNYKTPQEITYITDDKGDECINFIQRH-KDEPFFLYASFNAPHT-----PMQA 233

Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
             E++A +  I D +RRTYA MV RLD +VG ++ A+ +  +LEN+I +F++DNG P   
Sbjct: 234 TAEDLAIYQHIEDRKRRTYAAMVHRLDINVGRIVEAVEEADLLENTIFVFISDNGGP--- 290

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
           + +N   N P  G K T  +GG+R    I  P   +  KV        D  PT  A AG 
Sbjct: 291 VMTNGSINAPYNGKKGTLLEGGIRVPFFISWPGHLEPGKVFEHPVSSLDLTPTFVALAGG 350

Query: 355 EINDT-SLDGVN 365
           E  +    DG+N
Sbjct: 351 ESKEEDQFDGLN 362


>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 467

 Score =  198 bits (504), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 153/418 (36%), Positives = 196/418 (46%), Gaps = 51/418 (12%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F L  + L  D     TAP +P+ I ILADDLGW DV FH    +PTPN+D LA  GL 
Sbjct: 24  LFGLVVSSLAAD-----TAPLRPNFIFILADDLGWGDVGFH-HGNVPTPNLDHLAGEGLE 77

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           L QHYV  +C+P+R A ++G+Y      +  V     P         L + LK  GY T 
Sbjct: 78  LMQHYVYPVCSPTRCAFLSGRY----ASRFSVTTPQNPRAFRWDTVTLARALKSVGYDTA 133

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
             GKWHLG   E + P   GFD  YG   G    +DH        Y+  +       D K
Sbjct: 134 LCGKWHLGSKPE-WGPQKFGFDHSYGSLAGGVGPWDHH-------YKIGEFTQTWHRDGK 185

Query: 188 TI--GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LD 244
            I    + TDL T+ A+  + E    KP FLY+   AVH      P + PDE + +    
Sbjct: 186 LIEEQGHVTDLITKEAVEWL-ESRTDKPFFLYVPFTAVHI-----PIREPDEILQRVPAS 239

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK----- 299
           I+ P  R Y   V  LD+SVG ++ AL K G   N++V+F +DNGA   G+ +N      
Sbjct: 240 ITKPSLRHYGANVMHLDDSVGKILVALEKTGKAGNTLVIFGSDNGAIP-GVENNDPLYPP 298

Query: 300 ---------GSNHPLRGMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLC 349
                    GSN PL GMK   ++GG+   A A W   LK  + +   L HI+DW+PT C
Sbjct: 299 DHYPPGPAGGSNEPLHGMKGEVYEGGIHTAAVARWPGQLKPGKFLG--LAHITDWMPTFC 356

Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
           A AG +   D   DG N W  LT     K   I      V  P     ALR  D K V
Sbjct: 357 ALAGYKPEKDLKWDGQNIWPQLTGAEPVKPRTIY-----VAGPGFRSKALRDGDWKLV 409


>gi|323450479|gb|EGB06360.1| hypothetical protein AURANDRAFT_13369, partial [Aureococcus
           anophagefferens]
          Length = 304

 Score =  198 bits (503), Expect = 7e-48,   Method: Compositional matrix adjust.
 Identities = 116/314 (36%), Positives = 175/314 (55%), Gaps = 31/314 (9%)

Query: 77  CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
           CTPSR +L++G+YP+ +G+Q GV+     WGL    +LLP+ L+ AGY T  +GKWHLG 
Sbjct: 1   CTPSRGSLLSGRYPVRLGLQRGVVSAASKWGLDPAIRLLPETLQTAGYRTALVGKWHLGH 60

Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDH------SCKATFEPYQG---LDMRHNMQVDNK 187
           + E  TP  RGF   YG+  G  D++ H      +C +      G   LD + N     +
Sbjct: 61  YNESLTPARRGFGESYGFLGGFVDHFTHVAEAPPACASETPATDGTCALDWQVNGLPRPR 120

Query: 188 TIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDE----EVAKF 242
             G+Y+     +AA  ++A H  K +P+FLY A    H     EP   PD+    ++   
Sbjct: 121 ATGVYAAYACRDAARRLVASHEGKEEPLFLYYALPTPH-----EPITPPDDVETYDLLPK 175

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
           LD++   RR YA +V+  D ++G++ AAL+KH +  N++++ ++DNGA + G     GSN
Sbjct: 176 LDVAG-RRRDYAAVVAATDGAIGDLQAALQKHALWPNTLLVLLSDNGASAAG-----GSN 229

Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEINDT- 359
            PLRG K   ++GG+R  A +    L Q  + SS   L H++DW PTL  AAG+  +   
Sbjct: 230 APLRGAKGGYFEGGVRTRALLAGSLLPQNVRGSSHEGLVHVADWAPTLLNAAGLRPDLVA 289

Query: 360 ---SLDGVNQWDVL 370
              ++DGV+QWD L
Sbjct: 290 GFDAMDGVDQWDAL 303


>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 498

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 130/368 (35%), Positives = 192/368 (52%), Gaps = 39/368 (10%)

Query: 4   ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           A   F  L    +F            P+I++ILADD+G+ D+   GS  + TPN+D LA 
Sbjct: 8   AVSRFVFLPLIFVFATEASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67

Query: 64  NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
           +G++ +Q YV  A+C+PSR+ L+TG+ P   G +  +    E +       GLP +EK L
Sbjct: 68  SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTL 127

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
             +L  AGYAT  IGKWHLG   E++ P  RGFD   G   G   Y+  + K   E    
Sbjct: 128 ADHLGAAGYATALIGKWHLGM-GEMHHPNRRGFDHFCGMLTGSHHYFPTTMKHVIE---- 182

Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPF 232
              R+  +VD  +   Y TD +T+  +  I +H   N  +P F++ ++ A H      P 
Sbjct: 183 ---RNGKRVDGFS-SEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNAPHT-----PM 233

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
            A + ++A+F +I + +RRTYA M+  LD  VG +   L + G  EN++++F +DNG   
Sbjct: 234 HATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNG--- 290

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-----PWLKQTQKVSSELFHISDWLPT 347
            G  +N   N PLRG+K +  +GG+R V  IW+     P   +   V S L    D LPT
Sbjct: 291 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVISSL----DLLPT 344

Query: 348 LCAAAGIE 355
            C+AAG E
Sbjct: 345 FCSAAGAE 352


>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
 gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
          Length = 472

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 126/363 (34%), Positives = 193/363 (53%), Gaps = 23/363 (6%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP++I+I++DD G+ D   +G  QIPTPNIDA+A  G      YV A +C PSR+ ++TG
Sbjct: 28  KPNVIVIVSDDAGYVDFGCYGGKQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRAGILTG 87

Query: 88  KYPIHIGMQH---GVILEG---EPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           +Y    G +H    V+  G      G+  +E+ +   ++  GY T AIGKWH G     +
Sbjct: 88  RYQQRFGFEHNTSNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQG-DEPKH 146

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
            P  RGF+  YG+  G +D++ +  K T E      + +N ++  +    Y TD++T+ A
Sbjct: 147 FPLNRGFNEFYGFTGGHRDFFAYKGKRTNEH----ALYNNKEIVPENEITYLTDMFTDKA 202

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
            + I   NK KP F+YL++ AVH      P  A  + + ++  I+D  RR YA M++ LD
Sbjct: 203 TSFITA-NKDKPFFMYLSYNAVHT-----PMNAKKDLMERYASIADTGRRAYAAMMTSLD 256

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           + +G V+A L+ + + +N++++F+ DNG    G   N   N PLRGMK + W+GG+R   
Sbjct: 257 DGIGKVMATLKANQLDKNTLIIFINDNG----GATVNSSDNGPLRGMKGSKWEGGIRVAM 312

Query: 322 AIWSPWLKQTQKVSSELFHISDWLPT-LCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
            +  P      K  S      D LPT + A  G +     LDGVN    L+ G K    E
Sbjct: 313 MMKWPGHIAANKTDSRPVSSLDILPTAIGAGKGKQKGTKKLDGVNLLPYLSAGNKKTPHE 372

Query: 381 ILH 383
            L+
Sbjct: 373 ALY 375


>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
 gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
          Length = 579

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 133/412 (32%), Positives = 204/412 (49%), Gaps = 51/412 (12%)

Query: 8   FFALTCTLLF--NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           F  +TC  L   N +       + PHI+ I+ADDLGWNDV FHGS +I TPNID LA   
Sbjct: 112 FAIMTCFALVGANVSAETNGGSQPPHIVHIIADDLGWNDVGFHGS-EIRTPNIDRLASES 170

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           + L++ YV  +C+P+R+ ++TG YP   G+  GV+   +  GLP   +  P++L + GY 
Sbjct: 171 VTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPSKKHGLPPQLETAPEHLSKLGYD 230

Query: 126 THAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
             A+ GKWHLG    ++ P   G    YG++ G  DY+        + ++  D  H    
Sbjct: 231 HRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAIDYFSRERFGQLDWHRDFDSVHEEG- 289

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA---------- 234
                  YST+L   A ++ I  +  + P++ Y+A  A H+     P QA          
Sbjct: 290 -------YSTELVGNAVVDFIDRNANAGPVYAYVAFNAPHS-----PLQALRSDLDEYGF 337

Query: 235 -PDEEVAKFLD--ISDPE--------------RRTYAGMVSRLDESVGNVIAALRKHGML 277
            P+ ++A   D  I+  E              R+T+A M + +D  +G ++ A+ ++GM 
Sbjct: 338 DPNNKLAPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMR 397

Query: 278 ENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
           EN++V+F +DNGA P  G     G+N PLRG K T W+GG+R VA +  P          
Sbjct: 398 ENTLVVFHSDNGADPKHG-----GNNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYD 452

Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
            +    D LP++  AAG    + + DG+N    L+  A      IL + + V
Sbjct: 453 SVTSYVDLLPSMVGAAGSPPPEET-DGINLLPFLSGKASPPERTILLDAETV 503


>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
 gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 462

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 134/395 (33%), Positives = 200/395 (50%), Gaps = 42/395 (10%)

Query: 1   MTWARKYF-FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
           M W+R+ F F  + T+LF+       A  KP+I+ IL+DD G+ D  FHGS +  TP +D
Sbjct: 1   MKWSRRVFLFFASFTMLFS-----GWAQDKPNIVFILSDDAGYADFGFHGSKEFKTPELD 55

Query: 60  ALAYNGLILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVI--------LEGEPWGLPL 110
             A N +  +Q YV  A+C PSR+ L+TGKY    G +   +        L G+  GLPL
Sbjct: 56  KFAKNAVRFSQAYVSAAVCGPSRAGLLTGKYQQKFGFEENNVPGLMSKNGLTGDDMGLPL 115

Query: 111 TEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF 170
            +K +  YLKE GY T   GKWH G   + + PT RGFD  YG+  G + Y         
Sbjct: 116 DQKTIADYLKEQGYRTALFGKWHQG-NADRFHPTKRGFDEFYGFRGGARSYMPFGADNEL 174

Query: 171 EPYQGLDMRHNMQVDNKTIGI-----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA 225
                   R+  +++    G      Y T+   + AI  I E N+  P F+YLA  AVH 
Sbjct: 175 -------TRNEDRLERGFGGFQEHEGYLTEELADEAIAFI-ERNQKNPFFVYLAFNAVHT 226

Query: 226 GNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
                P +A +E++ K   +S  +R+T A M   +D + G V+  ++  G+ +N++++F 
Sbjct: 227 -----PMEAKEEDLKKVKGLSG-KRKTLAAMTIAMDRACGKVLDRIKALGLDKNTLIVFT 280

Query: 286 ADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWL 345
            DNG PS    +N+ SN PL G K+   +GG+R    +  P   +   V        D L
Sbjct: 281 NDNGGPS---DANESSNLPLSGTKANHLEGGIRVPFLMSWPGKLKPNTVYENPISTLDLL 337

Query: 346 PTLCAAAGIEINDTSLDGVNQWDVL--TKGAKTKR 378
           PT   AAG ++N  ++DG++  D+L   +G K  R
Sbjct: 338 PTFVTAAGGDVN--AIDGLDGVDLLPYLQGKKQGR 370


>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 461

 Score =  196 bits (499), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 133/381 (34%), Positives = 206/381 (54%), Gaps = 35/381 (9%)

Query: 1   MTWARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDA 60
           + WA   F  L  +     +     A + P+I+ ILADDLG+ D+S  GS+ + TP+ID+
Sbjct: 11  LLWAVTSFLPLPKS----SSASQPAANRPPNIVFILADDLGYGDLSCFGSTDLKTPHIDS 66

Query: 61  LAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILE--GEPWG-LPLTEKLLP 116
           L   G+     Y   ++C+PSR+AL+TG+YP  +G+  GVI +   + WG L  +  LLP
Sbjct: 67  LIGAGMRFTNFYANSSVCSPSRAALLTGRYPERVGVP-GVIRDEVQDSWGYLASSATLLP 125

Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL 176
            YL++ GY +  IGKWHLG       P  RGF   YG  +G+ D Y    +       G 
Sbjct: 126 TYLRKQGYHSANIGKWHLGL-ESPNLPNERGFQEFYGLLEGMMDDYVVKLR------HGQ 178

Query: 177 D-MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQA 234
           + +RHN QV +     ++TD++T+AA+  + +    K P FLYLA+ A H     +P Q 
Sbjct: 179 NFLRHNGQVIDPPG--HATDVFTDAAVRYLNDRKAKKDPFFLYLAYTAPH-----DPLQP 231

Query: 235 PD---EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
           P      V K     DP+R     ++  LD+++G V+A LR +G L N++V+F +DNG  
Sbjct: 232 PVAYLNRVQKRQPGIDPQRAKLVALIEHLDDNIGRVLATLRANGQLANTLVVFTSDNG-- 289

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
             G    K +N   RG+K   ++GG+R    A+WS  ++   + S  L  + DW PTL  
Sbjct: 290 --GWGPGKANNGQFRGVKGQFYEGGIRIPTGAMWSGHIEAGSE-SGALLQLMDWFPTLLQ 346

Query: 351 AAGIEINDTSLDGVNQWDVLT 371
           AAG  + D  ++G++  +++T
Sbjct: 347 AAGASVPD-GINGLSFLNLMT 366


>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 498

 Score =  196 bits (498), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 129/368 (35%), Positives = 199/368 (54%), Gaps = 41/368 (11%)

Query: 4   ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           +R  F +LT       +    +AP  P+I++ILADD+G+ D+   GS  + TPN+D LA 
Sbjct: 10  SRLVFLSLTVFFATKASRAAESAP--PNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67

Query: 64  NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
           +G++ +Q YV  A+C+PSR+ L+TG+ P   G +  +    E +       GLP++EK L
Sbjct: 68  SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDERYATRPELLGLPVSEKTL 127

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
             +L  AGYAT  IGKWHLG   E++ P  RGFD   G   G   Y+  +     E    
Sbjct: 128 GDHLGAAGYATALIGKWHLG-MGEMHHPNRRGFDHFCGMLTGSHHYFPTTMNHVIE---- 182

Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK---PMFLYLAHLAVHAGNTYEPF 232
              R+  +V++ +   Y TD +T+  +  I +H  +K   P F++ ++ A H      P 
Sbjct: 183 ---RNGQRVEDFS-NEYLTDFFTDEGLRFIDQHEAAKPDQPWFVFYSYNAPHT-----PM 233

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
            A + ++A+F +I + +RRTYA M+  LD  VG +   L + G  +N++++F +DNG   
Sbjct: 234 HATEADLARFANIQNKKRRTYAAMMFALDRGVGRIRKHLEETGQWKNTLLVFFSDNG--- 290

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-----PWLKQTQKVSSELFHISDWLPT 347
            G  +N   N PLRG+K +  +GG+R V  IW+     P  K+   V S L    D LPT
Sbjct: 291 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGKRYDGVISSL----DLLPT 344

Query: 348 LCAAAGIE 355
            C+AAG +
Sbjct: 345 FCSAAGAQ 352


>gi|323451693|gb|EGB07569.1| hypothetical protein AURANDRAFT_2707, partial [Aureococcus
           anophagefferens]
          Length = 351

 Score =  196 bits (497), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 131/349 (37%), Positives = 186/349 (53%), Gaps = 29/349 (8%)

Query: 28  KKPHIIIILADDLGWNDVSF----HGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSA 83
           + P++I+ LADDLG NDV +     G  +I +P IDALA   L+L++ Y Q +CTP+R+A
Sbjct: 9   RPPNLIVFLADDLGRNDVGYAHRDGGPGRIASPRIDALAAESLVLDRFYAQPMCTPTRAA 68

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           LMTG++    G+ + V+L  +  GLP  E  + + L++AGY+TH IGKWHLGF ++   P
Sbjct: 69  LMTGRHAYRTGLAYFVLLANQGTGLPAAEVTVAERLRDAGYSTHMIGKWHLGFAKKAMLP 128

Query: 144 TFRGFDSHYGYWQGLQDYYDHSC-----------KAT-FEPYQGLDMRHN-------MQV 184
           T RGFD  +GY  G  DY+ H             +AT  EP     M H+       +  
Sbjct: 129 TSRGFDRFFGYCLGSSDYWLHQSPEWVPGVPSTDRATGAEPPTTGGMGHDLWDGATPLPR 188

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
             KT  ++S DL+   A    A   + KP+FLY A  A HA     P +      A    
Sbjct: 189 TPKTENVHSADLFAARATETFATAPRDKPLFLYYASQAPHAHFVGAPPRHVAAAAAMGFA 248

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
             DP R   A +V+ LDE VGNV+  +R     EN+++ FM DNG P  G  +   S+ P
Sbjct: 249 AGDP-RADVAALVAALDEQVGNVVDGVRASRGWENTVLWFMGDNG-PEPGTGA---SSWP 303

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           LRG K T ++GG+R  + +++P  +     SS   H+ D  PTL   AG
Sbjct: 304 LRGAKRTLFEGGVREPSFVYAPG-RVAPGASSAPVHVIDVAPTLLGLAG 351


>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 498

 Score =  195 bits (496), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 129/368 (35%), Positives = 192/368 (52%), Gaps = 39/368 (10%)

Query: 4   ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           A   F  L    +F            P+I++ILADD+G+ D+   GS  + TPN+D LA 
Sbjct: 8   AVSRFVFLPLIFVFATEASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67

Query: 64  NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
           +G++ +Q YV  A+C+PSR+ L+TG+ P   G +  +    E +       GLP +EK L
Sbjct: 68  SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPKSEKTL 127

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
             +L  AGYAT  IGKWHLG   E++ P  RGFD   G   G   Y+  +     E    
Sbjct: 128 ADHLGAAGYATALIGKWHLGM-GEMHHPNRRGFDHFCGMLTGSHHYFPTTMNHVIE---- 182

Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPF 232
              R+  +V+N +   Y TD +T+  +  I +H   N  +P F++ ++ A H      P 
Sbjct: 183 ---RNGKRVENFS-SEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PM 233

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
            A + ++A+F +I + +RRTYA M+  LD  VG +   L + G  EN++++F +DNG   
Sbjct: 234 HATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNG--- 290

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-----PWLKQTQKVSSELFHISDWLPT 347
            G  +N   N PLRG+K +  +GG+R V  IW+     P   +   V S L    D LPT
Sbjct: 291 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVISSL----DLLPT 344

Query: 348 LCAAAGIE 355
            C+AAG E
Sbjct: 345 FCSAAGAE 352


>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
 gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
          Length = 488

 Score =  195 bits (495), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 125/350 (35%), Positives = 189/350 (54%), Gaps = 21/350 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRS 82
           T A  +P+II+ILADDLG+ D+ F GS +I TPNIDALA+ G++ +  YV    C PSR+
Sbjct: 29  TKADSRPNIIVILADDLGYGDLGFTGSREIKTPNIDALAHKGVVFSNAYVTHPYCGPSRA 88

Query: 83  ALMTGKYPIHIGMQ-HGVILEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
            L+TG+Y    GM+ +      +P+ GLP+ E    + +++AGY T  IGKWH+G     
Sbjct: 89  GLLTGRYQARFGMEINAAHSPDDPFMGLPVDEPTFAKRMQKAGYKTAVIGKWHMGSHPN- 147

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTE 199
           + P  RGFD  YG+  G  DY+  S K + E Y     R+      N+ +   +T +  E
Sbjct: 148 FHPNNRGFDYFYGFLGGGHDYFPESVKVSNEEYSIPLSRNGKPAQLNEYL---TTAISKE 204

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
           AA   +     S+P  +Y+A+ A H     +P +A  +++AK+  I D  RRTYA MV  
Sbjct: 205 AAEFAM---TTSQPFMMYVAYNAPH-----QPLEATQKDLAKYQHIEDINRRTYAAMVDS 256

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGA---PSFGIHSNKGSNHPLRGMKSTPWDGG 316
           +D+ +G ++  L++ G  EN+++ F++DNG     S+  +SN  SN P R  K +  +GG
Sbjct: 257 MDQGIGRLVNGLKQSGKFENTLIFFLSDNGGVYPESWMPNSNWASNKPFRRGKVSLTEGG 316

Query: 317 MR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
           +     A W        +    L    D   T    AG +++D  LDGVN
Sbjct: 317 VHVPFIAHWPAGFGNKGQFDG-LVSSLDIAATSLGLAGADVHDAKLDGVN 365


>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
 gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 463

 Score =  195 bits (495), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 138/383 (36%), Positives = 205/383 (53%), Gaps = 45/383 (11%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           K+ +AL   L+   AF +    KKP+II++L DD GW DV F+G++ IPTPN+D LA  G
Sbjct: 4   KFLYALALALM--TAFSSLAQEKKPNIIVVLTDDQGWADVGFNGATDIPTPNLDRLASEG 61

Query: 66  LILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGE---PWGLPLTEKLLPQYLKE 121
           ++ +  YV    C+PSR+ L+TG+Y    G    +  + E     G PL+EK++P+ LKE
Sbjct: 62  IVFDNAYVSHPYCSPSRAGLLTGRYQARFGHDCNMPYDSENDDTVGTPLSEKMIPEALKE 121

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYY---DHSCKATF---EPYQG 175
            GY T AIGKWHLG    ++ P  +GFD  +G+  G  +Y+   D   K      EP   
Sbjct: 122 HGYRTSAIGKWHLGDHPSLH-PIHQGFDHWFGFAGGGMNYWGIPDGPIKTIVRNGEPVPQ 180

Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
            ++R            Y TD +T+ AI+ I + +  KP F+YLA+ A HA     P QA 
Sbjct: 181 NELR------------YLTDDFTDEAIDFITKKD-DKPFFMYLAYNAPHA-----PDQAT 222

Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
            + + +   I    R  YA MV+ +D +VG + + L  +GM EN+I++F++DNG    G 
Sbjct: 223 KQYLEQTKHIEYGGRSIYAAMVNAVDANVGRIDSTLVANGMKENTILVFLSDNGGR--GE 280

Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQ---KVSSELFHISDWLPTLCAA 351
           H++   N P RG K   ++GG++    I W   LK  Q   K+ S L    D  PT   A
Sbjct: 281 HAD---NRPYRGHKGMLFEGGIKVPFFITWPKKLKGGQHYEKMVSSL----DLFPTFLEA 333

Query: 352 AGIEI-NDTSLDGVNQWDVLTKG 373
           AG ++  ++ LDG      + +G
Sbjct: 334 AGGKVKKESQLDGTTLLPYILEG 356


>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 489

 Score =  195 bits (495), Expect = 7e-47,   Method: Compositional matrix adjust.
 Identities = 134/412 (32%), Positives = 202/412 (49%), Gaps = 51/412 (12%)

Query: 8   FFALTCTLLFNDAFLNTT--APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           F  +TC  L  +     T    + PHI+ I+ADDLGWNDV FHGS +I TPNID LA   
Sbjct: 22  FAIMTCFSLVGENVSAETNGGSQPPHIVHIIADDLGWNDVGFHGS-EIRTPNIDRLASES 80

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           + L++ YV  +C+P+R+ ++TG YP   G+  GV+   +  GLP   +  P++L + GY 
Sbjct: 81  VTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVVSPTKKHGLPTLLETTPEHLSKLGYD 140

Query: 126 THAI-GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
             A+ GKWHLG    ++ P   G    YG++ G  DY+        + ++  D  H    
Sbjct: 141 HRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAIDYFSRERFGQLDWHRDFDSVHEEG- 199

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA---------- 234
                  YST+L   A ++ I  +  +  ++ Y+A  A H+     P QA          
Sbjct: 200 -------YSTELVGNAVVDFIDRNANAGAVYAYVAFNAPHS-----PLQALRSDLDEYGF 247

Query: 235 ---------PDEEVA---KFLDI-----SDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
                     D ++A   K LD       +  R+T+A M + +D  +G ++ A+ ++GM 
Sbjct: 248 DPNNKLSPNTDRKIAKREKALDYGKRGKGNSIRQTFAAMTTAMDRQIGRILDAIDRNGMR 307

Query: 278 ENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
           EN++V+F +DNGA P  G     GSN PLRG K T W+GG+R VA +  P          
Sbjct: 308 ENTLVVFHSDNGADPKHG-----GSNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYD 362

Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
            +    D LP + AAAG    + + DG+N    L+  A      IL + + V
Sbjct: 363 SVTSYVDLLPIMVAAAGGPPPEET-DGINLLPFLSGKASPPERTILLDAETV 413


>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
 gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
          Length = 520

 Score =  194 bits (494), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 126/364 (34%), Positives = 190/364 (52%), Gaps = 31/364 (8%)

Query: 4   ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           A   F  L    +F            P+I++ILADD+G+ D+   GS  + TPN+D LA 
Sbjct: 30  AVSRFVFLPLIFVFATEASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 89

Query: 64  NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
           +G++ +Q YV  A+C+PSR+ L+T + P   G +  +    E +       GLP +EK L
Sbjct: 90  SGVLCSQAYVASAVCSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTL 149

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
             +L  AGYAT  IGKWHLG   E++ P  RGFD   G   G   Y+  + K   E    
Sbjct: 150 ADHLGAAGYATALIGKWHLGM-GEMHHPNRRGFDHFCGMLTGSHHYFPATMKHVIE---- 204

Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPF 232
              R+  +VD+ +   Y TD +T+  +  I +H   N  +P F++ ++ A H      P 
Sbjct: 205 ---RNGKRVDDFS-SEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PM 255

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
            A + ++A+F +I + +RRTYA M+  LD  VG +   L + G  EN++++F +DNG   
Sbjct: 256 HATEADLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNG--- 312

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-PWLKQTQKVSSELFHISDWLPTLCAA 351
            G  +N   N PLRG+K +  +GG+R V  IW+ P       +   +    D LPT C+A
Sbjct: 313 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSA 370

Query: 352 AGIE 355
           AG E
Sbjct: 371 AGAE 374


>gi|431899642|gb|ELK07596.1| Arylsulfatase J [Pteropus alecto]
          Length = 637

 Score =  194 bits (494), Expect = 9e-47,   Method: Compositional matrix adjust.
 Identities = 122/320 (38%), Positives = 165/320 (51%), Gaps = 29/320 (9%)

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH VI   +P  LPL    LPQ L+EAGYATH +GKWHLGF+R    PT RG
Sbjct: 109 QYQIHTGLQHSVIRPTQPNCLPLDAATLPQKLREAGYATHMVGKWHLGFYRTECLPTRRG 168

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEAAINVIA 206
           FDS +G   G  DYY H  K       G D+  N +    +  G YST +YT+    ++A
Sbjct: 169 FDSFFGSLLGSGDYYTHY-KCDSPGVCGYDLHENDRAAWGRGGGAYSTQMYTQRVQRILA 227

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H+  KP+FLY+A+ AVH+     P QAP   +  +  I++  RR YA M+S LDE+V N
Sbjct: 228 SHDPGKPLFLYIAYQAVHS-----PLQAPGRYLGHYRSIANARRRRYAAMLSCLDEAVAN 282

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V  AL+ HG  +NS++++ +DNG    G  +  GSN PLRG K T W+GG+R V  + SP
Sbjct: 283 VTLALKAHGFYDNSVIVYSSDNG----GQPAAGGSNWPLRGRKGTYWEGGVRAVGFVHSP 338

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQ------------------WD 368
            L+Q                                                      W+
Sbjct: 339 LLRQRXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXXLWE 398

Query: 369 VLTKGAKTKRSEILHNIDNV 388
           VL++G ++ R +ILHNID V
Sbjct: 399 VLSEGQRSPRVDILHNIDPV 418



 Score = 51.2 bits (121), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 36/63 (57%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPW 564
           LFNIT DP E+ +L++    +++QL  +L+ +  T VP    P D R +P     +W PW
Sbjct: 488 LFNITADPYERVDLSDRYPGVVRQLLRRLSRFNRTAVPVRYPPRDPRGNPQLHGGVWGPW 547

Query: 565 YDE 567
           Y +
Sbjct: 548 YHQ 550


>gi|326433598|gb|EGD79168.1| hypothetical protein PTSG_09899 [Salpingoeca sp. ATCC 50818]
          Length = 515

 Score =  194 bits (492), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 122/372 (32%), Positives = 194/372 (52%), Gaps = 29/372 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGS--------SQIPTPNIDALAYNGLILNQHYVQALCTP 79
            +P+I++I+ DD+GW +   H +        S + TP +  L   G+ L +HY    C+P
Sbjct: 26  SQPNIVLIIGDDVGWANTELHCNISDPAQVCSDVKTPFLLQLKDEGVHLLRHYAYKFCSP 85

Query: 80  SRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           +RSALM+G+ PIH+   +   ++G   G+ L   +LP+ L   GYATH +GKWHLG    
Sbjct: 86  TRSALMSGRLPIHVNEVNPTYIKGG--GVSLNFTMLPEKLSRVGYATHHVGKWHLGMSSH 143

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
              PT RGFDS  GY  G +D++  +   T     G+D+  N          Y   L+  
Sbjct: 144 SRVPTGRGFDSSLGYLLGAEDHWKQT--VTQGNATGVDLWLNEGPAYLRNNEYGDLLFGR 201

Query: 200 AAINVIAEHNKS---KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
            A+ VI +H K+    P+FLY+     H+     P + P E   ++  I   +R    GM
Sbjct: 202 RAVEVIHDHAKNHRDSPLFLYVGLQVNHS-----PAEVPKEYQDRYPGIQ-VDRAIVNGM 255

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
            S +D+++G +  AL +  M E+++V++++DNG  S GI++N G+N+PLRG K T ++GG
Sbjct: 256 SSVMDDAIGKITTALWETTMWEDTLVVYISDNGGAS-GINANNGNNYPLRGGKHTDFEGG 314

Query: 317 MRGVAAIWSPWLKQTQKVSSE--LFHISDWLPTLCAAAGIEIND-----TSLDGVNQWDV 369
           +R  A +   +L   Q+ S+     H  DW  TL   AG    D       +DG++ W +
Sbjct: 315 VRTAALVSGGFLPDDQRGSTRDGYMHACDWYTTLAKLAGASPADDVEGLPGVDGMDMWPM 374

Query: 370 LTKGAKTKRSEI 381
           +    K+ R E+
Sbjct: 375 IANNEKSPRHEV 386



 Score = 40.8 bits (94), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 22/66 (33%), Positives = 33/66 (50%), Gaps = 5/66 (7%)

Query: 504 CLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPA-----RWN 558
           CLFNI +DP E +++     DL  +L+D+L+   ST    G        DPA     R++
Sbjct: 443 CLFNIKEDPTEHHDIKFDNLDLFWKLKDRLSKLASTAYQTGEDSEGDCEDPAHVMETRYH 502

Query: 559 NIWVPW 564
             W P+
Sbjct: 503 GFWGPF 508


>gi|341889947|gb|EGT45882.1| CBN-SUL-3 protein [Caenorhabditis brenneri]
          Length = 432

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 132/402 (32%), Positives = 202/402 (50%), Gaps = 87/402 (21%)

Query: 22  LNTTAP--KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY--NGLILNQHYVQALC 77
           +N  +P  +KP+++ I+ADDLG+ND+ +  S+ + TPN+  LA+  N  +L   YV  LC
Sbjct: 23  VNGQSPPTQKPNVLFIMADDLGFNDLDWKDST-LHTPNLRNLAFHKNTALLTNSYVNQLC 81

Query: 78  TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           TP+RSA MTG YP  +G Q GV L  EP G+P     L + +++  Y+T+ +GKWHLG+ 
Sbjct: 82  TPTRSAFMTGYYPFRVGTQAGVFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYC 141

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSC----KATFEPYQGLDMRHNMQVDNKTI---- 189
           ++ + PT RGFD  YG++     Y++HS     +      +GLD+   +  + K++    
Sbjct: 142 KKEFLPTNRGFDYFYGFYGPQTGYFNHSADQYHRELRRVVKGLDLFEEVG-NGKSVPDFS 200

Query: 190 --GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
             G+YSTDL+T+ A++VI  H                  NT +PF         F+ +S 
Sbjct: 201 QNGVYSTDLFTDVAMSVIDNH------------------NTTKPF---------FMFLS- 232

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
                Y  +                 H  L+N         G  +FG      SN PLRG
Sbjct: 233 -----YQAV-----------------HPPLQNG--------GTSNFG-----ASNAPLRG 257

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW 367
            K T W+GG +    I SP   +       +FH+ DW  T+ A  G+EI D+  DG+NQW
Sbjct: 258 EKDTIWEGGTKTTTFIHSPMYVEEGGEREMMFHVVDWHATVLAITGLEI-DSYGDGINQW 316

Query: 368 DVLTKGA-KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAG 408
           + +  G  K +R + ++NIDN      + +A+R  D K + G
Sbjct: 317 EYIKTGRPKFRRFQFVYNIDN------HGSAIRDGDYKLIVG 352


>gi|326428223|gb|EGD73793.1| arylsulfatase [Salpingoeca sp. ATCC 50818]
          Length = 697

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 184/364 (50%), Gaps = 40/364 (10%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           PHI+  L DDLG+ +        + TP+ID LA NG +L + Y    C+P+RS+L++G+ 
Sbjct: 75  PHIVYFLVDDLGYANTGVTNDEPL-TPHIDELARNGTMLRRFYTYKYCSPTRSSLLSGRL 133

Query: 90  PIHIGMQHGVILEGEP-----WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           PIH+        E  P      G+      L + L+ AGYATH +GKWH G       P 
Sbjct: 134 PIHVNQ------ENRPPPVPGGGIDQNMTTLAEVLQRAGYATHQVGKWHCGMSSPDRLPV 187

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQG-LDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
            RGF+S +GY  G + +Y        +  QG +D+  + Q      G Y T +Y   A  
Sbjct: 188 HRGFNSSFGYLSGAEHHYS-------QIRQGYVDLWRDTQPAYGENGTYGTFMYAREAER 240

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           +I  H+ + P+FLYLA   VH+     P + PD  + K+ D+    R+    MVS +DE+
Sbjct: 241 IIHAHDPTTPLFLYLAFQNVHS-----PLEVPDRFLDKYPDVDYRPRKHCLAMVSAVDEA 295

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           + NV  ALR+ G+ ++++++F +DNG P      +  +N+P RG K+  ++GG+R VA +
Sbjct: 296 INNVTLALRRAGLFDDTLIIFSSDNGGP-----HDHANNYPFRGAKTADFEGGVRAVAFV 350

Query: 324 WSPWLKQTQKVSS--ELFHISDWLPTLCAAAGIEIND--------TSLDGVNQWDVLTKG 373
               + +    ++   L H+ DW  TL   AG +  D          +DG + W +LT  
Sbjct: 351 SGGVVPKNMVGTNVHGLMHVCDWYSTLARLAGEDPTDHRAARSNLPPIDGFDMWPMLTGA 410

Query: 374 AKTK 377
             T 
Sbjct: 411 NSTS 414


>gi|443700719|gb|ELT99563.1| hypothetical protein CAPTEDRAFT_110993 [Capitella teleta]
          Length = 339

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 114/304 (37%), Positives = 167/304 (54%), Gaps = 20/304 (6%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+II   ADD G+ D  +  +S I TPNID L  +G+     Y    C+PSRSA ++G+
Sbjct: 31  KPNIIWFFADDYGYQDAGYR-NSDIHTPNIDKLVADGISFTNAYSAQQCSPSRSAFLSGR 89

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y    GMQHGVI + +   + L    +  YLKE  Y THA GKWHLG+  +  TPT+RGF
Sbjct: 90  YAYTSGMQHGVIGDTKAHCMDLKYNFISDYLKELKYNTHASGKWHLGYCNKECTPTYRGF 149

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGL-DMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           D+  G + G  +YY+H+       +QG+ D  +  ++D    GI+S D+     +N + +
Sbjct: 150 DTFSGGYSGEGEYYEHT------TFQGMYDWHNGTELDRSASGIHSQDMIEADLLNQL-D 202

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-SDPERRTYAGMVSRLDESVGN 266
            N   P+F Y    A H  NT+ P     E  A + D+ + P RR Y G+VS +D  +G 
Sbjct: 203 KNDGTPLFYY---AAFH--NTHSPMHPKPEHEALYDDLDAPPTRRKYLGLVSGMDAVIGK 257

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V+  L++  M +N+ + F +DNG     +H  +G N P RG KS+ +DGG    +   SP
Sbjct: 258 VVTKLKEKDMFDNTYIFFSSDNGGD---VH--EGDNSPKRGAKSSLFDGGCHAHSWASSP 312

Query: 327 WLKQ 330
            LK+
Sbjct: 313 LLKK 316


>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 454

 Score =  193 bits (490), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 131/365 (35%), Positives = 190/365 (52%), Gaps = 40/365 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K+P+++++LADDLGW DV F+G ++  TPN+D LA  G    + Y  A +C PSR+ALMT
Sbjct: 38  KRPNVLLLLADDLGWGDVGFNGRTEWATPNLDRLAARGTTFKRFYTAAVVCAPSRAALMT 97

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE---VYT- 142
           G+Y IH G+            LP  E  L +  K  GY T   GKWH G  R+    Y  
Sbjct: 98  GRYTIHDGVSRNND------DLPAREVTLAEAFKTHGYDTALFGKWHHGQPRDGSKTYVH 151

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P  +GFD  +G+            K  +E Y    + H  ++  K +  YS D++ + AI
Sbjct: 152 PMDQGFDEFFGF---------TDAKHAWEKYP-EQLWHGREL--KPVSGYSDDMFADHAI 199

Query: 203 NVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA----KF--LDISDPERRTYAG 255
           + +  H  K  P FLY+  +     NT+   +AP EEVA    KF  +D S P R TYA 
Sbjct: 200 DFLKRHKEKPTPFFLYVPFI-----NTHFNIEAPAEEVALHQGKFTEVDPSKPIRATYAA 254

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP----SFGIHSNKGSNHPLRGMKST 311
           MV++LD++VG ++ AL   G+  +++V+F++D+GA     + G      SN P RG K T
Sbjct: 255 MVTQLDKNVGRIMTALEGLGLAADTLVVFVSDHGATFESGNLGASDYHDSNRPFRGQKRT 314

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVL 370
            W+GG+R       P       +S+E+ H++D  PTL AA G        +DG + W   
Sbjct: 315 LWEGGIRVPGVACWPGHVPAGVISNEVVHMTDLFPTLLAATGAHPEAAWHVDGADLWPTW 374

Query: 371 TKGAK 375
           T  AK
Sbjct: 375 TGKAK 379


>gi|149175233|ref|ZP_01853855.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
 gi|148845842|gb|EDL60183.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
          Length = 459

 Score =  193 bits (490), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 130/376 (34%), Positives = 206/376 (54%), Gaps = 35/376 (9%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F A++C L+   A     A + P+I++I+ADDLG+ D++ +G+ Q+ TP+ID LA + L 
Sbjct: 15  FIAVSCLLIRFSA--AEAAQQPPNIVLIMADDLGYGDLACYGNKQVKTPHIDRLAASALK 72

Query: 68  LNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEP---WGLPLTEKLLPQYLKEAG 123
               H   A+CTP+R+A++TG+Y    G Q    L G+     GLP     + + LK+ G
Sbjct: 73  FTDFHSAGAMCTPTRAAMLTGQYQQRFGRQFESALSGKSNHDIGLPHQAVTMAELLKQQG 132

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
           YAT   GKWHLG ++  + PT +GFD   G   G  D++ H  ++  E     D  HN +
Sbjct: 133 YATACFGKWHLG-YQPPWLPTNQGFDLFRGLTSGDGDHHTHVDRSGNE-----DWWHNNE 186

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-------- 235
           +  +    Y+ DL ++ ++  + E N+++P FLY+ HLA+H      P+Q P        
Sbjct: 187 ISMEK--GYTADLLSKYSVAFM-EANRTRPFFLYVPHLAIHF-----PWQGPQDPPHRKA 238

Query: 236 --DEEVAKFLDISDPERRT--YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA- 290
             D    K+  I DP   +     M+  LD+SVG +++AL++  + +N++V+F +DNG  
Sbjct: 239 GQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGKILSALKRLDLEQNTLVIFTSDNGGY 298

Query: 291 PSFGIH-SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
            ++G +  N  SN PLRG K+T ++GG R    I  P +  T  V+ +  H  D LPTL 
Sbjct: 299 LTYGKNFQNISSNGPLRGQKATLYEGGHRVPCLISWPGVI-TAGVTDQTAHSVDLLPTLA 357

Query: 350 AAAGIEINDTSLDGVN 365
            AAGI   +   DG++
Sbjct: 358 QAAGISATNFQTDGLD 373


>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
 gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
          Length = 460

 Score =  193 bits (490), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 127/397 (31%), Positives = 208/397 (52%), Gaps = 29/397 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           A ++P+I+II  DD G NDV  +GS +IPTP+ID LA  GL+  Q+Y   A+CTPSR  +
Sbjct: 24  AAERPNILIIFTDDQGINDVGCYGS-EIPTPHIDQLAKEGLLFRQYYSASAICTPSRFGI 82

Query: 85  MTGKYPIH-----IGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           +TG+ P       +G    +    +  G+   E  +   L++ GY T  +GKWHLG   E
Sbjct: 83  LTGRNPTRSQDQLLGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLGHGTE 142

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
            + PT  GFD   G+  G  DY+  +     + Y   + RH  +        Y+TDL TE
Sbjct: 143 SFLPTAHGFDLFRGHTGGCIDYFTMTYGNIPDWYH--NQRHVSENG------YATDLITE 194

Query: 200 AAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEP--------FQAPDEEVAKFLDISDPER 250
            A + + +   + KP FL+L++ A H G  + P         QA  +++ +   I D  R
Sbjct: 195 EAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQARGDDLKRVGTIKDKVR 254

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
           R +A M   LD+ +G V+++L+ +G+ +N++V+FM D+G    G +   G+N P RG K+
Sbjct: 255 REFAAMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHG----GDYVYGGNNQPFRGAKA 310

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
           T ++GG+R    I  P   +    ++E+    D  PT+C  A ++ +  +LDG +   +L
Sbjct: 311 TLFEGGIRVPCIIRWPGKIKAGTETNEVAWALDLFPTICHFANVDTDGLTLDGKDISGLL 370

Query: 371 TKGAKTKRSEILHNID-NVDNPQKYYAALRVDDLKYV 406
           T+       E+   +  + +  +  ++ALR  D KY+
Sbjct: 371 TRQTPVGTRELYWQLGPHAELKRGRWSALRQGDWKYI 407


>gi|406834549|ref|ZP_11094143.1| Cerebroside-sulfatase [Schlesneria paludicola DSM 18645]
          Length = 470

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 128/381 (33%), Positives = 193/381 (50%), Gaps = 19/381 (4%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F AL C      A     A  KP++I+IL DD G  D +  G+  + TP+ID LA  G+ 
Sbjct: 25  FLALFCFTASGLASAADQATGKPNVIVILTDDQGTIDANCFGAKDLETPHIDGLAARGVR 84

Query: 68  LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
           L Q Y  A +C+PSR+ L+TG+YP+  G+   V       G+P T+  + +  K AGYAT
Sbjct: 85  LTQFYAAAPVCSPSRAGLLTGRYPVRGGVPGNVSSTAGVAGMPTTQVTMAETFKAAGYAT 144

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
             IGKWHLG + E   P  +GF + +G+  G  D + H     F  +QG + RH++  D 
Sbjct: 145 AHIGKWHLG-YSEDTMPLAQGFQTSFGHMGGCIDNFSH-----FFYWQGPN-RHDLHKDG 197

Query: 187 KTI---GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
           + +   G +  DL  E A   I   N+ +P F+Y A    H      P+Q     + ++ 
Sbjct: 198 REVQRPGRFFPDLMVEEASQFIGT-NRERPFFVYFALNGPH-----YPYQGDPRWLERYK 251

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG-APSFGIHSNKGSN 302
           D+  P R  YA  VS +DE +G ++  + + G+ E +IV+F +D+G +     H   GS 
Sbjct: 252 DLPYP-RNLYAAFVSSIDERIGGLLKRVDELGLREKTIVVFQSDHGHSTEERAHLGGGSA 310

Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD 362
              RG K + ++GG+R  A I  P      +  S+L H  DWLPTL A A + + +  +D
Sbjct: 311 GIYRGAKFSLFEGGIRVPAIISWPGRLPQGEARSQLAHGCDWLPTLAALAAVPLVEPDID 370

Query: 363 GVNQWDVLTKGAKTKRSEILH 383
           GVN   +L          ++H
Sbjct: 371 GVNLVPILNSADAATPHGLVH 391


>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 498

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 128/368 (34%), Positives = 190/368 (51%), Gaps = 39/368 (10%)

Query: 4   ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           A   F  L    +F            P+I++ILADD+G+ D+   GS  + TPN+D LA 
Sbjct: 8   AVSRFVFLPLIFVFATEASRAAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAE 67

Query: 64  NGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-------GLPLTEKLL 115
           +G++ +Q YV  A+C+PSR+ L+TG+ P   G +  +    E +       GLP +EK L
Sbjct: 68  SGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTL 127

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG 175
             +L  AGYAT  IGKWHLG   E++ P  RGFD   G   G   Y+  +     E    
Sbjct: 128 ADHLGAAGYATALIGKWHLG-MGEMHHPNRRGFDHFCGMLTGGHHYFPTTMNHVIE---- 182

Query: 176 LDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPF 232
              R+  +V+N +   Y TD +T+  +  I +H   N  +P F++ ++ A H      P 
Sbjct: 183 ---RNGKRVENFS-SEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNAPHT-----PM 233

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
            A + ++A+F +I   +RRTYA M+  LD  VG +   L + G   N++++F +DNG   
Sbjct: 234 HATEADLARFANIQTQKRRTYAAMMYALDRGVGRIREHLEESGQWGNTLLVFFSDNG--- 290

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS-----PWLKQTQKVSSELFHISDWLPT 347
            G  +N   N PLRG+K +  +GG+R V  IW+     P   +   V S L    D LPT
Sbjct: 291 -GATNNGSWNGPLRGVKGSMREGGIR-VPMIWTWPAKFPAGVRYDGVVSSL----DLLPT 344

Query: 348 LCAAAGIE 355
            C+AAG E
Sbjct: 345 FCSAAGAE 352


>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 468

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 132/382 (34%), Positives = 200/382 (52%), Gaps = 34/382 (8%)

Query: 13  CTLLFNDAF--LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           C LL    F  L+T+A KK +I+ + ADD G+ D  F GS   PTPN+D LA  G++  Q
Sbjct: 7   CLLLVALVFFSLSTSAQKKKNIVFLFADDAGYFDFGFQGSKTFPTPNLDQLAKEGMVFKQ 66

Query: 71  HY-VQALCTPSRSALMTGKYPIHIGMQHGVI---------LEGEPWGLPLTEKLLPQYLK 120
            Y   A+C PSR+ L+TG+Y    G +   +         L G+  GLPL EK +  YL 
Sbjct: 67  AYTTAAVCGPSRAGLLTGRYQQRFGFEENNVPGYMSKSSKLLGDDMGLPLDEKTMADYLG 126

Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMR 179
           + GY +  +GKWH+G   + Y P  RGF   YG+  G + +Y  + K A  +P   L++ 
Sbjct: 127 KLGYQSIVLGKWHMG-NADRYHPLKRGFTEFYGFRGGARSFYPLTQKQAADKPEDRLEIG 185

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
           +    + K    Y T    +AA + I + NK KP F+Y++  AVH+     P QA  E++
Sbjct: 186 YKKYQEPKK---YLTYDLADAACDFI-DRNKKKPFFMYVSFNAVHS-----PMQATQEDL 236

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
            K   ++  +R+  A M   LD++ G ++  L++ G+ ++++++F  DNG P       K
Sbjct: 237 KKIKGLTG-KRKVLAAMTLALDKACGQILEKLKEEGLEDDTLIVFSNDNGGPD----GTK 291

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEIN 357
             N+PL G KS   +GG+R    +  P  K  Q  SS  + +S  D LPT    AG + +
Sbjct: 292 TCNYPLSGCKSNNLEGGIRVPFVMKLP--KVIQPGSSYEYPVSTLDLLPTFVNIAGGKAD 349

Query: 358 DTS-LDGVNQWDVLTKGAKTKR 378
             + LDGV+    +T G  TKR
Sbjct: 350 KIAQLDGVDLLPYIT-GKDTKR 370


>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 134/359 (37%), Positives = 191/359 (53%), Gaps = 42/359 (11%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQ--IPTPNIDALAY 63
           K F AL C            A  KP+II+IL+DD G+ DVS++      I TP+ DALA 
Sbjct: 7   KLFLALLCVNFV------ALADSKPNIIVILSDDQGYADVSYNPEHDDYISTPHTDALAK 60

Query: 64  NGLILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-GLPLTEKLLPQYLKE 121
           +G+I ++ Y   ++C+ +RS LMTG+Y      ++G+   GE   G  L  K +P YLKE
Sbjct: 61  SGVIFHRGYTSGSVCSTTRSGLMTGRYQ----QRYGIYTAGEGGTGTDLNAKFIPNYLKE 116

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW-QGLQDYY----DHSCKATFEPYQGL 176
           AGY + A GKWHLG   + Y P  RGFD  YG+  +G  D++    ++  K     Y+GL
Sbjct: 117 AGYKSMAFGKWHLGHEMK-YHPLHRGFDDFYGFMGRGAHDFFRLEKEYDGKFGGPIYRGL 175

Query: 177 DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           +      +D+K    Y T   TE  +  I E NK KP F Y+A+ AVH      P QAP 
Sbjct: 176 E-----PIDDKG---YLTTRITEETVKFI-EENKDKPFFAYVAYNAVHT-----PAQAPA 221

Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
           E++       D  R     M+  LD  VG ++  L+KH + EN+I+++++DNG    G  
Sbjct: 222 EDIKAV--SGDETRDILVAMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNG----GAK 275

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
           S   +N PLRG+K   +DGG+R    + W   +K  Q   S +  + D LPTL  AAG+
Sbjct: 276 SMVANNKPLRGVKHDIYDGGIRVPFLMSWPAQIKAGQDTQSPVISL-DILPTLLDAAGL 333


>gi|423219918|ref|ZP_17206414.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
           CL03T12C61]
 gi|392624181|gb|EIY18274.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
           CL03T12C61]
          Length = 463

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 132/363 (36%), Positives = 191/363 (52%), Gaps = 34/363 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K P+I++IL DD G+ND  F GS ++ TPNIDAL   G++    +V A + +PSR+ L+T
Sbjct: 30  KNPNILVILIDDAGYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLIT 89

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y    G +    L     GLPL E+ + +  K  GY T AIGKWHLG  R+   P  R
Sbjct: 90  GRYGHRFGYE--CNLSDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDEQHPNNR 146

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI--YSTDLYTEAAINV 204
           GFD  YG   G +DY+ ++ K +  P        N+ ++++ +    Y TD ++E A+  
Sbjct: 147 GFDLFYGMKAGGRDYF-YNEKKSDRPGD----ERNLLLNDRQVKFEKYLTDAFSEKAVEF 201

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I E   S+P  +YLA+ AVH      P QA DE++AKF       R+  A M   LD  V
Sbjct: 202 INE--SSQPFMMYLAYNAVHT-----PMQATDEDMAKF---EGHPRQKLAAMTYALDRGV 251

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAI 323
           G VI  L+  G  +N+++ F++DNG    G  +N+ SN+PL+G K   ++GG R     +
Sbjct: 252 GTVIRGLKDSGKFDNTLIFFLSDNG----GATTNQSSNYPLKGFKGNKFEGGHRVPYFVV 307

Query: 324 WS---PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
           W    P  K    + S L    D   T   AA I      LDGV+    L KG K +  +
Sbjct: 308 WKNGIPAGKPYDGLVSSL----DIFATAIDAASISKTRNRLDGVSLLPYL-KGKKGEPHK 362

Query: 381 ILH 383
           +L+
Sbjct: 363 VLY 365


>gi|388257121|ref|ZP_10134301.1| sulfatase [Cellvibrio sp. BR]
 gi|387939325|gb|EIK45876.1| sulfatase [Cellvibrio sp. BR]
          Length = 484

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 134/397 (33%), Positives = 198/397 (49%), Gaps = 34/397 (8%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSF-HGSSQIPTPNIDALAYNG 65
           +   ++C+ L  D        K+P+I++ILADDLG+NDV F +G ++I TP +DALA  G
Sbjct: 13  FMVVISCSALGAD--------KRPNIVVILADDLGYNDVGFTNGQTEIKTPRLDALANEG 64

Query: 66  LILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWGLPLTEKLLPQYLKEA 122
           ++    YV    C PSR+ L+TG+Y    GM++ V    + +  GLPLTEK  P  L+E 
Sbjct: 65  VVFENGYVTHPYCGPSRAGLITGRYQARFGMENNVTYSPDDKYMGLPLTEKTFPARLQEV 124

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
           GY T   GKWHLG     + P  RGFD  YG+  G  +Y           Y    MR   
Sbjct: 125 GYKTAIFGKWHLGGAPH-FQPNERGFDYFYGFLDGGHNYMPGEVHLGAGGYLLPIMR--- 180

Query: 183 QVDNKTIGIY----STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
              NK +  +    +T L  +AA  +     +  P F+Y+++ A HA     P QAP   
Sbjct: 181 ---NKGVAEFDEYLTTALSRDAARYIERTSKEQAPFFIYMSYNAPHA-----PLQAPQNY 232

Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP---SFGI 295
           + K+  I D +RRTYA MV  +DE +G +I AL K G L+N+++ F++DNG     S+  
Sbjct: 233 LEKYAHIKDEKRRTYAAMVDAMDEGIGMLIDALDKSGELDNTLIFFLSDNGGLYPYSWWP 292

Query: 296 HSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
                 N P R  K    +GG+     A W   + + ++++  L    D   T  A AG 
Sbjct: 293 DYTWADNSPFRFGKVALTEGGVHVPFIAHWPKKITKGKRLNG-LVSALDIAATSLALAGA 351

Query: 355 EINDTSLDGVNQWDVLT-KGAKTKRSEILHNIDNVDN 390
              D  LDGV+    +T K  ++    +   ++  DN
Sbjct: 352 NTKDAKLDGVDLTAYVTGKSKQSPHKALFWRMEEADN 388


>gi|340369113|ref|XP_003383093.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 374

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 113/321 (35%), Positives = 170/321 (52%), Gaps = 28/321 (8%)

Query: 30  PHIIIILADDLGWNDVSFH---GSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           PHI+++L DD GW +V +H    + ++ TPNID L   GL LNQHYV   C+PSRS+L++
Sbjct: 25  PHIVMMLVDDWGWANVGYHRNPPTCEVVTPNIDKLVKEGLELNQHYVYKFCSPSRSSLIS 84

Query: 87  GKYPIHIGMQHGVILEGEP-------WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           G+ PIH+  Q+ +     P         +P     + + ++ AGYATH +GKW  G    
Sbjct: 85  GRLPIHVNDQNLLPTNYNPDDPVSGFSAIPRNMTGIAEKMRGAGYATHQVGKWDAGMATS 144

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI-------Y 192
            +TP  RGF + +GY+    DYY  +  A     Q +D+    + D    GI       Y
Sbjct: 145 DHTPKGRGFQTSFGYFHHANDYYKETASACNRT-QIVDL---WETDKPAHGINGTGPDNY 200

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
              L+ E  ++V++ H+ S P+FLY A   VH      P Q PD  +  F  I+D +R+ 
Sbjct: 201 EEGLFKERVLDVVSNHDPSTPLFLYYAPHIVHT-----PLQVPDRYLKLFSFINDHDRQY 255

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y  MV+ LD+ VG +  AL+  GM +N + +  +DNG P +       +N+PLRG K T 
Sbjct: 256 YHAMVNYLDDVVGEITDALKNKGMWDNLLFVTSSDNGGPVY--PGGGANNYPLRGGKVTD 313

Query: 313 WDGGMRGVAAIWSPWLKQTQK 333
           W GG+R  A +   +L +  +
Sbjct: 314 WQGGVRVNAFVSGGYLPEKMR 334


>gi|153807102|ref|ZP_01959770.1| hypothetical protein BACCAC_01379 [Bacteroides caccae ATCC 43185]
 gi|149130222|gb|EDM21432.1| arylsulfatase [Bacteroides caccae ATCC 43185]
          Length = 463

 Score =  192 bits (487), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 132/369 (35%), Positives = 193/369 (52%), Gaps = 34/369 (9%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
           + +   K P+I++IL DD G+ND  F GS ++ TPNIDAL   G++    +V A + +PS
Sbjct: 24  VESKTDKNPNILVILIDDAGYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPS 83

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           R+ L+TG+Y    G +    L     GLPL E+ + +  K  GY T AIGKWHLG  R+ 
Sbjct: 84  RACLITGRYGHRFGYE--CNLSDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLG-SRDE 140

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI--YSTDLYT 198
             P  RGFD  YG   G +DY+ ++ K +  P        N+ ++++ +    Y TD ++
Sbjct: 141 QHPNNRGFDLFYGMKAGGRDYF-YNEKKSDRPGD----ERNLLLNDRQVKFEKYLTDAFS 195

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           E A+  I E   S+P  +YLA+ AVH      P QA DE++AKF       R+  A M  
Sbjct: 196 EKAVEFINE--SSQPFMMYLAYNAVHT-----PMQATDEDMAKF---EGHPRQKLAAMTY 245

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LD  VG VI  L+  G  +N+++ F++DNG    G  +N+ SN+PL+G K   ++GG R
Sbjct: 246 ALDRGVGTVIRGLKDSGKFDNTLIFFLSDNG----GATTNQSSNYPLKGFKGNKFEGGHR 301

Query: 319 -GVAAIWS---PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA 374
                +W    P  K    + S L    D   T   AA I      LDGV+    L KG 
Sbjct: 302 VPYFVVWKNGIPAGKPYDGLVSSL----DIFATAIDAASISKTRNRLDGVSLLPYL-KGK 356

Query: 375 KTKRSEILH 383
           K +  ++L+
Sbjct: 357 KGEPHKVLY 365


>gi|408793054|ref|ZP_11204664.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
           meyeri serovar Hardjo str. Went 5]
 gi|408464464|gb|EKJ88189.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
           meyeri serovar Hardjo str. Went 5]
          Length = 583

 Score =  192 bits (487), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 147/432 (34%), Positives = 218/432 (50%), Gaps = 67/432 (15%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQ----IPTPNIDALAYNGLILNQHYV-QALCT 78
           T   + P++I+I+ADDLG+NDV+ +G       +PTPNID++A +G+  +  Y   A+CT
Sbjct: 91  TLKRRPPNVIVIMADDLGFNDVTTYGGGYADLGVPTPNIDSIAKDGVRFDSGYSGSAVCT 150

Query: 79  PSRSALMTGKYPIHIGMQH----------GVILEGEP---------------------WG 107
            SR+ALMTG+YP   G++           G  L  +P                      G
Sbjct: 151 VSRAALMTGRYPSRFGVEFTPTPGALARVGADLYADPNRLYPVVIDKEKAEKSKSFNELG 210

Query: 108 LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHY----GYWQGLQDYYD 163
           +P +E  + + LKE GY +  IGKWHLG   E+  P  +GFD       G +  + D   
Sbjct: 211 MPGSEITIAEVLKERGYHSVHIGKWHLGSTEEM-RPNKQGFDESLFMESGLYLPVDDPNV 269

Query: 164 HSCKATFEPYQGL---DMRHNMQVDNKT---IGIYSTDLYTEAAINVIAEHNKSKPMFLY 217
           ++ K  F+P       +MR  +  +         Y TD +T+ A+ VI + NK +P FL+
Sbjct: 270 YNAKQDFDPIDRFLWPNMRFGVSYNGGKWFEPSRYLTDYFTDEAVKVI-KTNKHRPFFLF 328

Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
           LAH AVH      P QA  E+      I D  +R Y  M+  LD SVG ++A+L++ G+ 
Sbjct: 329 LAHWAVHT-----PLQASKEDYDALSHIKDHRKRVYLSMIRSLDRSVGKILASLKEEGLD 383

Query: 278 ENSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVS 335
           EN+IV+F +DNGAP++ G+      N P RG K T + GG+R    A W   +K   K  
Sbjct: 384 ENTIVIFTSDNGAPNYIGLPD---VNSPYRGWKLTLFQGGIRVPYVAKWPGHIKPGTKYQ 440

Query: 336 SELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY 394
           + + +I D LPT+  AAGI++  D  +DGVN    L KG   ++   L   D       Y
Sbjct: 441 NAITNI-DILPTVANAAGIKLPLDREIDGVNLLPYL-KGQAIQKPRPLFWSDG------Y 492

Query: 395 YAALRVDDLKYV 406
           Y  ++ D  K +
Sbjct: 493 YQTVQADGWKLI 504


>gi|325285341|ref|YP_004261131.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324320795|gb|ADY28260.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 460

 Score =  191 bits (486), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 126/351 (35%), Positives = 190/351 (54%), Gaps = 33/351 (9%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
           TA K+P+II++L DD GW DV F+G++ IPTPN+D +A  G+I +  YV    C+PSR+ 
Sbjct: 21  TAQKQPNIIVVLTDDQGWADVGFNGATDIPTPNLDRIASEGVIFSNGYVSHPYCSPSRAG 80

Query: 84  LMTGKYPIHIGMQHGVILEGE---PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           L+TG+Y    G    +  EGE     G PL+EKL+ + LKE GY T AIGKWH+G    +
Sbjct: 81  LLTGRYQARFGHDCNMPYEGENDATVGTPLSEKLISEALKEQGYRTSAIGKWHIGDHPNL 140

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           + P  +GFD  +G+  G  +Y+  +       Y     R+   V  + +  Y TD +T  
Sbjct: 141 HPPA-QGFDHWFGFPGGSMNYWGKATSKIQTIY-----RNTKPVAEEEL-TYLTDDFTNE 193

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD----ISDPERRTYAGM 256
           AIN I + +K+ P F+YLA+ A H         APD    K+L+    I    R  YA M
Sbjct: 194 AINFINKKDKN-PFFIYLAYNAPH---------APDHATKKYLEQTKHIEYAGRSVYAAM 243

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V+ +D ++G + + L  +G+ +N+I++F++DNG            N P RG K   ++GG
Sbjct: 244 VNAVDANIGKIDSTLVANGIKDNTILVFLSDNGG-----RIQHADNRPYRGHKGMLFEGG 298

Query: 317 MRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLC-AAAGIEINDTSLDGVN 365
           ++    I W   +K+  K +  +  + D  PTL  AA G    +  LDGV+
Sbjct: 299 IKVPFFISWPSKIKEDSKYTKPISSL-DLFPTLLNAAGGNAAIEKQLDGVD 348


>gi|196229618|ref|ZP_03128482.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196225944|gb|EDY20450.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 490

 Score =  191 bits (486), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 141/394 (35%), Positives = 198/394 (50%), Gaps = 38/394 (9%)

Query: 1   MTWARKYFFALTCTLLFNDAFLNTTAP-KKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
           M+  RK F ++    L + +      P K+P+II+I++DD G+ D SF GS  I TPN+D
Sbjct: 9   MSLLRKAFTSVAALSLASSSVRADDTPTKRPNIIVIVSDDQGYADASFQGSKDILTPNLD 68

Query: 60  ALAYNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-PWG-LPLTEKLLP 116
           ALA +G+   + YV A +C+PSR+ LMTG+Y    G  + ++ E   P   LP  E LLP
Sbjct: 69  ALAKSGVRCTRGYVTAPVCSPSRAGLMTGRYQERFGHHNNIVAEAALPIAHLPSNETLLP 128

Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYY----------DHSC 166
           Q L +AGY T  +GKWHLG  ++   P  RGFD  +G   G  DY+          D S 
Sbjct: 129 QVLAKAGYYTAMVGKWHLG-LQDGCRPYERGFDEFFGIITGGHDYFVNHPEERAVGDQSY 187

Query: 167 KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE-HNK--SKPMFLYLAHLAV 223
           KA  E         N  V     G Y TD +   A+ +I E H K   +P+FLYLA  A 
Sbjct: 188 KARIE--------RNGPVGEAVPG-YLTDAFGADAVRIIRESHTKRPDQPLFLYLAFNAP 238

Query: 224 HAGNTYEPFQAPDEEVAKF-LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
           H      P QAP + V      +   +RRTYA  ++ +D SVG V AAL+++GM +++ +
Sbjct: 239 HT-----PTQAPKDLVDTMPATLESKDRRTYAAQITSMDASVGKVRAALKENGMEKDTFI 293

Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
           +F +DNG    G +     N PLR  K + ++GG+R       P       V        
Sbjct: 294 VFFSDNG----GANHPYYDNTPLRDHKGSLYEGGIRVPFFAVYPGHIPAGSVCELPVTSL 349

Query: 343 DWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAK 375
           D   T CA AG +   +  LD V+   VL   A+
Sbjct: 350 DVFATACALAGTKPETSHPLDSVDMLPVLEGNAR 383


>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 473

 Score =  191 bits (486), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 129/355 (36%), Positives = 189/355 (53%), Gaps = 33/355 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
           +KP+I++I ADD G+ D  F GS+Q+ TPN+D LA +G+   Q YV  + C PSR+ LMT
Sbjct: 27  EKPNIVLIFADDAGFGDFGFQGSTQLKTPNLDKLAQSGVRFTQGYVSDSTCGPSRAGLMT 86

Query: 87  GKYPIHIG---------MQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           GKY    G         M     L+G   GLPL +K +  YLKE GY T   GKWHLG  
Sbjct: 87  GKYQQRFGYEEINVPGFMSDNSALKGADMGLPLDQKTMGDYLKEQGYKTAVFGKWHLG-D 145

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQG----LDMRHNMQVDN-KTIGIY 192
            + + P  RGFD+  G+  G + Y+++S +   E   G     D +      N +    Y
Sbjct: 146 ADRFHPLKRGFDTFLGFRGGDRSYFNYSEQ---EMKNGNKHFFDKKLERDFGNYEEPKEY 202

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
            TD+  + A   I E NK +P F+YLA  AVH      P ++  +++AKF +++  +R+ 
Sbjct: 203 LTDVLGKEAAKYI-EQNKDEPFFIYLAFNAVHT-----PLESDPKDLAKFPNLTG-KRKE 255

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
            A M   LD + G V+  L++ G+ +N+IV+F  DNG PS     N  +N PL G KS  
Sbjct: 256 LAAMTLGLDRASGYVLDKLKELGLDDNTIVVFSNDNGGPS---DKNASNNAPLAGTKSNQ 312

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEINDTSLDGVN 365
            +GG+R        W K  +  S+  + +S  D LPT  +AA  +   + +DGV+
Sbjct: 313 LEGGIR--VPFLISWPKHIKPGSTYDYPVSTLDLLPTFYSAAKGKALGSDIDGVD 365


>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score =  191 bits (486), Expect = 7e-46,   Method: Compositional matrix adjust.
 Identities = 135/383 (35%), Positives = 199/383 (51%), Gaps = 29/383 (7%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           L  +L+F        A +KP+II+ILADDLG+ D+ F G+  I TP+IDALA +G+   Q
Sbjct: 3   LIFSLIFFTYSTLALAAQKPNIILILADDLGYEDLGFLGAPDIKTPHIDALARSGMNFTQ 62

Query: 71  HYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE------PWGLPLTEKLLPQYLKEAG 123
            Y  A +C PSR+ L+TG+Y    G        GE        G+PL E+++   LK A 
Sbjct: 63  GYQSASVCGPSRAGLLTGRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMIFDLLKPAA 122

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
           Y T  IGKWH+G   E   PT R  D +YG+  G   Y +        P      R+N  
Sbjct: 123 YTTGVIGKWHMGLSHE-QRPTQRSVDYYYGFLNGAHSYREAKMDMKGAPMTWPIFRNNEP 181

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
           V       Y+T+++ +  +N I + NK KP FLY+++ +VH      P++A  +++ +  
Sbjct: 182 V---PFSGYTTEVFNDEGVNFI-KRNKDKPFFLYMSYNSVHG-----PWEAQPKDLQRSD 232

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---- 299
            I    RR Y+ M+  +D+ VG +I  L+  G+ EN++V+FM+DNGAP+  +H  +    
Sbjct: 233 HIKKKWRRIYSAMLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAPN-NLHEAERAGD 291

Query: 300 --GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIE 355
              SN  LRG K   ++GG+R V  I S W +   K S+    +S  D +PTL   +   
Sbjct: 292 YLASNGSLRGRKGDTYEGGIR-VPYIMS-WPQVIPKQSTYQHPVSGLDIVPTLIHISQAA 349

Query: 356 INDTSLDGVNQWDVLTKGAKTKR 378
                L GVN    +T G KT R
Sbjct: 350 PAKKELSGVNLMPYIT-GEKTSR 371


>gi|241244123|ref|XP_002402253.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215496292|gb|EEC05932.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 196

 Score =  190 bits (483), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 101/225 (44%), Positives = 134/225 (59%), Gaps = 44/225 (19%)

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           +G WHLG+++E YTPT RGF+S YGY+   +DYY+H+          LD+          
Sbjct: 11  LGAWHLGYYKEAYTPTERGFESFYGYYNFGEDYYNHT----------LDL---------- 50

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
                                   P+FLYL+HLAVHAG  Y P QAP+E V KF  I   
Sbjct: 51  ------------------------PLFLYLSHLAVHAGTQYGPIQAPEENVQKFDYIGVK 86

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
            R  YAG V  LD+SVG V  AL K  ML+N++++F++DNGA  +G  SN GSN PLRG 
Sbjct: 87  NRSLYAGAVDALDQSVGIVFQALEKKDMLQNTVLVFLSDNGAIPWGEWSNAGSNWPLRGA 146

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           K T W+GG+R  A +WSP L++ Q+VS++L H+SDW+PTL + AG
Sbjct: 147 KFTLWEGGVRVPAFVWSPLLRKPQRVSNQLMHVSDWVPTLYSLAG 191


>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 484

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 137/377 (36%), Positives = 188/377 (49%), Gaps = 40/377 (10%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           R   F L  T+      +N  A +KP+II + +DD G+ D  F GS  + TPN+D LA +
Sbjct: 3   RNLIFILLLTVA-----INLNAQEKPNIIFLFSDDAGYADFGFQGSKIMKTPNLDKLAKS 57

Query: 65  GLILNQHYV-QALCTPSRSALMTGKYPIHIG---------MQHGVILEGEPWGLPLTEKL 114
           G    Q YV  A C PSR+ L+TGKY    G         M        +  GLPL +  
Sbjct: 58  GAKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEINVPGYMSANSKFLADDMGLPLDQLT 117

Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDY--YDHSCKATFEP 172
           +  +LK+ GY T   GKWHLG   + Y PT RGFD  YG+  G ++Y  Y+   KA    
Sbjct: 118 IADHLKKLGYKTAMYGKWHLG-DADRYHPTKRGFDEFYGFRGGARNYFGYNDVSKAN--- 173

Query: 173 YQGLDMRHNMQVDN-KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
              LD R      N +    Y TD   + A++ I E NK  P F+YLA  AVH      P
Sbjct: 174 ---LDNRMERGFGNYQEPTEYVTDALAKEAVSFI-EKNKGNPFFIYLAFNAVHT-----P 224

Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
            QA  E++ KF +++  +R+  A M   LD + G V+  L++ G+ +N+IV+F  DNG P
Sbjct: 225 MQATKEDLNKFPNLTG-KRKELAAMTLALDRACGTVLNKLKELGLDKNTIVVFSNDNGGP 283

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLC 349
           +     N   N PL G KS   +GG+R        W KQ +  +   F +S  D LPT  
Sbjct: 284 T---DKNASLNLPLSGTKSNHLEGGIR--VPFLISWPKQIKSKTVYNFPVSTLDLLPTFY 338

Query: 350 AAAGIEIND-TSLDGVN 365
           AA G  + D   +DGV+
Sbjct: 339 AAGGGNVADLKDIDGVD 355


>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
 gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
          Length = 472

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 129/356 (36%), Positives = 191/356 (53%), Gaps = 32/356 (8%)

Query: 26  APKKPH-IIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSA 83
           +PKK + +++I+ADDLG+ D+ F GS+QI TP++D LA NG+   Q YV  A+C+PSR+ 
Sbjct: 30  SPKKHYNLVLIVADDLGYGDLGFTGSTQIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAG 89

Query: 84  LMTGKYPIHIGMQHGVILEG-EPW------GLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
            +TG   +  G  H   L G EP       G+PL++K +  +L + GY    IGKWHLG 
Sbjct: 90  FITGINQVEFG--HDNNLAGVEPGFDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLGK 147

Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYST 194
               + P  RGFD  +GY  G  DY++   + K   EP     +  N +  +     Y T
Sbjct: 148 -EPQFHPLKRGFDEFWGYTGGGHDYFESLPNGKGYKEP-----LESNFKTPDPIT--YIT 199

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
           D     +++ I  H K +P FL+ A  A H      P QA +E++A +  I D +RRTYA
Sbjct: 200 DDVGNESVDFIERH-KDEPFFLFAAFNAPHT-----PMQALEEDLALYQHIEDKKRRTYA 253

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            MV RLD +VG ++ +L + G+ EN++V+F +DNG P+    SN   N P RG K    +
Sbjct: 254 AMVHRLDLNVGKIMTSLEEQGLSENTLVVFFSDNGGPT---DSNASLNAPYRGQKGILLE 310

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
           GG+     +  P L     +  E     D +PT  A AG    +TS+D  +  D++
Sbjct: 311 GGIHVPFVMNLPGLLPEGLIYQEQVTSLDVVPTFLALAG--DTETSMDMFSGVDLI 364


>gi|196230145|ref|ZP_03129008.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196225742|gb|EDY20249.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 487

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 129/355 (36%), Positives = 190/355 (53%), Gaps = 28/355 (7%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSA 83
            A + P++I+I+ADDLG+ DV  +G+    TPN D LA+ G      H  QA+C+ SR+A
Sbjct: 28  AADRPPNVILIVADDLGYADVGVYGAKGFETPNFDRLAHEGRRFTDFHVAQAVCSASRAA 87

Query: 84  LMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           +MTG YP  IG++  +    EPW   G+   E  +PQ  K  GYAT  +GKWHLG   E 
Sbjct: 88  IMTGCYPNRIGIEGAM----EPWYKFGISDQELTMPQMFKRKGYATGMVGKWHLGTPTE- 142

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEP---YQGLDMRHNMQVDNKTIGIYSTDL 196
           + PT RGFD  +G       +  H  K   F P   Y+G D   N  ++++ +   +T  
Sbjct: 143 FLPTHRGFDEWFGLPYSNDQWPLHPEKPGKFPPLPLYEG-DKVINPGINHRDMEQLTTQ- 200

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
           YTE A+N I + N  KP FLY+A    H      P    D    KF   +  +R  Y   
Sbjct: 201 YTERAVNFI-DRNHDKPFFLYVAQTMPHV-----PLAVSD----KFRGAT--KRGLYGDA 248

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  +D S G ++AAL+K+G+ +++I++F++DNG P     ++ GS +PLR  K+T WDGG
Sbjct: 249 VEEIDWSTGEILAALKKYGLEKDTIIMFLSDNG-PWLIFGNHAGSAYPLREGKTTTWDGG 307

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
            R    I  P       VS+E+    D +PTL    G ++ +  +DG++ W +L 
Sbjct: 308 TRVPFIISWPGHIPAGTVSTEMACAIDLMPTLAKLIGAKMPEHKIDGLDIWPLLA 362


>gi|86141258|ref|ZP_01059804.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
 gi|85831817|gb|EAQ50272.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
          Length = 461

 Score =  189 bits (479), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 120/365 (32%), Positives = 189/365 (51%), Gaps = 27/365 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K P+ ++I+ADD GWND SFHGS +I TPN+D LA  GL L++ Y    C+P+R++L+TG
Sbjct: 37  KTPNFLVIIADDAGWNDFSFHGS-EIQTPNLDQLAGKGLTLDRFYTYPTCSPARASLLTG 95

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +    +G+   +    E   LP +   LPQ L +  Y T  +GKWHLG   E   P   G
Sbjct: 96  RPASRMGIVAPISGRSE-LNLPDSITTLPQALSKLNYKTALMGKWHLGLKPES-GPEVYG 153

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FD  YG+  G  D Y H+ K     +     R+   +  K    + TDL T++A++ I  
Sbjct: 154 FDFSYGFLHGQLDQYAHTYKNGDSTW----YRNGKFISEKG---HVTDLLTQSAVHYIDT 206

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
               +  +L +A+ A H      P Q P E + K+  I D  RR YA  ++ +D  +G +
Sbjct: 207 LQTDQNFYLQVAYSAPHI-----PLQEPQEWLEKYTGIKDSSRRAYAAAMTHMDAGIGEI 261

Query: 268 IAALRKHGMLENSIVLFMADNGAPS-----------FGIHSNKGSNHPLRGMKSTPWDGG 316
           +  L+   + +N++VLF++DNGA             +G + + GSN PLR  K++ ++G 
Sbjct: 262 LQKLKDKDLEKNTVVLFVSDNGAQEKWVPNTQYDGKYGPNYSLGSNLPLRDFKTSNYEGA 321

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKT 376
           +R V AI S         S+   +++DW+PT    A  E   ++++GVN   +L   +  
Sbjct: 322 LR-VPAIISWPENLNSGTSTNYINVTDWMPTFLNWANAEELPSTVEGVNAQHLLKTASVA 380

Query: 377 KRSEI 381
           +   I
Sbjct: 381 REKPI 385


>gi|149198444|ref|ZP_01875489.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149138450|gb|EDM26858.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 458

 Score =  189 bits (479), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 130/373 (34%), Positives = 200/373 (53%), Gaps = 20/373 (5%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILN 69
           +  T +F    L  +    P++IIIL DD G+ DV F+G   IPTP++D++A NG+  ++
Sbjct: 1   MNKTFIFLYLILMNSIIAGPNLIIILTDDQGYQDVGFNGCKDIPTPHLDSIAQNGVNCID 60

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWGLPLTEKLLPQYLKEAGYATH 127
            H    +C PSR+ L+TG+Y    G      +  E    GLPL EK + + LKE GY++ 
Sbjct: 61  AHVSYPVCGPSRAGLLTGRYQDRFGFTTNPTVNPENPIAGLPLEEKNIAEVLKEVGYSSS 120

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYY--DHSCKATFEPYQGLD-MRHNMQV 184
            IGKWH+G    ++ P  RGFD  +G+  G  DY+   ++ K   E  +  D  R ++  
Sbjct: 121 IIGKWHMGT-HPIHHPLNRGFDHFFGFLSGGHDYFPAKYNLKDLSEVKRIWDWYRTHLIR 179

Query: 185 DNKTIGI---YSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           D + I +   Y TD+ T+AA++ I  + ++ KP  LYL++ A H      P QA ++ + 
Sbjct: 180 DRERIQVSEGYLTDILTDAAVDFIDKKASEKKPFMLYLSYNAPHT-----PLQASEKYLK 234

Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
           +F  I D +RRTYA MVS +D+ VG V+A L+KH + E+++V F++DNG P  G      
Sbjct: 235 RFTHIKDSKRRTYAAMVSAVDDGVGRVMARLKKHKIEEDTLVFFLSDNGGPLKGPMPFT- 293

Query: 301 SNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-D 358
            N P    K +  +GG R   A+ W   +   Q     +  + D L T+ A   ++ +  
Sbjct: 294 DNGPFSKGKGSLHEGGTRVPFAVQWKGTIPSGQIYKKPMSSL-DILATITALNKLKPHPK 352

Query: 359 TSLDGVNQWDVLT 371
             LDGVN    LT
Sbjct: 353 RPLDGVNIIPYLT 365


>gi|336424342|ref|ZP_08604383.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336003446|gb|EGN33530.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 460

 Score =  189 bits (479), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 128/384 (33%), Positives = 188/384 (48%), Gaps = 30/384 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           + P++I ILADD+G+ D          TPN+D L   G  ++  Y  + +C P+R+AL+T
Sbjct: 6   QHPNVIYILADDMGYGDFGIFSDGSARTPNLDRLVRQGCAMSHCYAASPVCAPARAALLT 65

Query: 87  GKYPIHIGMQHGVILEGEPWG---LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           G+YP     + G +   E  G   + L E  L    +  GY T  IGKWHLG   + Y P
Sbjct: 66  GRYP----HRTGAVDTYEAIGGDRMALREVTLADVYRANGYRTGLIGKWHLGLIGKEYHP 121

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
             RGFD+  G+  G  DYY +           LD    ++  +   G Y TD+ TE +I 
Sbjct: 122 CRRGFDTFIGFRGGWSDYYQYK----------LDRNGILEASD---GTYMTDVITEESIR 168

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
            I E N+ +P FL+ A+ A H      PFQ P+E  A F    +P   T  GM++ +DE 
Sbjct: 169 FIRE-NREQPFFLHAAYNAPHF-----PFQCPEEYAAPFRKRFNPTLATLYGMIACMDEG 222

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           +G ++  L + G+ +N+IV+F +DNG   +G  +N+  N  L G K   ++GG+R  A I
Sbjct: 223 IGKLLDVLEETGLSDNTIVIFASDNGPQLYG-DTNR-YNCYLNGQKGEAFEGGIRVPAVI 280

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEIL 382
             P            FH  DW PTL +A  +E+ D  SLDG N+ D +  G+ +      
Sbjct: 281 RWPGHIAPDSRCHAFFHGVDWFPTLLSACRLELPDGVSLDGKNRLDEILLGSSSPSEPYY 340

Query: 383 HNIDNVDNPQKYYAALRVDDLKYV 406
              +      K  AA+R    K V
Sbjct: 341 WQWNRFTPVSKCNAAVRCGKWKLV 364


>gi|86142609|ref|ZP_01061048.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
 gi|85830641|gb|EAQ49099.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
          Length = 477

 Score =  189 bits (479), Expect = 5e-45,   Method: Compositional matrix adjust.
 Identities = 134/402 (33%), Positives = 194/402 (48%), Gaps = 43/402 (10%)

Query: 6   KYFFALTCTLLFNDAF-LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           K+  AL   +L      ++     KP+ +II ADD+G+ D+S +G   I TPN+D +A  
Sbjct: 7   KFKLALFIAILLGGTITISAQIEDKPNFVIIFADDMGYGDLSSYGHPTIKTPNLDKMAME 66

Query: 65  GLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW---GLPLTEKLLPQYLK 120
           G      YV  ++CTPSR+A+MTG+ P+  GM   V    +PW   GLP +E  + + LK
Sbjct: 67  GQKWTNFYVGSSVCTPSRAAIMTGRLPVRNGMTSKVHRVLQPWSTNGLPQSEITIAEQLK 126

Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL---------------QDYYDHS 165
           + GY T A GKWHLG  +E Y P   GFD    YW G+               QDY++  
Sbjct: 127 KGGYGTAAFGKWHLG-DKEEYLPVNNGFD----YWYGVPYSNDMDITLEMNEPQDYWNFW 181

Query: 166 CKATFEPYQGLD---MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLA 222
                +     +   MR+   ++        T  YT  AI  I EH K  P F+YLA+  
Sbjct: 182 KSEERKEISSFNIPLMRNTEIIERPANQHTITKRYTNEAIEYIKEH-KENPFFIYLAYNL 240

Query: 223 VHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
            H      P  A +E   K       +R  Y  +V  +D +VG V+ AL++ G+ EN+IV
Sbjct: 241 PHV-----PLFASEEFEGK------SKRGLYGDVVEEIDHNVGRVMEALKEQGIAENTIV 289

Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
           +F +DNG P   +    GS   LR  K + W+GGMR     WSP       + +EL    
Sbjct: 290 VFTSDNG-PWLVMDEEGGSAGLLRDGKGSTWEGGMREPGIFWSPG-NLKPGIVTELGSTM 347

Query: 343 DWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
           D   T    AG+E+ +D  LDG +    L +G ++ R  + +
Sbjct: 348 DLFTTFSKLAGVELPDDRILDGEDLSSTLFEGKESPRETMFY 389


>gi|149197521|ref|ZP_01874572.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139539|gb|EDM27941.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 465

 Score =  188 bits (478), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 131/360 (36%), Positives = 192/360 (53%), Gaps = 23/360 (6%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGS-SQIPTPNIDALAYNGLILNQHYV 73
           L F  A L+ +A K P+II+ILADDLG+ DVS+HG+  +  TP+ID++A +G      Y 
Sbjct: 9   LCFVLASLSASAAK-PNIIVILADDLGYGDVSYHGTLKETTTPHIDSIAQSGAWFQNGYS 67

Query: 74  QA-LCTPSRSALMTGKYPIHIGMQHGV----ILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
            A +C PSR+ L++G+Y    G    +    + +    GLPL++KL+P+ L + GYAT  
Sbjct: 68  AAPVCGPSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLPLSQKLIPEILVKEGYATGM 127

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQG-LQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
           +GKWH G  +  + P  RGF   YG+  G + ++       T + + G   R N +V+N 
Sbjct: 128 VGKWHDGD-QHKFWPYNRGFQEFYGFNNGAINNWVLKGENHTVDEW-GAVHRENKRVENS 185

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
             G Y T+ +   A+  I  H K++P FLYL+  AVH      P QAP     +F  I  
Sbjct: 186 --GEYMTEAFGREAVEFIDRH-KTEPFFLYLSFNAVHG-----PLQAPKSYTNQFKHIKP 237

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
             R     M+  +D+++G V+  LRK G+ EN+I+ F +DNG    G +S  G     RG
Sbjct: 238 ENRALCLAMLKSMDDNIGLVLEKLRKEGLEENTIIFFTSDNGGKLKGNYSFNGK---YRG 294

Query: 308 MKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVN 365
            K+T +DGG+    A+ W   +    K      H  D   T+ AAAG+EI D   LDG N
Sbjct: 295 EKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSIDLAHTIFAAAGVEIKDEYKLDGRN 354


>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 471

 Score =  188 bits (477), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 128/376 (34%), Positives = 191/376 (50%), Gaps = 46/376 (12%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F ++ C       F  ++A +KP+II + +DD G+ D  F GS+++ TPN+D LA +G+ 
Sbjct: 7   FLSICC-------FSISSAQEKPNIIFLFSDDAGFADFGFQGSTEMKTPNLDKLANSGVK 59

Query: 68  LNQHYV-QALCTPSRSALMTGKYPIHIGMQH----GVILEGEPW-----GLPLTEKLLPQ 117
             Q YV  A C PSR+ L+TGKY    G +     G + E   +     GLPL +  +  
Sbjct: 60  FTQGYVTDATCGPSRAGLITGKYQQRFGYEEINVPGYMSENSKFLADDMGLPLDQLTIGD 119

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
           YLK+ GY T   GKWHLG   + + P  RGFD  YG+  G + Y+ +   ++        
Sbjct: 120 YLKKLGYNTAMYGKWHLG-NADRFHPMNRGFDEFYGFRGGARSYFGYDVASS-------- 170

Query: 178 MRHNMQVDNKTIGI-----YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
             H+ +++           Y TD   + AI+ I E NK  P F+YLA  AVH      P 
Sbjct: 171 AHHDTKMERGFGNFEEPQEYVTDALADEAISFI-EKNKKNPFFIYLAFNAVHT-----PM 224

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
           +A  E++ +F +++  +R+  A M   LD + G V+  L++ G+ +N+I++F  DNG P+
Sbjct: 225 EATKEDLNQFPNLTG-KRKELAAMTLALDRACGRVLDKLKQLGLDKNTIIVFSNDNGGPT 283

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCA 350
                N   N PL G KS   +GG+R        W K  Q  ++  F IS  D LPT  A
Sbjct: 284 ---DKNASLNLPLSGTKSNHLEGGIR--VPFLMSWPKHIQPKTTYTFPISTMDLLPTFYA 338

Query: 351 AAGIEIND-TSLDGVN 365
           A G    D   +DG+N
Sbjct: 339 AGGGNTKDLQEIDGIN 354


>gi|251798133|ref|YP_003012864.1| sulfatase [Paenibacillus sp. JDR-2]
 gi|247545759|gb|ACT02778.1| sulfatase [Paenibacillus sp. JDR-2]
          Length = 434

 Score =  188 bits (477), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 126/372 (33%), Positives = 195/372 (52%), Gaps = 29/372 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K+P+II+   DDLG+ D+  +GS  + TP++D LA  G+     Y  + +C+PSR++L+T
Sbjct: 2   KRPNIIVFYCDDLGYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLLT 61

Query: 87  GKYPIHIGMQHGVILEGEPW--GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           GKYP   G+    IL G+    GL L +  L   LKE GY T   GKWHLG   E Y P 
Sbjct: 62  GKYPAKAGVTS--ILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGASAE-YGPN 118

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAA 201
             GFD  YG+  G  DYY H     +    G++  H++  +   +   G Y T+  T  A
Sbjct: 119 AHGFDQFYGFRAGCIDYYSHIF--YWGQGGGVNPVHDLWRNETEVWENGEYMTEAITREA 176

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
            + I      +P F+Y+A+ A H      P  AP   + +F D+  P+RR  A M++ +D
Sbjct: 177 TSYIDAAPDDEPYFMYVAYNAPHY-----PMHAPKAYLDRFPDLP-PDRRIMAAMIAAVD 230

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS---------NKGSNHPLRGMKSTP 312
           + VG ++ AL++ G  E++I+ F +DNG PS    +           GS    RG K++ 
Sbjct: 231 DGVGEIVKALKQKGAYEDTIIFFSSDNG-PSTESRNWLDGTEDLYYGGSAGRFRGHKASL 289

Query: 313 WDGGMRGVAAIWSP--WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
           ++GG+R  A +  P    +Q  ++S E+F + D  PT+   +GI     SLDG + +D L
Sbjct: 290 FEGGIREPAILSYPAGLAEQQGQISDEMFAMMDIFPTMLELSGIGTEGYSLDGHSVFDAL 349

Query: 371 TKGAKTKRSEIL 382
           +  A + R ++ 
Sbjct: 350 SGNALSPRKQLF 361


>gi|414070343|ref|ZP_11406329.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410807260|gb|EKS13240.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 469

 Score =  188 bits (477), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 125/354 (35%), Positives = 184/354 (51%), Gaps = 20/354 (5%)

Query: 19  DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALC 77
           ++F    + +KP+I++ILADDLG+ D+ F GS +I TPNIDALA NG      YV    C
Sbjct: 6   NSFNVNASEQKPNIVVILADDLGYGDLGFTGSKEIKTPNIDALASNGTRFKNAYVTHPYC 65

Query: 78  TPSRSALMTGKYPIHIGMQHGV--ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
            PSR  L+TG+Y   +GM++ V  + + +  GLPL+E      L++ GY T   GKWHLG
Sbjct: 66  GPSRVGLLTGRYQARLGMENNVSYMPQDKYMGLPLSENTFANRLQDVGYHTSVFGKWHLG 125

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
                + P  RGFD  YG+  G  +Y         + Y  L +  N QV       Y T 
Sbjct: 126 GAPH-FQPNKRGFDYFYGFLDGGHNYMPDQVTVGGDGYS-LPLMRNTQV--TEFDEYLTT 181

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
             +  A+  I    +S P F+YL++ A H      P QAP E + K+  I D +RR YA 
Sbjct: 182 ALSRDAVKYIHRQQES-PFFMYLSYNAPHT-----PLQAPAEYIEKYKHIEDEDRRVYAA 235

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA---PSFGIHSNKGSNHPLRGMKSTP 312
           M+  +DE +G V+ AL+    L+N++++F++DNG     S+   S+   N P R  K+  
Sbjct: 236 MIDVMDEGIGMVVDALKDINQLDNTLIIFLSDNGGVFPESWQPTSDWADNSPFRRGKAAL 295

Query: 313 WDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
            +GG+     A W   + + ++ +  L    D   T  A AG +  +  LDGVN
Sbjct: 296 LEGGIHVPFIAHWPGVIPKGEEFTG-LVSSLDIAATSVALAGAD--NKKLDGVN 346


>gi|323452297|gb|EGB08171.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 889

 Score =  188 bits (477), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 170/600 (28%), Positives = 257/600 (42%), Gaps = 150/600 (25%)

Query: 58  IDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQ 117
           ++ LA  G+ L   Y    CT SR+AL TGKYP+H+G+Q  VI   EP G+PL E  L Q
Sbjct: 1   MNKLASGGVTLTSFYASPTCTASRAALFTGKYPMHLGLQDSVIHPSEPRGVPLKETFLSQ 60

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE------ 171
            LK+AGY T  +GKWHLGF +  YTP+ RGFD  +G   G  D+  H+   +F       
Sbjct: 61  KLKDAGYGTVFVGKWHLGFHQAAYTPSQRGFDEFFGILTGGGDHVAHASTESFAMRGPDY 120

Query: 172 -----PYQGLDMRHNMQ-VDNKTIGI---YSTDLYTEAAINVIAEHNKSK--PMFLYLAH 220
                   G ++ HN + V     G+   +ST LYT  +   +A++ +    P+FL +++
Sbjct: 121 KSKITTMTGSNLWHNGRPVGEDEEGVRDTHSTHLYTAKSRAYVAKYAEMADVPLFLTVSY 180

Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPE------------RRTYAGMVSRLDESVGNVI 268
            AVHA     P Q PDE V      +  E            RR   GMVS +DES+ ++ 
Sbjct: 181 QAVHA-----PIQVPDEYVDGSAYANGCESLRAGSASATRLRRQLCGMVSMVDESLASIH 235

Query: 269 AALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWL 328
             L+   M EN+++ F++DNG    GI  +  SN  LRG K + ++GG+R  A +   +L
Sbjct: 236 DDLKAEAMWENTLLFFLSDNG----GILRHGSSNLDLRGEKGSYYEGGIRVPAFVSGGYL 291

Query: 329 KQ---TQKVSSELFHISDWLPTLCAAAGIE------------------------------ 355
            +     +V   L HI+D   T    AG+                               
Sbjct: 292 AKLLSAGRVFEGLVHITDLHATALRVAGVSSKEDKRAAPAVAAAKVGHLRRLEPGQQPQS 351

Query: 356 -----INDT--------SLDGVNQWDVLTKGAK---TKRSEILHNIDNVDNPQKYYA--A 397
                ++D         SLDGV+QWD    GA    ++R+ + HNI    N + + A  A
Sbjct: 352 YAETALDDVLSHEHLTDSLDGVDQWDAFVSGASAAASQRTHVAHNI----NSELFGAAGA 407

Query: 398 LRVDDLKYVAGTDNNGQSDEWYGD---TDNEIDKYSPKEVLYSK----AGIT--FNALKT 448
           LRV + K +            YG     D+  D+    +V++ K     G T  F+ +  
Sbjct: 408 LRVGEYKLIVEARVTESEIYEYGQHMLQDDNWDQSELSQVIHQKLLRTPGTTSLFDVVAN 467

Query: 449 KLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFNI 508
             ++ ++   D ++              + +L EFA V+                     
Sbjct: 468 PNELDEEDCDDARSC-----------RNLYDLDEFADVKA-------------------- 496

Query: 509 TDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIWVPWYDEL 568
                         T L K  E + +  +ST +   + P    ADPA +  +W PW D+L
Sbjct: 497 --------------TILEKWAELRDSTPESTEIWLDDGPL---ADPALFGGVWTPWRDDL 539


>gi|149197520|ref|ZP_01874571.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139538|gb|EDM27940.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 446

 Score =  188 bits (477), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 122/366 (33%), Positives = 188/366 (51%), Gaps = 21/366 (5%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A  KP+I+++ ADD+GW DV++HG     TP IDA+A  G+   Q Y  A +C PSR+ +
Sbjct: 16  AADKPNIVLVFADDMGWGDVAYHGVEDAQTPAIDAIAKGGVWFEQGYAAASVCGPSRAGI 75

Query: 85  MTGKYPIHIGMQHGVILEGEP-WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           +TG+Y        GV+  G+   G+P ++K + + LK AGY + A GKWHLG  +  + P
Sbjct: 76  LTGRYQ----QLFGVVTNGDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKKGQF-P 130

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
             RGFD+ YG+  G  DYY    K   +      +  N  + +   G Y T+  T+ A+ 
Sbjct: 131 NDRGFDTFYGFHFGAHDYYRADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHAVE 190

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
            I E NK +P F+Y+A+ +VH+     P+Q PDE +A+  +     RR +  MV  +D+ 
Sbjct: 191 FI-EENKDQPFFMYVAYNSVHS-----PWQVPDEYLARIPESVPAYRRLFLAMVLAMDDG 244

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS-NKGS-----NHPLRGMKSTPWDGGM 317
           VG + A L++  + EN+I +F  DNG+P  G    N+G      +   RG K   ++GG+
Sbjct: 245 VGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKPNEGQYRMSMSQGFRGYKGDTYEGGI 304

Query: 318 R-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKT 376
           R      W   +K   K  + +    D  PT  +AA +E +     G +    L    K 
Sbjct: 305 RVPFCMSWPKKIKSGNKFEAPVI-AYDLAPTFLSAASLEYSTKQFSGKDLLPYLEDEQKG 363

Query: 377 KRSEIL 382
           +  E L
Sbjct: 364 RPHETL 369


>gi|323451705|gb|EGB07581.1| hypothetical protein AURANDRAFT_27261 [Aureococcus anophagefferens]
          Length = 614

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 165/576 (28%), Positives = 264/576 (45%), Gaps = 97/576 (16%)

Query: 19  DAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALC 77
           D F +T+A   +P++++ + DD G  DVS++ ++ + TP +  ++ +G +L+  Y    C
Sbjct: 32  DEFSSTSADAPRPNMLLTVVDDWGSGDVSYNDAA-LHTPELQRMSEHGFVLDNFYAAPTC 90

Query: 78  TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           TPSR+ LMTG+Y I  GMQ  VI   EP G+PL E+ L Q L +AGY T AIGKWHLG  
Sbjct: 91  TPSRAMLMTGRYNIRNGMQDSVIHSTEPRGVPLDERFLSQKLSDAGYRTAAIGKWHLGMH 150

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
           R+ YTP  RGFD  YG   G   +  H  K               +  +        D  
Sbjct: 151 RDAYTPLKRGFDLFYGILTGGGSHTGHFSKGD------------DKKGDDKKDAKKDDAK 198

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-VAKFLDISDPERRTYAGM 256
              A   +A   K+ P F+YL++ AVH     +P    D+  V++   IS   R T  GM
Sbjct: 199 KGGAHRRLAASGKA-PWFVYLSYQAVH-----DPVTVGDDRYVSETAAISTTNRPTLCGM 252

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           ++ +D+ +  +  +L        +I + ++DNG    G+ ++  +N+PLRG K+  ++GG
Sbjct: 253 IAEIDDGLKTLRLSLENAQAWAQTIFVVLSDNG----GVRAHGSANYPLRGEKAQYYEGG 308

Query: 317 MRGVAAIWSPWLKQTQK--------VSSELFHISDWLPTLCAAAGIEI--NDTSLDGVNQ 366
           ++  A +   ++++  +         S+ L HI+D   T  + AG E   +D  LDGV+ 
Sbjct: 309 VKVPAVLAGGYVEEALRRAGGAERLRSASLAHITDVHATFLSLAGYEGGDDDKPLDGVSL 368

Query: 367 WDVLTK------GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYG 420
           +  L +      GA   RS++L NI++         A+RV D K +   + N  S     
Sbjct: 369 FSHLVQSASGVAGAAPPRSDVLININSALFAGS--GAIRVGDHKLMVNPEPNESS----- 421

Query: 421 DTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILEL 480
                         +YSK             +++  AA   A  +D  ++I+        
Sbjct: 422 --------------IYSK-------------VRKALAAQKGAVPTDTFQSIV-------- 446

Query: 481 REFARVRCNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTM 540
                 + + D  GA   S     LF++  +P E+ + A S       L D +A Y    
Sbjct: 447 -----AQAHKDALGAPSIS-----LFDLKKNPYERTDCATSDPAESCNLYD-VAAYADQE 495

Query: 541 VPPGNKPFDKR---ADPARWNNIWVPWYDELDKQKA 573
             P    ++     ADPA + ++W PW D  D  KA
Sbjct: 496 AQPSTFAWEDDGPLADPALFGSVWAPWRDGDDTPKA 531


>gi|323454325|gb|EGB10195.1| hypothetical protein AURANDRAFT_62802 [Aureococcus anophagefferens]
          Length = 2048

 Score =  187 bits (476), Expect = 1e-44,   Method: Composition-based stats.
 Identities = 133/397 (33%), Positives = 199/397 (50%), Gaps = 61/397 (15%)

Query: 16  LFNDAFLN---TTAPKKPHIIIILADDLGWNDVSFHGSS-QIPTPNIDALAYNGLILNQH 71
           L + A+L+   +   ++ +I++ILADD+G+NDV +  +     TP +D+LA  G+ L  +
Sbjct: 180 LLDTAYLDELPSATTQQQNILLILADDMGYNDVGYQSTDLGAATPTLDSLASRGVELASY 239

Query: 72  YVQALCTPSRSALMTGKYPIHIGMQH---GVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
           Y    CTP+RS+L T  Y  + GM +   G  +    +GLPL   L+ + LK  GYATH 
Sbjct: 240 YTLPSCTPARSSLFTANYAANNGMGYDGPGTYVIDSSYGLPLGATLMSEVLKGRGYATH- 298

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL---DMRHNMQVD 185
                                        ++ Y+ H    T +P  G    D+  +  VD
Sbjct: 299 -----------------------------IETYFSHEAYGT-DPNNGTSYCDLMRSNSVD 328

Query: 186 NKTIG-----IYSTDLYTEAAINVIAEH--NKSKPMFLYLAHLAVHAGNTYEP---FQAP 235
              +G      YSTDLY    ++V+ EH      P+FLY A  A HA     P     AP
Sbjct: 329 GVKVGECYLDEYSTDLYARRLVDVVGEHATTSDSPLFLYFAAQAPHAPLEEPPPMNATAP 388

Query: 236 DEE-VAKFLDISDP--ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
            E  +A  +   D   +RRT+A MV  LD +V  V+ AL    +L+N++V F +DNG   
Sbjct: 389 QEAMLAALVGAGDSWGKRRTFAKMVMNLDNNVRKVVDALAAAKLLQNTVVAFASDNG--- 445

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQ--TQKVSSELFHISDWLPTLCA 350
            G   + GSN PLRG K + ++GG+R  A +WS  L++     V   LFH++DW PTL A
Sbjct: 446 -GCPRDGGSNWPLRGTKFSQYEGGVRVPAFLWSSTLQKRGAGTVFEGLFHVADWFPTLVA 504

Query: 351 AAGIEINDTSLDGVNQWDVLTKGAKTK-RSEILHNID 386
           +AG+      +DGVNQ+D L    +   R E+L +++
Sbjct: 505 SAGVRATVGKIDGVNQYDALFDATQNAPRDEVLIHLN 541


>gi|150008923|ref|YP_001303666.1| secreted sulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937347|gb|ABR44044.1| putative secreted sulfatase precursor [Parabacteroides distasonis
           ATCC 8503]
          Length = 452

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 143/420 (34%), Positives = 218/420 (51%), Gaps = 39/420 (9%)

Query: 6   KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
           KY     C L F+ A   T A P  P+I++IL DDLG+ D+S  G+++ I TP+ID L  
Sbjct: 3   KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62

Query: 64  NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
            G+   N H    + +PSR++L+TG+YP  +G+  GVI   + + WG L     LLPQ L
Sbjct: 63  EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
           K+ GY    +GKW+LG      TPT RGFD + G+   + D YY H        +    M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174

Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           R N+ ++D +    ++T+++++ AI  +++   K +P FLYLA+ A H      P Q P 
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226

Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
           E  E  K  + S PE+R     ++  LD +VG V  AL+K+G LEN+I++F +DNG    
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNG---- 282

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAA 352
           G      +N P RG K   ++GG+R    I+  W  Q +  V      +SD  PTLC   
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLT 340

Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKYVAGT 409
            + ++   +DG++   +L    +     ++H +    N +   K Y A R  D K +  T
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKDYKILQNT 399


>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 826

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 132/358 (36%), Positives = 192/358 (53%), Gaps = 33/358 (9%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
           A  +P+I++I+ADDLG++DV F+G  +IPTP++D LA +G++    Y     C+PSR+ L
Sbjct: 41  AKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGL 100

Query: 85  MTGKYPIHIGMQ---------HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           +TG++    G +         HG   E  P G+PL+E  L   LKEAGY T AIGKWHLG
Sbjct: 101 LTGRHQQRFGHESNPEPDTQWHG---EDTP-GMPLSETTLADALKEAGYVTGAIGKWHLG 156

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
             +  + P  RGFD  +G+  G   Y+    K   +P  G+  R +  VD KT+   + D
Sbjct: 157 DAKPFW-PNHRGFDEWFGFSGGGFSYWGDLGKK--DPLLGV-HRGDEPVDPKTLTHLTDD 212

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
             TE A+  I  H +S+P FLYLA+ A HA     P  A    + K   I    R  Y  
Sbjct: 213 FSTE-AVKFIQRH-ESEPFFLYLAYNAPHA-----PDHATRAHLQKTAHIEYGGRAVYGA 265

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV+ +DE +G V+  +R+ G+ EN++V+F +DNG            N P RG K   ++G
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMVIFYSDNGG-----RREHAVNFPYRGHKGMLFEG 320

Query: 316 GMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
           G+R    + W   ++   K  S +  + D  PT  AAAG++ + +  LDG N   VLT
Sbjct: 321 GIRVPFLVSWPGTVRSGMKEESPITAL-DLFPTALAAAGMDPSQNDKLDGQNLLPVLT 377


>gi|256841083|ref|ZP_05546590.1| arylsulphatase A [Parabacteroides sp. D13]
 gi|256736926|gb|EEU50253.1| arylsulphatase A [Parabacteroides sp. D13]
          Length = 452

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 141/419 (33%), Positives = 217/419 (51%), Gaps = 37/419 (8%)

Query: 6   KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
           KY     C L F+ A   T A P  P+I++IL DDLG+ D+S  G+++ I TP+ID L  
Sbjct: 3   KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62

Query: 64  NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
            G+   N H    + +PSR++L+TG+YP  +G+  GVI   + + WG L     LLPQ L
Sbjct: 63  EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR 179
           K+ GY    +GKW+LG      TPT RGFD + G+   + D Y   C+     +    MR
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTHCR-----FGNNYMR 175

Query: 180 HNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
            N+ ++D +    ++T+++++ AI  +++   K +P FLYLA+ A H      P Q P E
Sbjct: 176 ENLKEIDPQG---HATEVFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQE 227

Query: 238 --EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
             E  K  + S PE+R     ++  LD +VG V  AL ++G LEN+I++F +DNG    G
Sbjct: 228 WLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNG----G 283

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAAG 353
                 +N P RG K   ++GG+R    I+  W  Q +  V      +SD  PTLC    
Sbjct: 284 QDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLTA 341

Query: 354 IEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKYVAGT 409
           + ++   +DG++   +L    +     ++H +    N +   K Y A R  D K +  T
Sbjct: 342 VPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKDYKILQNT 399


>gi|149177395|ref|ZP_01855999.1| arylsulfatase A [Planctomyces maris DSM 8797]
 gi|148843728|gb|EDL58087.1| arylsulfatase A [Planctomyces maris DSM 8797]
          Length = 474

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 138/395 (34%), Positives = 194/395 (49%), Gaps = 35/395 (8%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           KY   L    L   +  +  A   P++I+I ADDLG+ D+   G   I TP +D +A  G
Sbjct: 13  KYASPLIALFLLLTSVADLNAAPGPNVIVIYADDLGYGDLGCFGHPTIKTPALDQMAAEG 72

Query: 66  LILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKE 121
           +   Q Y  A +CTPSR+AL+TG+YPI  GM   +  V+      G+P +E  L + LK 
Sbjct: 73  MKFTQFYSAAPVCTPSRAALLTGRYPIRSGMCSDKRRVLFPNSGGGIPASEVTLAEALKA 132

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYD--HSCKATFEP---YQ 174
           AGY T  +GKWHLG   + + PT  GFDS++G  Y   +    D  H      +P   + 
Sbjct: 133 AGYKTACVGKWHLGHLPQ-FLPTNNGFDSYFGIPYSNDMDRVADRKHGRSIFLKPEVKFW 191

Query: 175 GLDMRHNMQV-----DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
            + +  N +V     D  TI    T  YTE AI +I + NK +P F+YLAH   H     
Sbjct: 192 NVPLMRNTEVVELPADQTTI----TKRYTEEAIKLI-QQNKQQPFFIYLAHNMPHV---- 242

Query: 230 EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
             F++P+     F D S   R  Y  ++  +D SVG V+  L+  G+ +N+IV F +DNG
Sbjct: 243 PLFRSPE-----FQDKS--RRGLYGDVIEEIDWSVGEVLKELKTQGLDQNTIVWFCSDNG 295

Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
            P    +   GS   LR  K + W+GGMR     W P    +  VS EL    D   T  
Sbjct: 296 -PWLIFNEQGGSAGLLRDGKGSTWEGGMREPTIAWWPGHIPSGSVSQELGSTMDIYTTSI 354

Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
             AG  +  D  +DGV+   VL K   + R E+ +
Sbjct: 355 KLAGGSVPTDQVVDGVDLTPVLLKKENSPREELAY 389


>gi|422293430|gb|EKU20730.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
 gi|422295486|gb|EKU22785.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
          Length = 703

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 140/401 (34%), Positives = 210/401 (52%), Gaps = 42/401 (10%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIP----TPNIDALAYNGLILNQHYVQALCT 78
           N    K PH+++I+ DD G  DV ++ S + P    TP +D+LA   + L ++YV  +CT
Sbjct: 55  NPQIAKPPHVLMIVVDDAGVQDVGYNASPESPLRGKTPVLDSLAAESVRLKEYYVHPVCT 114

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK-EAGYATHAIGKWHLGFF 137
           P+R+AL+TG+Y +++GM   +I +    GL  +   LP+ LK EA Y+TH +GKWHLG  
Sbjct: 115 PTRAALLTGRYAVNVGMPFPLIGDAIS-GLDGSIPTLPEMLKSEANYSTHLVGKWHLGAA 173

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
           +    P  RGFDS YG      D+Y        + ++        +VD K    ++T L+
Sbjct: 174 KAKNRPLARGFDSFYGLLGASFDHYTKKMGEVRDLWKNEAEVPAKEVDEKE---HATTLF 230

Query: 198 TEAAINVIAEHNKS----------KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-DIS 246
           +  A+ VI EH+             P+FLYLA+ A HA     P QA DE+  K   D+ 
Sbjct: 231 SREAVKVIEEHSARGHAGAKDGDMDPLFLYLAYSAPHA-----PLQA-DEKFMKLCSDVP 284

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
           +  RRT+  M+++LDE +GNV A+L+ HGM E+++V+F +DNG    G     G N+  R
Sbjct: 285 NRHRRTFCAMMAQLDEGIGNVTASLKAHGMWEDTVVVFTSDNG----GYFLVGGFNYNYR 340

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE---LFHISDWLPTLCAAA-----GIEIND 358
           G KS  W+GG+R  A I +P  K+      +   + H++DW+PTL   A     G E   
Sbjct: 341 GGKSGGWEGGVRVPAFIRAP--KRYNFAPRDFRGMAHVADWMPTLMGMATGREEGRERMG 398

Query: 359 TSLDGVNQWDVLTKGAKTK--RSEILHNIDNVDNPQKYYAA 397
              D    +    +G K K  R+E L  +D   N   Y + 
Sbjct: 399 DGYDLRKAFRGDEEGGKEKSPRNEALLQLDIFQNTSAYRSG 439


>gi|405952520|gb|EKC20320.1| Arylsulfatase B [Crassostrea gigas]
          Length = 500

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 131/415 (31%), Positives = 199/415 (47%), Gaps = 102/415 (24%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A +KP+II I  DD+G ND+ ++ + ++ TPN+D LA NG+IL  +YV  +C+PSR+A M
Sbjct: 26  ANQKPNIIFIAVDDMGNNDIGYN-NPEVDTPNLDNLANNGVILESNYVYPVCSPSRAAFM 84

Query: 86  TGKYPIHIGMQHGVILEGEPWGLP-----LTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           TG+Y   IG Q G +   +P  +      + EKL   Y    GYA H IGKWHLG+ ++ 
Sbjct: 85  TGRYAHKIGFQRGPVEHKQPAYIESNYKTVAEKLTTNY----GYAAHMIGKWHLGYCKDA 140

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV--------DNKTIGIY 192
            TPT RGFDS YG++ G ++YY      T+   +  D R N+            + +  Y
Sbjct: 141 VTPTNRGFDSFYGFYGGQENYY------TYTSARYKDFRDNLTAVTPQNPNYPREDVDGY 194

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL--------- 243
           ST  Y + AI ++  H+KS P+FLYLA  A H+     P QAP E   K+          
Sbjct: 195 STFEYKKRAIEIVGNHDKSVPLFLYLAFQAPHS-----PLQAPQELKDKYPGDWTYTDWD 249

Query: 244 -DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
                  R T+  MV+ +D++VG V   +   G+L+N+I++ +          H+     
Sbjct: 250 GSTKTTSRATFYAMVTAVDQAVGEVYTEMETQGLLDNAIIVLL----------HA----- 294

Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG------IEI 356
                                 S WL                 PT  + AG       +I
Sbjct: 295 ----------------------SDWL-----------------PTFISLAGGSADLTTDI 315

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRV---DDLKYVAG 408
           + TS+DGV+Q  +L    ++ RS+I+ NI +++ P   Y  ++    +  KYV G
Sbjct: 316 DLTSIDGVDQTQMLLNNGESARSKIILNIYDLERPSTIYGIVKQYGPEVWKYVQG 370


>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 491

 Score =  186 bits (473), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 138/402 (34%), Positives = 201/402 (50%), Gaps = 36/402 (8%)

Query: 1   MTWARKYFFA-LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
           MTW R    A   C L    A L  +A ++P+I+IILADDLG++D+ + GS++I +P ID
Sbjct: 1   MTWFRIQLLATFACALT---AVL--SADERPNILIILADDLGYSDLGYTGSTEIESPVID 55

Query: 60  ALAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILE--GEPWGLPLTEKLLP 116
            LA NG+I    YV    C PSR+ L+TG++    GM+         +  GLP+ E    
Sbjct: 56  KLANNGVIFANGYVTHPYCGPSRAGLITGRHQARFGMEINATYSPFDQHMGLPVDEPTFA 115

Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE----- 171
           + L+ AGY T  IGKWHLG   + + P  RGFD  YG+  G  DY+  S     E     
Sbjct: 116 KRLQPAGYRTGIIGKWHLGAAPQFH-PNNRGFDYFYGFLSGGHDYFPESVNTHLELVLPN 174

Query: 172 --PYQGLDMRHNMQV-DNKTIGIY----STDLYTEAAINVIAEHNKSKPMFLYLAHLAVH 224
             P  G +    + +  NK    +    +T L  +AA  V +     +P  LYLA+ A H
Sbjct: 175 GKPNYGANEGTLLPLLRNKNAAEFDDYLTTALSKDAARFVTSSE---QPFCLYLAYNAPH 231

Query: 225 AGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
                 P QAP E +AK+  I DP+RR YA M+  +D  +G V+ AL + G L+N+++ F
Sbjct: 232 T-----PLQAPKETIAKYSHIKDPKRRIYAAMIDEMDAGIGLVVDALEQSGKLDNTLIFF 286

Query: 285 MADNGA---PSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFH 340
           ++DNG     ++  +S+   N P +  K++ ++GG      A W   LK   +    +  
Sbjct: 287 LSDNGGCVPQAWDSYSDFADNGPFKDGKTSFYEGGTHVPFIAHWPAKLK--PQTYDNIIS 344

Query: 341 ISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
             D   T  A AG + +   LDGVN    LT     +  E L
Sbjct: 345 SLDLAATAVALAGGDDSGKPLDGVNLIPFLTGETTGQPHEAL 386


>gi|348550278|ref|XP_003460959.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cavia porcellus]
          Length = 502

 Score =  186 bits (473), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 126/363 (34%), Positives = 181/363 (49%), Gaps = 26/363 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGK 88
           P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+TG+
Sbjct: 10  PNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGR 69

Query: 89  YPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
            PI  G        ++    +    G+P +E+LLPQ LKEAGYAT  +GKWHLG  R  +
Sbjct: 70  LPIRNGFYTTNGHARNAYTPQEIVGGIPDSERLLPQLLKEAGYATKIVGKWHLGH-RPQF 128

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLY 197
            P   GFD  +G        YD+  +     Y+  DM         ++ KT     T +Y
Sbjct: 129 HPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRNWDMVGRFYEEFPINVKTGESNLTQIY 188

Query: 198 TEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
            + A++ I +   ++ P FLY A  A HA         P      FL IS  +R  Y   
Sbjct: 189 LQEALDFIRQQQAAQHPFFLYWAVDATHA---------PVYASRPFLGIS--QRGRYGDA 237

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  +D+SVG +++ LR   + ENS V F +DNGA      S  GSN P    K T ++GG
Sbjct: 238 VREIDDSVGKILSLLRTLSIAENSFVFFTSDNGAALISAPSQGGSNGPFLCGKQTTFEGG 297

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAK 375
           MR  A  W P      +VS +L  I D   T  A AG++  +D  +DG+N    + +G  
Sbjct: 298 MREPAIAWWPKHIPAGQVSHQLGSIMDLFTTSLALAGLKPPSDRVIDGLNLLPTMLQGQL 357

Query: 376 TKR 378
             R
Sbjct: 358 MDR 360


>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
 gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
 gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 525

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 122/386 (31%), Positives = 195/386 (50%), Gaps = 43/386 (11%)

Query: 11  LTCTLLFNDAFLNTTAP----------KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDA 60
           ++CTL+   A L  +            ++P+I++++ADD+GW DV + G+  + TPNIDA
Sbjct: 3   VSCTLVSVAALLPFSGSNAGNVQRDKSQRPNIVLVIADDMGWGDVGYQGAVDVSTPNIDA 62

Query: 61  LAYNGLILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWG-LPLTEKLLPQY 118
           LA  G+  +Q YV  ++  PSR+ ++TG Y    G  + +     PW  +P  +  L + 
Sbjct: 63  LARRGVQFSQGYVSCSISGPSRAGILTGVYQQRFGFYNNL----HPWAKIPEGQSTLGEM 118

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
           +++ GYAT  +GKWH+    E  +P  RGFD  YG+W    DYY  + K   E Y    +
Sbjct: 119 VRDCGYATGFVGKWHMADSPE-QSPNRRGFDQFYGFWSDTHDYYRSTDKPGVELYDFCPL 177

Query: 179 RHNMQVDN--KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
             N ++       G Y TD +T  A+  I +H  S P  L L++ AVH+     P+Q P+
Sbjct: 178 YRNGEIQPPLHESGEYITDCFTREAVEFIDKH-ASSPFLLCLSYNAVHS-----PWQVPE 231

Query: 237 EEVAKF--LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP-SF 293
             V +         +R+ +A MV  LD+ +G V+ +LRK+G+ EN++ + ++DNG+P   
Sbjct: 232 HYVNRLEGRRFHHEDRKVFAAMVLALDDGIGRVMESLRKNGLEENTLFILISDNGSPRGQ 291

Query: 294 GIHSNKG------------SNHPLRGMKSTPWDGGMRGVAAI--WSPWLKQTQKVSSELF 339
           GI  + G            S  P RG K+  ++GG+R V  I  W   L Q     + + 
Sbjct: 292 GIECSTGYEYKDRGNTTMSSPGPFRGYKADTYEGGIR-VPYIMSWPSELPQGMVYDNPVI 350

Query: 340 HISDWLPTLCAAAGIEINDTSLDGVN 365
            +  +   + A  G      SLDGV+
Sbjct: 351 SLDIFPTVMQAVGGTSRQKYSLDGVS 376


>gi|241601300|ref|XP_002405277.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215502504|gb|EEC11998.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 234

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 97/225 (43%), Positives = 138/225 (61%), Gaps = 4/225 (1%)

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV--DNKT 188
           +WHLG++   Y P  RGFD+  G++    DYY  +   TF+ + G D  +  ++   +K 
Sbjct: 12  QWHLGYYDSKYVPVNRGFDTFLGFYSPALDYYTQNF--TFDNHTGHDFWNGDKIYWVDKE 69

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
              YST  YT   + +I  HNKS P+FL L+H A H        Q P + +  F  I + 
Sbjct: 70  KKQYSTHYYTRKTVQLIQVHNKSTPLFLLLSHQAPHTSGGPTLLQVPKKGIRDFSYIQEE 129

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
            R  +AGMV  LD+SVG+V+ AL+K  ML ++I++F  DNGA  +G  SN+GSN PLRG 
Sbjct: 130 NRTLFAGMVDALDQSVGSVVDALQKADMLTDTILVFCTDNGALPWGFKSNRGSNWPLRGT 189

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           K T ++GG+R  A IWSP +  +++VSS+L HI+DWLPTL +AAG
Sbjct: 190 KFTLYEGGVRSTAFIWSPLVNNSKRVSSQLMHITDWLPTLYSAAG 234


>gi|305665421|ref|YP_003861708.1| arylsulfatase A [Maribacter sp. HTCC2170]
 gi|88710176|gb|EAR02408.1| arylsulfatase A [Maribacter sp. HTCC2170]
          Length = 469

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 130/381 (34%), Positives = 190/381 (49%), Gaps = 31/381 (8%)

Query: 19  DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LC 77
           D F      + P+ III ADDLG+ D+S +G+  I TPN+D +A+ G      YV A +C
Sbjct: 20  DIFGQKANQQTPNFIIIFADDLGYGDLSTYGNPTIHTPNLDRMAHEGQKWTNFYVGASVC 79

Query: 78  TPSRSALMTGKYPIHIGMQHG---VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
           TPSR+AL+TG+ P+  GM      V+      GLP  E  L + LK AGY+T AIGKWHL
Sbjct: 80  TPSRAALLTGRLPVRSGMASSKTRVLFPNSKNGLPANEITLAEQLKTAGYSTAAIGKWHL 139

Query: 135 GFFREVYTPTFRGFDSHYGY-------WQGLQDY----YDHSCKATFEPYQGLDMRHNMQ 183
           G  +E Y PT  GFD ++G        + G  +Y     ++      E Y    +++   
Sbjct: 140 G-HKEQYLPTNNGFDYYFGIPYSNDMDYIGKANYKNLAQNNPSDIKIENYNVPLLQNTKI 198

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
           ++  T     T  YT+  I  I E NK +P F+YLAH   H       F + + E     
Sbjct: 199 IERPTNQNTITKRYTQETIKFIKE-NKREPFFVYLAHSLPHI----PLFTSKEFE----- 248

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
                +   Y  +V  +D  VG ++ AL + G+ +N+IV+F +DNG P     ++ GS  
Sbjct: 249 --GHSKNGIYGDVVEEIDHGVGEILNALEEEGLDQNTIVVFTSDNG-PWLPFKTHGGSAG 305

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLD 362
            LR  K + W+GGMR     W P L + + + +++    D   T    AG+EI ND  +D
Sbjct: 306 LLRAGKGSTWEGGMREPGIFWGPGLVK-KGLVTDMGSTMDLFTTFSTMAGVEIPNDRIID 364

Query: 363 GVNQWDVLTKGAKTKRSEILH 383
           G +    L  G  + R  IL+
Sbjct: 365 GHDLSKTLRNGDASPRQHILY 385


>gi|417303628|ref|ZP_12090677.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327540049|gb|EGF26644.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 826

 Score =  185 bits (470), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 131/359 (36%), Positives = 189/359 (52%), Gaps = 35/359 (9%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
           A  +P+I++I+ADDLG++DV F+G  +IPTP++D LA +G++    Y     C+PSR+ L
Sbjct: 41  AKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGL 100

Query: 85  MTGKYPIHIGMQ---------HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           +TG++    G +         HG   E  P G+PLTE  L   LKEAGY T AIGKWHLG
Sbjct: 101 LTGRHQQRFGHESNPEPDTQWHG---EDTP-GMPLTETTLADALKEAGYVTGAIGKWHLG 156

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
             +  + P  RGFD  +G+  G   Y+    K   +P  G+  R +  VD KT+   + D
Sbjct: 157 DAKPFW-PNRRGFDEWFGFSGGGFSYWGDLGKK--DPLLGV-HRGDEPVDPKTLTHLTDD 212

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
             TE A+  I  H +++P FLYLA+ A HA     P  A    + K   I    R  Y  
Sbjct: 213 FSTE-AVKFIQRH-ETEPFFLYLAYNAPHA-----PDHATRAHLQKTAHIEYGGRAVYGA 265

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV+ +DE +G V+  +R+ G+ EN++++F +DNG            N P RG K   ++G
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFEG 320

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
           G+R        W    +    E   I+  D  PT  AAAG++ + +  LDG N   VLT
Sbjct: 321 GIR--VPFLVSWPGTVRSGMKEELPITALDLFPTALAAAGMDPSRNDKLDGQNLLPVLT 377


>gi|298375849|ref|ZP_06985805.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|423330336|ref|ZP_17308120.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
           CL03T12C09]
 gi|298266886|gb|EFI08543.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|409231952|gb|EKN24800.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
           CL03T12C09]
          Length = 452

 Score =  185 bits (470), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 142/420 (33%), Positives = 217/420 (51%), Gaps = 39/420 (9%)

Query: 6   KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
           KY     C L F+ A   T A P  P+I++IL DDLG+ D+S  G+++ I TP+ID L  
Sbjct: 3   KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62

Query: 64  NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
            G+   N H    + +PSR++L+TG+YP  +G+  GVI   + + WG L     LLPQ L
Sbjct: 63  EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
           K+ GY    +GKW+LG      TPT RGFD + G+   + D YY H        +    M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174

Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           R N+ ++D +    ++T+++++ AI  +++   K +P FLYLA+ A H      P Q P 
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226

Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
           E  E  K  + S PE+R     ++  LD +VG V  AL ++G LEN+I++F +DNG    
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNG---- 282

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAA 352
           G      +N P RG K   ++GG+R    I+  W  Q +  V      +SD  PTLC   
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLT 340

Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKYVAGT 409
            + ++   +DG++   +L    +     ++H +    N +   K Y A R  D K +  T
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGKAYYASRYKDYKILQNT 399


>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 826

 Score =  185 bits (470), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 128/358 (35%), Positives = 192/358 (53%), Gaps = 33/358 (9%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
           A  +P+I++I+ADDLG++DV F+G  +IPTP++D LA +G++    Y     C+PSR+ L
Sbjct: 41  AKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGL 100

Query: 85  MTGKYPIHIGMQ---------HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           +TG++    G +         HG   E  P G+PL+E  L   LKEAGY T AIGKWHLG
Sbjct: 101 LTGRHQQRFGHESNPEPDTQWHG---EDTP-GMPLSETTLADALKEAGYVTGAIGKWHLG 156

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
             +  + P  RGFD  +G+  G   Y+    K   +P  G+  R +  V+ KT+   + D
Sbjct: 157 DAKPFW-PNHRGFDEWFGFSGGGFSYWGDLGKK--DPLLGV-HRGDEPVEPKTLTHLTDD 212

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
             TEA   +  + N+++P FLYLA+ A HA     P  A    + K   I    R  Y  
Sbjct: 213 FSTEAVKFI--QRNETEPFFLYLAYNAPHA-----PDHATRAHLQKTTHIEYGGRAVYGA 265

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV+ +DE +G V+  +R+ G+ EN++++F +DNG            N P RG K   ++G
Sbjct: 266 MVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFEG 320

Query: 316 GMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
           G+R    + W   ++   K  S +  + D  PT  AAAG++ + +  LDG N   VLT
Sbjct: 321 GIRVPFLVSWPGTVRSGMKEESPITAL-DLFPTALAAAGMDPSQNDKLDGQNLLPVLT 377


>gi|6863178|gb|AAF30403.1|AF109925_1 sulfatase 2 precursor [Helix pomatia]
          Length = 266

 Score =  185 bits (469), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 98/239 (41%), Positives = 140/239 (58%), Gaps = 35/239 (14%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           ++P+I+I++ADD G+ D+ +HG+ +  TPN+D LA  G+ L  +YVQ +C+P+RS LMTG
Sbjct: 26  QQPNIVIVVADDYGYRDIGYHGA-EFATPNLDKLAAEGVKLENYYVQPICSPTRSQLMTG 84

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P+GLPL    +   LK  GY+THAIGKWHLG +++ YTP +RG
Sbjct: 85  RYQIHTGLQHDIIWPSQPYGLPLQFPTIADMLKSVGYSTHAIGKWHLGLYKKEYTPLYRG 144

Query: 148 FDSHYGYWQGLQDYY---------------DHSCKATFEP-------------YQGLDMR 179
           FDS+YGY +G +DYY               D S   ++ P             + G D+R
Sbjct: 145 FDSYYGYLEGGEDYYTYYNCDTFHNRTTPADTSILESYSPKNILLGKHEDENKWCGYDLR 204

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
              +      G YST LYT+ AI++I       KP  LYLA+ AVH+     P + P E
Sbjct: 205 DMNEPVTDMNGTYSTHLYTKKAIDIINGASTGGKPFLLYLAYQAVHS-----PMEVPAE 258


>gi|284040883|ref|YP_003390813.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283820176|gb|ADB42014.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 453

 Score =  185 bits (469), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 128/359 (35%), Positives = 200/359 (55%), Gaps = 43/359 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           KKP+I++ILADDLG+ D+S +G+  I TP+ID+L   G+  +  Y   ++C+PSR+AL++
Sbjct: 27  KKPNILLILADDLGYGDLSSYGAPDIRTPHIDSLVRAGMRFSHFYANSSVCSPSRAALLS 86

Query: 87  GKYPIHIGMQHGVI--LEGEPWG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           G+YP  +G+  GVI  +  + WG L  +  LLP  LK+ GY T  +GKWHLG       P
Sbjct: 87  GRYPEQVGVP-GVIRTMPDDNWGYLSPSAVLLPSILKKNGYYTALVGKWHLG-LEPPNLP 144

Query: 144 TFRGFDSHYGYWQG--LQDYYDHSCKATFEPYQGLDMRHN---MQVDNKTIGI--YSTDL 196
             RGFD  +G+ +G  + DYY H             +RH+   M+++ +TI    ++TDL
Sbjct: 145 NDRGFDLFHGF-EGDMMDDYYTH-------------LRHDRNYMRLNRQTINPQGHATDL 190

Query: 197 YTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL----DISDPERR 251
           +T+ A + + +   +S P FLYLA+ A H     +P Q P + +AK       IS+ +R 
Sbjct: 191 FTQWATDYLEQRAGQSNPFFLYLAYNAPH-----DPIQPPADWLAKVKARQPGISE-KRA 244

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
              G++  +D+ +G VI  LR  G+ EN++++F++DNG   F    +  +N PLR  K  
Sbjct: 245 KLVGLIEHMDDGIGKVIQTLRAKGLYENTLIVFVSDNGGKLF----DGATNGPLRSGKGH 300

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
            ++GG+R  A +  P     Q  S +   + D  PTL  A G  IN   +DG +   +L
Sbjct: 301 MYEGGIRIPACVVWPGKVAAQSQSQQPLLLMDIFPTLAEATGTVIN-YPIDGRSFLSIL 358


>gi|311746665|ref|ZP_07720450.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|311302556|gb|EAZ82500.2| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 465

 Score =  185 bits (469), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 137/392 (34%), Positives = 198/392 (50%), Gaps = 30/392 (7%)

Query: 4   ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
            R   F  T  L+ N +   +   +KP+ III  DD G+ DV   G   I TPN+D +A 
Sbjct: 6   VRSTLFLFTFLLIANYSLAQS---QKPNFIIIFTDDQGYGDVGTFGHPTIKTPNLDQMAM 62

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYL 119
            G      YV A +CTPSRSA+MTG+ P+  GM      V+      GLP TE  + + L
Sbjct: 63  EGQKWTNFYVAANVCTPSRSAIMTGRLPVRTGMYSNTRRVLFPDSGGGLPATENTIAKLL 122

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATF-----EP 172
           K +GY+T AIGKWHLG   E Y PT  GFD+++G  Y   +    D + +  F     E 
Sbjct: 123 KTSGYSTAAIGKWHLGHLPE-YLPTSHGFDTYFGIPYSNDMDRINDVTAQEAFASPKPEY 181

Query: 173 YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
           +    MR    ++        T  YTE A++ I + NK +P F+YLAH   H      P 
Sbjct: 182 FNVPLMRDKEIIERPADQTTITKRYTEEAVSYI-KANKDQPFFIYLAHSLPHI-----PL 235

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
            A ++ + K       ER  Y  ++  +D SVG +++ L+  G+ +N+ V+F +DNG P 
Sbjct: 236 FASEDFLTK------SERGLYGDVIEEIDWSVGQILSTLKSEGIDKNTYVIFTSDNG-PW 288

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
              +   GS+  L G K T ++GG+R    IW P   + Q V S++    D LPT+ + +
Sbjct: 289 LVYNEQGGSSGGLFGGKGTSYEGGVRVPTIIWGPGNIKPQ-VVSKIGSTLDLLPTISSLS 347

Query: 353 GIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
           G EI ND   DG +   VL    K+ R E+ +
Sbjct: 348 GTEIPNDRIYDGYDLSPVLRGENKSPREEMFY 379


>gi|410628682|ref|ZP_11339400.1| arylsulfatase B [Glaciecola mesophila KMM 241]
 gi|410151686|dbj|GAC26169.1| arylsulfatase B [Glaciecola mesophila KMM 241]
          Length = 510

 Score =  184 bits (468), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 120/352 (34%), Positives = 190/352 (53%), Gaps = 27/352 (7%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
            A ++P+I++ILADDLG+ D+ F GS +I TPNIDALA NG++    YV    C PSR+ 
Sbjct: 52  VAKERPNIVVILADDLGYADLGFTGSKEIFTPNIDALANNGVVFKNGYVTHPYCGPSRAG 111

Query: 84  LMTGKYPIHIGMQ-HGVILEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           L+TG+Y    GM+ +      +P+ GLP+ E    + +++AGY T  +GKWH+G     +
Sbjct: 112 LLTGRYQARFGMEVNAAHSPDDPYMGLPVEELTFAKRMQQAGYKTAVMGKWHMGSHPN-F 170

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-NKTIGIYSTDLYTEA 200
            P  RGFD  +G+  G  DY+  S K +   Y     R+      N+ +   +T +  EA
Sbjct: 171 HPNNRGFDEFFGFLGGGHDYFPESVKVSSAEYSIALSRNGKPAQLNEYL---TTAISKEA 227

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A  V A     +P  +Y+A+ A H+     P QA ++++AK+  I+D +RRTYA M+  +
Sbjct: 228 ARFVSA---TEQPFMMYVAYNAPHS-----PLQATEQDLAKYQHIADLDRRTYAAMIDSM 279

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH-------SNKGSNHPLRGMKSTPW 313
           D+ VG ++AAL++   L N+++ F++DNG    G++       S+  +N P R  K +  
Sbjct: 280 DQGVGRIVAALKQSEKLHNTLIFFLSDNG----GVYPEEWMPDSDWANNAPFRRGKVSLT 335

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
           +GG+        P     +   + L    D   T  A A  +++   LDGVN
Sbjct: 336 EGGVHVPFIAHWPAGFSLKSEYAGLVSSLDIAATSLALARADVSQAKLDGVN 387


>gi|449137658|ref|ZP_21772978.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448883711|gb|EMB14224.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 810

 Score =  184 bits (468), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 132/381 (34%), Positives = 196/381 (51%), Gaps = 37/381 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           R  F +     L   A  +  A ++P+I++I+ADDLG++DV F+G  +IPTP +D LA  
Sbjct: 4   RTKFLSSMLIGLLVLAASSVDAKERPNIVLIVADDLGYSDVGFNGCKEIPTPRLDELAGE 63

Query: 65  GLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEP-----W------GLPLTE 112
           G++    Y     C+PSR+ L+TG+Y    G       EG P     W      G+PL+E
Sbjct: 64  GVVFTNGYASHPYCSPSRAGLLTGRYQQRFGH------EGNPEPDPQWHGDDTPGMPLSE 117

Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
             L   LKEAGY T AIGKWHLG  +  + P  RGFD  +G+  G   Y+    +   +P
Sbjct: 118 TTLADALKEAGYVTGAIGKWHLGDAKPFW-PNRRGFDEWFGFSGGGLSYWGDLGR--MDP 174

Query: 173 YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
             G+  R +  VD K++  Y TD ++  A+  I  H ++ P FLYLA+ A HA     P 
Sbjct: 175 LLGV-HRGDEPVDRKSL-TYLTDDFSTEAVKFIQRH-ETDPFFLYLAYNAPHA-----PD 226

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
           QA    + K   I    R  Y  MV+ +DE +G V+  +R+ G+ +N++++F +DNG   
Sbjct: 227 QATRAHLEKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQIRESGLDDNTLIVFYSDNGG-- 284

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAA 351
                    N P RG K   ++GG+R    + W   L    +  S +  + D  PT+ AA
Sbjct: 285 ---RREHAMNLPYRGHKGMLFEGGIRVPFLVSWPGTLPGGVREQSPVTAL-DVFPTVLAA 340

Query: 352 AGIEIN-DTSLDGVNQWDVLT 371
           AG++ + +  LDG N    LT
Sbjct: 341 AGLDPSKNEKLDGQNLLAGLT 361


>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
 gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
          Length = 543

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 130/357 (36%), Positives = 194/357 (54%), Gaps = 31/357 (8%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
           A  +P+I++I+ADDLG++DV F+G  +IPTP++D LA +G++    Y     C+PSR+ L
Sbjct: 41  AKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGL 100

Query: 85  MTGKYPIHIGMQHGVILEGEP-W------GLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG++    G  HG   E +  W      G+PL+E  L   LKEAGY T AIGKWHLG  
Sbjct: 101 LTGRHQQRFG--HGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGDA 158

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYY-DHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
           +  + P  RGFD  +G+  G   Y+ D   K   +P  G+  R +  VD KT+  + TD 
Sbjct: 159 KPFW-PNRRGFDEWFGFSGGGFSYWGDLGMK---DPLLGV-HRGDEPVDPKTL-THLTDD 212

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
           ++  A+  I  H +++P FLYLA+ A HA     P  A    + K   I    R  Y  M
Sbjct: 213 FSTEAVKFIQRH-ETEPFFLYLAYNAPHA-----PDHATRAHLQKTAHIEYGGRAVYGAM 266

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V+ +DE +G V+  +R+ G+ EN++++F +DNG            N P RG K   ++GG
Sbjct: 267 VAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG-----RREHAVNFPYRGHKGMLFEGG 321

Query: 317 MRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
           +R    + W   ++   K  S +  + D  PT  AAAG++ + +  LDG N   VLT
Sbjct: 322 IRVPFLVSWPGTVRSGMKEESPITAL-DLFPTALAAAGMDPSQNDKLDGQNLLPVLT 377


>gi|319954018|ref|YP_004165285.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422678|gb|ADV49787.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 434

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 123/337 (36%), Positives = 179/337 (53%), Gaps = 23/337 (6%)

Query: 34  IILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGKYPIH 92
           +IL DD GW DV F+G++ IPTPN+D LA  G+I +  YV    C+PSR+ L+TG+Y   
Sbjct: 1   MILTDDQGWADVGFNGATDIPTPNLDRLASEGVIFSNGYVSHPYCSPSRAGLLTGRYQAR 60

Query: 93  IGMQHGVILEGE---PWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
            G    +  +G+     G PL+EK++ + LKE GY T AIGKWHLG   ++Y P  +GFD
Sbjct: 61  FGHDCNMPYDGKNDASVGTPLSEKMISEALKEQGYRTSAIGKWHLGDHPDLYPPA-QGFD 119

Query: 150 SHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHN 209
             +G+  G  +Y+  S       Y+      N +V  +    Y TD +T  AI  I + +
Sbjct: 120 HWFGFPGGGMNYWGESKNEIQTIYR------NRKVVPEEELTYLTDDFTTEAIRFITQKD 173

Query: 210 KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIA 269
           + KP F+YLA+ A HA     P QA  E + K   I    R  YA MV+ +D +VG + +
Sbjct: 174 E-KPFFMYLAYNAPHA-----PDQATKEYLEKTKHIEYAGRSVYAAMVNAVDANVGKIDS 227

Query: 270 ALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLK 329
            L  +G+ EN+I++F++DNG            N P RG K   ++GG++    I  P   
Sbjct: 228 TLIANGLKENTILVFLSDNGG-----RIEHADNRPYRGHKGMLFEGGIKVPFFITWPTKI 282

Query: 330 QTQKVSSELFHISDWLPT-LCAAAGIEINDTSLDGVN 365
           + +   +E     D  PT L AA G    +  LDGV+
Sbjct: 283 KEKYAYNEPISSLDLFPTFLNAAGGKAKKERQLDGVD 319


>gi|449677596|ref|XP_004208885.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 193

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 88/152 (57%), Positives = 109/152 (71%), Gaps = 5/152 (3%)

Query: 39  DLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKYPIHIGM--- 95
           +LG+NDVSFHGS QIPTPNID +A  G+ILN +YV  +CTPSRSA+MTG+YPIH G+   
Sbjct: 44  NLGFNDVSFHGSKQIPTPNIDKIAKEGVILNNYYVLPICTPSRSAIMTGRYPIHTGIFYH 103

Query: 96  QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW 155
            +  I     WG+ L EK LPQYLK  GY THAIGKWHLGFF + YTPT+RGFDS YGY+
Sbjct: 104 TYDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGYY 163

Query: 156 QGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
            G  DY+DHS  +    + GLD+ ++    +K
Sbjct: 164 GGQADYWDHSLAS--NGWWGLDLHYDTPSSSK 193


>gi|301311961|ref|ZP_07217883.1| putative arylsulfatase [Bacteroides sp. 20_3]
 gi|423339483|ref|ZP_17317224.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
           CL09T03C24]
 gi|300830063|gb|EFK60711.1| putative arylsulfatase [Bacteroides sp. 20_3]
 gi|409230864|gb|EKN23725.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
           CL09T03C24]
          Length = 452

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 142/421 (33%), Positives = 218/421 (51%), Gaps = 41/421 (9%)

Query: 6   KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
           KY     C L F+ A   T A P  P+I++IL DDLG+ D+S  G+++ I TP+ID L  
Sbjct: 3   KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62

Query: 64  NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
            G+   N H    + +PSR++L+TG+YP  +G+  GVI   + + WG L     LLPQ L
Sbjct: 63  EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
           K+ GY    +GKW+LG      TPT RGFD + G+   + D YY H        +    M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174

Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           R N+ ++D +    ++T+++++ AI  +++   K +P FLYLA+ A H      P Q P 
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226

Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
           E  E  K  + S PE+R     ++  LD +VG V  AL ++G LEN+I++F +DNG    
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNG---- 282

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAA 352
           G      +N P RG K   ++GG+R    I+  W  Q +  V      +SD  PTLC   
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLT 340

Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNP----QKYYAALRVDDLKYVAG 408
            + ++   +DG++   +L    +     ++H +    N     Q YYA+ +  D K +  
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGQAYYAS-QYRDFKILQN 398

Query: 409 T 409
           T
Sbjct: 399 T 399


>gi|241756057|ref|XP_002401379.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215508427|gb|EEC17881.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 287

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 111/312 (35%), Positives = 163/312 (52%), Gaps = 31/312 (9%)

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  +D+S+G VI AL K  ML NSIV+F  DNG    G+ SN G N PLRG K+T W+GG
Sbjct: 2   VYEMDQSIGLVIEALHKRNMLGNSIVIFSTDNGGLPTGLGSNYGFNWPLRGFKATLWEGG 61

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAK 375
           +RG A IWSP L+++ ++SS++ HI+DWLPTL +AAG +++    +DG++ W+ L +   
Sbjct: 62  VRGAAFIWSPLLRRSGRISSQMMHITDWLPTLYSAAGGDVSRLGVIDGIDMWEALCEDLP 121

Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVL 435
           + R EIL NID+  N     +AL V + K                     +    P E+ 
Sbjct: 122 SPRHEILLNIDSAANS----SALIVGNRKVF-------------------LTSSFPDELR 158

Query: 436 YSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKIL---ELREFARVRCNYDN 492
           Y ++ +          ++       K+ +   LR     EK+      R+   V C+  N
Sbjct: 159 YHRSEVP----GGTRPVEGLDEMMLKSRTGTVLRDFYKVEKLSIRPNWRKEVVVNCSQYN 214

Query: 493 KGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRA 552
            G++  +   P  F+I  DPCE  NLA      + +L +KLA Y +TM+PP NKP D R 
Sbjct: 215 PGSNFVANSSPYYFDIQRDPCELENLAAVNATEVDELMEKLAAYAATMIPPANKPMDPRG 274

Query: 553 DPARWNNIWVPW 564
            P   + +W PW
Sbjct: 275 LPKYNHGLWAPW 286


>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
 gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
          Length = 553

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 138/400 (34%), Positives = 195/400 (48%), Gaps = 56/400 (14%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           + P+I++ILADDLG+ND+S  G   +PTPNID++A  G      Y   A C PSR+ +MT
Sbjct: 69  RPPNIVVILADDLGFNDISHFGGGIVPTPNIDSIARGGANFTSAYSGTAACAPSRAMIMT 128

Query: 87  GKYPIHIGMQ----------------------HGVILEGE------PW---GLPLTEKLL 115
           G+Y    G +                      H ++++ E      P+   GLP +E  L
Sbjct: 129 GRYGTRTGFEFTPTPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPFREQGLPGSEITL 188

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL---QDYYD-HSCKATFE 171
            + LK  GY    IGKWHLG   E + P  +GFD       GL   +D  D  + K  F+
Sbjct: 189 AEALKPKGYHNIHIGKWHLGNAPE-FLPNAQGFDESVMLESGLFLPEDSPDVVNAKLPFD 247

Query: 172 PYQGLDMRHNMQVDNKTIGI-------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVH 224
           P     +   MQ      G        Y TD YT+ AI  I E N+++P FLYLAH  VH
Sbjct: 248 PIDQF-LWARMQYATSYNGSAWFEPKGYLTDFYTDEAIKAI-EANRNRPFFLYLAHWGVH 305

Query: 225 AGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
                 P QA   +      I D   R YA M+  LD SVG V+ +L+++G+ EN++V+F
Sbjct: 306 T-----PLQASKADYDALSHIEDERLRVYAAMIVALDRSVGRVLQSLKENGLEENTLVIF 360

Query: 285 MADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISD 343
            +DNGAP  G       N P RG K T ++GG+R    A W   +    + ++ + H+ D
Sbjct: 361 SSDNGAP--GYIGLPDVNKPYRGWKLTFFEGGIRVPFFAKWPARIPAGTERTTPVAHL-D 417

Query: 344 WLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEIL 382
             PT+ AAAG E+  D  +DG++      +G K     I 
Sbjct: 418 MFPTIVAAAGGELPADRVIDGIDLLPYAARGEKPAPRPIF 457


>gi|255014751|ref|ZP_05286877.1| putative secreted sulfatase precursor [Bacteroides sp. 2_1_7]
 gi|410102840|ref|ZP_11297765.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
 gi|409237967|gb|EKN30762.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
          Length = 452

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 142/421 (33%), Positives = 218/421 (51%), Gaps = 41/421 (9%)

Query: 6   KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
           KY     C L F+ A   T A P  P+I++IL DDLG+ D+S  G+++ I TP+ID L  
Sbjct: 3   KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62

Query: 64  NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
            G+   N H    + +PSR++L+TG+YP  +G+  GVI   + + WG L     LLPQ L
Sbjct: 63  EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
           K+ GY    +GKW+LG      TPT RGFD + G+   + D YY H        +    M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174

Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           R N+ ++D +    ++T+++++ AI  +++   K +P FLYLA+ A H      P Q P 
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226

Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
           E  E  K  + S PE+R     ++  LD +VG V  AL ++G LEN+I++F +DNG    
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNG---- 282

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAA 352
           G      +N P RG K   ++GG+R    I+  W  Q +  V      +SD  PTLC   
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNQIRPAVRDNFVMLSDMFPTLCDLT 340

Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNP----QKYYAALRVDDLKYVAG 408
            + ++   +DG++   +L    +     ++H +    N     Q YYA+ +  D K +  
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVHWVRREGNSRYGGQAYYAS-QYRDFKILQN 398

Query: 409 T 409
           T
Sbjct: 399 T 399


>gi|325107240|ref|YP_004268308.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324967508|gb|ADY58286.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 469

 Score =  184 bits (466), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 139/389 (35%), Positives = 198/389 (50%), Gaps = 38/389 (9%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F  L C L  +      + P  P+ ++I  DDLG+ D+   G   I TP +D +A  G  
Sbjct: 10  FGLLFCNLPGDGQAAEKSTP--PNFVVIFCDDLGYGDLGCFGHPSISTPQLDQMADEGQR 67

Query: 68  LNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAG 123
             Q YV A +CTPSR+AL+TG+ PI  GM   +  V+      GLP  E  + + LK+ G
Sbjct: 68  WTQFYVGASVCTPSRAALLTGRLPIRNGMMSAKRRVLFPDSKGGLPAQELTIAELLKKNG 127

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGY--------WQGLQDYYDHSCK-----ATF 170
           YAT AIGKWHLG  R  + PT  GFDS++G          +G  +Y   + +     +  
Sbjct: 128 YATAAIGKWHLG-HRSEFLPTEHGFDSYWGIPYSNDMDREKGHPNYRQKAAEDPHYFSPI 186

Query: 171 EPYQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
           E YQ + +  N +V  +    ++ T  YTE AI  I + N+ +P FLYLAH   H     
Sbjct: 187 EQYQ-VPIIDNKEVVERPANQHTITKRYTEKAIEFI-KGNREQPFFLYLAHNLPHI---- 240

Query: 230 EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
            P  A D     FL+ S   R  Y  +V+ +D  VG ++  L++ G+ EN++V+F +DNG
Sbjct: 241 -PLYASD----SFLETSP--RGIYGDVVTEIDHGVGQILDTLKETGLAENTLVVFTSDNG 293

Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
            P     ++ GS   LR  K   ++GGMR    +W P   +  KV +EL    D LPT C
Sbjct: 294 -PWLLFDTHGGSAGLLREGKGATFEGGMRVPTVMWWPGTLEP-KVQTELGTTMDLLPTFC 351

Query: 350 AAAGIEI-NDTSLDGVNQWDVLTKGAKTK 377
           + AG+ I  D  LDG +    L KG  T+
Sbjct: 352 SLAGVGIPEDRVLDG-HDLSSLLKGETTE 379


>gi|156368526|ref|XP_001627744.1| predicted protein [Nematostella vectensis]
 gi|156214663|gb|EDO35644.1| predicted protein [Nematostella vectensis]
          Length = 157

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 79/139 (56%), Positives = 104/139 (74%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A K PHI+ I  DDLGW+DV +H  + + TPNID LA  G++L  +YVQ +CTP+R   +
Sbjct: 18  AAKPPHIVFIFVDDLGWSDVGYHNITDLKTPNIDRLAGEGVVLENYYVQPICTPARGTFL 77

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           +G+YPIH G+QH  I E EP+GLPL   LLPQ LK+AGY+THA+GKWHLGFF + YTP +
Sbjct: 78  SGRYPIHTGLQHSNIHETEPFGLPLDFTLLPQKLKKAGYSTHAVGKWHLGFFEKEYTPLY 137

Query: 146 RGFDSHYGYWQGLQDYYDH 164
           RGFD+ +GY+ G +D+Y H
Sbjct: 138 RGFDTFFGYYSGSEDHYTH 156


>gi|262383795|ref|ZP_06076931.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|262294693|gb|EEY82625.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
          Length = 452

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 142/421 (33%), Positives = 220/421 (52%), Gaps = 41/421 (9%)

Query: 6   KYFFALTCTLLFNDAFLNTTA-PKKPHIIIILADDLGWNDVSFHGSSQ-IPTPNIDALAY 63
           KY     C L F+ A   T A P  P+I++IL DDLG+ D+S  G+++ I TP+ID L  
Sbjct: 3   KYILTAACGLAFSAASAQTKALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLN 62

Query: 64  NGL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYL 119
            G+   N H    + +PSR++L+TG+YP  +G+  GVI   + + WG L     LLPQ L
Sbjct: 63  EGVRFTNFHANCPVSSPSRASLLTGRYPDMVGVP-GVIRTHKEDSWGYLSEDAVLLPQML 121

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDM 178
           K+ GY    +GKW+LG      TPT RGFD + G+   + D YY H        +    M
Sbjct: 122 KKRGYHNAMVGKWNLGL-ESPNTPTERGFDFYRGFLGDMMDDYYTH------RRFGNNYM 174

Query: 179 RHNM-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           R N+ ++D +    ++T+++++ AI  +++   K +P FLYLA+ A H      P Q P 
Sbjct: 175 RENLKEIDPQG---HATEIFSDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQ 226

Query: 237 E--EVAKFLDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
           E  E  K  + S PE+R     ++  LD +VG V  AL+K+G LEN+I++F +DNG    
Sbjct: 227 EWLEKVKKREPSLPEKRAKIVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNG---- 282

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELF-HISDWLPTLCAAA 352
           G      +N P RG K   ++GG+R    I+  W  + +    E F  +SD  PTLC   
Sbjct: 283 GQDDAGANNGPFRGAKQDMYEGGIRVAGGIY--WKNRIRPAVRENFVMLSDMFPTLCDLT 340

Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNP----QKYYAALRVDDLKYVAG 408
            + ++   +DG++   +L    +     +++ +    N     Q YYA+ +  D K +  
Sbjct: 341 AVPVSH-EIDGISILPLLRGEEQDTGDRMVYWVRREGNSRYGGQAYYAS-QYRDFKILQN 398

Query: 409 T 409
           T
Sbjct: 399 T 399


>gi|345317091|ref|XP_001516952.2| PREDICTED: arylsulfatase I-like, partial [Ornithorhynchus anatinus]
          Length = 264

 Score =  183 bits (464), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 97/230 (42%), Positives = 139/230 (60%), Gaps = 12/230 (5%)

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y IH G+QH +I   +P  LPL +  LPQ L+EAGY+TH +GKWHLGF+++   PT RG
Sbjct: 6   RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 65

Query: 148 FDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           FD+  G   G  DYY + +C        G D+     V  +  G YST LY + A  ++A
Sbjct: 66  FDTFLGSLTGNVDYYTYDNCDG--PGVCGYDLHEGENVAWEQSGKYSTLLYAQRAGKILA 123

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
            H+  +P+FLY+A  AVH      P Q+P E + ++  + +  RR YA MV+ +DE+V N
Sbjct: 124 SHDPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRN 178

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           +  AL+K+G  +NS+++F  DNG  +F    + GSN PLRG K T W+GG
Sbjct: 179 ITRALKKYGYYDNSVIVFSTDNGGQTF----SGGSNWPLRGRKGTYWEGG 224


>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 467

 Score =  183 bits (464), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 134/387 (34%), Positives = 193/387 (49%), Gaps = 43/387 (11%)

Query: 16  LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ- 74
            F  +FL  TA ++P+I+ IL+DD G+ D  F GS +  TP +D LA   +  +Q YV  
Sbjct: 15  FFKVSFL--TAQQQPNIVFILSDDAGYADFGFQGSKEFKTPELDKLAKKSIKFSQAYVSA 72

Query: 75  ALCTPSRSALMTGKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYAT 126
           A+C PSR+ ++TGKY    G +   +        L G+  GLPL +  +  YL++ GY T
Sbjct: 73  AVCGPSRAGILTGKYQQKFGFEENNVPGYMSTSGLVGDEMGLPLDQITIANYLQDLGYKT 132

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDY--YDHSCKAT-----FEPYQGLDMR 179
              GKWH G   + + PT RGFD  YG+  G + Y  YD S   +      E   G  + 
Sbjct: 133 ALFGKWHQG-NADRFHPTKRGFDEFYGFRGGARSYMPYDDSNPLSKNEDRLERGFGNFLE 191

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
           H           Y TD     AI+ I   NK  P F+YL+  AVH      P +A  E++
Sbjct: 192 HEG---------YLTDELAHEAISFI-NRNKKHPFFIYLSFNAVHT-----PMEATAEDL 236

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
            +F  +   +R+T A M   +D ++G V+  L   G+  N++++F  DNG PS    +N 
Sbjct: 237 EQFPHLKG-KRKTLAAMTLAMDRAIGTVLKELETQGLSNNTLIVFTNDNGGPS---DAND 292

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPT-LCAAAGIEI 356
             N PL G K+   +GG+R V  + S W KQ QK  +  + IS  D  PT L AA+G   
Sbjct: 293 SDNSPLSGTKANHLEGGIR-VPFLMS-WPKQLQKNKNYNYPISTLDLFPTFLSAASGNSS 350

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILH 383
               LDGVN    L +    +  E+L+
Sbjct: 351 AMEELDGVNLLPYLKEENTERPHELLY 377


>gi|149196558|ref|ZP_01873612.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140238|gb|EDM28637.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 443

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 118/346 (34%), Positives = 173/346 (50%), Gaps = 21/346 (6%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
           A  + + ++ILADDLG+ DV F GSSQI TP+ID LA +G+I +Q YV  ++C PSR+ L
Sbjct: 5   ASNRLNFVLILADDLGYGDVGFTGSSQIKTPHIDRLAKDGVIFSQGYVSSSVCGPSRAGL 64

Query: 85  MTGKYPIHIGMQHGVI-----LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           MTGK  +  G  + +       + E  GLP++EK L   L E GY    +GKWHLG  +E
Sbjct: 65  MTGKNQVRFGFDNNLTNYLPQFKDEFHGLPISEKTLATRLAEKGYVNGLVGKWHLG-DKE 123

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
            Y P  RGF   +GY  G   Y+           +G D              Y TD   +
Sbjct: 124 QYHPLKRGFHEFWGYLGGGHHYFRSKPNG-----KGYDCPIECNYKTPQPISYITDDKGD 178

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
             ++ I  H K +P FL+ +  A HA     P  A +E++  +  I   +RR Y  M   
Sbjct: 179 ECVDFIRRH-KDEPFFLFASFNAPHA-----PMHAKEEDLKLYSHIEGEKRRAYCAMTHN 232

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           LD +VG ++A L+   + +N++V+F++DNG P+    +N   N PLRG K    +GG+R 
Sbjct: 233 LDLNVGKIVAELKAQNIYDNTVVVFLSDNGGPT---PNNASLNAPLRGGKGILLEGGIRV 289

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
              +  P   Q     ++     D   +    AG +     +DGVN
Sbjct: 290 PFTMTWPAKIQAGSSFNKAVSSIDLAASFTELAGAKFRAEEMDGVN 335


>gi|149198313|ref|ZP_01875359.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
 gi|149138609|gb|EDM27016.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
          Length = 476

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 134/376 (35%), Positives = 190/376 (50%), Gaps = 29/376 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           +P+IIII  DD G+ D+S  G + + TP ID +A  G  L   YV A +CTPSR+ALMTG
Sbjct: 31  QPNIIIIFTDDQGYADLSCFGGTHVSTPRIDQMAAEGAKLTSFYVAAPVCTPSRAALMTG 90

Query: 88  KYPIHIGMQHG----VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
            YP  I M  G    V+L G+  GL   E  + + LK  GY T   GKWHLG   E + P
Sbjct: 91  TYPKRIDMARGSNFVVLLAGDKKGLNPKEITIAEVLKAVGYKTGMFGKWHLGDQPE-FLP 149

Query: 144 TFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
           T +GFD  +G  Y   +  Y+       F     LD    +++D      Y T  +TE A
Sbjct: 150 TRQGFDEFFGLPYSHDIHPYHPQQSHFKFPSLPLLDGEEVIEMDPD--ADYLTKRFTERA 207

Query: 202 INVIAEHNKSKPMFLYL----AHLAVHAGNTYEPFQAPDEEVAKFLDISD-PERRTYAGM 256
           +  I E NK +P FLY+     H  +HA   Y     P+E  AK    ++  + +T   +
Sbjct: 208 VQFI-EKNKDQPFFLYMPHPIPHTPLHASPAYMK-DLPEEIKAKLSKENNYIDYKTRTQL 265

Query: 257 VSR----LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           +SR    +D SVG ++ AL+K+G+ +N+IV+F +DNG P+ G  +      PLRG K+  
Sbjct: 266 LSRAIGEVDWSVGQILDALKKNGIDDNTIVIFSSDNG-PAIGKAT------PLRGRKNQT 318

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLT 371
            +GGMR    I  P        + E+    D LPT    AG EI +D  +DG + W V+ 
Sbjct: 319 LEGGMRVPTVIHWPGKIPAGIANDEIMTTMDLLPTFAKLAGAEIPSDRVIDGKDIWAVIR 378

Query: 372 KGAKTKRSEILHNIDN 387
              K+      ++  N
Sbjct: 379 GEEKSPHQAFFYHKGN 394


>gi|359400556|ref|ZP_09193534.1| sulfatase [Novosphingobium pentaromativorans US6-1]
 gi|357597898|gb|EHJ59638.1| sulfatase [Novosphingobium pentaromativorans US6-1]
          Length = 493

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 130/379 (34%), Positives = 187/379 (49%), Gaps = 41/379 (10%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           +P++++IL DDL W DVS +G+S I TPNIDA+   G   +  YV A +C+ SR+ L++G
Sbjct: 44  RPNVVVILVDDLAWTDVSANGASTIQTPNIDAIGAAGATFSSGYVTASVCSVSRAGLLSG 103

Query: 88  KYPIHIGMQHGVILEG---EPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           + P   G  + +   G   +  GLP+    L + LK  GY T   GKWH G   E Y PT
Sbjct: 104 RNPTTFGFMYNITERGAVDKGAGLPVDVPTLAERLKPLGYDTAVFGKWHQGSTPEFY-PT 162

Query: 145 FRGFDSHYGYWQGLQDYYDH------SCKATFEPYQ------------GLDMRHNMQVDN 186
            RGFD  +G+  G   Y         +   +F+ YQ            G DM    Q   
Sbjct: 163 RRGFDEFFGFLAGESIYATPDTPGIVTTPTSFDKYQPKEREGAQRLFQGPDMEPVAQPHK 222

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
                Y T+  TE A++ I      K P FLY+A+ A H      P Q P     +  +I
Sbjct: 223 -----YLTEEITEHAVDFIDRKADGKDPFFLYVAYNAPHW-----PLQVPRTYYDRHSEI 272

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP-SFGIHSNKGSNHP 304
           +DP RRTY  M+  LD  VG ++ AL   G+ E+++V+F++DNG P  FG  +     HP
Sbjct: 273 ADPARRTYVAMIDALDAGVGQILKALDDKGLREDTLVVFLSDNGCPVQFGYCTQ---THP 329

Query: 305 LRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLD 362
             G K T  +GGMR    A W   +   Q++++ +  + D +PT+  A    +     LD
Sbjct: 330 WGGGKFTYLEGGMRVPFMASWPGHIAAGQRIATPVSSL-DIVPTVLEAVSPGKTLPEELD 388

Query: 363 GVNQWDVLTKGAKTKRSEI 381
           GV+  D L   A  +RS I
Sbjct: 389 GVDLLDTLAGTAPKERSFI 407


>gi|196232399|ref|ZP_03131252.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196223471|gb|EDY17988.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 452

 Score =  182 bits (462), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 135/370 (36%), Positives = 199/370 (53%), Gaps = 39/370 (10%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           +  ALTC+    DA        KP+II+ILADDLG  + S  G+  I TPNID L + GL
Sbjct: 12  FAVALTCSAFAVDA-------SKPNIIVILADDLGRGEYSDFGTKDIQTPNIDRLCHEGL 64

Query: 67  ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEP---WG-LPLTEKLLPQYLKE 121
             +  Y  + +C+PSR+ALMTG +P  +G+  GVI E +    WG L    +L PQ LK 
Sbjct: 65  TFDNFYANSCVCSPSRAALMTGCFPNRVGVP-GVIREEDQDNNWGWLAQGVRLFPQLLKP 123

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLDMRH 180
           AGY +  +GKWHLG      TPT RGFD   G+   +  DY+ H         Q   MRH
Sbjct: 124 AGYQSAIVGKWHLG-INSPNTPTERGFDRFDGFLGDMMDDYWTH-----LRHGQNF-MRH 176

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEV 239
           N +V +     ++TD++T+ A+  + E +KSK P FLYLA+ A H     +P Q   + +
Sbjct: 177 NQEVVSPE--GHATDVFTDWALRYLDERSKSKEPFFLYLAYNAPH-----DPIQPKPDWL 229

Query: 240 AKFLDISDP----ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
           AK +   +P    +R     ++  LD+ +G V+A L + G+ +N+IVLF +DNG    G+
Sbjct: 230 AK-VKAREPGLSEKRAKLVALIEHLDDGIGKVLAKLDETGLAKNTIVLFSSDNG----GV 284

Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
            SN+ +N P R  K+  ++GG+R    +  P        S+ +  + D LPT     G +
Sbjct: 285 LSNEANNGPWRSGKTHMYEGGLRVPCMVRWPGHVAPGTHSNRVALLMDILPTALEIVG-Q 343

Query: 356 INDTSLDGVN 365
               ++DGV+
Sbjct: 344 TPPPNIDGVS 353


>gi|402824519|ref|ZP_10873876.1| sulfatase family protein [Sphingomonas sp. LH128]
 gi|402261924|gb|EJU11930.1| sulfatase family protein [Sphingomonas sp. LH128]
          Length = 455

 Score =  182 bits (461), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 127/389 (32%), Positives = 186/389 (47%), Gaps = 35/389 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           +KP+ I+IL DDLG+ D+   G + IPTPN++ +A  G++L  +Y  Q +CTPSR+ L+T
Sbjct: 30  RKPNFIVILCDDLGYGDIEPTGGTAIPTPNLNRMAREGMVLTDYYAPQNICTPSRAGLLT 89

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+YPI  G+  G IL+ +   LPL E  +P+ L+ AGY +   GKWHLG     + PT  
Sbjct: 90  GRYPIRTGLAWGAILQPDKRILPLDEVTIPKALQPAGYRSGLFGKWHLGHLGPDWQPTRH 149

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           GFD  YG        Y H  K   + +           +   +       Y E A+N I 
Sbjct: 150 GFDRFYGI------PYSHDMKP-LQIFTDTSPDRRASEEEPVMAELQQRFYEE-AVNFI- 200

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           E NK +P F+ LA  A H  N   P    D+  +            Y  +V  +D  VG 
Sbjct: 201 EANKDQPFFVELALSAPHLANYPNPRHRDDKMAS-----------AYGEVVEEIDAIVGA 249

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK-STPWDGGMRGVAAIWS 325
           ++  LR  G+  +++VLF +DNG P +     +GS+ PLR  K    +DGG R     W 
Sbjct: 250 LMVKLRTLGLDRDTLVLFTSDNG-PWY-----EGSSGPLRDRKGGAGYDGGYRVPCIAWQ 303

Query: 326 PWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHN 384
           P           +    D+LPT CA AG  +     +DG +   +L +GA++    +L  
Sbjct: 304 PGTVPAGLRCDSIAMGIDFLPTFCAMAGTPVPAGPVIDGKDISQILLRGAESPHESLLLF 363

Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNG 413
            D      +    +R    KYVA T  +G
Sbjct: 364 DD------EEVTGIRTQRWKYVAQTYYHG 386


>gi|343086301|ref|YP_004775596.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354835|gb|AEL27365.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 481

 Score =  182 bits (461), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 129/386 (33%), Positives = 191/386 (49%), Gaps = 29/386 (7%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           ++C   F          +KP+II+I ADDLG+ D+   G   I TPN+D +A+ G     
Sbjct: 14  ISCIGFFVTTAEARQGGEKPNIIVIFADDLGYGDLGVFGHPSIKTPNLDKMAFEGQKWTN 73

Query: 71  HYVQA-LCTPSRSALMTGKYPIHIGMQ---HGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
            YV A +CTPSR+ L+TG+ PI  GM      V+      GLP +E  + + LK  GY T
Sbjct: 74  FYVAAPVCTPSRAGLLTGRLPIRSGMSSDDRRVLFPDSDGGLPQSEITIAKALKGNGYQT 133

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQ-----DYYDHSCKATFEPYQGLDMR 179
            AIGKWHLG  +  Y PT  GFDS++G  Y   +      D++  +    F+ Y    M+
Sbjct: 134 AAIGKWHLG-HKSPYLPTDHGFDSYFGIPYSNDMDKIDKTDHFTLTDNEKFDAYNVPLMK 192

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
                +  T     T  YTE A++ I    K +P F+YLAH   H          P    
Sbjct: 193 DAEIAERPTDQRTLTKRYTEEAVSKIKAF-KDEPFFIYLAHNLPH---------IPLFRS 242

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
           A F D S      Y  ++  +D SVG +++ L++ G+ EN++V+F +DNG P     ++ 
Sbjct: 243 AAFKDQS--LGGIYGDVIEEIDWSVGQILSTLKEEGIAENTLVVFTSDNG-PWHVFKTHG 299

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ-KVSSELFHISDWLPTLCAAAGIEIND 358
           G+   LRG K   ++GGMR     W  W  Q +  V  ++    D LPT CA +G E+ D
Sbjct: 300 GTAGLLRGAKGGTFEGGMREPTVFW--WPAQIKPGVVMDMGTTMDLLPTFCAISGTELPD 357

Query: 359 TSL-DGVNQWDVLTKGAKTKRSEILH 383
             + DG +   +L    K++R  + +
Sbjct: 358 DRIYDGYDISGLLKGTGKSERETVFY 383


>gi|171910115|ref|ZP_02925585.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 460

 Score =  182 bits (461), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 130/368 (35%), Positives = 184/368 (50%), Gaps = 37/368 (10%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSR 81
           +  A + P+I+II ADDLG+ D+  +GS  I TP++D +A  GL     YV + +CTPSR
Sbjct: 23  SQAADRPPNIVIIFADDLGYGDLGCYGSPTIATPHLDQMAAEGLRFTDFYVASEVCTPSR 82

Query: 82  SALMTGKYPIHIGM--QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           +AL+TG+YP+  GM  +  V+      GLP  E  LP+ LK  GYAT  +GKWHLG   E
Sbjct: 83  AALLTGRYPVRSGMYGKRRVLFPNSTGGLPAGEITLPEALKARGYATAHVGKWHLG-IHE 141

Query: 140 VYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD------MRHNMQVDNKTIGI 191
              P  +GFD  +G  Y   +    D    +T  P   +D      +R+   V+     +
Sbjct: 142 GSRPLDQGFDQSFGLPYSNDMDARPDLPKGSTGSPTPPIDGWNVPLLRNGEVVEKPADQV 201

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
           + T  YTE A+  I +  KS+P FLY+AH   H      P  A +    +          
Sbjct: 202 HLTGHYTEEAVKFI-QQKKSQPFFLYMAHSFPHV-----PLFASENFKGR------SRAG 249

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            Y   V  LD SVG V+  LRK G+ EN++V F +DNG P   + +  GS  PL+  K +
Sbjct: 250 IYGDTVEELDWSVGQVLQTLRKEGLSENTLVFFTSDNG-PWLIMGAQGGSAGPLKDGKGS 308

Query: 312 PWDGGMRGVAAIW-----SPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
            W+GGMR     W      P + +T   S +LF  S       A AG E+  D  +DG +
Sbjct: 309 HWEGGMRVPGIAWWPGHIKPGVSRTPANSMDLFVTS------VALAGAEVPKDRVIDGTD 362

Query: 366 QWDVLTKG 373
              +L +G
Sbjct: 363 IRPLLLEG 370


>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
 gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
          Length = 474

 Score =  182 bits (461), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 116/347 (33%), Positives = 176/347 (50%), Gaps = 23/347 (6%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP+I+ IL+DD G+ D  F GS++IPTPN+D LA  G++  Q YV A +C PSR+ L+TG
Sbjct: 35  KPNIVFILSDDAGYADFGFQGSTEIPTPNLDQLAQEGVVFKQAYVSASVCGPSRAGLLTG 94

Query: 88  KYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE 139
           KYP   G +   +          G+  G+ L +  +  YL E GY T  IGKWH G   +
Sbjct: 95  KYPQRFGFEENNVPGYMSSSGATGDDMGMRLDQLTMANYLAERGYRTSLIGKWHQG-NED 153

Query: 140 VYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
            + P  RGFD  +G+  G + Y+  +        +    R+          +Y TD    
Sbjct: 154 RFHPLKRGFDEFFGFRGGARSYFPFTQAHPSSRREDFLERNFNNYGESP--LYLTDALAN 211

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
             I  I + NK +P F +L+  A H      P +A   ++ +F  ++  ER+ YA M+  
Sbjct: 212 ETIEFI-KRNKHQPFFTFLSLSAPHY-----PMEAEKSDLDQFPHLTG-ERKIYAAMMLN 264

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           +D ++G V+  L++ G+ EN++V+F  DNG PS     N   N PL G K+   +GG+R 
Sbjct: 265 MDRAIGRVLQTLQQEGLSENTLVIFTNDNGGPS---DHNGSINLPLSGTKANLLEGGIRV 321

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVN 365
              +  P + +       +    D LPT  AAAG ++   T  DGV+
Sbjct: 322 PMIMRWPGVTKPGSSYDPMVSTLDLLPTFYAAAGGDVRQVTDWDGVD 368


>gi|325108643|ref|YP_004269711.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324968911|gb|ADY59689.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 484

 Score =  181 bits (460), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 129/399 (32%), Positives = 192/399 (48%), Gaps = 61/399 (15%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMTGK 88
           P+I++I++DD G+ND+    S  I TP++D LA  G  L   YV    CTPSR++L+TG+
Sbjct: 28  PNILLIVSDDQGYNDLGVLNSDLI-TPHLDRLAAEGTRLTDFYVAWPACTPSRASLLTGR 86

Query: 89  YPIHIGMQHGVILEGEPWG-----------------LPLTEKLLPQYLKEAGYATHAIGK 131
           YP   G+   +  E   +G                 +   EKLLP+YLK+ GY +   GK
Sbjct: 87  YPQRNGIYDMIRNEAPDYGYKYKPAEYEVSFERIGGMDQREKLLPEYLKKLGYTSAIFGK 146

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
           W LG  +  + PT RGFD  YG+     DY+ H      E Y    M     +  +  G 
Sbjct: 147 WDLGSLKR-FLPTNRGFDEFYGFVNTGIDYFTH------ERYGVPSMFRQTSLTEEDRGE 199

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQAPDEEVAKFLDIS 246
           Y+T+L+   A+  +     S+P  LYL   A H  ++ +P      QAP+E  A +  + 
Sbjct: 200 YATELFKREALAFLDRAEASEPFLLYLPFNAPHNSSSLDPRIRSTVQAPEEYKAMYPAV- 258

Query: 247 DPERRT---------------------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
           DPE RT                     Y   V+ +D ++G V+  L   G+L+N+IV+F 
Sbjct: 259 DPEYRTVNRYRYGNPAQVVTPVARQRDYRAAVTCMDAAIGEVLGKLEDKGVLDNTIVIFF 318

Query: 286 ADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS-DW 344
           +DNG       S    N PLRG K+  W+GG+R    +  P        +++ F  S + 
Sbjct: 319 SDNGG------SGGADNSPLRGHKAQVWEGGIRVPCLVRWPDGGVPAGATNDSFLSSLEL 372

Query: 345 LPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEIL 382
           +P+L  AAG  I  D  +DG N W  LT+ A + R+E+ 
Sbjct: 373 VPSLVKAAGGTIPEDIVMDGRNWWPALTEAASSPRNEMF 411


>gi|32477899|ref|NP_870893.1| iduronate-sulfatase or arylsulfatase A [Rhodopirellula baltica SH
           1]
 gi|32448456|emb|CAD77971.1| iduronate-sulfatase or arylsulfatase A [Rhodopirellula baltica SH
           1]
          Length = 1012

 Score =  181 bits (460), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 140/419 (33%), Positives = 202/419 (48%), Gaps = 33/419 (7%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
           T    KP+ I+IL DD G+ D+S  G+  + TP ID +A  G  L   YV A +CTPSR+
Sbjct: 565 TAETTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRA 624

Query: 83  ALMTGKYPIHI----GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
            LMTG YP  I    G   GV+L G+P GL   E  + + LK AGY T   GKWHLG   
Sbjct: 625 GLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQP 684

Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
           E + PT +GFD  +G  Y   +  ++       F P   L     +++D      + T  
Sbjct: 685 E-FLPTKQGFDEFFGIPYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDA--DFLTKR 741

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPF-QAPDEEVAKFLDISD------P 248
            TE A++ I E NK +P FLYL H   HA  +   PF +   ++V   ++  D       
Sbjct: 742 LTEQAVSFI-ERNKDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYAT 800

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
               +   ++ +D SVG ++ ALR +G+ E ++VLF +DNG P   ++++ G    LRG 
Sbjct: 801 RANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGE---LRGH 857

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
           K T ++GGMR    +  P        + EL    D LPT    AG  I  D  +DG + W
Sbjct: 858 KGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAGAAIPTDRVIDGKDIW 917

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
             L    +T      ++  N        AA+R    K      NNG + + Y D +N++
Sbjct: 918 PTLKGETQTPHDAFFYHRGN------QLAAVRSGKWKLHV---NNGVAKQLY-DLENDL 966



 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 117/400 (29%), Positives = 173/400 (43%), Gaps = 71/400 (17%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
           +  A + P++++I  DDLG+ D+  +G++++ TPNID LA  G    + H   A+CTPSR
Sbjct: 33  SVAAERPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSR 92

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWG-LPLTE--------KLLPQYLKEAGYATHAIGKW 132
             L+TG+YP+      G+      WG LP T         K + +  K  GYAT  +GKW
Sbjct: 93  YGLLTGQYPVRAMGGQGI------WGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKW 146

Query: 133 HLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD----YYDHSCKATF-- 170
           HLGF  E           P   GFD ++G         +  + D     YD S    +  
Sbjct: 147 HLGFKEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGG 206

Query: 171 ---EPYQGLDMRHNMQVDNKTIG------IYSTD----LYTEAAINVIAEHNKSKPMFLY 217
               P        +++  N+  G      IY  +    L TE A+  I E  K++P FLY
Sbjct: 207 KPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE-KKNEPFFLY 265

Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
            A   +H   T  P         +F   S  +   Y   V  LD  VG ++ +L  +G+ 
Sbjct: 266 FATPNIHHPFTPAP---------RFKGTS--QCGLYGDFVHELDWMVGEIVQSLEDNGLT 314

Query: 278 ENSIVLFMADNG-----APSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQT 331
           +N++VLF +DNG     A    I +    N  L G K   W+GG R  + A W   +K  
Sbjct: 315 DNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAG 374

Query: 332 QKVSSELFHISDWLPTLCAAAGIEINDTSL-DGVNQWDVL 370
            + S +L    D   T  A    E+  +   D +N    L
Sbjct: 375 TQ-SDQLISQVDLFATFSALTEQEMPSSEQKDSINMLPAL 413


>gi|149199999|ref|ZP_01877025.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149136872|gb|EDM25299.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 512

 Score =  181 bits (460), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 139/401 (34%), Positives = 202/401 (50%), Gaps = 56/401 (13%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
           T A K+P+II+I ADD+G++DV +HG+ +I TPNID++A  G+  +Q YV A +C PSR+
Sbjct: 15  TFADKQPNIILIFADDMGYDDVGYHGNKRIITPNIDSIAEQGVQFSQGYVSASVCGPSRA 74

Query: 83  ALMTGKYPIHIGMQHGVILEGEP-------WGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
            L+TG Y    G        G P        GLP ++ ++ + LK  GY    IGKWH+G
Sbjct: 75  GLLTGVYQQRFGCGENPNGSGYPNQMKYPMAGLPQSQSMISEELKTLGYTNGMIGKWHMG 134

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTIGI--- 191
           F   +  P  RG+D  YG+  G  DY + + + A  +    +     M+  NK   I   
Sbjct: 135 FDMSL-RPNQRGYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYIDVF 193

Query: 192 -----------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
                      Y TDL+T+ A+N I + N  KP FLYLA+ AVH      P+Q     + 
Sbjct: 194 KEKGVKVVDENYLTDLFTDEAVNFI-DRNADKPFFLYLAYNAVH-----HPWQTTQHALD 247

Query: 241 KFLDISDPER-RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF-GI-HS 297
           K   + D +    +A MV  +DE +G V+  L++  + +N+I++F++DNG+P   GI HS
Sbjct: 248 KTAHLKDDKNYHVFASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGIEHS 307

Query: 298 NKGSNHP-----------LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DW 344
            K  N              RG K   ++GG+R    I   W +Q QK +     IS  D 
Sbjct: 308 PKDPNRHRGGFTMSSTGIFRGYKGDTYEGGIRVPFCI--KWPQQIQKGTKYDMPISALDL 365

Query: 345 LPTLCAAAGIEINDT------SLDGVNQWDVLTKGAKTKRS 379
            PTL  AAG   ND       + DGV+    L +  + KRS
Sbjct: 366 QPTLVKAAG--GNDKKPQKGFAYDGVDILPYLKEDKEIKRS 404


>gi|421614584|ref|ZP_16055638.1| arylsulfatase A [Rhodopirellula baltica SH28]
 gi|408494671|gb|EKJ99275.1| arylsulfatase A [Rhodopirellula baltica SH28]
          Length = 1012

 Score =  181 bits (460), Expect = 7e-43,   Method: Compositional matrix adjust.
 Identities = 139/419 (33%), Positives = 202/419 (48%), Gaps = 33/419 (7%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
           T    KP+ I+IL DD G+ D+S  G+  + TP ID +A  G  L   YV A +CTPSR+
Sbjct: 565 TAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRA 624

Query: 83  ALMTGKYPIHI----GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
            LMTG YP  I    G   GV+L G+P GL   E  + + LK AGY T   GKWHLG   
Sbjct: 625 GLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQP 684

Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
           E + PT +GFD  +G  Y   +  ++       F P   L     +++D      + T  
Sbjct: 685 E-FLPTKQGFDEFFGIPYSHDIHPFHPRQNHYHFPPLPLLQNDTVIEMDPDA--DFLTKR 741

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPF-QAPDEEVAKFLDISD------P 248
            TE A++ I E NK +P FLYL H   HA  +   PF +   ++V   ++  D       
Sbjct: 742 LTEQAVSFI-ERNKDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYAT 800

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
               +   ++ +D SVG ++ ALR +G+ E ++VLF +DNG P   ++++ G    LRG 
Sbjct: 801 RANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGE---LRGH 857

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
           K T ++GGMR    +  P     +  + EL    D LPT    AG     D  +DG + W
Sbjct: 858 KGTTFEGGMREPTVVRWPGQIPAEHQNDELMTAMDLLPTFAKLAGAATPTDRVIDGKDIW 917

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
             L    +T      ++  N        AA+R    K      NNG + + Y D +N++
Sbjct: 918 PTLKGETQTPHDAFFYHRGN------QLAAVRSGKWKLHV---NNGVAKQLY-DLENDL 966



 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 117/420 (27%), Positives = 175/420 (41%), Gaps = 82/420 (19%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-L 66
            +A+   +L      +  A + P++++I  DDLG+ D+S +G++++ TPNID LA  G  
Sbjct: 19  LYAVALVMLLG-CGTSVAAERPPNVVLIFVDDLGYGDLSCYGATKLSTPNIDRLAAKGRR 77

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWG-LPLTEKL--------LPQ 117
             + H   A+CTPSR  L+TG+YP+      G+      WG LP T  L        + +
Sbjct: 78  FTDAHSASAVCTPSRYGLLTGQYPVRAMGGQGI------WGPLPTTSGLIIDTNTQTIGK 131

Query: 118 YLKEAGYATHAIGKWHLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD- 160
             K  GYAT  +GKWHLGF  E           P   GFD ++G         +  + D 
Sbjct: 132 VFKNKGYATACLGKWHLGFKEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDD 191

Query: 161 -----------------------YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
                                  + + +   +   + G    H +  D KT       L 
Sbjct: 192 SIFGYDPNDPLVYGGKPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKT-----GTLL 246

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
           TE A+  I E  K +P FLY A   +H   T  P         +F   S  +   Y   V
Sbjct: 247 TERAVKWITE-KKDEPFFLYFATPNIHHPFTPAP---------RFKGTS--QCGLYGDFV 294

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNG-----APSFGIHSNKGSNHPLRGMKSTP 312
             LD  VG ++ +L  +G+ +N++VLF +DNG     A    I +    N  L G K   
Sbjct: 295 HELDWMVGEIVQSLEDNGLTDNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGV 354

Query: 313 WDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL-DGVNQWDVL 370
           W+GG R  + A W   +K   + S +L    D   T  A    E+  +   D +N    L
Sbjct: 355 WEGGHRVPLIAKWPGKIKAGTQ-SDQLISQVDLFATFSALTEQEMPSSEQKDSINMLPAL 413


>gi|355689580|gb|AER98880.1| galactosamine -6-sulfate sulfatase [Mustela putorius furo]
          Length = 503

 Score =  181 bits (459), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 122/367 (33%), Positives = 180/367 (49%), Gaps = 26/367 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           APK P+++++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL
Sbjct: 8   APKPPNLVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAAL 67

Query: 85  MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG+ PI  G        ++    +    G+P  E+LLP+ LK AGYA+  +GKWHLG  
Sbjct: 68  LTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPAEERLLPELLKGAGYASKIVGKWHLGH- 126

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
           R  + P  RGFD  +G        YD+  +     Y+  +M         ++ KT     
Sbjct: 127 RPQFHPLKRGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL 186

Query: 194 TDLYTEAAINVIA-EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           T LYT+ A++ +  +H   +P FLY A  A HA         P      FL  S   R  
Sbjct: 187 TQLYTQEALDFVQRQHAARRPFFLYWAIDATHA---------PVYASRPFLGTS--RRGR 235

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y   V  +D+SVG +++ LR   + E++ V F +DNGA         GSN P    K T 
Sbjct: 236 YGDAVREVDDSVGKILSLLRDLRIAEDTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTT 295

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
           ++GGMR  A  W P      +VS +L  I D   T  + AG+   +D  +DG++    + 
Sbjct: 296 FEGGMREPAIAWWPGRVPAGQVSHQLGSIMDLFTTSLSLAGLAPPSDRVIDGLDLLPAML 355

Query: 372 KGAKTKR 378
            G  T R
Sbjct: 356 WGQLTDR 362


>gi|430745365|ref|YP_007204494.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430017085|gb|AGA28799.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 476

 Score =  181 bits (459), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 128/376 (34%), Positives = 203/376 (53%), Gaps = 37/376 (9%)

Query: 3   WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
           +A+  FFA+       +AF  +    +P++++ILADDLG+ D+S +G++ + TPNIDAL 
Sbjct: 4   FAQWIFFAVVVLAGVPEAFGASPDAGRPNVLLILADDLGYGDLSSYGAADLKTPNIDALV 63

Query: 63  YNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQY 118
            +G+  ++ Y  + +C+P+R+AL+TG YP  +G+  GVI     + WG L     LLPQ 
Sbjct: 64  ASGVRFDRFYANSPVCSPTRAALLTGCYPDLVGVP-GVIRTHPDDSWGVLSPQAVLLPQV 122

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLD 177
           LK AGY T  +GKWHLG       P+ RGFD  +G+   +  DY++H          G++
Sbjct: 123 LKGAGYHTALVGKWHLG-LSGASLPSRRGFDLFHGFLGDMMDDYHNHR-------RHGIN 174

Query: 178 --MRHNMQVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQA 234
              R + ++D K    ++TDL+++ AI+ + E   + +P FL LA+   H      P Q 
Sbjct: 175 YMRRDDREIDPKG---HATDLFSQWAIDFLNESKGQDRPFFLELAYNVPHT-----PVQP 226

Query: 235 PDEEVAKFLDIS---DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
           P E + K    +   DP+R     ++  LD  +G V+AALR +G  E ++++F +DNG  
Sbjct: 227 PQEWLEKVRRRAPNLDPKRARLVALIEHLDHGIGQVLAALRANGQAERTLIVFTSDNGGQ 286

Query: 292 -SFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
              G H       P RG K   ++GG+R  + A+W P   +    S  +    D  PTLC
Sbjct: 287 LDAGAHCG-----PYRGGKQDMYEGGIRVPLCAVW-PGRIEPGTRSDRVAVSMDLFPTLC 340

Query: 350 AAAGIEINDTSLDGVN 365
            AAG    +  +DG++
Sbjct: 341 EAAGARF-EHPVDGIS 355


>gi|410635995|ref|ZP_11346602.1| arylsulfatase [Glaciecola lipolytica E3]
 gi|410144672|dbj|GAC13807.1| arylsulfatase [Glaciecola lipolytica E3]
          Length = 499

 Score =  181 bits (459), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 129/372 (34%), Positives = 185/372 (49%), Gaps = 23/372 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           ++P++I+I  DD G+ DV   G   + TPN+D +A  GL+L   YV + LC+PSR+ LMT
Sbjct: 43  RQPNVIVIFTDDQGYEDVGVFGGDHVLTPNLDKMAEEGLMLTDFYVPSPLCSPSRAGLMT 102

Query: 87  GKYPIHIGMQHG----VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           G YP  + M  G    V+L  +  GL   E  + + LK  GYAT   GKWHLG   E + 
Sbjct: 103 GSYPRRVDMATGSNFPVLLAADTKGLNPAEITIAEVLKSVGYATGIFGKWHLGDQPE-FL 161

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI-YSTDLYTEAA 201
           PT +GFD  +G          H  +A F+ +  L +  N  V        Y T   TE A
Sbjct: 162 PTRQGFDEFFGLPYSHDIAPTHKRQAHFK-FPDLPLMENENVIELNPNPEYLTRRITERA 220

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAG--NTYEPFQAPDEEVAKFLDISDPE------RRTY 253
           I+ I  H +  P FLYL H   H     + E     D+     L+ISD +      +  Y
Sbjct: 221 IDFIERH-QDAPFFLYLPHPMPHGPLHVSQEFLNTVDKATLARLNISDADIAANSKKAMY 279

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
             ++  LD +VGN++  L++  +  N+IV+F +DNG PS G  ++ G+   L G K+   
Sbjct: 280 PLVIGELDNAVGNILDTLKRLNLDSNTIVIFTSDNG-PS-GRRNSDGTERYLSGHKTLTQ 337

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTK 372
           +GGMR    +W P       VS  L    D LPT+   AG ++  D  +DG N W +L  
Sbjct: 338 EGGMRVPTVVWGPSQIPANTVSQALTTSMDILPTVAFLAGADVPTDRVIDGKNIWPIL-- 395

Query: 373 GAKTKRSEILHN 384
            A  K +E  H+
Sbjct: 396 -AAEKDAESPHD 406


>gi|323448512|gb|EGB04410.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 572

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 134/407 (32%), Positives = 196/407 (48%), Gaps = 58/407 (14%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIP-------TPNIDALAYNGLILNQHYVQALCT 78
           A   PHII I+ADDLGW+ V +H   +         TP +DAL   G+ L +HY    C+
Sbjct: 32  AAPPPHIIHIVADDLGWSYVDWHRDPRTADPPDTPLTPRLDALRRAGVTLERHYAFQYCS 91

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEP------W-GLPLTEKLLPQYLKEAGYATHAIGK 131
           PSRSAL++G+ PI + +Q+ V     P      W G+P     L + L+  GY T A+GK
Sbjct: 92  PSRSALLSGRNPISVNVQNVVPEVSNPEDPVGGWQGIPTAMTGLAEVLRRRGYRTEAVGK 151

Query: 132 WHLGFFREVYT-PTFRGFDSHYGYWQGLQDYY---DHSC-----------KATFE-PYQG 175
           W +G     +  P  RGFD+  GYW    DY+   + +C            ATF+ P   
Sbjct: 152 WDVGMATAAHHHPRARGFDAWTGYWHHANDYWTFAEETCDGEAVQDLWHYNATFDGPATW 211

Query: 176 LDMRHNMQVDNKT-----IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE 230
           L    +    N+T       +Y   L  + A+  + EH+ + P+FLY A   VH      
Sbjct: 212 LRNGPSCTQGNQTHTAASPCVYEEQLLADHAVRAVEEHDDTAPLFLYWALHLVHM----- 266

Query: 231 PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
           P Q PD  +A+F  I D  RR    MV   D+ VG ++ AL + GML N++V F +DNG 
Sbjct: 267 PLQVPDAYLARFDGIPDEPRRFMTAMVRYADDEVGRLVDALERRGMLANAVVTFHSDNGG 326

Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTL 348
              G  +  G+N PL G K + ++GG+R  A +    + + ++ +  S+L  + DW  T 
Sbjct: 327 EIMGAGACGGNNWPLTGGKFSNFEGGIRVNAFVSGGAVPEARRGAALSQLSTLWDWYATY 386

Query: 349 CAAAGIEINDTS------------LDGVNQWDVLTKG-AKTKRSEIL 382
              AGI   D+             LD V+QW  +T   A   R E++
Sbjct: 387 ---AGIAGADSIDDAAAAAAGLPPLDSVDQWPYVTGANATPPRREVV 430


>gi|296121201|ref|YP_003628979.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013541|gb|ADG66780.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 479

 Score =  180 bits (457), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 178/345 (51%), Gaps = 14/345 (4%)

Query: 14  TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
            LL  +    T+   +P+I++I+ADDLG+ D+   G  +IPTP++D LA +G+     YV
Sbjct: 22  ALLAAEDVNQTSKSGRPNILVIMADDLGYADLGVQGGCEIPTPHLDQLAASGIRCTNAYV 81

Query: 74  QA-LCTPSRSALMTGKYPIHIGMQHGV-ILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
            A  C+PSR+  +TGKY    G +    + E    GLPL E  +   L+  GY T  IGK
Sbjct: 82  SAPYCSPSRAGFLTGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGK 141

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIG 190
           WH GF ++ + P  RGFD  +G+  G  +Y  H   KA F      DM +  +      G
Sbjct: 142 WHQGFSKD-HHPQSRGFDEFFGFLVGGHNYLLHKEVKARFGTAHSHDMIYRGREVEPQEG 200

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-FQAPDEEVAKFLDISDPE 249
            Y+TDL+T  A+  ++    +KP FLYL++ AVH      P  Q    E  K      P 
Sbjct: 201 -YATDLFTNEALRWMSG-PPNKPWFLYLSYNAVHTPLEIAPHLQKRIPESVKL-----PA 253

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG--IHSNKGSNHPLRG 307
           RR Y  +++ LD+S+G +   L +HG+ E ++++F++DNG       +  N G NHPLRG
Sbjct: 254 RRGYLSLLAGLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAYNSGLNHPLRG 313

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
            K    +GG+R    +  P     + +  +     D LPT+C  A
Sbjct: 314 DKGQTLEGGIRVPFFVSWPGQLPARTIYEQPIISLDLLPTVCQLA 358


>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
 gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
          Length = 502

 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 122/350 (34%), Positives = 178/350 (50%), Gaps = 31/350 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMTGK 88
           P+I++IL+DD G+ D  F GSSQI TPN+D LA    +  Q YV  A+C PSR+ ++TGK
Sbjct: 60  PNIVLILSDDAGYGDFGFQGSSQIRTPNLDNLAVQSTVFTQAYVSAAVCGPSRAGILTGK 119

Query: 89  YPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           Y    G +   +        L G+  GLPL ++ +  YL   GY+T  IGKWH G   + 
Sbjct: 120 YQQRFGFEENNVPGYMSDSGLTGDDMGLPLNQRTIGDYLTHFGYSTALIGKWHQG-NADK 178

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL----DMRHNMQVDNKTIGIYSTDL 196
           + PT RGF+  YG+  G + Y++      F P   +    D         K    Y T  
Sbjct: 179 FHPTKRGFEHFYGFRGGARSYFE------FGPNNPVSYPEDRLERGFAHYKESPHYLTQA 232

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
             + AI  I + N+ +P FL+L+  AVH      P  A  E++A+F  +S  +R+  A M
Sbjct: 233 LADEAIKFI-KQNQREPFFLFLSFNAVHT-----PMDANKEDLAQFPQLSG-KRQRVAAM 285

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
              +D  +G V+A+L + G+ EN++++F  DNG PS    +N   N PL G K+   +GG
Sbjct: 286 TLSMDREIGKVLASLDELGLSENTLLVFTNDNGGPS---DTNASDNTPLSGTKANHLEGG 342

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVN 365
           +R    +  P          E     D LPT  AAAG + +  + LDG N
Sbjct: 343 IRVPFLMRWPAGNIGPNTYHEPISTLDLLPTFFAAAGGKKSAVADLDGTN 392


>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 471

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 123/380 (32%), Positives = 185/380 (48%), Gaps = 25/380 (6%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +  F  +   L  + A  + +  K+P+I+ + +DD G+ D  F GS  + TPN+D LA  
Sbjct: 2   KSVFHRVVAALSISVACTSLSYAKQPNIVFLFSDDAGYADFGFQGSETMKTPNLDQLASE 61

Query: 65  GLILNQHYVQ-ALCTPSRSALMTGKYPIHIG---------MQHGVILEGEPWGLPLTEKL 114
           G+   Q YV  + C PSR+ +MTG+Y    G         M     ++G   G+PL E  
Sbjct: 62  GVRFTQGYVSDSTCGPSRAGIMTGRYQQKFGYEEINVPGYMSEHSAIKGAEMGIPLDEVT 121

Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP-- 172
           +  Y+K  GY T   GKWHLG   E++ P  RGFD  YG+  G + Y+ +   A      
Sbjct: 122 MGDYMKSLGYRTAFYGKWHLGGTDELH-PMHRGFDEFYGFRGGDRSYWAYEVNAPERKSA 180

Query: 173 -YQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
            +    + H +    +  G Y TD+  E A N   E    KP F++L+  AVH      P
Sbjct: 181 VFTDKKLEHGIDQFQEHEG-YLTDVLAEKA-NQFIEKAPDKPFFIFLSFNAVHT-----P 233

Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
            +A  E++AKF  +   +R+  A M   LD + G V+  L++ G+ ++++V+F  DNG P
Sbjct: 234 MEATPEDLAKFPQLKG-KRKEVAAMTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGP 292

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
           +     N  SN+PL G KS   +GG+R    +  P      KV  +     D LPT   A
Sbjct: 293 T---DKNASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFKA 349

Query: 352 AGIEINDTSLDGVNQWDVLT 371
            G E   + LDGV+    +T
Sbjct: 350 GGGEEVMSELDGVDLMPYIT 369


>gi|194919165|ref|XP_001983032.1| GG19813 [Drosophila erecta]
 gi|190647643|gb|EDV45031.1| GG19813 [Drosophila erecta]
          Length = 299

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 112/299 (37%), Positives = 164/299 (54%), Gaps = 28/299 (9%)

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           ++ L     LENSIV+F +DNGAPS G+ +N GSN PLRG K+TPW+GG+R   AIWS  
Sbjct: 1   MSTLAATDQLENSIVIFYSDNGAPSVGMFANTGSNFPLRGQKNTPWEGGVRVAGAIWSSK 60

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKR--SEILHN 384
           L+    + S+  +++DWLPTL  AAGIE++ +  LDG++ W  L+  A       EILH 
Sbjct: 61  LQARGSIFSQPLYVADWLPTLSHAAGIELDSSLKLDGIDLWPELSGSADAPHVPREILHI 120

Query: 385 IDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFN 444
           +D++       AAL++   KYV GT   G+ D     T  E+D   P+E  Y+       
Sbjct: 121 LDDI----WRVAALQLGQWKYVNGTTAAGRYDSLL--TYRELDDLDPRESRYA------- 167

Query: 445 ALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPC 504
                  +  +++A  +A S   LR  LT ++I   R  A VRC    +   CN  ++ C
Sbjct: 168 -------VTVRNSATSRALSRYDLRR-LTQQRISITRRLAAVRCGDLQRS--CNPLLEEC 217

Query: 505 LFNITDDPCEQNNL--AESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKRADPARWNNIW 561
           L++I+ DPCEQNNL  +E  +D+L  L  ++   +++   PGN+     ADPA     W
Sbjct: 218 LYDISTDPCEQNNLVYSERHSDVLAALRRRVRELRASASRPGNRASMPEADPALHTCAW 276


>gi|87306602|ref|ZP_01088749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
 gi|87290781|gb|EAQ82668.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
          Length = 468

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 145/487 (29%), Positives = 220/487 (45%), Gaps = 55/487 (11%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           +  AL   LL ++A     + + P I++I++DD G+ D+S  G +   TP +D LA +G 
Sbjct: 9   FVCALASALLVSNAVAAEKSKRPPSIVLIVSDDQGFADLSCIGDNGCRTPRLDQLAASGT 68

Query: 67  ILNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-----------------GL 108
            L   YV    CTPSR++LMTG+YP   G    +  E   +                 G 
Sbjct: 69  RLTSFYVSWPACTPSRASLMTGRYPQRNGTYDMIRNEAPDYDYLYTPEEYAVTAERILGT 128

Query: 109 PLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKA 168
            L E  L   LK+AGY +   GKW  G  +  Y P  RGFD +YG+     DY+ H    
Sbjct: 129 DLQEVFLADVLKQAGYVSAVFGKWDGGQLKR-YLPLQRGFDQYYGFANTGVDYFTH---- 183

Query: 169 TFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
             E Y    M  + Q   +  G Y TDL+   AI  I E N  +P FLYL   A H+ + 
Sbjct: 184 --ERYGVPSMFRDNQPTEEDKGTYLTDLFEREAIRFIDE-NHDRPFFLYLPFNAPHSASN 240

Query: 229 YEP-----FQAPDEEVAKF---LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
            +       QAP E +  F       +  R+ Y   V R+DE++G V+  L++H + +N+
Sbjct: 241 LDRSIRGFAQAPQEYLDHFPGGESKQEKRRQAYLAAVERMDEAIGKVVDQLQQHQIADNT 300

Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
           +++F++DN             N PLRG K+  ++GG R    +  P      KVS++   
Sbjct: 301 LIIFLSDN------GGGGGADNSPLRGGKAKMFEGGNRVPCIVHWPGKVPAGKVSNQFLT 354

Query: 341 ISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
             +  PT+ AA G ++ +D   DG +   VL  GA + R E+          ++   A R
Sbjct: 355 SLEVFPTVIAAIGGKLPDDVIYDGFDMLPVLN-GASSPREEMFWK-------RRGDVAAR 406

Query: 400 VDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAAD 459
           V D K+V      G  D  + D   + D       + +K    F+A   +++     AAD
Sbjct: 407 VGDWKWVDSAAGKGLFDLAH-DIGEKKDLSKEHPEMLAKLKARFDAWTAEME-----AAD 460

Query: 460 PKANSSD 466
           P+    D
Sbjct: 461 PRGPFRD 467


>gi|449135790|ref|ZP_21771228.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448885592|gb|EMB16025.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 476

 Score =  179 bits (455), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 145/414 (35%), Positives = 205/414 (49%), Gaps = 66/414 (15%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KP++I+ILADDLG+ D+S +G+  I TP+ID +A  G   N  YV  +C+P+R+ALMTG
Sbjct: 25  QKPNVIVILADDLGYGDLSCYGADDIATPSIDRMAAEGAKFNSFYVSPVCSPTRAALMTG 84

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            +   +G+  GV+      GL   E  LP+ LK+ GYAT  IGKWHLG  ++++ P   G
Sbjct: 85  SHSTRVGI-GGVMFPRNNHGLNPDEITLPELLKDQGYATAIIGKWHLG-NQDMFQPLKHG 142

Query: 148 FDSHYGYWQGL-----QDYYDHS------CKATFEPYQ--GLDMRHNMQV----DNKTIG 190
           FD    YW G      Q YY ++      CK   E Y   G+  R         DN  I 
Sbjct: 143 FD----YWYGTPASNNQFYYPNTKTYALDCKYR-EGYSCDGILKRETAACPLIEDNIVIE 197

Query: 191 IYS-----TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
           + +     T  YT   I  I   NK KP F+YLAH   H      P  A +    KF+  
Sbjct: 198 VPADQTQFTQRYTRETIRFIT-RNKDKPFFVYLAHNMPHI-----PLHASE----KFVGS 247

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
           S  +R  Y   +  LD S G ++ +L++ G+  N++V+F +DNG P+ G     GS  PL
Sbjct: 248 S--KRGIYGDTIQELDWSTGEILRSLKELGLDNNTLVIFTSDNG-PNTG---KGGSAGPL 301

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGV 364
           +G KS+  +GG+R       P +      S E     D LPTL   A G+  +D  +DG 
Sbjct: 302 KGGKSSTLEGGVRVPFVARWPGMIPAGTESDEPITGMDLLPTLTRIAVGVVPDDRVIDGK 361

Query: 365 NQWDVLTKGAKTKRSEILHNIDNVDNPQK--YY------AALRVDDLKYVAGTD 410
           + W +LT+ +  K            +P K  YY       A+RV D KY   T+
Sbjct: 362 DIWPLLTRKSDAK------------SPHKAVYYLRGRGVQAIRVGDWKYRVATE 403


>gi|171184398|ref|NP_057931.3| N-acetylgalactosamine-6-sulfatase isoform 1 precursor [Mus
           musculus]
 gi|124007189|sp|Q571E4.2|GALNS_MOUSE RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|74198064|dbj|BAE35212.1| unnamed protein product [Mus musculus]
 gi|148679747|gb|EDL11694.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_b [Mus
           musculus]
          Length = 520

 Score =  179 bits (455), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 26/366 (7%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
           P+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+
Sbjct: 25  PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 84

Query: 86  TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GKWHLG  R
Sbjct: 85  TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 143

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
             + P   GFD  +G        YD+  K     Y+  +M         ++ KT     T
Sbjct: 144 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 203

Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
            LYT+ A++ I  +H +  P FLY A  A HA         P     +FL  S   R  Y
Sbjct: 204 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 252

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              V  +D+SVG +++ L+  G+ +N+ V F +DNGA      +  GSN P    K T +
Sbjct: 253 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 312

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
           +GGMR  A  W P      +VS +L  I D   T  + AG++  +D  +DG++    + K
Sbjct: 313 EGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTMLK 372

Query: 373 GAKTKR 378
           G    R
Sbjct: 373 GQMMDR 378


>gi|60359902|dbj|BAD90170.1| mFLJ00319 protein [Mus musculus]
          Length = 537

 Score =  179 bits (455), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 181/366 (49%), Gaps = 26/366 (7%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
           P+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+
Sbjct: 42  PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 101

Query: 86  TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GKWHLG  R
Sbjct: 102 TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 160

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
             + P   GFD  +G        YD+  K     Y+  +M         ++ KT     T
Sbjct: 161 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 220

Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
            LYT+ A++ I  +H +  P FLY A  A HA         P     +FL  S   R  Y
Sbjct: 221 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 269

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              V  +D+SVG +++ L+  G+ +N+ V F +DNGA      +  GSN P    K T +
Sbjct: 270 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 329

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
           +GGMR  A  W P      +VS +L  I D   T  + AG++  +D  +DG++    + K
Sbjct: 330 EGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTMLK 389

Query: 373 GAKTKR 378
           G    R
Sbjct: 390 GQMMDR 395


>gi|325285326|ref|YP_004261116.1| arylsulfatase [Cellulophaga lytica DSM 7489]
 gi|324320780|gb|ADY28245.1| Arylsulfatase [Cellulophaga lytica DSM 7489]
          Length = 466

 Score =  179 bits (454), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 125/372 (33%), Positives = 192/372 (51%), Gaps = 39/372 (10%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F  L  +LLF          +KP+I+ IL+DD G+ D  F GS++  TP +D LA   + 
Sbjct: 13  FVLLEVSLLF--------GQQKPNIVFILSDDAGYADFGFQGSTEFKTPELDKLAKKSVK 64

Query: 68  LNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQY 118
             Q YV  A+C PSR+ ++TG+Y    G +   +        L G+  GLP  +  +  Y
Sbjct: 65  FTQAYVSAAVCGPSRAGMLTGQYQQKFGFEENNVPGYMSSSGLLGDQMGLPTNQLTIADY 124

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDY--YDHSCKATFEPYQGL 176
           LK+ GY T   GKWH+G   +++ PT RGFD  YG+  G + Y  YD +   T +    L
Sbjct: 125 LKKQGYTTALFGKWHMG-NADLFHPTKRGFDEFYGFRGGSRSYLPYDKNNVLT-KNEDRL 182

Query: 177 DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           +      +++K    Y TD     AI+ I + NK  P F+YL+  AVH      P +A  
Sbjct: 183 EKGFGNYLEHKG---YLTDKLANEAISFINK-NKKTPFFIYLSFNAVHT-----PMEATK 233

Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
           E++  F  ++  +R+T A M   +D ++G V+  L+K G+ +N++V+F  DNG PS    
Sbjct: 234 EDLKYFSHLTG-KRKTLAAMTLAMDRAIGTVLDELKKQGLDKNTLVVFTNDNGGPS---D 289

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DWLPT-LCAAAG 353
           +N+ +N+PL G K+   +GG+R        W  + +K ++  + IS  D  PT L  A G
Sbjct: 290 ANQSNNNPLSGTKANHLEGGIR--VPFLMSWPGKLKKNTTYNYPISTLDLFPTFLNVANG 347

Query: 354 IEINDTSLDGVN 365
                  LDGV+
Sbjct: 348 NSSKLKQLDGVD 359


>gi|260062069|ref|YP_003195149.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88783631|gb|EAR14802.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
          Length = 492

 Score =  179 bits (454), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 127/377 (33%), Positives = 185/377 (49%), Gaps = 35/377 (9%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALM 85
           P+KP+ II+ ADDLG+ D+S  G   I T N+D +A  G      YV A +CTPSR+ L+
Sbjct: 47  PEKPNFIIVFADDLGYGDLSSFGHPTIHTKNLDRMAAEGQKWTNFYVAASVCTPSRAGLL 106

Query: 86  TGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           TG+ P+  G+   + GV       G+P +E  L + LK+AGYAT  +GKWHLG  +E Y 
Sbjct: 107 TGRLPVRNGLTSNEIGVFFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLG-HKEEYL 165

Query: 143 PTFRGFDSHYGY-----------WQGLQDYYDHSCKA----TFEPYQGLDMRHNMQVDNK 187
           P   GFD ++G            +   QDY+    +       E Y    +R   +++  
Sbjct: 166 PPNHGFDDYFGIPYSNDMDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERP 225

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
                 T  Y + A+  I EH K +P F+YLAH   H      P    DE    F   S 
Sbjct: 226 VNQNTITKRYNDEAVKWIREH-KDEPFFMYLAHSLPHV-----PLFTSDE----FRGTS- 274

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG 307
             R  Y  +V  +D  VG ++  L   G+ EN+IV+F +DNG P      + GS   LR 
Sbjct: 275 -ARGLYGDVVEEIDHGVGQIMELLEAEGLAENTIVVFTSDNG-PWLPTGISGGSAGLLRE 332

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQ 366
            K T W+GGMR     W+P +    KV  ++    D   T  + AG+ + +D  +DGV+ 
Sbjct: 333 GKGTTWEGGMREPTIFWAPGMLPA-KVVMDMGSTLDLFNTFSSLAGVPMPDDREMDGVDL 391

Query: 367 WDVLTKGAKTKRSEILH 383
             +L   A++ R E+ +
Sbjct: 392 SPILFGDAESPRKEMFY 408


>gi|260062070|ref|YP_003195150.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88783632|gb|EAR14803.1| arylsulphatase A [Robiginitalea biformata HTCC2501]
          Length = 459

 Score =  179 bits (454), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 144/417 (34%), Positives = 209/417 (50%), Gaps = 39/417 (9%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            F++ C             P  P+I+ IL DDLG+ D+S  G++ + +PNIDALA NG+ 
Sbjct: 20  LFSIGCLAAATGTCYAQERPDAPNILCILVDDLGYGDLSCQGATDLQSPNIDALAANGMR 79

Query: 68  LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEP--WG-LPLTEKLLPQYLKEAG 123
               Y  + +C+PSR+AL+TG+YP  +G+  GVI +     WG L     L+P  L  AG
Sbjct: 80  FTNFYANSTVCSPSRAALLTGRYPDLVGVP-GVIRQNPENNWGNLADDAVLIPSELNPAG 138

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLD-MRHN 181
           Y T  IGKWHLG   E  TP  RGF    G+   +  DY+DH          G++ MR N
Sbjct: 139 YHTGIIGKWHLG-LEEPDTPNDRGFTYFKGFLGDMMDDYWDHR-------RGGINWMRLN 190

Query: 182 M-QVDNKTIGIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
             ++D K    ++TDL+T+  I+ + E   + +P FLYLA+ A H      P Q P E +
Sbjct: 191 REEIDPKG---HATDLFTDWTIDFLKERQGEEQPFFLYLAYNAPHF-----PIQPPREWL 242

Query: 240 AKFLDISDP----ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
            K  +  +P    +R      V  LD SVG V+ AL+  G+ EN++V+F++DNG    G 
Sbjct: 243 DKVRE-REPNLTEKRAKNVAFVEHLDYSVGRVMEALKTTGLEENTLVVFVSDNG----GA 297

Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
                SN PLRG K   ++GG+R  A  +          S     + D  PT C  AG +
Sbjct: 298 LWYAQSNGPLRGGKQDMYEGGIRVPAIFYWKGKIAPGTTSDNTALLMDLFPTFCELAGRK 357

Query: 356 INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVD---NPQKYYAALRVDDLKYVAGT 409
             + ++DG++    LT  A+   +  L+ +         Q YYAA R  D K +  T
Sbjct: 358 PPE-NVDGISLVPTLTGQAQDTANRYLYWVRREGGDYGGQAYYAA-RFGDFKILQNT 412


>gi|313216787|emb|CBY38029.1| unnamed protein product [Oikopleura dioica]
          Length = 383

 Score =  178 bits (452), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 90/203 (44%), Positives = 125/203 (61%), Gaps = 8/203 (3%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           K+PH + ILADD+G++D+ +H  ++I TP +D LA  G+ L Q+YVQ +CTP+R  LMTG
Sbjct: 174 KQPHFLFILADDMGYHDIGYH-QAEILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTG 232

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y I  GMQHGV+   +P G+PL EKLLP+ L++ GY T  IGKWHLG F E Y P  RG
Sbjct: 233 RYQIRYGMQHGVVRPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRG 292

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-----GIYSTDLYTEAAI 202
           FD   G++ G QD+Y H+    F    G D R       + I       YST ++ +   
Sbjct: 293 FDHFMGFYTGSQDFYSHN--KCFSGMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELE 350

Query: 203 NVIAEHNKSKPMFLYLAHLAVHA 225
             +++ N S+P F YL+  AVH+
Sbjct: 351 KRLSKMNPSEPSFTYLSFQAVHS 373


>gi|399033016|ref|ZP_10732099.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
 gi|398068627|gb|EJL60037.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
          Length = 460

 Score =  178 bits (452), Expect = 6e-42,   Method: Compositional matrix adjust.
 Identities = 128/369 (34%), Positives = 194/369 (52%), Gaps = 33/369 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KP+II+I+ADD GWNDV +HGS  I TPNID LA NG+ LN+ Y    C+PSR++L TG
Sbjct: 33  EKPNIILIIADDAGWNDVEYHGSV-IQTPNIDFLAKNGVELNRFYANPTCSPSRASLFTG 91

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +    +G+    I +   + LP +   LP+ L +  Y T  IGKWHLG  ++   P   G
Sbjct: 92  RPASRMGIV-APISDKSQFKLPDSIATLPKLLHQNNYQTALIGKWHLG-LQQSSGPKAYG 149

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FD  YG+  G  D Y H  K   + +     R+   +D +    ++TDL T  AI+ ++E
Sbjct: 150 FDYSYGFLHGQIDQYTHLYKNGDKSW----YRNGEFIDEQG---HATDLITNEAIHWLSE 202

Query: 208 HNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD-ISDPERRTYAGMVSRLDESVG 265
              S K  FL +A+ A H      P Q   +    +++ I D  RR +A  +S LD S+G
Sbjct: 203 KRDSNKNFFLEVAYSAPHF-----PLQEEQKWKDPYMNSIKDSSRRDFAAAMSHLDYSIG 257

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPS--------FGIHSNK---GSNHPLRGMKSTPWD 314
            ++  L++  + +N++V+FM+DNGA           G+H +    G N PLR  K++ ++
Sbjct: 258 ILLEKLKQENLDKNTLVIFMSDNGAMENWDSRNEYNGVHPSNTVLGDNSPLRDWKTSNYE 317

Query: 315 GGMRGVAAI--WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           GG+R V  I  W   LK  +  S     I D LP++   AG +    S++G N W  + +
Sbjct: 318 GGIR-VPCIFYWKDHLKNYK--SESYISIIDLLPSILFLAGDKNLPQSIEGKNVWPAIAE 374

Query: 373 GAKTKRSEI 381
                  +I
Sbjct: 375 NKAIANQQI 383


>gi|311748006|ref|ZP_07721791.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|126574986|gb|EAZ79344.1| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 529

 Score =  178 bits (452), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 134/407 (32%), Positives = 196/407 (48%), Gaps = 69/407 (16%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
           N   P  P+I+ IL DDLG+NDVSF GS    TPNID +A  G+I    Y   A+C+PSR
Sbjct: 22  NPEPPSNPNIVFILVDDLGYNDVSFMGSEFYETPNIDKVASKGMIFTNGYANSAVCSPSR 81

Query: 82  SALMTGKYPIHIGMQHGV-ILEGEPWG--------LP------LTEKL--LPQYLKEAGY 124
           ++LMTG++    G+   +   +GE W         LP      L++++  LP+ LK+ GY
Sbjct: 82  ASLMTGRFTAVHGITDWIGAPQGEAWRSYKRYSKLLPAEYNHQLSKEITTLPEALKQEGY 141

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYG-------YWQGLQDYYDHSCKATFEPYQGLD 177
            T   GKWHLG   E   PT  GFD + G       Y  G    +++     FE  +G+ 
Sbjct: 142 KTFFAGKWHLGSAEENSLPTDHGFDINKGGYHVGGPYSGGYFSPFENPFMEDFEEEKGIS 201

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           +  +M++  +T              N+  E NK +    YL+  AVHA     P Q   E
Sbjct: 202 L--SMKLAKET--------------NLFIEQNKEEKFLAYLSFYAVHA-----PIQTSKE 240

Query: 238 EVAKFLDISD---------------PERR-----TYAGMVSRLDESVGNVIAALRKHGML 277
           + +K+ + ++               P R+      YAG++ ++DE+VG V+  L K G+ 
Sbjct: 241 KWSKYRNKAEAMGIDSTGFEMERVLPARKYQDNPVYAGLIEQMDEAVGAVVDQLEKLGLS 300

Query: 278 ENSIVLFMADNGAPSFGIHSNKGSN-HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
           EN+I++F +DNG    G   N  +N  PLRG K   W+GG R    I+ PWL+   + + 
Sbjct: 301 ENTIIVFTSDNGGVVSG--DNYSTNLDPLRGGKGYQWEGGTRIPYMIYVPWLENNGQRND 358

Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
                 D+ PTL   AG+      LDG +   VL  G   +RS   H
Sbjct: 359 TPVSGVDFYPTLLDFAGVNEISQPLDGQSIRPVLEGGTLEERSIYWH 405


>gi|443724469|gb|ELU12468.1| hypothetical protein CAPTEDRAFT_223869 [Capitella teleta]
          Length = 441

 Score =  178 bits (452), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 109/314 (34%), Positives = 171/314 (54%), Gaps = 21/314 (6%)

Query: 108 LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK 167
           + L    +  YLKE  Y THA GKWHLG+  +  TPT+RGFD+  G + G  +YY+H+  
Sbjct: 1   MDLKYNFISDYLKELKYNTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEGEYYEHTT- 59

Query: 168 ATFEPYQGL-DMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG 226
                +QG+ D  +  ++D    GI+S D+     +N + + N   P+F Y    A H  
Sbjct: 60  -----FQGMYDWHNGTELDRSASGIHSQDMIEADLLNQL-DKNDGTPLFYY---AAFH-- 108

Query: 227 NTYEPFQAPDEEVAKFLDI-SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
           NT+ P     E  A + D+ + P RR Y G+VS +D  +G V+A L++  M +N+ + F 
Sbjct: 109 NTHSPMHPKPEHEALYDDLDAPPTRRKYLGLVSGMDAVIGKVVAKLKEKDMFDNTYIFFS 168

Query: 286 ADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWL 345
           +DNG     +H  +G N P RG KS+ +DGG    +   SP LK+    +  L HI+DWL
Sbjct: 169 SDNGGD---VH--EGDNSPKRGAKSSLFDGGCHAHSWASSPLLKKKGVEADGLLHITDWL 223

Query: 346 PTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNID-NVDNPQKYYAALRVDDLK 404
           PT+   AG E+ D  +DG+ Q D++T    +KR+++++NID  ++     +  +   DL+
Sbjct: 224 PTIVKLAGGEVPD-GIDGIEQVDLITNAGPSKRTDMIYNIDMELETGTPDFGTMAARDLR 282

Query: 405 YVAGTDNNGQSDEW 418
           Y       G+SD +
Sbjct: 283 YKLIWGFEGRSDGY 296


>gi|444523022|gb|ELV13435.1| Arylsulfatase J [Tupaia chinensis]
          Length = 383

 Score =  178 bits (452), Expect = 7e-42,   Method: Compositional matrix adjust.
 Identities = 120/386 (31%), Positives = 188/386 (48%), Gaps = 61/386 (15%)

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
           GIYST +YT+    ++A HN +KP+FLY+A+ AVH+     P QAP      +  I +  
Sbjct: 19  GIYSTQMYTQRVQQILASHNPAKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININ 73

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGM 308
           RR YA M++ LDE++ NV  AL+ +G  +NSI+++ +DNG  P+ G     GSN PLRG 
Sbjct: 74  RRRYAAMLACLDEAIHNVTLALKTYGFYDNSILIYSSDNGGQPTAG-----GSNWPLRGS 128

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQW 367
           K T W+GG+R V  + SP LK    V  EL HI+DW PTL + A  +I+ D  LDG + W
Sbjct: 129 KGTYWEGGIRAVGFVHSPLLKNRGTVCRELVHITDWYPTLISLAEGQIDEDIQLDGYDIW 188

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEID 427
           + +++G ++                      RVD L  +       ++  W         
Sbjct: 189 ETISEGLRSP---------------------RVDILHNIDPIYTKAKNGSWA-------- 219

Query: 428 KYSPKEVLYSKAGITFNALKTKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVR 487
                    +  GI   A+++ ++++           SD           +  + F+ + 
Sbjct: 220 ---------AGYGIWNTAIQSAIRVQHWKLLTGNPGYSD----------WVPPQAFSNLG 260

Query: 488 -CNYDNKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNK 546
              + N+    ++     LFNIT DP E+ +L+     ++KQL  +L+ +  T VP    
Sbjct: 261 PTRWHNERVTLSTGKSVWLFNITADPYERVDLSSRYPGIVKQLLRRLSQFNKTAVPVRYP 320

Query: 547 PFDKRADPARWNNIWVPWYDELDKQK 572
           P D R++P     +W PWY E  K+K
Sbjct: 321 PKDPRSNPRLNGGVWGPWYKEEYKKK 346


>gi|431799329|ref|YP_007226233.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430790094|gb|AGA80223.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 454

 Score =  178 bits (451), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 124/368 (33%), Positives = 186/368 (50%), Gaps = 38/368 (10%)

Query: 13  CTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY 72
           C++L     +  T  + P+II+I ADDLG+ D+  +G   I TP +D +A  G+   Q Y
Sbjct: 18  CSVL-----VAQTPKRPPNIIVIFADDLGYGDLGCYGHPTIQTPYLDRMAKEGMRFTQFY 72

Query: 73  VQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
           V A +CTPSR+AL+TG+ PI  GM   + GV+      GLP TE  +   LK AGY T  
Sbjct: 73  VGADVCTPSRAALLTGRLPIRYGMAGEERGVLFPDSSKGLPHTEATMASALKAAGYRTGI 132

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNK 187
           +GKWHLG   E Y PT  GFD ++G  +        H+       Y+  ++  ++ +D +
Sbjct: 133 VGKWHLGHLPE-YLPTTHGFDFYFGIPYSNDMRPNPHNHVPPLPLYRNEEVI-DLDIDQR 190

Query: 188 TIGIYSTDLYTEAAINVIAEHNKSKPMFLY----LAHLAVHAGNTYEPFQAPDEEVAKFL 243
            +    T  YT+ AI  I E N+ +P FLY      H+ ++A + +E             
Sbjct: 191 QL----TKRYTKEAIRFI-EENQERPFFLYYPNNFPHVPLYASSDFE------------- 232

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
                +R  Y  +VS LD SVG ++  L+   + EN++V+F +DNG P        GS  
Sbjct: 233 --GKSKRGIYGDVVSELDWSVGRILEKLKALDLAENTLVIFTSDNG-PWLSQKEKGGSAG 289

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLD 362
            L   K++ ++GGMR  A  W P   +  ++S+ L    D  PT    AG+ +  D +LD
Sbjct: 290 LLFEGKASTYEGGMRVPAIAWWPGTIKPNQISTALVASMDLYPTFLTMAGLNMPQDKTLD 349

Query: 363 GVNQWDVL 370
           G N   +L
Sbjct: 350 GDNILPIL 357


>gi|7527462|gb|AAF63155.1|AF111346_1 N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
 gi|7576473|gb|AAF63858.1| N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
          Length = 520

 Score =  177 bits (450), Expect = 9e-42,   Method: Compositional matrix adjust.
 Identities = 120/366 (32%), Positives = 181/366 (49%), Gaps = 26/366 (7%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
           P+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+
Sbjct: 25  PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 84

Query: 86  TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GKWHLG  R
Sbjct: 85  TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 143

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
             + P   GF+  +G        YD+  K     Y+  +M         ++ KT     T
Sbjct: 144 PQFHPLKHGFNEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 203

Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
            LYT+ A++ I  +H +  P FLY A  A HA         P     +FL  S   R  Y
Sbjct: 204 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 252

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              V  +D+SVG +++ L+  G+ +N+ V F +DNGA      +  GSN P    K T +
Sbjct: 253 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 312

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
           +GGMR  A  W P      +VS +L  I D   T  + AG++  +D  +DG++    + K
Sbjct: 313 EGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTMLK 372

Query: 373 GAKTKR 378
           G    R
Sbjct: 373 GQMMDR 378


>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
 gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
          Length = 471

 Score =  177 bits (450), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 118/330 (35%), Positives = 171/330 (51%), Gaps = 24/330 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           ++P+II ILADDLG  D+  +G + I TPN+D LA  G+   Q Y   A+C+ +R AL+T
Sbjct: 31  EQPNIIFILADDLGHADLGVYGQTDIRTPNLDKLAGQGIRFTQAYANSAVCSATRFALIT 90

Query: 87  GKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           G+Y   +  G++  +    +  GLP T   LP  LK+ GY T  IGKWHLG+    + P 
Sbjct: 91  GRYQYRLRGGLEEPIAGASDTLGLPRTHPTLPSLLKKQGYGTALIGKWHLGYL-PTFGPL 149

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
             G+DS +G + G  DY+ H  K    P    D+ +  +V    IG Y TDL    A++ 
Sbjct: 150 KSGYDSFFGNYGGAIDYFTH--KPGVGPQVKEDL-YEGEVPVHQIGYY-TDLLGARAVDF 205

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVAKFLD----ISDPERRTYAGMVSR 259
           + +    KP  L L + A H      P++ P DE +++ +D        +  TY  MV  
Sbjct: 206 VQKQQAGKPFLLSLHYTAPH-----WPWEGPGDEAISRDIDNIFHYDGGDLATYGRMVES 260

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           LD S+G V+ AL   G+ EN+IV+F +DNG   F          PL G K+   +GG+R 
Sbjct: 261 LDASIGRVLKALEDKGLAENTIVIFTSDNGGERF------SKTWPLTGQKTELLEGGIRV 314

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
            A +  P      +VS ++    DWLPTL 
Sbjct: 315 PAIVRWPAKIAAGQVSEQVAISMDWLPTLL 344


>gi|13278373|gb|AAH04002.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Mus musculus]
          Length = 520

 Score =  177 bits (450), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 121/367 (32%), Positives = 180/367 (49%), Gaps = 26/367 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL
Sbjct: 24  APQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAAL 83

Query: 85  MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GKWHLG  
Sbjct: 84  LTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH- 142

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
           R  + P   GFD  +G        YD+  K     Y+  +M         ++ KT     
Sbjct: 143 RPQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANL 202

Query: 194 TDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           T LY + A++ I  +H +  P FLY A  A HA         P     +FL  S   R  
Sbjct: 203 TQLYLQEALDFIRTQHARQGPFFLYWAIDATHA---------PVYASRQFLGTS--LRGR 251

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y   V  +D+SVG ++  L+  G+ +N+ V F +DNGA      +  GSN P    K T 
Sbjct: 252 YGDAVREIDDSVGKILNLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTT 311

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
           ++GGMR  A  W P      +VS +L  I D   T  + AG++  +D  +DG++    + 
Sbjct: 312 FEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLDLLPTML 371

Query: 372 KGAKTKR 378
           KG    R
Sbjct: 372 KGQMMDR 378


>gi|426242290|ref|XP_004015007.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Ovis aries]
          Length = 522

 Score =  177 bits (450), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 121/367 (32%), Positives = 178/367 (48%), Gaps = 26/367 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL
Sbjct: 26  APQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDQMATEGMLFPNFYTANPLCSPSRAAL 85

Query: 85  MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG+ PI  G        ++    +    G+P +E LLP  LK AGYA+  +GKWHLG  
Sbjct: 86  LTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH- 144

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
           R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT     
Sbjct: 145 RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL 204

Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           T +Y + A+  I     + +P FLY A  A HA         P      FL  S  +R  
Sbjct: 205 TQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA---------PVYASKPFLGTS--QRGR 253

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y   +  LD+SVG ++  L+  G+ EN+ V F +DNGA         GSN P    K T 
Sbjct: 254 YGDAIRELDDSVGRILRLLQDLGIAENTFVFFTSDNGAALISAPRQGGSNGPFLCGKQTT 313

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
           ++GGMR  A  W P      +VS +L  I D   T  + AG+E   D ++DG++    + 
Sbjct: 314 FEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPRDRAIDGLDLLPAML 373

Query: 372 KGAKTKR 378
           +G  T R
Sbjct: 374 QGRLTDR 380


>gi|4503899|ref|NP_000503.1| N-acetylgalactosamine-6-sulfatase precursor [Homo sapiens]
 gi|462148|sp|P34059.1|GALNS_HUMAN RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|618426|gb|AAC51350.1| N-acetylgalactosamine 6-sulphatase [Homo sapiens]
 gi|870751|dbj|BAA04535.1| N-acetylgalactosamine 6-sulfate sulfatase [Homo sapiens]
 gi|33440495|gb|AAH56151.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
 gi|37589093|gb|AAH50684.2| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
 gi|119587163|gb|EAW66759.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
           mucopolysaccharidosis type IVA), isoform CRA_c [Homo
           sapiens]
          Length = 522

 Score =  177 bits (450), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
           + AP+ P+I+++L DD+GW D+  +G     TPN+D +A  GL+    Y    LC+PSR+
Sbjct: 25  SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRA 84

Query: 83  ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           AL+TG+ PI  G        ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG
Sbjct: 85  ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
             R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT   
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
             T +Y + A++ I    +  P FLY A  A HA         P      FL  S  +R 
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            Y   V  +D+S+G ++  L+   + +N+ V F +DNGA         GSN P    K T
Sbjct: 253 RYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
            ++GGMR  A  W P      +VS +L  I D   T  A AG+   +D ++DG+N    L
Sbjct: 313 TFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372

Query: 371 TKGAKTKRSEILHNIDNV 388
            +G    R    +  D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390


>gi|189069200|dbj|BAG35538.1| unnamed protein product [Homo sapiens]
          Length = 522

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
           + AP+ P+I+++L DD+GW D+  +G     TPN+D +A  GL+    Y    LC+PSR+
Sbjct: 25  SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAGGLLFPNFYSANPLCSPSRA 84

Query: 83  ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           AL+TG+ PI  G        ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG
Sbjct: 85  ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
             R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT   
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
             T +Y + A++ I    +  P FLY A  A HA         P      FL  S  +R 
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            Y   V  +D+S+G ++  L+   + +N+ V F +DNGA         GSN P    K T
Sbjct: 253 RYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
            ++GGMR  A  W P      +VS +L  I D   T  A AG+   +D ++DG+N    L
Sbjct: 313 TFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372

Query: 371 TKGAKTKRSEILHNIDNV 388
            +G    R    +  D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390


>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
          Length = 328

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 122/350 (34%), Positives = 177/350 (50%), Gaps = 49/350 (14%)

Query: 31  HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN-GLILNQHYVQALCTPSRSALMTGKY 89
           H + I+ADDLG+ D+ +     I TPNID LA   G+IL+  Y    C+PSR++ +TG+ 
Sbjct: 1   HHLQIMADDLGYYDLGYRNPDSI-TPNIDQLATQEGVILDNAYGYRYCSPSRASFLTGRV 59

Query: 90  PIHIGMQHGVILEGEPWGLP------LTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           PIH+        +G P GL       L   ++P  L+ AGY T  +GKWH G       P
Sbjct: 60  PIHV-------HQGNP-GLAAAGCTNLNYTMIPAQLRRAGYRTAMVGKWHQGASLPECLP 111

Query: 144 TFRGFDSHYGYWQGLQDYYDHS-----CKAT-FEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
             RGFD+ +GY  G +D+ D +     C  T F    G  +R N        G YS+  Y
Sbjct: 112 VNRGFDTSFGYLSGEEDHMDQTTNGGQCNVTDFWLDSGPAIRRN--------GTYSSFQY 163

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL-----------DIS 246
            +A +++I +H   +P+ LY A   VH      P++ P+   AKF              S
Sbjct: 164 NDAIVDIIQQHAPEQPLMLYAALQNVHG-----PYEVPERYRAKFPADKNCTETGDNPTS 218

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG-SNHPL 305
              R     MV+ LD+ VGN+  AL+ +GM  N++++F +DNG    G   +   +NHPL
Sbjct: 219 QCTRDFMLAMVNVLDDIVGNITRALKANGMWNNTVLVFSSDNGGAVPGAPQHGSMNNHPL 278

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS--ELFHISDWLPTLCAAAG 353
           RG K   ++GG+R  A + SP L ++ + +    L HISDW  T C  AG
Sbjct: 279 RGGKVAYFEGGVRTAAFVASPLLPKSIRGTELRGLIHISDWYATFCHLAG 328


>gi|256421633|ref|YP_003122286.1| sulfatase [Chitinophaga pinensis DSM 2588]
 gi|256036541|gb|ACU60085.1| sulfatase [Chitinophaga pinensis DSM 2588]
          Length = 452

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 131/396 (33%), Positives = 197/396 (49%), Gaps = 34/396 (8%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           AL+C   F  A L     K+P+++II  DD G  DV+ +G+  + TPNID LA  G++ +
Sbjct: 11  ALSC---FMAAPLFAQQQKRPNVLIIYTDDQGTLDVNCYGAKDLHTPNIDRLAKEGVLFS 67

Query: 70  QHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
           Q Y  A +C+PSR++L+TG+YP    + +    E    G+P ++  + +  K+ GY T  
Sbjct: 68  QFYAAAPVCSPSRASLLTGRYPQRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAH 127

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           IGKWH+G+  E   P  +GFD  +G+  G  D Y H        Y     RH++  + + 
Sbjct: 128 IGKWHIGYSPET-MPNQQGFDYSFGFMGGCIDNYSHYF------YWAGPNRHDLWRNGQE 180

Query: 189 I---GIYSTDLYTEAAINVIAEHNKS--KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
           I   G +  DL T   +N   E NK   KP FLY A    H      P Q  ++    + 
Sbjct: 181 IWEDGKFFADL-TVQEVNGFLEKNKRADKPFFLYWAINMPHY-----PLQGQEKWRQYYK 234

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP----SFGIHSNK 299
           D+  P RR YA  VS +DE +G V+  L + G+ EN+IV+F +D G      SFG     
Sbjct: 235 DLPAP-RRMYAAAVSTMDEKIGQVLQQLDRLGLAENTIVVFQSDQGHSTEDRSFG---GG 290

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
           G   P RG K + ++GG+R  A I W+  L + + V  +L    DW PTL     + +  
Sbjct: 291 GFTGPYRGAKFSLFEGGIRVPAIIRWTGHLPKNE-VRDQLCVNIDWYPTLAGLCKVALPQ 349

Query: 359 TSLDGVNQWDVLTKGAKTKRSEIL--HNIDNVDNPQ 392
             +DG +   V+T    +   +I    +    +NPQ
Sbjct: 350 RKIDGKDIQQVITSSKTSSPHDIFFWQSQGTKENPQ 385


>gi|407938758|ref|YP_006854399.1| sulfatase [Acidovorax sp. KKS102]
 gi|407896552|gb|AFU45761.1| sulfatase [Acidovorax sp. KKS102]
          Length = 452

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 123/356 (34%), Positives = 182/356 (51%), Gaps = 26/356 (7%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRS 82
           + + + P+I+ ILADDLGW D+S +G +   TP++DALA  G+   Q Y   A+C+ +R 
Sbjct: 10  SASTQAPNIVFILADDLGWADLSVYGQADFTTPHLDALAGEGVRFTQAYANSAVCSATRF 69

Query: 83  ALMTGKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           AL+TG+Y   +  G++  ++ +    GLP     LP  L++AGY T  IGKWHLG     
Sbjct: 70  ALITGRYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRDAGYDTALIGKWHLGSL-PT 128

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           + P   G+D  +G + G  DY+ H         + L   +  +V  + +G Y T L  + 
Sbjct: 129 FGPLKSGYDRFFGNYGGAIDYFTHKPGVGDAVARDL---YEGEVPVERVGYY-TQLLADE 184

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVA----KFLDISDPERRTYAG 255
           A   I E + +KP FL L   A H      P+  P DEE++               TYA 
Sbjct: 185 ATRWIGERSAAKPFFLSLHFTAPHW-----PWVGPEDEEISLGLKDLFHYDGGNLATYAR 239

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV  LD++VG V+ AL+  G+ +N+IV+F +DNG   F          P  G K+   +G
Sbjct: 240 MVRSLDKAVGQVLEALKAQGLADNTIVVFTSDNGGERF------SKTWPFTGQKTELLEG 293

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVL 370
           G+R    +  P  +   +VS ++    DWLPTL AAAG+  + D   DG N   VL
Sbjct: 294 GIRVPTLLRWP-ARIAPQVSDQVTATMDWLPTLLAAAGVAPHADYPPDGENILPVL 348


>gi|119587161|gb|EAW66757.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
           mucopolysaccharidosis type IVA), isoform CRA_a [Homo
           sapiens]
          Length = 708

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
           + AP+ P+I+++L DD+GW D+  +G     TPN+D +A  GL+    Y    LC+PSR+
Sbjct: 25  SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRA 84

Query: 83  ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           AL+TG+ PI  G        ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG
Sbjct: 85  ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
             R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT   
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
             T +Y + A++ I    +  P FLY A  A HA         P      FL  S  +R 
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            Y   V  +D+S+G ++  L+   + +N+ V F +DNGA         GSN P    K T
Sbjct: 253 RYGDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVL 370
            ++GGMR  A  W P      +VS +L  I D   T  A AG+   +D ++DG+N    L
Sbjct: 313 TFEGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372

Query: 371 TKGAKTKRSEILHNIDNV 388
            +G    R    +  D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390


>gi|440715400|ref|ZP_20895947.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
 gi|436439744|gb|ELP33158.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
          Length = 1004

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 138/419 (32%), Positives = 200/419 (47%), Gaps = 33/419 (7%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
           T    KP+ I+IL DD G+ D+S  G+  + TP ID +A  G  L   YV A +CTPSR+
Sbjct: 557 TAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRA 616

Query: 83  ALMTGKYPIHI----GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
            LMTG YP  I    G   GV+L G+P GL   E  + + LK AGY T   GKWHLG   
Sbjct: 617 GLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFGKWHLGDQP 676

Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
           E + PT +GFD  +G  Y   +  ++       F     L     +++D      + T  
Sbjct: 677 E-FLPTKQGFDEFFGIPYSHDIHPFHPRQNHYHFPSLPLLQNDTVIEMDPDA--DFLTKR 733

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPF-QAPDEEVAKFLDISD------P 248
            TE A++ I E NK +P FLYL H   HA  +   PF +   ++V   ++  D       
Sbjct: 734 LTEQAVSFI-ERNKDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAAIEKEDGNIDYAT 792

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
               +   ++ +D SVG ++ ALR +G+ E ++VLF +DNG P   ++++ G    LRG 
Sbjct: 793 RANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGE---LRGH 849

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
           K T ++GGMR    +  P        + EL    D LPT    AG     D  +DG + W
Sbjct: 850 KGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAGAATPTDRVIDGKDIW 909

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
             L    +T      ++  N        AA+R    K      NNG + + Y D +N++
Sbjct: 910 PTLKGETQTPHDAFFYHRGN------QLAAVRSGKWKLHV---NNGVAKQLY-DLENDL 958



 Score =  122 bits (307), Expect = 4e-25,   Method: Compositional matrix adjust.
 Identities = 113/379 (29%), Positives = 165/379 (43%), Gaps = 70/379 (18%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
           +  A + P++++I  DDLG+ D+  +G++++ TPNID LA  G    + H   A+CTPSR
Sbjct: 25  SVAAERPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSR 84

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWG-LPLTE--------KLLPQYLKEAGYATHAIGKW 132
             L+TG+YP+      G+      WG LP T         K + +  K  GYAT  +GKW
Sbjct: 85  YGLLTGQYPVRAMGGQGI------WGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKW 138

Query: 133 HLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD----YYDHSCKATF-- 170
           HLGF  E           P   GFD ++G         +  + D     YD S    +  
Sbjct: 139 HLGFKEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGG 198

Query: 171 ---EPYQGLDMRHNMQVDNKTIG------IYSTD----LYTEAAINVIAEHNKSKPMFLY 217
               P        +++  N+  G      IY  +    L TE A+  I E  K +P FLY
Sbjct: 199 KPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE-KKDEPFFLY 257

Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
            A   +H   T  P         +F   S  +   Y   V  LD  VG ++ +L  +G+ 
Sbjct: 258 FATPNIHHPFTPAP---------RFKGTS--QCGLYGDFVHELDWMVGEIVQSLEDNGLT 306

Query: 278 ENSIVLFMADNG-----APSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQT 331
           +N++VLF +DNG     A    I +    N  L G K   W+GG R  + A W   +K  
Sbjct: 307 DNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAG 366

Query: 332 QKVSSELFHISDWLPTLCA 350
            + S +L    D   T  A
Sbjct: 367 TQ-SDQLISQVDLFATFSA 384


>gi|223936859|ref|ZP_03628768.1| sulfatase [bacterium Ellin514]
 gi|223894428|gb|EEF60880.1| sulfatase [bacterium Ellin514]
          Length = 460

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 142/404 (35%), Positives = 192/404 (47%), Gaps = 36/404 (8%)

Query: 1   MTWARKYFFALTCTLLFNDAFL-NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNID 59
           MT   KY   L    L    F     A K P+I+IILADDLG+ D+  +G   I TPN+D
Sbjct: 1   MTRISKYL--LVVAFLIAATFAPGVFAAKAPNIVIILADDLGYGDLGCYGHPSIRTPNLD 58

Query: 60  ALAYNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQ---HGVILEGEPWGLPLTEKLL 115
            +A  G+     YV A +CTPSR+ LMTG++PI  GM    + V+      GLP  E  +
Sbjct: 59  RMAAEGIRFTDFYVAANVCTPSRAGLMTGRWPIRSGMAGSTNDVLRVFSMGGLPTNEITI 118

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGY-----WQGLQDYYDHSCKATF 170
              LK  GYAT  IGKWHLG   + + PT  GFD  YG       + ++     +  ++ 
Sbjct: 119 AAALKSKGYATACIGKWHLGHELQ-FLPTHHGFDYFYGLRFSNDMEPVRGKIPKNASSSL 177

Query: 171 EPYQGLDMRHNMQVDNKTIGIYSTDL------YTEAAINVIAEHNKSKPMFLYLAHLAVH 224
            P   L+  ++  + N  I    TDL      YTE AI  I   NK +P FLY  H   H
Sbjct: 178 HP--KLEWWNSALLQNDKILEQPTDLSTLTRRYTEEAIKFI-HQNKKQPFFLYFPHTFPH 234

Query: 225 AGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
                 P  A D     F + S   R  Y  +V  LD SVG V+  LRK G+ EN++V F
Sbjct: 235 V-----PLFASD----AFKEHS--ARGLYGDVVEELDWSVGQVLNTLRKEGLAENTLVFF 283

Query: 285 MADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
            +DNG P        GS  PL+  K + W+GGMR  A  W P  K    ++ EL    D 
Sbjct: 284 TSDNG-PWLIRDLAGGSAGPLKDGKGSTWEGGMREPAIAWWPG-KIKPAINHELVCSLDL 341

Query: 345 LPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDN 387
             T    AG+ +  D  +DG++   +L     ++R  + +   N
Sbjct: 342 FTTSLCLAGVALPPDRVIDGLDMRPMLFGTGHSQRQLMFYYKKN 385


>gi|150008689|ref|YP_001303432.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937113|gb|ABR43810.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
          Length = 452

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 130/380 (34%), Positives = 184/380 (48%), Gaps = 26/380 (6%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           KY   +T  L   D     + P KP+II+I  DD+G+ D+S  GS  I TPNID +A  G
Sbjct: 5   KYILYITGVLASYDGH---SQPTKPNIIVINCDDMGYGDLSCFGSPTIKTPNIDRMAIEG 61

Query: 66  LILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKE 121
              +  YV A + +PSR+ L+TG+  +  GM   Q  V+      GLP  E  + + LK+
Sbjct: 62  QKWSSFYVSASVSSPSRAGLLTGRLGVRTGMYGDQRRVLFPDSKGGLPSEELTIAELLKQ 121

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKA--TFEPYQGLDMR 179
           AGY T  IGKWHLG   E Y P   GFD  YGY            K   T  PY+ +   
Sbjct: 122 AGYHTACIGKWHLGHLPE-YMPLRHGFDYFYGYPYSNDMSRKEQIKLGNTKYPYEYIIYE 180

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
              +++ +      T   TEAAI  I + N++ P FLYLAH   H      P  A  +  
Sbjct: 181 QEKELEREPQQYNLTQQVTEAAIRYI-KSNENSPFFLYLAHPMPHM-----PVYASTDFQ 234

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
            K        R  Y   V  LD SVG ++  L+  G+ +N++V+F +DNG P        
Sbjct: 235 GK------SARGKYGDTVEELDWSVGQILQTLKSEGLDKNTLVIFTSDNG-PWLLCKQEG 287

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-ND 358
           GS  PL+  K++ ++GG R    +W   +K      +++    D LPT C  AGI + +D
Sbjct: 288 GSPGPLKDGKASMFEGGFRVPCIMWGAMVKPG--YITDMASTLDLLPTFCEIAGIPLPSD 345

Query: 359 TSLDGVNQWDVLTKGAKTKR 378
              DG++  +VL   +  KR
Sbjct: 346 RHYDGISLLNVLKDKSTCKR 365


>gi|114145565|ref|NP_001041316.1| N-acetylgalactosamine-6-sulfatase precursor [Rattus norvegicus]
 gi|123779981|sp|Q32KJ6.1|GALNS_RAT RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|81158026|tpe|CAI84987.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus
           norvegicus]
          Length = 524

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 121/373 (32%), Positives = 181/373 (48%), Gaps = 26/373 (6%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCT 78
             L   AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+
Sbjct: 22  GLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCS 81

Query: 79  PSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
           PSR+AL+TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GK
Sbjct: 82  PSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGK 141

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNK 187
           WHLG  R  + P   GFD  +G        YD+  K     Y+  +M         ++ K
Sbjct: 142 WHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLK 200

Query: 188 TIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
           T     T LY + A++ I  +H +  P FLY A  A HA         P     +FL  S
Sbjct: 201 TGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA---------PVYASKQFLGTS 251

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
              R  Y   V  +D+SVG +++ L+  G+ +N+ V F +DNGA         GSN P  
Sbjct: 252 --LRGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFL 309

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVN 365
             K T ++GGMR  A  W P      +VS +L  I D   T  + AG++  +D  +DG++
Sbjct: 310 CGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLD 369

Query: 366 QWDVLTKGAKTKR 378
               + +G    R
Sbjct: 370 LLPTMLQGHIIDR 382


>gi|449132446|ref|ZP_21768529.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448888348|gb|EMB18668.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 1001

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 131/375 (34%), Positives = 187/375 (49%), Gaps = 24/375 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
           T    KP+ I+IL DD G+ D+S  G   + TP ID +A  G  L   YV A +CTPSR+
Sbjct: 554 TAGTTKPNFIVILTDDQGYGDLSCFGGKHVATPRIDQMAAEGSRLTSFYVAAPVCTPSRA 613

Query: 83  ALMTGKYPIHIGM----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
            LMTG YP  IGM      GV+L G+  GL   E  + + LK  GY T   GKWHLG   
Sbjct: 614 GLMTGCYPKRIGMAMGSNFGVLLAGDRKGLHPNEITIAEVLKTVGYRTGMFGKWHLGDQP 673

Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
           E + PT +GFD  +G  Y   +  ++       F P   L+  +N  V+ +      T  
Sbjct: 674 E-FLPTKQGFDEFFGLPYSHDIHPFHPRQNHYQFPPLPLLE--NNTVVEMEPNADLLTKR 730

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQ---APDEEVAKFLDISDPERRT 252
            TE A++ I + NK +P FLY+ H   HA  +   PF    A D  VA   +  + +  T
Sbjct: 731 ITERAVSFI-DRNKDQPFFLYIPHPIPHAPLHVSPPFMKDVADDVVVAINKEDGNIDYAT 789

Query: 253 YAGM----VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
            A +    ++ +D SVG ++ AL+ +G+ E ++VLF +DNG P   + ++ G    LRG 
Sbjct: 790 RANLFHQAIAEIDWSVGQILDALKSNGLDEKTMVLFTSDNGPPKNTLFASAGQ---LRGH 846

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
           K T ++GG+R    +  P      + ++EL    D LPT    AG  I  D  +DG + W
Sbjct: 847 KGTAFEGGVREPTVVRWPGQVPAGQENNELMSTMDLLPTFAKLAGAAIPTDRVIDGKDIW 906

Query: 368 DVLTKGAKTKRSEIL 382
             L KG      E+ 
Sbjct: 907 TTL-KGETQSPHEVF 920



 Score =  119 bits (299), Expect = 3e-24,   Method: Compositional matrix adjust.
 Identities = 113/379 (29%), Positives = 169/379 (44%), Gaps = 70/379 (18%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
           +  A + P++++I  DDLG+ D+S +G++++ TPNID LA  G    + H   A+CTPSR
Sbjct: 22  SAAAERPPNVVLIFVDDLGYGDLSCYGATKLSTPNIDQLAAEGRRFTDAHSASAVCTPSR 81

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWG-LPLTEKL--------LPQYLKEAGYATHAIGKW 132
             L+TG+YP+      GV      WG LP T  L        + +  K+ GYAT  +GKW
Sbjct: 82  YGLLTGQYPVRAMGGQGV------WGPLPTTSGLIIDTNTQTIGKVFKDKGYATACLGKW 135

Query: 133 HLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD----YYDHSCKATF-- 170
           HLGF  E           P   GFD ++G         +  + D     YD +    +  
Sbjct: 136 HLGFKEEPNDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPNDPLVYGG 195

Query: 171 ---EPYQGLDMRHNMQVDNKTIG------IYSTD----LYTEAAINVIAEHNKSKPMFLY 217
               P        +++  N+  G      IY  +    L T+ A+  I+E  K +  FLY
Sbjct: 196 KPVSPTPTFTAEASIKSPNRFSGALKAHEIYDDEKTGTLLTDRAVKWISE-KKDELFFLY 254

Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
            A   +H   T  P         +F   S  +   Y   V  LD  VG ++ +L  +G+ 
Sbjct: 255 FATPNIHHPFTPAP---------RFKGTS--QCGLYGDFVHELDWMVGEIVKSLEANGLT 303

Query: 278 ENSIVLFMADNGA--PSFGIHSNKGSNHP---LRGMKSTPWDGGMR-GVAAIWSPWLKQT 331
           +N++VLF +DNGA     G  + K  + P   L G K   W+GG R  + A W   +K  
Sbjct: 304 DNTLVLFTSDNGAMLNRAGRDAVKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKPG 363

Query: 332 QKVSSELFHISDWLPTLCA 350
            + S +L    D   T  A
Sbjct: 364 TQ-SDQLISQVDLFATFSA 381


>gi|256841279|ref|ZP_05546786.1| arylsulfatase A [Parabacteroides sp. D13]
 gi|262383570|ref|ZP_06076706.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|423330544|ref|ZP_17308328.1| hypothetical protein HMPREF1075_00341 [Parabacteroides distasonis
           CL03T12C09]
 gi|256737122|gb|EEU50449.1| arylsulfatase A [Parabacteroides sp. D13]
 gi|262294468|gb|EEY82400.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|409232160|gb|EKN25008.1| hypothetical protein HMPREF1075_00341 [Parabacteroides distasonis
           CL03T12C09]
          Length = 452

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 130/380 (34%), Positives = 184/380 (48%), Gaps = 26/380 (6%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           KY   +T  L   D     + P KP+II+I  DD+G+ D+S  GS  I TPNID +A  G
Sbjct: 5   KYILYITGVLASYDGH---SQPTKPNIIVINCDDMGYGDLSCFGSPTIKTPNIDRMAIEG 61

Query: 66  LILNQHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKE 121
              +  YV A + +PSR+ L+TG+  +  GM   Q  V+      GLP  E  + + LK+
Sbjct: 62  QKWSSFYVSASVSSPSRAGLLTGRLGVRTGMYGDQRRVLFPDSKGGLPSEELTIAELLKQ 121

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKA--TFEPYQGLDMR 179
           AGY T  IGKWHLG   E Y P   GFD  YGY            K   T  PY+ +   
Sbjct: 122 AGYHTACIGKWHLGHLPE-YMPLRHGFDYFYGYPYSNDMSRKEQIKLGNTKYPYEYIIYE 180

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
              +++ +      T   TEAAI  I + N++ P FLYLAH   H      P  A  +  
Sbjct: 181 QEKELEREPQQYNLTQQVTEAAIRYI-KSNENSPFFLYLAHPMPHM-----PVYASTDFQ 234

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
            K        R  Y   V  LD SVG ++  L+  G+ +N++V+F +DNG P        
Sbjct: 235 GK------SARGKYGDTVEELDWSVGQILQTLKSEGLDKNTLVIFTSDNG-PWLLCKQEG 287

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-ND 358
           GS  PL+  K++ ++GG R    +W   +K      +++    D LPT C  AGI + +D
Sbjct: 288 GSPGPLKDGKASMFEGGFRVPCIMWGAMVKPG--YITDMASTLDLLPTFCEIAGIPLPSD 345

Query: 359 TSLDGVNQWDVLTKGAKTKR 378
              DG++  +VL   +  KR
Sbjct: 346 RHYDGISLLNVLKDKSTCKR 365


>gi|229822457|ref|YP_002883983.1| Cerebroside-sulfatase [Beutenbergia cavernae DSM 12333]
 gi|229568370|gb|ACQ82221.1| Cerebroside-sulfatase [Beutenbergia cavernae DSM 12333]
          Length = 458

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 121/328 (36%), Positives = 172/328 (52%), Gaps = 28/328 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           ++P+I++I ADDLG+ D+  +GS +  TP++D LA  G+ L   Y+ + +C+PSR  ++T
Sbjct: 3   QRPNIVLINADDLGYGDLGCYGSMRNDTPHLDRLAAEGVRLTDFYMASPVCSPSRGGMLT 62

Query: 87  GKYPIHIG----MQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           G YP  IG    +   V+  G+P GL   E+ + + L +AGYAT AIGKWH G   E + 
Sbjct: 63  GCYPPRIGFGEFVGRPVLFPGDPVGLDPAERTMARVLGDAGYATAAIGKWHCGDQPE-FL 121

Query: 143 PTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           PT  GFDS++G  +   +    +H          G  +    Q D +++    T+ YT A
Sbjct: 122 PTRHGFDSYFGIPFSNDMGRQREHEDWPPLPLMSGESVVQE-QPDQRSL----TERYTVA 176

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A   I E N  +P FLYLAH+ VH      P   P    A FL  S      Y G V+ L
Sbjct: 177 ATRFI-EENAHQPFFLYLAHMYVHV-----PLFVP----APFLAAS--RNGGYGGAVAAL 224

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D S G V+  LR+ G+ EN+IV+F +DNG+ + G     GSN PLRG K+  W+GG R  
Sbjct: 225 DWSTGVVMDTLRRLGLEENTIVVFTSDNGSRARG---EGGSNDPLRGHKAQTWEGGQRVA 281

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTL 348
             +  P       V   +    D LPT 
Sbjct: 282 CVVRWPAAIPAGGVCDAVTRSIDLLPTF 309


>gi|325109725|ref|YP_004270793.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969993|gb|ADY60771.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 471

 Score =  176 bits (447), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 124/380 (32%), Positives = 187/380 (49%), Gaps = 24/380 (6%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +      C L    + L      +P++I+I++DD GW+DV F+G  +IPTP++DALA +G
Sbjct: 4   RLVLVCLCVLGVLFSLLPEANAGQPNVIVIVSDDQGWSDVGFNGCKEIPTPHLDALAKSG 63

Query: 66  LILNQHYV-QALCTPSRSALMTGKYPIHIGMQ-----HGVILEGEPWGLPLTEKLLPQYL 119
           +  +  Y     C+PSR+ L+TG+Y    G +     HG     E  GLPL+E LL    
Sbjct: 64  VAFDCGYASHPYCSPSRAGLLTGRYQQRFGHECNPGAHGNDDAIEMEGLPLSETLLSTVF 123

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR 179
           + AGY T AIGKWHLG   + + PT RGF+  +G+  G   Y+    K    P  G+ +R
Sbjct: 124 RNAGYRTGAIGKWHLGDEPQFW-PTERGFEEWFGFSGGGLSYWGDLGKKP--PLHGV-LR 179

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
           +   V    +   + D  TEA   V  E N+++P FLYLA+ A HA     P Q   E +
Sbjct: 180 NGDVVPKDELTYLTDDFSTEAVKFV--EENRARPFFLYLAYNAPHA-----PDQVTREHL 232

Query: 240 AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK 299
            K   I    R  Y  MV+ +DE +G V+ AL    + + +++ F +DNG       +  
Sbjct: 233 KKTSHIEYGGRAVYGAMVAAMDEGIGRVVQALDDQNLRDETVIFFYSDNGG-----RATL 287

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
             N P RG K   ++GG+R    + W   ++  ++    +  +  +   L AA       
Sbjct: 288 ARNFPYRGHKGMLFEGGIRVPFCVSWPEQIEGGRRYEPPVTALDIFPTVLAAAGVDAPAA 347

Query: 359 TSLDGVNQWDVLTKGAKTKR 378
             LDGVN    L  G+K +R
Sbjct: 348 LQLDGVNLLPYLA-GSKQRR 366


>gi|431907857|gb|ELK11464.1| Arylsulfatase B [Pteropus alecto]
          Length = 274

 Score =  176 bits (447), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 99/226 (43%), Positives = 134/226 (59%), Gaps = 18/226 (7%)

Query: 64  NGLILN-QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
           NG  L  +H++  L      A++T K  IH G+QH +I   +P  +PL EKLLPQ LKEA
Sbjct: 2   NGCTLTLEHFLMEL------AILTLK--IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEA 53

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHS----CKATFEPYQGLDM 178
           GY TH +GKWHLG +R+   PT RGFD+++GY  G +DYY H       A       LD 
Sbjct: 54  GYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLINALNVTRCALDF 113

Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
           R   +V      +YST+++T+ A  +I  H   KP+FLYLA  +VH     EP Q P+E 
Sbjct: 114 RDGEEVATGYENMYSTNIFTQRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEY 168

Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLF 284
           +  +  I D  R  YAGMVS +DE+VGNV AAL+ HG+  N++ +F
Sbjct: 169 LKPYDFIQDKNRHYYAGMVSLMDEAVGNVTAALKSHGLWNNTVFIF 214


>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
           variabilis ATCC 29413]
 gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
           ATCC 29413]
          Length = 457

 Score =  176 bits (447), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 122/380 (32%), Positives = 199/380 (52%), Gaps = 37/380 (9%)

Query: 16  LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-Q 74
           LF+ A   T    +P+++ IL DD+GW D+S +G +   TPN+D LA  G+     Y  Q
Sbjct: 31  LFSRA---TAQSSRPNVVFILVDDMGWGDLSIYGRTDYETPNLDRLARQGVRFTNAYANQ 87

Query: 75  ALCTPSRSALMTGKYP--IHIGMQHGVILEGEP----WGLPLTEKLLPQYLKEAGYATHA 128
            +CTP+R A +TG+Y   + +G++  +    +P     G+P  +  +   LK  GY T  
Sbjct: 88  TVCTPTRIAFLTGRYQARLPVGLREPLGARSQPASNNIGIPANQPTIASLLKANGYETAL 147

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           +GKWH G +   + P  +GFD ++G+  G  +Y+ H+          LD+  N  V  + 
Sbjct: 148 VGKWHAG-YPPNFGPLQKGFDEYFGHLSGGIEYFTHTGTDRI-----LDLYEN-DVPVQR 200

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD- 247
            G Y TDL+T+ A+  I +   S+P +L L + A H      P+Q P+++ +    +++ 
Sbjct: 201 SG-YVTDLFTDRAVEFI-QRPHSRPFYLSLHYNAPH-----WPWQGPNDQASTAFYLTNG 253

Query: 248 ----PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
                 + TYA MV  LD+ VG V+ AL   G  +N++V+F +DNG   F   SN G   
Sbjct: 254 YTVGGSQATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGGERF---SNFG--- 307

Query: 304 PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLD 362
           P RG K++ ++GG+R  A I  P + Q  +VS+++    D   T+ AA G   + +   D
Sbjct: 308 PFRGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTATILAATGTSFHPNYPPD 367

Query: 363 GVNQWDVLTKGAKTKRSEIL 382
           G N   +L +G +++ S  L
Sbjct: 368 GQNLLPLL-RGDRSEFSRTL 386


>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
 gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
          Length = 482

 Score =  176 bits (447), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 118/352 (33%), Positives = 179/352 (50%), Gaps = 27/352 (7%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSA 83
           T+P  P+I+ IL+DD G+ D  F GS  + TPN+D LA  G +  Q YV  A+C PSR+ 
Sbjct: 35  TSP--PNILFILSDDAGYADFGFQGSDVMRTPNLDKLASQGTVFTQAYVSAAVCGPSRAG 92

Query: 84  LMTGKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           ++TGKY    G +   +        L G+  GLPL +K +  YL+E GY T  IGKWH G
Sbjct: 93  ILTGKYQQRFGYEENNVPGYMSQSGLTGDDMGLPLDQKTMADYLRERGYKTALIGKWHQG 152

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE-PYQGLDMRHNMQVDNKTIGIYST 194
              + + PT RGFD  YG+  G + Y+    +     P   L+       ++K    Y T
Sbjct: 153 -NADRFHPTKRGFDEFYGFRGGARSYFGFGAQNPVSYPEDKLERGFAHFQESKR---YLT 208

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
           +      +  I + N+  P F++L+  AVH      P +A   ++A+F ++   +R+  A
Sbjct: 209 EALATETVEFI-KRNQKHPFFVFLSFNAVHT-----PMEAKPADLAQFSNLKG-KRQQLA 261

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            M   +D  +G V+  L + G+ +N++V+F  DNG PS    +N  +N PL G K+   +
Sbjct: 262 AMTLSMDREIGKVLNTLDELGLSDNTLVVFTNDNGGPS---DTNASNNGPLSGTKANHLE 318

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPT-LCAAAGIEINDTSLDGVN 365
           GG+R    +  P  K +Q          D LPT   AA G E    ++DGV+
Sbjct: 319 GGIRVPFIMRWPAGKVSQGEYPNPISTLDLLPTFFSAAGGNEQQLAAIDGVD 370


>gi|417305609|ref|ZP_12092563.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327538042|gb|EGF24732.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 1004

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 141/419 (33%), Positives = 201/419 (47%), Gaps = 33/419 (7%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
           T    KP+ I+IL DD G+ D+S  G+  + TP ID +A  G  L   YV A +CTPSR+
Sbjct: 557 TAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAAPVCTPSRA 616

Query: 83  ALMTGKYPIHI----GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
            LMTG YP  I    G   GV+L G+P GL   E  + + L  AGY T   GKWHLG   
Sbjct: 617 GLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLNTAGYRTGMFGKWHLGDQP 676

Query: 139 EVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
           E + PT +GFD  +G  Y   +  ++       F     L     +++D      + T  
Sbjct: 677 E-FLPTKQGFDEFFGIPYSHDIHPFHPRQNXYHFPSLPLLQNDTVIEMDPDA--DFLTKR 733

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQ---APDEEVAKFLDISDPERRT 252
            TE A++ I E NK +P FLYL H   HA  +   PF    A D   A   +  + +  T
Sbjct: 734 LTEQAVSFI-ERNKDQPFFLYLPHPIPHAPLHASPPFMEDVADDVIAAIEKEDGNIDYAT 792

Query: 253 YAGM----VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
            A +    ++ +D SVG ++ ALR +G+ E ++VLF +DNG P   ++++ G    LRG 
Sbjct: 793 RANLFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPPKNTLYASPGE---LRGH 849

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQW 367
           K T ++GGMR    +  P        + EL    D LPT    AG  I  D  +DG + W
Sbjct: 850 KGTTFEGGMREPTVVRWPGQIPAGHQNDELMTAMDLLPTFAKLAGAAIPTDRVIDGKDIW 909

Query: 368 DVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEI 426
             L    +T      ++  N        AA+R    K      NNG + + Y D +N++
Sbjct: 910 PTLKGETQTPHDAFFYHRGN------QLAAVRSGKWKLHV---NNGVAKQLY-DLENDL 958



 Score =  123 bits (308), Expect = 3e-25,   Method: Compositional matrix adjust.
 Identities = 114/379 (30%), Positives = 165/379 (43%), Gaps = 70/379 (18%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
           +  A + P++++I  DDLG+ DV  +G++++ TPNID LA  G    + H   A+CTPSR
Sbjct: 25  SVAAERPPNVVLIFVDDLGYGDVGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSR 84

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWG-LPLTEKL--------LPQYLKEAGYATHAIGKW 132
             L+TG+YP+      G+      WG LP T  L        + +  K  GYAT  +GKW
Sbjct: 85  YGLLTGQYPVRAMGGKGI------WGPLPTTSGLIIDTNTQTIGKVFKNKGYATACLGKW 138

Query: 133 HLGFFREVY--------TPTFRGFDSHYGY--------WQGLQD----YYDHSCKATF-- 170
           HLGF  E           P   GFD ++G         +  + D     YD S    +  
Sbjct: 139 HLGFKEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGG 198

Query: 171 ---EPYQGLDMRHNMQVDNKTIG------IYSTD----LYTEAAINVIAEHNKSKPMFLY 217
               P        +++  N+  G      IY  +    L TE A+  I E  K +P FLY
Sbjct: 199 KPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITE-KKDEPFFLY 257

Query: 218 LAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGML 277
            A   +H   T  P         +F   S  +   Y   V  LD  VG ++ +L  +G+ 
Sbjct: 258 FATPNIHHPFTPAP---------RFKGTS--QCGLYGDFVHELDWMVGEIVQSLEDNGLT 306

Query: 278 ENSIVLFMADNG-----APSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQT 331
           +N++VLF +DNG     A    I +    N  L G K   W+GG R  + A W   +K  
Sbjct: 307 DNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIKAG 366

Query: 332 QKVSSELFHISDWLPTLCA 350
            + S +L    D   T  A
Sbjct: 367 TQ-SDQLISQVDLFATFSA 384


>gi|380796101|gb|AFE69926.1| N-acetylgalactosamine-6-sulfatase precursor, partial [Macaca
           mulatta]
          Length = 503

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 120/378 (31%), Positives = 180/378 (47%), Gaps = 25/378 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
           + AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+
Sbjct: 6   SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRA 65

Query: 83  ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           AL+TG+ PI  G        ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG
Sbjct: 66  ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 125

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
             R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT   
Sbjct: 126 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 184

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
             T +Y + A++ I    +  P FLY A  A HA         P      FL  S  +R 
Sbjct: 185 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 233

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            Y   V  +D+SVG ++  L    + +N+ V F +DNGA         GSN P    K T
Sbjct: 234 RYGDAVREIDDSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQAGSNGPFLCGKQT 293

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVL 370
            ++GGMR  A  W P      +VS +L  I D   T  A AG+    D ++DG+N    L
Sbjct: 294 TFEGGMREPALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAGLTPPRDRAIDGLNLLPAL 353

Query: 371 TKGAKTKRSEILHNIDNV 388
            +G    R    +  D +
Sbjct: 354 LQGRLMDRPVFYYRGDTL 371


>gi|149038400|gb|EDL92760.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus norvegicus]
          Length = 466

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 121/373 (32%), Positives = 181/373 (48%), Gaps = 26/373 (6%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCT 78
             L   AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+
Sbjct: 22  GLLAAGAPQPPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCS 81

Query: 79  PSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGK 131
           PSR+AL+TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GK
Sbjct: 82  PSRAALLTGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGK 141

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNK 187
           WHLG  R  + P   GFD  +G        YD+  K     Y+  +M         ++ K
Sbjct: 142 WHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLK 200

Query: 188 TIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
           T     T LY + A++ I  +H +  P FLY A  A HA         P     +FL  S
Sbjct: 201 TGEANLTQLYLQEALDFIRTQHARQSPFFLYWAIDATHA---------PVYASKQFLGTS 251

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
              R  Y   V  +D+SVG +++ L+  G+ +N+ V F +DNGA         GSN P  
Sbjct: 252 --LRGRYGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFL 309

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVN 365
             K T ++GGMR  A  W P      +VS +L  I D   T  + AG++  +D  +DG++
Sbjct: 310 CGKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRVIDGLD 369

Query: 366 QWDVLTKGAKTKR 378
               + +G    R
Sbjct: 370 LLPTMLQGHIIDR 382


>gi|449138314|ref|ZP_21773584.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448883087|gb|EMB13631.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 485

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 129/424 (30%), Positives = 195/424 (45%), Gaps = 61/424 (14%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           R +   L  TL      +  +A  +P+I++I++DD G+ND+   G+  I TP +D LA  
Sbjct: 5   RNWMIHLFTTLAMLLVAIKASAADRPNILLIVSDDQGYNDLGQLGNGII-TPALDRLANE 63

Query: 65  GLILNQHYVQ-ALCTPSRSALMTGKYPIHIGM------------------QHGVILEGEP 105
           G  L   YV    CTPSR++L+TG+YP   G+                  Q+ V  E   
Sbjct: 64  GTRLTNFYVAWPACTPSRASLLTGRYPQRNGIYDMIRNEAPDYGHRYTPDQYAVTFE-RI 122

Query: 106 WGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHS 165
            G+   E ++P  LK AGY +   GKW LG  R +  P  RGFD  YG+     DY+ H 
Sbjct: 123 GGMDEREVIIPAVLKRAGYKSGIYGKWDLGALRRML-PNSRGFDDFYGFVNTGIDYFTH- 180

Query: 166 CKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA 225
                E Y    M  N++   +  G Y T L+   A+  + EH   +P FLY+   A H 
Sbjct: 181 -----ERYGVPCMVRNLEPTEEDKGTYCTYLFQREALRFLDEHAGEEPFFLYVPFNAPHN 235

Query: 226 GNTYEP-----FQAPDEEVAKF------------------LDISDPE--RRTYAGMVSRL 260
            ++ +P      QAP+   A +                    ++ P+  RR Y   V+ +
Sbjct: 236 SSSLDPKIRSSVQAPERFKAMYPPVESETRVTDRYRYGTPATVATPQARRRDYRAAVTCM 295

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D ++G ++  L    MLE++IV+F +DNG       S    N PLRG K+  W+GG+R  
Sbjct: 296 DAAIGEMLDRLEAKQMLEDTIVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVP 349

Query: 321 AAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKR 378
             + W+        V+ E     + LP+  +AAG+E   D  LDG + W  L   + + R
Sbjct: 350 CIVRWTAGKIPAGAVNDEFLTSLELLPSFASAAGLEPPADIVLDGFDWWPTLRGESASPR 409

Query: 379 SEIL 382
            E+ 
Sbjct: 410 EEMF 413


>gi|410215590|gb|JAA05014.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410254514|gb|JAA15224.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410288780|gb|JAA22990.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410330541|gb|JAA34217.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
          Length = 522

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 119/378 (31%), Positives = 182/378 (48%), Gaps = 25/378 (6%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
           + AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+
Sbjct: 25  SGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRA 84

Query: 83  ALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           AL+TG+ PI  G        ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG
Sbjct: 85  ALLTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG 144

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGI 191
             R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT   
Sbjct: 145 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEA 203

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
             T +Y + A++ I    +  P FLY A  A HA         P      FL  S  +R 
Sbjct: 204 NLTQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRG 252

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            Y   V  +D+S+G ++  L+   + +N+ V F +DNGA         GSN P    K T
Sbjct: 253 RYGDAVREIDDSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQT 312

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
            ++GGMR  A  W P      +VS +L  I D   T  A AG+   +D ++DG+N    L
Sbjct: 313 TFEGGMREPALAWWPGHVAAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTL 372

Query: 371 TKGAKTKRSEILHNIDNV 388
            +G    R    +  D +
Sbjct: 373 LQGRLMDRPIFYYRGDTL 390


>gi|182677550|ref|YP_001831696.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
 gi|182633433|gb|ACB94207.1| sulfatase [Beijerinckia indica subsp. indica ATCC 9039]
          Length = 486

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 137/422 (32%), Positives = 204/422 (48%), Gaps = 43/422 (10%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
           T  + P+II ILADDLG+ DVS +G   + TPNID +   G  L Q Y   A+C+ +R+A
Sbjct: 32  TGDRAPNIIFILADDLGYADVSIYGRPDLSTPNIDGIGLKGARLLQAYANSAVCSATRTA 91

Query: 84  LMTGKYPI--HIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           L+TG+Y     +G++   I      GLP     LP  LK+AGY T  IGKWHLG   + +
Sbjct: 92  LLTGRYQYRERVGLEE-PIAGNIHVGLPPQRPTLPSLLKKAGYTTTLIGKWHLGTLPD-F 149

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
            P   G+D  YG+  G  DYY H  K T +     D    +         Y T+L  + A
Sbjct: 150 GPLQSGYDHFYGFRGGAVDYYSH--KGTDDQDDLWDQDTKVHQTG-----YLTELLGDRA 202

Query: 202 INVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV-------AKFLDISDPERRTY 253
           I  I A     +P F+ L     H    + P++AP +E         +  D     + TY
Sbjct: 203 IETINASAKTGQPFFISL-----HFNAPHWPWEAPGDEAESARVAGTRLFDFDGGSQATY 257

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
            GM++ +D  +G ++ AL+ +G+ EN+IV+F +DNG   F          P  G K+   
Sbjct: 258 RGMIAAMDLQIGRIVQALQANGISENTIVIFTSDNGGERF------ADTWPFTGRKTELL 311

Query: 314 DGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT 371
           +GG+R  A + W   +K  Q +      + DWLPTL AAAG E + +   DG+N    L+
Sbjct: 312 EGGLRIPALVSWPARIKADQTIDQVSISM-DWLPTLLAAAGSEPDPNFPSDGINLLPFLS 370

Query: 372 KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSP 431
           +       ++        N Q+   A+R  D KY+   DN    D  +   D+ +++ + 
Sbjct: 371 ESKAAIPRKLFWRYKA--NAQR---AVRDGDYKYLKIRDN----DFLFNVVDDPLERVNL 421

Query: 432 KE 433
           KE
Sbjct: 422 KE 423


>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 494

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 133/381 (34%), Positives = 179/381 (46%), Gaps = 41/381 (10%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           AL  T+L   A L +    KP++I+ILADDLG+ DV F GS++I TP +D LA  G+I N
Sbjct: 3   ALCTTVLGLIAGLCSIEANKPNLILILADDLGYADVGFTGSTEIQTPVLDRLAAGGVIFN 62

Query: 70  QHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPW----GLPLTEKLLPQYLKEAGY 124
             YV  A C PSR+ L+TG+Y    G++  V     P+    GLP  EK     LK++GY
Sbjct: 63  NGYVTHAYCGPSRAGLITGRYQARFGVE--VNFPYAPFDPHSGLPTDEKTFATRLKQSGY 120

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFE------------- 171
            T  IGKWHLG     Y P  RGFD  YG+  G  DY   +   T               
Sbjct: 121 RTAMIGKWHLGAAYP-YHPNNRGFDYFYGFLGGAHDYMPENTSTTVPLTLENGKVNHMAN 179

Query: 172 --PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
              Y  L MR+N+   N     Y T   +  A   I       P  +YL++ A H     
Sbjct: 180 AGSYLPL-MRNNV---NAEFDEYLTTALSRDAARFI--EKTEGPFCVYLSYNAPHT---- 229

Query: 230 EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
            P QAP   + K+  I   +RRTYA M+  +D+ +G V+ AL K G LEN+++ F +DNG
Sbjct: 230 -PLQAPKALIEKYAHIESQKRRTYAAMIDSMDQGIGRVVDALEKSGKLENTLIFFFSDNG 288

Query: 290 A--PSFGIHSNK-GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS--DW 344
              P  G        N P RG K    +GG        + W K     S+  + +S  D 
Sbjct: 289 GPRPKPGYEGEDFADNTPYRGGKGAMLEGGTH--VPFIAYWPKGIPAGSTYDYPVSALDV 346

Query: 345 LPTLCAAAGIEINDTSLDGVN 365
             T       + +   L+GVN
Sbjct: 347 AATFVELGEGDTSGHPLEGVN 367


>gi|160891517|ref|ZP_02072520.1| hypothetical protein BACUNI_03968 [Bacteroides uniformis ATCC 8492]
 gi|317478374|ref|ZP_07937538.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|156858924|gb|EDO52355.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
 gi|316905533|gb|EFV27323.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 448

 Score =  176 bits (446), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 128/380 (33%), Positives = 195/380 (51%), Gaps = 34/380 (8%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           A T + +   +  +    ++P+ ++I  DD+G+ D+  +G+  I TPNID +A  G+   
Sbjct: 12  ATTVSQMMFSSCSSEQEKEQPNFVLIFCDDMGYGDLGCYGNPTIHTPNIDRMASEGMKFT 71

Query: 70  QHYVQA-LCTPSRSALMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           Q YV A + TPSRSALMTG+ P+  G+   +  V+      GL   E  + + L++ GYA
Sbjct: 72  QFYVGASVSTPSRSALMTGRLPVRNGLYGDKRDVLFPDSKAGLGQDEVTVAKLLQQNGYA 131

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           T  +GKWHLG F   Y PT  GFD  +G       Y +    A+        MR +  ++
Sbjct: 132 TACVGKWHLGHFSP-YLPTDHGFDDFFGI-----PYSNDMRPASL-------MRGDSVLE 178

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQAPDEEVAKFLD 244
            K      T  YTE A++ I + NK++P FLYLAH   H   +T E F+           
Sbjct: 179 IKPEQGELTRRYTEYAVDYI-KKNKNRPFFLYLAHTFPHTPLHTNERFEGRS-------- 229

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
                R  Y  +V  LD SVG V+ AL+++G+ EN++V+F +DNG P      N GS   
Sbjct: 230 ----NRGLYGDVVEELDWSVGEVLKALKENGLDENTLVIFTSDNG-PWLVQRLNGGSAGA 284

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDG 363
           LR  KST W+GG+R V A++    +        +    D LPT    AGI + +  +LDG
Sbjct: 285 LRQGKSTCWEGGLR-VPAVFRMPGRIAPCTQQGMMATMDILPTFLNMAGIPLPEGVTLDG 343

Query: 364 VNQWDVLTKGAKTKRSEILH 383
           ++Q  +L +  ++ R E+ +
Sbjct: 344 MDQTPMLWEQRESSRDEMYY 363


>gi|149199849|ref|ZP_01876878.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149137020|gb|EDM25444.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 467

 Score =  176 bits (445), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 128/403 (31%), Positives = 195/403 (48%), Gaps = 58/403 (14%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           +FF+L+    F D         KP++++I ADD G+ D+S  GS  I TP ID +A  G 
Sbjct: 10  FFFSLSA---FAD---------KPNVVLIFADDQGYGDLSCFGSETIKTPYIDQMAKEGR 57

Query: 67  ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
                 V + +CTPSR+AL+TG YP  +G+  GV+      GL   E  +  + K  GYA
Sbjct: 58  KFTSFMVASPVCTPSRAALLTGSYPKRVGLHKGVLFPQSTTGLNPAEYTIGDHFKSLGYA 117

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYYDHSCKATFEPYQGLDMRHN--- 181
           T   GKWHLG   E   P   GFD +YG  +    ++ D+  K    P  G+D+  N   
Sbjct: 118 TACYGKWHLGHHPETL-PISNGFDEYYGIPYSNDMNHPDNKNKPKGGP-GGMDLLWNDPE 175

Query: 182 ------------------MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAV 223
                             + VD +T+    T  YT+ AI+ I E NK K  F+YL H   
Sbjct: 176 STLTKWQTPLMEGAKIVEIPVDQRTV----TRRYTDKAISFI-ERNKEKSFFIYLPHSMP 230

Query: 224 HAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
           H      P   PDE       I DP+ +  Y  ++  +D+ VG +   L++ G+ +N+IV
Sbjct: 231 HI-----PLYVPDE-------IRDPDPKNAYINVIEHMDDQVGRITKKLKELGLDKNTIV 278

Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
           ++  DNG P      + GS   LR  K + ++GG R    +W+P L      ++EL    
Sbjct: 279 IYTTDNG-PWLPFKHHGGSAGELRDGKGSTFEGGQRVPCVMWAPGLIPAGTETNELMTTI 337

Query: 343 DWLPTLCAAAGIEI-NDTSLDGVNQWDVLT-KGAKTKRSEILH 383
           D LP+L A +   +     +DG++Q  ++T K  ++ R+E ++
Sbjct: 338 DLLPSLAALSSSTLPGKNKIDGIDQSALITGKAERSARNEFIY 380


>gi|239626021|ref|ZP_04669052.1| arylsulfatase E [Clostridiales bacterium 1_7_47_FAA]
 gi|239520251|gb|EEQ60117.1| arylsulfatase E [Clostridiales bacterium 1_7_47FAA]
          Length = 483

 Score =  176 bits (445), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 124/371 (33%), Positives = 187/371 (50%), Gaps = 27/371 (7%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
           T  KKP+II+ L DD G+ D+S  GS+ + TPN+D LA  G      Y   A+C+PSR+ 
Sbjct: 12  TIMKKPNIIVFLTDDQGYGDLSCMGSTDVCTPNLDILAAGGARFTDFYAGSAVCSPSRAC 71

Query: 84  LMTGKYPIHIGMQHGVILEG--EPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           L+TG+YP   G++   IL G     GL          LK+ GY T  +GKWHLG   E  
Sbjct: 72  LLTGRYPYMTGVRS--ILGGIKTTTGLNPGIPTFASALKDLGYTTGMVGKWHLGAVPEC- 128

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN----KTIGIYSTDLY 197
            PT  GFD   G+  G+ DY+ H        + G++  H++  ++    K  G YST+L+
Sbjct: 129 RPTHMGFDYFCGFLSGVNDYFSHIHYTEANSHPGINPNHDLWENDERCLKYTGEYSTELF 188

Query: 198 TEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
               +  I E   K  P  LY A  A H      P  AP + + +F  + + +R+  A M
Sbjct: 189 ARKGLEFIREQVEKDMPFALYCAFNAPHY-----PMHAPYKYLERFKHLPE-DRQIMAAM 242

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI--------HSNKGSNHPLRGM 308
           +S +D+ VG ++  L++ G+  ++I+ F +DNG              +   GS   L+G 
Sbjct: 243 LSAVDDGVGEIMNYLKRRGIFNDTIIYFQSDNGPSKESRNWLDERKDYYYGGSTGGLKGH 302

Query: 309 KSTPWDGGMRGVAAIWS-PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW 367
           K + +DGG+R V AI+S P +    +V SE    +D  PT   AAG   +D  + G +  
Sbjct: 303 KFSLFDGGIR-VPAIFSWPAMVPAGQVISEPCMGTDIFPTFINAAGGNASDYEISGCDIL 361

Query: 368 DVLTKGAKTKR 378
            V+T GA+  +
Sbjct: 362 PVMTIGARRDK 372


>gi|47522740|ref|NP_999120.1| N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
 gi|75054309|sp|Q8WNQ7.1|GALNS_PIG RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|18028088|gb|AAL55968.1|AF322917_1 N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
          Length = 522

 Score =  175 bits (444), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 123/379 (32%), Positives = 184/379 (48%), Gaps = 27/379 (7%)

Query: 15  LLFNDAFLNTT-APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
           L+ + A L  T AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y 
Sbjct: 14  LVLSAAGLGVTGAPQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYA 73

Query: 74  -QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
              LC+PSR+AL+TG+ PI  G        ++    +    G+P  E LLP+ LK AGYA
Sbjct: 74  ANPLCSPSRAALLTGRLPIRTGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAGYA 133

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHN 181
           +  +GKWHLG  R  + P   GFD  +G        YD+  +     Y+  +M       
Sbjct: 134 SKIVGKWHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEE 192

Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
             ++ KT     T +Y + A++ I     +  P FLY A  A HA         P     
Sbjct: 193 FPINLKTGESNLTQIYLQEALDFIKRQQATHHPFFLYWAIDATHA---------PVYASR 243

Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
            FL  S  +R  Y   V  +D+SVG ++  LR   +  N+ V F +DNGA         G
Sbjct: 244 AFLGTS--QRGRYGDAVREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALVSAPKQGG 301

Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDT 359
           SN P    K T ++GGMR  A  W P      +VS +L  + D   T  + AG+E  +D 
Sbjct: 302 SNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLAGLEPPSDR 361

Query: 360 SLDGVNQWDVLTKGAKTKR 378
           ++DG++    + +G  T+R
Sbjct: 362 AIDGLDLLPAMLQGRLTER 380


>gi|395856891|ref|XP_003800850.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Otolemur garnettii]
          Length = 526

 Score =  175 bits (444), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 121/367 (32%), Positives = 177/367 (48%), Gaps = 26/367 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G +    Y    LC+PSR+AL
Sbjct: 30  APQSPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAAL 89

Query: 85  MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG+ PI  G        ++    +    G+P +E LLP+ LKEAGY +  +GKWHLG  
Sbjct: 90  LTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPRSEHLLPELLKEAGYISKIVGKWHLGH- 148

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
           R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT     
Sbjct: 149 RPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNL 208

Query: 194 TDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           T +Y + A++ I      + P FLY A  A HA         P      FL  S  +R  
Sbjct: 209 TQIYLQEALDFIKRQQAQQHPFFLYWAIDATHA---------PVYASKAFLGTS--QRGR 257

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y   V  +D+SVG ++  LR   + EN+ V F +DNGA         GSN P    K T 
Sbjct: 258 YGDAVREIDDSVGKILDLLRGLSIGENTFVFFTSDNGAALISAPRQGGSNGPFLCGKQTT 317

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
           ++GGMR  A  W P      +VS +L  I D   T  + AG+E  +D ++DG++    L 
Sbjct: 318 FEGGMREPAIAWWPGHIPAAQVSHQLGSIMDLFTTSLSLAGLEPPSDRTIDGLDLLPTLL 377

Query: 372 KGAKTKR 378
           +G    R
Sbjct: 378 QGRLVDR 384


>gi|116621986|ref|YP_824142.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
 gi|116225148|gb|ABJ83857.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
          Length = 461

 Score =  175 bits (444), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 127/359 (35%), Positives = 178/359 (49%), Gaps = 39/359 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
           ++P+I++ILADDLG+ D+  +GS  I TPNID LA  G      Y    +C+PSR+ALMT
Sbjct: 26  RQPNIVVILADDLGYGDLGCYGSP-IATPNIDRLAEEGARFTSFYSASPVCSPSRAALMT 84

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+YP  + +   V+L     GLP +E  + Q LK AGY T  IGKWH+G     Y PT R
Sbjct: 85  GRYPTRVEVP--VVLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIG-STPGYLPTNR 141

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           GFD  +G          +S   T  P           VD  T+    T  +T+ A++ + 
Sbjct: 142 GFDEFFG--------VPYSADITPCPLMRGSSVVAPAVDCSTL----TSSFTQEALDFM- 188

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
              +  P FLYLAH A H      P  A    +  + D+           V  LD S G 
Sbjct: 189 RRAQDNPFFLYLAHTAPHLPLAASPRFAGQSGLGMYADV-----------VQELDWSTGQ 237

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           V+AAL+  G+  N++V+F +DNG P +     +GS   LRG K   ++GGMR       P
Sbjct: 238 VMAALKATGLDSNTLVMFSSDNG-PWY-----QGSQGKLRGRKGETYEGGMREPFLARYP 291

Query: 327 WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNI 385
            +  +    + L    D LPTL   AG +     LDGV+ W VLT     +R+E+  ++
Sbjct: 292 GVIPSGIGCAGLATTMDLLPTLARLAGAQTPSNPLDGVDIWPVLTG----ERAEVDRDV 346


>gi|386287174|ref|ZP_10064349.1| sulfatase family protein [gamma proteobacterium BDW918]
 gi|385279706|gb|EIF43643.1| sulfatase family protein [gamma proteobacterium BDW918]
          Length = 555

 Score =  175 bits (444), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 131/389 (33%), Positives = 192/389 (49%), Gaps = 62/389 (15%)

Query: 28  KKPHIIIILADDLGWNDVSFHG----SSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
           + P++I+ILADDLG ND+S +G    + Q+ TPNID LA +G   +Q Y  A  C PSR+
Sbjct: 63  RPPNVIVILADDLGINDISTYGGGLANGQVQTPNIDELAASGANFSQAYSGAGTCAPSRA 122

Query: 83  ALMTGKYPIHIGMQHGVILEG--------------------------------EPWGLPL 110
            L+TG+YP   G +     +G                                +  GLP 
Sbjct: 123 MLLTGRYPTRTGFEFTPTPDGMGAMATMIASSHDNGLPPSIYNKAGADKIPSFDEQGLPS 182

Query: 111 TEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH----SC 166
           +E  L + L++ GY T  IGKWHLG  +  ++P  +GF+       GL    DH    + 
Sbjct: 183 SEITLAEILRDRGYYTAHIGKWHLGRGKG-FSPEEQGFEQSLLMHSGLYLPVDHPDVVNA 241

Query: 167 KATFEPYQGL---DMRHNMQVDNKTIGI----YSTDLYTEAAINVIAEHNKSKPMFLYLA 219
           K  F+P        MR+    ++         Y TD +T+ +I +I + NK++P FLYLA
Sbjct: 242 KVDFDPIDQFLWAKMRYAAAFNDGGPVFEPKSYLTDYWTDESIKII-KANKNRPFFLYLA 300

Query: 220 HLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLEN 279
           H   H      P QA  E+     DI     R YA MV  +D SVG ++A L++ G+ +N
Sbjct: 301 HWGAHT-----PLQATREDYEAVGDIKPHRARVYAAMVRAVDRSVGRILATLKEEGLSDN 355

Query: 280 SIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSE 337
           +IV+F +DNG   + GI      N P RG K T ++GG+R  +   W   + +   + + 
Sbjct: 356 TIVVFSSDNGGAGYIGIPD---INAPYRGWKLTMFEGGIRVPLFMRWPAHIARNTSIDTP 412

Query: 338 LFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
             HI D+LPTL AAA  ++  D  +DG N
Sbjct: 413 AAHI-DFLPTLVAAAQAKLPEDRIIDGAN 440


>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 486

 Score =  175 bits (443), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 134/413 (32%), Positives = 201/413 (48%), Gaps = 40/413 (9%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           P KP+I+ ILADD+GW+D+  +G+    TPNID  A   +     Y  ++C+PSRS LMT
Sbjct: 23  PDKPNILFILADDMGWSDLGCYGADLHETPNIDRFASGAVRFTSAYAMSVCSPSRSTLMT 82

Query: 87  GKYPIHI-------GMQHG-----VILEGEP-WGLPLTEKLLPQYLKEAGYATHAIGKWH 133
           GK+   +       G Q G      + E E  W LP +EK +  YLK AGY T  IGKWH
Sbjct: 83  GKHAARLHFTIWAEGAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKWH 142

Query: 134 LGFFREVYTPTFRGFDSHYG--YWQGLQDY-YDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           LG +   + P   GFD + G   W   Q + + +S   T  P    + R+   ++    G
Sbjct: 143 LGDWE--HYPEAHGFDINIGGTNWGAPQTFWWPYSGSGTHGP----EFRYIPHLEYGHPG 196

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV----AKFLDIS 246
            Y TD  T+ AI VI +H   +P F+YLAH AVH      P +A  +++    AK+ D  
Sbjct: 197 EYLTDRLTDEAIKVI-DHAGDQPFFVYLAHHAVHT-----PIEAKADDIQHFDAKYRDGM 250

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------ 300
           +     YA M   LDE+VG V+  L++ G+ +N++V+F +DNG    G+    G      
Sbjct: 251 NHRHTIYAAMNKELDENVGRVLEHLKERGLDKNTVVIFASDNGG-YIGVDKVSGKNMPVT 309

Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
           +N PLR  K   ++GG+R    I  P +        E   ++D L T     G      +
Sbjct: 310 NNAPLRSGKGALYEGGIRVPLIIRWPGVTPNGATCDEPVILTDMLQTFLHITGQPPATDA 369

Query: 361 LDGVNQWDVLTK-GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
            DG++   +L    AK  R  +  +  +  +     +A+R  D K +   ++N
Sbjct: 370 TDGMDISPLLKDPSAKLNRDALFFHYPHYYHTTTPVSAIRARDWKLLEFYEDN 422


>gi|323455471|gb|EGB11339.1| hypothetical protein AURANDRAFT_2359, partial [Aureococcus
           anophagefferens]
          Length = 367

 Score =  174 bits (442), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 119/371 (32%), Positives = 189/371 (50%), Gaps = 24/371 (6%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIP-----TPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           PHI+I+LAD++GW  V +H    +P     TP +DAL  +GL L++HY    C+PSRSAL
Sbjct: 1   PHIVIMLADNVGWAVVEWHRPPALPAREISTPKLDALRASGLELDRHYTYKFCSPSRSAL 60

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF-REVYTP 143
           M+G+ P H+ + +         G P   +L+P  L++AGY TH +GKWHLG   R + TP
Sbjct: 61  MSGRLPAHVNVYNDDPAMAGA-GAPAHMRLMPAMLRDAGYRTHFVGKWHLGMASRSLQTP 119

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
             RGFD+  GY+    +YYD +           D+       +     Y   ++ +  ++
Sbjct: 120 AARGFDTSLGYFHSTNNYYDGTRAEGCGDAAHADLWDGDGPSHLNGTAYEELVFRDRVVD 179

Query: 204 VIAEHNKSKPMFLYLA-HLAV-------HAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           V+  H+ ++P+F+Y A H +         AG   +  Q       KF  I D +RR    
Sbjct: 180 VVEAHDAARPLFIYYALHTSCVGYDPDGTAGGERDSLQPDRAYYDKFSFIDDDDRRANVA 239

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH-PLRGMKSTPWD 314
           MV+ +D+  G+V+ ALR  G+   +++L+ +DNG     +H   GSN  PLRG     ++
Sbjct: 240 MVALMDDVAGDVVGALRTAGLWAKTLLLWSSDNGG---AVHLGGGSNTWPLRGGYYNNFE 296

Query: 315 GGMRGVAAIWSPWLKQTQ--KVSSELFHISDWLPTLCAAAGIEINDT---SLDGVNQWDV 369
           GG+R  A +    L +    K+ +E  +I+DW  T    AG   +D    ++D ++ WD 
Sbjct: 297 GGVRAPAFLAGGALPEVARGKLLTEWIYIADWYATFARLAGASPDDGAVPAVDSLDVWDA 356

Query: 370 LTKGAKTKRSE 380
           +     + R E
Sbjct: 357 IAGDGASPRVE 367


>gi|410611985|ref|ZP_11323071.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
 gi|410168398|dbj|GAC36960.1| N-acetylgalactosamine-6-sulfatase [Glaciecola psychrophila 170]
          Length = 508

 Score =  174 bits (442), Expect = 9e-41,   Method: Compositional matrix adjust.
 Identities = 129/365 (35%), Positives = 185/365 (50%), Gaps = 42/365 (11%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
           T+ P KP+I+ I ADDLG+ D+  +GS+ I TPNID +A  G   ++ Y  + +CTPSR+
Sbjct: 44  TSPPDKPNIVFIFADDLGYGDIGAYGSTTINTPNIDKMAAQGAKFDEFYAASPVCTPSRA 103

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
            L+TG+YPI  G+ H V       G+   E  + + LK AGYAT  +GKWHLG   E Y 
Sbjct: 104 GLLTGRYPIRQGI-HNVFFPESFQGMDPEEITIAEVLKGAGYATGLVGKWHLG-HHEQYM 161

Query: 143 PTFRGFDSHYGY-----WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
           P  +GFD  +G        GL  YY ++    FE           +VD +    Y T  Y
Sbjct: 162 PWNQGFDEFFGLPYSNDMGGL--YYFNNKDIDFE-----------EVDQR----YMTKTY 204

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
           T+ A+  I +H + +P FLYLAH   H       + +P+ E              Y  +V
Sbjct: 205 TDQALQFIDKH-QEQPFFLYLAHNMPHV----PIYASPEFE-------GQSAGGAYGDVV 252

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
             LD SVG V+  L +  + EN++V+F +DNG P   +    GS  PLR  K   ++GGM
Sbjct: 253 EELDWSVGQVLKRLDELNLSENTLVIFTSDNG-PWLLMGDEGGSAGPLREGKQFTFEGGM 311

Query: 318 R-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLT-KGA 374
           R    A W   + Q  K    L  + DWLPT    A + +  D  +DG +   +LT KG 
Sbjct: 312 RVPTVAYWPGTISQGTKPQG-LATMMDWLPTFANLANVPVPADRVIDGRDITGLLTGKGT 370

Query: 375 KTKRS 379
           + +++
Sbjct: 371 RAEQT 375


>gi|254444367|ref|ZP_05057843.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
 gi|198258675|gb|EDY82983.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
          Length = 462

 Score =  174 bits (442), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 122/361 (33%), Positives = 181/361 (50%), Gaps = 29/361 (8%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRS 82
           ++A K P+I+ I ADDLG+ND+S +G++ I TP ID+L   G+     Y    +C+PSR+
Sbjct: 29  SSAEKPPNIVFIFADDLGYNDLSSYGATDIATPAIDSLGEQGIRFTDFYSASPVCSPSRA 88

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           AL+TG+YPI  G+  GV       G+   E  + + L+E GY T  +GKWHLG  ++ + 
Sbjct: 89  ALLTGRYPIRQGIT-GVFWPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLGHHQK-HL 146

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           P   GF S++G          +S       Y   +   + +VD      Y+T  YTE A+
Sbjct: 147 PLQNGFHSYFG--------IPYSNDMDMVVYMRGNDVESYEVDQH----YTTRRYTEEAV 194

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
             I E NK +P FLYLAH   H          P      F+  S  +R  Y  ++  LD 
Sbjct: 195 QFI-EQNKDQPFFLYLAHSMPH---------VPIYASENFVGTS--KRGLYGDVIQELDW 242

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           SV  ++  L KH + EN++V+F +DNG P   +    GS  PLR  K   +DGGMR    
Sbjct: 243 SVAQILDTLDKHQLSENTLVVFTSDNG-PWTALKHLGGSAAPLREGKMFTFDGGMRVPCL 301

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKTKRSEI 381
           +  P      + S  + ++ DW PT    A ++   + S+DG++  DVLT       +E 
Sbjct: 302 VRWPAQIPAGQTSHAMANMMDWFPTFSRIANLDTPKSRSIDGLDITDVLTGSGPRADNEF 361

Query: 382 L 382
            
Sbjct: 362 F 362


>gi|302535739|ref|ZP_07288081.1| arylsulfatase [Streptomyces sp. C]
 gi|302444634|gb|EFL16450.1| arylsulfatase [Streptomyces sp. C]
          Length = 495

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 130/413 (31%), Positives = 188/413 (45%), Gaps = 34/413 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           ++P+I+ IL DDLGW D+S +GS  I TPN+D LA  G+     Y   A C+P+R +L T
Sbjct: 78  RRPNILFILGDDLGWADLSSYGSPHIKTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYT 137

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+YP          +  +  GL  T   L   LKEAGYAT  IGKWH G+  + Y+PT  
Sbjct: 138 GRYPARTRGGLAEPIADKSAGLDPTHPTLASLLKEAGYATALIGKWHAGYLPD-YSPTKS 196

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           G+D+ +G + G  +YY        ++ Y+G     +++        Y T + TE A   +
Sbjct: 197 GWDTFFGNFGGALEYYSKLGLGGEYDLYEGDAEYKDLR--------YYTRIITERASEYV 248

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER---------RTYAGM 256
           +  +  +P  L L     H     +   A   E+ + +   D            R Y  M
Sbjct: 249 SRDHGGRPWLLNLNFTTPHWPWIADGDTAESAEIVRRIKAGDARALWHQDGGSVRKYKQM 308

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  LD SVG V+ ALR+ G  EN++V+F +DNG   F        N PL G K++  +GG
Sbjct: 309 VEDLDRSVGEVLQALRRSGQEENTLVVFSSDNGGERFSY------NWPLSGNKASLQEGG 362

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAK 375
           +R    +  P      +VS       DW  TL   AG   +    LDGV+    L +G K
Sbjct: 363 IRVPNIVRWPARLDGGQVSRVPVFSPDWTATLLEVAGTRPHRAYPLDGVSLAGYLLRGEK 422

Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDK 428
               ++   +       +   ALR  D KY  G     Q  +  GD   + DK
Sbjct: 423 PAERDLFWRV-------RGERALRRGDWKYYRGKSGRDQLFDLAGDLREQADK 468


>gi|260060774|ref|YP_003193854.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88784904|gb|EAR16073.1| arylsulfatase A (precursor) [Robiginitalea biformata HTCC2501]
          Length = 526

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 127/396 (32%), Positives = 192/396 (48%), Gaps = 47/396 (11%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           A + P+I+II  DD G++DV  +G+  IPTPN+DA+A +GL+L   Y  Q +C+ SR+ L
Sbjct: 70  ADRPPNIVIIFTDDQGYSDVGVYGARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGL 129

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +TG YP  +G+ H  ++   P GL   E+ L + L++ GY T   GKWHLG   + + PT
Sbjct: 130 LTGCYPNRVGI-HNALMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLGDHPD-FLPT 187

Query: 145 FRGFDSHYG------YW--QGLQD-YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
             GFD  +G       W    LQ   +D      +E  + +D   + ++  + I      
Sbjct: 188 RHGFDEFFGIPYSNDMWPLHPLQGPVFDFGPLPLYEQERVVDTLEDQRLLTRQI------ 241

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
             TE +++ I  H K +P FLY+ H   H      P    D    K        R  Y  
Sbjct: 242 --TERSVDFINRH-KEEPFFLYVPHPQPHV-----PLFVSDAFRGK------SGRGLYGD 287

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           ++  +D SVG V+ AL  +G+ +++ V+F +DNG P     ++ G   PLR  K T W+G
Sbjct: 288 VIMEIDWSVGQVLGALEDNGLTDDTWVIFTSDNG-PWLAYGNHSGRAEPLREGKGTNWEG 346

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK 375
           G+R    +  P      KV  E     D LPT+ +  G       +DG N W +L+ GA+
Sbjct: 347 GVREPCIMKFPGRLPRGKVLDEPLMAIDLLPTIASVTGSPQPGREIDGKNAWGLLS-GAE 405

Query: 376 TKRSEILHNIDNVDNPQ-KYYAALRVDDLKYVAGTD 410
            +             PQ  YY   RV++L+ V   D
Sbjct: 406 AR------------GPQDAYYFYYRVNELQAVRDGD 429


>gi|440716877|ref|ZP_20897380.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436438070|gb|ELP31646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 485

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 128/404 (31%), Positives = 190/404 (47%), Gaps = 59/404 (14%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRS 82
            +A  +P+I++I++DD G+ND+   G+  I TP +D LA  G  L   YV    CTPSR+
Sbjct: 24  ASAADRPNILLIVSDDQGYNDLGQLGNGII-TPALDRLAKEGTRLTNFYVAWPACTPSRA 82

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLT-----------------EKLLPQYLKEAGYA 125
           +L+TG+YP   G+   +  E   +G   T                 E +LP  L+ AGY 
Sbjct: 83  SLLTGRYPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYK 142

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           +   GKW LG  + +  PT RGFD  YG+     DY+ H      E Y    M  N++  
Sbjct: 143 SGIFGKWDLGALQRML-PTSRGFDDFYGFVNTGIDYFTH------ERYGVPCMVRNLEPT 195

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQAPDEEVA 240
               G Y T L+   A+  + EH  ++P FLY+   A H  ++ +P      QAPD+  A
Sbjct: 196 EADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKA 255

Query: 241 KF------------------LDISDPE--RRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
            +                    ++ P+  RR Y   V+ +D ++G ++  L    ML+N+
Sbjct: 256 MYPPVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDNT 315

Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
           IV+F +DNG       S    N PLRG K+  W+GG+R    +  P  K    V +  F 
Sbjct: 316 IVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNNEFL 369

Query: 341 IS-DWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
            S + LP+   AAG+E      LDG N W  L   A++ R E+ 
Sbjct: 370 TSLELLPSFAIAAGVEPPPGVVLDGFNWWPTLRGEAESPRKEMF 413


>gi|72110136|ref|XP_780327.1| PREDICTED: arylsulfatase A-like [Strongylocentrotus purpuratus]
          Length = 527

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 127/412 (30%), Positives = 196/412 (47%), Gaps = 38/412 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTGK 88
           P+I+I+ ADD+G+ D+  +G     TPN+  LA  GL+  Q Y  A +C+PSR+AL+TG+
Sbjct: 27  PNIVILFADDVGYGDLEVYGHPTSSTPNLTKLAEGGLVFTQFYAGAPVCSPSRAALLTGR 86

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTFRG 147
           Y +  G+  GV++     GLP  E  + + +K  GY+T  +GKWHLG      Y PT +G
Sbjct: 87  YAVRSGIYPGVLVPTSTGGLPHNETTIAEIVKTRGYSTSMVGKWHLGVGENGTYLPTNQG 146

Query: 148 FDSHYGYWQGLQDYYDHS---CKATFEP-----------YQGLDMRHNMQVDNKTIGIYS 193
           FD + G        Y H    C   F P           Y    + +N  +  + + + +
Sbjct: 147 FDEYLGVP------YSHDMCPCYNCFYPNDPCWNTCDLNYTYCPLFNNTDILEQPVNLTT 200

Query: 194 TD-LYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
            D  Y + A +VI  + N   P FLY A    H          P     +F + +   R 
Sbjct: 201 LDATYVQRAKSVIQRNANAGTPFFLYYAFQHAH---------HPQFASKEFRNAT--ARG 249

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
           T+   ++ LD  VG V+  L   G+L+N++VLF +DNG PS    +  G+   L+  K T
Sbjct: 250 TFGDSLAELDWCVGQVMEQLESSGVLDNTLVLFTSDNG-PSLRNENRGGNAGLLKCGKGT 308

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
            ++GG R  A  + P  +     ++EL    D+LPT+    G  +   +LDGV+   +L 
Sbjct: 309 TYEGGQRVPAIAYWPQ-RIAPGRTTELAGNMDFLPTIANLVGATLPSVALDGVDMAPILF 367

Query: 372 KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTD 423
           +G K+ R    +      NP+    A+R +  K    T     SDE+  D D
Sbjct: 368 EGKKSARDTFFYYFTE-PNPKYGIYAVRYNQYKAHYYTQGQQNSDEFNHDKD 418


>gi|305665425|ref|YP_003861712.1| arylsulfatase A [Maribacter sp. HTCC2170]
 gi|88710180|gb|EAR02412.1| arylsulfatase A (precursor) [Maribacter sp. HTCC2170]
          Length = 489

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 132/410 (32%), Positives = 201/410 (49%), Gaps = 43/410 (10%)

Query: 6   KYFFALTCTLLF-------NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNI 58
           ++ F    + LF       ND  +   +P  P++++I  DD G+ DV   GS  I TP++
Sbjct: 13  RFQFLFVISFLFILLSCKDNDQKVTKKSPP-PNVVLIFTDDQGYQDVGVFGSPNIKTPHL 71

Query: 59  DALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQ 117
           D +A  G+ L  +Y  QA+C+ SR+ L+TG YP  IG+ H  +  G   G+  +E  + +
Sbjct: 72  DRMASEGVKLTSYYSAQAVCSASRAGLLTGCYPNRIGI-HNALGPGNTHGINSSETTMAE 130

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
            LK  GY T   GKWHLG   E + P   GFD  +G       +  H  +     +  L 
Sbjct: 131 MLKNNGYKTAIFGKWHLGHHPE-FMPNKHGFDEFFGIPYSNDMWPYHPQQGPIFNFPDLP 189

Query: 178 MRHNMQV-DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD 236
           +  N  V D  T     T   TE +++ I   NK  P FLY+ H   H          P 
Sbjct: 190 LYENETVIDTLTEQSNLTTQITERSVDFI-NRNKDNPFFLYVPHPQPHV---------PL 239

Query: 237 EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGI 295
               KF   SD  R  Y  ++  +D SVG +I AL+K+G+ EN+IV+F +DNG   S+G 
Sbjct: 240 FVSEKFKGKSD--RGLYGDVIMEIDWSVGQIIEALKKNGLEENTIVIFTSDNGPWLSYGN 297

Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP-WLKQTQKVSSELFHISDWLPTLCAAAGI 354
           H+  GS  P R  K T W+GG R    +  P  LK  + +   +  I D LPTL +    
Sbjct: 298 HA--GSALPFREGKGTAWEGGQREPFIMKYPNKLKAGEVIDVPMMAI-DLLPTLASLTNS 354

Query: 355 EINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQK-YYAALRVDDL 403
           ++ + S+DG N W++++              +++++PQK Y+   RV++L
Sbjct: 355 KLPELSIDGKNVWNIISG-------------ESIESPQKAYFFYYRVNEL 391


>gi|325110321|ref|YP_004271389.1| arylsulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970589|gb|ADY61367.1| Arylsulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 980

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 127/369 (34%), Positives = 187/369 (50%), Gaps = 38/369 (10%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSS-----QIPTPNIDALAYNGLILNQHYVQA-LCTP 79
           A  +P+I+I  ADD G  D  F G          TP++DALA  G+     YV A +CTP
Sbjct: 21  AADQPNILIFFADDAGNADFGFQGGGINGDFANLTPHLDALAEGGVRFTNGYVSAAVCTP 80

Query: 80  SRSALMTGKYPIHIGMQ--HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           SR+ ++TG+Y    G++  +G I E    GLP +E  +   L++AGY T+AIGKWHLG  
Sbjct: 81  SRAGMLTGRYQHRFGVETVYGRIPEA---GLPASEITMADTLRKAGYRTYAIGKWHLGEH 137

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI-----GIY 192
              + P  RGFD  YG   G +         TF PY+G +    +Q +   +       Y
Sbjct: 138 LHEHLPNQRGFDEFYGALTGAR---------TFFPYRGNNPGSKLQRNGVFLPEPLDQPY 188

Query: 193 STDLYTEAAINVIAEH---NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
            TDL     +  I +H   + + P FLYLA  AVH     +P +  D  +    DIS P+
Sbjct: 189 FTDLLARQTVAYIDDHVANHANAPFFLYLAFTAVHTPLEADPKRLDDRRIQ---DISPPQ 245

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH---SNKGSNHPLR 306
           R+T A M   +D++VG V+A LR++ + EN+IV+F++DNG P         N   N  L+
Sbjct: 246 RKTLAAMTIAMDDAVGTVMAKLRENNLTENTIVVFLSDNGGPEDNRSLRAPNWSDNGALQ 305

Query: 307 GMKSTPWDGGMRGVAAI-WSPWLKQ--TQKVSSELFHISDWLPTLCAAAGIEI-NDTSLD 362
           G KS  ++GG+R    I W   + +  + K   ++    D  PT   AA   +  D  LD
Sbjct: 306 GNKSQNFEGGIRVPFVIHWPRGIARGLSGKSLPDVVTALDLFPTFAEAAKATLPKDRELD 365

Query: 363 GVNQWDVLT 371
           G++   +L+
Sbjct: 366 GMSLASLLS 374


>gi|241708194|ref|XP_002413315.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215507129|gb|EEC16623.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 294

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 115/318 (36%), Positives = 155/318 (48%), Gaps = 34/318 (10%)

Query: 213 PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALR 272
           P+FLYLAH A H+GN +   + P+    ++ DI    R  YAGMVS LDESVG V  AL 
Sbjct: 1   PLFLYLAHQAPHSGN-FGRLEVPETYTEQYRDIGSKNRTLYAGMVSALDESVGAVFEALD 59

Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
           K GML NSIV+F +DNG        N GS  PLRG K +  +GG+R  A +WSP L  T+
Sbjct: 60  KRGMLSNSIVVFSSDNGGAPDEETYNSGSPWPLRGQKMSLSEGGVRAPALLWSPLLSATE 119

Query: 333 -KVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKTKRSEILHNIDNVDN 390
             +   LFHI+DWLPT    AG   +   ++DGV+Q   LT+     R EIL NID    
Sbjct: 120 GSIFDGLFHITDWLPTFYELAGGSPSSLGAIDGVSQLGALTRQTTAPRKEILINID---- 175

Query: 391 PQKYYAALRVDDLKYVAGTDNNGQSDEWY---GDTDNEIDKYSPKEVLYSKAGITFNALK 447
           P++  +A+   + K V G    G  + WY   G  D E DK  P++              
Sbjct: 176 PEENCSAIIEGNFKLVLGVAEQGLYENWYPLSGHVDWESDK--PRQECEK---------S 224

Query: 448 TKLQIKQKHAADPKANSSDALRTILTDEKILELREFARVRCNYDNKGAHCNSTVKPCLFN 507
              ++ + H   P   S                     V+C   +    C  T   CLFN
Sbjct: 225 VVARVLEHHGVRPVCGSRTGAYA-------------TPVQCGARDPSRECVPTKSLCLFN 271

Query: 508 ITDDPCEQNNLAESQTDL 525
           ++ DPCE NN+A+   ++
Sbjct: 272 LSKDPCEYNNVADQHPEV 289


>gi|296124231|ref|YP_003632009.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296016571|gb|ADG69810.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 467

 Score =  174 bits (441), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 139/426 (32%), Positives = 210/426 (49%), Gaps = 49/426 (11%)

Query: 9   FALTCTL-LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F  T  L L   A   TT+ ++P+I++IL+DD G  + S  G  +  TP+ID++  NG+ 
Sbjct: 17  FQFTSQLALAQRATTETTSERRPNILLILSDDCGHAEFSIQGHPRYKTPHIDSIGKNGVH 76

Query: 68  LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILE-GEPWGLPLTEKLLPQYLKEAGYA 125
             Q YV   +C+PSR+ L+ G+Y    G +  +     E  GLP +E LLPQ LKE GY 
Sbjct: 77  FRQGYVSGCVCSPSRAGLLAGRYQQRFGHEFNIPPAYSETNGLPRSETLLPQLLKEDGYR 136

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           T A+GKWHLG+  + + P  RGF  +YG+ QG + Y+               +R    + 
Sbjct: 137 TIALGKWHLGYAPQFH-PMERGFTDYYGFLQGSRSYFPLKKPTRLNQM----LRDRTAIP 191

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
            +  G Y TD   + AI  I +  +S+P  +YLA  A H         +P++  A  L  
Sbjct: 192 EEQFG-YMTDHLADEAIAYIKQW-QSQPWMMYLAFNATH---------SPNDATAVDLQA 240

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
           +D   + YA M   LD +VG V+ AL++ G+ ++++V+F+ DNG    G H N GS H  
Sbjct: 241 AD-GNKIYA-MTIALDRAVGKVLDALKECGLSKDTLVIFINDNGGA--GGHDN-GSLH-- 293

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG--------IEIN 357
            G K + W+GG R    +  P    + +V  E     D  PT+   AG        I  +
Sbjct: 294 -GKKGSTWEGGTRIPFLVQYPAKIPSGQVIDEPVIALDLFPTILDVAGLGDAELKKIPFD 352

Query: 358 DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYA----ALRVDDLKYVAGTDNNG 413
              LDG++    +T   KT+R         VD P  + +    A+R  +LK V+G D+ G
Sbjct: 353 PEKLDGISLIPRMT--GKTQRL--------VDRPLYWKSGKRWAIRQGNLKAVSGNDDQG 402

Query: 414 QSDEWY 419
              E +
Sbjct: 403 DQVELF 408


>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 616

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 126/382 (32%), Positives = 192/382 (50%), Gaps = 62/382 (16%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIP-TPNIDALAYNGLILNQHYVQALCTPSRSALMT 86
           ++P++I IL DD+G ND+ +  +     TP +D+L+  G+ L ++Y   LCTPSR++LMT
Sbjct: 118 QQPNVIFILIDDVGTNDIGYQSTDLWELTPFMDSLSSEGVRLTKYYTNQLCTPSRASLMT 177

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+     GMQ+ V+ +   WGLPL E  L +  K         GKWHLG + + + P  R
Sbjct: 178 GRDTFRTGMQYEVVEDSGAWGLPLEEVTLAERFK--------TGKWHLGMYSDAHYPFAR 229

Query: 147 GFDSHYGYWQGLQDYYDH-SCKA-TFE----------PYQGLDMRHNMQVDNKT------ 188
           GFD+  GY   ++ Y  H  C   TFE           Y   D   N  + N T      
Sbjct: 230 GFDTFLGYMGAVRGYSSHEGCNTPTFEGGEYSCFKDFGYGDKDGYIN-HITNTTRQGPSF 288

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP------DEEVAKF 242
           +G YST + T+ AI V  EH +  P FLY++H AVH+     P   P      +EE A  
Sbjct: 289 VGNYSTTIITDRAIEVAKEHGE-DPFFLYVSHQAVHS-----PVDPPPDDYFTEEEHATL 342

Query: 243 LDI--SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNK 299
             +   D  R+ +A ++  LD+ +  +   L   G L+N++++  +DNGA P+ G     
Sbjct: 343 ARVVSEDAYRQRFARVILFLDKEMRRLHDELDALGALDNTVLVVASDNGACPTAG----- 397

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE--------LFHISDWLPTLCAA 351
           GSN+PLRG K T ++GG+R  A ++S   K T  +  E        + H +DW PT  + 
Sbjct: 398 GSNYPLRGYKHTIFEGGVRVPAFVYS---KSTDLIPEEARGTRYSGMMHSTDWTPTFGSV 454

Query: 352 A-GIEINDTS--LDGVNQWDVL 370
              + + +    L G + WD +
Sbjct: 455 VPDLPLGEMGKELSGFDHWDAI 476


>gi|170726755|ref|YP_001760781.1| sulfatase [Shewanella woodyi ATCC 51908]
 gi|169812102|gb|ACA86686.1| sulfatase [Shewanella woodyi ATCC 51908]
          Length = 548

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 137/389 (35%), Positives = 190/389 (48%), Gaps = 57/389 (14%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSR 81
           ++T    P+I++ILADD+G NDVS  G   I TPNID LA  G L  N +   A C PSR
Sbjct: 56  DSTKTDLPNIVLILADDMGINDVSTFGGGMIETPNIDKLAAKGALFTNGYSGHANCAPSR 115

Query: 82  SALMTGK---------YPIHIGMQHGVIL------EGEP-----------------WGLP 109
           +AL+TG+          PI  GM   +         G P                  GLP
Sbjct: 116 AALLTGRDATRTGYDTTPIPDGMSRIIAAIENNEDNGRPEMSYSAEADATNPTYDNRGLP 175

Query: 110 LTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH--SCK 167
            +E L+P+ LKE+GY T  IGKWHLG   E+  P  +GFD       GL    DH  S  
Sbjct: 176 GSEILIPEILKESGYHTMHIGKWHLGRSPEMM-PNAQGFDESLMMDSGLYLPVDHPESVN 234

Query: 168 ATFEPYQGLD------MRHNMQVDNKTI---GIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
           A  E   GLD      MR+++  +   I     Y TD +TE A   I E N ++P FLYL
Sbjct: 235 APVES-SGLDRFIWATMRYSVNWNGGEIFKPNGYLTDYFTEEAEKAI-EANANRPFFLYL 292

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
           AH   H      P QA   +     DI    +R YA M+  +D SV  V+A L K G+ +
Sbjct: 293 AHWGPH-----NPVQAKRADYEAVGDIQPHNKRVYAAMLRSIDRSVERVMAKLEKQGIAD 347

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSE 337
           N+IV+  +DNG   +   ++   N P RG K+T ++GG+R   +  W   + ++  +   
Sbjct: 348 NTIVILSSDNGGADYVAINDL--NKPYRGWKNTFFEGGIRVPFSVTWPNVIDESTVIEEP 405

Query: 338 LFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
           + HI D +PT+   A  ++  D  +DGV+
Sbjct: 406 VNHI-DLMPTIINMANADLPQDREIDGVD 433


>gi|432851909|ref|XP_004067102.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 1
           [Oryzias latipes]
          Length = 525

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 126/372 (33%), Positives = 177/372 (47%), Gaps = 26/372 (6%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
           NT     P+III+L DD+GW D+   G     TPN+DA+A  G++    Y    LC+PSR
Sbjct: 24  NTPNTSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSR 83

Query: 82  SALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
           +AL+TG+ PI  G        ++    +    G+   E LLPQ LKE GY    +GKWHL
Sbjct: 84  AALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHL 143

Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR----HNMQVDNKTIG 190
           G  R  Y P   GFD  +G        Y+++ +     Y   +M        ++D KT  
Sbjct: 144 GH-RPQYLPLENGFDEWFGAPNCHFGPYNNTVRPNIPVYNNSEMLGRYFEEFKIDKKTGE 202

Query: 191 IYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
              T +Y EA ++ I+   ++K P FLY A  A H+         P      FL  S  +
Sbjct: 203 SNLTQMYLEAGLDFISRQAEAKRPFFLYWAADATHS---------PVYASKPFLGKS--Q 251

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           R  Y   V  LD SVG ++  L++ G+  N+ V F +DNGA      +  GSN P    K
Sbjct: 252 RGRYGDAVMELDYSVGRILTKLQRLGIDNNTFVFFTSDNGAALMSGPNEGGSNGPFLCGK 311

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWD 368
            T ++GGMR  A  W P   +   VS +  ++ D   T  A AGI    D  LDG++   
Sbjct: 312 ETTFEGGMREPAIAWWPGRIKGGTVSFQPANVMDLFTTSLALAGIGPPGDRVLDGLDLTP 371

Query: 369 VLTKGAKTKRSE 380
           VL   + T +S 
Sbjct: 372 VLFNHSHTLQSR 383


>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
 gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 489

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 128/404 (31%), Positives = 198/404 (49%), Gaps = 46/404 (11%)

Query: 4   ARKYFFALTCTLLF----NDAFLNTT------APKKPHIIIILADDLGWNDVSFHGSSQI 53
            R ++F L   + F    N A   T       +P++P+I+ I ADDLG+ D+ F GS   
Sbjct: 3   GRIFYFTLLIAVCFGSCKNTASKETVLHEEKHSPRQPNIVFIFADDLGFADLGFTGSDTH 62

Query: 54  PTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIG-MQHGVILE-GEPW---- 106
            TPN+D LA   +  ++ Y     C PSR ++MTGKYP  +G + HG +     P     
Sbjct: 63  LTPNLDKLAKESVYFDRAYSSHPTCAPSRMSIMTGKYPARLGAVSHGKLGGVAHPGPNDN 122

Query: 107 GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSC 166
           GLP+TE  + + LK+ GY T  IGKWH+G  +    P  RGFD      +         C
Sbjct: 123 GLPMTETTIGEALKKEGYTTAHIGKWHIG--KGENNPGTRGFDVDIASNE-------FCC 173

Query: 167 KATFE-PYQGLDMRHNM-----QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAH 220
             ++  P++  + +  +      ++++  G + TD   E A+  I   ++ KP FL ++ 
Sbjct: 174 PGSYMYPFESNNEKQRVASKIPDLEDRKPGDFLTDALAEEAVKFIHSTDE-KPFFLNMSF 232

Query: 221 LAVHAGNTYEPFQAPDEEVAKFLDISDPERR----TYAGMVSRLDESVGNVIAALRKHGM 276
            AVH      P  A  E+V K+  +  P+ R    TYAG+V  LD++VG ++ AL + G+
Sbjct: 233 YAVHT-----PITAIPEKVEKYKRLIGPDARQKNPTYAGLVEHLDDAVGAILKALEEKGI 287

Query: 277 LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSS 336
           ++N+I++F +DNG     I      N PLR  K + ++GG R    +  P + Q   VS 
Sbjct: 288 IDNTIIVFTSDNGGE---ILHGITDNFPLRDGKGSSYEGGTRVPLLVKWPGVTQVNTVSH 344

Query: 337 ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
           E     D+ PT  + AG   +   +DG   +  L K  K K  E
Sbjct: 345 ERIIGFDYYPTFLSMAGANPDLHDIDG-RDFSPLLKNPKEKLEE 387


>gi|320105193|ref|YP_004180784.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319752475|gb|ADV64235.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 481

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 129/393 (32%), Positives = 194/393 (49%), Gaps = 45/393 (11%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSALMTGK 88
           PHI++I+ DDLG+ D+  +G+  I TP ID+LA +G  L+  H    +CTP+R+AL+TG+
Sbjct: 65  PHIVLIMTDDLGYADLGCYGAPDIATPRIDSLARDGARLSHFHSPGPVCTPTRAALLTGR 124

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           +P  +G++  +       GLP+ E +L + LKE GY T  +GKWHLG +R  + P   GF
Sbjct: 125 WPQRVGLEWALSASDTEPGLPVEEPILSRPLKEVGYRTVMVGKWHLG-YRPEFGPNAHGF 183

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI--YSTDLYTEAAINVIA 206
           D  +G   G  D+Y H         + ++ + +   + K + +  YSTDL ++ A+  I 
Sbjct: 184 DEFFGLLSGNVDHYSH---------REINGKEDWYENTKPVRVEGYSTDLLSDRAVAAIQ 234

Query: 207 E-----HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE------RRTYAG 255
           +      ++ +P++LY+A+ AVH      PFQ P     +  DI D        R  Y  
Sbjct: 235 KTAAQPPDQRQPLWLYVAYNAVH-----WPFQPP----GRPQDIRDRSTWFNGTRADYVK 285

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           MV  +D  VG ++ AL   GM ++++V+F  DNG            N P    K T W+G
Sbjct: 286 MVESIDAGVGRILDALEAGGMADHTLVIFTNDNGG------ERLSDNGPFFHHKGTVWEG 339

Query: 316 GMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKG 373
           G R    I W   +    +       + D   T+ AAA I  N +  LDGVN   +L   
Sbjct: 340 GHRVPCLIRWPGRIAAGTEFGQPTIGM-DLTATILAAAKIPPNPERPLDGVNLVPILRGE 398

Query: 374 AKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
                  +   ID  +  QK  AALR    KYV
Sbjct: 399 VPCPDRALFWRIDRPNRKQK--AALR-GRWKYV 428


>gi|255690672|ref|ZP_05414347.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
 gi|260623696|gb|EEX46567.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
          Length = 459

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 120/377 (31%), Positives = 190/377 (50%), Gaps = 39/377 (10%)

Query: 6   KYFFALTCTLLFNDAF--LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           ++F A TC L    AF  +   A K+P+ +II+ADD+G+ DV  +G+  I TPNID +A 
Sbjct: 6   RHFVAATCALA---AFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEYIKTPNIDQIAR 62

Query: 64  NGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVIL----EGEPWGLPLTEKLLPQY 118
            G++    H   ++ +P+R  L+TG+Y    G++  +++    + +  GLP  E    + 
Sbjct: 63  EGMMFTDFHSNGSVSSPTRCGLLTGRYQQRAGLEKVLLVPRDDKDKEVGLPSEEITFAKI 122

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLD 177
           L + GY T  IGKWHLG+ ++ +   F GF    G+  G  DY  H  +    + + GL+
Sbjct: 123 LGDNGYRTALIGKWHLGYLQKHHPMNF-GFQKFVGFKSGNVDYQSHRNRYGDMDWWDGLE 181

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           M        K +  Y+T L T  + + I E NK KP  LY+AH A H+     P Q PDE
Sbjct: 182 M--------KDMSGYTTTLLTTLSEDYIKE-NKDKPFCLYIAHAAPHS-----PMQGPDE 227

Query: 238 EVAKFLDISDPERRT-------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
           +  +     + ++ +       Y  MV  LD SVG ++  L+K+ + EN+ V+F +DNG 
Sbjct: 228 KAVRTEATPEGDKNSDRSNKEIYKDMVEELDWSVGRILETLKKYKLDENTFVVFFSDNGP 287

Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
               + +N GS    +G K +PW+GG R     + P   +      +     D  PT+  
Sbjct: 288 ----VINNGGSAGGYKGAKGSPWEGGHRVPGICYMPGTIKEGTTCEQTVMSFDLFPTMLD 343

Query: 351 AAGIEINDT--SLDGVN 365
            A I  +D+   LDG +
Sbjct: 344 MADIHYDDSKKKLDGTS 360


>gi|334145238|ref|YP_004538448.1| sulfatase family protein [Novosphingobium sp. PP1Y]
 gi|333937122|emb|CCA90481.1| sulfatase family protein [Novosphingobium sp. PP1Y]
          Length = 425

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 131/389 (33%), Positives = 187/389 (48%), Gaps = 34/389 (8%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
           +   A +  ++I+IL DDLG+ D+S  G+  I TPNID +A  G   +Q Y  A LCTPS
Sbjct: 1   MAANARRPRNVIVILCDDLGYGDLSITGARGIKTPNIDRMAREGRTFSQFYAAANLCTPS 60

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           R+ L+TG+YP+  G+   VIL  +   LP +E  +P  LK A YAT   GKWHLG     
Sbjct: 61  RAGLLTGRYPVRTGLGDKVILYNDDRVLPTSEVTIPTALKTAEYATGLFGKWHLGHRGPD 120

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           + PT  GFD     + G+   +D S         G   +   QV  +   +    ++ E 
Sbjct: 121 WLPTHHGFDR----FVGIPYSHDMSPLVLVRAEAG---KEAHQVPTEITPL--QQIFCEE 171

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A   I E N  +P F+ LA  A H      P + P    A F   S      Y   V  +
Sbjct: 172 AEQFITE-NAERPFFVELALSAPHL-----PCRPP----APFAGRSG--AGAYGDSVEEV 219

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK-STPWDGGMRG 319
           D+ VG ++A LRK G+  +++VLF +DNG P +     +GS+ PLR  K    +DG  + 
Sbjct: 220 DDKVGRILALLRKLGIERDTLVLFTSDNG-PWY-----EGSSGPLRQRKGGGGYDGASKV 273

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRS 379
               WSP           +    D+LPT CA AG+ + +  LDG +   ++  GA+    
Sbjct: 274 PMIAWSPGSVSASSHCDAIAMAIDFLPTCCALAGVPLPEVELDGKDISSLIFDGAEQSPH 333

Query: 380 EILHNIDNVDNPQKYYAALRVDDLKYVAG 408
           E L   DN D       A+R    K+V+G
Sbjct: 334 EELVLFDNEDP-----VAIRTQHWKFVSG 357


>gi|403072042|pdb|4FDI|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A
 gi|403072043|pdb|4FDI|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A
 gi|403072044|pdb|4FDJ|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
 gi|403072045|pdb|4FDJ|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
          Length = 502

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/376 (31%), Positives = 180/376 (47%), Gaps = 25/376 (6%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           AP+ P+I+++L DD+GW D+  +G     TPN+D +A  GL+    Y    L +PSR+AL
Sbjct: 1   APQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLXSPSRAAL 60

Query: 85  MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG+ PI  G        ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG  
Sbjct: 61  LTGRLPIRNGFYTTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH- 119

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
           R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT     
Sbjct: 120 RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL 179

Query: 194 TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
           T +Y + A++ I    +  P FLY A  A HA         P      FL  S  +R  Y
Sbjct: 180 TQIYLQEALDFIKRQARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRY 228

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              V  +D+S+G ++  L+   + +N+ V F +DNGA         GSN P    K T +
Sbjct: 229 GDAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTF 288

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTK 372
           +GGMR  A  W P      +VS +L  I D   T  A AG+   +D ++DG+N    L +
Sbjct: 289 EGGMREPALAWWPGHVTAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQ 348

Query: 373 GAKTKRSEILHNIDNV 388
           G    R    +  D +
Sbjct: 349 GRLMDRPIFYYRGDTL 364


>gi|301608268|ref|XP_002933701.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Xenopus
           (Silurana) tropicalis]
          Length = 520

 Score =  173 bits (438), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 121/385 (31%), Positives = 182/385 (47%), Gaps = 26/385 (6%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            +AL    L     L      +P+I+++L DD+GW D+   G     TPN+D +A  G++
Sbjct: 6   MYALALGFLNPVLVLGGMNGTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMASEGML 65

Query: 68  LNQHYV-QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYL 119
               Y    LC+PSR+AL+TG+ PI  G        ++    +    G+P +E L P+ L
Sbjct: 66  FPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNSHARNAYTPQEIMGGIPDSEVLFPELL 125

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM- 178
           K+AGY    IGKWHLG  R  Y P  RGFD  +G        YD+  +     Y+  DM 
Sbjct: 126 KKAGYVNKIIGKWHLGH-RPSYHPLRRGFDEWFGSPNCHFGPYDNKQRPNIPVYRDWDMV 184

Query: 179 ---RHNMQVDNKTIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQA 234
                +  +D+KT     T +Y + A+  I  +    +P FLY +  A HA         
Sbjct: 185 GRYYEDFNIDHKTGESNLTQIYLQEALEFINRQAGDQQPFFLYWSIDATHA--------- 235

Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
           P      FL  S   R  Y   V  +D S+G ++  L +  +++++ V F +DNGA    
Sbjct: 236 PVYASRPFLGTS--RRGLYGDAVREIDFSIGKILGLLNQKDIIKDTFVFFTSDNGAALIS 293

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
                GSN P    K T ++GGMR     W P      +VS +L  I D   T  + AG+
Sbjct: 294 APMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIMDLFTTSLSLAGV 353

Query: 355 EI-NDTSLDGVNQWDVLTKGAKTKR 378
           EI  D ++DG++   V+ +  +T+R
Sbjct: 354 EIPKDRAIDGIDLSPVILESKETER 378


>gi|338212321|ref|YP_004656376.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
 gi|336306142|gb|AEI49244.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
          Length = 446

 Score =  173 bits (438), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 126/360 (35%), Positives = 194/360 (53%), Gaps = 36/360 (10%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F    C LLF  +  +  + +KP+I++IL DD+GW +V  +GS    TP++D LA  G+ 
Sbjct: 7   FIRHCCFLLFFTS--SAFSQQKPNIVMILVDDMGWGEVGIYGSHYCQTPHLDNLAKQGMR 64

Query: 68  LNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEG--EPWGLPLTEKL-LPQYLKEAG 123
               Y  + +C+P+R+ALMTG+YP  +G+  GVI       WG    + + LPQ LK AG
Sbjct: 65  FTNFYANSTVCSPTRAALMTGRYPDLVGVP-GVIRGNVENSWGYFSPKAVTLPQVLKTAG 123

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLD-MRHN 181
           Y T  +GKWHLG   E + P   GF   +G+ + +  DYY H         +G + MR N
Sbjct: 124 YRTAMVGKWHLGLEPENH-PNRHGFTHFHGFLEDMMDDYYTHL-------REGKNWMRLN 175

Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
            QV + +   ++TDL+T+ AI  +    K+ +P FLYLA+ A H      P Q P E + 
Sbjct: 176 GQVIDPS--GHATDLFTQWAIEYLNTQKKNPQPFFLYLAYNAPHF-----PVQPPREWLE 228

Query: 241 KFLDISDP----ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
           K +   +P    +R     ++  LDES+G V+ ALR++G  +N++++F +DNG    G+ 
Sbjct: 229 K-VKKRNPNLSEKRAKLTALIEHLDESIGKVMEALRQNGQADNTLIVFSSDNG----GLL 283

Query: 297 SNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
           S++  N   +G K T ++GG+R    A+W   +    K       + D  PT C AAG +
Sbjct: 284 SDEADNGIWKGGKQTMYEGGIRVPTIAVWKNKIIPGTKTDFRALTM-DLFPTFCEAAGTK 342


>gi|313232711|emb|CBY19381.1| unnamed protein product [Oikopleura dioica]
          Length = 453

 Score =  173 bits (438), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 144/493 (29%), Positives = 221/493 (44%), Gaps = 73/493 (14%)

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           MTG+YP+  G+Q GVI   EP G+PL E LLPQ  +E GY+T   GKWHLG++ + + P 
Sbjct: 1   MTGRYPMRYGLQRGVIRPDEPEGIPLREILLPQGFRECGYSTLMSGKWHLGYYTKAHRPQ 60

Query: 145 FRGFDSHYGYWQGLQDYYDH-----SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
            RGFD  YG++ G QD+Y H     S  A+F   +G      + + +   G YST L  +
Sbjct: 61  NRGFDRFYGFYVGSQDFYFHRTGFKSDGASFWSEEGYSTGEEI-IRHDAWGHYSTTLLVD 119

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP--DEEVAKFLDISDPERRTYAGMV 257
            AI  +   N   P+F YL+    HA     P Q P   E+  K   +   ER+ +   V
Sbjct: 120 DAIKQLRNRNPGNPLFHYLSIQDPHA-----PEQRPLRLEQSIKHFALRG-ERKIFVTKV 173

Query: 258 SRLDESVGNVIAALRKHGM--LENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
             +D  +G  +  L++  +    N+IV+F +DNG P         SN PLRG K   +DG
Sbjct: 174 KSMDLEIGRYLDVLKRIYIPDYNNTIVIFSSDNGGP----RKLGASNWPLRGGKGCIFDG 229

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI--NDTSLDGVNQWD-VLTK 372
             R  A I  P +K   K    +FH  DW+PTL      ++  +   LDGV+    +L  
Sbjct: 230 AFRSRALIHIPGMK--HKNYPHMFHAVDWMPTLMEMLSCKLPKDHLPLDGVSHMKTILGM 287

Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPK 432
             +  R E+L +ID  +  QK  +  R  D  Y+                          
Sbjct: 288 TDRAPREEMLLDIDPEEVLQKTASDAR--DFSYL-------------------------- 319

Query: 433 EVLYSKAGITFNALKTKLQIKQ-KHAADPKANSSDALRTILTDEKILELREFARVRCNYD 491
                K+G   + ++  L+ K+ K    P  + +     + ++ +            NY 
Sbjct: 320 ----KKSGFD-SRMRAALRYKEWKIITGPPCSGTKC--GVQSENR------------NYY 360

Query: 492 NKGAHCNSTVKPCLFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKSTMVPPGNKPFDKR 551
               + + +    L+NIT DP E  +L++   ++ K L  KL++Y        ++  DK+
Sbjct: 361 KAEEYVSKSRSVRLYNITADPRESYDLSDKYPEVTKALLLKLSVYHEEQSLFMDRMRDKK 420

Query: 552 ADPARWNNIWVPW 564
           A P    + W PW
Sbjct: 421 ALPETSEHTWFPW 433


>gi|329744562|ref|NP_001193258.1| N-acetylgalactosamine-6-sulfatase precursor [Bos taurus]
 gi|296478055|tpg|DAA20170.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Bos taurus]
          Length = 522

 Score =  173 bits (438), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 120/367 (32%), Positives = 176/367 (47%), Gaps = 26/367 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           A + P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL
Sbjct: 26  ALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPSRAAL 85

Query: 85  MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG+ PI  G        ++    +    G+P +E LLP  LK AGYA+  +GKWHLG  
Sbjct: 86  LTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH- 144

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
           R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT     
Sbjct: 145 RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL 204

Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           T +Y + A+  I     + +P FLY A  A HA         P      FL  S  +R  
Sbjct: 205 TQIYLQEALEFIQRQQAAHRPFFLYWAVDATHA---------PIYASKPFLGTS--QRGR 253

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y   +  LD+SVG ++  LR   + EN+ V F +DNGA         GSN P    K T 
Sbjct: 254 YGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALISAPRQGGSNGPFLCGKQTT 313

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
           ++GGMR  A  W P      +VS +L  I D   T  + AG+E   D ++DG++    + 
Sbjct: 314 FEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPRDRAIDGLDLLPAML 373

Query: 372 KGAKTKR 378
           +G  T R
Sbjct: 374 QGRLTDR 380


>gi|340377674|ref|XP_003387354.1| PREDICTED: arylsulfatase A-like [Amphimedon queenslandica]
          Length = 522

 Score =  173 bits (438), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 125/397 (31%), Positives = 188/397 (47%), Gaps = 42/397 (10%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           A  KP+III+ ADDLG+ D+  +G     TPN++++A  GL+L Q Y    +C+PSR+AL
Sbjct: 21  AVNKPNIIILFADDLGYGDLQIYGHPTSSTPNLNSMASEGLVLTQFYSANPVCSPSRAAL 80

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTP 143
           MTG+YP   G+  GV+      GLP  E  + + LK AGY T  +GKWHLG      Y P
Sbjct: 81  MTGRYPPRTGIWPGVLNPNSIGGLPHNETTIAEALKTAGYKTGMVGKWHLGVGENGTYLP 140

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-------VDNKTIGIY-STD 195
           T  GFD    Y+ G+  Y +  C  T   Y        ++       V      ++ +TD
Sbjct: 141 TNHGFD----YYLGIP-YSNDECPCTICYYPNAPCNSPLKIPCNPSFVSCPVCPVFENTD 195

Query: 196 L-------------YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
           +             Y+ AA+  I  +    P FLY+A   VH     +P Q  DE+    
Sbjct: 196 IIEQPAHLPTLSTKYSNAALGFIRANAGKNPFFLYMAFTHVH-----QP-QFSDEDYTNT 249

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
                  R  +   ++ LD  VG +++ L+   + +N+ V F +DNG PS  +    G+ 
Sbjct: 250 T-----LRGPFGDALAELDNEVGLIMSTLKDTQVDDNTFVFFASDNG-PSLRVQQQGGNA 303

Query: 303 HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD 362
             LR  K + WDGGMR  A  W P  K  Q  ++EL    D  PT+ +  G +     +D
Sbjct: 304 GLLRCGKGSTWDGGMREPAIAWWPG-KIRQGKTAELAATVDLFPTIMSITGAKPGSAFID 362

Query: 363 GVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALR 399
           G++   +L     + R   ++    + +P   Y A+R
Sbjct: 363 GIDMSPILFDSKPSNRESYVY-FPAIYDPNIGYHAIR 398


>gi|372221524|ref|ZP_09499945.1| sulfatase [Mesoflavibacter zeaxanthinifaciens S86]
          Length = 461

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 119/375 (31%), Positives = 194/375 (51%), Gaps = 35/375 (9%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRS 82
           N+   ++P+I++++ADD GWNDV +HGS +I TP +D+LA NG  L + YV   C+PSR+
Sbjct: 29  NSLPKEQPNILLVIADDAGWNDVGYHGS-KIKTPVLDSLANNGAKLERFYVAPTCSPSRA 87

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           AL+TG     +G+   +  + +   LP +   LP+ +K+ GY T   GKWHLG   E   
Sbjct: 88  ALLTGIPASRLGIVAPIAGKSK-IALPDSLVTLPKAMKKLGYRTALFGKWHLGLTPE-NG 145

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQ--GLDMRHNMQVDNKTIGIYSTDLYTEA 200
           P   GFD+ YG+  G  D Y H  K     +   G  ++ +  V         TDL T+A
Sbjct: 146 PQAYGFDTSYGFLHGQIDQYTHEYKNGDPSWHKNGKFLKEDGHV---------TDLLTDA 196

Query: 201 AINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI-SDPERRTYAGMVS 258
           AI    +  K++ P F+ LA+ A H      P Q  +     +  I +D  R  YA  ++
Sbjct: 197 AIAYFNQETKTETPSFVTLAYSAPHF-----PLQEEERYKRPYRTIFTDSSRVDYAAAMT 251

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGA-----------PSFGIHSNKGSNHPLRG 307
            LD + G ++ +L++ G L+N+++LFM+DNGA              G ++  G N PL+ 
Sbjct: 252 HLDIAFGRLLNSLKRSGKLDNTLILFMSDNGAMKNWYPKDQYQGKHGPNTTLGDNTPLKD 311

Query: 308 MKSTPWDGGMRGVAAIWSPWLKQ-TQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQ 366
            K+T ++G +R  A ++  W      K   +   + D LPTL    G  +   +++G + 
Sbjct: 312 YKTTNYEGAIRVPAFVY--WKNHFNGKTLDDFIGVIDVLPTLVELTGASVKPNTIEGSSF 369

Query: 367 WDVLTKGAKTKRSEI 381
            ++L   ++TK  ++
Sbjct: 370 LNLLNGASQTKIPQL 384


>gi|320102821|ref|YP_004178412.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319750103|gb|ADV61863.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 480

 Score =  172 bits (437), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 122/371 (32%), Positives = 184/371 (49%), Gaps = 41/371 (11%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           L  TL    + ++     +P+I++ILADDLGW D+S++G  Q  TP++D LA  G   ++
Sbjct: 17  LAATLGGEASGVHARETDRPNIVLILADDLGWGDLSYNGRIQWSTPHLDDLAARGARFDR 76

Query: 71  HYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPW----GLPLTEKLLPQYLKEAGYA 125
            Y    +C PSR+AL+TGK PIH G           W     LP  E  + + LK  GY 
Sbjct: 77  FYTAGVVCAPSRAALLTGKNPIHCGT----------WRNDHDLPAEEVTIAEALKAKGYH 126

Query: 126 THAIGKWHLGF-FREVYT---PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
           T   GKWH G   RE  T   P  +GFD  +G+      +          P    + R  
Sbjct: 127 TALYGKWHQGRPTREGGTRTHPLDQGFDETFGFLSATHAWEKF-------PNHLFNGRDR 179

Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           ++V     G Y+ D +T+  +  IA H +   P FLYL   + H        +AP++E+A
Sbjct: 180 VEVPQ---GRYADDWFTDKGVEFIARHKDDPTPFFLYLPLTSTHFH-----IEAPEDEIA 231

Query: 241 KFLD-ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG----I 295
           K     SDP +  YA +V+R+D  VG ++ A+R  G+ ++++++F +D+GA   G     
Sbjct: 232 KHRSTFSDPIQTAYAAIVTRMDAQVGRIVQAVRDAGLEDSTLIVFTSDHGATFEGGSKEA 291

Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
            S+  SNHP RG K T ++GG+R    +  P +    ++ S      D LPT    AG  
Sbjct: 292 SSSLDSNHPFRGQKRTVYEGGIRVPGIMTYPGVIPAGRIVSTPVQTIDLLPTFLELAGGR 351

Query: 356 IN-DTSLDGVN 365
              D  +DG+N
Sbjct: 352 PEPDWKVDGLN 362


>gi|285808548|gb|ADC36070.1| sulfatase [uncultured bacterium 213]
          Length = 478

 Score =  172 bits (437), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 121/361 (33%), Positives = 186/361 (51%), Gaps = 30/361 (8%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSA 83
           TAP+ P+I+ I+ADDLG+ DVS +G   + TPN+D +A  G+   Q Y   A+C+ +R+A
Sbjct: 34  TAPRPPNIVFIMADDLGYADVSCYGRPDLNTPNVDRVALKGVRFLQAYANSAVCSATRTA 93

Query: 84  LMTGKYP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           L+TG+Y   + IG++  + + G   GLP     LP  L++AGY T  +GKWHLG   + +
Sbjct: 94  LITGRYQYRLPIGLEEPLGI-GRDVGLPPEHPTLPSLLRKAGYRTTLLGKWHLGALPK-F 151

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
            P   G+D  YG+  G  DYY H+     +         +  V  +  G Y TDL    A
Sbjct: 152 GPLQSGYDHFYGFRGGSVDYYTHAGPDQRD------DLWDDDVPLRQSG-YLTDLLGSRA 204

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER-------RTYA 254
           ++VI  ++ S   FL    +++H    + P++AP +E       S   R       +TY 
Sbjct: 205 VDVINGYSHSDRPFL----VSLHFSAPHWPWEAPGDEAESNRIRSTTLRHYDGGTQKTYQ 260

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            M+ ++D  +G V+ AL  +G  +N+IV+F +DNG   F          P  G K+   +
Sbjct: 261 RMIEQMDLQIGRVLGALDANGATDNTIVVFTSDNGGERF------ADTWPFTGRKTELLE 314

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKG 373
           GG+R  A I  P   +    + ++    DWLPTL AAAG+  +     DG+N   +LT  
Sbjct: 315 GGLRIPALISWPARIRQGATTDQVAMSMDWLPTLLAAAGVAPDRAFPPDGLNLLPLLTAN 374

Query: 374 A 374
           A
Sbjct: 375 A 375


>gi|325108489|ref|YP_004269557.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324968757|gb|ADY59535.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 455

 Score =  172 bits (437), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 117/346 (33%), Positives = 180/346 (52%), Gaps = 29/346 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           ++P++++I +DD G  D   +GS  + TP ID LA  G+   Q Y    +C+ SR+ L+T
Sbjct: 30  ERPNVLVIYSDDQGTIDAGCYGSDDLTTPAIDHLAETGIRFTQMYAPSCICSASRAGLLT 89

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+ P+  G+   V  +    G+P +E  L    K+AGY T  +GKWHLG +     P  +
Sbjct: 90  GRLPVRAGVPANVSSKKGAEGMPASEVTLGDVFKQAGYRTAHVGKWHLG-YTPATMPLGQ 148

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
           GFDS +G+  G  D Y H     F  + G + RH++  D K +   G +  DL  +    
Sbjct: 149 GFDSSFGHMGGCIDNYSH-----FFYWNGPN-RHDLWRDGKEVFHDGQFFPDLLVDEMQR 202

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           ++ E ++ +P+F+Y A    H      P+Q  D+    + D+  P RR YA  VS LDE 
Sbjct: 203 LLTEKSE-QPLFVYWAINVPHY-----PYQGTDKWRNYYADLETP-RREYAAFVSTLDEK 255

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPS-----FGIHSNKGSNHPLRGMKSTPWDGGMR 318
           +G V+A L + G  EN+IV+F +D+G  +     FG     GS  P RG K   ++GG+R
Sbjct: 256 LGEVLATLEETGQRENTIVVFQSDHGHSTEERAFFG----GGSAGPYRGAKGCLFEGGIR 311

Query: 319 GVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
             + I W   L Q + V  ++    DWLPTL     ++++   LDG
Sbjct: 312 VPSVISWPAGLPQGE-VRDQMAVGCDWLPTLVELTKVDVDLPLLDG 356


>gi|410631133|ref|ZP_11341814.1| arylsulfatase B [Glaciecola arctica BSs20135]
 gi|410149335|dbj|GAC18681.1| arylsulfatase B [Glaciecola arctica BSs20135]
          Length = 454

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 125/367 (34%), Positives = 185/367 (50%), Gaps = 35/367 (9%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
           T+   P+I+IILADD+G+ D+S +GS +I TP ID LA  G    Q Y  A +C+PSR+ 
Sbjct: 32  TSAVPPNIVIILADDMGFGDLSINGSGKIKTPAIDQLARTGFNFTQFYASANICSPSRAG 91

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           LMTG+YPI   + + VI  G+  GLP TE  L +    A Y T  IGKWHLG F E + P
Sbjct: 92  LMTGRYPIRTSLGYNVITAGDTHGLPKTEDTLAELALRADYRTMLIGKWHLGRFPE-HAP 150

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
              GF++ YG          HS   A F  Y G D      V+  T+    T  YTE ++
Sbjct: 151 MKHGFETFYG--------VPHSNDMANFALYSG-DNIVEQPVEQSTL----TLRYTEKSV 197

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           + I + N   P  L+++H   H      P  A ++   K       +   Y  +V  LD 
Sbjct: 198 SFI-DANAKHPFLLFVSHTMPHI-----PLYASEDFAGK------SDAGVYGDVVEELDY 245

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           S G ++AAL+++G+ +NSI++  +DNG P F     +G+   L+G K + ++G  R    
Sbjct: 246 STGKIVAALKRNGIFDNSIIIITSDNG-PFF-----EGATAGLKGGKGSTYEGAYRVPLI 299

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           I  P       V+  +    D LPT+  A   +   + +DG +   +L +GA+    E L
Sbjct: 300 ISWPAGIANPAVTDAIAMNIDILPTIAEAIESKPAASIIDGRSLLPLL-RGAEKSPHEYL 358

Query: 383 HNIDNVD 389
           +  +N D
Sbjct: 359 YYFNNED 365


>gi|421613760|ref|ZP_16054831.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495346|gb|EKJ99933.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 485

 Score =  172 bits (435), Expect = 5e-40,   Method: Compositional matrix adjust.
 Identities = 127/404 (31%), Positives = 190/404 (47%), Gaps = 59/404 (14%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRS 82
            +A  +P+I++I +DD G+ND+   G+  I TP +D LA  G  L   YV    CTPSR+
Sbjct: 24  ASAADRPNILLIASDDQGYNDLGQLGNGII-TPALDRLAKEGTRLTNFYVAWPACTPSRA 82

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLT-----------------EKLLPQYLKEAGYA 125
           +L+TG+YP   G+   +  E   +G   T                 E +LP  L+ AGY 
Sbjct: 83  SLLTGRYPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAMLRPAGYK 142

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           +   GKW LG  + +  PT RGFD  YG+     DY+ H      E Y    M  N++  
Sbjct: 143 SGIYGKWDLGALQRML-PTSRGFDDFYGFVNTGIDYFTH------ERYGVPCMVRNLEPT 195

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQAPDEEVA 240
               G Y T L+   A+  + EH  ++P FLY+   A H  ++ +P      QAPD+  A
Sbjct: 196 EADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKA 255

Query: 241 KF------------------LDISDPE--RRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
            +                    ++ P+  RR Y   V+ +D ++G ++  L    ML+ +
Sbjct: 256 MYPPVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDET 315

Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
           +V+F +DNG       S    N PLRG K+  W+GG+R    +  P  K    V ++ F 
Sbjct: 316 MVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNDEFL 369

Query: 341 IS-DWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
            S + LP+  AAAG+E      LDG N W  L   A++ R E+ 
Sbjct: 370 TSLELLPSFAAAAGVEPPPGVVLDGFNWWPTLRGEAESPRKEMF 413


>gi|418530519|ref|ZP_13096442.1| sulfatase [Comamonas testosteroni ATCC 11996]
 gi|371452238|gb|EHN65267.1| sulfatase [Comamonas testosteroni ATCC 11996]
          Length = 440

 Score =  172 bits (435), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 126/365 (34%), Positives = 180/365 (49%), Gaps = 36/365 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
           P+I+ ILADDLGW D+  +G++   TP++D LA  G+  NQ Y   A+C+ +R AL+TG+
Sbjct: 4   PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63

Query: 89  YP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           Y   +  G++  +   G   GLP     LP  LK+AGY T  IGKWHLG     Y P   
Sbjct: 64  YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQLS 122

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
           G+D  +G   G  DY+ H      +P  G     ++   N+ +   G Y+  L  EA   
Sbjct: 123 GYDYFFGNHSGAIDYFTH------KPGVGEQFSSDLWQGNEPVERTGYYTYILGDEATRY 176

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-----VAKFLDISDPERRTYAGMVS 258
           V     + KP FL L   A H      P++ PD+E     +           + Y  +V 
Sbjct: 177 VHERKGQDKPFFLSLHFTAPHW-----PWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVE 231

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LD++VG V+ AL   G  +N+IV+F +DNG   F          P  G K+   +GG+R
Sbjct: 232 ALDKAVGQVLQALDDSGQADNTIVIFTSDNGGERF------SKTWPFTGQKTELLEGGIR 285

Query: 319 GVAAI-WSPWLK-QTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT-KGA 374
               + WS  ++ Q Q+  +  F   DWLPTL AAAG   + D   DG N   +L    A
Sbjct: 286 VPTLLRWSARIQPQVQEQVTASF---DWLPTLLAAAGARQHPDFPSDGQNILPILEGHAA 342

Query: 375 KTKRS 379
           +T+RS
Sbjct: 343 QTERS 347


>gi|296122982|ref|YP_003630760.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296015322|gb|ADG68561.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 466

 Score =  172 bits (435), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 127/349 (36%), Positives = 168/349 (48%), Gaps = 27/349 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP+I++I ADDLG+ D+  +GS  I TP++D +A  GL     Y  A +CTPSR+AL+TG
Sbjct: 35  KPNIVVIFADDLGYGDLGCYGSPTIRTPHLDEMAAEGLRFTDFYSAAEVCTPSRAALLTG 94

Query: 88  KYPIHIGM--QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           + PI  GM     V+      GLP  E  + + LKE GYAT  IGKWHLG       P  
Sbjct: 95  RLPIRSGMCGARRVLFPNSKGGLPPAEITIAEALKEKGYATAQIGKWHLG-IHPGSRPLD 153

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD------MRHNMQVDNKTIGIYSTDLY 197
           +GFD  +G  Y   +    D    +T  P   LD      +R+   V+        T  Y
Sbjct: 154 QGFDQSFGLPYSNDMDARADLPKGSTGSPNPPLDGWNVALLRNGEVVEQPANQTTLTKRY 213

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
           TE AI  I E  K+ P FLY+ H   H      P  A  +   K           Y   V
Sbjct: 214 TEEAIKFITE-KKNVPFFLYMPHTFPHV-----PMFASQDFKGK------SRAGIYGDAV 261

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
             LD SVG V+ AL + G+ EN++V F +DNG P   +    GS   L+  K + W+GGM
Sbjct: 262 EELDWSVGQVLGALCREGIAENTLVFFSSDNG-PWLIMGDQGGSAGLLKDGKGSTWEGGM 320

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
           R     W P  +    V+S+L    D  PT  A AG  +  D   DGV+
Sbjct: 321 RVPGIAWMP-SRIKPGVTSQLASAMDVFPTALALAGASLPKDVVFDGVD 368


>gi|149175125|ref|ZP_01853748.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
           8797]
 gi|148846103|gb|EDL60443.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
           8797]
          Length = 413

 Score =  171 bits (434), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 117/344 (34%), Positives = 179/344 (52%), Gaps = 35/344 (10%)

Query: 36  LADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSALMTGKYPIHIG 94
           +ADDLG+ D+S +GS    TP++D LA NG+     H   A+C+P+R+ L+TG+Y    G
Sbjct: 1   MADDLGYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAG 60

Query: 95  MQHGVIL----EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDS 150
           +  GV+     +    GL   E  L Q L++AGY T   GKWHLG+ R+ Y PTFRGF  
Sbjct: 61  ID-GVVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGYQRQ-YNPTFRGFQQ 118

Query: 151 HYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK 210
             GY  G  DY+ H      +     D  HN +++ +  G Y T L  + A+  I +  +
Sbjct: 119 FVGYVSGNVDYFAH-----LDGTGVFDWWHNAELNREEQG-YVTHLINDHALEFIRQQ-Q 171

Query: 211 SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK---FLDISDPERR----TYAGMVSRLDES 263
            KP F+Y+AH AVH+     P+Q P ++  +     DI   +R+     Y  M + +D+ 
Sbjct: 172 EKPFFVYIAHEAVHS-----PYQGPHDQPMRKEGGGDIKSAKRKDIANAYREMNTEMDKG 226

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA-A 322
           +G ++  L++  + E + + F++DNGA      +  GSN  LRG K + W+GG R  A A
Sbjct: 227 IGQIVDVLKEVNLTEKTFIFFLSDNGA------NKNGSNGKLRGFKGSLWEGGHRVPAIA 280

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
            W   + +       +  I D +PT+   A  +I     LDGV+
Sbjct: 281 CWPGRIPEGTVCDEPVISI-DLMPTILELANAKIPAGHKLDGVS 323


>gi|299530773|ref|ZP_07044188.1| sulfatase [Comamonas testosteroni S44]
 gi|298721289|gb|EFI62231.1| sulfatase [Comamonas testosteroni S44]
          Length = 440

 Score =  171 bits (434), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 124/364 (34%), Positives = 179/364 (49%), Gaps = 34/364 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
           P+I+ ILADDLGW D+  +G++   TP++D LA  G+  NQ Y   A+C+ +R AL+TG+
Sbjct: 4   PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63

Query: 89  YP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           Y   +  G++  +   G   GLP     LP  LK+AGY T  IGKWHLG     Y P   
Sbjct: 64  YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQLS 122

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
           G+D  +G   G  DY+ H      +P  G     ++   N+ +   G Y+  L  EA   
Sbjct: 123 GYDYFFGNHSGAIDYFTH------KPGVGEQFSSDLWQGNEPVERTGYYTYILGDEATRY 176

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-----VAKFLDISDPERRTYAGMVS 258
           V     + KP FL L   A H      P++ PD+E     +           + Y  +V 
Sbjct: 177 VHERKGQDKPFFLSLHFTAPHW-----PWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVE 231

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LD++VG V+ AL   G  +N+IV+F +DNG   F          P  G K+   +GG+R
Sbjct: 232 ALDKAVGQVLQALDDSGQADNTIVIFTSDNGGERF------SKTWPFTGQKTELLEGGIR 285

Query: 319 GVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLT-KGAK 375
               + WS  ++   +V  ++    DWLPTL AAAG   + D   DG N   +L    A+
Sbjct: 286 VPTLLRWSARIQ--PQVQEQITASFDWLPTLLAAAGARQHPDFPSDGQNILPILEGHAAQ 343

Query: 376 TKRS 379
           T+RS
Sbjct: 344 TERS 347


>gi|432851911|ref|XP_004067103.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 2
           [Oryzias latipes]
          Length = 523

 Score =  171 bits (434), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 122/357 (34%), Positives = 171/357 (47%), Gaps = 26/357 (7%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
           NT     P+III+L DD+GW D+   G     TPN+DA+A  G++    Y    LC+PSR
Sbjct: 24  NTPNTSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSR 83

Query: 82  SALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHL 134
           +AL+TG+ PI  G        ++    +    G+   E LLPQ LKE GY    +GKWHL
Sbjct: 84  AALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHL 143

Query: 135 GFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR----HNMQVDNKTIG 190
           G  R  Y P   GFD  +G        Y+++ +     Y   +M        ++D KT  
Sbjct: 144 GH-RPQYLPLENGFDEWFGAPNCHFGPYNNTVRPNIPVYNNSEMLGRYFEEFKIDKKTGE 202

Query: 191 IYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
              T +Y EA ++ I+   ++K P FLY A  A H+         P      FL  S  +
Sbjct: 203 SNLTQMYLEAGLDFISRQAEAKRPFFLYWAADATHS---------PVYASKPFLGKS--Q 251

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMK 309
           R  Y   V  LD SVG ++  L++ G+  N+ V F +DNGA      +  GSN P    K
Sbjct: 252 RGRYGDAVMELDYSVGRILTKLQRLGIDNNTFVFFTSDNGAALMSGPNEGGSNGPFLCGK 311

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVN 365
            T ++GGMR  A  W P   +   VS +  ++ D   T  A AGI    D  LDG++
Sbjct: 312 ETTFEGGMREPAIAWWPGRIKGGTVSFQPANVMDLFTTSLALAGIGPPGDRVLDGLD 368


>gi|430746414|ref|YP_007205543.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018134|gb|AGA29848.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 590

 Score =  171 bits (434), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 123/354 (34%), Positives = 179/354 (50%), Gaps = 44/354 (12%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALM 85
           A + P++++ILADD GW D+S HG++ + TPNID+LA +G +  + YV  +C P+R+  +
Sbjct: 20  AVEPPNVVVILADDQGWGDLSVHGNTNLKTPNIDSLARDGALFERFYVCPVCAPTRAEFL 79

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG+Y    G++ GV   GE   L L EK + +  K AGYAT A GKWH G  +  Y P  
Sbjct: 80  TGRYHPRGGVR-GVTSGGE--RLDLNEKTIAETFKSAGYATGAFGKWHNG-TQFPYHPNA 135

Query: 146 RGFDSHYGYWQG-LQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           RGFD +YG+  G   +Y+D   +    P QG                + TD  T+ AI+ 
Sbjct: 136 RGFDEYYGFTSGHWGEYFDPPLEHNGRPVQG--------------NGFITDDLTDHAISF 181

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF------LDISDPER------RT 252
           I + +K +P F YL     H+     P Q PD    KF      L   D ER      R 
Sbjct: 182 I-KASKDRPFFCYLPFNTPHS-----PMQVPDRFYDKFKNAALKLRAHDAEREDLMMTRA 235

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
              M   +DE+VG V+  L    + + +IVL+ +DNG  S+        N  +RG K + 
Sbjct: 236 ALAMCENIDENVGRVLKTLEDLSLDQKTIVLYFSDNGPNSWRW------NGGMRGRKGST 289

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
            +GG+R    I  P   + +   +++    D LPTL   AGI + +D  LDGV+
Sbjct: 290 DEGGVRSPLLIRWPKQIRPETRVAKISAAIDLLPTLTDLAGIPVKSDKPLDGVS 343


>gi|325109524|ref|YP_004270592.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969792|gb|ADY60570.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 486

 Score =  171 bits (434), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 125/362 (34%), Positives = 184/362 (50%), Gaps = 36/362 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           +KP+II+ILADDLG+ D++ +G+  I TP ID +A  G+  N  YV A+C+P+R++LMTG
Sbjct: 28  EKPNIIVILADDLGYGDLACYGAKDIATPAIDRMATEGVKCNSFYVSAVCSPTRASLMTG 87

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            + I +G+  GV+      GL   E  LP+ LK+ GYAT  IGKWHLG   +++ P   G
Sbjct: 88  SHSIRVGI-GGVMFPRNNHGLNPDEITLPELLKDQGYATAIIGKWHLG-NEDMFQPMNHG 145

Query: 148 FDSHYG--------YWQGLQDY---------YDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           FD  YG        Y+  ++ Y         Y  +   T E      +R N+ ++     
Sbjct: 146 FDYWYGTPASNNQFYYPTIKKYAADCVFREGYTRNGILTRETAACPLIRDNVVIEVPADQ 205

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
              T  YT   I  I E N  +P F+YLAH   H      P  A +    +F+  SD  R
Sbjct: 206 SQFTQRYTRETIRFITE-NHEQPFFIYLAHNMPHI-----PLHASE----RFVGSSD--R 253

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
             Y   +  LD S G ++ AL++ G+  N++V+F +DNG P+ G     GS  PL+G K 
Sbjct: 254 GIYGDTIQELDWSTGEILRALKELGLDSNTLVIFTSDNG-PNTG---KGGSAGPLKGGKG 309

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDV 369
           + ++GG+R       P        S E     D  PTL   AG ++ +D  +DG +   +
Sbjct: 310 STYEGGVRVPFVARWPGAIPAGIESDEAMTGMDLPPTLTRLAGGDVPDDRVIDGKDISPL 369

Query: 370 LT 371
           LT
Sbjct: 370 LT 371


>gi|32475143|ref|NP_868137.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
 gi|32445684|emb|CAD78415.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
          Length = 485

 Score =  171 bits (433), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 187/404 (46%), Gaps = 59/404 (14%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRS 82
            +A  +P+I++I++DD G+ND+   G+  I TP +D LA  G  L   YV    CTPSR+
Sbjct: 24  ASAADRPNILLIVSDDQGYNDLGQLGNGII-TPALDRLAKEGTRLTNFYVAWPACTPSRA 82

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLT-----------------EKLLPQYLKEAGYA 125
           +L+TG+YP   G+   +  E   +G   T                 E +LP  L+ AGY 
Sbjct: 83  SLLTGRYPQRNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYK 142

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           +   GKW LG  + +  PT RGFD  YG+     DY+ H      E Y    M  N++  
Sbjct: 143 SGIFGKWDLGALQRML-PTSRGFDDFYGFVNTGIDYFTH------ERYGVPCMVRNLEPT 195

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-----FQAPDEEVA 240
               G Y T L+   A+  + EH  ++P FLY+   A H  ++  P      QAPD+  A
Sbjct: 196 EADKGTYCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLVPTIRSSVQAPDQFKA 255

Query: 241 KF------------------LDISDPE--RRTYAGMVSRLDESVGNVIAALRKHGMLENS 280
            +                    ++ P+  RR Y   V+ +D ++G ++  L    ML+ +
Sbjct: 256 MYPPVEVETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEILDRLEAKQMLDET 315

Query: 281 IVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELF 339
           IV+F +DNG       S    N PLRG K+  W+GG+R    + W         V+ E  
Sbjct: 316 IVVFFSDNGG------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGQIPAGVVNDEFL 369

Query: 340 HISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSEIL 382
              + LP+  AAAG+E      LDG + W  L   A++ R E+ 
Sbjct: 370 TSLELLPSFAAAAGVEPPPGVVLDGFDWWPTLRGEAESPRKEMF 413


>gi|325110588|ref|YP_004271656.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324970856|gb|ADY61634.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 458

 Score =  171 bits (433), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 121/375 (32%), Positives = 181/375 (48%), Gaps = 32/375 (8%)

Query: 3   WARKYFFALTCTLLFNDAFLNTTAPKKPH--IIIILADDLGWNDVSFHGSSQIPTPNIDA 60
           W  + F  L  T +       T A K  H  +++I+ADDLG+ D+S +G  +  TP +D 
Sbjct: 8   WFTRCFLLLAVTTVALTVSQGTAAEKPSHRNVVVIMADDLGYGDLSCYGCERYQTPFLDQ 67

Query: 61  LAYNGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVILEG-EPWGLPLTEKLLPQY 118
           +A  G+  +  H    +C+P+R+AL+TG+Y    G+   +   G    GLPL      + 
Sbjct: 68  MAAEGMKWSDFHSNGNVCSPTRAALLTGQYQQRWGVDRVITAAGHRDRGLPLEAVTFAET 127

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
             + GYAT   GKWH G+ +  + P  +GFD   GY  G  D+  H  +A        D 
Sbjct: 128 FSQHGYATAIYGKWHQGY-QARFNPVRQGFDHFVGYVSGNVDFQSHIDQAG-----QADW 181

Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
            HN ++  +    Y T L T+ A+  I E NK +P  LYL H   H      P+Q P++ 
Sbjct: 182 WHNDELTEEAG--YVTHLITQYAVKFI-EDNKEQPFCLYLPHETPHY-----PYQGPNDL 233

Query: 239 V-----AKFLD--ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
                  KF++    D +   YA M+  +D  +G V+  L+KHG+ E ++VLF +DNG  
Sbjct: 234 ADRTVGGKFVNHGSRDDKAAAYAEMMIEMDRGIGEVLQTLQKHGLAEQTLVLFFSDNGGT 293

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
           S       G N  LRG K + W+GG R     W+P      +   +L    D  PTL   
Sbjct: 294 SL------GDNGGLRGTKGSDWEGGHRVPMIAWNPETVPAGRDCDQLAITHDIFPTLADY 347

Query: 352 AGIEI-NDTSLDGVN 365
           A + +  D SLDG++
Sbjct: 348 ADLPLPKDQSLDGIS 362


>gi|351729940|ref|ZP_08947631.1| sulfatase [Acidovorax radicis N35]
          Length = 446

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 129/397 (32%), Positives = 193/397 (48%), Gaps = 36/397 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           + P+I+ ILADDLGW D+S +G +   TP++DALA  G+   Q Y   A+C+ +R AL+T
Sbjct: 7   QPPNIVFILADDLGWADLSVYGQADFSTPHLDALAGEGVRFTQAYANSAVCSATRFALIT 66

Query: 87  GKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           G+Y   +  G++  ++ +    GLP     LP  L+ AGY T  IGKWHLG     + P 
Sbjct: 67  GRYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRNAGYDTALIGKWHLGSL-PTFGPL 125

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGIYSTDLYTEAA 201
             G++  +G + G  DY+ H      +P  G  +  ++   +V  + +G Y+  L  EA+
Sbjct: 126 KSGYNRFFGNYGGAIDYFTH------KPGVGEAVPRDLYEGEVPVERVGYYTQLLADEAS 179

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVA----KFLDISDPERRTYAGM 256
             +      SKP FL L   A H      P+  P DEEV+               TY  M
Sbjct: 180 GWLRGRSAGSKPFFLSLHFTAPHW-----PWVGPEDEEVSLGLKDLFHYDGGNLHTYGKM 234

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  LD +VG V+ AL++ G+ +N+IV+F +DNG   F          P  G K+   +GG
Sbjct: 235 VRSLDGAVGQVLHALQEQGLADNTIVVFTSDNGGERF------SKTWPFTGQKTELLEGG 288

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTKGAK 375
           +R    +  P  +   +VS ++    DWLPTL AAAG+  + D   DG N   VL   A+
Sbjct: 289 IRVPTLLRWP-ARIAPQVSDQVTATMDWLPTLLAAAGVAPDADYPSDGENILPVLEGHAR 347

Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
                +     +     +   A+R    KY+   DN 
Sbjct: 348 VHPRTLFWRYKS-----QRQRAVREGRFKYLKINDNE 379


>gi|346225028|ref|ZP_08846170.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
 gi|346226998|ref|ZP_08848140.1| sulfatase [Anaerophaga thermohalophila DSM 12881]
          Length = 488

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 123/374 (32%), Positives = 181/374 (48%), Gaps = 34/374 (9%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP+ I+I ADDLG+ D+   G   I TPN+D +A+ G      YV A +CTPSR+ L+TG
Sbjct: 35  KPNFIVIFADDLGYGDLGIFGHPTINTPNLDRMAFEGQKWTNFYVAAPVCTPSRAGLLTG 94

Query: 88  KYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           + PI  GM   +  V+      GLP +E  + + LKE GY T AIGKWHLG  +  + PT
Sbjct: 95  RLPIRSGMCSDKRRVLFPDSNGGLPQSEITIARALKENGYQTAAIGKWHLG-HKSPFLPT 153

Query: 145 FRGFDSHYGY----------WQGLQDYYDHSCK----ATFEPYQGLDMRHNMQVDNKTIG 190
             GFDS++G             G  +Y     K      F  Y    +R++  ++     
Sbjct: 154 DHGFDSYFGIPYSNDMDRVPGTGTGNYITSQIKLAEEENFHAYNVPILRNDKIIERPADQ 213

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
              T  YTE A+  I +  K  P F+YLAH   H          P     +F D+S    
Sbjct: 214 RTITKRYTEEAVAKI-KSMKDGPFFIYLAHNLPH---------IPLFRSEEFKDVS--LA 261

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
             Y  ++  +D SVG ++  L + G+ EN++V+F +DNG P     ++ G+   LRG K 
Sbjct: 262 GFYGDVIEEIDWSVGQILKTLEEEGLDENTLVVFTSDNG-PWHIFETHGGTAGLLRGAKG 320

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL-DGVNQWDV 369
             ++GGMR     W P  K  Q V  ++    D  PT C  +G ++ D  + DG +   +
Sbjct: 321 GTFEGGMREPTIFWWPG-KLKQGVVMDMATTMDLFPTFCNLSGTKLPDDRIYDGYDISPL 379

Query: 370 LTKGAKTKRSEILH 383
           L    K+ R  + +
Sbjct: 380 LLGTGKSPRDVVFY 393


>gi|315644664|ref|ZP_07897795.1| sulfatase [Paenibacillus vortex V453]
 gi|315279923|gb|EFU43222.1| sulfatase [Paenibacillus vortex V453]
          Length = 439

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 111/373 (29%), Positives = 195/373 (52%), Gaps = 27/373 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K+P+ +I+  DDLG+ D+  +GS  + TP++D LA  G+     Y  + +C+PSR++L+T
Sbjct: 2   KQPNFVIVYCDDLGYGDLGCYGSDSVRTPHLDGLADEGVRFTNWYSNSPVCSPSRASLLT 61

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           GKYP+  G+   +  +    GLP  E  L + LK AGY T   GKWHLG  +E  +P   
Sbjct: 62  GKYPVRAGVGEILGAKRGSHGLPAAEVTLAKALKPAGYRTALYGKWHLGLSKET-SPNAH 120

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
           GFD  +G+  G  D+Y H     +    G++  H++  +   +   G Y T+L TE +++
Sbjct: 121 GFDEFFGFKAGCVDFYSHIF--YWGQGHGVNPLHDLWENETEVWENGRYMTELITERSVD 178

Query: 204 VIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
            I    + + P FL+ ++ A H      P  AP E + +F  +   +R+  A M++ +D+
Sbjct: 179 FIKRSREQEAPFFLFASYNAPHY-----PMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDD 232

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH--------SNKGSNHPLRGMKSTPWD 314
            VG ++ AL++ G  E++++ F +DNG  S   +           GS    RG K++ ++
Sbjct: 233 GVGEIVEALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDIYYGGSAGIFRGHKASLFE 292

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-----INDTSLDGVNQWDV 369
           GG+R  A +  P   +  +V  E+  + D +PT    AG++     +   +LDG +  ++
Sbjct: 293 GGIREPAILNWPNGWEGGQVRDEVAAMMDIVPTFLDLAGVDLAAGPLQGITLDGSSLKEM 352

Query: 370 LTKGAKTKRSEIL 382
           L +G  +   ++ 
Sbjct: 353 LFRGKPSPHKQLF 365


>gi|114326198|ref|NP_001041585.1| N-acetylgalactosamine-6-sulfatase precursor [Canis lupus
           familiaris]
 gi|122138594|sp|Q32KH5.1|GALNS_CANFA RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|81158068|tpe|CAI85008.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Canis lupus
           familiaris]
          Length = 522

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 118/376 (31%), Positives = 179/376 (47%), Gaps = 26/376 (6%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
           P+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+
Sbjct: 27  PQPPNILLLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 86

Query: 86  TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           TG+ PI  G        ++    +    G+P  E +LP+ LKEAGY +  +GKWHLG  R
Sbjct: 87  TGRLPIRNGFYTTNRHARNAYTPQEIVGGIPDQEHVLPELLKEAGYVSKIVGKWHLGH-R 145

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
             + P   GFD  +G        YD+  +     Y+  +M         ++ KT     T
Sbjct: 146 PQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLT 205

Query: 195 DLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
            +Y + A++ I     + +P FLY A  A HA         P      FL  S  +R  Y
Sbjct: 206 QVYLQEALDFIKRQQAAQRPFFLYWAIDATHA---------PVYASRPFLGTS--QRGRY 254

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              V  +D SVG +++ L+   + EN+ V F +DNGA      +  GSN P    K T +
Sbjct: 255 GDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALISAPNQGGSNGPFLCGKQTTF 314

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTK 372
           +GGMR  A  W P      +VS +L  I D   T  + AG+   +D  +DG++    +  
Sbjct: 315 EGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRVIDGLDLLPAMLG 374

Query: 373 GAKTKRSEILHNIDNV 388
           G  T R    +  D +
Sbjct: 375 GQLTDRPIFYYRGDTL 390


>gi|332663784|ref|YP_004446572.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332598|gb|AEE49699.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 580

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 138/461 (29%), Positives = 206/461 (44%), Gaps = 103/461 (22%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F A + +            P KP+II+I+ DDLG++D   +GS +I TPNID LAY GL 
Sbjct: 11  FLAFSASAQKKGKTTPAPTPSKPNIILIMVDDLGYSDFGAYGS-EIQTPNIDKLAYGGLR 69

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQ-------YLK 120
           L + Y  ++C P+R++L+TG+YP   G+ +         GLP  +  L Q        L+
Sbjct: 70  LKEFYNNSICAPTRASLITGQYPHKAGLGY----FNTNLGLPAYQGWLNQESLTFGEVLQ 125

Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
           + GY T+  GKWH+G    +Y P  RGF+  YG+  G  +YYD S      P   +++  
Sbjct: 126 QGGYNTYLTGKWHVG-NDSLYWPNQRGFNKFYGFIGGASNYYDISPYPEKAP--PVELVE 182

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           N Q  N   G Y TD  T  A++ I E +K KP FLYLA  A H      P QAP E++A
Sbjct: 183 NNQRINLAPGKYLTDEITNHALSYINE-SKDKPFFLYLAFNAPHW-----PLQAPAEDIA 236

Query: 241 KF---------------------LDISDPERR---------------------------T 252
           K+                     L I+DP ++                            
Sbjct: 237 KYKGKYKIGWDSLRAQRYRRQLALGIADPRQKVAERDPEVVAWESLTFDEKELWQRKMEV 296

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS--------NKG---- 300
           YA MV  +D+SVG V+  L+K    +N++++ ++DNGA   G++         N G    
Sbjct: 297 YAAMVDHVDQSVGRVVEELKKLKKDDNTLIVLISDNGAQG-GLNQLGQRRRQRNSGPIGS 355

Query: 301 --------------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLP 346
                         SN P R  K+T  +GG+      W P   +   ++    H+ D  P
Sbjct: 356 AGSYDYQEQNWAYVSNTPFRNYKATSHEGGISSPFIAWFPKKIKAGTITKGTGHLIDLAP 415

Query: 347 TLCAAAGIE-------INDTSLDGVNQWDVLTKGAKTKRSE 380
           T    AG+        +   +L GV+  ++L +G    R+E
Sbjct: 416 TFYDVAGVNYPTNYRGVQTNTLPGVSLKNLLFEGNDLVRNE 456


>gi|410907337|ref|XP_003967148.1| PREDICTED: arylsulfatase A-like [Takifugu rubripes]
          Length = 501

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 124/423 (29%), Positives = 195/423 (46%), Gaps = 47/423 (11%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCT 78
             L +     P+ +++ ADDLG+ D+  +G     TPN+D LA  GL     Y  + +C+
Sbjct: 13  VLLGSCTSSPPNFVLLFADDLGFGDLGCYGHPTSLTPNLDRLAAGGLRFTDFYCTSPVCS 72

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF-F 137
           PSR++L+TG+Y    G+  GV+  G   GLPL E  + + LK  GYAT A+GKWHLG   
Sbjct: 73  PSRASLLTGRYQTRSGVYPGVLYPGSRGGLPLNETTIAEVLKPLGYATAAVGKWHLGIGA 132

Query: 138 REVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
              + PT +GFD + G  Y   +    + +C        GL       V   T+ +   +
Sbjct: 133 NGTFLPTRQGFDQYLGIPYSHDMGPCQNLTCFPPDVKCFGL-----CDVGTVTVPLMYNE 187

Query: 196 LYTEAAINVIAEHN---------------KSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
           +  +  +N +   N               K +P FLY      H    Y  F  P     
Sbjct: 188 VIKQQPVNFLDLENAYRDFASDFISTSAKKRQPFFLYFPSHHTH----YPQFAGPGAAGM 243

Query: 241 KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG 300
                    R  +   +  LD ++G+++  L   G+L N+++LF +DNG P     S  G
Sbjct: 244 SL-------RGPFGDALLELDNTIGSLLETLEGTGVLNNTLILFTSDNG-PELMRMSRGG 295

Query: 301 SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK-VSSELFHISDWLPTLCAAAGIEINDT 359
           +  PLR  K T ++GGMR  A  +  W    Q  V+ E+    D LPT+ + AG ++   
Sbjct: 296 NAGPLRCGKGTTYEGGMREPAIAY--WRGIIQPGVTHEMASTLDILPTIASLAGAKLPQV 353

Query: 360 SLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYA--ALRVDDLK---YVAGTDNNGQ 414
            LDGV+  ++L    K+KR  ++       NP + Y   A+R++  K   Y  G  ++G 
Sbjct: 354 MLDGVDMSNILFSQGKSKRETMMF---YPTNPSETYGVFAIRLEKYKAHFYTQGASHSGT 410

Query: 415 SDE 417
           + +
Sbjct: 411 TPD 413


>gi|404447701|ref|ZP_11012695.1| sulfatase [Indibacter alkaliphilus LW1]
 gi|403766287|gb|EJZ27159.1| sulfatase [Indibacter alkaliphilus LW1]
          Length = 486

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 115/350 (32%), Positives = 183/350 (52%), Gaps = 28/350 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           K P+++II  DD G+NDV   GS  I TPN+D +A  G+  +  YV QA+C+ SR+AL+T
Sbjct: 39  KLPNVVIIFTDDQGYNDVGVFGSKDISTPNLDNMAKEGVRFSNFYVAQAVCSASRAALLT 98

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G Y   +G+ HG +      GL   E  + + LK  GY T   GKWHLG   E + P  +
Sbjct: 99  GTYSNRLGI-HGALDHSSNHGLNPDETTIAEMLKPLGYQTAIFGKWHLGHHPE-FLPVQQ 156

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEP----YQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           GFD ++G       + +H     + P    Y+  ++   +  D   +    T + TE ++
Sbjct: 157 GFDEYFGIPYSNDMWPNHPETKNYYPPLPLYENDEVVDTLHHDQSML----TTVLTEKSL 212

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           + I E NK  P FLYLAH   H      P    D+   K       ER  Y  ++  +D 
Sbjct: 213 DFI-ERNKETPFFLYLAHPMPHV-----PLFVSDKFEGK------SERGLYGDVIMEIDW 260

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           SVG V+ +L+  G+ EN++V+F++DNG   S+ +H+  GS +PL+  K T WDGG++  +
Sbjct: 261 SVGQVLESLKSKGLEENTLVIFLSDNGPWLSYSLHA--GSAYPLKEGKGTSWDGGVKIPS 318

Query: 322 AI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
            + W   + + +  ++    I D LPT+    G ++ +  +DG + W ++
Sbjct: 319 IMQWKGKIPEGKVQTNPAMSI-DILPTIAEITGAKLPELPIDGKSIWKMI 367


>gi|432862125|ref|XP_004069735.1| PREDICTED: arylsulfatase A-like [Oryzias latipes]
          Length = 500

 Score =  171 bits (432), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 124/408 (30%), Positives = 195/408 (47%), Gaps = 42/408 (10%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCT 78
           + L   +   P+ +++ ADDLG+ D+   G     TPN+D LA  GL     Y  + +C+
Sbjct: 12  SLLCVISASPPNFVLLFADDLGFGDLGCFGHPSSLTPNLDRLAAGGLRFTDFYCTSPVCS 71

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF-F 137
           PSR++L+TG+Y    G+  GV+  G   GLPL E  + + LK  GYAT A+GKWHLG   
Sbjct: 72  PSRASLLTGRYQTRSGVYPGVLYPGSIGGLPLNETTIAEVLKPLGYATAAMGKWHLGLGA 131

Query: 138 REVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEP-----------YQGLDMRHNMQV 184
              + PT +GFD + G  Y   +   ++ +C   F P              + + HN  +
Sbjct: 132 NGTFLPTRQGFDQYLGIPYSHDMGPCHNLTC---FPPDVKCFGFCDLGTVTVPLMHNEVI 188

Query: 185 DNKTIGIYSTD-LYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
             + +     +  Y++ A N IA   +K +P FLY      H    Y  +  P       
Sbjct: 189 KQQPVDFLDLERAYSDFATNFIASSASKKQPFFLYYPSHHTH----YPQYAGPGAAGHSL 244

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSN 302
                  R  +   +  +D ++GN++A L + G++ N++V F +DNG P     +  GS 
Sbjct: 245 -------RGPFGDALMEMDNTIGNLMATLEQTGVINNTLVFFTSDNG-PELMRRARGGSA 296

Query: 303 HPLRGMKSTPWDGGMR--GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
             L+  K T ++GGMR  G+ A W   +K    V+ E+    D LPT  A AG ++    
Sbjct: 297 GLLKCGKGTTYEGGMREPGI-AYWPGTIK--PGVTHEMASTLDILPTFAALAGAKLPPVM 353

Query: 361 LDGVNQWDVLTKGAKTKRSEIL-HNIDNVDNPQKYYAALRVDDLKYVA 407
           LDGV+  D+L    K+KR  ++ + +D    P + Y    +   KY A
Sbjct: 354 LDGVDMMDILFNQGKSKREAMMFYPVD----PTEQYGLFAIRFGKYKA 397


>gi|255038246|ref|YP_003088867.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254951002|gb|ACT95702.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 481

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 138/420 (32%), Positives = 202/420 (48%), Gaps = 61/420 (14%)

Query: 5   RKYFFA---LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDAL 61
           RKY      LT + L   A      P++P+I+ ILADDLG+ DV F+G   I TPNID L
Sbjct: 2   RKYLLLIPLLTSSFLTQRADAQAPKPQRPNIVFILADDLGYGDVGFNGQKLIKTPNIDKL 61

Query: 62  AYNGLILNQHYV-QALCTPSRSALMTGKYPIH--IGMQHGVILEGEPWGLPLTEKL--LP 116
           A  G+I NQ Y   ++C PSRS+L+TG++  H  I    GV  EG+    P+ + +  L 
Sbjct: 62  AKEGMIFNQFYAGTSVCAPSRSSLLTGQHTGHTYIRGNKGVEPEGQ---QPIADSVTTLA 118

Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYY------DHSCKAT 169
           + LK++GY T A GKW LG       P  +GFD  YGY  Q L   Y      D+S K  
Sbjct: 119 EVLKKSGYVTAAFGKWGLGPVGSEGDPNKQGFDRFYGYNCQSLAHRYYPEHLWDNSKKIL 178

Query: 170 FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
            E  +GL       + NK    Y+ DL  + A++ +   +  +P FL+L ++  HA    
Sbjct: 179 LEGNKGL-------IHNKE---YAPDLIQKKALSFVNAQDGKQPFFLFLPYILPHA---- 224

Query: 230 EPFQAPDEEVAKFLDISDPER-----------------------RTYAGMVSRLDESVGN 266
                PD+ + ++      E+                        T+A MV+RLD  VG 
Sbjct: 225 -ELVVPDDSLFRYYKGKFEEKPHKGADYGPGANGGGYASQDFPHATFAAMVARLDLYVGQ 283

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GSNHPLRGMKSTPWDGGMRG-VAAI 323
           V+ AL+K G+ +N++V+F +DNG    G    +   S    RG+K   ++GG+R   AA 
Sbjct: 284 VMNALKKKGLDKNTLVIFTSDNGPHVEGGADPRFFNSGAGFRGVKRDLYEGGIREPFAAR 343

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           W   +K   K S  +    D LPT    A       ++DG++  D L   A  K+ + L+
Sbjct: 344 WPAAIKPGSK-SDYIGAFWDILPTFAELANAPA-PRNIDGISFTDALKGKAIQKKHDYLY 401


>gi|296122901|ref|YP_003630679.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296015241|gb|ADG68480.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 519

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 113/335 (33%), Positives = 172/335 (51%), Gaps = 24/335 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +P+II+++ DD G+ D+S HG+  + TP++D L    +   Q +V   C P+R+++MT +
Sbjct: 45  RPNIILMMTDDQGYGDLSLHGNPVVKTPHLDQLGRQSVRFEQFHVSPTCAPTRASIMTSR 104

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           +    G+ H  ILE E   L L   +LPQ+LK AGY T   GKWHLG   + Y P  RGF
Sbjct: 105 HEFSSGVTH-TILERE--RLSLKATILPQFLKRAGYTTGIFGKWHLG-DEDAYQPGKRGF 160

Query: 149 DS---HYGYWQGLQDYYDHSC-KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           D    H G   G+   Y  SC  A    Y    +RHN +        Y T ++ + AI  
Sbjct: 161 DEVFIHGG--GGIGQSYPGSCGDAPLNKYFNPVIRHNGKF--VATNGYCTKVFVDQAITW 216

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I+    ++P F Y+   A HA     P   P E    +L+    +   + GM++  D+ +
Sbjct: 217 ISSQPDNQPFFCYITPNAPHA-----PLDCPKEYYEPYLEHVPEDVARFYGMITHWDDQL 271

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G ++ AL    + +++IV+FM DNG+ +   H + G    +R  K TP++GG+R V A W
Sbjct: 272 GRLLKALEDRDISKDTIVIFMTDNGSATGAKHFSAG----MRANKGTPYEGGIR-VPAFW 326

Query: 325 SPWLKQTQ-KVSSELFHISDWLPTLCAAAGIEIND 358
           S W    Q +V  E+    D LPTL   A + + D
Sbjct: 327 S-WAGHWQPQVRQEVTCHYDILPTLTELANVPVAD 360


>gi|149177363|ref|ZP_01855968.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148843888|gb|EDL58246.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 466

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 122/343 (35%), Positives = 180/343 (52%), Gaps = 32/343 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K+P+I++I AD+LG+ D+  +G+  + TP +D LA  G+ L   Y  +  CT SR+ L+T
Sbjct: 33  KRPNILLITADNLGYGDLGCYGNPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLT 92

Query: 87  GKYPIHIGMQHGVIL-EGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           G+YP  IG+ H +   E    GL  +E L+P+YLK+ GY T   GKW++G F     PT 
Sbjct: 93  GRYPQRIGLNHQLSADENYGDGLRKSEVLIPEYLKQQGYRTACFGKWNVG-FSPGSRPTE 151

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI--YSTDLYTEAAIN 203
           RGFD  +G+  G  DYY H        Y G   RH++    K + +  YSTDL+ +AA  
Sbjct: 152 RGFDEFFGFAAGNIDYYHHY-------YAG---RHDLWRGLKEVFVEGYSTDLFADAACQ 201

Query: 204 VIAEHNKSKPMFLYLAHLAVH--AGNTYEP-----FQAPDEEVAKF---LDISDPERRTY 253
            I+  +  +P F+YL   A H  +    +P     +QAPD    K+       +P+ R Y
Sbjct: 202 YISAES-DQPFFIYLPFNAPHFPSQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQER-Y 259

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK---GSNHPLRGMKS 310
             +V+ LD ++G V+  L   G+ + +IV++ +DNGA  F +        SN PLR    
Sbjct: 260 RAVVTALDSAIGRVLKQLDTSGLRDQTIVIWYSDNGA--FMLKERGLEVASNKPLRDGGV 317

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           T W+GG+R  A I  P   +   V+       D LPTL   AG
Sbjct: 318 TLWEGGIRVPAIIRYPGHLKAGTVNQSPLISLDILPTLITLAG 360


>gi|374374259|ref|ZP_09631918.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
 gi|373233701|gb|EHP53495.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
          Length = 476

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 139/405 (34%), Positives = 196/405 (48%), Gaps = 46/405 (11%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           FFA   TLL         A  KP+I++ILADDLG+ DV  +G  +I TP+IDALA  GL 
Sbjct: 7   FFASFLTLLIRLP-TRLAAQTKPNIVLILADDLGYGDVGVYGQQKIKTPHIDALAKEGLR 65

Query: 68  LNQHYV-QALCTPSRSALMTGKYPIHIGMQH--GVILEGEPWGLPLTEKLLPQYLKEAGY 124
             + Y   A+C PSRS+L+TG++  H  ++   GV  EG+ W LP     +   LK AGY
Sbjct: 66  FTRFYCGTAVCAPSRSSLLTGQHTGHTPIRGNIGVKPEGQ-WPLPGNTVTIATLLKSAGY 124

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGY-WQGL-QDYY-DHSCKATFEPYQGLDMRHN 181
           +T   GKW LGF      P  +GFD  YGY  Q L  DY+ DH            + R  
Sbjct: 125 STGCFGKWGLGFIGTSGDPLKQGFDRFYGYNCQTLAHDYFPDHLWNN--------EQRVE 176

Query: 182 MQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG----------NTY-- 229
           +   ++   IYS  L  E A+  I +H + +P FL LA+   HA           N Y  
Sbjct: 177 LSNTHEKQTIYSAQLIHEQAMQFIRQHQR-QPFFLLLAYTLPHAALQLPKDDTLYNYYKK 235

Query: 230 ----EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
               +P    +    K  +        YA MV +LD  VG V   L+  G+ +N+++LF 
Sbjct: 236 LFNEKPVPVAEHWNGKGYEPQAYPHAAYAAMVGKLDAYVGEVQQQLKVLGLDKNTMILFA 295

Query: 286 ADNGA-------PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
           +DNG        P+F  +SN G    LRG+K   ++GG+R     W P   Q    S+++
Sbjct: 296 SDNGPHREGGNDPAF-FNSNGG----LRGIKRDLYEGGIRSPFIAWWPGTIQAGTQSNQV 350

Query: 339 FHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
               D LPT  A AGI +   + DG++    L    K ++ + L+
Sbjct: 351 GAFWDLLPTFAALAGIGMPANN-DGISMLPTLLARGKQQQHDHLY 394


>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
           queenslandica]
          Length = 523

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 120/368 (32%), Positives = 182/368 (49%), Gaps = 27/368 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           +P  P++I +L DD+GW D+  +G     TPN+D +A  G++L   Y    LC+PSR+A+
Sbjct: 29  SPTTPNMIFMLMDDMGWGDLGVYGHPVKETPNLDKMALEGMLLPDFYSANPLCSPSRAAM 88

Query: 85  MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG+ PI  G        ++    +    G+P +E L P+ L++ GYAT  IGKWHLG  
Sbjct: 89  LTGRLPIRNGFYTTNAHARNAYTPQDIVGGIPDSEILYPELLQKNGYATMIIGKWHLG-Q 147

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR----HNMQVDNKTIGIYS 193
           +  Y P   GFD  +G        +D   +     Y+   M      +  +++KT     
Sbjct: 148 QTHYHPLKHGFDEFFGSTNCHFGPFDGKEQPNMPVYRNATMAGRYYQDFPINHKTGESNL 207

Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           T  YT+ AI  I ++  + KP FLY    A H          P      FL  S  +R  
Sbjct: 208 TVEYTQEAIKFINKNAANKKPFFLYWTPDATH---------TPLFASKDFLGTS--QRGL 256

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y   V  LD SVG ++  L+  G+ ++++V+F +DNG  ++   S  GSN P    K T 
Sbjct: 257 YGDAVKELDYSVGQILDTLKTLGIDKDTLVIFSSDNGGATYAKESG-GSNSPFLCGKETT 315

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLT 371
           ++GGMR     W P   Q  +VS +L  + DW  T    AGI E +D  +DG++   +  
Sbjct: 316 FEGGMREPTIAWWPGTIQPGQVSHQLGSLMDWYSTALDLAGISEPSDRIIDGISLAPLFK 375

Query: 372 KGAKTKRS 379
           KG +T R+
Sbjct: 376 KGQETNRT 383


>gi|301788958|ref|XP_002929896.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like, partial
           [Ailuropoda melanoleuca]
          Length = 519

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 119/378 (31%), Positives = 178/378 (47%), Gaps = 26/378 (6%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-V 73
           +L    F     PK P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y  
Sbjct: 12  VLSAAGFGAMGTPKPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSA 71

Query: 74  QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
             LC+PSR+AL+TG+ PI  G        ++    +    G+P  E LLP+ LK AGYA+
Sbjct: 72  NPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAGYAS 131

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNM 182
             +GKWHLG  R  + P   GFD  +G        YD+  +     Y+  +M        
Sbjct: 132 KIVGKWHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEF 190

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
            ++ +T     T +Y + A++ +     + +P FLY A    HA         P      
Sbjct: 191 PINLQTGEANLTQVYLQEALDFMKRQQVAQRPFFLYWAIDGTHA---------PVYASRP 241

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           FL  S   R  Y   V  +D+SVG +++ LR   + EN+ V F +DNGA         GS
Sbjct: 242 FLGTS--RRGRYGDAVREIDDSVGKILSLLRDLRITENTFVFFTSDNGAALISAPKQGGS 299

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTS 360
           N P    K T ++GGMR  A  W P      +VS +L  + D   T  A AG+   +D  
Sbjct: 300 NGPFLCGKQTTFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLALAGLAPPSDRV 359

Query: 361 LDGVNQWDVLTKGAKTKR 378
           +DG++    + +G  T R
Sbjct: 360 IDGLDLLPAMLRGQLTDR 377


>gi|223936271|ref|ZP_03628184.1| sulfatase [bacterium Ellin514]
 gi|223895133|gb|EEF61581.1| sulfatase [bacterium Ellin514]
          Length = 488

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 131/403 (32%), Positives = 202/403 (50%), Gaps = 32/403 (7%)

Query: 1   MTWARKYFFALTCTLLFNDAFLNTT---APKKPHIIIILADDLGWNDVSFHGSSQIPTPN 57
           + W+   F  L    L +DA  +T    AP++P+II+ILADDLG+ D+  +G +QI TPN
Sbjct: 13  LLWSMVSFSGLLT--LTSDAQTSTNRPPAPRRPNIILILADDLGYGDLGCYGQTQIKTPN 70

Query: 58  IDALAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHG--VILEGEPWGLPLTEKL 114
           ID LA +G+     Y    +C PSR+ LMTGK   H+ ++    + L GE       E  
Sbjct: 71  IDKLAEDGMKFTSFYAGSTVCAPSRATLMTGKNTGHVNIRGNADLSLNGE-------ELT 123

Query: 115 LPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ--DYY-DHSCKATFE 171
           + + LK AGYAT  IGKW LG       P  +GFD + GY   +Q  DYY  H  ++  +
Sbjct: 124 IAKILKLAGYATGCIGKWGLGNEGSPGLPGRQGFDEYLGYLDQVQAHDYYPTHLFRSDSK 183

Query: 172 PYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI-----AEHNKSKPMFLYLAHLAVHAG 226
             +          D+K  G+YS D +T++A+N +     ++ NK +  FLYL +   HA 
Sbjct: 184 GEESKIALTENDADHK--GLYSNDFFTQSALNYLRINKPSKLNKHRSFFLYLPYTLPHAN 241

Query: 227 NTYEPFQAPDEEVAKFLDISDPE----RRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
           N          EV      ++ +     +  A M++RLD  VG ++  L+K  + EN++V
Sbjct: 242 NELGNRTGNGMEVPSTEPYTNEQWPQVEKNKAAMITRLDHYVGEIMDYLKKSKLDENTVV 301

Query: 283 LFMADNGAPSFGIHSNK--GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
           +F +DNG    G  + K   S   LRG+K   ++GG+R    +  P   +   +S     
Sbjct: 302 IFASDNGPHKEGGVNPKYFNSAGGLRGIKRDLYEGGIRVPFIVRWPARVKAGSISDAPLA 361

Query: 341 ISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
             D+LPT    A    + T++DG++    L   A+T R + L+
Sbjct: 362 FWDFLPTAAEIARTS-SPTNIDGISFLPTLLGKAQTNRHQYLY 403


>gi|449692603|ref|XP_004213101.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 181

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 90/181 (49%), Positives = 116/181 (64%), Gaps = 4/181 (2%)

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
           WHLGFF + YTPT+RGFDS+YG + G  DY++HS     E Y GLD+  N        G 
Sbjct: 1   WHLGFFAKQYTPTYRGFDSYYGSYLGKGDYWNHSNT---ETYSGLDLHDNENGVFSQDGN 57

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE-PFQAPDEEVAKFLDISDPER 250
           YST++YT  AI+ I  HN S+P+FLYLA+ AVH+ NT E P QAP E + KF  I   +R
Sbjct: 58  YSTEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQR 117

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
           R YA M+  +D  VG V  AL +  ML+NSI++F  DNG P+ G   N  +N PLRG+K+
Sbjct: 118 RKYAAMLGYMDYGVGRVHDALAEKKMLDNSIIIFTTDNGGPANGFDYNWANNFPLRGVKA 177

Query: 311 T 311
           T
Sbjct: 178 T 178


>gi|325109525|ref|YP_004270593.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969793|gb|ADY60571.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 453

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 131/416 (31%), Positives = 206/416 (49%), Gaps = 60/416 (14%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F  L   +LF    L   A ++P+I+ IL DD G+ D+  +GS++I TP+ID LA  G+ 
Sbjct: 15  FETLLSLVLFAGGQL-AAADQRPNIVFILTDDQGYGDLGCYGSTEIATPHIDRLAAEGMK 73

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
               YV   C+P+R+A MTG +   +G+   VI   +  GL   E  + + L+ AGY T 
Sbjct: 74  FESFYVHNRCSPTRAAFMTGCHAQRVGLGK-VIYRKDKTGLSADEITVAELLQGAGYRTG 132

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD-N 186
            +GKWHLG + E ++P   GFD+ YG+ +     +D +     +     + R  ++ D +
Sbjct: 133 IVGKWHLGEY-EQFSPVHHGFDTFYGFME-----WDDNRTTAID-----NNRTIVERDVS 181

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
           KT G++S  L   A I  I E N+ +P FLY A    H    ++P +       +F   S
Sbjct: 182 KTDGVHSPKLLA-AGIEFITE-NQDRPFFLYYASPLPH--TRWKPHE-------RFAGSS 230

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
             E+ TY  +V  +D  VG ++  L K G+ +N++V+F +DNG P   +  + GS  PLR
Sbjct: 231 --EQGTYGDVVQEIDWQVGELMQTLDKLGLTDNTLVVFASDNG-PQLNVEGH-GSAGPLR 286

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
             K T ++GG+R    +  P +     V++E+  I D LPT C  AG+++ +D  +DG +
Sbjct: 287 DGKWTNFEGGIRVPCLMRWPSVVPAGSVNNEIVGIIDMLPTFCEIAGVDVPSDRVIDGRS 346

Query: 366 ----------------------------QWDVLTKGAK--TKRSEILHNIDNVDNP 391
                                       QW +L KGAK    +S+ +   D V  P
Sbjct: 347 ILPYMRGESVHPPIHDTFVVPGATIRHQQWKLLIKGAKPGGSKSDRVGKTDRVPAP 402


>gi|423293951|ref|ZP_17272078.1| hypothetical protein HMPREF1070_00743 [Bacteroides ovatus
           CL03T12C18]
 gi|392677172|gb|EIY70591.1| hypothetical protein HMPREF1070_00743 [Bacteroides ovatus
           CL03T12C18]
          Length = 461

 Score =  170 bits (430), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 119/377 (31%), Positives = 188/377 (49%), Gaps = 39/377 (10%)

Query: 6   KYFFALTCTLLFNDAF--LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           ++F A TC L    AF  +   A K+P+ +II+ADD+G+ DV  +G+  I TPNID +A 
Sbjct: 6   RHFVAATCALA---AFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEYIKTPNIDQMAR 62

Query: 64  NGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVIL----EGEPWGLPLTEKLLPQY 118
            G++    H   ++ +P+R  L+TG+Y    G++  +++    + +  GL   E    + 
Sbjct: 63  EGMMFTDFHSNGSVSSPTRCGLLTGRYQQRAGLEKVLLVPRDDKDKEVGLQSEEITFAKI 122

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLD 177
           L + GY T  IGKWHLG+ ++ +   F GF    G+  G  DY  H  +    + + GL+
Sbjct: 123 LGDNGYRTALIGKWHLGYLQKHHPMNF-GFQKFVGFKSGNVDYQSHRNRYGDVDWWDGLE 181

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
           M        K +  Y+T L T  + + I E NK KP  LY+AH A H+     P Q P+E
Sbjct: 182 M--------KNMPGYTTTLLTTLSEDYIKE-NKDKPFCLYIAHAAPHS-----PMQGPNE 227

Query: 238 EVAKF-------LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
           +  +         +   P +  Y  MV  LD SVG ++  L+K+ + +N+ V+F +DNG 
Sbjct: 228 KAIRTEATLEGDKNSDRPNKEIYKDMVEELDWSVGRILETLKKYKLDKNTFVVFFSDNGP 287

Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
               + +N GS    RG K  PW+GG R     + P   +   +  +     D  PT+  
Sbjct: 288 ----VINNGGSTGGYRGAKGAPWEGGHRVPGICYMPGTVKGGTMCKQTVMSFDLFPTMLD 343

Query: 351 AAGIEINDT--SLDGVN 365
            A I  +D+   LDG +
Sbjct: 344 MANIHYDDSKKKLDGAS 360


>gi|440908775|gb|ELR58760.1| N-acetylgalactosamine-6-sulfatase, partial [Bos grunniens mutus]
          Length = 525

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 120/367 (32%), Positives = 178/367 (48%), Gaps = 27/367 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           A + P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL
Sbjct: 30  ALQPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPSRAAL 89

Query: 85  MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG+ PI  G        ++    +    G+P +E LLP  LK AGYA+  +GKWHLG  
Sbjct: 90  LTGRLPIRSGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLG-H 148

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
           R  + P   GFD  +G        YD+  +     Y+  +M         ++ KT     
Sbjct: 149 RPQFHPLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL 208

Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           T +Y + A+  I     + +P FLY A  A H         AP      FL  S  +R  
Sbjct: 209 TQIYLQEALEFIQRQQAAHRPFFLYWAVDATH---------APVYASKPFLGTS--QRGR 257

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y   +  LD+SVG ++  LR   + EN+ V F +DNGA +      +GSN P    K T 
Sbjct: 258 YGDAIRELDDSVGRILRLLRDLSIAENTFVFFTSDNGA-ALISAPRQGSNGPFLCGKQTT 316

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
           ++GGMR  A  W P      +VS +L  I D   T  + AG+E   D ++DG++    + 
Sbjct: 317 FEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPRDRAIDGLDLLPAML 376

Query: 372 KGAKTKR 378
           +G  T R
Sbjct: 377 QGRLTDR 383


>gi|343087074|ref|YP_004776369.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342355608|gb|AEL28138.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 510

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 118/333 (35%), Positives = 173/333 (51%), Gaps = 25/333 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGKY 89
           P+II+++ DD G+ D+S HGS  + TPN+D L    + L    V   C P+RSA+M+G++
Sbjct: 33  PNIILVMTDDQGYGDISAHGSPDVFTPNMDKLKSQSIRLEDFQVSPTCAPTRSAIMSGRH 92

Query: 90  PIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFD 149
           P   G+ H  ILE E   L LT   LPQ LK  GY +   GKWHLG  +E Y P  RGFD
Sbjct: 93  PFKNGITH-TILERERMALGLT--TLPQVLKRGGYTSGIFGKWHLGDEQE-YQPDSRGFD 148

Query: 150 SHYGYWQ-GLQDYYDHSC-----KATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
             + +   G+   Y  SC        F P     ++HN  V  KT G  +   +T+A   
Sbjct: 149 EVFIHGAGGIGQAYAGSCADVPDNGYFNPI----IKHN-GVFVKTEGFCTDVFFTQALSW 203

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDE 262
           +  + +  +P F Y+   A H      PF AP++   KF+D S PE  + + GM+  +D+
Sbjct: 204 IKQKSSAEEPFFAYITTNAPHG-----PFIAPEKYKQKFIDQSYPEDAQGFYGMIENVDD 258

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS--NHPLRGMKSTPWDGGMRGV 320
           ++G ++  L   G+ +N+I++FM DNG    G ++  G   N  +RG K + ++GG R  
Sbjct: 259 NLGVLMDKLDAWGIADNTILIFMTDNGKTYGGYNTVHGETYNAGMRGFKGSVYEGGTRVP 318

Query: 321 AAI-WSPWLKQTQKVSSELFHISDWLPTLCAAA 352
             I W    KQ Q++   L H  D LPT    A
Sbjct: 319 FFIRWPKVFKQGQEIDVMLNHY-DILPTFAEIA 350


>gi|347528892|ref|YP_004835639.1| putative arylsulfatase [Sphingobium sp. SYK-6]
 gi|345137573|dbj|BAK67182.1| putative arylsulfatase [Sphingobium sp. SYK-6]
          Length = 458

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 118/373 (31%), Positives = 184/373 (49%), Gaps = 27/373 (7%)

Query: 3   WARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALA 62
           ++R+ F      L    A +     + PHI+ I+ADDLG+ D+SF G ++  TP+ID++A
Sbjct: 2   FSRRAFLGTAAALPLA-ARVEAAPARPPHIVFIMADDLGYADLSFTGRTEYATPHIDSIA 60

Query: 63  YNGLILNQHYV-QALCTPSRSALMTGKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYL 119
             G   +  Y   ++C+P+R AL TG+Y   +  G+   +    EP GLP     +   L
Sbjct: 61  EQGCFFSDAYANSSVCSPTRMALATGRYQYRLRGGLDEPIGGGPEPVGLPPEHPTIASRL 120

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR 179
           +EAGY T  +GKWHLG     ++P   G+D  +G   G  DY+ HS   + +     D+ 
Sbjct: 121 REAGYRTVLVGKWHLG-SPPRFSPLLSGYDRFFGIHGGGADYFTHS--QSMQGGGSSDLY 177

Query: 180 HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEE 238
               V ++    Y TD+ TE A+  I    ++ KP+FL L + A H      P++ P++E
Sbjct: 178 DGSSVTDRH--GYLTDILTERAVAEIDLAARTRKPLFLSLHYTAPH-----WPWEGPEDE 230

Query: 239 -----VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
                +   L+       T+A MV  LD+ VG V+AAL   GM +++IV+F +DNG   F
Sbjct: 231 ALAAQIKSLLNYDGGSDSTFAKMVQALDKGVGKVLAALEAQGMTQDTIVVFTSDNGGERF 290

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
                     P  GMK    +GG+R    +  P        S+++    D++PT  A AG
Sbjct: 291 ------SKVWPFDGMKGDLLEGGIRVPMMMRWPGKIPAGSRSAQIAMTMDYMPTFLATAG 344

Query: 354 IEINDT-SLDGVN 365
              +     DG+N
Sbjct: 345 GSADPAFPSDGIN 357


>gi|171912352|ref|ZP_02927822.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 491

 Score =  169 bits (428), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 132/383 (34%), Positives = 186/383 (48%), Gaps = 33/383 (8%)

Query: 10  ALTCTLL-FNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           AL   L  F+ A   + A   P++++I ADDLG+ D+   G+    TPN+D +A  G+  
Sbjct: 13  ALAGVLFSFHTATAASGAQGSPNVVVIFADDLGYGDLGCFGAKGQATPNLDRMAAEGVKF 72

Query: 69  NQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
            + YV Q +C+ SR ALMTG YP  +G++ G +  G   G+   E  L + +K+ GYAT 
Sbjct: 73  ERFYVAQPVCSASRMALMTGCYPNRVGIK-GALGPGAKVGISKEETTLAELVKQNGYATA 131

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ---V 184
           A GKWHLG   + + P   GFD + G       +  H       P Q    R       V
Sbjct: 132 AFGKWHLGDDPQ-FLPVRHGFDEYLGLPYSNDMWPYHPELVNLTPEQRKKRRGFPALPLV 190

Query: 185 DNKTIGIYS---------TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
           D   I +           T  YT+ A+  I   NK KP  LYLAH   H      P    
Sbjct: 191 DGDRIILPEVTTVEQTRLTTWYTQRAVKFI-NTNKDKPFLLYLAHSMPHV-----PLFVS 244

Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFG 294
           D+         +     Y  ++  +D SVG V+ AL+++G+ EN++V+F +DNG   S+G
Sbjct: 245 DKYKG------NSANGVYGDVIEEIDWSVGEVLKALKENGLDENTLVIFTSDNGPWLSYG 298

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
            HS  GS  PLR  K T W+GG+R       P    + KV +E     D LPT+ A  G 
Sbjct: 299 THS--GSPGPLREGKGTVWEGGVREPFVARWPGKIPSGKVVTEPAMTIDILPTVAALTGS 356

Query: 355 EINDTSLDGVNQWDVL--TKGAK 375
           ++ D  +DG N W +L  T GAK
Sbjct: 357 KLPDHKIDGRNIWPLLSATDGAK 379


>gi|443704175|gb|ELU01350.1| hypothetical protein CAPTEDRAFT_214223 [Capitella teleta]
          Length = 336

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 97/256 (37%), Positives = 140/256 (54%), Gaps = 15/256 (5%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+II   ADD G+ D  +  +S I TPNID L  +G+     Y    C+PSRS+ ++G+
Sbjct: 93  KPNIIWFFADDYGYQDAGYR-NSDIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGR 151

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y    GMQHGVI +     + L    L  YLKE  Y THA GKWHLG+  +  TPT+RGF
Sbjct: 152 YAYTSGMQHGVISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGF 211

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGL-DMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           D+  G + G   YY+H+       +QG+ D  +  ++D    GI+S DL     +N + E
Sbjct: 212 DTFSGGYSGEGKYYEHTT------FQGMYDWHNGTKLDRSASGIHSQDLIEADLLNQLDE 265

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE-RRTYAGMVSRLDESVGN 266
            N   P+F Y A       NT+ P Q   E  A + D+  P  R+ Y G+VS +D  +G 
Sbjct: 266 -NDGTPLFYYAAFQ-----NTHSPMQPKPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGK 319

Query: 267 VIAALRKHGMLENSIV 282
           V+A L++  M +N+++
Sbjct: 320 VVAKLKEKDMFDNTVM 335


>gi|149199736|ref|ZP_01876767.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149137141|gb|EDM25563.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 585

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 182/365 (49%), Gaps = 48/365 (13%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           KKP+III L DD GW D+S +G+  I TPNID+LA++G +    YVQ +C+P+R+ L+TG
Sbjct: 20  KKPNIIIFLTDDQGWGDLSINGNKDISTPNIDSLAHDGALFENFYVQPVCSPTRAELLTG 79

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +Y    G++      GE +   L E+ +    K+AGYAT A GKWH G  +  Y P  RG
Sbjct: 80  RYAFRSGVR-STSEGGERFN--LDEQTIADVFKKAGYATGAFGKWHSG-MQYPYHPNGRG 135

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           FD  YG+           C   +  Y    + HN ++  K  G +  D +T  A++ I E
Sbjct: 136 FDEFYGF-----------CSGHWGNYYSPMLEHNGKI-VKGQG-FCVDDFTSKAMDFI-E 181

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY-------------- 253
            NK KP  +YL +   H+     P Q PD    K+ D +   + TY              
Sbjct: 182 ENKEKPFLVYLPYNTPHS-----PMQVPDRFWEKYQDKTLTMKNTYNKKWGHGEDFVKAA 236

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG--APSFGIHSNKGSNHPLRGMKST 311
             M   +D +VG V+  L +  + +N+IVL+ +DNG  +P +        N  ++G K +
Sbjct: 237 LAMCENIDWNVGRVLNKLDELQIADNTIVLYFSDNGPNSPRW--------NGEMKGKKGS 288

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
             +GG+R    +  P      +V  E+    D LPTL + AGIE  +   LDG++   +L
Sbjct: 289 VDEGGVRSPLHLRWPQKVAQGQVIKEVTGAIDLLPTLSSMAGIEFQSKKKLDGLDFSSLL 348

Query: 371 TKGAK 375
           T   K
Sbjct: 349 TGKEK 353


>gi|311745970|ref|ZP_07719755.1| arylsulfatase [Algoriphagus sp. PR1]
 gi|311302452|gb|EAZ80458.2| arylsulfatase [Algoriphagus sp. PR1]
          Length = 476

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 132/406 (32%), Positives = 188/406 (46%), Gaps = 74/406 (18%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
            +P+KP+II ILADDLG+ D+ F G   I TPNID LA  G+  +  Y  + +C PSRS+
Sbjct: 14  ASPEKPNIIFILADDLGYGDLGFLGQEYIETPNIDRLAKEGMFFSDFYSGSTVCAPSRSS 73

Query: 84  LMTGKYPIHIGMQHGVILEGEPWG-LPLTEKLL--PQYLKEAGYATHAIGKWHLGFFREV 140
            +TG +  H  ++     E +P G  P+ + +L   + +K+AGY T A GKW LGF    
Sbjct: 74  FLTGMHTGHTPIRGNS--EVQPEGQFPMPDSVLTVAKVMKQAGYVTGAFGKWGLGFIGST 131

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV------DNKTIGIYST 194
             P  +GF+  YGY          +C+     Y    + HN +       D  T G Y+ 
Sbjct: 132 GEPASQGFEEFYGY----------NCQRYAHRYYPAYLWHNDEKIDLPGNDWTTKGDYAP 181

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV-----AKFLDIS--- 246
           DL  E  +  I E NK  P FL++  +  HA        APD+E+     A F D     
Sbjct: 182 DLIQEKTLEFIDE-NKENPFFLFMPIVTPHA-----ELAAPDDEIMEKYRAMFPDEKPYV 235

Query: 247 -------DPERR------------TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMAD 287
                   P+ R            T+A MV R+D  VG V+  L  +G+ EN++++F +D
Sbjct: 236 AGNGADYGPDMRIPGYQSQPYPHATFAAMVERIDRYVGEVLQKLESNGLSENTVIIFTSD 295

Query: 288 NGAPSFGIHSNKG-------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFH 340
           NGA     H   G       SN   RG K   ++GG+R     W P      K  S   H
Sbjct: 296 NGA-----HREGGADPEFFDSNGKFRGFKRDLYEGGVRAPMIAWWP---NKIKAGSTTDH 347

Query: 341 IS---DWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           +S   D LPT  A  G +I   S+DG++    L      ++ E L+
Sbjct: 348 VSAFWDLLPTFAAIGGEKIT-KSIDGISFLPTLLGEGNQQKHEFLY 392


>gi|149177301|ref|ZP_01855906.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148843826|gb|EDL58184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 501

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 125/427 (29%), Positives = 188/427 (44%), Gaps = 67/427 (15%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F +L        AF    A   P+II+I++DD G+ D+   GS +I TP++D LA  G  
Sbjct: 16  FISLALLFCLCSAFGLKAAETPPNIIMIVSDDQGYRDLGSFGSEEIMTPHLDRLAKEGAK 75

Query: 68  LNQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-----------------GLP 109
           L   YV    CTPSR +L+TG+YP   G+   +  E   +                 G+ 
Sbjct: 76  LTSFYVTWPACTPSRGSLLTGRYPQRNGIYDMIRNEAPDFGHKYKPAEYEVTFERIGGMD 135

Query: 110 LTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKAT 169
           + EKLLP  LK AGY +   GKW LG  +  + P  RGFD  YG+     DY+ H     
Sbjct: 136 VREKLLPALLKPAGYVSAIYGKWDLGIHKR-FLPLARGFDDFYGFTNTGIDYFTH----- 189

Query: 170 FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTY 229
            E Y    M  N Q   +  G Y T L+   A+  I E N  KP FLYL   A H  ++ 
Sbjct: 190 -ERYGVPSMYRNNQPTEEDKGTYCTYLFQREAVRFIKE-NHQKPFFLYLPFNAPHGASSL 247

Query: 230 EP-----FQAPDEEVAKFLDISD-----------------------------PERRTYAG 255
           +P      QAP++    +  + D                               R  Y  
Sbjct: 248 DPRIRGGAQAPEKYKNMYPHLKDTLVTKKKTGRYEFRERPDGPVIHQGVSASKRRLEYVA 307

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
            ++ +D+++G V+  L ++ + +N+IV+F +DNG       S    N PL+G K   ++G
Sbjct: 308 SITCMDDAIGEVLGLLDEYQIADNTIVVFFSDNGG------SGGADNSPLKGKKGMMFEG 361

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGA 374
           G+R    +  P   +   V+ EL    + +PT    A I +  +  +DG +   VL    
Sbjct: 362 GIRVPCLVRYPAKIKPGTVNDELLTSLELVPTFLKEAAIPLPENVVIDGYDMLPVLMGKT 421

Query: 375 KTKRSEI 381
            + R+E+
Sbjct: 422 TSPRNEM 428


>gi|294053769|ref|YP_003547427.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613102|gb|ADE53257.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 490

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 119/365 (32%), Positives = 173/365 (47%), Gaps = 40/365 (10%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
           A  +P+ I I+ADDLG+ D+ F+GS  + TP +D LA+NG+I    YV    C PSR+ L
Sbjct: 19  ADNRPNFIFIVADDLGYGDLGFNGSKVVETPTLDRLAHNGVIFQNGYVTHPYCGPSRAGL 78

Query: 85  MTGKYPIHIGMQHGVILEGEPW----GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           +TG+Y    GM+  + L   P+    GLPL EK   + ++ AGY T  IGKWHLG   E 
Sbjct: 79  ITGRYQARFGME--INLTYSPFDLHQGLPLDEKTFAERVRPAGYRTGMIGKWHLG-ASEP 135

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDH-----------SCKATFEPYQGLDMRHNMQVDNKTI 189
           + P  RGFD  YG+  G  DY+             + K  +   +G         +    
Sbjct: 136 FHPNSRGFDYFYGFLSGGHDYFPEMVNTHPKLLLPNGKPHYSHNEGCTQPLLRNKNAAEF 195

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE 249
             Y T   ++ A   +++    KP  LYLA+ A H      P  AP E V K+  +   +
Sbjct: 196 NEYLTTALSKDAARFVSD--SEKPFLLYLAYNAPHG-----PLHAPKELVEKYSKLEKDK 248

Query: 250 RR-TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG-------S 301
           RR TY  M+  +D+ +G ++ AL + G L+N+++ FM+DNG    G+ S  G        
Sbjct: 249 RRATYLAMIDSMDQGIGLIVEALEQSGKLDNTLIFFMSDNG----GVQSKPGHEYETWAD 304

Query: 302 NHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
           N P R  K +  +GG      A W     Q       +  + D   T    +G +     
Sbjct: 305 NGPFRNGKGSMREGGSHVPFIAHWPKGFPQGITYPHPVSSL-DLTATAVELSGGDATGKP 363

Query: 361 LDGVN 365
           LDGVN
Sbjct: 364 LDGVN 368


>gi|340619489|ref|YP_004737942.1| sulfatase [Zobellia galactanivorans]
 gi|339734286|emb|CAZ97663.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 483

 Score =  169 bits (428), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 126/392 (32%), Positives = 201/392 (51%), Gaps = 64/392 (16%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           +  T+L    F  T   K+P+I+ IL DDLGW D++ +GS+   TPNID LA  G+   +
Sbjct: 13  VLLTVLGPTVFAQTD--KQPNIVFILVDDLGWMDIAANGSTFYETPNIDQLAKEGIRFTK 70

Query: 71  HYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEP---WGLPLTEKLLPQYLKEAGYAT 126
            Y  + +C+P+R++++TGK P  I +   +   G P     LPL E L P+ L++AGY T
Sbjct: 71  AYAASPICSPTRASILTGKNPARIDLTQWIGGPGNPDYLRNLPLEEVLFPELLQDAGYKT 130

Query: 127 HAIGKWHLGFF--REVYTPTFRGFDSHY-GYWQG---LQDYYDHSCKATFEPYQGLDMRH 180
             +GKWHL      E + P  +GFD +  G+++G   +++ Y       F P+   ++  
Sbjct: 131 AFMGKWHLNNVAGEEKFWPDKQGFDVNVAGHFRGGLYIKNKY-------FSPWNIPNL-- 181

Query: 181 NMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
               +N   G Y TD   + AI+ I E++K+ P  LY +  +VHA     PF AP + V 
Sbjct: 182 ----ENGPEGEYMTDRLAKEAIDFIDENSKA-PFLLYFSLYSVHA-----PFDAPADRVE 231

Query: 241 KF------LDISDPER-------------------RTYAGMVSRLDESVGNVIAALRKHG 275
           K+      L ++D ER                    TYA MV  +D +VG ++  L++ G
Sbjct: 232 KYEKKKKALALTDKERFAEEKNAQKPFKYRIKQDHPTYAAMVESMDMAVGKILGKLQEKG 291

Query: 276 MLENSIVLFMADNGAPSFGIHSNKG---SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
           + +N++V+F +DNG    G+ +++G   +N PLR  K   ++GG+R  A I  P   +  
Sbjct: 292 IADNTVVIFFSDNG----GLSTSEGIPTANTPLRAGKGWLYEGGIREPAIIRWPGTIKPG 347

Query: 333 KVSSELFHISDWLPTLCAAAGIEIN-DTSLDG 363
            VS  +    D+ PT+    G  +  D  +DG
Sbjct: 348 TVSDAVITSMDFYPTILEMTGQALRPDLHVDG 379


>gi|383780433|ref|YP_005464999.1| putative sulfatase [Actinoplanes missouriensis 431]
 gi|381373665|dbj|BAL90483.1| putative sulfatase [Actinoplanes missouriensis 431]
          Length = 464

 Score =  169 bits (427), Expect = 4e-39,   Method: Compositional matrix adjust.
 Identities = 126/375 (33%), Positives = 184/375 (49%), Gaps = 39/375 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
           K+P+I+ ILADDLGW D+S +GS +I TPN+D LA +G+   Q Y   A+C+P+R AL T
Sbjct: 46  KRPNILFILADDLGWADLSSYGSPEIRTPNLDRLARSGVRFTQGYSAGAVCSPTRVALYT 105

Query: 87  GKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           G+ P     G+   +    E  G+P T   L   LK+ GYAT  IGKWH GF    ++P 
Sbjct: 106 GRLPGRTPGGLPEPIGSVNERDGIPATHPTLASLLKKTGYATALIGKWHGGFLPN-FSPL 164

Query: 145 FRGFDSHYGYWQGLQDYYD-HSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
             G+D  +G + G  DYY  +S    ++ ++G       + +    G Y TD+  + A +
Sbjct: 165 RTGWDYFFGNYSGGVDYYSKYSHTGKYDLFEG-------ETEVPDDGRYYTDVIADKAED 217

Query: 204 VIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAP-----DEEVAKFLDISDPER------- 250
            I +  K  KP  L L   + H      P++AP      EE+   +   + +        
Sbjct: 218 WITKAKKQGKPWLLNLNFTSPHW-----PWEAPGDRKISEEITAKIKAGNTQAIFHNDGG 272

Query: 251 --RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
              TY  MV  LD  +G V+ ALR+ G  EN++V+F +DNG   F        N PL G 
Sbjct: 273 SLETYTKMVENLDARIGQVLNALRRSGQEENTLVVFASDNGGERFSY------NWPLTGN 326

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQW 367
           K +  +GG+R    +  P   ++ +VS E     DW  TL   AG   + D  LDG +  
Sbjct: 327 KGSLSEGGIRVPTILRWPARIRSGQVSHEPVVTHDWTATLLEIAGAAPSPDHPLDGASFA 386

Query: 368 DVLTKGAKTKRSEIL 382
             L +GA     +I 
Sbjct: 387 GYLLRGADAPERDIF 401


>gi|264677229|ref|YP_003277135.1| sulfatase [Comamonas testosteroni CNB-2]
 gi|262207741|gb|ACY31839.1| sulfatase [Comamonas testosteroni CNB-2]
          Length = 440

 Score =  169 bits (427), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 126/363 (34%), Positives = 175/363 (48%), Gaps = 32/363 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
           P+I+ ILADDLGW D+  +G++   TP++D LA  G+  NQ Y   A+C+ +R AL+TG+
Sbjct: 4   PNIVFILADDLGWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGR 63

Query: 89  YP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           Y   +  G++  +   G   GLP     LP  LK+AGY T  IGKWHLG     Y P   
Sbjct: 64  YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPSYGPQRS 122

Query: 147 GFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           G+D  +G   G  DY+ H       F P    D+    +   +T G Y+  L  EA   V
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSP----DLWQGSEPVERT-GYYTYILGDEATRYV 177

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE-----VAKFLDISDPERRTYAGMVSR 259
                + KP FL L   A H      P++ PD+E     +           + Y  +V  
Sbjct: 178 HERKGQDKPFFLSLHFTAPHW-----PWEGPDDEHVAHNIKDLFHYDGGSLKKYGEIVEA 232

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           LD++VG V+ AL   G  +N+IV+F +DNG   F          P  G K+   +GG+R 
Sbjct: 233 LDKAVGQVLQALDDTGQADNTIVIFTSDNGGERF------SKTWPFTGQKTELLEGGIRV 286

Query: 320 VAAIWSPWLKQTQKVSSELFHIS-DWLPTLCAAAGIEIN-DTSLDGVNQWDVLT-KGAKT 376
              +   W  + Q    E   +S DWLPTL AAAG   + D   DG N   +L    A T
Sbjct: 287 PTLL--RWSARIQPQVQEQVTVSFDWLPTLLAAAGARPHPDFPSDGQNILPILEGHAANT 344

Query: 377 KRS 379
           +RS
Sbjct: 345 ERS 347


>gi|354581367|ref|ZP_09000271.1| sulfatase [Paenibacillus lactis 154]
 gi|353201695|gb|EHB67148.1| sulfatase [Paenibacillus lactis 154]
          Length = 446

 Score =  169 bits (427), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 117/375 (31%), Positives = 194/375 (51%), Gaps = 31/375 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K+P+ I I  DDLG+ D+  +GS  + TPN+D+LA  G+     Y  + +C+PSR++L+T
Sbjct: 9   KQPNFIDIYCDDLGYGDLGCYGSDTVTTPNLDSLAGEGIRFTNWYSNSPVCSPSRASLLT 68

Query: 87  GKYPIHIGMQHGVILEGEPW--GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           GKYP   G+  G IL  +    GLP TE  L + LK AGY T   GKWHLG   E  +P 
Sbjct: 69  GKYPARAGV--GEILGAKRGLDGLPSTEVTLAKALKPAGYRTALYGKWHLGVSEET-SPN 125

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAA 201
             GFD  +G+  G  D+Y H     +    G++  H++  +   +   G Y T+L TE +
Sbjct: 126 AHGFDEFFGFKAGCIDFYSHIF--YWGQGHGVNPLHDLWENETEVWENGRYMTELITERS 183

Query: 202 INVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           ++ I    + + P FL++++ A H      P  AP E + +F  +   +R+  A M++ +
Sbjct: 184 VDFIKRSREQEDPFFLFVSYNAPHY-----PMHAPKEYMDRFAHLP-WDRQVMAAMIAAV 237

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH--------SNKGSNHPLRGMKSTP 312
           D+ VG ++ AL+  G  EN+++ F +DNG  S   +           G+    RG K++ 
Sbjct: 238 DDGVGEIVKALKAAGCYENTVIFFSSDNGPSSESRNWLDGTEDVYYGGTAGIFRGHKASL 297

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-----INDTSLDGVNQW 367
           ++GG+R  A +  P   +  +V  E+  + D +PT    AG++     +    LDG +  
Sbjct: 298 FEGGIREPAILSWPNGLEGGQVRDEVGVMMDIVPTFLDLAGVDPATRPLEGAELDGSSLK 357

Query: 368 DVLTKGAKTKRSEIL 382
           ++L  G  +   ++ 
Sbjct: 358 EMLLMGEPSPHKQLF 372


>gi|223938010|ref|ZP_03629909.1| sulfatase [bacterium Ellin514]
 gi|223893411|gb|EEF59873.1| sulfatase [bacterium Ellin514]
          Length = 500

 Score =  169 bits (427), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 143/471 (30%), Positives = 216/471 (45%), Gaps = 83/471 (17%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSAL 84
           A  +P+ + ILADDLGW DV F+GS+   TPN+D LA  G+     Y   ++C+P+R+++
Sbjct: 35  AADRPNFVFILADDLGWKDVGFNGSTFYETPNLDRLAREGMRFTDAYAACSVCSPTRASI 94

Query: 85  MTGKYPIHIGMQHGVILEGEP-------------WGLPLTEKLLPQYLKEAGYATHAIGK 131
           MTGKYP  + +     L G P               LP  E  L + L+E GY T  IGK
Sbjct: 95  MTGKYPARLHLTD--WLPGRPDKPDQILKHPKIITELPAAEITLAKALQEGGYKTAFIGK 152

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
           WHLG     + P   GFD + G   G    +  S    F PY+      N  + +  +G 
Sbjct: 153 WHLGGLG--HWPEQAGFDINIG---GCGMGHPSSY---FSPYK------NPTLKDGPVGE 198

Query: 192 YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-------DEEVAKFLD 244
           Y  D  T+ A+  I E+ K  P  LYL+H +VH      P QA         ++V +   
Sbjct: 199 YLADRLTDEAVKFI-ENTKGTPFLLYLSHYSVHT-----PLQAKKGLIEKYQKKVMQLPP 252

Query: 245 ISDPERRT--------------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
              PE  T              YA M+  LDESVG V+  L++ G+ +N++++F +DNG 
Sbjct: 253 TKGPEFVTEGNTNARQVQNQPIYAAMMQSLDESVGRVLDKLKELGLDKNTVIIFTSDNG- 311

Query: 291 PSFGIHSNKG---SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPT 347
              G+ + +G   SN PLR  K  P++GG+R    +  P + +   VS      +D+ PT
Sbjct: 312 ---GLSTAEGAPTSNMPLRAGKGWPYEGGVREPLVVKWPGVTKAASVSDHQVMSTDYYPT 368

Query: 348 LCAAAGIEIN-DTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
           L   AG+    +  LDG++    L      +R    H     +      +++R  D K +
Sbjct: 369 LLEIAGLPARPEQHLDGISFTPALRGKEMGERPLFWHYPHYSNQGGAPSSSIRKGDWKLI 428

Query: 407 AGTDNNGQSDEWYGDTDNEIDKYS------PKEVLYSKAGITFNALKTKLQ 451
                     EWY   +N I+ ++       K  L S + +    LK++LQ
Sbjct: 429 ----------EWY--EENRIELFNLRLDVGEKNDLASTSALKREELKSELQ 467


>gi|423293952|ref|ZP_17272079.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
           CL03T12C18]
 gi|392677173|gb|EIY70592.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
           CL03T12C18]
          Length = 453

 Score =  169 bits (427), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 125/421 (29%), Positives = 212/421 (50%), Gaps = 37/421 (8%)

Query: 1   MTWARKYFFALTCTLLFNDAFLNTTAPKK-PHIIIILADDLGWNDVSFHGSSQIPTPNID 59
           M   +    AL  T+    A  + T  K  P++++IL DDLG  D+S   +  + TPNID
Sbjct: 1   MKIEKIALLALGTTMTSGAALASNTVDKATPNVLLILVDDLGLGDLSCQYAKDLSTPNID 60

Query: 60  ALAYNGLILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEG--EPWG-LPLTEKLL 115
            +   G+ L+  Y   ++ +PSR+AL+TG++P  +G+  GVI     + WG    +   +
Sbjct: 61  RIFETGVRLDNFYANSSVSSPSRAALLTGRFPAMVGVP-GVIRPSIDQNWGYFDPSAVTM 119

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQ 174
           P+ LK  GY T  IGKWHLG ++    P  RGF+  +G+   +  DYY H  +       
Sbjct: 120 PEVLKNGGYRTALIGKWHLG-WKSPNLPNERGFEHFHGFLADMMDDYYTHQRQGG----- 173

Query: 175 GLDMRHNMQVDNKTIGI--YSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEP 231
                + M +++K I    ++T+L+T  +++ I +  K K P FLYLA+ A H+     P
Sbjct: 174 -----NYMYLNDKEITPKGHATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----P 223

Query: 232 FQAPDEEVAKF--LDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
            Q P E V K    D S P +R     ++  LD+++G V+ AL + G L N++V+F +DN
Sbjct: 224 LQPPVEWVNKVQERDKSLPVKRARLIALIEHLDDNIGKVMQALEESGQLSNTLVIFASDN 283

Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
           G    G   +  +N P RG K   ++GG+    A+  P + +  +  +    + D +PT+
Sbjct: 284 G----GDRGSMANNGPTRGAKGDMFEGGIHVPCALNMPGVFEGGRRDNHFVVMMDLMPTI 339

Query: 349 CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKY 405
           C    + +    +DG++  D + +  +      +  + N   PQ   K  +A+R ++ K 
Sbjct: 340 CDFVNVPVRH-EIDGISVLDAIKRKKQNTEDRFVFWLRNEGGPQFGGKSQSAVRYNEYKL 398

Query: 406 V 406
           +
Sbjct: 399 L 399


>gi|221068581|ref|ZP_03544686.1| sulfatase [Comamonas testosteroni KF-1]
 gi|220713604|gb|EED68972.1| sulfatase [Comamonas testosteroni KF-1]
          Length = 440

 Score =  169 bits (427), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 129/364 (35%), Positives = 180/364 (49%), Gaps = 34/364 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
           P+I+ ILADDLGW D+  +G++   TP++D LA  G+  NQ Y   A+C+ +R AL+TG+
Sbjct: 4   PNIVFILADDLGWADLGVYGATDFKTPHLDHLASQGVRFNQAYANSAVCSATRIALITGR 63

Query: 89  YP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           Y   +  G++  +   G   GLP     LP  LK+AGY T  IGKWHLG     Y P   
Sbjct: 64  YQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLG-KPPAYGPQRS 122

Query: 147 GFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
           G+D  +G   G  DY+ H       F P    D+    +   +T G Y+  L  EA   V
Sbjct: 123 GYDYFFGNHSGAIDYFTHKPGVGEQFSP----DLWQGNEPVERT-GYYTYILGDEATRYV 177

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVAK----FLDISDPERRTYAGMVSR 259
               ++ KP FL L   A H      P++ P DE VA+             + Y  +V  
Sbjct: 178 HERKDQDKPFFLSLHFTAPHW-----PWEGPGDEHVARSIKDLFHYDGGSLKKYGEIVEA 232

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRG 319
           LD++VG V+ AL   G  +N+IV+F +DNG   F          P  G K+   +GG+R 
Sbjct: 233 LDKAVGQVLQALDDTGQADNTIVIFTSDNGGERF------SKTWPFTGQKTELLEGGIRV 286

Query: 320 VAAI-WSPWLK-QTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVLTK-GAK 375
              + WS  ++ Q Q+  +  F   DWLPTL AAAG   + D   DG N   +L    A 
Sbjct: 287 PTLLRWSARIRPQVQEQVTASF---DWLPTLLAAAGARPHPDFPSDGQNILPILQGIAAN 343

Query: 376 TKRS 379
           T+RS
Sbjct: 344 TERS 347


>gi|431797524|ref|YP_007224428.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430788289|gb|AGA78418.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 442

 Score =  168 bits (426), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 134/426 (31%), Positives = 207/426 (48%), Gaps = 57/426 (13%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           K +F L   +    A +   A ++P+II+I++DD G+ D SF  +    TPNID +A  G
Sbjct: 2   KRYFTLLLFIWLPLALV--VAQERPNIIVIVSDDAGYADFSFQSTKLTTTPNIDRIAQEG 59

Query: 66  LILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVI------LEGEPWGLPLTEKLLPQY 118
                 YV A +C+PSR+ L++G      G  +  I      ++   +G+P   K + +Y
Sbjct: 60  AKFTNAYVTASVCSPSRAGLLSGTNQPEFGHIYNYIKGVEYSIQQHEYGIPHNIKTIGEY 119

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
           L+  GYAT A GKWH GF  E + P  +GFD  +G+  G   Y  H+ KA       +D+
Sbjct: 120 LQPLGYATAAFGKWHEGFAEE-FQPEQKGFDHFWGFLWGANHY--HTGKA-------IDV 169

Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
             + +  + T   Y TD  T+ A++  + H +  P FLY+A+ AVH+     P +A  E 
Sbjct: 170 IDDGKPVDPTTIPYMTDAITDEALSFASAHEED-PFFLYIAYNAVHS-----PLEAKPEH 223

Query: 239 VA----KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
           +A    KF D  DP+R   A M   LDE+VG +   L   G L+N+I+ F+ DNG    G
Sbjct: 224 IALYEGKFPD--DPKREILAAMTHSLDENVGKIFNRLEAMGELDNTIIFFINDNG----G 277

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
                  N PL+G K   ++GG+R   A+  P   + + V   +    D LP++  A G 
Sbjct: 278 SKHISADNSPLKGYKGQLYEGGIRVPFAVRWPGKIKGKTVCHSITSSLDILPSILDAVGY 337

Query: 355 EINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ------KYYAALRVDDLKYVAG 408
              +  L+G +   VL         E++HN  N ++ +      +Y  A+R    K +  
Sbjct: 338 TARN--LEGQS---VL---------ELIHNPQNFEHRELFWHTSQYGGAIRAGQWKLI-- 381

Query: 409 TDNNGQ 414
            D NG 
Sbjct: 382 IDQNGH 387


>gi|126304968|ref|XP_001376926.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Monodelphis
           domestica]
          Length = 520

 Score =  168 bits (426), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 119/385 (30%), Positives = 184/385 (47%), Gaps = 27/385 (7%)

Query: 9   FALTCTLLFNDAFLNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
             L  +LLF  A L   A  +P +I+ +L DD+GW D+   G     TP++D +A  G++
Sbjct: 6   LVLGVSLLFGWAPLGAGATSQPPNIVFLLMDDMGWGDLGVFGEPSRETPHLDQMAAEGML 65

Query: 68  LNQHY-VQALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYL 119
               Y    LC+PSR+AL+TG+ PI  G        ++    +    G+P +E LLP+ L
Sbjct: 66  FPNFYTANPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDSEFLLPELL 125

Query: 120 KEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM- 178
           K+AGY    +GKWHLG  R  + P   GFD  +G        YD+        Y+  +M 
Sbjct: 126 KKAGYVNKIVGKWHLGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNKAMPNIPVYRNWEMV 184

Query: 179 ---RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQA 234
                +  +++KT     T +Y + A++ I +     +P FLY A  A HA         
Sbjct: 185 GRFYEDFPINHKTGEANLTQIYLKEAVDFIKKQQAHQQPFFLYWAIDATHA--------- 235

Query: 235 PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
           P      FL  S  +R  Y   V  +D+S+G ++  L+  G+ EN+ V F +DNGA    
Sbjct: 236 PVYASKSFLGTS--QRGRYGDAVREIDDSIGKILKLLQDLGIKENTFVFFTSDNGAALIS 293

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGI 354
                GSN P    K T ++GG+R  A  W P      +VS +L ++ D   T  + AG+
Sbjct: 294 APKEGGSNGPFLCGKQTTFEGGLREPAIAWWPGHIPAGQVSHQLGNVMDLFTTGLSLAGL 353

Query: 355 E-INDTSLDGVNQWDVLTKGAKTKR 378
           +  +D  +DG++    + +G    R
Sbjct: 354 KPPSDREIDGIDLLPAIVQGKVIDR 378


>gi|343082868|ref|YP_004772163.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342351402|gb|AEL23932.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 462

 Score =  168 bits (426), Expect = 6e-39,   Method: Compositional matrix adjust.
 Identities = 121/416 (29%), Positives = 201/416 (48%), Gaps = 48/416 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           +KP++++I++DD G++D+   G+  I TP +D LA  G+ L   YV  + CTPSRS L+T
Sbjct: 23  QKPNVLLIISDDQGYHDLGSFGNKDILTPYLDQLAAEGVRLTNFYVTSSACTPSRSGLLT 82

Query: 87  GKYPIHIG----MQHGVILEGEPW-------------GLPLTEKLLPQYLKEAGYATHAI 129
           G+YP   G     ++ ++  G  +             G  L E  + + LKE GY    I
Sbjct: 83  GRYPQRNGTFELFRNNMVNYGHQYTDYEYSVSPERILGTDLREIFISELLKEGGYTNGYI 142

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI 189
           GKW LG  +  Y P  +G+D +YG+     DY+ H      E Y    M    Q+  +  
Sbjct: 143 GKWDLGQLKR-YLPMQQGYDKYYGFANTGIDYFTH------ERYGSPSMIDGNQLSLEDK 195

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP-FQAPDEEVAKFLDI--- 245
           G Y+T L+   A+  I   ++ KP FL LA  A H+ +  +P  +   +   K+L +   
Sbjct: 196 GRYTTALFEREALEFI-NGSQGKPFFLTLAFNAPHSASNLDPEIKGTVQATEKYLKMYPK 254

Query: 246 SDPER----RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           S+ +R    R Y   V+++D+S+G ++  +++ G+ EN++V+F++DNG  S         
Sbjct: 255 SNTKRGEKQRAYKAAVTQMDQSIGTILERIKEMGLEENTLVIFLSDNGGMSL------AD 308

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTS 360
           N+PL+G K+  ++GG+R    +  P      + +       +  P + AAAG+E+  D  
Sbjct: 309 NYPLKGGKAQFFEGGVRVPCLVKWPEKIDAGQTNDAFLSSLELFPMIVAAAGLELPKDVI 368

Query: 361 LDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
            DG +   VL   +   R+ +               A RV + K+V     NG  D
Sbjct: 369 YDGFDMLPVLQGKSGVDRAALYWEFRGDQ-------AARVGNWKWVQSAKGNGLFD 417


>gi|254483658|ref|ZP_05096879.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
 gi|214036091|gb|EEB76777.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
          Length = 558

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 133/406 (32%), Positives = 195/406 (48%), Gaps = 64/406 (15%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGS--SQIPTPNIDALAYNGLILNQHYVQA-LCTP 79
           + +  + P+II+I+ADDLG+ND+S  G     + TP ID LA  G +  Q Y  A  C P
Sbjct: 59  SASGEQPPNIILIVADDLGYNDISTFGGGVGGMQTPGIDRLAAEGAVFTQSYSGASTCAP 118

Query: 80  SRSALMTGKYPIHIGMQH-----------GVIL-----------------EGEP----WG 107
           SR+ +MTG+YP   G +            GV+                  E +P     G
Sbjct: 119 SRAMMMTGRYPTRTGFEFTPTPAGMAKIVGVVAASMDSGLPVSVYDEALEESQPPYERKG 178

Query: 108 LPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTFRGFDSHYGYWQGLQDYYDH-- 164
           LP +E  + + LK  GY T  IGKWHLG  RE    P  +GFD       GL    D   
Sbjct: 179 LPASEVTIAETLKAKGYYTAHIGKWHLG--RENGMAPHDQGFDDSLLMQSGLYLPEDDPN 236

Query: 165 --SCKATFEPY-------QGLDMRHNM-QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPM 214
             + K +F+P         G    +N    D      Y TD +T+ +I VI + NK++P 
Sbjct: 237 VVNAKVSFDPIDRFLWAGMGFSASYNSGDADKFKPAGYLTDYWTDESIKVI-KANKNRPF 295

Query: 215 FLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKH 274
           FLYLAH   H      P Q+  E+      I    +R YAGM+  +D SV  ++  L   
Sbjct: 296 FLYLAHWGPHT-----PLQSTREDYDALSGIEPHRKRVYAGMIRAVDRSVSRILDTLEAE 350

Query: 275 GMLENSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQ 332
           G+ ++++V+F +DNG   + GI      N P RG K T ++GG+R    + W   +    
Sbjct: 351 GIADDTVVVFTSDNGGAGYIGIQD---VNSPFRGWKITMFEGGLRVPMFVKWPKRIAAGS 407

Query: 333 KVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLT-KGAKT 376
            V + + HI D +PTL AAAG  +    ++DGV+   ++T +GA++
Sbjct: 408 SVDTPVAHI-DMMPTLAAAAGASQPQGVAIDGVSLLPLVTGEGAQS 452



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 19/34 (55%), Positives = 25/34 (73%)

Query: 505 LFNITDDPCEQNNLAESQTDLLKQLEDKLAIYKS 538
           L+N+ DDP EQNNLA S  D LK+LE  LA +++
Sbjct: 489 LYNLADDPTEQNNLAASYPDKLKELEALLAAHQA 522


>gi|220919599|ref|YP_002494902.1| sulfatase [Methylobacterium nodulans ORS 2060]
 gi|219952019|gb|ACL62410.1| sulfatase [Methylobacterium nodulans ORS 2060]
          Length = 482

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 130/404 (32%), Positives = 201/404 (49%), Gaps = 47/404 (11%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
           A   P+I+ I+ADDLG+ DV  +G   + TP+ID +A  G+   Q Y   A+C+ +R+AL
Sbjct: 33  AAGPPNIVFIMADDLGYADVGCYGRPDLKTPHIDRIAARGVRFTQGYANSAVCSATRTAL 92

Query: 85  MTGKYP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           +TG+Y   + +G++  V       GLP     LP  L+ AGY T  +GKWHLG   + + 
Sbjct: 93  ITGRYQYRLPVGLEEPVTTRFT-GGLPAEHPTLPGLLRRAGYRTALVGKWHLGSVAK-FG 150

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGIYSTDLYTE 199
           P  RG+D+ +G+  G  DY+ H         +G D + ++    V  + +G Y+T L   
Sbjct: 151 PLQRGYDTFWGFRGGTIDYFSH---------RGTDDKPDLWDGDVRVEEVG-YATHLIGR 200

Query: 200 AAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR------- 251
            A+  I    ++ +P FL L     H    + P++ P++E A+   I+    R       
Sbjct: 201 RAVETIGTAVRAGRPFFLSL-----HFNAPHWPWETPEDE-AESRRIAGSNLRHFDGGTQ 254

Query: 252 -TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
            TY  +V+ +D  +G V+ AL +HG+ +++IV+F +DNG   F          P  G K+
Sbjct: 255 ATYHRIVAEMDAQIGRVLDALEQHGIAQDTIVVFTSDNGGERF------SDTWPFTGRKT 308

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDV 369
              +GG+R  A +  P       VS++     DWLPTL AAAG   +     DG++   +
Sbjct: 309 ELLEGGLRIPAIVSWPRRLPAGVVSAQPMITMDWLPTLLAAAGASPDPVYPPDGIDLLPI 368

Query: 370 LT-KGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNN 412
           LT + A+  RS       N        AALR  DLKY+   DN 
Sbjct: 369 LTGEAAERPRSFYWRYRANAQ------AALREGDLKYLKIRDNT 406


>gi|386819433|ref|ZP_10106649.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386424539|gb|EIJ38369.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 489

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 119/360 (33%), Positives = 180/360 (50%), Gaps = 21/360 (5%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSALMT 86
           KKP+++ IL DD G+ DV   G+  I TP++D +A  G+ +++    Q +C+ SR++ +T
Sbjct: 40  KKPNVVFILTDDQGFEDVGCFGAPAIKTPHLDQMAKEGIKLIDFSATQPVCSASRASYLT 99

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G YP  IG+ H  ++     GL   E  +   LKE GY T   GKWHLG   E + PT +
Sbjct: 100 GCYPNRIGI-HNALMPDAKIGLNPEEITIADMLKEQGYNTAIFGKWHLGDAPE-FMPTKQ 157

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV-DNKTIGIYSTDLYTEAAINVI 205
           GFD ++G       +  H  +     +  L +  N +V D        T   TE A++ I
Sbjct: 158 GFDEYFGIPYSNDMWPHHPQQGPIFNFDPLPLYQNEKVIDTLNDQSLLTTQITEHAVSFI 217

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
            + NK+ P FLY+AH   H      P    D+   K       ER  Y  ++  +D SVG
Sbjct: 218 -QKNKNNPFFLYVAHPQPHV-----PLFVSDKFKGK------SERGLYGDVIMEIDWSVG 265

Query: 266 NVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAI 323
            ++  L+  G+ EN++V+F +DNG   S+G HS  GS  P R  K T W+GG R    A 
Sbjct: 266 EILKTLKAEGIDENTLVIFTSDNGPWLSYGNHS--GSAKPFREGKGTSWEGGQRVPFIAR 323

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           +   L Q + +++ +  I D  PT+ A     +    +DG N WD+LT+   T   E  +
Sbjct: 324 FPGKLPQNKTITTPVMGI-DIFPTIAALTNSTLPKKKIDGKNVWDILTEKTTTTPHEAFY 382


>gi|218262868|ref|ZP_03477199.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223078|gb|EEC95728.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
           DSM 18315]
          Length = 461

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 111/314 (35%), Positives = 164/314 (52%), Gaps = 30/314 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           +KP+I+I+LADD G+ D  F GS+ I TPNID LA  G I    +V A + +PSRS ++T
Sbjct: 29  RKPNILILLADDAGYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLT 88

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y    G +  +   G+  G+P  E+LLP  LK  GY T  IGKWHLG       P  +
Sbjct: 89  GRYGQRYGYECNLDKPGD--GIPDDEELLPALLKRYGYRTGCIGKWHLG-SEPSQRPNAK 145

Query: 147 GFDSHYGYWQGLQDY-YDHSCK---ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           GFD+ YG   G + Y YD          + YQ     +N Q    +   Y TD     A 
Sbjct: 146 GFDTFYGLLAGHRSYFYDPETSDKDGNLQQYQ-----YNGQ--KLSFDGYFTDELASKAR 198

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
             +AE    +P  LY++  A H+     P +A +E++A+F       R+ YA M+  LD 
Sbjct: 199 QFVAE--SEQPFMLYMSFTAPHS-----PNEAAEEDLARF---EGQPRQKYAAMMYALDR 248

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVA 321
            VG ++  L+  G  +++I+ F++DNG    G  +N+ SN PL+G K   ++GG R    
Sbjct: 249 GVGKIVDELKAAGKFDDTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPFF 304

Query: 322 AIWSPWLKQTQKVS 335
            +W    K+ Q+ +
Sbjct: 305 VVWGDRFKRDQRFT 318


>gi|343086062|ref|YP_004775357.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354596|gb|AEL27126.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 444

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 117/348 (33%), Positives = 168/348 (48%), Gaps = 30/348 (8%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSR 81
           NT +P  P+II+   DD G+ D+  +G+    TP++D LA  G+     YV A +CTPSR
Sbjct: 29  NTISP--PNIIVFFTDDQGYADLGVYGAEDFETPHLDQLASEGIRFTNFYVPATVCTPSR 86

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           + L+TG+YP    +   V+      GL      + + LK AGY+T  IGKWHLG  ++ Y
Sbjct: 87  AGLLTGQYPKRSNLHEAVLFPYSEGGLSPQAFTMAELLKGAGYSTACIGKWHLG-HKDEY 145

Query: 142 TPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQV-DNKTIGIYSTDLYT 198
            P  +GFD+ YG  Y   + +YY  +      P   L    N +V +N +   Y T  YT
Sbjct: 146 MPYNQGFDTFYGVPYSNDMDNYYYKNIDFQSPP---LPFYENTKVIENGSDQRYLTKRYT 202

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           E  +  I    + KP F+YLAH   H       F +P  E          +   Y  ++ 
Sbjct: 203 EETVKRIKNRGE-KPFFIYLAHNMPHT----PLFASPAFE-------GKSKNGLYGDVIM 250

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LD S G +I AL++ G+ EN+I +F +DNG         KGS  PLRG K+  W+GG R
Sbjct: 251 ELDWSAGEIINALKEEGIYENTIFVFTSDNGPA-------KGSAKPLRGKKAQTWEGGQR 303

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
               I  P       V+ E     D  PT    A  +I  +  +DG++
Sbjct: 304 VPGIITWPGSIPRGVVTDEFVSTLDLFPTFAKLASTKIPEEIKIDGMD 351


>gi|329928435|ref|ZP_08282305.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
 gi|328937871|gb|EGG34277.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
          Length = 443

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 181/350 (51%), Gaps = 22/350 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K+P+ I+I  DDLG+ D+  +GS  + TP++D LA  G+     Y  + +C+PSR++L+T
Sbjct: 6   KQPNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLT 65

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           GKYP   G+   +  +    GLP  E  L + LK AGY T   GKWHLG   E  +P   
Sbjct: 66  GKYPARAGVGEILGAKRGSHGLPADEVTLAKALKPAGYRTALFGKWHLGLSEET-SPNAH 124

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
           GFD  +G+  G  D+Y H     +    G++  H++  +   +   G Y T+L TE +++
Sbjct: 125 GFDEFFGFKAGCVDFYSHIF--YWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVD 182

Query: 204 VIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
            I    + + P FL+ ++ A H      P  AP E + +F  +   +R+  A M++ +D+
Sbjct: 183 FIQRSREQEAPFFLFASYNAPHY-----PMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDD 236

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH--------SNKGSNHPLRGMKSTPWD 314
            VG ++ AL++ G  E++++ F +DNG  S   +           GS    RG K++ ++
Sbjct: 237 GVGEIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFE 296

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
           GG+R  A +  P   +  +V  E+  + D  PT    AG++     L GV
Sbjct: 297 GGIREPAILSWPNGWEGGQVRDEVVAMMDIAPTFLDLAGVDPAAGPLQGV 346


>gi|149196020|ref|ZP_01873076.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149140867|gb|EDM29264.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 462

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 118/367 (32%), Positives = 176/367 (47%), Gaps = 26/367 (7%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPS 80
           L   A  KP++I+I+ADDLG+ DV F+G   IPTP ID++A NG+  +  Y   ++C PS
Sbjct: 15  LIVQASSKPNLIVIMADDLGYADVGFNGCKDIPTPGIDSIANNGVKFSSGYTSYSVCGPS 74

Query: 81  RSALMTGKYPIHIGMQHGVI--LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           R+  +TG+Y    G +      L      LP +E  + + L++ GY    IGKWHLG   
Sbjct: 75  RAGFITGRYQQRFGFERNPQWNLTDPNSALPKSEMTIAESLQQVGYHCGIIGKWHLG-AE 133

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYY--------DHSCKATFEPYQGLDMRHNMQVDNKTIG 190
               P  RGF+  +G+  G   Y+            K   + Y     R++  V      
Sbjct: 134 PSLRPNQRGFNEFFGHLGGGHAYFPEKLRIIKTEDVKNEMDSYASYITRNDTPVKTTK-- 191

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
            Y TD +++ AI  + E N  +P FL+L++ A H      P QA  + + +F  I D  R
Sbjct: 192 -YLTDEFSDEAIRFV-EKNYEQPFFLFLSYNAPHT-----PLQATQKYLDRFPHIEDQNR 244

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
           RTY  MVS +D+ +  ++  L    + +++IV F++DNG P+     N   N  LRG K 
Sbjct: 245 RTYCAMVSAMDDGIVRLMKKLDALKIKDDTIVFFLSDNGGPT---TKNNSDNGDLRGKKG 301

Query: 311 TPWDGGMRGVAAIWSPW-LKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDV 369
             ++GG R   A+  P  LK  Q     +  +  +      A      D  LDGVN    
Sbjct: 302 DVFEGGFRVPFAMQYPRELKPGQTYDLPVSSLDIFATIASLAQSPTHADKPLDGVNLLPY 361

Query: 370 LTKGAKT 376
           L KG K+
Sbjct: 362 L-KGEKS 367


>gi|240977207|ref|XP_002402636.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215491217|gb|EEC00858.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 159

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 77/157 (49%), Positives = 113/157 (71%)

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
           YTE A+++I  HN S+P+FLYL++ A H G      +AP+E + KFL I +  R TYAGM
Sbjct: 1   YTERAVSLIRSHNPSQPLFLYLSYQAPHVGTGPSLLEAPEENMKKFLHIPEKNRTTYAGM 60

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  LD+S+G ++ AL++  ++ N+IV+F +DNGA  +G+ SN+G N PLRG K T W+GG
Sbjct: 61  VDALDQSIGTIMEALQEADLIGNTIVVFTSDNGALPWGVKSNRGYNWPLRGGKFTLWEGG 120

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           +R    IWSP L+ +++VS+++ HI+DWLPTL +AAG
Sbjct: 121 LRTTTFIWSPLLEMSRRVSNQMMHITDWLPTLYSAAG 157


>gi|373956153|ref|ZP_09616113.1| sulfatase [Mucilaginibacter paludis DSM 18603]
 gi|373892753|gb|EHQ28650.1| sulfatase [Mucilaginibacter paludis DSM 18603]
          Length = 468

 Score =  168 bits (425), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 121/348 (34%), Positives = 180/348 (51%), Gaps = 32/348 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           ++P+II ILADDLG+ DV  +G ++I TPNID LA  G  L   Y  A +C+PSR  L+T
Sbjct: 33  QQPNIIFILADDLGYGDVGVYGQTKILTPNIDLLAREGTKLTDFYAGAPVCSPSRGVLLT 92

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEK-----LLPQ------YLKEAGYATHAIGKWHLG 135
           G    H  ++  + ++G   G   ++K     LLPQ       L+ +GY T  IGKWH+ 
Sbjct: 93  GLNTGHATIRGNMAIKGGLAGNKGSDKIRRAGLLPQDSTIGDLLQRSGYTTGLIGKWHVD 152

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP---YQGLDMRHNMQVDNKTIGIY 192
            F  + TP   GFD  YG+       Y  +  +T+ P   Y+   +    Q  N     Y
Sbjct: 153 GFDTLATPQRHGFDYFYGWLVS----YPQTYASTYWPDTWYRDGKLVAVPQNQNGQKNYY 208

Query: 193 STDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERR 251
           ++++ T+ AI  +A+H KS KP FL + H      N + P  AP   + +  D +  +++
Sbjct: 209 TSEIITDDAIRYLAKHKKSKKPFFLMVNH-----SNPHSPLDAPHNTIYEGKDWTS-DQK 262

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------SNHPL 305
           TYA MV+ LD SVG +   L + G+ +N+IV+F +DNG  S    S         S+ PL
Sbjct: 263 TYAAMVTYLDSSVGKIRKYLIESGLDKNTIVIFTSDNGPRSEPTTSLTAVSDFFHSSGPL 322

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           RG K    +GG+R    +W+    ++  VS    + +D LPTL   AG
Sbjct: 323 RGYKRDVTEGGIRVPMIVWNKNRIKSDAVSHTPGYFADILPTLAGIAG 370


>gi|149196405|ref|ZP_01873460.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
 gi|149140666|gb|EDM29064.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
          Length = 472

 Score =  168 bits (425), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 124/368 (33%), Positives = 186/368 (50%), Gaps = 29/368 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
           A  KP+II IL DDLG+ +V ++G   I TP +D LA  G+    HY   A+C PSR++L
Sbjct: 17  AQMKPNIIYILCDDLGYGEVGYNGQKMIQTPELDKLASKGMRFTDHYCGNAVCAPSRASL 76

Query: 85  MTGKYPIHIGMQHGVILEGEPWG---LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           +TGK+P H  ++      G P G   +P   + L + +K AGYAT  IGKW LG F    
Sbjct: 77  ITGKHPGHAFIRANS--PGYPDGQTPIPADSETLGKLMKRAGYATACIGKWGLGGFHNAG 134

Query: 142 TPTFRGFDSHYGY--WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
            P  +GFD  YGY   +   +YY        E       +  +   N     YS DL T 
Sbjct: 135 NPHKQGFDHFYGYTDQRKAHNYYPEYLWRNGE-------KEMLNNKNGEENDYSHDLMTV 187

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP-ERRTYAGMVS 258
            A+  I E  K +P FLYLA+L  H       +Q PD  +A++ D   P E + +A M S
Sbjct: 188 DALKYI-EEKKDQPFFLYLAYLIPHV-----KYQVPD--LAQYKDKDWPKEMKIHAAMTS 239

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GSNHPLRGMKSTPWDGG 316
           R+D  +G +   L + G+ +N++++F +DNGA     +S K   ++  L+G+K + +DGG
Sbjct: 240 RMDRDIGTIARRLEELGIADNTLIMFNSDNGAHGKS-NSEKFFNTSGDLKGLKRSMYDGG 298

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQW-DVLTKGAK 375
           +R     + P   Q   VS  +    D +PT     G E      DG++    +L K ++
Sbjct: 299 VRSPMIAYWPGTIQAGSVSDHISAFWDMMPTFSELTG-EPFKGETDGISMLPTLLGKDSE 357

Query: 376 TKRSEILH 383
            K+ + L+
Sbjct: 358 QKQHKYLY 365


>gi|406661522|ref|ZP_11069640.1| Arylsulfatase [Cecembia lonarensis LW9]
 gi|405554671|gb|EKB49747.1| Arylsulfatase [Cecembia lonarensis LW9]
          Length = 477

 Score =  168 bits (425), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 122/342 (35%), Positives = 175/342 (51%), Gaps = 30/342 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
           P+II+I  DD G++DV  +G+S I TP++D LA  GL     YV QA+C+ SR+AL+TG 
Sbjct: 31  PNIILIFTDDQGYHDVGVYGASDIETPHLDQLASEGLQFTNFYVAQAVCSASRAALLTGT 90

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP  +G+ HG +      GL   E  +   LK  GYAT   GKWHLG   E + PT +GF
Sbjct: 91  YPNRLGI-HGALDHSSKHGLHPEEATIADLLKPLGYATAVFGKWHLGHHPE-FLPTNQGF 148

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS--TDLYTEAAINVIA 206
           D ++G       + +H     +  Y  L +  N +V +      S  T  +TE +I+ I 
Sbjct: 149 DEYFGIPYSNDMWPNHPQTKDY--YPPLPIYQNDKVVDTIWNDQSMFTTWFTEKSIDFI- 205

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           E NK +P FLYLAH   H      P    D    KF   S+  R  Y  ++  +D SVG 
Sbjct: 206 ERNKDRPFFLYLAHPMPHV-----PLFVSD----KFKGKSN--RGLYGDVIMEIDWSVGQ 254

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI--- 323
           ++  L+++ + EN+I++F++DNG P      + GS +PLR  K T WDGG++  A +   
Sbjct: 255 IMQTLKQNDLDENTIIIFLSDNG-PWLSYSEHSGSAYPLREGKGTSWDGGVKVPAIMRWK 313

Query: 324 --WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
             W   L Q     +      D LP L    G E+    +DG
Sbjct: 314 GTWPSGLVQKNPAMT-----IDILPMLAHLTGAELPVKKIDG 350


>gi|114799529|ref|YP_761144.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
 gi|114739703|gb|ABI77828.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
          Length = 459

 Score =  168 bits (425), Expect = 9e-39,   Method: Compositional matrix adjust.
 Identities = 126/355 (35%), Positives = 181/355 (50%), Gaps = 38/355 (10%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A K P+IIII+ADDLGW D+S +G++ I TPNID +   G+ L   Y  + +C+PSR+AL
Sbjct: 35  AAKPPNIIIIMADDLGWGDISLNGAALIETPNIDRIGQEGIQLTDFYAGSNVCSPSRAAL 94

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +TG+YPI  GMQH VI      GLP  E  + + LK AGY T  +GKWHLG  +E Y PT
Sbjct: 95  LTGRYPIRSGMQH-VIFPHSQDGLPAEEITISEMLKNAGYRTGMVGKWHLG-HQEEYWPT 152

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
            +GFD  YG        Y +   A F+ Y+G ++  +   D   + +     Y +AA   
Sbjct: 153 NQGFDWFYGV------PYSNDM-APFDLYRGKEIIES-PADQSQLSLN----YAKAAKEF 200

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESV 264
           I E +  KP FLY A    H      P   P++           +   Y  +V  +D  +
Sbjct: 201 I-EDSSDKPFFLYYAETFPHI-----PLFVPEDRSGT------SDAGLYGDVVETVDAGI 248

Query: 265 GNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIW 324
           G V+  L + G+ ++++++F +DNG P F     +GS    RG K    +GG R      
Sbjct: 249 GIVLDTLDEAGVADDTLIIFTSDNG-PWF-----EGSAGEFRGRKGETHEGGFR--VPFL 300

Query: 325 SPWLKQTQK--VSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKT 376
           + W     K  VS E+    D LPT  + +G  +  D  +DG +   +LT GA T
Sbjct: 301 ARWPGHIPKGSVSHEMAMNIDLLPTAASLSGATLPADRVIDGKDLTSLLTAGAPT 355


>gi|300114943|ref|YP_003761518.1| sulfatase [Nitrosococcus watsonii C-113]
 gi|299540880|gb|ADJ29197.1| sulfatase [Nitrosococcus watsonii C-113]
          Length = 463

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 128/413 (30%), Positives = 195/413 (47%), Gaps = 62/413 (15%)

Query: 2   TWARKYFFALTCTLLFNDAFLNTTAP------------KKPHIIIILADDLGWNDVSFHG 49
           T+ R++F     TL F  A +  + P             +P+I++I+ADD+G+ DV  +G
Sbjct: 5   TYTRRHFLK---TLGFAGASMALSWPIDSLSLVAERERGRPNIVLIVADDMGYGDVGCYG 61

Query: 50  SSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW-- 106
           +  I TPN+DALA  G      H    LCTP+R+AL+TG Y   +G+Q  +I + + +  
Sbjct: 62  NQHIKTPNLDALAKRGARFTDFHSNGPLCTPTRAALLTGCYQQRVGLQ--IIPKDQRYAM 119

Query: 107 --GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH 164
              +PL E    + LK  GY+T  IGKWHLG  R  ++P+ +GFD ++G        Y H
Sbjct: 120 AKAMPLAEITFAEALKAVGYSTALIGKWHLG-DRPSFSPSRQGFDEYFGI------PYSH 172

Query: 165 SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY-TEAAINVIAEHNKSKPMFLY----LA 219
                 + +  L +  N ++      +     Y TE A+  I++H KS P  LY    + 
Sbjct: 173 DMHPWRKSFPPLPLMRNEEIIELNPDLDDMTQYCTEEAVQFISKH-KSNPFLLYMPHPMP 231

Query: 220 HLAVHAGNTY---------EPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAA 270
           H  VH    +            Q  D++  KFL         Y+  +  +D SVG VI  
Sbjct: 232 HQPVHVSKRFAQRFSKEQLAAIQGEDKKSRKFL---------YSAAIEEIDWSVGEVIKT 282

Query: 271 LRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLK 329
           ++  G+ + ++V F +DNG P+       GS  PLRG K   W+GG R    A W   + 
Sbjct: 283 IKSLGIEDKTLVAFTSDNG-PAI------GSAGPLRGKKGALWEGGHRVPFIACWRGKI- 334

Query: 330 QTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           Q   V  E+    D  PT+ A AG  + +  +DGVN   +L +G K     + 
Sbjct: 335 QPNVVIDEIAMSMDLFPTMAAMAGASLPEKKIDGVNLLPLLCEGGKLSERTVF 387


>gi|423226077|ref|ZP_17212543.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392630595|gb|EIY24583.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 483

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/367 (32%), Positives = 181/367 (49%), Gaps = 31/367 (8%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
           T  +KP+ + I  DD+G++DVS +G ++  TPNIDALA  G+     Y  + + +PSR+ 
Sbjct: 29  TKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISSPSRAG 88

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
            +TG+YP  +G+Q GV       G+   E  + + LK  GYAT  IGKWHLG  RE Y P
Sbjct: 89  FLTGRYPARMGIQ-GVFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SREKYLP 146

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
             +GFD ++G          +S   + + Y   +      +D   +    T  YTE A++
Sbjct: 147 LQQGFDEYFG--------IPYSNDMSAQVYLRGNEVEEFHIDINNV----TKKYTEEAVD 194

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
            I      +P FL+LAH  +H      P    DE   K           Y   V  +D S
Sbjct: 195 YI-RRKADQPFFLFLAHSMMHV-----PIYVSDEFAGK------SGAGIYGDAVLEVDWS 242

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAA 322
           VG ++  LR+ G+ +N++V+F +DNG P        G   PLR  K+T ++GG+R    A
Sbjct: 243 VGRIMETLRELGLDDNTLVVFTSDNG-PWLQEGPLGGRALPLREGKTTAFEGGVRVPCIA 301

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
            W   +K    V++++  + DW PT+ A +G  + D  LDG +   VL    K    +  
Sbjct: 302 YWKGQIKPV--VNTDVVSLLDWFPTVTALSGGILPDVRLDGYDLTAVLNGTGKRASEDYA 359

Query: 383 HNIDNVD 389
           +  +N D
Sbjct: 360 YFRNNRD 366


>gi|296121751|ref|YP_003629529.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296014091|gb|ADG67330.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 536

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 139/402 (34%), Positives = 194/402 (48%), Gaps = 52/402 (12%)

Query: 14  TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV 73
           +L  N+  L +    +P+++ ILADDLGW +V   G S+IPTPNID LA  G+ L +HY 
Sbjct: 23  SLWANELTLISHQSPRPNVVFILADDLGWGEVGCFGQSKIPTPNIDRLASRGVKLTRHYS 82

Query: 74  QA-LCTPSRSALMTGKYPIHI---GMQHGVI-----LEGEPWGLPLTEKLL--PQYLKEA 122
            A  C PSR  LMTGK+  H    G Q   +      EG+    PL++K L   +  ++A
Sbjct: 83  GAPTCAPSRCVLMTGKHLGHAEIRGNQQAKVKLPQFTEGQ---HPLSDKALTIARQFQKA 139

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYY--------DHSCKATFEPY 173
           GYAT A GKW LG       P  +GFD  +GY  Q L   Y          S     +P 
Sbjct: 140 GYATGAFGKWGLGPVGSTGEPNRQGFDEFFGYNCQALAHSYFPKALWKNAESIVNNEKPV 199

Query: 174 QGLDMRHNMQVDNKTIGI--YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG----- 226
            G   +   +V  +      Y+  L    A++ I  H++ +P FLYL     H       
Sbjct: 200 PGHKKQPEGEVTMEAYQGENYAPRLIMAEALSFIDRHHQ-QPFFLYLPFTEPHVAMQPPP 258

Query: 227 NTYEPFQAP-DEEVAK----FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSI 281
              E F    DE V +    +L    P R  YA M+  LD  VG+VI +L KHG+LE ++
Sbjct: 259 KIVEEFPVEWDERVYRGDGGYLPHPRP-RAAYAAMIRDLDNHVGDVITSLEKHGLLEKTL 317

Query: 282 VLFMADNGA------PSFGIHSNK----GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQT 331
           ++F +DNGA      P F +         S   L+G K + ++GG+R V AI S W  Q 
Sbjct: 318 IVFTSDNGATHASANPDFHVGGADPLFFNSTRELKGFKGSIYEGGLR-VPAIVS-WPGQI 375

Query: 332 QKVSS--ELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
              ++     +  DW PTLC A  + + +  LDGVN   +LT
Sbjct: 376 PPATTINTPSYFPDWFPTLCNATQLPLPE-GLDGVNLLPLLT 416


>gi|119475675|ref|ZP_01616028.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
 gi|119451878|gb|EAW33111.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
          Length = 479

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 118/376 (31%), Positives = 187/376 (49%), Gaps = 43/376 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
           + P++III ADD+G+ D+  +G   I +PN+D +A  G+     Y   ++CTPSR+ L+T
Sbjct: 36  QSPNVIIIFADDMGYGDIGAYGHPTIRSPNLDQMAAEGIKWTNFYAASSVCTPSRAGLLT 95

Query: 87  GKYPIHIGMQHG---VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           G+ P+  GM H    V+      GLP TE  + + LKE  Y T  +GKWHLG     + P
Sbjct: 96  GRLPVRSGMAHDQIRVLFPTSTGGLPTTEITIAKALKEKDYRTALVGKWHLG-----HLP 150

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD-----------MRHNMQVDNKTIGIY 192
            F+  D  +  + G+    DH  K      Q +            M++   ++       
Sbjct: 151 GFQPLDHGFDEYFGIPYSNDHDLKKELSYIQTITHAKDGDFNVPLMQNRSIIERPANQNT 210

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
            T  YT+ A++ I + N ++P FLYLAH   H      P  A D+    F   SD  R  
Sbjct: 211 ITKRYTQEAVSFIKK-NSNQPFFLYLAHSMPHV-----PLFASDQ----FRGSSD--RGL 258

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y  ++  +D SVG V++ L + G+ EN++V+F +DNG P   + ++ GS   L+  K T 
Sbjct: 259 YGDVIEEIDWSVGQVLSTLSEQGISENTLVVFTSDNG-PWLIMGAHGGSAGLLKSGKGTS 317

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHIS----DWLPTLCAAAGIEI-NDTSLDGVNQW 367
           ++GGMR  A  W P     +K+   + H +    D  PT+ + AGI++ +D S DG +  
Sbjct: 318 YEGGMREPAIFWWP-----EKIKPAVAHNTASTLDLFPTIMSIAGIDMPSDRSYDGYDLS 372

Query: 368 DVLTKGAKTKRSEILH 383
             + +    +R  I +
Sbjct: 373 PTMFEQKSNERKNIFY 388


>gi|374619517|ref|ZP_09692051.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374302744|gb|EHQ56928.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 556

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 128/370 (34%), Positives = 175/370 (47%), Gaps = 58/370 (15%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSS----QIPTPNIDALAYNGLILNQHYVQA-LCTPSRS 82
           + P+II+I+ADDLG+ND+S  G      ++ TP+ID LA +G++  Q Y  A  C PSR+
Sbjct: 63  RPPNIILIVADDLGYNDISTFGGGLDGGRVKTPHIDQLAADGVVFTQSYSGAGTCAPSRA 122

Query: 83  ALMTGKYPIHIGMQHGVILEG--------------------------------EPWGLPL 110
            LMTG+YP   G +      G                                E  GLP 
Sbjct: 123 MLMTGRYPTRTGFEFTPTPSGMAPMLSRISAEMGRGTPSMIYDAALDESKPPYEQQGLPP 182

Query: 111 TEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL----QDYYDHSC 166
            E  + + LKE GYAT  IGKWHLG  ++   P  +GFD       GL     D    + 
Sbjct: 183 EEVTIAEILKERGYATFHIGKWHLGR-QDGMAPHEQGFDQSLLMASGLFLPEDDPNVVNA 241

Query: 167 KATFEPY-QGLDMR-------HNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
           K  F+P  Q L  R       ++   D    G Y TD +T+ +IN+I   NK++P FLYL
Sbjct: 242 KLDFDPIDQFLWARMAFANSFNSGDQDRFEPGGYLTDYWTDESINII-NANKNRPFFLYL 300

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
            H  VH      P QA  E+     DI+   +R YA M+  LD SVG +  AL + G+ +
Sbjct: 301 GHWGVHT-----PLQATREDYEAVGDITPHRKRVYAAMIRSLDRSVGRINKALEELGIAD 355

Query: 279 NSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSEL 338
           N++V+F  DNG P  G       N P RG K + ++GG+R    +  P       VS E 
Sbjct: 356 NTLVVFTNDNGGP--GYIGLPDINKPFRGWKISQFEGGIRVPLMMKWPARISPGTVSEEP 413

Query: 339 FHISDWLPTL 348
               D +PTL
Sbjct: 414 VAHIDVMPTL 423


>gi|423214667|ref|ZP_17201195.1| hypothetical protein HMPREF1074_02727 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692573|gb|EIY85810.1| hypothetical protein HMPREF1074_02727 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 459

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 118/376 (31%), Positives = 188/376 (50%), Gaps = 37/376 (9%)

Query: 6   KYFFALTCTLLFNDAF--LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           ++F A TC L    AF  +   A K+P+ +II+ADD+G+ DV  +G+  I TPNID +A 
Sbjct: 6   RHFVAATCALA---AFSPVEMMAQKQPNFVIIVADDMGYGDVGIYGNEYIRTPNIDQMAR 62

Query: 64  NGLILNQ-HYVQALCTPSRSALMTGKYPIHIGMQHGVIL----EGEPWGLPLTEKLLPQY 118
            G++    H   ++ +P+R  L+TG+Y    G++  +++    + +  GLP  E    + 
Sbjct: 63  KGMMFTDFHSNGSVSSPTRCGLLTGRYQQRAGLERVLLVPRNDKDKEVGLPSEEITFAKV 122

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
           + + GY T  IGKWHLG+ ++ +   F GF    G+  G  DY  H  +     Y  +D 
Sbjct: 123 MGDNGYRTALIGKWHLGYLQKHHPMNF-GFQKFVGFKSGNVDYQSHRNR-----YGDMDW 176

Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
              ++V  K +  Y+T L T  + + I E NK KP  LY+AH A H+     P Q P E+
Sbjct: 177 WDGLEV--KDMPGYTTTLLTTLSEDYIKE-NKDKPFCLYIAHAAPHS-----PMQGPHEK 228

Query: 239 VAKFLDISDPERRT-------YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
             +     + ++ +       Y  MV  LD SVG ++  L+K+ + EN+ V+F +DNG  
Sbjct: 229 AIRTEATPEGDKNSDRSNKEIYKDMVEELDWSVGRILETLKKYKLDENTFVVFFSDNGP- 287

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
              + +N GS    +G K +PW+G  R     + P   +      +     D  PT+   
Sbjct: 288 ---VINNGGSAGGYKGAKGSPWEGAHRVPGICYMPGTIKEGTTCEQTVMSFDLFPTMLDM 344

Query: 352 AGIEINDT--SLDGVN 365
           A I  ND+   LDG +
Sbjct: 345 ADIHYNDSKKKLDGTS 360


>gi|381205411|ref|ZP_09912482.1| sulfatase [SAR324 cluster bacterium JCVI-SC AAA005]
          Length = 448

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 126/412 (30%), Positives = 203/412 (49%), Gaps = 44/412 (10%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           K ++I+I+ DD G+ D+S  G++   TPN+D LA  G      Y  + +C+PSR+AL+TG
Sbjct: 2   KKNVIVIMTDDQGYGDLSCMGATDFKTPNLDRLANEGARFKCWYSNSPVCSPSRAALLTG 61

Query: 88  KYPIHIGMQHGVILEG--EPWGLPLTEKLLPQYLKEA-GYATHAIGKWHLGFFREVYTPT 144
           +YP + G++   IL+G  +  GLP     LP+ L++  GY T   GKWHLG   E   P 
Sbjct: 62  RYPANAGVR--AILDGHRKATGLPKETLTLPKILRDRLGYHTAMSGKWHLGL-AEGCRPQ 118

Query: 145 FRGFDSHYGYWQGLQDYYDHSCK-ATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEA 200
             GF+  +G+  G  D++ H           G++  H++ V+NK +   G Y T++ TE 
Sbjct: 119 DHGFEQWFGFLAGCVDFFSHIFYWGMNREGPGMNPTHDLWVNNKEVWKNGEYLTEVITEY 178

Query: 201 AINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
           +I  +    K SK  FLY+ + A H      P  AP + + +F  +   +R+  A MVS 
Sbjct: 179 SIQALRRAKKDSKSFFLYIPYNAPHY-----PMHAPQKYMDRFGGLP-WDRQVMAAMVSA 232

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGA-------------PSFGIHSNKGSNHPLR 306
           +D+ +G +   L +   +E++I  F +DNG              P +G     GS+   +
Sbjct: 233 VDDGIGEIFNELARLDFIEDTITFFTSDNGPSRESRNWLDGTLDPYYG-----GSSGGFK 287

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQ 366
           G K + ++GG+R    +  P +    K+  E F   D LPT+C    ++I+D  LDG++Q
Sbjct: 288 GHKFSLFEGGVRVPGIVHYPGVVPASKIVEEPFCSFDMLPTICEWLHLDISDMELDGISQ 347

Query: 367 WDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA-GTDNNGQSDE 417
              L         ++   +D+         ++R  D K V  G    GQ DE
Sbjct: 348 AGFLEGKQPALERDLFWEMDD-------QTSIRRGDWKLVLNGQIVEGQPDE 392


>gi|410912979|ref|XP_003969966.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Takifugu
           rubripes]
          Length = 519

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 122/363 (33%), Positives = 176/363 (48%), Gaps = 26/363 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG-LILNQHYVQALCTPSRSALMTGK 88
           P+III+L DD+GW D+   G     TPN+DA+A  G L+LN +    LC+PSR+AL+TG+
Sbjct: 27  PNIIIMLMDDMGWGDLGAFGQPSKETPNLDAMAAQGMLLLNFYTANPLCSPSRAALLTGR 86

Query: 89  YPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
            P+  G        ++    +    G+   E LLPQ LK+ GY    +GKWHLG  R  Y
Sbjct: 87  LPVRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQMLKKRGYFNKIVGKWHLGH-RPQY 145

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLY 197
            P   GFD  +G        Y++S +     Y+   M        ++D KT     T +Y
Sbjct: 146 LPLEHGFDEWFGAPNCHFGPYNNSVRPNIPVYRNSWMLGRYYEEFKIDKKTGESNLTQMY 205

Query: 198 TEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
               ++ I ++    KP FLY A  A HA     P  A  + + K       +R  Y   
Sbjct: 206 LLEGLDFIQSQAEAQKPFFLYWAPDATHA-----PVYASKDFLGK------SQRGRYGDA 254

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  LD SVG +++ LR+ G+  N+ V F +DNGA      +  GSN P    K T ++GG
Sbjct: 255 VMELDYSVGQILSLLRRLGIDSNTFVFFTSDNGAALMSGPNESGSNGPFLCGKETTFEGG 314

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAK 375
           MR  A  W P   +   VS +L ++ D   T  A AG+   +D ++DG++   VL     
Sbjct: 315 MREPAIAWWPGHIREGTVSFQLANVMDLFTTSLALAGMSPPDDRTIDGMDLTPVLLHNTV 374

Query: 376 TKR 378
             R
Sbjct: 375 LNR 377


>gi|423341845|ref|ZP_17319560.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219938|gb|EKN12897.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 461

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 111/311 (35%), Positives = 163/311 (52%), Gaps = 30/311 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           +KP+I+I+LADD G+ D  F GS+ I TPNID LA  G I    +V A + +PSRS ++T
Sbjct: 29  RKPNILILLADDAGYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLT 88

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y    G +  +   G+  G+P  E+LLP  LK  GY T  IGKWHLG  +    P  +
Sbjct: 89  GRYGQRYGYECNLDKPGD--GIPDDEELLPALLKRYGYRTGCIGKWHLG-SKPFQRPNAK 145

Query: 147 GFDSHYGYWQGLQDY-YDHSCK---ATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           GFD+ YG   G + Y YD          + YQ     +N Q    +   Y TD     A 
Sbjct: 146 GFDTFYGLLAGHRSYFYDPETSDKDGNLQQYQ-----YNGQ--KLSFDGYFTDELASKAR 198

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
             +AE    +P  LY++  A H+     P +A +E++A+F       R+ YA M+  LD 
Sbjct: 199 QFVAE--SEQPFMLYMSFTAPHS-----PNEATEEDLARF---EGQPRQKYAAMMYALDR 248

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVA 321
            VG ++  L+  G  +++I+ F++DNG    G  +N+ SN PL+G K   ++GG R    
Sbjct: 249 GVGKIVDELKAVGKFDDTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPFF 304

Query: 322 AIWSPWLKQTQ 332
            +W    K+ Q
Sbjct: 305 VVWGDRFKRDQ 315


>gi|224537481|ref|ZP_03678020.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520919|gb|EEF90024.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 525

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 119/367 (32%), Positives = 181/367 (49%), Gaps = 31/367 (8%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
           T  +KP+ + I  DD+G++DVS +G ++  TPNIDALA  G+     Y  + + +PSR+ 
Sbjct: 71  TKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISSPSRAG 130

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
            +TG+YP  +G+Q GV       G+   E  + + LK  GYAT  IGKWHLG  RE Y P
Sbjct: 131 FLTGRYPARMGIQ-GVFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SREKYLP 188

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
             +GFD ++G          +S   + + Y   +      +D   +    T  YTE A++
Sbjct: 189 LQQGFDEYFG--------IPYSNDMSAQVYLRGNEVEEFHIDINNV----TKKYTEEAVD 236

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
            I      +P FL+LAH  +H      P    DE   K           Y   V  +D S
Sbjct: 237 YI-RRKADQPFFLFLAHSMMHV-----PIYVSDEFAGK------SGAGIYGDAVLEVDWS 284

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAA 322
           VG ++  LR+ G+ +N++V+F +DNG P        G   PLR  K+T ++GG+R    A
Sbjct: 285 VGRIMETLRELGLDDNTLVVFTSDNG-PWLQEGPLGGRALPLREGKTTAFEGGVRVPCIA 343

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
            W   +K    V++++  + DW PT+ A +G  + D  LDG +   VL    K    +  
Sbjct: 344 YWKGQIKPV--VNTDVVSLLDWFPTVTALSGGILPDVRLDGYDLTAVLNGTGKRASEDYA 401

Query: 383 HNIDNVD 389
           +  +N D
Sbjct: 402 YFRNNRD 408


>gi|305667515|ref|YP_003863802.1| N-acetylgalactosamine 6-sulfatase [Maribacter sp. HTCC2170]
 gi|88709563|gb|EAR01796.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Maribacter sp. HTCC2170]
          Length = 596

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 120/356 (33%), Positives = 187/356 (52%), Gaps = 45/356 (12%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP++++I+ DD GW D+SF+G++ + TPNIDA+A NG      YVQ +C+P+R+ L+TGK
Sbjct: 36  KPNVVLIMTDDQGWGDLSFNGNTNLSTPNIDAIAKNGASFQNFYVQPVCSPTRAELLTGK 95

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y   +G+ +     GE +     E  + +  K+AGY T A GKWH G  +  Y P  RGF
Sbjct: 96  YAARLGV-YSTSTGGERFN--SKETTIAEIFKKAGYKTTAYGKWHSG-MQPPYHPNSRGF 151

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEH 208
           D +YG+  G    Y       F P     + HN ++  K  G    DL T   ++ I E 
Sbjct: 152 DDYYGFTSGHWGNY-------FSPM----LEHNGEIV-KGEGFLVDDL-TNKGLDFITE- 197

Query: 209 NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF----LDI--------SDPERRTYAGM 256
           NK+ P FLYL +   H+     P Q P+E   +F    LD+        S+   R    M
Sbjct: 198 NKNNPFFLYLPYNTPHS-----PMQVPNEYWERFEKKKLDMRYQGNEEESENFTRAALAM 252

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  +D ++G +   L++ G+ EN+I+++++DNG    G   N G    +RG K +  +GG
Sbjct: 253 VENIDFNMGRLTNKLKELGLEENTIIVYLSDNGPN--GWRWNGG----MRGRKGSTDEGG 306

Query: 317 MRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
           +R    I W   + + +K+ S++    D LPTL + AG  IN  ++  ++  D+ T
Sbjct: 307 VRSPFFIQWKNTIPKNKKI-SQIAGAIDILPTLTSLAG--INQPTIKSIDGKDLKT 359


>gi|410030097|ref|ZP_11279927.1| sulfatase [Marinilabilia sp. AK2]
          Length = 476

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 124/343 (36%), Positives = 176/343 (51%), Gaps = 32/343 (9%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
           P+IIII  DD G++DV   G+S I TP++D LA  G+     YV QA+C+ SR+AL+TG 
Sbjct: 30  PNIIIIFTDDQGYHDVGVFGASDIATPHLDQLAAEGVQFTNFYVAQAVCSASRAALLTGV 89

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           Y   +G+ HG +     +GL   E  +   LK  GYAT   GKWHLG + E + PT +GF
Sbjct: 90  YSNRLGI-HGALDHMSRYGLHPEEATIADILKPLGYATAMFGKWHLGHYPE-FLPTNQGF 147

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS--TDLYTEAAINVIA 206
           D + G       + +H     +  Y  L +  N +V +      S  T L+TE +I+ I 
Sbjct: 148 DEYLGIPYSNDMWPNHPQTKDY--YPPLPLYQNDKVIDTIWNDQSMFTTLFTEKSIDFI- 204

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           E NK +P FLYLAH   H      P    D    KF   S+  R  Y  ++  +D SVG 
Sbjct: 205 ERNKDRPFFLYLAHPMPHV-----PLFVSD----KFKGKSN--RGLYGDVIMEIDWSVGQ 253

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI--- 323
           ++  L+ + + EN+I++F++DNG P      + GS +PLR  K T WDGG++ V  I   
Sbjct: 254 IMETLKHYDLDENTIIIFLSDNG-PWLSYSGHSGSAYPLREGKGTSWDGGVK-VPVIMRW 311

Query: 324 ---WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
              W+  L Q     S      D LPTL    G  + +  +DG
Sbjct: 312 KGTWASGLVQPNPAMS-----IDILPTLAYLTGASLPEKKIDG 349


>gi|375143552|ref|YP_005005993.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
 gi|361057598|gb|AEV96589.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
          Length = 482

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 131/364 (35%), Positives = 183/364 (50%), Gaps = 42/364 (11%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
           KP+II ILADDLG+ D+  +G   I TPN+DA+A  G    Q Y   A+C PSRS+ +TG
Sbjct: 34  KPNIIFILADDLGYGDLGCYGQKLIQTPNLDAMAKQGTRFTQFYAGTAVCAPSRSSFLTG 93

Query: 88  KYPIH--IGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           ++  H  I    GV  EG+ W +P +   + + LK+AGYAT   GKW LG       P  
Sbjct: 94  QHTGHTPIRGNKGVQPEGQ-WPIPDSAVTIAEVLKKAGYATGDFGKWGLGPVASSGDPIK 152

Query: 146 RGFDSHYGY-WQGL-QDYY-DHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAI 202
           +GFD+ YGY  Q L  DYY DH     +E    +D   N          YS DL  + A+
Sbjct: 153 QGFDAFYGYNCQTLAHDYYPDH----LWENDTRIDFAANTPEHPTD---YSADLIHQKAL 205

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPD----EEVAKFLD---ISDPE------ 249
             I +H K  P F++L++   HA       Q PD    E+  K L+   I  P+      
Sbjct: 206 QFIDQH-KGNPFFVFLSYTLPHAA-----LQVPDDSLFEKYKKLLNEKPIPVPKWNGKNY 259

Query: 250 ------RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GS 301
                 R  YA MVSRLD  VG V+  L++ G+ +N++V+F +DNG    G +      S
Sbjct: 260 APQAYPRAAYAAMVSRLDVYVGQVLQKLKEAGIDKNTLVIFSSDNGPHKEGGYDPDYFNS 319

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL 361
           N PLRG+K   ++GGMR     W P   +    S  +    D+LPT    A   +   ++
Sbjct: 320 NGPLRGIKRDLYEGGMREPMIAWWPGKIKAGATSDYVGAFWDFLPTFAELAK-ALKPANI 378

Query: 362 DGVN 365
           DG++
Sbjct: 379 DGIS 382


>gi|355563805|gb|EHH20367.1| hypothetical protein EGK_03210 [Macaca mulatta]
          Length = 507

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 124/384 (32%), Positives = 186/384 (48%), Gaps = 29/384 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           + P+I++I ADDLG+ D+  +G     TPN+D LA  GL L   YV A LCTPSR+AL+T
Sbjct: 19  RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPASLCTPSRAALLT 78

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+ +GM  GV++     GLPL E  L + L   GY T   GKWHLG   E  + P  
Sbjct: 79  GRLPVRMGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPH 138

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
           +GF    G  Y        + +C     P  G        + +  N+ V+ +   +   +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLE 198

Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
             Y   A +++A+   + +P FLY A    H    Y  F         F + S   R  +
Sbjct: 199 ARYVAFARDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 247

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              +  LD +VG ++ A+R  G+LE ++V+F ADNG  +  + S  G +  LR  K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 306

Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           +GG+R  A A W   +     V+ EL    D LPTL A AG  + + +LDG +   +L  
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364

Query: 373 GAKTKRSEILHNIDNVDNPQKYYA 396
             K+ R  +     + D  +  +A
Sbjct: 365 TGKSPRQSLFFYPSDPDEVRGVFA 388


>gi|241143299|ref|XP_002404958.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215493668|gb|EEC03309.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 166

 Score =  167 bits (423), Expect = 1e-38,   Method: Composition-based stats.
 Identities = 83/161 (51%), Positives = 110/161 (68%), Gaps = 1/161 (0%)

Query: 213 PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALR 272
           P+FLYL+HLAVHAG  Y P QAP E + KF  I    R  YAG V  LD+SVG V  AL 
Sbjct: 1   PLFLYLSHLAVHAGTQYGPIQAPKENLRKFDYIGVKNRSLYAGAVDALDQSVGIVFQALE 60

Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
           K  ML+N++++F++DNGA  +G  SN GSN PLRG K T W+GG+R  A +W P L++ Q
Sbjct: 61  KKDMLQNTMLVFISDNGAIPWGEWSNAGSNWPLRGAKFTLWEGGVRVPAFVWGPLLRKPQ 120

Query: 333 KVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLTK 372
           +VS++L H+SDW+PTL + AG  I D   +DG+N W  L++
Sbjct: 121 RVSNQLMHVSDWVPTLYSLAGGNIADLGVIDGLNMWQALSR 161


>gi|414070344|ref|ZP_11406330.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410807261|gb|EKS13241.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 470

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 119/394 (30%), Positives = 195/394 (49%), Gaps = 28/394 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
           +KP+I++++ADD G++D  F GS  + TP ID LA   ++  Q YV  A+C PSR+ L T
Sbjct: 25  EKPNILLVIADDAGYHDFGFQGSEVMQTPTIDKLASQSVVFEQAYVTAAVCGPSRAGLYT 84

Query: 87  GKYPIHIGMQHGVI--------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           GKY    G +   +          G+  GLP T+  + ++LKE GY T   GKWH G   
Sbjct: 85  GKYQQRFGFEENNVPGYMSKSGFTGDKMGLPFTQVTMAEHLKELGYHTGLFGKWHQG-NH 143

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYT 198
           + Y PT RGFD+ YG+ +G + Y+ +S +   + Y    M  + +   +  G  +  L T
Sbjct: 144 DDYHPTKRGFDNFYGFREGARGYFAYSNEEQ-QAYPSQKMERDFKHYIEHEGYLTDALAT 202

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVS 258
           + +  +       +P F  L+  AVHA     P QA + ++A+F  ++  +R+  A M  
Sbjct: 203 QTSHFIGQSVVNKQPFFAVLSFSAVHA-----PMQATNADLAQFPHLTG-KRKILAAMNF 256

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            +D ++  V+  L    + +N++V+F+ DNG P+     N  +N+PL G K+   +GG+R
Sbjct: 257 AMDRAIAVVLDKLNALKISDNTLVVFINDNGGPT---DQNYANNYPLSGTKANHLEGGIR 313

Query: 319 GVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIND-TSLDGVNQWDVLTKGAKT 376
               + W   L       +      D LPT   AA  ++ +   LDGV+    L   +  
Sbjct: 314 VPMLLKWPKRLSSNNTRYTYPVSSLDLLPTFIHAANGKVTEQMQLDGVDLMPFLLSKSTQ 373

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTD 410
           +  + L+        ++  AA+R  DLK +   D
Sbjct: 374 RPHQTLY------WKKESRAAIRDKDLKLLRFPD 401


>gi|343086188|ref|YP_004775483.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354722|gb|AEL27252.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 491

 Score =  167 bits (423), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 121/351 (34%), Positives = 179/351 (50%), Gaps = 29/351 (8%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSALM 85
           P  P+I+II  DD G+ DV   G+  I TPN+D +A NG+   N H   A+C+ SR++++
Sbjct: 40  PSPPNIVIIFTDDQGYEDVGVFGAKDIRTPNLDRMANNGVQFTNFHVANAVCSASRASIL 99

Query: 86  TGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
           TG Y   +G+  G + +    GL   E  + + LK  GYAT  +GKWHLG   E + PT 
Sbjct: 100 TGCYSNRLGI-FGALSDRSKHGLSPEETTIAEMLKPKGYATGIVGKWHLGHL-EPFLPTK 157

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAI 202
           +GFD  +G  Y   +  Y+        + Y  L +  N   +D  T     T  YTE A+
Sbjct: 158 QGFDEFFGLPYSNDMWPYHPERP----DDYPPLPLYQNETIIDTLTEQSMLTTWYTEKAV 213

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
           + I + N  +P FLYLAH   H      P    D+   K       E   Y  ++  +D 
Sbjct: 214 DFI-DRNSHQPFFLYLAHNMPHV-----PLFVSDKFKGK------SEAGIYGDVIMEIDW 261

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA 321
           SVG V+ AL+K+G+ EN++V+F +DNG   S+G H+  GS    +  K T W+GG+R V 
Sbjct: 262 SVGQVMEALKKNGLEENTLVIFTSDNGPWLSYGTHA--GSTGIFKEGKGTSWEGGIR-VP 318

Query: 322 AIWSPWLKQTQK--VSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
           AI   W  +  K  V  ++    D LPT+    G  +   ++DG N W ++
Sbjct: 319 AIMQ-WKGKIPKGEVQEKMAMTIDLLPTIAHITGAPLPQKTIDGKNIWPLI 368


>gi|343086300|ref|YP_004775595.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354834|gb|AEL27364.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 454

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 133/416 (31%), Positives = 211/416 (50%), Gaps = 36/416 (8%)

Query: 9   FALTCTLLFNDAFLNTTAP--KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
             LTC +L        T    +KP+I+II  DD G++DVS++G+  I TP+IDA+  +G+
Sbjct: 9   LGLTCMMLGLGIVHAQTIKNNEKPNILIIFTDDQGYHDVSYYGTEDIQTPHIDAIRNDGM 68

Query: 67  ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEP---WGLPLTEK-LLPQYLKE 121
             +  Y  + +C P+R++LM+G+YP H+G+    ++   P   WG    E  LLP  LK+
Sbjct: 69  QFDYFYTNSPVCAPTRASLMSGRYPDHVGVPG--LVRSNPANNWGYLNPETILLPAMLKK 126

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHN 181
            GY T  IGKW+LG       P   GFD  +G+ + + D Y      T   +    MR N
Sbjct: 127 GGYHTAHIGKWNLGLMHP-NLPNDHGFDYFHGWLEDMMDDY-----LTHRRHGKNYMRLN 180

Query: 182 MQ-VDNKTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEV 239
            + +D K    ++TDL+T+ ++N I E  K  +P FLYLA+ A H      P Q P   V
Sbjct: 181 EEIIDPKG---HATDLFTQWSVNYIKEREKEERPFFLYLAYNAPHF-----PVQPPKSWV 232

Query: 240 AKF--LDISDPERRT-YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
            K    +   PE+R      +  LD+ +G VI AL++ G  EN++++F +DNG    G  
Sbjct: 233 EKVNKREPGLPEKRANLIAFIEHLDDGIGQVINALKETGQYENTLIIFTSDNG----GHL 288

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
            +  +N  LR  K + ++GG+R   AI  P   +    S ++    D  PTL   AG+++
Sbjct: 289 PDLANNGLLRDGKQSMYEGGLRVPTAISWPAKIKAGSTSDQVNLSMDIFPTLTEIAGVDL 348

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQ---KYYAALRVDDLKYVAGT 409
            D +++G +    L      + +  L+ +      +   K Y ALR+ D K +  +
Sbjct: 349 -DHAINGRSFLPTLLGQNMKEEARPLYFVRREGGTRYGGKAYHALRLGDWKLLQNS 403


>gi|223936836|ref|ZP_03628745.1| sulfatase [bacterium Ellin514]
 gi|223894405|gb|EEF60857.1| sulfatase [bacterium Ellin514]
          Length = 477

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 135/392 (34%), Positives = 185/392 (47%), Gaps = 52/392 (13%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILN 69
           L   LL     L+T A  KP+I+ ILADDLG+ DV+ +GS    TPNID LA +G+   +
Sbjct: 4   LLSLLLMAVFCLSTKAANKPNIVFILADDLGYTDVACYGSKYYETPNIDKLAKDGIKFTD 63

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPW------------GLPLTEKLLPQ 117
            H     C P+R++LM+G+Y    G+     ++   W             LPL +  L Q
Sbjct: 64  GHTCGPNCQPTRASLMSGQYGPRTGVYTVGSIDRFAWQTRSLHPVENVTKLPLDKITLAQ 123

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
            LK+AGYAT   GKWHLG  +E + P  RGFD      + L     H            D
Sbjct: 124 SLKKAGYATGMFGKWHLGEDKE-HHPAQRGFD------EALVSMGVH-----------FD 165

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
              N +VD      Y  D  T+ A++ I  H K +P FLYL H AVH     +P QA  E
Sbjct: 166 FVTNPKVDYPK-DEYLADFLTDKALDFIKRH-KDEPFFLYLPHYAVH-----KPLQAKKE 218

Query: 238 EVAKF---LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
            + KF     +      TYA M++ +DESVG V+A L +  + +N++V+F +DNG    G
Sbjct: 219 LIQKFSAKQGVDGHHNPTYAAMIASVDESVGRVVALLDELKLSDNTLVIFSSDNGG--VG 276

Query: 295 IHSNKG--------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLP 346
            +  +G         N+PLRG K   ++GG R       P      KV  +     D  P
Sbjct: 277 GYQREGIKKAGDVTDNNPLRGGKGMLYEGGHRVPYIFRWPGKIPAGKVCDQPIISIDLYP 336

Query: 347 TLCAAAGIEINDT-SLDGVNQWDVLTKGAKTK 377
           TL   AG +  +   LDG +   VL  G   K
Sbjct: 337 TLLELAGAKAPEKYPLDGTSYLKVLKSGGMKK 368


>gi|355785121|gb|EHH65972.1| hypothetical protein EGM_02852 [Macaca fascicularis]
          Length = 507

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 125/384 (32%), Positives = 184/384 (47%), Gaps = 29/384 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           + P+I++I ADDLG+ D+  +G     TPN+D LA  GL L   YV A LCTPSR+AL+T
Sbjct: 19  RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPAPLCTPSRAALLT 78

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+  GM  GV++     GLPL E  L + L   GY T   GKWHLG   E  + P  
Sbjct: 79  GRLPVRTGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPH 138

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
           +GF    G  Y        + +C     P  G        + +  N+ V+ +   +   +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLE 198

Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
             Y   A +++A+   + +P FLY A    H    Y  F    +  AK        R  +
Sbjct: 199 ARYVAFARDLMADAQRQDRPFFLYYASHPTH----YPQFSG--QNFAK-----RSGRGPF 247

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              +  LD +VG ++ A+R  G+LE ++V+F ADNG P     S  G +  LR  K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNG-PETMRMSRGGCSGLLRCGKGTTY 306

Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           +GG+R  A A W   +     V+ EL    D LPTL A AG  + + +LDG +   +L  
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364

Query: 373 GAKTKRSEILHNIDNVDNPQKYYA 396
             K+ R  +     + D  +  +A
Sbjct: 365 TGKSPRQSLFFYPSDPDEVRGVFA 388


>gi|168701882|ref|ZP_02734159.1| Arylsulfatase [Gemmata obscuriglobus UQM 2246]
          Length = 467

 Score =  167 bits (422), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 124/378 (32%), Positives = 193/378 (51%), Gaps = 38/378 (10%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP+I++I+ADDLG  ++  +G ++I TP+ID LA  G    + Y  + +C PSR  LMTG
Sbjct: 25  KPNIVLIVADDLGCFELGCYGQTKIKTPHIDKLAQGGAKFTRFYSGSPVCAPSRCVLMTG 84

Query: 88  KYPIHIGMQHGVILEGEPWG-LPLTEK--LLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           K+  H  +++ V  E +P G  P+  +   +   LK  GYAT A+GKW LG F    +P 
Sbjct: 85  KHSGHATVRNNV--EAKPEGQFPIRAEDVTVADALKAHGYATGAMGKWGLGMFDTAGSPL 142

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
             GFD  +GY    +  + H     +   + ++++ N   D KT   ++ DL+ E A+  
Sbjct: 143 KHGFDLFFGY-NCQRHAHSHYPTYIYRNDKRVELKGN---DGKTGKQFTQDLFEEEALGF 198

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI--SDPE------------- 249
           I E NK+KP FLYL     H        Q P++ + ++      DP              
Sbjct: 199 I-EANKAKPFFLYLPFTVPHVA-----VQVPEDSLNEYKGQLGDDPAYDGKKGYQPHPAP 252

Query: 250 RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI----HSNKGSNHPL 305
              YA MV+R+D SVG V+  L   G+ +N++VLF +DNG P+  +     S   S   L
Sbjct: 253 HAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDNG-PTHNVGGADSSFFNSAGKL 311

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
           RG+K + ++GG+R     + P   +    S    +  D LPTLCA AG +   +++DG++
Sbjct: 312 RGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTLCAFAGTKA-PSAIDGIS 370

Query: 366 QWDVLTKGAKTKRSEILH 383
              +L KG K    + L+
Sbjct: 371 FLPLL-KGEKQPTHDFLY 387


>gi|109094666|ref|XP_001113032.1| PREDICTED: arylsulfatase A isoform 2 [Macaca mulatta]
 gi|109094668|ref|XP_001113064.1| PREDICTED: arylsulfatase A isoform 3 [Macaca mulatta]
 gi|109094670|ref|XP_001113092.1| PREDICTED: arylsulfatase A isoform 4 [Macaca mulatta]
          Length = 507

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 125/384 (32%), Positives = 184/384 (47%), Gaps = 29/384 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           + P+I++I ADDLG+ D+  +G     TPN+D LA  GL L   YV A LCTPSR+AL+T
Sbjct: 19  RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPASLCTPSRAALLT 78

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+  GM  GV++     GLPL E  L + L   GY T   GKWHLG   E  + P  
Sbjct: 79  GRLPVRTGMYPGVLVPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPH 138

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
           +GF    G  Y        + +C     P  G        + +  N+ V+ +   +   +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLE 198

Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
             Y   A +++A+   + +P FLY A    H    Y  F         F + S   R  +
Sbjct: 199 ARYVAFARDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QNFAERSG--RGPF 247

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              +  LD +VG ++ A+R  G+LE ++V+F ADNG P     S  G +  LR  K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNG-PETMRMSRGGCSGLLRCGKGTTY 306

Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           +GG+R  A A W   +     V+ EL    D LPTL A AG  + + +LDG +   +L  
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364

Query: 373 GAKTKRSEILHNIDNVDNPQKYYA 396
             K+ R  +     + D  +  +A
Sbjct: 365 TGKSPRQSLFFYPSDPDEVRGVFA 388


>gi|427388367|ref|ZP_18884250.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
           12058]
 gi|425724950|gb|EKU87824.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
           12058]
          Length = 457

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 122/348 (35%), Positives = 186/348 (53%), Gaps = 32/348 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           +KP+I++I+ DDLG  D+S   +  + TPNID L  +G+ LN  Y   ++ +PSR+ L+T
Sbjct: 35  QKPNILLIMVDDLGLGDLSCQYARDLQTPNIDKLFSDGVRLNNFYANSSVSSPSRAGLLT 94

Query: 87  GKYPIHIGMQHGVILE-GEPWGLPLTEK-LLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           G YP  +G+   +  E    WG    E  LLPQ +K+A Y T  IGKWHLG   E   P 
Sbjct: 95  GCYPDMVGVPGVIRTETTASWGYLSPEAILLPQMMKKAEYNTAIIGKWHLG-LEEPNLPN 153

Query: 145 FRGFDSHYGYWQGLQD-YYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
            RGFD  +G+   + D YY H  +          MR N +V   T   ++TDL++  A++
Sbjct: 154 SRGFDFFHGFLGDMMDNYYTHLRRGHNY------MRLNEEVIEPT--GHATDLFSSWALD 205

Query: 204 VIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD----ISDPERRTYAGMVS 258
                 KSK P FLYLA+ A H      P Q P E + +  +    IS+ +R     ++ 
Sbjct: 206 YFDTMRKSKVPFFLYLAYNAPHV-----PLQPPTEWLERVREREPGISE-KRAKLVALIE 259

Query: 259 RLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
            LD ++G V  AL + G L+N++++F +DNG  + G  +N G   P+RG K   ++GG++
Sbjct: 260 HLDYNIGRVYKALEESGQLDNTLIIFCSDNGGDA-GAEANNG---PVRGSKGDMYEGGIK 315

Query: 319 GVAAIWSPWLKQTQKVS-SELFHISDWLPTLCAAAGIEINDTSLDGVN 365
              A++  W    +  S  +L  +SD  PTLC    I +N   +DG++
Sbjct: 316 VPCAVY--WRNHLKPRSIDDLVIMSDIFPTLCDLVSIPVNH-RIDGIS 360


>gi|440718008|ref|ZP_20898478.1| arylsulfatase A [Rhodopirellula baltica SWK14]
 gi|436436890|gb|ELP30586.1| arylsulfatase A [Rhodopirellula baltica SWK14]
          Length = 479

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 132/425 (31%), Positives = 203/425 (47%), Gaps = 44/425 (10%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +   A+T   LF       ++  +P++++ILADDLG+ D+S +G+  I TPNID +A  G
Sbjct: 2   RTIVAITFVCLFVAQVCAESSEGRPNVVVILADDLGYGDLSCYGAEDISTPNIDRMATEG 61

Query: 66  LILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
              +  YV ++C+P+R++LMTG +   +G+  GV+      GL   E  LP+ LK  GYA
Sbjct: 62  AKFSSFYVSSVCSPTRASLMTGSHSTRVGI-GGVLFPRNNHGLNSDEITLPELLKNQGYA 120

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYG--------YWQGLQDYYDHSCKATFEPYQGLD 177
           T  IGKWHLG   +++ P   GFD  YG        Y+  ++ Y            +G+ 
Sbjct: 121 TAIIGKWHLG-NEDMFQPLNHGFDYWYGTPSSNSQFYYPTIKKYAADCVFREGFTREGIL 179

Query: 178 MRHNMQV----DNKTIGIYS-----TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
            R         DN  I + +     T  YT   I  I + NK K  F+YL H   H    
Sbjct: 180 KRETAACPLIEDNIVIEVPADQTQFTQRYTRETIRFITQ-NKDKSFFVYLGHNMPHI--- 235

Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
             P  A +    KF+  S  +R  Y   +  LD S G ++ AL++ G+ +N++V+F +DN
Sbjct: 236 --PLHASE----KFVGSS--KRGIYGDTIQELDWSTGEILRALKELGLDQNTLVIFTSDN 287

Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPT 347
           G P+ G     GS   L+G K +  +GG+R    A W   +    +    +  + D LPT
Sbjct: 288 G-PNTG---KGGSAGVLKGGKGSTLEGGVRVPFVARWPGTIPAGTEFDEAITGM-DLLPT 342

Query: 348 LCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRS-EILHNIDNVDNPQKYYAALRVDDLKY 405
           L   AG E+ +D  +DG +   +L      K   E ++ +       +    +RV D KY
Sbjct: 343 LTKLAGGEVPDDRVIDGKDIGSLLAGKPDAKSPHEAIYYLRG-----RGVDGIRVGDWKY 397

Query: 406 VAGTD 410
              TD
Sbjct: 398 RMATD 402


>gi|87309449|ref|ZP_01091584.1| arylsulfatase A (precursor) [Blastopirellula marina DSM 3645]
 gi|87287757|gb|EAQ79656.1| arylsulfatase A (precursor) [Blastopirellula marina DSM 3645]
          Length = 478

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 123/374 (32%), Positives = 179/374 (47%), Gaps = 28/374 (7%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           +LT +L F    +       P+I++I  DD+G+ D+   G+   PTP++D LA  G I  
Sbjct: 5   SLTLSLGFATVTVAAEKNATPNIVVIFIDDMGYADIGPFGAKDYPTPHLDQLAQEGTICT 64

Query: 70  QHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
             YV QA+C+ SR+ L+TG Y   IG+  G +      G+   E  L +  K+ GYAT  
Sbjct: 65  DFYVTQAVCSASRAGLLTGCYNNRIGIL-GALGPQSKIGISAEETTLAEICKQKGYATAC 123

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEP----YQGLDMRHNM 182
            GKWHLG   E + P   GFD + G  Y   +  Y+      T +     Y  L +    
Sbjct: 124 YGKWHLGHHEE-FLPLQHGFDDYVGLPYSNDMWPYHPELRHLTKDQQQKRYPDLPLYEKN 182

Query: 183 QVDNKTIGIYS----TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEE 238
           ++ +  +        T LYTE A+  I + N ++P FLY+ H  VH      P    D+ 
Sbjct: 183 EIIDTEVTPEDQRNLTTLYTEKAVKFI-DDNHAQPFFLYVPHSMVHV-----PLYVSDKF 236

Query: 239 VAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHS 297
             K           +  +V  +D SVG +I ALR++ + EN++V+F ADNG   S+G H+
Sbjct: 237 AGK------SGAGLFGDVVMEVDWSVGQIIEALRRNMLDENTLVIFTADNGPWLSYGDHA 290

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN 357
             GS  PLR  K T +DGG R     W P      K  S      D LPT+    G ++ 
Sbjct: 291 --GSAGPLREGKGTMFDGGCREPTIFWQPGTIPAGKTCSTPMMTIDILPTVADLIGAKLP 348

Query: 358 DTSLDGVNQWDVLT 371
           D  +DG N W ++T
Sbjct: 349 DHKIDGKNIWPIVT 362


>gi|281346853|gb|EFB22437.1| hypothetical protein PANDA_020197 [Ailuropoda melanoleuca]
          Length = 520

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 120/380 (31%), Positives = 179/380 (47%), Gaps = 28/380 (7%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-V 73
           +L    F     PK P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y  
Sbjct: 11  VLSAAGFGAMGTPKPPNILLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSA 70

Query: 74  QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
             LC+PSR+AL+TG+ PI  G        ++    +    G+P  E LLP+ LK AGYA+
Sbjct: 71  NPLCSPSRAALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAGYAS 130

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNM 182
             +GKWHLG  R  + P   GFD  +G        YD+  +     Y+  +M        
Sbjct: 131 KIVGKWHLG-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEF 189

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
            ++ +T     T +Y + A++ +     + +P FLY A    H         AP      
Sbjct: 190 PINLQTGEANLTQVYLQEALDFMKRQQVAQRPFFLYWAIDGTH---------APVYASRP 240

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP--SFGIHSNK 299
           FL  S   R  Y   V  +D+SVG +++ LR   + EN+ V F +DNGA   S       
Sbjct: 241 FLGTS--RRGRYGDAVREIDDSVGKILSLLRDLRITENTFVFFTSDNGAALISAPKQGQC 298

Query: 300 GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-IND 358
           GSN P    K T ++GGMR  A  W P      +VS +L  + D   T  A AG+   +D
Sbjct: 299 GSNGPFLCGKQTTFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLALAGLAPPSD 358

Query: 359 TSLDGVNQWDVLTKGAKTKR 378
             +DG++    + +G  T R
Sbjct: 359 RVIDGLDLLPAMLRGQLTDR 378


>gi|300773187|ref|ZP_07083056.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
 gi|300759358|gb|EFK56185.1| N-acetylgalactosamine-6-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
          Length = 443

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 115/361 (31%), Positives = 174/361 (48%), Gaps = 25/361 (6%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +Y   L   LL   A  N++A  +P+ III  DD+G+ DV  +G+  I TPN+D +A  G
Sbjct: 5   RYLITLFICLL---AVFNSSAQTQPNFIIIYVDDMGYGDVGINGNPNIETPNLDRMAMEG 61

Query: 66  LILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
           +  + +Y     CT SR AL+TGKYP   G +  V+   +  G+   E  + + LKE GY
Sbjct: 62  MRFSNYYSASPACTASRYALLTGKYPSRAGFRW-VLNPTDQIGIHQQESTIAERLKEKGY 120

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV 184
            T   GKWHLG  R+ + P   GFD + G       Y +      +     L     +++
Sbjct: 121 RTAIYGKWHLGSTRKEFLPLANGFDEYVGL-----PYSNDMIPPKYPDIALLSGYDTLEL 175

Query: 185 DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD 244
           +     +  T LYTE AI  I + N  +P F+YL +   H      P  A ++ + K   
Sbjct: 176 NPDQSKL--TRLYTEKAIAFITK-NAKQPFFIYLPYAMPHT-----PLHASEDFLGK--- 224

Query: 245 ISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHP 304
               +R  Y  +V  LD  +G ++  L+++ + + + V+F +DNG P    + N GS   
Sbjct: 225 ---SKRGLYGDVVQELDHHIGRLLTFLKENKLDQQTYVVFTSDNG-PWLIQNQNGGSAGL 280

Query: 305 LRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
            R  K + W+GGMR    +W         V +E+F   D LPT+ A AGI      +DG 
Sbjct: 281 FRDGKGSTWEGGMREPFFLWGHHTIPKGYVENEVFTALDMLPTITALAGISAGPNKIDGT 340

Query: 365 N 365
           N
Sbjct: 341 N 341


>gi|302370951|ref|NP_001180574.1| N-acetylgalactosamine-6-sulfatase isoform 2 precursor [Mus
           musculus]
 gi|26329565|dbj|BAC28521.1| unnamed protein product [Mus musculus]
          Length = 440

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 107/313 (34%), Positives = 157/313 (50%), Gaps = 25/313 (7%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
           P+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+
Sbjct: 25  PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 84

Query: 86  TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GKWHLG  R
Sbjct: 85  TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 143

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
             + P   GFD  +G        YD+  K     Y+  +M         ++ KT     T
Sbjct: 144 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 203

Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
            LYT+ A++ I  +H +  P FLY A  A HA         P     +FL  S   R  Y
Sbjct: 204 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 252

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              V  +D+SVG +++ L+  G+ +N+ V F +DNGA      +  GSN P    K T +
Sbjct: 253 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 312

Query: 314 DGGMRGVAAIWSP 326
           +GGMR  A  W P
Sbjct: 313 EGGMREPAIAWWP 325


>gi|116620007|ref|YP_822163.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
 gi|116223169|gb|ABJ81878.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
          Length = 443

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 116/342 (33%), Positives = 176/342 (51%), Gaps = 35/342 (10%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A  +P++++++ DDLG +D+ + G++ + TP+IDALA  GL     Y  A +C P+RSA+
Sbjct: 23  AADRPNVLVVVLDDLGCHDLGYLGAADLKTPHIDALAARGLKFRNWYSNAPVCAPARSAI 82

Query: 85  MTGKYPIHIGM-QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           +TG++P   G+  +G  L     G+P     L   LK +GY T   GKWHLG   E   P
Sbjct: 83  LTGRFPASAGVPDNGPALA---HGIPT----LASVLKGSGYQTGCFGKWHLGSTDET-AP 134

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEA 200
           T  GFDS YG+  G  DYY H        Y G D  H++  +   I   G Y T+   + 
Sbjct: 135 TGHGFDSFYGFHSGCVDYYSHRF------YWG-DNYHDLWHNRTEIFEDGRYLTERIADE 187

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A   I    +++P   Y+A  A H      P  AP +  A+F +++ PER+TYA M++ +
Sbjct: 188 AAGFIG---RNRPFLGYVAFNAPH-----YPMHAPAQYKARFPNLA-PERQTYAAMIAAV 238

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPS------FGIHSNKGSNHPLRGMKSTPWD 314
           D+ +G +  AL   G  EN+++ F+ DNGA +       G  +  G N   +G K + +D
Sbjct: 239 DDGIGQIQRALETTGAAENTLMFFIGDNGATTEKRAGLNGDFATAGDNGVFKGYKFSLFD 298

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
           GGM     +  P   +    + EL    D LPT+C A G  +
Sbjct: 299 GGMHVPGFVSWPAGIRKGGWTDELAMSMDILPTICRATGAPL 340


>gi|50753991|ref|XP_414208.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gallus gallus]
          Length = 513

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 117/371 (31%), Positives = 178/371 (47%), Gaps = 28/371 (7%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPS 80
           L   AP  P+++++L DD+GW D+   G     TPN+D +A  G++    Y    LC+PS
Sbjct: 15  LGCAAP--PNVVLLLMDDMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAANPLCSPS 72

Query: 81  RSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
           R+AL+TG+ P+  G        ++    +    G+  +E LLP+ LK+AGY    IGKWH
Sbjct: 73  RAALLTGRLPVRNGFYTTNAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWH 132

Query: 134 LGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTI 189
           LG  R  + P   GFD  +G        YD+        Y+  +M      + ++D +T 
Sbjct: 133 LGH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRALPNIPVYRDWEMIGRYYEDFKIDLRTG 191

Query: 190 GIYSTDLYTEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDP 248
               T +Y + A++ I++   S+ P FLY A  A HA         P      FL  S  
Sbjct: 192 EANLTQIYLQEALDFISKQQASQQPFFLYWAIDATHA---------PVYASKHFLGTS-- 240

Query: 249 ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGM 308
           +R  Y   V  +D+SVG ++  L+K G+ EN+ V F +DNGA         GSN P    
Sbjct: 241 QRGRYGDAVREIDDSVGKILKHLQKLGISENTFVFFTSDNGAALISAPKQGGSNGPFLCG 300

Query: 309 KSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQW 367
           K T ++GGMR  A  W P       VS +L  + D   T  +  G++  +D  +DG++  
Sbjct: 301 KQTTFEGGMREPAIAWWPGHIPAGSVSRQLGSVMDLFTTSLSLVGLQPPSDRQIDGIDLL 360

Query: 368 DVLTKGAKTKR 378
             + +G    R
Sbjct: 361 PAILQGKLIDR 371


>gi|149195951|ref|ZP_01873007.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140798|gb|EDM29195.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 464

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 119/363 (32%), Positives = 180/363 (49%), Gaps = 46/363 (12%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+++I   DD G  DV+ +GS  + TPN+D LA +G+   Q Y   +C P+R+ LMTG+
Sbjct: 31  KPNLVIFFTDDQGTLDVNCYGSKDLYTPNMDKLAEDGVRFTQAYAHQVCCPARAMLMTGR 90

Query: 89  YPIHIGMQHGVILEGEPWG-----LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           +P    + H    +G+  G     + L E  L + LK++GY T   GKWHLG   + Y P
Sbjct: 91  HPQRSNVNHWT--QGDAKGPKTRNMNLEEYTLAEALKDSGYKTALFGKWHLGAHLD-YGP 147

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEA 200
           T +GFD  YG   G  D Y+H     F   +G    H++    K +   G Y  +L T+ 
Sbjct: 148 TKQGFDEFYGIRGGFIDNYNH----YFLHGEGF---HDLYEGTKEVFDEGKYFPNLVTDR 200

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           A+N I + NK+ P FL+LA    H      P QA  +   ++ ++  P R++YA M+S  
Sbjct: 201 ALNFI-DRNKNNPFFLFLAFNIPHY-----PEQADPKFDERYKNMKMP-RQSYAKMISTT 253

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH----------------- 303
           D+ +G +++ L++HG+ +N+I++FM+DNG      H  K  NH                 
Sbjct: 254 DDHMGQIMSKLQEHGIYDNTIIIFMSDNGHSRERNHI-KFDNHKSGLAKNTKYGALGGGG 312

Query: 304 ---PLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS 360
                RG KS  ++GG+R  A I  P       V  +     DW+PT+     IE     
Sbjct: 313 NTGKWRGNKSNFYEGGIRVPAIITFPNKLPKGAVRDQAITAMDWMPTVLELCNIEPPKIK 372

Query: 361 LDG 363
            DG
Sbjct: 373 FDG 375


>gi|148679746|gb|EDL11693.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_a [Mus
           musculus]
          Length = 462

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 107/313 (34%), Positives = 157/313 (50%), Gaps = 25/313 (7%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALM 85
           P+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+
Sbjct: 47  PQPPNIVLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALL 106

Query: 86  TGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           TG+ PI  G        ++    +    G+P +E LLP+ LK+AGY    +GKWHLG  R
Sbjct: 107 TGRLPIRNGFYTTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-R 165

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYST 194
             + P   GFD  +G        YD+  K     Y+  +M         ++ KT     T
Sbjct: 166 PQFHPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGRFYEEFPINRKTGEANLT 225

Query: 195 DLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
            LYT+ A++ I  +H +  P FLY A  A HA         P     +FL  S   R  Y
Sbjct: 226 QLYTQEALDFIQTQHARQSPFFLYWAIDATHA---------PVYASRQFLGTS--LRGRY 274

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              V  +D+SVG +++ L+  G+ +N+ V F +DNGA      +  GSN P    K T +
Sbjct: 275 GDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALISAPNEGGSNGPFLCGKQTTF 334

Query: 314 DGGMRGVAAIWSP 326
           +GGMR  A  W P
Sbjct: 335 EGGMREPAIAWWP 347


>gi|404486643|ref|ZP_11021833.1| hypothetical protein HMPREF9448_02276 [Barnesiella intestinihominis
           YIT 11860]
 gi|404336461|gb|EJZ62922.1| hypothetical protein HMPREF9448_02276 [Barnesiella intestinihominis
           YIT 11860]
          Length = 478

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 130/420 (30%), Positives = 194/420 (46%), Gaps = 62/420 (14%)

Query: 5   RKYFFALTC-TLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           +K F ++    +L + +    +  +K ++I ILADDLG+ D+  +G   I TPNID LA 
Sbjct: 2   KKVFLSVALLGVLISSSCTTKSKKQKMNVIYILADDLGYGDLGCYGQKIIKTPNIDRLAE 61

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQ----HGVILEGEPWGLPLTEKLLPQY 118
            G++  QHY    +  PSRS LMTG++  H  ++    HG   E EP  LP     L + 
Sbjct: 62  EGMLFTQHYSGCTVSAPSRSCLMTGQHTGHTYIRGNQSHGTEGE-EP--LPGNTYTLARM 118

Query: 119 LKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM 178
           +K+AGYAT A GKW LG+      PT  GFD  +GY          +C+     Y    +
Sbjct: 119 MKDAGYATGAFGKWGLGYPCSEGDPTNLGFDEFFGY----------NCQRQAHHYYPYHL 168

Query: 179 RHNMQV------DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA------- 225
            HN +       +      Y+ DL  E A+  I + N+SKP FLYL ++  HA       
Sbjct: 169 WHNQEKVMLPGNEGSKTETYAQDLIQEKALQFIVD-NQSKPFFLYLPYILPHAELVSPED 227

Query: 226 ------------GNTYEP---FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAA 270
                       G  YE     + P  +   +    +P    +A MV+R D  VG ++  
Sbjct: 228 SILAMYKGKIEEGKPYEGVDDIKNPYYKYGGYCSSENP-HADFASMVTRFDAYVGEIMQT 286

Query: 271 LRKHGMLENSIVLFMADNGAPSFGIHSNKGSN-------HPLRGMKSTPWDGGMRGVAAI 323
           L++ G+ +N+IV F +DNG      H   G++        P RG+K   ++GG+R     
Sbjct: 287 LKRLGLDKNTIVFFASDNGP-----HQEGGADPDFFNSYGPFRGIKRDMYEGGIRVPLIA 341

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           W+P   +    S  +    D +PTL   AG EI     DG++    L  G   K+ E L+
Sbjct: 342 WAPGKVEAGTCSDHISAFWDVMPTLADIAGGEIKG-ECDGISFAPTLFGGKGQKQHEYLY 400


>gi|261404208|ref|YP_003240449.1| sulfatase [Paenibacillus sp. Y412MC10]
 gi|261280671|gb|ACX62642.1| sulfatase [Paenibacillus sp. Y412MC10]
          Length = 452

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 109/350 (31%), Positives = 181/350 (51%), Gaps = 22/350 (6%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K+P+ I+I  DDLG+ D+  +GS  + TP++D LA  G+     Y  + +C+PSR++L+T
Sbjct: 15  KQPNFIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLT 74

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           GKYP   G+   +  +    GLP  E  L + LK AGY T   GKWHLG   E  +P   
Sbjct: 75  GKYPARAGVGEILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGLSEET-SPNAH 133

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLYTEAAIN 203
           GFD  +G+  G  D+Y H     +    G++  H++  +   +   G Y T+L TE +++
Sbjct: 134 GFDEFFGFKAGCVDFYSHIF--YWGQAHGVNPLHDLWENETEVWENGRYMTELITERSVD 191

Query: 204 VIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
            I    + + P FL+ ++ A H      P  AP + + +F  +   +R+  A M++ +D+
Sbjct: 192 FIQRSREQEAPFFLFASYNAPHY-----PMHAPQKYMDRFAHLP-WDRQVMAAMIAAVDD 245

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH--------SNKGSNHPLRGMKSTPWD 314
            VG ++ AL++ G  E++++ F +DNG  S   +           GS    RG K++ ++
Sbjct: 246 GVGKIVKALKEAGCYEDTVIFFSSDNGPSSESRNWLDGTEDVYYGGSAGIFRGHKASLFE 305

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGV 364
           GG+R  A +  P   +  +V  E+  + D  PT    AG++     L GV
Sbjct: 306 GGIREPAILSWPNGWEGGQVRDEVAAMMDLAPTFLDLAGVDPAAGPLQGV 355


>gi|348516447|ref|XP_003445750.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Oreochromis
           niloticus]
          Length = 525

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 120/360 (33%), Positives = 170/360 (47%), Gaps = 26/360 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
           P+III+L DD+GW D+   G     TPN+DA+A  G++L   Y    LC+PSR+AL+TG+
Sbjct: 31  PNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAEGMLLPNFYTANPLCSPSRAALLTGR 90

Query: 89  YPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
            PI  G        ++    +    G+   E LLPQ LK  GY +  +GKWHLG  R  Y
Sbjct: 91  LPIRNGFYTTNAHARNAYTPQEIVGGISKDEILLPQLLKTKGYVSKIVGKWHLGH-RPQY 149

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLY 197
            P   GFD  +G        Y+   K     Y   +M      + ++D  T     T +Y
Sbjct: 150 LPLKNGFDEWFGSPNCHFGPYNDQSKPNIPVYNNSEMLGRFYEDFKIDRNTGESNLTQIY 209

Query: 198 TEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
               ++ I    K++ P FLY A  A HA         P     +FL  S  +R  Y   
Sbjct: 210 LMEGLDFILRQTKAQQPFFLYWAVDATHA---------PVYASKRFLGKS--QRGRYGDA 258

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  LD S+G ++  LR  G+  N+ V F +DNGA      +  GSN P    K T ++GG
Sbjct: 259 VMELDYSIGQILKWLRTLGIDNNTFVFFTSDNGAAVMSGPNESGSNGPFLCGKETTFEGG 318

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAK 375
           MR  A  W P   +   V+ +L ++ D   T    AGI   +D  LDG++   VL   ++
Sbjct: 319 MREPAIAWWPGHIKEGSVNFQLANVMDLFTTSLVLAGINPPDDRILDGLDLTPVLLNSSQ 378


>gi|423239862|ref|ZP_17220977.1| hypothetical protein HMPREF1065_01600 [Bacteroides dorei
           CL03T12C01]
 gi|392644851|gb|EIY38585.1| hypothetical protein HMPREF1065_01600 [Bacteroides dorei
           CL03T12C01]
          Length = 464

 Score =  166 bits (419), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 131/402 (32%), Positives = 187/402 (46%), Gaps = 46/402 (11%)

Query: 12  TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           TC +LF+ A L+        TTA K P++I I+ADDLG  D+  +G  QI TPNID +A 
Sbjct: 4   TCKILFSAALLSSGSMMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
           NG+   QHY  + +  PSR AL+TGK+  H  ++    + G         LP  E  +  
Sbjct: 64  NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
             K   Y T  +GKW +G       P   GFD  YGY   L   + HS    F      +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
               + +D K    YS DL  E A+N I E N  KP FLY +    HA           Y
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEVMTEY 232

Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
           E      PF    +            R  YA MV+ LD+SVG +I  L++ G+ +++I++
Sbjct: 233 EGEFCETPFGGSKDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 288

Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
           F +DNG  S G H  S   SN P RG K   ++GG+R    I  P +     V++ +   
Sbjct: 289 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 348

Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D+LPT+      +I   ++DG++    LT     K  + ++
Sbjct: 349 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGIQKEHDCIY 389


>gi|414344125|ref|YP_006985646.1| sulfatase [Gluconobacter oxydans H24]
 gi|411029460|gb|AFW02715.1| sulfatase [Gluconobacter oxydans H24]
          Length = 478

 Score =  166 bits (419), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 114/349 (32%), Positives = 172/349 (49%), Gaps = 30/349 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           K P+II ILADDLG+ DVS  G+    TP+ID +A+ G+ L + Y   A+C+ +R+AL+T
Sbjct: 43  KAPNIIFILADDLGYADVSCFGTPGFTTPSIDRIAHEGIKLTRAYANSAVCSATRTALIT 102

Query: 87  GKYP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           G+Y   I  G++  +       GLP     LP+ L+ AGY T  +GKWHLG   E + P 
Sbjct: 103 GRYQDRIACGLEEPIASSSGKIGLPPGLPTLPEQLRRAGYKTALVGKWHLGNPPE-FGPL 161

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
             G+D  YG   G  DY+ H  K    P +    R +  + +     Y T L  + A+ +
Sbjct: 162 KSGYDHFYGISGGAADYFTHQGK----PGEDQFYRDDHLIHDHG---YLTGLLGDEAVRL 214

Query: 205 IAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEV------AKFLDISDPERRTYAGMV 257
           +    +++ P FL L     H    + P++ P++E        K  D     R+TY  M+
Sbjct: 215 VETFASETSPFFLSL-----HFNAPHWPWEGPEDEAESKRLKGKIFDYDGGTRKTYGRMI 269

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
             +D+ +G V+A L  + + EN+IV+F +DNG   F          P  GMK    +GG+
Sbjct: 270 KAMDDQIGRVLATLDSYHLTENTIVVFTSDNGGERF------SDTWPFTGMKGELLEGGL 323

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVN 365
           R  A +  P        S +     DWLP+  AA G   + T   DG++
Sbjct: 324 RVPAVLRWPARVAAGSESDQTLITMDWLPSFLAATGATPDATFPSDGIS 372


>gi|291238687|ref|XP_002739253.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 512

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 117/395 (29%), Positives = 193/395 (48%), Gaps = 37/395 (9%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           +F+ +    +F   F++     +P+I+I+ ADD+G+ D+  +G     TP++D LA NGL
Sbjct: 2   FFWRIQLFAVFVQYFIDKCKTAEPNIVILFADDVGYGDLGCYGHPTTSTPHLDKLAANGL 61

Query: 67  ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
           +L Q Y  + +C+PSR++L+TG+ P   G+  GV    +  GLPL E  + + LK   Y 
Sbjct: 62  LLTQFYAGSPVCSPSRASLLTGRLPPRTGVYPGVFQANDSGGLPLNETTIAELLKTVNYQ 121

Query: 126 THAIGKWHLGFFR-EVYTPTFRGFDSHYG-------------YWQGLQDYYDHSCKATFE 171
           T  IGKWHLG    + Y PT  GFD ++G             +W         SC  + +
Sbjct: 122 TGMIGKWHLGVGENQKYLPTNFGFDYYFGIPYTHAQCPCVTCFWP------RDSCYGSCK 175

Query: 172 PY-QGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNT 228
           PY  G  + +N ++ ++ + + +  + Y  AA + IA++  + KP FLY A    H    
Sbjct: 176 PYFPGCPLFYNDEIVDQPVDLLTLNEQYANAARDFIAKNAQEKKPFFLYYAFQHCH---- 231

Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
                 P     KF + +   R  +   ++ +D SVG VI  L++  + +N++V F +DN
Sbjct: 232 -----HPQFAGKKFRNST--SRGVFGDSLAEIDWSVGQVIKELKQQDIDKNTLVFFSSDN 284

Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
           G PS    +  G    L+  K T ++GG R     + P + +  + S EL    D LPT+
Sbjct: 285 G-PSIQWKTRGGVAGLLKCGKGTTYEGGQRVPGIAYWPGMIKPGR-SMELASTLDLLPTI 342

Query: 349 CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
                  + + +LDGV+   +L    K  R    +
Sbjct: 343 AKMVNGILPNVTLDGVDMGPILFTQDKGLRDTFFY 377


>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
           [Ciona intestinalis]
          Length = 513

 Score =  165 bits (418), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 124/392 (31%), Positives = 174/392 (44%), Gaps = 27/392 (6%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           L+C LL      +     KP+II +L DD+GW D+  +G     TPN+D +A  G +   
Sbjct: 2   LSCFLLLLMCGYSLAESSKPNIIFMLMDDMGWGDLGINGQPSKETPNLDNMAKEGTLFTD 61

Query: 71  HY-VQALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEA 122
            Y    LC+PSR+AL+TG+ PI  G         +G   +    G+P  E L+ + L  A
Sbjct: 62  FYSANPLCSPSRAALLTGRLPIRNGFYTSNYHGHNGYTPQHIVGGIPDHEILVSELLSSA 121

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM---- 178
           GY    IGKWHLG  +E Y P   GF   +G        YD         Y   +M    
Sbjct: 122 GYTNKLIGKWHLGQ-QEQYLPLKHGFHEWFGSPNCHFGPYDDKTTPNIPVYNNTEMVGRY 180

Query: 179 RHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDE 237
                +++       T  Y + A++ I    ++ KP FLY A  A H+         P  
Sbjct: 181 YEEFAIESHKYLSNMTQYYIQEALDFIERMERNEKPFFLYWAPDATHS---------PVY 231

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
               F   S   R  Y   V  LD  VG +I  L++ G+ +N++VLF +DNGA   G   
Sbjct: 232 SSPMFRGAS--RRGPYGDAVMELDYGVGVIIQKLKQLGLDKNTLVLFSSDNGAAMIGSAF 289

Query: 298 NKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI- 356
             GSN P    K T ++GG+R     W P   Q  +VS ++  + DW  T    A I + 
Sbjct: 290 ENGSNGPFLCGKQTTFEGGIREPTIAWGPGFVQQGQVSHQVGSLMDWFATALDLASIAVP 349

Query: 357 NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNV 388
            D  +DGV+   VL   A+  R  I H   N 
Sbjct: 350 KDRIIDGVSLLPVLNGAAEFDRP-IFHYRGNA 380


>gi|284041117|ref|YP_003391047.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283820410|gb|ADB42248.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 486

 Score =  165 bits (418), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 125/397 (31%), Positives = 196/397 (49%), Gaps = 34/397 (8%)

Query: 20  AFLNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALC 77
           A++   AP  P ++++   DDLG+ D+S  G+    TPN+D +A  G    N    QA+C
Sbjct: 27  AWVEKPAPATPPNVVLFFMDDLGYGDLSVTGALDYTTPNLDKMAAEGTRFTNFLAAQAVC 86

Query: 78  TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           + SR+AL+TG YP  +G+ +G +    P GL   E+ L + LKE GYAT   GKWHLG  
Sbjct: 87  SASRAALLTGCYPNRLGL-YGALGPNSPIGLNPNEETLAELLKERGYATGMFGKWHLGDN 145

Query: 138 REVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRH-NMQVDNKTIGIYST 194
           ++ + P  +GFD +YG  Y   +   +    +A + P + +D      ++ +       T
Sbjct: 146 KQ-FLPMQQGFDEYYGVPYSHDMWPLHPAQAQAKYPPLRWIDGNEPGPEIKDLNDAGKIT 204

Query: 195 DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYA 254
              TE A++ I  H K KP FLY+ H   H          P    A+F   S   R  + 
Sbjct: 205 GTITEKAVSFIRNHKK-KPFFLYVPHPLPH---------VPLATSARFKGQS--ARGIFG 252

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            +++ LD SVG ++  L++ G+ +N++V+F++DNG P      + GS+   R  K T ++
Sbjct: 253 DVLTELDWSVGQIMNELKQQGLDKNTLVIFISDNG-PWLNYGDHAGSSGGFREGKGTSFE 311

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA 374
           GG R    +  P +    +VS++L    D LPT+    G  +    +DGV+ W  L KG 
Sbjct: 312 GGHRVPCLVRWPGVVPAGRVSNKLLTALDILPTVANVCGARLPKQRIDGVD-WVALLKG- 369

Query: 375 KTKRSEILHNIDNVDNPQ-KYYAALRVDDLKYVAGTD 410
                      DN   P+ K+Y   R + L+ V   D
Sbjct: 370 -----------DNSVTPRDKFYYYYRKNSLEAVRQGD 395


>gi|393784381|ref|ZP_10372546.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666157|gb|EIY59674.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
           CL02T12C01]
          Length = 456

 Score =  165 bits (418), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 116/350 (33%), Positives = 192/350 (54%), Gaps = 30/350 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTGK 88
           P+I++I+ DDLG  D+S   +  + TPNID L + G+ ++  Y   ++ +PSR+ L+TG 
Sbjct: 36  PNILLIMVDDLGLGDLSCQYAKDVQTPNIDRLFHQGVRMDNFYANSSVSSPSRAGLLTGC 95

Query: 89  YPIHIGMQHGVILEGEP---WG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +P  +G+  GVI   +P   WG L     LLP+ +K+AGY T  IGKWHLG       P 
Sbjct: 96  FPDMVGVP-GVI-RTDPKGSWGYLSPDATLLPEMMKKAGYQTAIIGKWHLG-LESPNLPN 152

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
            RGFD  +G+   + D Y+   +  F  Y  L+     ++D +    ++TDL+T+ A + 
Sbjct: 153 ERGFDFFHGFLGDMMDDYNTHLRRGFN-YMRLNTE---EIDPQG---HATDLFTDWASDY 205

Query: 205 IAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAP---DEEVAKFLDISDPERRTYAGMVSRL 260
           I +  K  KP FLYLA+ A H+     P Q P   +++V K        R     ++  +
Sbjct: 206 IFKARKQDKPFFLYLAYNAPHS-----PLQPPVEWEKKVRKRHPNISETRGKLVALIEHM 260

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D+ VG VI +L K G L+N++++F +DNG    G   ++ +N P+RG K   +DGG++  
Sbjct: 261 DDGVGRVINSLEKSGQLDNTLIIFCSDNG----GDRKSEANNGPVRGDKGDMYDGGIKVA 316

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
            +++     + ++V++ L  +SD  PTLC    + +N   +DG++   +L
Sbjct: 317 CSLYWKGHLEHRRVNN-LVMMSDIFPTLCDLVQLPVNH-RIDGISVLPLL 364


>gi|149174645|ref|ZP_01853271.1| sulfatase [Planctomyces maris DSM 8797]
 gi|148846755|gb|EDL61092.1| sulfatase [Planctomyces maris DSM 8797]
          Length = 405

 Score =  165 bits (417), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 112/352 (31%), Positives = 180/352 (51%), Gaps = 25/352 (7%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
           +   + +KP++III  DD G  D++ +G+  + TP++D++A  G+   Q Y  A +C+PS
Sbjct: 1   MGAISSEKPNVIIIFTDDQGSVDLNCYGAKDLITPHMDSIARRGIRFTQFYASAPVCSPS 60

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           R+ ++TG++P   G+   V       G+P  +  + + +++AGY T  IGKWHLG+  E 
Sbjct: 61  RAGMLTGRFPARAGVPGNVSSHHGKSGMPTEQITIAEMMQQAGYQTAHIGKWHLGYTPET 120

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI---GIYSTDLY 197
             P  +GF++ +G+  G  D Y H     F  + G + RH++  + K +   G +  DL 
Sbjct: 121 -MPHGQGFETSFGHMGGCIDNYSH-----FFYWNGPN-RHDLWENGKEVWRDGAFFPDLM 173

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
            E   + I +    KP FLY A    H      P Q  ++    +  +S P R  YA  V
Sbjct: 174 VEQCQDYIRKAG-DKPFFLYWAINVPHY-----PLQGKEKWRKTYAHLSSP-RDKYAAFV 226

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAP----SFGIHSNKGSNHPLRGMKSTPW 313
           S +D+ +G V+A L    + E +I++F +D+G      +FG     GS  P RG K + +
Sbjct: 227 STMDDCIGEVLATLDACQLREKTIIIFQSDHGHSHEERTFG---GGGSAGPYRGAKFSLF 283

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
           +GG+R  A I  P      +V  +L    DWLPT+ A  G  +    LDG N
Sbjct: 284 EGGIRVPAMISWPGTIAEGEVRDQLATGCDWLPTISALTGAPLPAHHLDGKN 335


>gi|410638700|ref|ZP_11349253.1| sulfatase [Glaciecola lipolytica E3]
 gi|410141228|dbj|GAC16458.1| sulfatase [Glaciecola lipolytica E3]
          Length = 522

 Score =  165 bits (417), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 129/383 (33%), Positives = 191/383 (49%), Gaps = 52/383 (13%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSAL 84
           A KKP++I IL DDLG  D+  +G   I TPN+  +A  G++ +QHY    +C PSR+AL
Sbjct: 40  AAKKPNVIYILVDDLGIGDIEPYGQELINTPNLQRMADQGMVFSQHYAGNPVCAPSRAAL 99

Query: 85  MTGKYPIHIGMQHGVILEG----EPWG-LPLT--EKLLPQYLKEAGYATHAIGKWHLGFF 137
           MTG++  H  ++    L G    E +G +PL    + + + +K+AGY T  IGKW LG  
Sbjct: 100 MTGQHSGHNQIRGNFELGGFADEEEFGQMPLQPGTETVAKLMKKAGYTTALIGKWGLGGP 159

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSC------------KATFEPYQGLDMRHNMQVD 185
                PT +GFD  +GY    Q +  +                  +P++GL  +H+   D
Sbjct: 160 GSYGVPTKQGFDYFFGYLDQKQAHNHYPTHLWQNETWFPLNNEYLDPHKGLG-KHDDPND 218

Query: 186 NKTI-----GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVA 240
             +      G ++    T+ A++ I ++ K +P FLYL++ A HA       QAPDEE+A
Sbjct: 219 EASYEKFKRGDFAQQRLTDEALSYI-DNKKDEPFFLYLSYAAPHAA-----LQAPDEEIA 272

Query: 241 KFLDISDP---------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
           K+    D                 R T A M+S +DE VG V+  L+K G+ EN++V+F 
Sbjct: 273 KYNHFEDTPYDGVADGGYVPSFRPRATRAAMISHIDEGVGLVLDKLKKLGLDENTLVIFS 332

Query: 286 ADNGAPSFGIHSNK---GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS 342
           +DNG PSF   ++     SN   RG K   +DGG+R       P      KVS  +    
Sbjct: 333 SDNG-PSFEGGADLEFFDSNAQYRGFKRDLYDGGIRMPTIARWPTKVPAGKVSDHVSAFW 391

Query: 343 DWLPTLCAAAGIEINDTSLDGVN 365
           D LPTL     IE    ++DG++
Sbjct: 392 DVLPTLADLVEIEA-PKNIDGIS 413


>gi|423214668|ref|ZP_17201196.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692574|gb|EIY85811.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 453

 Score =  165 bits (417), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 117/367 (31%), Positives = 191/367 (52%), Gaps = 31/367 (8%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
           NT     P++++IL DDLG  D+S   +  + TPNID +   G+ L+  Y   ++ +PSR
Sbjct: 24  NTVDKSTPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSR 83

Query: 82  SALMTGKYPIHIGMQHGVILEG--EPWG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           +AL+TG++P  +G+  GVI     + WG    +   +P+ LK  GY T  IGKWHLG + 
Sbjct: 84  AALLTGRFPAMVGVP-GVIRPSVDQNWGYFDPSAVTMPEVLKNGGYRTALIGKWHLG-WE 141

Query: 139 EVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
               P  RGFD  +G+   +  DYY H  +     Y      ++ ++D K    ++T+L+
Sbjct: 142 SPNLPNERGFDHFHGFLADMMDDYYTHRRQGGNYMYL-----NDKEIDPKG---HATELF 193

Query: 198 TEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRT-Y 253
           T  +++ I +  K K P FLYLA+ A H+     P Q P E V K    D S P +R   
Sbjct: 194 TSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPVKRARL 248

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
             ++  LD ++G VI +L + G L N++V+F +DNG    G   +  +N P RG K   +
Sbjct: 249 IALIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNG----GDRGSMANNGPTRGAKGDMF 304

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG 373
           +GG+    A+  P + +  +  +    + D +PT+C    + +    +DG++  D +   
Sbjct: 305 EGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTICDFVNVPVKH-EIDGISVLDAIK-- 361

Query: 374 AKTKRSE 380
            KT+ +E
Sbjct: 362 GKTQNTE 368


>gi|336424148|ref|ZP_08604192.1| hypothetical protein HMPREF0994_00198 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336009236|gb|EGN39231.1| hypothetical protein HMPREF0994_00198 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 454

 Score =  164 bits (416), Expect = 8e-38,   Method: Compositional matrix adjust.
 Identities = 127/366 (34%), Positives = 176/366 (48%), Gaps = 30/366 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K P+II+I  DDLG+ DV  +GS    TP ID LA NG+     Y+ + +C+PSR A+MT
Sbjct: 6   KCPNIILINCDDLGYGDVGCYGSGVNRTPGIDYLAQNGMRCTSFYMASPVCSPSRGAMMT 65

Query: 87  GKYPIHIGMQ----HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           G YP  IG +      V+  G+  GL   E  + Q L+ AGY T  +GKWH G  +E + 
Sbjct: 66  GCYPPRIGFESFEGEWVLFPGQGVGLNSDEVTIAQILQRAGYRTKLVGKWHCG-DQEEFL 124

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH--NMQVDNKTIGIYSTDLYTEA 200
           PT  GFD +YG      D      K  F P   L       +Q D ++I    T+ Y E 
Sbjct: 125 PTNHGFDEYYGLPYS-NDMGRQVFKEDFPPLPLLSGTGIVELQPDQRSI----TERYVEQ 179

Query: 201 AINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
               I E NK+   FLYLAH+ VH      P  A +E V K ++        +   V  +
Sbjct: 180 CRTFIRE-NKNNKFFLYLAHMHVHL-----PLYAGEEFVKKSIN------GDFGACVEAI 227

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D SV  ++  LR  G+ E+++V+F +DNG    G   + GSN PL+G K T W+GG R  
Sbjct: 228 DWSVRVLLHELRSLGLEEDTLVIFTSDNG----GRGDHGGSNAPLKGNKGTTWEGGQRVP 283

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAA-GIEINDTSLDGVNQWDVLTKGAKTKRS 379
              + P      +   EL    D+  TL     G    D   D ++   V+ +G KT R 
Sbjct: 284 CIFYWPGHIAPGQECRELICSLDFYRTLAELGHGTMPADRKNDSLDIRRVILEGGKTPRD 343

Query: 380 EILHNI 385
              + I
Sbjct: 344 TFFYYI 349


>gi|423348778|ref|ZP_17326457.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
           CL03T12C32]
 gi|409212738|gb|EKN05774.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
           CL03T12C32]
          Length = 461

 Score =  164 bits (416), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 107/315 (33%), Positives = 165/315 (52%), Gaps = 32/315 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           ++P+I+I+LADD G+ D  F G++ I TPNID LA  G I    +V A + +PSRS ++T
Sbjct: 29  QRPNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLT 88

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y    G +  +   G+  GLP  E+LLP  LK   Y T  IGKWHLG       P  +
Sbjct: 89  GRYGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAK 145

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGI--YSTDLYTEAA 201
           GFD+ YG   G + Y+       ++P +  D   N+   Q + + +    Y TD     A
Sbjct: 146 GFDTFYGLLAGHRSYF-------YDP-ETSDKDGNLQQYQYNGRKLSFDSYFTDELASKA 197

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
              + E    +P  LY++  A H+     P +A +E++A+F       R+ YA M+  LD
Sbjct: 198 QQFVTE--SEQPFMLYMSFTAPHS-----PNEATEEDLARF---EGQPRQKYAAMMYALD 247

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GV 320
             VG ++  L+  G  +N+I+ F++DNG    G  +N+ SN PL+G K   ++GG R   
Sbjct: 248 RGVGKIVDELKAAGKFDNTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPF 303

Query: 321 AAIWSPWLKQTQKVS 335
             +W    K+ Q+ +
Sbjct: 304 FVVWGDRFKRDQRFT 318


>gi|149199917|ref|ZP_01876945.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149136986|gb|EDM25411.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 499

 Score =  164 bits (416), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 125/388 (32%), Positives = 188/388 (48%), Gaps = 41/388 (10%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
           F  L  +L F   F  T   + P+ I I+ DD G+ D+  +G   I TPNID +A  G+ 
Sbjct: 3   FRTLIISLSFLLGF--TAKAEMPNFIFIMTDDQGYGDLGCYGHPIIKTPNIDKMADRGVR 60

Query: 68  LNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
               Y +  C+P+R++LMTG +   +G+   ++      GL      +P+ LKE GY T 
Sbjct: 61  FTDFYARHKCSPARASLMTGAFNFRVGVG-SIVYPNSTTGLIKEVVTIPEMLKEKGYTTA 119

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYG-----------YWQGLQDYYDHSCKATFEPY--- 173
            IGKWHLG     Y P  +GFD ++G           +   + + +  S + T E Y   
Sbjct: 120 LIGKWHLG-HTAGYLPRDQGFDYYFGVPGTNHGDAKTHKLPVAEGFKPSGEFTIEDYWAD 178

Query: 174 --QGLDMRHNMQVDNKTIGIYSTDL------YTEAAINVIAEHNKSKPMFLYLAHLAVHA 225
             +G+     + + N  +  + TD+      YT  A+  I E NK KP FLY AH     
Sbjct: 179 KGKGVHGNSTILMKNDNVIEWPTDITQLTKRYTHDAVRYIKE-NKDKPFFLYFAH----- 232

Query: 226 GNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFM 285
           G  + P+       A F   SD     Y  M+  +D SVG VI AL+++G+ + +I+ F 
Sbjct: 233 GTPHHPYTVD----AAFRGKSD--HGLYGDMIEEIDWSVGEVIKALQENGIEKKTIIAFT 286

Query: 286 ADNGAPSF--GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISD 343
           +DNGA S     H+ KGSN PL+G K +  +GG+R    +  P      K ++E+  + D
Sbjct: 287 SDNGADSKPNKEHAEKGSNLPLKGWKGSSEEGGVRVPFVLSWPGTLPEGKKTNEIASLMD 346

Query: 344 WLPTLCAAAGIEIN-DTSLDGVNQWDVL 370
             PT  A AGIE      +DG N + ++
Sbjct: 347 IFPTYAALAGIEPEVPQKIDGNNIFPIM 374


>gi|423725143|ref|ZP_17699283.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
           CL09T00C40]
 gi|409234771|gb|EKN27595.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
           CL09T00C40]
          Length = 461

 Score =  164 bits (416), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 107/315 (33%), Positives = 165/315 (52%), Gaps = 32/315 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           ++P+I+I+LADD G+ D  F G++ I TPNID LA  G I    +V A + +PSRS ++T
Sbjct: 29  QRPNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLT 88

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y    G +  +   G+  GLP  E+LLP  LK   Y T  IGKWHLG       P  +
Sbjct: 89  GRYGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAK 145

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGI--YSTDLYTEAA 201
           GFD+ YG   G + Y+       ++P +  D   N+   Q + + +    Y TD     A
Sbjct: 146 GFDTFYGLLAGHRSYF-------YDP-ETSDKDGNLQQYQYNGRKLSFDGYFTDELASKA 197

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
              + E    +P  LY++  A H+     P +A +E++A+F       R+ YA M+  LD
Sbjct: 198 QQFVTE--SEQPFMLYMSFTAPHS-----PNEATEEDLARF---EGQPRQKYAAMMYALD 247

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GV 320
             VG ++  L+  G  +N+I+ F++DNG    G  +N+ SN PL+G K   ++GG R   
Sbjct: 248 RGVGKIVDELKAAGKFDNTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPF 303

Query: 321 AAIWSPWLKQTQKVS 335
             +W    K+ Q+ +
Sbjct: 304 FVVWGDRFKRDQRFT 318


>gi|154495003|ref|ZP_02034008.1| hypothetical protein PARMER_04049 [Parabacteroides merdae ATCC
           43184]
 gi|154085553|gb|EDN84598.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
          Length = 464

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 107/315 (33%), Positives = 165/315 (52%), Gaps = 32/315 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           ++P+I+I+LADD G+ D  F G++ I TPNID LA  G I    +V A + +PSRS ++T
Sbjct: 32  QRPNILILLADDAGYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLT 91

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y    G +  +   G+  GLP  E+LLP  LK   Y T  IGKWHLG       P  +
Sbjct: 92  GRYGQRYGYECNLDKPGD--GLPDDEELLPALLKRYDYRTGCIGKWHLG-SEPSQRPNAK 148

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGI--YSTDLYTEAA 201
           GFD+ YG   G + Y+       ++P +  D   N+   Q + + +    Y TD     A
Sbjct: 149 GFDTFYGLLAGHRSYF-------YDP-ETSDKDGNLQQYQYNGRKLSFDGYFTDELASKA 200

Query: 202 INVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLD 261
              + E    +P  LY++  A H+     P +A +E++A+F       R+ YA M+  LD
Sbjct: 201 QQFVTE--SEQPFMLYMSFTAPHS-----PNEATEEDLARF---EGQPRQKYAAMMYALD 250

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GV 320
             VG ++  L+  G  +N+I+ F++DNG    G  +N+ SN PL+G K   ++GG R   
Sbjct: 251 RGVGKIVDELKAAGKFDNTIIFFLSDNG----GSTTNQSSNLPLKGFKGNKFEGGQRVPF 306

Query: 321 AAIWSPWLKQTQKVS 335
             +W    K+ Q+ +
Sbjct: 307 FVVWGDRFKRDQRFT 321


>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
 gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
          Length = 457

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 121/382 (31%), Positives = 177/382 (46%), Gaps = 43/382 (11%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
            A    LL + A  +  AP KP+I+ IL DD+G  D   +G++   TP+ID LA  G+  
Sbjct: 11  IAAILVLLASGALHSDAAPTKPNIVFILIDDMGCKDAGCYGATNFSTPHIDRLANQGMRF 70

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGE--PWG----------LPLTEKLLP 116
              Y   +C+P+R++LMTGK+P  + + + +   G   P G          LPL EK + 
Sbjct: 71  TDAYAAPVCSPTRASLMTGKHPARLHLTNFIPQIGRQLPAGKLIPPGFNHVLPLDEKTIA 130

Query: 117 QYLKEAGYATHAIGKWHLG-FFREVYTPTFRGFD-----SHYGYWQGLQDYYDHSCKATF 170
           Q L   GY    IGKWHLG      Y P  RGFD      H+G +     + D       
Sbjct: 131 QELHADGYQCAMIGKWHLGEEHGPEYRPQNRGFDRVVLSEHHGIFNYFYPFVDQQKWPYA 190

Query: 171 EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE 230
            P  G              G Y  D  T+ AI+ + E N+ +P FLYL+H +VH      
Sbjct: 191 GPLPG------------NPGDYLPDRLTDEAIDFVRE-NRERPFFLYLSHWSVHGR---- 233

Query: 231 PFQAPDEEVAKFLDISDPER-RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG 289
            + AP+  +AK+ +    ER   YA M+  +D SVG ++A L +  + +N++ +FM+DNG
Sbjct: 234 -YFAPESLIAKYRERGLEERPAIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMSDNG 292

Query: 290 APSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
                      S  PLRG K + ++GG+R    +  P + +     S      D  PT  
Sbjct: 293 G------ERITSMAPLRGSKGSLYEGGVRVPLIVRYPGVVKPNTTCSVPVISHDLFPTFL 346

Query: 350 AAAGIEINDTSLDGVNQWDVLT 371
             A     D  LDG +   +LT
Sbjct: 347 DFAERSYRDNKLDGHSIAGLLT 368


>gi|311747710|ref|ZP_07721495.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|126575698|gb|EAZ80008.1| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 481

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 123/348 (35%), Positives = 179/348 (51%), Gaps = 31/348 (8%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRS 82
           T  P KP+I++I ADD+G+ D+  +G++Q  TPN+D +A +G+   Q YV  A+C+ SR+
Sbjct: 32  TEIPSKPNIVLIFADDMGYGDLGVYGATQWETPNLDKMASDGVRFTQFYVPHAVCSASRA 91

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           AL+TG Y   + +  G +      GL   E  + + LK  GYAT  +GKWHLG  +  + 
Sbjct: 92  ALLTGTYANRLEI-FGALDHSAKHGLNPEETTIAEMLKANGYATGIVGKWHLG-HQAPFL 149

Query: 143 PTFRGFDSHYGYWQGLQDYYDH--SCKATFEPYQGLDMRHNMQV----DNKTIGIYSTDL 196
           PT +GFDS+YG      D + H    K  + P   L +  N  V    D++++    T  
Sbjct: 150 PTEQGFDSYYGLPYS-NDMWPHHPEVKGYYPP---LPLYENTAVIDTLDDQSM---LTTN 202

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
           YTE A+  I E++K KP FLYLAH   H      P    D+   K       E   Y  +
Sbjct: 203 YTEKALEFI-ENSKDKPFFLYLAHSMTHV-----PLYVSDKFKGK------SEHGLYGDV 250

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDG 315
           +  +D SVG V   L + G+ EN+IV+F +DNG   S+G H+  G    L+  K T WDG
Sbjct: 251 MMEVDWSVGQVRNKLDELGLAENTIVIFTSDNGPWLSYGGHA--GLTGGLKEGKGTSWDG 308

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
           G+R       P      KV ++     D LPTL    G ++ +  +DG
Sbjct: 309 GIREPGIFVWPDHFPAGKVETQAAMTIDILPTLAEITGSKLPELPIDG 356


>gi|430744284|ref|YP_007203413.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430016004|gb|AGA27718.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 501

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 114/360 (31%), Positives = 178/360 (49%), Gaps = 24/360 (6%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           ++ +F +   +L   + ++  +P +P+I++++ADD+G+ D S  G+  I TPN+DA A  
Sbjct: 3   QREWFVVASLVLVIGSRVHAASPPRPNIVLVMADDVGYGDFSCLGNPIIQTPNVDAFAKQ 62

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
            +     +V   C P+R+AL+TG++    G+ H  ILE E   L L    L Q LK  GY
Sbjct: 63  AVRFTDFHVSPTCAPTRAALLTGRHEFKSGVTH-TILERER--LSLKATTLAQVLKSKGY 119

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQG--LQDYYDHSCKATFEPYQGLDMRHNM 182
           AT   GKWHLG   + Y P  RGFD  + +  G   Q Y      A    Y    + HN 
Sbjct: 120 ATGVFGKWHLG-DEDAYQPGQRGFDEVFIHGAGGIGQTYAGSGGDAPGNSYFNPAILHNG 178

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
             + KT G Y TD++   A+  I +  K  +P F Y+   A H+     P   P+E   +
Sbjct: 179 TFE-KTTG-YCTDVFFRQALTWIDQERKGDEPFFAYITPNAAHS-----PLDCPEEYAKR 231

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
                      + GM+  +D++ G ++A L+  G+  +++V+FM DNG    G    +  
Sbjct: 232 HAGQVPENVAKFYGMIENIDDNFGTLVAKLKDWGIENDTLVIFMTDNG----GTAGTRTY 287

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPW---LKQTQKVSSELFHISDWLPTLCAAAGIEIND 358
           N  +RG K TP+ GG R V + W  W    K   +V +   H+ D LPTL    G  +++
Sbjct: 288 NAGMRGTKGTPYQGGTR-VPSFWR-WPSGFKGDVEVGALTAHL-DILPTLAEIVGSPLSE 344


>gi|383114003|ref|ZP_09934770.1| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
 gi|382948682|gb|EFS34104.2| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
          Length = 496

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 125/363 (34%), Positives = 192/363 (52%), Gaps = 25/363 (6%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGS-SQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           P KP+I+IILADD G+  V+ +    +I TPNID LA +G+   Q Y    L +P+R+ L
Sbjct: 34  PDKPNIVIILADDQGYGGVNCYPHIKKIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGL 93

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           MTGKY    G  +G+       G+P  +KLL +YL E GY T  IGKWHLG +   + P 
Sbjct: 94  MTGKYQQSFGF-YGLSTP-HVGGIPQDQKLLSEYLVENGYNTACIGKWHLGDYIRSH-PN 150

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEP-YQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
            RGF + +G+  GL DYYD     +++  Y GL    +  ++  T   YST  YT+ A++
Sbjct: 151 NRGFQTFFGFINGLHDYYDPLVGGSWDGVYNGLAFTLD-NMEPVTEMEYSTYEYTKRAVD 209

Query: 204 VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE---RRTYAGMVSRL 260
            I + N   P FLYL + A+H+     P QAP+E + + L I+  E         M   L
Sbjct: 210 FI-QKNADHPFFLYLPYNAIHS-----PLQAPEELIGE-LAINPQEIGKDDIARAMTFAL 262

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D+ VG V+  L + G+ +N+I+ +++DNGA  +   S+K      RG K + ++GG+R  
Sbjct: 263 DQGVGKVVETLEQLGLRDNTIIFYLSDNGAVEY---SDKWE---FRGRKGSYYEGGIRVP 316

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSE 380
             +  P       + ++     D  PT+   AG+   D  + GVN    L+   +T+  +
Sbjct: 317 FIVSYPAKLAKGTIYNKPVMSIDIAPTVMELAGLSHAD--MHGVNLLPYLSGKDRTEPHD 374

Query: 381 ILH 383
           +L+
Sbjct: 375 VLY 377


>gi|343084600|ref|YP_004773895.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342353134|gb|AEL25664.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 472

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 185/366 (50%), Gaps = 41/366 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
           + P+I+ IL DDLGW D+  +GS    TPNID L   G+     Y    +C+PSR++++T
Sbjct: 27  RPPNIVYILIDDLGWKDLGCYGSEFYETPNIDKLRDQGMKFTAAYSASPVCSPSRASILT 86

Query: 87  GKYPIHIGM-------------QHGVILEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKW 132
           GK P +IG              + G I+  + +  + L EK++P+ L ++GY + +IGKW
Sbjct: 87  GKNPANIGFTGHITAIGKHRYPEEGRIIPPDDYMHVSLEEKMIPEILLQSGYTSASIGKW 146

Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF-EPYQGLDMRHNM--QVDNKTI 189
           H+G   E + PT +GF  +          Y+H    T+  P++     + +   +DN+  
Sbjct: 147 HVG-EEEKFFPTHQGFAINIA-------GYEHGSPPTYWGPFESEKSWNPVIKNLDNREE 198

Query: 190 GIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF-LDIS-- 246
           G Y T+  T+ AIN I E NK  P FLYL+H AVH      P +APD  + K+ L +   
Sbjct: 199 GQYLTNRLTDEAINFIDE-NKEGPFFLYLSHYAVHT-----PLEAPDSLIKKYELKLEGQ 252

Query: 247 -DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
            + +   YA M+  +D +VG ++ +L   G+  N+IV+  +DNG           +N PL
Sbjct: 253 MEQKNAIYAAMIENMDWNVGRLLKSLDSLGLEGNTIVILGSDNGG-----EGRVTNNVPL 307

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
           R  K   ++GG+R    I  P   +    S       D LPT+   AG+   +  +DGV+
Sbjct: 308 REGKGYIYEGGIRVPLVIKWPGKVKPNSSSDVPVITDDMLPTIVEMAGLSNENQDIDGVS 367

Query: 366 QWDVLT 371
              +LT
Sbjct: 368 LLPLLT 373


>gi|334139745|ref|YP_004532943.1| sulfatase [Novosphingobium sp. PP1Y]
 gi|333937767|emb|CCA91125.1| sulfatase [Novosphingobium sp. PP1Y]
          Length = 472

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 103/294 (35%), Positives = 150/294 (51%), Gaps = 26/294 (8%)

Query: 42  WNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVI 100
           W DVS +G + +PTPNID +A  G+  +  YV A +C  SR+ LMTG+ P   G  + + 
Sbjct: 39  WADVSTYGRTDVPTPNIDRIAKTGVAFSSGYVAASVCAVSRAGLMTGRMPQRFGFTYNIN 98

Query: 101 LEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ 159
            +G+   GLP+ +K +   L+  GY T A GKWHLG  R+ Y PT RGFD  +G+  G  
Sbjct: 99  DKGDVGAGLPVGQKTIADRLQPLGYRTAAFGKWHLGADRQFY-PTNRGFDEFFGFLAGET 157

Query: 160 DYYDHSCKATFEPYQGLDM--------RHNM-----QVDNKTIGIYSTDLYTEAAINVIA 206
           +Y D            +D          H M              Y T+  T+ A++ I 
Sbjct: 158 NYVDPKTPGIVTTPTKVDKYEIGPGEGNHAMVEGPDARPADDFSKYLTNQITDRAVDFIN 217

Query: 207 EHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
               +K P F Y+A+ A H      P Q P     +F ++ DP RRTY  M++ +D+ VG
Sbjct: 218 RSADAKQPFFSYVAYNAPH-----WPLQVPQAYYDRFANVKDPVRRTYIAMIAAMDDGVG 272

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPLRGMKSTPWDGGMR 318
            ++  L + G+ EN+IV+F++DNG P  FG  +   +NHP    K T  +GG R
Sbjct: 273 QILDTLEERGLRENTIVVFLSDNGCPERFGFCN---TNHPWGSGKFTYLEGGTR 323


>gi|405970955|gb|EKC35816.1| N-acetylgalactosamine-6-sulfatase [Crassostrea gigas]
          Length = 511

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 124/385 (32%), Positives = 183/385 (47%), Gaps = 38/385 (9%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTG 87
           KP+ I++L DD+GW D+   G     TP +D +A  G++    Y    LC+PSR+AL+TG
Sbjct: 19  KPNFILMLMDDMGWGDLGVFGEPNKETPYLDQMAAEGMLFPDFYSANPLCSPSRAALLTG 78

Query: 88  KYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           + PI  G        ++    +    G+P  E LLP+ L++AGY +  +GKWHLG  +  
Sbjct: 79  RLPIRNGFYTTNGHARNAYTPQNIVGGIPDEEILLPELLQKAGYKSKLVGKWHLG-HQAK 137

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD---NKTIGIYSTDLY 197
           Y P   GFD  +G        YD+        Y+  +M      D    K      T LY
Sbjct: 138 YLPLKHGFDEWFGAPNCHFGPYDNVHTPNIPVYRNEEMAGRYYQDFKIEKNGESNLTQLY 197

Query: 198 TEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
           T+ A+  I   HNKSKP FLY A  A H     EP  A       FL  S+  R  Y  +
Sbjct: 198 TKEAVEFITRMHNKSKPFFLYWAVDATH-----EPLYA----SKPFLGTSN--RGLYGDV 246

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           V  LD +VG ++A+LR   +  N++V F +DNG  ++    + GSN P    K T ++GG
Sbjct: 247 VRELDSAVGKILASLRSLKIDNNTLVFFSSDNGGATYA-KQHGGSNGPFLCGKETTFEGG 305

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAK 375
           MR     W P   +  +VS ++  + D   T    A + +  D  +DG+           
Sbjct: 306 MREPTIAWWPGHIKKGQVSHQVGSLMDLYTTFVDFAEVAMPTDRVIDGI----------- 354

Query: 376 TKRSEILHNIDNVDNPQKYYAALRV 400
           + R+ +L+N  NV  P  YY   ++
Sbjct: 355 SLRASLLNNT-NVQRPLFYYRGNQI 378


>gi|417305610|ref|ZP_12092564.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327538043|gb|EGF24733.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 455

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 178/355 (50%), Gaps = 41/355 (11%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A + P+ I+I ADDLG+ D+S + SS + TP++D LA  G      +V A +C+PSR+AL
Sbjct: 16  ADRPPNFILIFADDLGYGDISCYDSSGVKTPHLDQLAAEGFRSTDFFVPANVCSPSRAAL 75

Query: 85  MTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           +TG+YP+  GM   ++  + + + +G    E  +P+ L  AGY +  +GKWHLG   E  
Sbjct: 76  LTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGMELEGS 135

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD---MRHNMQVDNKTIGIYS-TDLY 197
            P   GFD ++G              + +EP +G +   +    QV+ K +     T  Y
Sbjct: 136 HPLDAGFDEYFG------------IPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKRY 183

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
           T+  I+ I +  K  P F+Y++H  VH  N  +P  +PD     F+  S  E+  Y   +
Sbjct: 184 TDEVIDFI-KRQKDDPFFVYVSHHIVH--NPLKP--SPD-----FVGTS--EKGKYGDFI 231

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
             LD S G ++  +R  G+ EN++V+F +DNG       +  GS+  L G K    +GG 
Sbjct: 232 KELDHSTGRIMQTIRDAGLDENTLVIFTSDNGP------TRNGSSGELSGGKYCTMEGGH 285

Query: 318 RGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
           R      W+  +   Q VS       D LP  C  AG+ I +D  +DG N   VL
Sbjct: 286 RVPGMFRWTSKIAPNQ-VSDVTLTSMDLLPLFCQLAGVPIPDDRQIDGKNILPVL 339


>gi|344292944|ref|XP_003418184.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Loxodonta
           africana]
          Length = 513

 Score =  164 bits (414), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 120/367 (32%), Positives = 182/367 (49%), Gaps = 26/367 (7%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSAL 84
           AP+ P+I+++L DD+GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL
Sbjct: 27  APQPPNIVLMLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAAL 86

Query: 85  MTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           +TG+ PI  G        ++    +    G+P +E LLP+ LK+A YAT  +GKWHLG  
Sbjct: 87  LTGRLPIRNGFYTTNGHARNAYTPQDIVGGIPDSEHLLPELLKKANYATKIVGKWHLGH- 145

Query: 138 REVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYS 193
           R  + P   GFD  +G        YD+  K     Y+  +M         ++ KT     
Sbjct: 146 RPQFHPLKHGFDEWFGSPNCHFGPYDNRAKPNIPVYRDWEMVGRFYEEFPINLKTGEANL 205

Query: 194 TDLYTEAAINVIA-EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
           T +Y + A++ I  + ++ +P FLY A  A HA         P      FL  S  +R  
Sbjct: 206 TQIYLQEALDFIKRQQSQQRPFFLYWAIDATHA---------PVYASKPFLGTS--QRGR 254

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           Y   V  +D+SVG ++  LR  G+ E++ + F +DNGA         GSN P    K T 
Sbjct: 255 YGDAVREVDDSVGQLLQLLRGLGIAEHTFIFFTSDNGAALVSAPKEAGSNGPFLCGKQTT 314

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLT 371
           ++GG+R  A  W P      +VS +L  + D   T  + AG+E  +D  +DG++   VL 
Sbjct: 315 FEGGVREPAIAWWPGHIPAGQVSHQLGSLMDLFTTSLSLAGLEPPSDREIDGLDLLPVLL 374

Query: 372 KGAKTKR 378
           +G    R
Sbjct: 375 RGHLIDR 381


>gi|149199915|ref|ZP_01876943.1| arylsulphatase A [Lentisphaera araneosa HTCC2155]
 gi|149136984|gb|EDM25409.1| arylsulphatase A [Lentisphaera araneosa HTCC2155]
          Length = 489

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 131/426 (30%), Positives = 204/426 (47%), Gaps = 46/426 (10%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           R++  +L C L F    ++  A +KP+I+  L DDLG+ D+  +G+    TP ID LA  
Sbjct: 6   RQFVTSLAC-LAFFTGVVSLQAQQKPNILFYLTDDLGYGDIGCYGAEGQYTPAIDQLAKE 64

Query: 65  GLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGE-PWGLPLTEKLLPQYLKEAG 123
           G   +  YV   C+PSR+A MTG Y   +G+   +    E P GL  +E  LP+ +K AG
Sbjct: 65  GTKFSSFYVHQRCSPSRAAFMTGSYAHRVGLPQVIYKHREGPIGLNPSEITLPELMKTAG 124

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
           Y T  +GKWHLG ++  + P   G+D  YG+ + ++     S          ++ R  + 
Sbjct: 125 YNTALVGKWHLGEWKP-FHPLNHGYDYFYGFLKVIEGSEKPSL---------IENRKELA 174

Query: 184 VD-NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
               KT G        +AAIN + +H K+ P FL  +    HA     P+  P E+    
Sbjct: 175 SKIQKTEG--QAPGMVKAAINFMTKHKKN-PFFLVYSDPMPHA-----PY-FPSEQFK-- 223

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--- 299
                 +R  Y  ++  +D    +++ AL + G+ EN+IV+F +DNG P   +   K   
Sbjct: 224 ---GTSKRGNYGEVIHEIDWQFKHLMDALDELGLKENTIVVFTSDNGPP---VERQKKYD 277

Query: 300 -GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-N 357
            G + PLR  K T ++GG+R    I  P   +    S  +  I D LPT C  AG+++ N
Sbjct: 278 VGLSGPLRDGKWTNFEGGVRVPFIIRWPGKVKVDASSDAMIGIIDMLPTFCELAGVDVPN 337

Query: 358 DTSLDGVNQWDVLTKGAKTK---RSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQ 414
           D  +DGVN    L    ++K    ++I+     + N  KYYA  +           NN +
Sbjct: 338 DRVIDGVNILPQLLGDQESKALRETQIVPGATIIHNGWKYYAKQQ--------NPYNNKK 389

Query: 415 SDEWYG 420
            ++W G
Sbjct: 390 PEDWNG 395


>gi|423288159|ref|ZP_17267010.1| hypothetical protein HMPREF1069_02053 [Bacteroides ovatus
           CL02T12C04]
 gi|392671048|gb|EIY64524.1| hypothetical protein HMPREF1069_02053 [Bacteroides ovatus
           CL02T12C04]
          Length = 481

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 139/419 (33%), Positives = 198/419 (47%), Gaps = 67/419 (15%)

Query: 8   FFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
            FAL C L             +P+++ ILADDLG+ DVS +G  +  TPNIDALA  G+ 
Sbjct: 9   LFAL-CMLASCSVSEKNKLESRPNVVFILADDLGYGDVSCYGQEKFTTPNIDALASEGMK 67

Query: 68  LNQHYVQALCT---PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEK--LLPQYLKEA 122
             Q Y  A CT   PSR +LMTG +  H  ++    L+GE   LP+ E    + +  K+ 
Sbjct: 68  FTQFY--AGCTVSAPSRCSLMTGFHTGHAQVRGNRELKGEGQ-LPMAEGTYTIARMFKDN 124

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
           GYAT A GKW LGF      P  +GFD  +GY          +C+     Y    + HN 
Sbjct: 125 GYATGAFGKWGLGFPGSEGDPNNQGFDEFFGY----------NCQRQAHRYYPSHLWHNQ 174

Query: 183 QV------DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA----------- 225
           +       D K   +Y+ DL  E A++ I   NK KP F ++  +  HA           
Sbjct: 175 EKVILEGNDTKNKTVYAPDLIHEKALDFI-RTNKDKPFFAFIPVIQPHAELLVPEDSIIE 233

Query: 226 ---GNTYE-PFQAPDEEVAKFLD--------ISDPE-RRTYAGMVSRLDESVGNVIAALR 272
              G   E PF A D+E A++ D         S PE   T+A MVSR+D+ VG+V   L+
Sbjct: 234 KYRGKYPETPFVA-DKEGAEYGDPDFDVKAYCSQPEPHATFAAMVSRVDKYVGDVTGLLK 292

Query: 273 KHGMLENSIVLFMADNGAPSFGIHSNKG-------SNHPLRGMKSTPWDGGMRGVAAI-W 324
           + G+ +N+IV+F +DNG      H   G       SN    G K +  DGG+R    I W
Sbjct: 293 ELGIDDNTIVIFSSDNGP-----HLEGGADPDFWNSNGDFSGYKRSMTDGGIRVPMIIKW 347

Query: 325 SPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
              +K    V   +    D++PT     G++++ T  DG++   V+T   + K  E L+
Sbjct: 348 GDRIK-AGSVEQHIGAFYDFMPTFADLLGVDVDKT--DGISLLPVITGSGEQKAHEFLY 403


>gi|291223783|ref|XP_002731888.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 516

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 124/422 (29%), Positives = 202/422 (47%), Gaps = 41/422 (9%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
           ++  KP+I+++ ADD+G+ D++ +G     TPN+D +A NG++    Y  A LC+PSR+A
Sbjct: 29  SSGSKPNIVVLFADDVGYGDLACYGHPASYTPNLDGMAKNGMLFTSFYSAANLCSPSRAA 88

Query: 84  LMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYT 142
           L+TG+Y I  G+   V+      GLP  E  +   LK+ GY T  IGKW LG  ++ V+ 
Sbjct: 89  LLTGRYHIRSGIYPFVLGPDSTGGLPRNETTIANILKQQGYHTAMIGKWDLGIGKDMVHF 148

Query: 143 PTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLD-----------------MRHNMQ 183
           P   GFD +YG  Y  G        C+  F P++G                   +  NM+
Sbjct: 149 PMHYGFDKYYGVPYVHGAC-----PCETCFYPHEGCQQSVIGGDKCGFDKTYCPLFQNMR 203

Query: 184 VDNKTIGIYS-TDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
           +  + + + +  + Y + A + I +  N  +  FLYLA+   H          P     +
Sbjct: 204 IIQQPVDLLTLAEKYVDFAKDFITDSANAGESFFLYLAYNHCH---------FPQFAGKR 254

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           F + +  +R  Y   ++ +D SVG ++  LR   +  N++V F +DNGA +     N GS
Sbjct: 255 FHNFT--QRGIYGDSLAEIDWSVGEILQQLRHSQIENNTLVFFTSDNGA-ALRDWYNGGS 311

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSL 361
           +  L+  K+T ++GG R  A  + P        S +     D LPT+   +G  +   ++
Sbjct: 312 SGMLKCGKATTYEGGHRVPAIAYWPGQITAGVSSRDFLTTLDLLPTITKLSGSVLPKVTI 371

Query: 362 DGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGD 421
           DG++   VL   AK+ R  +L+     D     Y A+R    K    T+ + QS  +  D
Sbjct: 372 DGMDMSSVLFGQAKSARDNLLYYPAYPDKKIGAY-AIRYKHYKAHFYTEGSPQSGMYNAD 430

Query: 422 TD 423
           TD
Sbjct: 431 TD 432


>gi|402884716|ref|XP_003905821.1| PREDICTED: arylsulfatase A [Papio anubis]
          Length = 507

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 128/404 (31%), Positives = 192/404 (47%), Gaps = 30/404 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
           + P+I++I ADDLG+ D+  +G     TPN+D LA  GL L   YV  +LCTPSR+AL+T
Sbjct: 19  RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRLTDFYVPVSLCTPSRAALLT 78

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+ +GM  GV+      GLPL E  L + L   GY T   GKWHLG   E  + P  
Sbjct: 79  GRLPVRMGMYPGVLEPSSRGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGSEGAFLPPH 138

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
           +GF    G  Y       ++ +C     P  G        + +  N+ V+ +   +   +
Sbjct: 139 QGFHRFLGIPYSHDQGPCWNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPRLE 198

Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
             Y   A +++A+   + +P FLY A    H    Y  F         F + S   R  +
Sbjct: 199 ARYVAFARDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QNFAERSG--RGPF 247

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              +  LD +VG ++ A+R  G+LE ++V+F ADNG  +  + S  G +  LR  K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 306

Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           +GG+R  A A W   +     V+ EL    D LPTL A AG  + + +LDG +   +L  
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364

Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
             K+ R  +       D  +  + A+R    K    T  +  SD
Sbjct: 365 TGKSPRQSLFFYPSYPDEVRGVF-AVRSGKYKAHFFTQGSAHSD 407


>gi|340617020|ref|YP_004735473.1| sulfatase [Zobellia galactanivorans]
 gi|339731817|emb|CAZ95082.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
          Length = 452

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 127/383 (33%), Positives = 188/383 (49%), Gaps = 42/383 (10%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
            AL C+    D+        KP+II ILADDLG+ D+  +G   I TP+ID +A  GL  
Sbjct: 1   MALFCSCENTDSAYGKVKGSKPNIIFILADDLGYGDLGCYGQETISTPHIDRMASEGLRF 60

Query: 69  NQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
            QHY  A +C PSR++LMTG+   H+G      +E       LT   + + LK+AGY T 
Sbjct: 61  TQHYAGATVCAPSRNSLMTGQ---HMGNTTIKSMEKPIKDSDLT---VAEVLKDAGYRTG 114

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ--DYYDHSCKATFEPYQ-GLDMRHNMQV 184
            IGKW LG          +GFD  +GY+  ++  +YY        E +    ++ +    
Sbjct: 115 VIGKWGLGNVGTSGYANAQGFDYSFGYYDQIRAHNYYPDYLMENGEKFPLKNEVVYVSDS 174

Query: 185 DNKTIGI---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
            N  +GI         YS DL+TE A++ I    K +P FLYLA+   HA N        
Sbjct: 175 TNYAVGIGNAAIKKKEYSNDLFTEKALDFIKTDAK-EPFFLYLAYTIPHANNESFLINEH 233

Query: 236 DEEVAKFLDISD-----PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
             EV  +  I D      E++  A M++RLD  VG ++  L+K+ +  N++V+F +DNG 
Sbjct: 234 GMEVPDY-GIYDKEPWPSEKKAGAAMITRLDSYVGQILDLLKKNNLDGNTLVIFTSDNGP 292

Query: 291 PSFGIHSNKG-------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK-VSSELFHIS 342
                H   G       SN PLRGMK   ++GG+R      + W  + +K  + ++    
Sbjct: 293 -----HQEGGWQVDYFDSNGPLRGMKRDLYEGGIR--VPFIAKWPGKIEKGETDQVATFW 345

Query: 343 DWLPTLCAAAGIEINDTSLDGVN 365
           D+LPT C  +G +    S DG++
Sbjct: 346 DFLPTACELSG-QDKPISTDGIS 367


>gi|149178470|ref|ZP_01857059.1| arylsulfatase A (precursor) [Planctomyces maris DSM 8797]
 gi|148842683|gb|EDL57057.1| arylsulfatase A (precursor) [Planctomyces maris DSM 8797]
          Length = 491

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 127/383 (33%), Positives = 190/383 (49%), Gaps = 36/383 (9%)

Query: 9   FALTCTLLFNDAFLNTT----APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           F + C L+F     N      A + P+ +II  DD G+ DV   GS  I TPN+D +A  
Sbjct: 22  FLVIC-LVFLSLLSNVADARPARQHPNFVIIFTDDQGYQDVGVFGSPNIKTPNLDQMAKE 80

Query: 65  GLILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
           G+     Y  QA+C+ SR AL+TG YP  +G++ G +      G+   E  + + +K  G
Sbjct: 81  GIRFTDFYAAQAVCSASRVALLTGCYPNRVGIR-GALGPQSKIGINAEETTIAEVVKPQG 139

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGLDMRHN 181
           YAT   GKWHLG   E + PT  GFD ++G  Y   +  ++  + K     +  L +  N
Sbjct: 140 YATAIYGKWHLGHLPE-FLPTRHGFDEYFGLPYSNDMWPFHPTAGKR----FPDLPLIEN 194

Query: 182 MQVDN-KTIGIYSTDL---YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE 237
             V N K  G     L   YTE A++ I + N  KP FLY+ H   H      P    D+
Sbjct: 195 ETVINPKVTGKEQAQLSTWYTERAVSFINK-NHDKPFFLYVPHSMPHV-----PLFVSDK 248

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIH 296
              K       E+  Y  ++  +D SVG +  AL+++G+ EN++V+F +DNG   S+G H
Sbjct: 249 FKGK------SEQGLYGDVIMEIDWSVGQIRQALKENGIEENTLVIFTSDNGPWLSYGDH 302

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI 356
           +  GS  PLR  K T WDGG R    +  P       V  ++    D LPT+   +G ++
Sbjct: 303 A--GSALPLREGKGTAWDGGQREPCIMAWPGQIPAGSVCHQMAMTIDILPTIAYLSGGKV 360

Query: 357 -NDTSLDGVNQWDVLT--KGAKT 376
             D  +DG N W +++  +GA++
Sbjct: 361 PQDRIIDGKNIWPLMSGEQGAQS 383


>gi|294053911|ref|YP_003547569.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613244|gb|ADE53399.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 469

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 119/349 (34%), Positives = 179/349 (51%), Gaps = 30/349 (8%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
           +P++I+I+ADDLG+ D+ + GS  IPTPNID LA  G+     YV    C PSR+  +TG
Sbjct: 31  QPNVIVIMADDLGYGDLGYTGSKHIPTPNIDRLANEGVECTYGYVTHQYCGPSRAGFLTG 90

Query: 88  KYPIHIGMQ-------HGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           +Y    G +       H  I      G+P +E+L  + L+  GY T  +GKWH+G    +
Sbjct: 91  RYQQRFGFETNPPYDRHNTIA-----GVPASERLFAERLQAVGYKTGIVGKWHIGS-HSI 144

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATF-EPYQGLDMRHNMQVDNKTIGIYSTDLYTE 199
           + P  RGFD  +G+  G  D++    +    E Y    MR+   VD   +  Y T   T+
Sbjct: 145 HHPNNRGFDFFFGFLGGGHDFFRVDTREPMDEGYLDPMMRNGSSVD---VEGYLTTQLTD 201

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            AI  I E N+  P FL+L++ A HA     P QAP+E +AKF  +   ERR Y+ MV  
Sbjct: 202 EAIGFI-ERNEKDPFFLFLSYNAPHA-----PLQAPEESIAKFSHVEGKERRVYSAMVYE 255

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG--SNHPLRGMKSTPWDGGM 317
           +D  +G +++ L    + E +I+ F++DNG P     S     SN   RG K   +DGG+
Sbjct: 256 MDLEIGRILSMLEDRDLAEETIIFFLSDNGGPPHWDKSQDSYTSNGAFRGYKGDTYDGGV 315

Query: 318 R-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
                A W   L   ++ +S +  + D   T  A  G +  ++ ++GVN
Sbjct: 316 HVPFLAYWPGTLPAGKQFTSPVVSL-DIARTAVALGGGD--ESEMEGVN 361


>gi|149701806|ref|XP_001488119.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Equus caballus]
          Length = 491

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 113/359 (31%), Positives = 173/359 (48%), Gaps = 26/359 (7%)

Query: 34  IILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIH 92
           I+ + ++GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+TG+ PI 
Sbjct: 3   ILKSVNMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIR 62

Query: 93  IGM-------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTF 145
            G        ++    +    G+P +E+LLP+ LKEAGY +  +GKWHLG  R  + P  
Sbjct: 63  NGFYTTSGHARNAYTPQEIVGGIPDSERLLPELLKEAGYVSKIVGKWHLGH-RPQFHPLK 121

Query: 146 RGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAA 201
            GFD  +G        YD+  +     Y+  +M         ++ KT     T +Y + A
Sbjct: 122 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEA 181

Query: 202 INVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRL 260
           ++ I     ++ P FLY A  A HA         P      FL  S  +R  Y   V  +
Sbjct: 182 LDFIRRQQAARRPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRYGDAVREI 230

Query: 261 DESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGV 320
           D+SVG ++ +L+  G+ E++ V F +DNGA         GSN P    K T ++GGMR  
Sbjct: 231 DDSVGKILHSLQDLGIAESTFVFFTSDNGAALISAPKQGGSNGPFLCGKQTTFEGGMREP 290

Query: 321 AAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKR 378
           A  W P      +VS +L  I D   T  + AG+E   D ++DG++    + +G    R
Sbjct: 291 AVAWWPGRIPAGQVSHQLGSIMDLFTTSLSLAGLEPPRDRAIDGLSLLPAMLQGQLMDR 349


>gi|430741911|ref|YP_007201040.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013631|gb|AGA25345.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 469

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 127/378 (33%), Positives = 177/378 (46%), Gaps = 58/378 (15%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQAL-CTPSRSAL 84
           A +KP+I+ ILADDLG+ DV+ +GS    TP+ID LA  GL     Y     C P+R+AL
Sbjct: 28  ADRKPNIVFILADDLGFTDVACYGSRYYETPHIDKLAAQGLKFTDGYTCGPNCQPTRAAL 87

Query: 85  MTGKYPIHIGMQHGVILEGEPW------------GLPLTEKLLPQYLKEAGYATHAIGKW 132
           M+G+Y    G+     ++   W            GLPL +  + Q L+ AGYAT   GKW
Sbjct: 88  MSGQYGPRTGIYTVGSIDRFDWKSRPLRPVDNAQGLPLGKVTIAQTLQAAGYATGMFGKW 147

Query: 133 HLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
           HLG   + Y P  RGFD                 +A     Q  D     +V     G Y
Sbjct: 148 HLGQQGD-YHPRRRGFD-----------------EAIVSMGQHFDFTTAPEV-TYPAGTY 188

Query: 193 STDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS------ 246
             D  T+ A++ I  H K +P FLYL H  VH+     P+QA  + V KF + +      
Sbjct: 189 LADFLTDKAVDFITRH-KDEPFFLYLPHFGVHS-----PYQAKKDLVKKFAEKAAAGGHH 242

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------ 300
           DP   TYA M++ +DESVG V+  L + G+ ++++V+F +DNG    G +  +G      
Sbjct: 243 DP---TYAAMLASVDESVGRVVKTLDELGLSDDTLVIFSSDNGG--VGGYEREGIGKAGA 297

Query: 301 --SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEIN 357
              N PLRG K   ++GG+R       P       V     +  D  PTL   A G    
Sbjct: 298 VTDNAPLRGGKGMLYEGGIRAPYLFRWPGKIPAGTVCDRAINSVDLYPTLVELAEGKAPE 357

Query: 358 DTSLDGVNQWDVLTKGAK 375
              LDG +   +LT G +
Sbjct: 358 SDPLDGTSYLRLLTSGGQ 375


>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
          Length = 544

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 124/407 (30%), Positives = 192/407 (47%), Gaps = 71/407 (17%)

Query: 29  KPHIIIILADDLGWNDVSFH----GSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSA 83
           +P+II++LADD+G+ND+S H        + T NIDALA +G++  + Y   A C PSR++
Sbjct: 59  RPNIILVLADDMGYNDISIHNGGAADGTLQTKNIDALAKSGILFTRGYAANATCAPSRAS 118

Query: 84  LMTGKYPIHIG----------------------------MQHGVILEGEPW---GLPLTE 112
           +MTGKYP   G                            +   V+    P+   G+P  +
Sbjct: 119 IMTGKYPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDREVVSNMPPFMEQGMPTEQ 178

Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
             + + L++AGY T  IGKWHLG    +  P  +GF    G    L    DH        
Sbjct: 179 ITIAEVLRDAGYYTAHIGKWHLGHEYGM-DPMSQGFQDSLGLVGPLYLPEDHP------- 230

Query: 173 YQGLDMRHNMQVDNKTIGI-----------------YSTDLYTEAAINVIAEHNKSKPMF 215
              ++ + + ++D    G+                 Y TD YT+ A+ VI E+NK++P F
Sbjct: 231 -DVVNAKFDTRIDKMIWGMGQYSANFNGGDLFAPDKYVTDYYTDEALKVI-ENNKNRPFF 288

Query: 216 LYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHG 275
           LYL+H A+H      P QA   +  +   +     + Y+GM++ LD SVG +I  L++  
Sbjct: 289 LYLSHWAIH-----NPLQALRSDFEQMSHMHGHNLQVYSGMINSLDRSVGKIIEKLKELD 343

Query: 276 MLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS 335
           +   ++++F +DNG  ++ I  N   N P RG K + +DGG+R    I  P      K S
Sbjct: 344 IYGKTLIIFTSDNGGANY-IELND-INKPYRGWKISFFDGGIRVPYIISWPDEINPGKKS 401

Query: 336 SELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
               H  D  PT+  AAGIE +   LDGV+    +   + +K  + L
Sbjct: 402 ENAVHHFDIFPTILKAAGIE-STNELDGVDLMPFIKNDSSSKPHKTL 447


>gi|325107642|ref|YP_004268710.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324967910|gb|ADY58688.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 749

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 112/339 (33%), Positives = 170/339 (50%), Gaps = 26/339 (7%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP++I +L DD G+ D+  +G++++ TP IDALA  G     +Y  A +C+PSR+ L+TG
Sbjct: 28  KPNVIFVLVDDQGYYDLGCYGATEVETPEIDALAAEGTRFTDYYAAAPICSPSRAGLLTG 87

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            YP  +G    V       G+   E  L +   + GYAT  IGKWHLG F E + P  +G
Sbjct: 88  CYPRRVGNHIWVHRADSDTGIHPNELTLAELFHQNGYATACIGKWHLG-FHEPFLPQNQG 146

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIA 206
           FD ++G    L    D      FE   G+ +  N QV  +       T  YT+ AI+ + 
Sbjct: 147 FDHYFGLLHNL----DPVETVYFEEQGGVPLLRNDQVVQRPADPAELTKQYTDEAISWME 202

Query: 207 EHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGN 266
           +H + +P FLYL H  +H      P    DE    F   S+     Y   +  LD ++G 
Sbjct: 203 QH-RDQPFFLYLPHTMLH-----NPLGVSDE----FRGSSN--WGEYGDAIQELDHNMGR 250

Query: 267 VIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSP 326
           ++  L++  + +N++V++ +DNG         +     L+G K + W+ G+R  A  W P
Sbjct: 251 LMQTLKRLEIDKNTVVVYASDNGR-----GPGRTDQQKLQGRKLSTWEAGIRVPAIAWGP 305

Query: 327 WLK-QTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDG 363
            L  Q+  VSS +    DW PTL + AG E+  D  +DG
Sbjct: 306 GLGLQSGAVSSTVVSAMDWYPTLASLAGFEVPADRVIDG 344


>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 443

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 121/355 (34%), Positives = 178/355 (50%), Gaps = 31/355 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           ++P++++IL DDLG+ D+  +GS  I TP +D +A +G+ L   Y  A +CTP+R+ALMT
Sbjct: 32  QQPNVVLILVDDLGYGDLGCYGSESIRTPRLDRMAASGMKLTSFYAAAPICTPTRAALMT 91

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G Y   +G+   + +  E  G+  +E  L + +++ GY T  +GKWHLG     Y PT  
Sbjct: 92  GCYATRVGLPTPLHVYDE-IGINESEFTLGEAMQQCGYETVCVGKWHLGHQPRFY-PTEH 149

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS----TDLYTEAAI 202
           GF+ ++G   G         KA        D       D + I        T+  TE A+
Sbjct: 150 GFNHYWGTPLGHMFNRPAVGKAIG------DTSDLFLDDTREIPFPEDADLTERLTEKAV 203

Query: 203 NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDE 262
             I E  + +P FL+LAH   H     EP  A +    KF D S  +   Y  +V  LD 
Sbjct: 204 EFI-EAKRDRPFFLFLAHPMPH-----EPLAASE----KFADRS--QAGLYGDVVECLDW 251

Query: 263 SVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAA 322
           S G V+ AL + G  EN++V F++DNG P  G     G+   LRG K  P++GGMR    
Sbjct: 252 STGEVLDALERTGQAENTLVAFLSDNG-PKKG----HGTTAGLRGFKHDPYEGGMRVPCI 306

Query: 323 IWSPWLKQTQKVSSELFHISDWLPTLC-AAAGIEINDTSLDGVNQWDVLTKGAKT 376
           I +  L    + S E+ ++ DW PTL   A G  +++   DG N   +LT    T
Sbjct: 307 IAAQGLIPAGQTSDEICNVMDWYPTLVNLAGGTTVSEQVRDGKNLTPLLTGETAT 361


>gi|325109298|ref|YP_004270366.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969566|gb|ADY60344.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 463

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 115/362 (31%), Positives = 187/362 (51%), Gaps = 35/362 (9%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           +   +L   +LF  +     A  KP++++IL DDLG+ D+S +G++ + +PNID L   G
Sbjct: 11  QLLMSLVAVVLFGVSS-TAQAADKPNVLLILVDDLGYGDLSCYGATDLQSPNIDKLVSRG 69

Query: 66  LILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVIL--EGEPWG-LPLTEKLLPQYLKE 121
           L     Y    +C+P+R+A+++GKYP  +G+  GVI       WG L    +LLP  L+ 
Sbjct: 70  LKFTNFYANCPVCSPTRAAILSGKYPDRVGVP-GVIRTHADNSWGYLAPEAELLPSLLQP 128

Query: 122 AGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLD-MR 179
           AGY +  IGKWHLG       P  RGFD   GY   +  DYYDH          G++ MR
Sbjct: 129 AGYHSAIIGKWHLG-LEAPNRPNDRGFDHFKGYLGDMMDDYYDHR-------RHGINYMR 180

Query: 180 HNMQ-VDNKTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPD- 236
            N Q +D +    ++TDL+TE + + + E  +  +P FLYLA+ A H      P Q P  
Sbjct: 181 ENEQEIDPEG---HATDLFTEWSCDYLKEQADNEQPFFLYLAYNAPHT-----PIQPPQD 232

Query: 237 --EEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
             E+V K       +R     ++  +D+ +G V+  L + G  ++++V+F +DNG     
Sbjct: 233 WLEKVKKREAGIGEQRAKLVALIEHMDDGIGQVLTCLEQTGQADDTLVIFTSDNGG-QLN 291

Query: 295 IHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           + +N G+    R  K + ++GG++   A +W   ++  Q    +   + D+  T+   AG
Sbjct: 292 VGANNGAT---RDGKQSMYEGGLKVPTAVVWPGKIEAGQTTEFQALSM-DFCSTILDVAG 347

Query: 354 IE 355
           +E
Sbjct: 348 VE 349


>gi|403282838|ref|XP_003932845.1| PREDICTED: arylsulfatase A [Saimiri boliviensis boliviensis]
          Length = 509

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 129/398 (32%), Positives = 191/398 (47%), Gaps = 38/398 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
           + P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +LCTPSR+AL+T
Sbjct: 21  RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 80

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+ +GM  GV++     GLPL E  L + L   GY T   GKWHLG   E  + P  
Sbjct: 81  GRLPVRMGMYPGVLVPSSRGGLPLEEVTLAEVLAAQGYLTGMAGKWHLGVGPEGAFLPPH 140

Query: 146 RGFD-------SH-YGYWQGLQDY-----YDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
           +GF        SH  G  Q L  +      D  C     P   + +  N+ V+ +   + 
Sbjct: 141 QGFHRFLGIPYSHDQGPCQNLTCFPPDTPCDGGCDQGLVP---IPLLANLSVEAQPPWLP 197

Query: 193 STDL-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
             +  Y   A +++A+   + +P FLY A    H    Y  F         F + S   R
Sbjct: 198 GLEARYIAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--R 246

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
             +   +  LD +VG ++ A+   G+LE ++V+F ADNG  +  + S  G +  LR  K 
Sbjct: 247 GPFGDSLMELDAAVGALMTAIGDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKG 305

Query: 311 TPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDV 369
           T ++GG+R  A A W   +  T  V+ EL    D LPTL A AG  + + +LDG +  D+
Sbjct: 306 TTYEGGVREPALAFWPGHI--TPGVTHELSSSLDLLPTLAALAGAPLPNVTLDGFDLSDL 363

Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA 407
           L    K+ R  +         P + +    V + KY A
Sbjct: 364 LLGTGKSPRQSLFF---YPSYPDEVHGVFAVRNGKYKA 398


>gi|332663783|ref|YP_004446571.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332597|gb|AEE49698.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 550

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 132/461 (28%), Positives = 206/461 (44%), Gaps = 96/461 (20%)

Query: 9   FALTCTLLFND--AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           F +    LF+   A L     KKP+II+I+ DD+G++D+  +GS +I TPNID LAY GL
Sbjct: 7   FIIALLALFSSIIAPLVQAQTKKPNIILIMVDDMGYSDLGAYGS-EIQTPNIDKLAYEGL 65

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEP--WGLPLTEKL-LPQYLKEAG 123
            L + Y  ++C P+R++L+TG+YP   G+ +  +  G P   G    E L   + L++AG
Sbjct: 66  RLREFYNNSICAPTRASLITGQYPHKAGVGYFDVNLGIPPYQGYLNKESLTFGEVLRQAG 125

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
           Y+T   GKWH+G    ++ P  RGFD  +G   G  +Y+D         Y  + +  N +
Sbjct: 126 YSTLLSGKWHVG-NDSLHWPKQRGFDRFFGVIGGGSNYFDAEPMPLGRQYPVVILEDNQR 184

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
              K    Y TD  T  A+  + E NK  KP FLYLA+ A H      P QA  E++AK+
Sbjct: 185 QKPKANSYYFTDEITNHAVQFLDEQNKMDKPFFLYLAYTAPHW-----PLQALPEDIAKY 239

Query: 243 -----------------------------------LDISDPE---------------RRT 252
                                               D+  P+                  
Sbjct: 240 RGKYDTGWDALRKERLARQVKLGIITQAQADAAAARDVEVPDWDNLTFEEKQLWKAKMEV 299

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP--------------------S 292
           YA M+ R D+ VG V+A L++    +N++++F++DNGAP                    +
Sbjct: 300 YAAMLDRADQGVGKVLAKLKELKKDDNTLIIFISDNGAPAEDMAHRDKNHAGRNTGPVGT 359

Query: 293 FGIHSNKG------SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLP 346
            G   ++G      SN P R  KS  ++GG+      W P   +   ++    H+ D  P
Sbjct: 360 AGSFESQGKPWSYVSNTPFRSFKSFAYEGGISSPFIAWFPGKIKAGSIAKGTAHLIDLAP 419

Query: 347 TLCAAAGIE-------INDTSLDGVNQWDVLTKGAKTKRSE 380
           T    AG++       ++  +L G +  ++L K    +R E
Sbjct: 420 TFYELAGLKYPSTYNGVSTNALQGKSLSNLLFKTEAWERGE 460


>gi|372210171|ref|ZP_09497973.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 651

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 135/449 (30%), Positives = 201/449 (44%), Gaps = 94/449 (20%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQ-------IPTPN 57
           +KY F +    +       +T   +P+II+I+ADDLG+ DV F+  +        IPTP 
Sbjct: 3   KKYLFFIFLLTVVGQYNAQSTKGSRPNIIVIVADDLGYADVGFNRDANFPAEKGVIPTPE 62

Query: 58  IDALAYNGLIL-NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILE-GEPWGLPLTEKLL 115
           +D LA NG+I  N H     C PSR+ALMTG  P  IG+Q+ +  +     G+PL E   
Sbjct: 63  LDQLANNGIICTNGHVAHPFCGPSRAALMTGVQPSRIGVQYNLPNDINTSLGIPLEETYF 122

Query: 116 PQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATF----- 170
           P+ L++  Y T A GKWHLGF +  Y P  RGFD  +G+  G + Y++   +  +     
Sbjct: 123 PKILQQNNYHTAAFGKWHLGFTQGKYQPLDRGFDYFFGFLGGGKAYFEREYEDLYYRRLG 182

Query: 171 ------EPYQGLDMRHNMQV--DNKTIGIYSTDLYTEAAINVIAEH-NKSKPMFLYLAHL 221
                   YQ    R    V  D      Y TD+ T+ AIN IAE+  KS P F+Y+A+ 
Sbjct: 183 GSNPVTNEYQDPLQRQRDYVAKDEFNQDEYLTDILTDEAINYIAENKTKSDPFFMYVAYN 242

Query: 222 AVHAGNTYEPFQAPDEEVAKF----------LDISDPERRTYAGMVSRLDESVGNV---- 267
           A H      P QAP EE+A+F          +  SD  + + A  + +++  +  V    
Sbjct: 243 APHT-----PLQAPAEEIAQFKEENPNFESLVRNSDYLKESGAYELDKVERELKKVKDYD 297

Query: 268 ------------IAALRKH---GMLENSIV------------------------------ 282
                       +A  R+H      E+ IV                              
Sbjct: 298 YNSLSSSEKEEKVADYREHLLDKFTESRIVYATMVSNMDKNIGRIVAELKKDQKVFDNTM 357

Query: 283 -LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFH 340
            +F++DNG  ++   S    NHPL  +K + ++GG R    I W   +K  +K   ++  
Sbjct: 358 IVFLSDNGGYTY---SKGAVNHPLYALKGSVFEGGHRVPFFIHWPNKIKSPKKYKYQISS 414

Query: 341 ISDWLPTLCAAAGIEINDT-SLDGVNQWD 368
           + D  PTL   AG  +  T  LDG++  D
Sbjct: 415 L-DLYPTLVDLAGASVPSTKKLDGLSFMD 442


>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
 gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
          Length = 454

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 125/360 (34%), Positives = 180/360 (50%), Gaps = 35/360 (9%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
           +TTA   P+I+ ILADDLGW D+  +G+S   TPN+D LA  G+   Q Y   A+C+ +R
Sbjct: 7   DTTA--APNILFILADDLGWADLGVYGASDFATPNLDRLAAQGVRFTQAYANSAVCSATR 64

Query: 82  SALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
            AL+TG+Y   +       +     GLP     LP  L+EAGY T  IGKWHLG     Y
Sbjct: 65  IALITGRYQYRLPAGLEEPIARSDIGLPPEHPTLPSLLREAGYDTALIGKWHLG-KPPTY 123

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM---QVDNKTIGIYSTDLYT 198
            P   G+D  +G   G  DY+ H      +P  G  +  ++    V  +  G Y+  L  
Sbjct: 124 GPLKSGYDRFFGNIGGALDYFTH------KPGVGAQVPRDLWEGDVPVERTGYYTNILGD 177

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP-DEEVAKFLDISD------PERR 251
           EA+  V A  ++ KP FL L   A H      P++ P DE V++  +I+D          
Sbjct: 178 EASAYVRAREDEKKPFFLSLHFTAPHW-----PWEGPGDEAVSR--EITDLFHYDGGNLA 230

Query: 252 TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKST 311
            Y  +V  LD +VG V+AAL + G  +N++V+F +DNG   F          P  G K+ 
Sbjct: 231 KYGELVEALDAAVGQVLAALDETGQADNTLVVFTSDNGGERF------SKTWPFTGQKTE 284

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEIN-DTSLDGVNQWDVL 370
             +GG+R    +  P  +   +V +++   +DWLPTL AAA +  + D   DG N   VL
Sbjct: 285 LLEGGLRVPTLLRWP-ARVRPQVQAQVTITADWLPTLLAAAQVWPHPDYPSDGENILPVL 343


>gi|404450774|ref|ZP_11015752.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
 gi|403763554|gb|EJZ24502.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
          Length = 469

 Score =  162 bits (411), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 118/369 (31%), Positives = 178/369 (48%), Gaps = 52/369 (14%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           +KP+I+ I+ADDLG  D+  +G + IPTPN+D +A  G+  N  Y  + +C PSR++LMT
Sbjct: 29  EKPNIVFIIADDLGIGDIGPYGQTIIPTPNLDKMASTGMKFNDFYAGSTVCAPSRASLMT 88

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+   H G  H ++     + L    K+LP+ LKEAGY     GKW LG      +P   
Sbjct: 89  GQ---HTG--HTLVRGNGEYPLESGRKILPERLKEAGYTNAMFGKWGLGLANTHSSPEKV 143

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTI--GIYSTDLYTEAAINV 204
           G+D   G+         H   A ++    LD     ++    +  G Y+ DL+ E +I  
Sbjct: 144 GWDHFVGHL--------HHVSAHYQKPDSLDCIIEGEISRMPVPKGKYANDLFVENSIEF 195

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD----------PE----- 249
           I      +P FLYL+    HA         P+     FLD S           PE     
Sbjct: 196 IETQTDEQPFFLYLSLTVPHA-----ELLVPERYFQPFLDESGKSIFPSEKAHPEGLHYG 250

Query: 250 -----RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG------IHSN 298
                +  YA +VS +D+ VG V++AL K G+ EN++V+F +DNG    G      +   
Sbjct: 251 AQPYPKAAYAALVSSIDDYVGRVMSALEKIGIAENTLVIFTSDNGTHVEGGRRMEDVDYF 310

Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW--LPTLCAAAGIEI 356
           K S H  RG+K   +DGG++    +   W  + ++ S+  F  + W   PTL   AGIE+
Sbjct: 311 KSSAH-FRGVKRDLFDGGIK--VPLMVNWPGKIKEGSTSDFRGAFWDLFPTLSHLAGIEV 367

Query: 357 NDTSLDGVN 365
           +    DG++
Sbjct: 368 DRDDTDGIS 376


>gi|408672294|ref|YP_006872042.1| sulfatase [Emticicia oligotrophica DSM 17448]
 gi|387853918|gb|AFK02015.1| sulfatase [Emticicia oligotrophica DSM 17448]
          Length = 478

 Score =  162 bits (411), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 125/375 (33%), Positives = 190/375 (50%), Gaps = 35/375 (9%)

Query: 5   RKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           +K F ++   +L   +   T     P+II+I  DD+G+ D+S +G+    TPNID LA  
Sbjct: 2   KKIFLSILTIILLQSS---TKKELSPNIILIFMDDMGYGDLSCYGAIDYRTPNIDKLASE 58

Query: 65  GL-ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
           G+   N    QA+C+ SR+AL+TG YP  IG+  G ++     G+   E  L + LK+ G
Sbjct: 59  GVRFTNFLAAQAVCSASRAALLTGCYPNRIGIS-GALMPTSKNGINPEETTLAEMLKQKG 117

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYG--YWQGL-QDYYDHSCKATFEPYQGLDMRH 180
           YAT   GKWHLG  +  + P   GFD + G  Y   +   YYD     + +      +  
Sbjct: 118 YATGIFGKWHLGHHK-TFLPLQHGFDEYVGLPYSNDMWPVYYDGKPATSTQHMSAYPVLP 176

Query: 181 NMQVDNKTIGIYSTD-------LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQ 233
            +Q ++K   I + D       LYTE A+  I ++NK KP FLYL H   H      P  
Sbjct: 177 LIQNNDKKEEIKTLDDQAKLTTLYTEKAVEFIKKNNK-KPFFLYLPHSMPHV-----PIA 230

Query: 234 APDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PS 292
             D+   K       ++  YA ++  +D SVG ++  L+++G+ +N++V+F +DNG   +
Sbjct: 231 VSDKFKGK------SKQGLYADLMMEIDWSVGEIMKTLKENGLDKNTLVIFTSDNGPWLN 284

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQK--VSSELFHISDWLPTLCA 350
           FG H+  GS+   R  K T ++GG R V  I   W   T +  V ++L    D LPT+  
Sbjct: 285 FGNHA--GSSGGFREGKGTSFEGGQR-VPCIMR-WKGTTPEGVVCNQLASTIDVLPTIAR 340

Query: 351 AAGIEINDTSLDGVN 365
             G E+    +DGVN
Sbjct: 341 ITGAELPKNKIDGVN 355


>gi|406830821|ref|ZP_11090415.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 497

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 128/360 (35%), Positives = 174/360 (48%), Gaps = 29/360 (8%)

Query: 10  ALTCTLLFN-DAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           A  C LLF     L       P+II+I+ DD G+ D S HG+  + TP++D L   G+  
Sbjct: 25  ASICLLLFTWGGRLPRLHAAPPNIIVIITDDQGYGDFSCHGNPVLKTPHLDRLHDEGVRF 84

Query: 69  NQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
              +V   C P+RSA+MTG++    G+ H  ILE E   L LT   LPQ LK  GY T  
Sbjct: 85  LDFHVSPTCAPTRSAIMTGRHEFRNGVTH-TILERER--LTLTATTLPQVLKTVGYTTGI 141

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGY-WQGLQDYYDHSC-KATFEPYQGLDMRHNMQVDN 186
            GKWHLG     Y P  RGFD  + +   G+   Y  SC  A    Y    + HN   + 
Sbjct: 142 FGKWHLG-DEPAYRPGQRGFDETFIHGGGGIGQTYPGSCGDAPGNTYFDPAIMHNGMFE- 199

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQA-PDEEVAKFLDI 245
           KT G Y TD++   A+  I      +P F Y+A  A H      P Q  P++E      +
Sbjct: 200 KTQG-YCTDVFFGQAMTWINTVKGRQPFFCYIATNAPHG-----PLQVRPEDEQRYRGKV 253

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP-SFGIHSNKGSNHP 304
           +D     + GM+S +D++VG ++A L   G+   ++V+FM DNG      IH     N  
Sbjct: 254 ADSNVAKFFGMISNIDDNVGRLLAKLTDWGIERETLVVFMNDNGGTVGVAIH-----NAG 308

Query: 305 LRGMKSTPWDGGMRGVAAIWS-PWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
           +RG K TPW GG R  A+ W  P   +   V S   H+ D  PT+      EI +  LDG
Sbjct: 309 MRGQKGTPWLGGTR-AASFWRWPGTLKPVDVRSPCAHL-DVFPTIA-----EIAEIKLDG 361


>gi|354504055|ref|XP_003514094.1| PREDICTED: arylsulfatase A [Cricetulus griseus]
 gi|344258436|gb|EGW14540.1| Arylsulfatase A [Cricetulus griseus]
          Length = 518

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 139/448 (31%), Positives = 210/448 (46%), Gaps = 39/448 (8%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMTGK 88
           P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +LCTPSR+AL+TG+
Sbjct: 21  PNILLIFADDLGYGDLGAYGHPSSSTPNLDQLANGGLRFTDFYVPVSLCTPSRAALLTGR 80

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTFRG 147
            P+  GM  GV+      GLPL E  L + L   GY T   GKWHLG   E  + P  +G
Sbjct: 81  LPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTGMAGKWHLGVGPEGAFLPPHQG 140

Query: 148 FD-------SH-YGYWQGLQDY-YDHSCKATF-EPYQGLDMRHNMQVDNKTIGIYSTDL- 196
           F        SH  G  Q L  +  D  C  +  + Y  + +  N+ V+ +   +   +  
Sbjct: 141 FHRFLGIPYSHDQGPCQNLTCFPPDIPCSGSCDQGYVPIPLLANLSVEAQPPWLPGLEAR 200

Query: 197 YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           Y   + +++A+   +S+P FLY A    H    Y  F         F   SD  R  +  
Sbjct: 201 YVSFSRDLMADAQRQSRPFFLYYASHHTH----YPQFSG-----ESFSQQSD--RGPFGD 249

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
            +  LD ++G ++ A+   G+L  ++V+F ADNG P     SN G +  LR  K T ++G
Sbjct: 250 SLMELDAAIGALMTAVGDLGLLGETLVIFTADNG-PELMRMSNGGCSGLLRCGKGTTFEG 308

Query: 316 GMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGA 374
           G+R  A A W   +  T  V+ EL    D LPTL A  G  + + +LDGV+   +L    
Sbjct: 309 GVREPALAFWPGHI--TPGVTHELASSLDLLPTLAALTGAPLPNVTLDGVDISPLLLGTG 366

Query: 375 KTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA-----GTDNNGQSDEWYGDTDNEIDKY 429
           K+ R  +         P + +    V D KY A     G+ ++  + +      N +  +
Sbjct: 367 KSPRKSVFF---YPPYPDEVHGVFAVRDGKYKAHFFTQGSAHSDTTSDSACHAANPLTSH 423

Query: 430 SPKEV--LYSKAGITFNALKTKLQIKQK 455
            P  +  L    G  +N L+T  +I  +
Sbjct: 424 EPPLLYDLSEDPGENYNLLETMDEISPE 451


>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
 gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
          Length = 508

 Score =  162 bits (410), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 131/404 (32%), Positives = 183/404 (45%), Gaps = 59/404 (14%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGS----SQIPTPNIDALAYNGLILNQHYV-QALCTPS 80
           A + P++I ILADDLG ND+S  G      ++ TPNID LA  G + +  Y     C PS
Sbjct: 15  AVRPPNVIFILADDLGINDISTFGGGMADGRVQTPNIDRLAAEGALFSTAYSGTGTCAPS 74

Query: 81  RSALMTGKYPIHIGMQHGVILEG-----------------------------EPW---GL 108
           R+ LMTG+YP   G ++     G                              P+   GL
Sbjct: 75  RAMLMTGRYPTRTGFEYTPTPPGMSRIVPMFANDMKTGLPPTEQVKENEKLMPPFAEQGL 134

Query: 109 PLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGL----QDYYDH 164
           P  E  L + LK+ GY T  IGKWHLG     + P  +GFD       GL     D    
Sbjct: 135 PTEEVTLAEVLKDRGYHTVHIGKWHLG-NTSPFRPNDQGFDESLDMASGLFLPPGDPRGV 193

Query: 165 SCKATFEP-----YQGLDMRHNMQ-VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYL 218
             +  F+P     +  +D   +    D    G Y TD +T+ ++ VI + NK++P FLYL
Sbjct: 194 EARLDFDPIDKFLWARMDFAASYNGSDWFEPGGYLTDYWTDESLKVI-DANKNRPFFLYL 252

Query: 219 AHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLE 278
           AH  VH      P QA  E+     DI     R YA M+  +D SVG ++  L + G+ +
Sbjct: 253 AHWGVHT-----PLQATKEDYEAVGDIEPHRLRVYAAMLRAVDRSVGRIMEKLEEEGLAD 307

Query: 279 NSIVLFMADNGAPSF-GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSE 337
           N+I++F +DNG   + GI      N P RG K T ++GG+R    +  P   Q   V  E
Sbjct: 308 NTIIVFSSDNGGAGYIGIPE---VNAPYRGWKITLFEGGIRVPLFMKWPGTIQPGTVIDE 364

Query: 338 LFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVLTKGAKTKRSE 380
                D +PTL AA G  + +   +DG N   + T      R +
Sbjct: 365 PVAHIDLMPTLAAATGARLPSSVEIDGQNLLPLATGTGPITRPD 408


>gi|421614583|ref|ZP_16055637.1| arylsulfatase A [Rhodopirellula baltica SH28]
 gi|408494670|gb|EKJ99274.1| arylsulfatase A [Rhodopirellula baltica SH28]
          Length = 478

 Score =  162 bits (410), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 115/356 (32%), Positives = 177/356 (49%), Gaps = 41/356 (11%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
            A + P+ ++I ADDLG+ D+S + SS + TP++D LA  G      +V A +C+PSR+A
Sbjct: 38  VADRPPNFVLIFADDLGYGDISCYDSSGVKTPHLDQLAAEGFRSTDFFVPANVCSPSRAA 97

Query: 84  LMTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREV 140
           L+TG+YP+  GM   ++  + + + +G    E  +P+ L  AGY +  +GKWHLG   E 
Sbjct: 98  LLTGRYPMRCGMPVARNENVAKYKAYGFSPDEITIPELLGPAGYRSLMVGKWHLGMELEG 157

Query: 141 YTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD---MRHNMQVDNKTIGIYS-TDL 196
             P   GFD + G              + +EP +G +   +    QV+ K +     T  
Sbjct: 158 SHPLDAGFDEYLG------------IPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKR 205

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
           YT+  I+ I E  K  P F+Y++H  VH  N  +P  +PD     F+  S  E+  Y   
Sbjct: 206 YTDEVIDFI-ERQKDDPFFIYVSHHIVH--NPLKP--SPD-----FIGTS--EKGKYGDF 253

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           +  LD S G ++  +R  G+ EN++V+F +DNG       +  GS+  L G K    +GG
Sbjct: 254 IKELDHSTGRIMQTIRDAGLDENTLVIFTSDNGP------TRNGSSGELSGGKYCTMEGG 307

Query: 317 MRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
            R      W+  +   Q VS       D LP  C  AG+ I +D  +DG +   VL
Sbjct: 308 HRVPGMFRWTSKIAPNQ-VSDVTLTSMDLLPLFCELAGVPIPDDRQIDGKSILPVL 362


>gi|423315320|ref|ZP_17293250.1| hypothetical protein HMPREF1058_03862 [Bacteroides vulgatus
           CL09T03C04]
 gi|423315328|ref|ZP_17293258.1| hypothetical protein HMPREF1058_03870 [Bacteroides vulgatus
           CL09T03C04]
 gi|392679557|gb|EIY72938.1| hypothetical protein HMPREF1058_03862 [Bacteroides vulgatus
           CL09T03C04]
 gi|392679565|gb|EIY72946.1| hypothetical protein HMPREF1058_03870 [Bacteroides vulgatus
           CL09T03C04]
          Length = 464

 Score =  162 bits (410), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 129/399 (32%), Positives = 188/399 (47%), Gaps = 40/399 (10%)

Query: 12  TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           T  +LF+ A L+        TTA K P++I I+ADDLG  D+  +G  QI TPNID +A 
Sbjct: 4   TRKILFSAALLSSGLTMARTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
           NG+   QHY  + +  PSR AL+TGK+  H  ++    + G         LP  E  +  
Sbjct: 64  NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
             K   Y T  +GKW +G       P   GFD  YGY   L   + HS    F      +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGVPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPF 232
               + +D K    YS DL  E A+N I E N  KP FLY +    HA     G     +
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEAMTEY 232

Query: 233 QAPDEEVA------KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMA 286
           +    E         +    +P R  YA MV+ LD+SVG +I  L++ G+ +++I++F +
Sbjct: 233 EGEFCETPFGGSRDGYKSQQNP-RAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIVFTS 291

Query: 287 DNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
           DNG  S G H  S   SN P RG K   ++GG+R    I  P +     V++ +    D+
Sbjct: 292 DNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAFWDF 351

Query: 345 LPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           LPT+      +I   ++DG++    LT     K  + ++
Sbjct: 352 LPTIGELVQADI-PQNIDGISYLPTLTGKGTQKEHDCIY 389


>gi|448409003|ref|ZP_21574483.1| sulfatase [Halosimplex carlsbadense 2-9-1]
 gi|445673783|gb|ELZ26342.1| sulfatase [Halosimplex carlsbadense 2-9-1]
          Length = 496

 Score =  162 bits (410), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 126/390 (32%), Positives = 181/390 (46%), Gaps = 59/390 (15%)

Query: 27  PKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQAL-CTPSRSALM 85
           P++P+I+++  DDLGW D  F GS    TPN+DALA  G++    Y  A  C PSR+   
Sbjct: 47  PERPNIVVVYVDDLGWLDTGFQGSQYYDTPNLDALADEGVVATSGYANAPNCAPSRACFN 106

Query: 86  TGKYPIHIGMQHGVILEGEPWGL--------PLTEKL------LPQYLKEAGYATHAIGK 131
           TG+Y    G+  G + +  P           P +E+L      + + L +AGYAT  +GK
Sbjct: 107 TGQYTPRHGV--GTVTDPPPRRPEARRLETPPNSERLGTDNATIAEVLSDAGYATGFVGK 164

Query: 132 WHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGI 191
           WHLG     + P  RGFD + G      D         F PY+      N+  D    G 
Sbjct: 165 WHLGEADSRHGPLARGFDRNVG-----GDIVGRPPAGYFSPYR----LPNVTDD----GG 211

Query: 192 YSTDLYTEAAINVIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS---- 246
           Y  D  T+ AI  +A+H     P FL  +  +VH      P QAPD+ V  + D+     
Sbjct: 212 YLPDRLTDDAIEFVADHAGGPDPFFLLFSTYSVHG-----PLQAPDDAVDAYRDVDCWNG 266

Query: 247 --DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG---- 300
             +P   TY  MVS LD +VG ++AAL + G+ E ++V+F +DNG    G +   G    
Sbjct: 267 QCNP---TYGAMVSNLDANVGRLLAALDERGIREETLVVFTSDNGG--LGDYDEIGLPRD 321

Query: 301 -----SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
                S  PLRG K T ++GG+R    +  P +    + SS     +D+ PT    AG  
Sbjct: 322 WKRVTSQGPLRGGKRTLYEGGIRVPTVVSWPGVVPGGRRSSTPIAGTDFFPTFVDLAGAH 381

Query: 356 ---INDTSLDGVNQWDVLTKGAKTKRSEIL 382
               +D  +DG +    LT G    R  + 
Sbjct: 382 DAVPDDHVVDGRSLVPWLTDGTPPDRDALF 411


>gi|150002770|ref|YP_001297514.1| arylsulfatase [Bacteroides vulgatus ATCC 8482]
 gi|149931194|gb|ABR37892.1| arylsulfatase precursor [Bacteroides vulgatus ATCC 8482]
          Length = 464

 Score =  162 bits (409), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 129/399 (32%), Positives = 188/399 (47%), Gaps = 40/399 (10%)

Query: 12  TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           T  +LF+ A L+        TTA K P++I I+ADDLG  D+  +G  QI TPNID +A 
Sbjct: 4   TRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
           NG+   QHY  + +  PSR AL+TGK+  H  ++    + G         LP  E  +  
Sbjct: 64  NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
             K   Y T  +GKW +G       P   GFD  YGY   L   + HS    F      +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPF 232
               + +D K    YS DL  E A+N I E N  KP FLY +    HA     G     +
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEAMTEY 232

Query: 233 QAPDEEVA------KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMA 286
           +    E         +    +P R  YA MV+ LD+SVG +I  L++ G+ +++I++F +
Sbjct: 233 EGEFCETPFGGSRDGYKSQQNP-RAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIVFTS 291

Query: 287 DNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
           DNG  S G H  S   SN P RG K   ++GG+R    I  P +     V++ +    D+
Sbjct: 292 DNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAFWDF 351

Query: 345 LPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           LPT+      +I   ++DG++    LT     K  + ++
Sbjct: 352 LPTIGELVQADI-PQNIDGISYLPTLTGKGTQKEHDCIY 389


>gi|212690903|ref|ZP_03299031.1| hypothetical protein BACDOR_00391 [Bacteroides dorei DSM 17855]
 gi|237712609|ref|ZP_04543090.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
 gi|212666135|gb|EEB26707.1| arylsulfatase [Bacteroides dorei DSM 17855]
 gi|229453930|gb|EEO59651.1| arylsulfatase [Bacteroides sp. 9_1_42FAA]
          Length = 464

 Score =  162 bits (409), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 130/402 (32%), Positives = 187/402 (46%), Gaps = 46/402 (11%)

Query: 12  TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           T  +LF+ A L+        TTA K P++I I+ADDLG  D+  +G  QI TPNID +A 
Sbjct: 4   TRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
           NG+   QHY  + +  PSR AL+TGK+  H  ++    + G         LP+ E  +  
Sbjct: 64  NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPVGEVTVAD 123

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
             K   Y T  +GKW +G       P   GFD  YGY   L   + HS    F      +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
               + +D K    YS DL  E A+N I E N  KP FLY +    HA           Y
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEVMTEY 232

Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
           E      PF    +            R  YA MV+ LD+SVG +I  L++ G+ +++I++
Sbjct: 233 EGEFCETPFGGSKDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 288

Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
           F +DNG  S G H  S   SN P RG K   ++GG+R    I  P +     V++ +   
Sbjct: 289 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 348

Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D+LPT+      +I   ++DG++    LT     K  + ++
Sbjct: 349 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGIQKEHDCIY 389


>gi|167762012|ref|ZP_02434139.1| hypothetical protein BACSTE_00358 [Bacteroides stercoris ATCC
           43183]
 gi|167700104|gb|EDS16683.1| arylsulfatase [Bacteroides stercoris ATCC 43183]
          Length = 463

 Score =  162 bits (409), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 118/347 (34%), Positives = 174/347 (50%), Gaps = 27/347 (7%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPS 80
           L T    KP+II ILADD+G+ D+S +G+  I TPNID LA  G    Q Y    + +PS
Sbjct: 27  LVTDGGDKPNIIFILADDMGYCDLSCYGNKYIETPNIDRLAATGTAFTQCYAGSGISSPS 86

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLK-----------EAGYATHAI 129
           R ALMTGK   +  ++    + G   GL  T+ +   +L+            AGY T  +
Sbjct: 87  RCALMTGKNTGNTTIRDNFCIAGGIEGLKGTKTIRRMHLQPNDTTIATVLGAAGYRTCLV 146

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMR--HNMQVDNK 187
            KWHL  F    TP  RGFD  YG+         +S    + PY   +     N++ +  
Sbjct: 147 NKWHLDGFNPEATPLNRGFDEFYGWLISTA----YSNDPYYYPYWRFNNEKLENVKENEG 202

Query: 188 TIGI-YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
              I ++TDL TE AI  I   NK+ P FLYLA+ A H     EP+   DE      +  
Sbjct: 203 DKHIKHNTDLSTEDAIKFI-NRNKNNPFFLYLAYDAPH-----EPYNI-DETTWYDDEAW 255

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
           D   + YA +++ +D ++G ++A L + G+ EN++V+F +DNGA         G    L+
Sbjct: 256 DMNTKRYASLITHMDRAIGRLLAELDRLGLRENTLVIFASDNGAAKQAPLEELGCKGSLK 315

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           GMK   ++GG+R    +  P     QK+++ + +  D +PTL A AG
Sbjct: 316 GMKGQLYEGGIRVPFIVNQPGKVPVQKLNN-IIYFPDVMPTLAALAG 361


>gi|32477898|ref|NP_870892.1| arylsulphatase A [Rhodopirellula baltica SH 1]
 gi|32448455|emb|CAD77970.1| arylsulphatase A [Rhodopirellula baltica SH 1]
          Length = 478

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 115/355 (32%), Positives = 177/355 (49%), Gaps = 41/355 (11%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A + P+ ++I ADDLG+ D+S + SS + TP++D LA  G      +V A +C+PSR+AL
Sbjct: 39  ADRPPNFVLIFADDLGYGDISCYDSSGVKTPHLDQLAAEGFRSKDFFVPANVCSPSRAAL 98

Query: 85  MTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           +TG+YP+  GM   ++  + + + +G    E  +P+ L  AGY +  +GKWHLG   E  
Sbjct: 99  LTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGMELEGS 158

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD---MRHNMQVDNKTIGIYS-TDLY 197
            P   GFD + G              + +EP +G +   +    QV+ K +     T  Y
Sbjct: 159 HPLDAGFDEYLG------------IPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKRY 206

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
           T+  I+ I E  K  P F+Y++H  VH  N  +P  +PD     F+  S  E+  Y   +
Sbjct: 207 TDEVIDFI-ERQKDDPFFIYVSHHIVH--NPLKP--SPD-----FVGTS--EKGKYGDFI 254

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
             LD S G ++  +R  G+ EN++V+F +DNG       +  GS+  L G K    +GG 
Sbjct: 255 KELDHSTGRIMQTIRDAGLDENTLVIFTSDNGP------TRNGSSGELSGGKYCTMEGGH 308

Query: 318 RGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
           R      W+  +   Q VS       D LP  C  AG+ I +D  +DG +   VL
Sbjct: 309 RVPGMFRWTSKIAPNQ-VSDVTLTSMDLLPLFCELAGVPIPDDRQIDGKSILPVL 362


>gi|168334944|ref|ZP_02693063.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Epulopiscium sp.
           'N.t. morphotype B']
          Length = 594

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 125/391 (31%), Positives = 189/391 (48%), Gaps = 41/391 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTG 87
           KKP++IIIL DD G+ D++ HG+  + TPN+D L  N + L   +   LC PSR+AL++G
Sbjct: 3   KKPNVIIILTDDQGYGDMAAHGNPWLKTPNLDWLNDNAISLESFHTDPLCAPSRAALISG 62

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           KY    G+     L G  +  P   K +  Y K+ GYAT   G WHLG     Y P  RG
Sbjct: 63  KYSFGAGVYS--TLNGRYYMDP-NIKTIANYFKDGGYATAMFGXWHLGDVYP-YRPENRG 118

Query: 148 FDSHYGYWQGL----QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAIN 203
           FD    +  G+     DY+++            D  + +   N     Y TD++ ++A  
Sbjct: 119 FDVATIFGGGVIGETPDYWNNDY---------FDDTYQVNGKNTKFEGYCTDVWFDSATK 169

Query: 204 VIAEH-NKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT-YAGMVSRLD 261
            I +    +KP F YL   A H      PF        K++D+  P +R  + GM+  +D
Sbjct: 170 FIDDQLAATKPFFCYLPTNAPHG-----PFNIAPSYYQKYIDMGVPPKRAKFFGMIEGID 224

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK------GSNHPLRGMKSTPWDG 315
           E++G ++  LR     +++I++F  DNG  + G  ++K      G N  +RG K T ++G
Sbjct: 225 ENIGKLMDHLRAKNAFDDTIIIFFGDNGTAT-GCDTDKNGWVKDGFNAGMRGKKGTTYEG 283

Query: 316 GMRGVAAIWSP--WLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG 373
             R    I +P   L + +KV   L    D LPTL    G+   + +LDGV+    L  G
Sbjct: 284 AHRNTCFITTPKNILGEARKVYG-LTAQFDLLPTLIDICGLPEGE-NLDGVSMAQALKAG 341

Query: 374 A---KTKRSEILHNIDNVDNPQKY--YAALR 399
                  R  ++HN+  +D PQKY  Y  L+
Sbjct: 342 ETHLNEDRVIMIHNM-QLDMPQKYKDYTVLK 371


>gi|255690671|ref|ZP_05414346.1| twin-arginine translocation pathway signal [Bacteroides finegoldii
           DSM 17565]
 gi|260623695|gb|EEX46566.1| arylsulfatase [Bacteroides finegoldii DSM 17565]
          Length = 453

 Score =  162 bits (409), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 117/367 (31%), Positives = 190/367 (51%), Gaps = 31/367 (8%)

Query: 23  NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSR 81
           NT     P++++IL DDLG  D+S   +  + TPNID +   G+ L+  Y   ++ +PSR
Sbjct: 24  NTVDKAIPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSR 83

Query: 82  SALMTGKYPIHIGMQHGVILEG--EPWG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           +AL+TG +P  +G+  GVI     + WG    +   +P+ LK  GY T  IGKWHLG + 
Sbjct: 84  AALLTGCFPAMVGVP-GVIRPSIDQNWGYFGPSAVTMPEVLKNGGYRTALIGKWHLG-WE 141

Query: 139 EVYTPTFRGFDSHYGYWQGLQ-DYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
               P  RGFD  +G+   +  DYY H  +     Y      ++ ++D K    ++T+L+
Sbjct: 142 SPNLPNERGFDHFHGFLADMMDDYYTHRRQGGNYMY-----LNDKEIDPKG---HATELF 193

Query: 198 TEAAINVIAEHNKSK-PMFLYLAHLAVHAGNTYEPFQAPDEEVAKF--LDISDPERRT-Y 253
           T  +++ I +  K K P FLYLA+ A H+     P Q P E V K    D S P +R   
Sbjct: 194 TSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPVKRARL 248

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
             ++  LD ++G VI +L + G L N++V+F +DNG    G   +  +N P RG K   +
Sbjct: 249 IALIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNG----GDRGSMANNGPTRGAKGDMF 304

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKG 373
           +GG+    A+  P + +  +  +    + D +PT+C    + +    +DG++  D +   
Sbjct: 305 EGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTICDFVNVPVKH-EIDGISVLDAIK-- 361

Query: 374 AKTKRSE 380
            KT+ +E
Sbjct: 362 GKTQNTE 368


>gi|338210219|ref|YP_004654266.1| Cerebroside-sulfatase [Runella slithyformis DSM 19594]
 gi|336304032|gb|AEI47134.1| Cerebroside-sulfatase [Runella slithyformis DSM 19594]
          Length = 490

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 121/354 (34%), Positives = 185/354 (52%), Gaps = 28/354 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILNQHYVQALCTPSRSALMTGK 88
           P+II+I  DDLG+ D+S +G+ Q  TPN+D LA  G+   N    QA+C+ SR+AL+TG 
Sbjct: 34  PNIILIFMDDLGYGDLSCYGALQYRTPNLDRLAGEGIRFTNFLAAQAVCSASRAALLTGC 93

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP  +G+  G +      GL   E  + + LK+ GYAT   GKWHLG  R  + PT +GF
Sbjct: 94  YPNRVGIS-GALFPNAKVGLNPEETTIAELLKDKGYATGMFGKWHLG-DRPEFLPTRQGF 151

Query: 149 DSHYG--YWQGL-QDYYDHSCKATF-----EPYQGLDMRHNMQV--DNKTIGIYS--TDL 196
           D + G  Y   +   +YD S  +T      + +  L + HN     + KT+   +  T +
Sbjct: 152 DEYVGLPYSNDMWPVHYDGSPISTTSDNRKKNFPFLPLLHNSDTLQEIKTLDDQAKLTGI 211

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGM 256
           YTE A+N I  H KS P F+YL H   H          P    AKF   S  ++ TY  +
Sbjct: 212 YTERAVNFINRHKKS-PFFVYLPHSMPH---------VPIAASAKFKGKS--KQGTYGDV 259

Query: 257 VSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGG 316
           +  +D SVG ++ AL+++G+ +N++V+F +DNG P     ++ GS+  LR  K   ++GG
Sbjct: 260 LMEIDWSVGEIMKALKENGLDKNTVVIFTSDNG-PWLNYGNHAGSSGGLREGKGNSFEGG 318

Query: 317 MRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVL 370
            R    +    +     V ++L    D LPT+    G ++    +DGV+   +L
Sbjct: 319 QRVPCIVRWKGITPAGLVCNKLTSTIDLLPTIAHICGTKLPARPIDGVDILPLL 372


>gi|326438104|gb|EGD83674.1| hypothetical protein PTSG_04280 [Salpingoeca sp. ATCC 50818]
          Length = 693

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 127/412 (30%), Positives = 198/412 (48%), Gaps = 51/412 (12%)

Query: 28  KKPHIIIILADDLGWNDV-------SFHGSSQ-IPTPNIDALAYNGLILNQHYVQ-ALCT 78
           +KP+ I++ ADDLG N++        + G++  I TPN+  LA  G++ +  Y    +C+
Sbjct: 44  RKPNFIVLFADDLGVNEIRLDNVPNGYTGNNHTIATPNLQRLANEGMVFHNWYSAFHVCS 103

Query: 79  PSRSALMTGKYPIHIG-----MQHGVILEGEPW-----------GLPLTEKLLPQYLKEA 122
           PSR+++MTG+Y I  G     +++     G P            GLPL E  + + L   
Sbjct: 104 PSRASMMTGRYSIRSGIGSSPLKYAPDAPGGPLPSQVFSAEAVGGLPLNETTIAEALLPQ 163

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYW----QGLQDY--YDHSCKATFEPYQGL 176
           GYAT AIGKWHLG  R++Y PT RGF+ +YG       G+  +  +D++  A  EP    
Sbjct: 164 GYATMAIGKWHLG-QRDIYLPTARGFEEYYGIPFSQDMGMSFWLPFDYNGIAPMEPTPIP 222

Query: 177 DMRHNMQVDNKTIGIYSTDLYTEAAIN-VIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
            ++ N  V+         D Y + A   V+    K +P FLYL    VH  N+  P    
Sbjct: 223 KLKGNAVVEQPAALDKLADEYAQVAKEFVMRNAQKDRPFFLYLPFNHVHTPNSCSP---- 278

Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
                +F   S  +       V  +D  +G V+ AL+  G+ +N++VLF +DNGAP   +
Sbjct: 279 -----RFCSTS--KSGPVGDAVQEMDWVIGEVMDALKHAGVDDNTLVLFTSDNGAP---L 328

Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
            +++  N PLR  K++ W+GG +     W P        S  L    D   T+   AG +
Sbjct: 329 ANDEQGNLPLRDGKASTWEGGFKAPGIAWWPNRIPAGSQSKALVSTLDVFTTVITMAGGD 388

Query: 356 I-NDTSLDGVNQWDVLTKGAKTKRSEIL--HNIDNVDNPQKYYAALRVDDLK 404
           +  D  +DG++   +L  G+K     I+  HN   V+   + Y A+R  D K
Sbjct: 389 VPTDRIIDGIDLSPILFNGSKQGHDCIMFYHNAAAVNASGELY-AMRCGDHK 439


>gi|224536571|ref|ZP_03677110.1| hypothetical protein BACCELL_01446 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224521827|gb|EEF90932.1| hypothetical protein BACCELL_01446 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 450

 Score =  161 bits (408), Expect = 7e-37,   Method: Compositional matrix adjust.
 Identities = 121/388 (31%), Positives = 190/388 (48%), Gaps = 38/388 (9%)

Query: 13  CTL-LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQH 71
           CTL L  +A    T  +KP+++ I  DDLGW+DV ++GS+ + TPNID LA  G+     
Sbjct: 13  CTLPLGVNALAAQTQQEKPNVLFIAVDDLGWSDVGYNGSTLVATPNIDRLASMGVSFTDG 72

Query: 72  YVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIG 130
           YV A +  PSR+ +++G Y    GMQ    L+     +P   K LP+ ++EAGY T  +G
Sbjct: 73  YVTAPISGPSRNGMVSGMYSQKYGMQINADLKFAQ--VPAQHKTLPETMREAGYRTALVG 130

Query: 131 KWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIG 190
           KWH+     V       FD  Y       +Y+  S      P   +        +N+   
Sbjct: 131 KWHVCRDANVV------FDEVYDRIDISSNYFPDSTGVYDGPRLPILAVKESTYENE--- 181

Query: 191 IYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER 250
            Y TD  T  AI  + + + + P FLYL + AVH      P+QA  +   +  +I D   
Sbjct: 182 -YMTDKLTSHAIQFMNKQSNATPFFLYLGYNAVH-----NPWQAEKKYYDRLSNIKDEYM 235

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA--------------PSFGIH 296
           R  A +++  DE++G ++  L    +++N+I++F++DNG               PSF   
Sbjct: 236 RVQASLIACADENIGILLNYLENRKLIDNTIIVFVSDNGPAKGGPELKTWEGYDPSF--E 293

Query: 297 SNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
              G    LRG K   ++GG+R    I + P L++  KV  ++    D+ PT+C   G E
Sbjct: 294 YVFGQMKTLRGHKVDLYEGGIRTPMIIAYKPLLREG-KVFRDMISTLDFYPTICEMTGSE 352

Query: 356 IND-TSLDGVNQWDVLTKGAKTKRSEIL 382
           I + T+LDGV+    L +  K +  ++L
Sbjct: 353 IPEGTTLDGVSLVRYLREETKQQPHDML 380


>gi|294778194|ref|ZP_06743624.1| arylsulfatase [Bacteroides vulgatus PC510]
 gi|294447960|gb|EFG16530.1| arylsulfatase [Bacteroides vulgatus PC510]
          Length = 466

 Score =  161 bits (408), Expect = 8e-37,   Method: Compositional matrix adjust.
 Identities = 130/402 (32%), Positives = 186/402 (46%), Gaps = 46/402 (11%)

Query: 12  TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           T  +LF+ A L+        TTA K P++I I+ADDLG  D+  +G  QI TPNID +A 
Sbjct: 6   TRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 65

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
           NG+   QHY  + +  PSR AL+TGK+  H  ++    + G         LP  E  +  
Sbjct: 66  NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 125

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
             K   Y T  +GKW +G       P   GFD  YGY   L   + HS    F      +
Sbjct: 126 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 178

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
               + +D K    YS DL  E A+N I E N  KP FLY +    HA           Y
Sbjct: 179 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEAMTEY 234

Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
           E      PF    +            R  YA MV+ LD+SVG +I  L++ G+ +++I++
Sbjct: 235 EGEFCETPFGGSRDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 290

Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
           F +DNG  S G H  S   SN P RG K   ++GG+R    I  P +     V++ +   
Sbjct: 291 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 350

Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D+LPT+      +I   ++DG++    LT     K  + ++
Sbjct: 351 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGTQKEHDCIY 391


>gi|326802120|ref|YP_004319939.1| sulfatase [Sphingobacterium sp. 21]
 gi|326552884|gb|ADZ81269.1| sulfatase [Sphingobacterium sp. 21]
          Length = 471

 Score =  161 bits (407), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 131/397 (32%), Positives = 192/397 (48%), Gaps = 65/397 (16%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           KKP+II ILADDLG+ DV F+G   I TPN+D LA  G+  NQ Y   A+C PSRS+LM+
Sbjct: 24  KKPNIIFILADDLGYGDVGFNGQRLINTPNLDRLAKEGIRFNQFYAGTAVCAPSRSSLMS 83

Query: 87  GKYPIHIGMQH--GVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           G++  H  ++   GV  EG+ + +P + + + + LK+AGY T A GKW LG       P 
Sbjct: 84  GQHTGHTFIRGNLGVEPEGQ-YPIPDSVETIAETLKKAGYQTGAFGKWGLGPVGSEGDPN 142

Query: 145 FRGFDSHYGY-WQGLQDYY------DHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLY 197
            +GFD  YGY  Q L   Y      D+S K      +  D+ ++ Q        Y+ DL 
Sbjct: 143 KQGFDRFYGYNCQSLAHRYYPTHLWDNSEKVVLT--ENGDLEYHKQ--------YAPDLI 192

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER------- 250
            E A+  I EH K +P FL+L ++  HA         PD+ + ++      E+       
Sbjct: 193 QEQALGFIEEH-KDRPFFLFLPYILPHA-----ELLVPDDSIFQYYKGKFEEKPFKGFDY 246

Query: 251 ----------------RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA---- 290
                            T+A MV+RLD  VG +   LR  G+ EN++++F +DNG     
Sbjct: 247 GKGAKSGGYASQQYPHSTFAAMVARLDLYVGQIEEKLRSLGLDENTLIVFTSDNGPHKEG 306

Query: 291 ---PSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLP 346
              P+F   S+ G    LRG+K   ++GG+R    A W   ++  Q+ S  +    D  P
Sbjct: 307 GADPNF-FQSSGG----LRGIKRDLYEGGIRVPFVAKWPKAIRAGQQ-SEFIGAFWDIFP 360

Query: 347 TLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           T     G      S DG++  + L      K+ E L+
Sbjct: 361 TFAELGGA-ATAASTDGISFVNTLKGQGTQKQHEYLY 396


>gi|440750638|ref|ZP_20929879.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
 gi|436480856|gb|ELP37068.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
          Length = 478

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 116/348 (33%), Positives = 171/348 (49%), Gaps = 30/348 (8%)

Query: 24  TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRS 82
           T A  +P++I+I  DD+G+ D+  +G+  I TPN+D +A NG+     YV QA+C+ SR+
Sbjct: 27  TLAQDRPNVILIFVDDMGYGDLGAYGAEDIATPNLDQMAENGVKFTNFYVSQAVCSASRA 86

Query: 83  ALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYT 142
           +L+TG Y   +G+ HG +      GL   E  + + LKE GY T   GKWHLG     + 
Sbjct: 87  SLLTGTYANRLGI-HGALDHTAKHGLNPEEITIAEMLKEQGYQTAMFGKWHLG-HHPPFL 144

Query: 143 PTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL-------DMRHNMQVDNKTIGIYSTD 195
           PT +GFD  YG       +  H     + P   L       D  HN Q         +T+
Sbjct: 145 PTKQGFDEFYGIPYSNDMWPQHPQTKNYYPPLPLFFNDNVVDTLHNDQSR------LTTE 198

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           L TE A+N I +  K +P FLYLAH   H          P     KF   S  +R  Y  
Sbjct: 199 L-TERAVNFI-QRKKDQPFFLYLAHPMPH---------VPLFVSEKFRGKS--KRGLYGD 245

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
           ++  +D SVG ++  LR +G+ E ++V+F++DNG P      + GS +PLR  K T WDG
Sbjct: 246 VLMEIDWSVGQILKTLRDNGLEEKTLVIFISDNG-PWLSFSGHAGSAYPLREGKGTSWDG 304

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
           G++  A ++         V +      D LPT+      ++    +DG
Sbjct: 305 GVKVPALMYWKGKIPAGTVQTNPAMTIDMLPTIAKLTRADLPKLPIDG 352


>gi|294011191|ref|YP_003544651.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
 gi|292674521|dbj|BAI96039.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
          Length = 468

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 113/353 (32%), Positives = 179/353 (50%), Gaps = 30/353 (8%)

Query: 25  TAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSA 83
           + P++P+I+ I+ADD+G  D+  +GS  I TP IDA+A  G+     Y  + +C+P+R A
Sbjct: 33  SGPRRPNILYIMADDMGHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIA 92

Query: 84  LMTGKYP--IHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           L+TG+Y     IG++  +   G+   LP   + LP  L++ GYAT  +GKWH+G      
Sbjct: 93  LLTGRYQGRFRIGLEEPIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVGEL-PAS 151

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKAT-FEPYQGLDMRHNMQVDNKTIGIYSTDLYTEA 200
           +P   G+D  +G   G  DY+ H+      E  +  + R  +Q        Y TDL    
Sbjct: 152 SPLDHGYDYFFGIASGGTDYFAHATTINGHEMGKLFENRTEIQRPG-----YLTDLLGAK 206

Query: 201 AINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPER------RTY 253
           AI+ + +  +  +P F+ L   A H      P+Q P  + A+    +DP +      R Y
Sbjct: 207 AIDRMQQAARQDRPFFISLHFTAPH-----WPWQGPG-DAAQGSAAADPRQMDGGNVRAY 260

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
           A M+  +D +VG V+A L   G+ E++IV+F +DNG   F        + PL GMK    
Sbjct: 261 AAMMEGMDANVGRVLAELAALGLEEDTIVIFTSDNGGERF------SDSWPLTGMKGELL 314

Query: 314 DGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVN 365
           +GG+R    +  P      + S+++    D LPTL +AAG +      LDG++
Sbjct: 315 EGGIRVPLIVSWPAHLPKGRDSAQVTMSMDALPTLLSAAGGDPGRVEGLDGID 367


>gi|291225019|ref|XP_002732502.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 197

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 79/160 (49%), Positives = 99/160 (61%), Gaps = 2/160 (1%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           +PH+I+I  DD+GWNDV +H +  I  PN+  LA +G+I NQ Y    CTPSR+A+M G 
Sbjct: 26  QPHVILIFVDDMGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMKGL 84

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP   G QH ++    P G+PL  KLLP+ LKE GY+TH +GKWHLGF ++ Y PT RGF
Sbjct: 85  YPFKTGNQHQMVFNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYQPTNRGF 144

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
           DSHYG W      YD        P  G D R NM V  K+
Sbjct: 145 DSHYGLWTLGVGNYD-KMNGVLSPSAGYDFRDNMGVVPKS 183


>gi|440715401|ref|ZP_20895948.1| arylsulfatase A [Rhodopirellula baltica SWK14]
 gi|436439745|gb|ELP33159.1| arylsulfatase A [Rhodopirellula baltica SWK14]
          Length = 455

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 113/354 (31%), Positives = 175/354 (49%), Gaps = 39/354 (11%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A + P+ ++I ADDLG+ D+S + SS + TP++D LA  G      +V A +C+PSR+AL
Sbjct: 16  ADRPPNFVLIFADDLGYGDISCYDSSGVKTPHLDQLAAEGFRSTDFFVPANVCSPSRAAL 75

Query: 85  MTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           +TG+YP+  GM   ++  + + + +G    E  +P+ L  AGY +  +GKWHLG   E  
Sbjct: 76  LTGRYPMRCGMPVARNENVAKYKDYGFAPDEITIPELLGPAGYRSLMVGKWHLGMELEGS 135

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD---MRHNMQVDNKTIGIYS-TDLY 197
            P   GFD + G              + +EP +G +   +    QV+ K +     T  Y
Sbjct: 136 HPLDAGFDEYLG------------IPSNYEPRRGKNHNTLYRGKQVEQKNVACEELTKRY 183

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMV 257
           T+  I+ I +  K  P F+Y++H  VH  N  +P  +PD     F+  S  E+  Y   +
Sbjct: 184 TDEVIDFI-KRQKDDPFFVYVSHHIVH--NPLKP--SPD-----FVGTS--EKGKYGDFI 231

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
             LD S G ++  +R  G+ EN++V+F +DNG       +  GS+  L G K    +GG 
Sbjct: 232 KELDHSTGRIMQTIRDAGLDENTLVIFTSDNGP------TRNGSSGELSGGKYCTMEGGH 285

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVNQWDVL 370
           R       P      +VS       D LP  C  AG+ I +D  +DG +   VL
Sbjct: 286 RVPGMFRWPSKIAPNQVSDVTLTSMDLLPLFCELAGVPIPDDRRIDGKSILPVL 339


>gi|265752292|ref|ZP_06088085.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
 gi|423229021|ref|ZP_17215426.1| hypothetical protein HMPREF1063_01246 [Bacteroides dorei
           CL02T00C15]
 gi|423244861|ref|ZP_17225935.1| hypothetical protein HMPREF1064_02141 [Bacteroides dorei
           CL02T12C06]
 gi|263237084|gb|EEZ22554.1| arylsulfatase [Bacteroides sp. 3_1_33FAA]
 gi|392634774|gb|EIY28686.1| hypothetical protein HMPREF1063_01246 [Bacteroides dorei
           CL02T00C15]
 gi|392640902|gb|EIY34693.1| hypothetical protein HMPREF1064_02141 [Bacteroides dorei
           CL02T12C06]
          Length = 464

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 130/402 (32%), Positives = 186/402 (46%), Gaps = 46/402 (11%)

Query: 12  TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           T  +LF+ A L+        TTA K P++I I+ADDLG  D+  +G  QI TPNID +A 
Sbjct: 4   TRKILFSAALLSSGLTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
           NG+   QHY  + +  PSR AL+TGK+  H  ++    + G         LP  E  +  
Sbjct: 64  NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
             K   Y T  +GKW +G       P   GFD  YGY   L   + HS    F      +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
               + +D K    YS DL  E A+N I E N  KP FLY +    HA           Y
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEVMTEY 232

Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
           E      PF    +            R  YA MV+ LD+SVG +I  L++ G+ +++I++
Sbjct: 233 EGEFCETPFGGSKDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 288

Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
           F +DNG  S G H  S   SN P RG K   ++GG+R    I  P +     V++ +   
Sbjct: 289 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 348

Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D+LPT+      +I   ++DG++    LT     K  + ++
Sbjct: 349 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGIQKEHDCIY 389


>gi|395801059|ref|ZP_10480329.1| sulfatase [Flavobacterium sp. F52]
 gi|395436775|gb|EJG02699.1| sulfatase [Flavobacterium sp. F52]
          Length = 484

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 129/371 (34%), Positives = 177/371 (47%), Gaps = 34/371 (9%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL-NQHYVQALCTPSRSALMTG 87
           KP+IIII  DD+G+ D   +G + + TP+ + +A  G+ L N +  QA+CT SR+AL+TG
Sbjct: 27  KPNIIIINMDDMGYGDTEPYGMTGVNTPHFNQIAEEGMRLTNFNAGQAICTASRAALLTG 86

Query: 88  KYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
            YP  IGM  GV+L G    L   E+ +   LK+ GY T   GKWHLG     Y PT   
Sbjct: 87  CYPNRIGMS-GVLLPGAKHALNPNEETIASLLKKNGYKTANYGKWHLGNVLP-YWPTNYD 144

Query: 148 FDSHYG------YWQGLQDYYDH-----SCKATFEPYQGLDMRHNMQVDNKTIGIYS--- 193
           FD  +G       W    D Y         +++F P   L +  N  V +    I     
Sbjct: 145 FDEFFGIPYSHDVWPIDYDGYSKVTDSKDMRSSFPP---LPLISNTTVIDTIENIKDASK 201

Query: 194 -TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
            T L+TE AI  I + NK  P FLYL H   HA     P    D    KF   SD     
Sbjct: 202 LTTLFTEKAIGFI-KKNKKNPFFLYLTHPLPHA-----PLAVSD----KFKGKSD--LGL 249

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTP 312
           +  ++  LD S+G +I  L K G+ +N++++  +DNG P      N GS+   R  KST 
Sbjct: 250 FGDVIMELDWSIGEIINTLDKEGIAKNTVLIITSDNG-PWLVFGDNAGSSGGFREGKSTT 308

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           W+GG R    I  P   +   V+S L    D LPT+    G  +    +DG+N   VLT 
Sbjct: 309 WEGGTRVPFLIRWPGKIKGGSVNSSLMTNMDLLPTIAKITGASLPKNQIDGLNFLPVLTG 368

Query: 373 GAKTKRSEILH 383
              +   EI +
Sbjct: 369 KTASGPREIFY 379


>gi|345512484|ref|ZP_08792010.1| arylsulfatase [Bacteroides dorei 5_1_36/D4]
 gi|345456366|gb|EEO45525.2| arylsulfatase [Bacteroides dorei 5_1_36/D4]
          Length = 464

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 130/402 (32%), Positives = 186/402 (46%), Gaps = 46/402 (11%)

Query: 12  TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           T  +LF+ A L+        TTA K P++I I+ADDLG  D+  +G  QI TPNID +A 
Sbjct: 4   TRKILFSAALLSSGFTMAQTTTAEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
           NG+   QHY  + +  PSR AL+TGK+  H  ++    + G         LP  E  +  
Sbjct: 64  NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
             K   Y T  +GKW +G       P   GFD  YGY   L   + HS    F      +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGMPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG--------NTY 229
               + +D K    YS DL  E A+N I E N  KP FLY +    HA           Y
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEVMTEY 232

Query: 230 E------PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVL 283
           E      PF    +            R  YA MV+ LD+SVG +I  L++ G+ +++I++
Sbjct: 233 EGEFCETPFGGSKDGYKS----QQTPRAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIV 288

Query: 284 FMADNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHI 341
           F +DNG  S G H  S   SN P RG K   ++GG+R    I  P +     V++ +   
Sbjct: 289 FTSDNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAF 348

Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
            D+LPT+      +I   ++DG++    LT     K  + ++
Sbjct: 349 WDFLPTIGELVQADIPQ-NIDGISYLPTLTGKGIQKEHDCIY 389


>gi|340619477|ref|YP_004737930.1| sulfatase [Zobellia galactanivorans]
 gi|339734274|emb|CAZ97651.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
          Length = 488

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 129/378 (34%), Positives = 190/378 (50%), Gaps = 52/378 (13%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
           L T    KP+I+ ILADDLG+ D+S +G  +  TPNID LA +G++  QHY  + +C PS
Sbjct: 27  LKTAIATKPNIVYILADDLGYGDLSSYGQQKFSTPNIDKLAKDGILFTQHYSGSTVCAPS 86

Query: 81  RSALMTGKYPIHIGMQHG--VILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           RSAL+TG +  H  ++    ++ EG+ + LP     L + +K+AGY T A GKW LGF  
Sbjct: 87  RSALLTGMHTGHTVVRGNKEILPEGQ-YPLPDNTYTLAEAMKKAGYITGAFGKWGLGFPG 145

Query: 139 EVYTPTFRGFDSHYGY-WQGL-QDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
               P  +GFD+ +GY  Q L  +YY     A  +    L +  N     +T   Y+ +L
Sbjct: 146 SEGDPLNQGFDAFFGYNCQRLGHNYYPRHLWANKD---SLVLEGNSGFKKET---YAPEL 199

Query: 197 YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPE------- 249
             +  +  I E NK  P FLY+A +  HA  T     AP+E + K+     PE       
Sbjct: 200 IHDKTLEFI-EMNKDNPFFLYVASIIPHAELT-----APEEIMEKYKGKFAPETVYEGVD 253

Query: 250 ----------------RRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
                              +A MV  LD  VG ++A L K G+ +N++V+F +DNG  + 
Sbjct: 254 DGPEYKLGPYGSQKEPHAAFAAMVHLLDAQVGEIMATLEKLGIADNTLVVFTSDNGPHTE 313

Query: 294 GIHSNK--GSNHPLRGMKSTPWDGGMR-GVAAIWSPWLK---QTQKVSSELFHISDWLPT 347
           G    +   S+ P RG K   ++GG+R  + A W   +K   +T+ VS+      D  PT
Sbjct: 314 GGADPEYFNSSGPFRGTKRDLYEGGIRVPMVAAWPGKIKPGSRTELVSA----FWDVFPT 369

Query: 348 LCAAAGIEINDTSLDGVN 365
               AGIE  +  LDG++
Sbjct: 370 FSDIAGIETPE-GLDGIS 386


>gi|296122626|ref|YP_003630404.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296014966|gb|ADG68205.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 470

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 136/406 (33%), Positives = 197/406 (48%), Gaps = 49/406 (12%)

Query: 8   FFALTCTLLFNDA-FLNTTAPK---KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           +FAL   LLF  A F    A +   KP++++I  DDLG  D+   GSS   TP IDALA 
Sbjct: 6   YFAL---LLFMGAPFFPVEAKEMADKPNVLLIFIDDLGKTDIGIEGSSFYETPRIDALAK 62

Query: 64  NGLILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEA 122
           +G    Q Y    +C+P+R+ALMTGK P  +G+   +  E +   LP +E  + Q  +EA
Sbjct: 63  SGARFTQFYSAHPVCSPTRAALMTGKMPQRLGITDWIRPESD-VALPQSEVTIGQAFQEA 121

Query: 123 GYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPY---QGLDMR 179
           GY T  +GKWHLG   + + P  RGFD   G   G Q        + + PY   Q  D  
Sbjct: 122 GYHTAYLGKWHLGHKPQQH-PAARGFDWTKGVNHGGQP------SSYYFPYKNPQKPDAP 174

Query: 180 HNM-QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPFQ 233
           +N+   +      Y TD+ T +AI  + + ++++P FL LAH AVH       N  E +Q
Sbjct: 175 NNVPDFEKCQPEDYLTDVLTSSAIEHLQQRDRTRPFFLCLAHYAVHTPIQPPKNLVEKYQ 234

Query: 234 ----------APDEEVAKFLDISDPER--RTYAGMVSRLDESVGNVIAALRKHGMLENSI 281
                     +P E + +   IS  ++    YA MV  LD  VG ++  L+  G+L+ +I
Sbjct: 235 VKLATQKNPKSPGEGIQEGSAISRSQQDHPAYAAMVENLDTQVGRLLDELKTQGILDQTI 294

Query: 282 VLFMADNGAPSF--GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELF 339
           V+F +DNG      G       N PLR  K   ++GG+R    I  P      K+S ++ 
Sbjct: 295 VVFTSDNGGLCTLNGKSPGPTCNLPLRAGKGWTYEGGIRIPTYISWPG-----KISPQVL 349

Query: 340 HI----SDWLPTLCAAAGIEINDTS-LDGVNQWDVLTKGAKTKRSE 380
            I     D  PTL +   I    T  +DG++   +LTK +    SE
Sbjct: 350 DIPAYTCDIYPTLLSLCQIPPRPTQHVDGISLAGLLTKSSSLPESE 395


>gi|417411296|gb|JAA52089.1| Putative arylsulfatase a, partial [Desmodus rotundus]
          Length = 508

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 131/445 (29%), Positives = 206/445 (46%), Gaps = 33/445 (7%)

Query: 30  PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMTGK 88
           P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +LCTPSR+AL+TG+
Sbjct: 22  PNIVLIFADDLGYGDLGAYGHPSSTTPNLDQLATAGLRFTDFYVPVSLCTPSRAALLTGR 81

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTFRG 147
            P+  G+  GV+  G   GLPL E  L + L   GY T   GKWHLG   E  + P  +G
Sbjct: 82  LPVRSGLYPGVLEPGSQGGLPLEEVTLAEVLATQGYLTGMAGKWHLGVGPEGAFLPPRQG 141

Query: 148 FDSHYG--YWQGLQDYYDHSCKATFEPYQGL--------DMRHNMQVDNKTIGIYSTDL- 196
           F    G  Y        + +C     P  G+         +  N+ V+ +   +   +  
Sbjct: 142 FHRFLGIPYSHDQGPCQNLTCFPPATPCDGICDQGLVPVPLLANLSVEAQPPWLPGLEAR 201

Query: 197 YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           Y   A +++A+   + +P FLY A    H    Y  F                 R  +  
Sbjct: 202 YVAFARDLMADAQRQGRPFFLYYASHHTH----YPQFSGQSSA-------GRSGRGPFGD 250

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
            +  LD +VG ++AAL+  G+L  ++V+F ADNG P     S+ G +  LR  K T ++G
Sbjct: 251 SLMELDAAVGALMAALQDLGLLRQTLVIFTADNG-PETMRMSHGGCSGLLRCGKGTTFEG 309

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAK 375
           G+R  A  + P    T  V+ EL    D LPTL A AG  + + +LDGV+   +L    K
Sbjct: 310 GVREPAVAFWPG-HITPGVTHELASSLDLLPTLAALAGAPLPNVTLDGVDLSSLLLGTGK 368

Query: 376 TKRSEILHNIDNVDNPQKYYAALRVDDLK---YVAGTDNNGQSDEWYGDTDNEIDKYSPK 432
           + R  ++      D  +  + A+R    K   +  G+ ++  + +      N +  + P 
Sbjct: 369 SPRQTMVFYPAYPDEVRGVF-AVRKGKYKAHFFTQGSAHSDTTADPACHASNPLTAHEPP 427

Query: 433 EV--LYSKAGITFNALKTKLQIKQK 455
            +  L    G  +N L++  ++  +
Sbjct: 428 LLFDLSEDPGENYNLLRSGAEVAPE 452


>gi|386381787|ref|ZP_10067486.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
 gi|385670744|gb|EIF93788.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
          Length = 426

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 123/425 (28%), Positives = 192/425 (45%), Gaps = 38/425 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           ++P+++ IL DDLGW D+S +G+ +I TPN+D LA  G+     Y   A C+P+R +L T
Sbjct: 10  RRPNVLFILGDDLGWADLSSYGAPEIRTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYT 69

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+YP          +     GL  T   L   L++AGY T  IGKWH G+  + Y+PT  
Sbjct: 70  GRYPGRTKGGLAEPIADRSVGLEPTHPTLASLLRDAGYRTALIGKWHCGYLPD-YSPTRS 128

Query: 147 GFDSHYGYWQGLQDYYDH-SCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVI 205
           G++  +G + G  +YY   S    ++ Y+G     +++        Y T + TE A+  +
Sbjct: 129 GWEEFFGNFGGALEYYSKLSSGGEYDLYEGDAEYRDLR--------YYTRILTERAVEYV 180

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT-----------YA 254
              +   P  L L     H     +  +    E+ + +   D  RR            Y 
Sbjct: 181 GRDHGRTPWLLNLNFTTPHWPWIADGDEETSAEIVRRIKAGD--RRALWHQDGGSLAKYR 238

Query: 255 GMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWD 314
            +V  LD SVG V+AAL++ G  E+++V F +DNG   F        N PL G K +  D
Sbjct: 239 QLVEDLDRSVGKVLAALKRSGQEEDTLVFFASDNGGERFSY------NWPLSGNKGSLLD 292

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKG 373
           GG+R  + +  P   +  +VS    +  DW  TL    G   +    LDGV+    L  G
Sbjct: 293 GGIRVPSILRWPARIEGGQVSGVPVYSPDWTATLLDVGGARPHPAYPLDGVSLKGYLLHG 352

Query: 374 AKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSDEWYGDTDNEIDKYSPKE 433
           A+    ++   +       +   ALR  + KY  G D   +      D   + DK S + 
Sbjct: 353 AEVPARDLFWRV-------RGERALRRGEWKYYRGKDGRDRLYRLSEDVREQADKASVRP 405

Query: 434 VLYSK 438
            L ++
Sbjct: 406 ELLTE 410


>gi|171912351|ref|ZP_02927821.1| aryl-sulphate sulphohydrolase [Verrucomicrobium spinosum DSM 4136]
          Length = 491

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 126/386 (32%), Positives = 182/386 (47%), Gaps = 55/386 (14%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPS 80
           L+   P +P+I+ ILADDLG+ DV+  GS    TPNID +A  G+   Q Y     C P+
Sbjct: 23  LHAATPARPNIVFILADDLGYTDVATLGSKYYETPNIDQMAAQGVKFTQGYSCGPNCQPT 82

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPW------------GLPLTEKLLPQYLKEAGYATHA 128
           R+ALM+G+Y    G+     ++  PW             LPL +  L Q LK AGYAT  
Sbjct: 83  RAALMSGQYGPRTGVYTVGGIDRFPWQTRPLRPVDNVVNLPLEKVTLAQTLKNAGYATAM 142

Query: 129 IGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKT 188
            GKWHLG   E Y P+ RGFD       G    +D + K                     
Sbjct: 143 FGKWHLGN-GEAYHPSKRGFDEAI-VSNGRHINFDTNPKVAVPE---------------- 184

Query: 189 IGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLD---I 245
              Y  D  T+ A++ I  H +S P FLYL H  VH+     P++A  + +A+F D   +
Sbjct: 185 -DAYLADFLTDRAVDFIQRHKES-PFFLYLPHYGVHS-----PYEAKADLIARFKDKPAV 237

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG----- 300
                  YA M++ +DESVG ++  L    + +N++V+F +DNG    G +  +G     
Sbjct: 238 GGHHSPVYAAMLASVDESVGRILKLLDDLKLSQNTLVVFSSDNGG--VGGYEREGIDAKG 295

Query: 301 --SNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-- 355
              N PLRG K   ++GG+R      W   +K  Q V++   +  D  PTL   AG +  
Sbjct: 296 ITDNAPLRGGKGMLYEGGVRVPYVFQWKGHIKAGQ-VNTTPINSVDLYPTLVELAGAQAP 354

Query: 356 -INDTSLDGVNQWDVLTKGAKTKRSE 380
            + +  LDGV+  + L +   T R  
Sbjct: 355 AVAEQPLDGVSYVNELIEPGATPRGR 380


>gi|167524284|ref|XP_001746478.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775240|gb|EDQ88865.1| predicted protein [Monosiga brevicollis MX1]
          Length = 520

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 113/400 (28%), Positives = 184/400 (46%), Gaps = 76/400 (19%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQ----IPTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           +PHI +++ DD GW +V +H S +    + TP IDAL   G+ LN+HYV   CTP+    
Sbjct: 30  RPHIFMVIVDDFGWAEVGYHRSEEYKQDVQTPTIDALVQEGIELNRHYVHYACTPN---- 85

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
                P   G            G+P     + ++L+ AGYATH +GKW +G     +TP 
Sbjct: 86  -----PCQPGC-----------GMPQNMTGVAEHLQSAGYATHFVGKWDVGMTTPQHTPK 129

Query: 145 FRGFDSHYGY------------WQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIY 192
            RG+D+   Y            WQG    Y+H  ++   P +      +    +K     
Sbjct: 130 GRGYDTSLNYFGHGNWMWSEAEWQG---SYNH--RSDLPPCEPPACFKDFWETDKPAAEL 184

Query: 193 STDLYTEAAI-----NVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISD 247
           +   Y EA       +++ +H+ S P+FL  A    H      P QAP E   KF  I +
Sbjct: 185 NGTQYEEAIFRDRVYSILEQHDPSTPLFLTYASKVAHY-----PLQAPIEYQEKFSFIEE 239

Query: 248 PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA------------PSFG- 294
           P RR Y  MV+ LD+++ N  + L+  GM  N++++  +DNG              S G 
Sbjct: 240 PHRRVYHAMVAFLDDNLANFTSLLKSKGMWNNTLMVLTSDNGGYVKSFLGDCDNNTSHGY 299

Query: 295 --IHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVS--SELFHISDWLPTLCA 350
              +   G+N+PL+G K +  +GG+R  A +   ++ + ++ +   E+ H++DW  T  A
Sbjct: 300 ACFNGEAGANYPLQGGKYSTLEGGVRVNAFVSGGFVPEERRGTREDEMVHVADWYATFAA 359

Query: 351 AAGIEIND--------TSLDGVNQWDVLTKGAKTKRSEIL 382
            AG++  D          +D +N W +L+    +   + L
Sbjct: 360 LAGVDPTDHRAAQHNLPPIDSLNIWPLLSGANVSSPRQTL 399


>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
 gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
          Length = 470

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 123/390 (31%), Positives = 185/390 (47%), Gaps = 40/390 (10%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           KKP+II ILADDLG+ D+S +G   I TP ID +  +G+ L Q Y  + +C+ +R+A+MT
Sbjct: 34  KKPNIIFILADDLGYADLSSYGHPTIRTPAIDKIGNDGVRLLQAYSNSPVCSATRTAIMT 93

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y   + +     L G   GLP ++  LP  LK+AGY T  IGKWHLG + + Y P   
Sbjct: 94  GQYQYRLALGLEEPLAGRDIGLPPSQTTLPSLLKQAGYETTLIGKWHLGAYPK-YGPLKS 152

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIA 206
           G+D  YG+      YY+H          G D   +     K    Y TDL  +  + +I 
Sbjct: 153 GYDHFYGFRGSALSYYNH----------GKDFWEDDAPVEKA--GYFTDLLGDKTVELIQ 200

Query: 207 EHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK--------FLDISDPERRTYAGMV 257
           + +   +P F      +VH    + P++AP E+  K                +  Y  +V
Sbjct: 201 KSDSCERPFF-----ASVHFNAPHWPWEAPGEQGKKEAARKKNSLAAFDAGTQDVYRQIV 255

Query: 258 SRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGM 317
            R+D  VG ++ AL    + EN+IV+F +DNG   F       +  P  G KS   +GG+
Sbjct: 256 ERMDFQVGRILQALTDKKIRENTIVVFTSDNGGERF------SNTWPFSGKKSELLEGGI 309

Query: 318 RGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDT-SLDGVNQWDVLTKGAKT 376
           R  + I  P    + + S ++    DWLPTL  AA   ++     DG++   +L      
Sbjct: 310 RIPSLICWPAKIPSGQTSEQVTLSMDWLPTLLEAAETNVDQNHQSDGLSILPMLKGDEGV 369

Query: 377 KRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
           K   +        N Q+   A R+ D KY+
Sbjct: 370 KSRTVFWRYKG--NAQR---AARMGDYKYL 394


>gi|423330334|ref|ZP_17308118.1| hypothetical protein HMPREF1075_00131 [Parabacteroides distasonis
           CL03T12C09]
 gi|409231950|gb|EKN24798.1| hypothetical protein HMPREF1075_00131 [Parabacteroides distasonis
           CL03T12C09]
          Length = 468

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 124/394 (31%), Positives = 185/394 (46%), Gaps = 43/394 (10%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           K+   L   L F+ A + T A ++P++II+  DD G+ D+  +GS++  TP+ID +A  G
Sbjct: 2   KFISKLISVLAFSGAAVATQAAERPNVIIVFIDDFGYGDLGCYGSTKHRTPHIDQMAKEG 61

Query: 66  LILNQHYV-QALCTPSRSALMTGKYPIHIGMQ------------HGVILEGEPWGLPLTE 112
           + L   YV  ++ TPSRSAL+TG YP  + M               V+      GL   E
Sbjct: 62  IRLTDFYVGSSVSTPSRSALLTGCYPRRVSMHVNADPTPLMSKGRQVLFPASHKGLNPGE 121

Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
             + + +KE GYAT  IGKWHLG  +  + PT +GFD +YG        Y +       P
Sbjct: 122 ITIAELMKEQGYATACIGKWHLG-DQLPFLPTRQGFDYYYGI------PYSNDMD---RP 171

Query: 173 YQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
           Y  L +    +V    +G  S T  YT   +  I  H +S P F+YL H   H      P
Sbjct: 172 YCPLPLMEQEEVIVAPVGHDSLTIRYTNKTVEFIKSHKES-PFFIYLCHNMTH-----NP 225

Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
             A      K       +   Y      LD S+G ++  L++ G+ +N++V+F +DNGA 
Sbjct: 226 LAASPAFKGK------SQNGLYGDATEELDWSMGVLLETLKEEGLDQNTLVIFTSDNGAD 279

Query: 292 S-FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
             FG     G+N PLRG K T ++GG R    +  P      + +  L    D+LPTL  
Sbjct: 280 EHFG-----GTNRPLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAH 334

Query: 351 AAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
             G  + +D  +DG N   +L   +    +E  +
Sbjct: 335 YCGYAVPSDRVIDGHNVSGILEGESMASPTETFY 368


>gi|254482912|ref|ZP_05096148.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
 gi|214036784|gb|EEB77455.1| sulfatase, putative [marine gamma proteobacterium HTCC2148]
          Length = 470

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 130/399 (32%), Positives = 197/399 (49%), Gaps = 40/399 (10%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQI---PTPNIDALAYNGL 66
           A   TL+F+  F  T A  KP+I++++ D+ G+ +V  +G   +   PTPNID++A  G 
Sbjct: 12  AYCSTLVFSVLFSVTAAASKPNIVMVVMDNFGYGEVGVYGGGMLRGAPTPNIDSIATEGF 71

Query: 67  ILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYAT 126
            L    V+A CTPSRS+LMTG+Y I    +      G  +G+   E  L + L +AGYAT
Sbjct: 72  QLTNFNVEAECTPSRSSLMTGRYGIRTRQRPNDEPRGIWYGITPWEITLAEMLSDAGYAT 131

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSC---KATFEPYQGLDMRHNMQ 183
              GKWHLG     Y PT +GFD     W G+ +  D +      +F+   G++  H M+
Sbjct: 132 GMFGKWHLGDEEGRY-PTDQGFDE----WYGIPNSSDQAFWPDSDSFQKDAGVEFTHVME 186

Query: 184 VDN----KTIGIYSTDL-------YTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEP 231
                  K   +Y  +         T+ AI+ I    K+ KP F YL +   H     EP
Sbjct: 187 SKRGQKPKKKDVYGREKRKTIDREITDRAIDFIKRKAKAGKPFFAYLPYTQTH-----EP 241

Query: 232 FQA-PDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
             A PD + +           ++A ++++ D  VG ++  +   G  +N+I +F +DNG 
Sbjct: 242 VDAHPDFKGST-------GNGSFADVLAQTDSYVGELLKTIDNLGFKDNTIFIFTSDNGR 294

Query: 291 PSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
              GI  + G   P RG    P++G +R    I  P     +KVS+++ H+ D  PT+  
Sbjct: 295 E--GIKRSFGFTGPWRGTMFAPYEGSLRVPFLIRYPDKIPAKKVSNDIVHLIDIFPTIAK 352

Query: 351 AAGIEI-NDTSLDGVNQWDVLT-KGAKTKRSEILHNIDN 387
            +G EI  D  LDGV+Q D LT K  K+ R  ++  I N
Sbjct: 353 LSGGEIPQDRILDGVDQTDFLTGKSEKSARESVIIYIGN 391


>gi|432091635|gb|ELK24657.1| Arylsulfatase A [Myotis davidii]
          Length = 557

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 124/380 (32%), Positives = 184/380 (48%), Gaps = 31/380 (8%)

Query: 18  NDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-AL 76
           ND  L    P  P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +L
Sbjct: 61  NDNGLAAAYP--PNIVLIFADDLGYGDLGSYGHPTSTTPNLDQLATAGLRFTDFYVPVSL 118

Query: 77  CTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF 136
           CTPSR+AL+TG+ P+  G+  GV+  G   GLPL E  L + L + GY T   GKWHLG 
Sbjct: 119 CTPSRAALLTGRLPVRSGLYPGVLEPGSRGGLPLEEVTLAEILADQGYLTGMAGKWHLGV 178

Query: 137 FRE-VYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQGL--------DMRHNMQVD 185
             E  + P  +GF    G  Y        + +C     P  G+         +  N+ V+
Sbjct: 179 GPEGTFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPATPCHGICDQGLVPVPLLANLSVE 238

Query: 186 NKTIGIYSTDL-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
            +   +   +  Y   A +++ + H + +P FLY A    H    Y  F         F 
Sbjct: 239 AQPPWLPGLEARYVAFAHDLMVDAHRQGRPFFLYYASHHTH----YPQFSG-----QNFA 289

Query: 244 DISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH 303
             S   R  +   +  LD +VG ++ A+   G+L++++V+F ADNG  +  + S  G + 
Sbjct: 290 GHSG--RGPFGDSLMELDSAVGALMTAVGDLGLLKDTLVIFTADNGPETMRM-SRGGCSG 346

Query: 304 PLRGMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLD 362
           PLR  K T ++GG+R  A A W   +  T  V+ EL    D LPTL A AG  + + +LD
Sbjct: 347 PLRCGKGTTFEGGVREPALAFWPGHI--TPGVTHELASSLDLLPTLAALAGAPLPNVTLD 404

Query: 363 GVNQWDVLTKGAKTKRSEIL 382
           G +   +L    K+ R  + 
Sbjct: 405 GFDLSPLLLGKGKSPRQTMF 424


>gi|67972647|ref|NP_033843.2| arylsulfatase A precursor [Mus musculus]
 gi|341940244|sp|P50428.2|ARSA_MOUSE RecName: Full=Arylsulfatase A; Short=ASA; AltName:
           Full=Cerebroside-sulfatase; Flags: Precursor
 gi|12835776|dbj|BAB23356.1| unnamed protein product [Mus musculus]
 gi|15030075|gb|AAH11284.1| Arylsulfatase A [Mus musculus]
 gi|67867472|gb|AAH98075.1| Arylsulfatase A [Mus musculus]
 gi|74208915|dbj|BAE21207.1| unnamed protein product [Mus musculus]
 gi|148672389|gb|EDL04336.1| arylsulfatase A, isoform CRA_b [Mus musculus]
          Length = 506

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 132/390 (33%), Positives = 189/390 (48%), Gaps = 33/390 (8%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
            AL    L   A L+T +P  P+I++I ADDLG+ D+  +G     TPN+D LA  GL  
Sbjct: 1   MALGTLFLALAAGLSTASP--PNILLIFADDLGYGDLGSYGHPSSTTPNLDQLAEGGLRF 58

Query: 69  NQHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
              YV  +LCTPSR+AL+TG+ P+  GM  GV+      GLPL E  L + L   GY T 
Sbjct: 59  TDFYVPVSLCTPSRAALLTGRLPVRSGMYPGVLGPSSQGGLPLEEVTLAEVLAARGYLTG 118

Query: 128 AIGKWHLGFFRE-VYTPTFRGFD-------SH-YGYWQGLQDY-YDHSCKATFEPYQGL- 176
             GKWHLG   E  + P  +GF        SH  G  Q L  +  D  CK   +  QGL 
Sbjct: 119 MAGKWHLGVGPEGAFLPPHQGFHRFLGIPYSHDQGPCQNLTCFPPDIPCKGGCD--QGLV 176

Query: 177 --DMRHNMQVDNKTIGIYSTDL-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPF 232
              +  N+ V+ +   +   +  Y   + +++A+   + +P FLY A    H    Y  F
Sbjct: 177 PIPLLANLTVEAQPPWLPGLEARYVSFSRDLMADAQRQGRPFFLYYASHHTH----YPQF 232

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
                    F   S   R  +   +  LD +VG ++  +   G+LE ++V+F ADNG P 
Sbjct: 233 SG-----QSFTKRSG--RGPFGDSLMELDGAVGALMTTVGDLGLLEETLVIFTADNG-PE 284

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
               SN G +  LR  K T ++GG+R  A ++ P    T  V+ EL    D LPTL A  
Sbjct: 285 LMRMSNGGCSGLLRCGKGTTFEGGVREPALVYWPG-HITPGVTHELASSLDLLPTLAALT 343

Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           G  + + +LDGV+   +L    K+ R  + 
Sbjct: 344 GAPLPNVTLDGVDISPLLLGTGKSPRKSVF 373


>gi|332265013|ref|XP_003281524.1| PREDICTED: arylsulfatase A isoform 1 [Nomascus leucogenys]
          Length = 509

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 191/404 (47%), Gaps = 30/404 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
           + P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +LCTPSR+AL+T
Sbjct: 21  RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDRLAAGGLRFTDFYVPVSLCTPSRAALLT 80

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+ +GM  GV++     GLPL E  + + L   GY T   GKWHLG   E  + P  
Sbjct: 81  GRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPH 140

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
           +GF    G  Y        + +C     P  G        + +  N+ V+ +   +   +
Sbjct: 141 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLE 200

Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
             Y   A +++A+   + +P FLY A    H    Y  F         F + S   R  +
Sbjct: 201 ARYVAFARDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 249

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              +  LD +VG ++ A+R  G+LE ++V+F ADNG  +  + S  G +  LR  K T +
Sbjct: 250 GDSLMELDAAVGTLMTAIRDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 308

Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           +GG+R  A A W   +     V+ EL    D LPT+ A AG  + + +LDG +   +L  
Sbjct: 309 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTVAALAGAPLPNVTLDGFDLSPLLLG 366

Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
             K+ R  +       D  +  + A+R    K    T  +  SD
Sbjct: 367 TGKSPRQSLFFYPSYPDEVRGVF-AVRSGKYKAHFFTQGSAHSD 409


>gi|262383797|ref|ZP_06076933.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|262294695|gb|EEY82627.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
          Length = 468

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 124/394 (31%), Positives = 185/394 (46%), Gaps = 43/394 (10%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           K+   L   L F+ A + T A ++P++II+  DD G+ D+  +GS++  TP+ID +A  G
Sbjct: 2   KFISKLISVLAFSGAAVATQAAERPNVIIVFIDDFGYGDLGCYGSTKHRTPHIDQMAKEG 61

Query: 66  LILNQHYV-QALCTPSRSALMTGKYPIHIGMQ------------HGVILEGEPWGLPLTE 112
           + L   YV  ++ TPSRSAL+TG YP  + M               V+      GL   E
Sbjct: 62  IRLTDFYVGSSVSTPSRSALLTGCYPRRVSMHVNADPTPLMSKGRQVLFPASHKGLNPGE 121

Query: 113 KLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEP 172
             + + +KE GYAT  IGKWHLG  +  + PT +GFD +YG        Y +       P
Sbjct: 122 ITIAELMKEQGYATACIGKWHLG-DQLPFLPTRQGFDYYYGI------PYSNDMD---RP 171

Query: 173 YQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEP 231
           Y  L +    +V    +G  S T  YT   +  I  H +S P F+YL H   H      P
Sbjct: 172 YCPLPLMEQEEVIVAPVGHDSLTIRYTNKTVEFIKSHKES-PFFIYLCHNMTH-----NP 225

Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
             A      K       +   Y      LD S+G ++  L++ G+ +N++V+F +DNGA 
Sbjct: 226 LAASPAFKGK------SQNGLYGDATEELDWSMGVLLETLKEEGLDQNTLVIFTSDNGAD 279

Query: 292 S-FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA 350
             FG     G+N PLRG K T ++GG R    +  P      + +  L    D+LPTL  
Sbjct: 280 EHFG-----GTNRPLRGQKGTTYEGGFRVPCIMRWPAKIPAGQETDNLVTSMDFLPTLAH 334

Query: 351 AAGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILH 383
             G  + +D  +DG N   +L   +    +E  +
Sbjct: 335 YCGYAVPSDRVIDGHNVSGILEGESMASPTETFY 368


>gi|340377676|ref|XP_003387355.1| PREDICTED: arylsulfatase A-like [Amphimedon queenslandica]
          Length = 501

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 129/429 (30%), Positives = 189/429 (44%), Gaps = 60/429 (13%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-V 73
            +F    +N  A  KP+III+ ADDLG+ D+  +G     TPN++++A  GL+L Q Y  
Sbjct: 10  CIFVSLRVNGAAVNKPNIIILFADDLGYGDLQIYGHPTSSTPNLNSMASEGLVLTQFYSA 69

Query: 74  QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWH 133
             +C+PSR+ALMTG+YP   G+  GV+      GLP  E  + + LK AGY T  +GKWH
Sbjct: 70  NPVCSPSRAALMTGRYPPRTGIWPGVLFPNSIGGLPHNETTIAEALKTAGYKTGMVGKWH 129

Query: 134 LGFFRE-VYTPTFRGFDSHYGYWQGLQDYYDHS-CKATFEP---------------YQGL 176
           LG      Y PT  GFD    Y+ G+    D   C   + P               Y G 
Sbjct: 130 LGVGENGTYLPTNHGFD----YYLGIPYSNDMGPCTICYYPNEACISPLKNPCNGGYSGC 185

Query: 177 DMRHNMQVDNKTIGIYS-TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP 235
            +  N  +  + +   +  + Y+ AA   I  +    P FLY+A       +T++P Q  
Sbjct: 186 PVFENADIIEQPVNFLTLAERYSNAATGFIKANAGKNPFFLYMAF-----SHTHKP-QFS 239

Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
            EE             T  G+ +  D  V             +N+ V F +DNG P+   
Sbjct: 240 GEEFTN---------STLRGLSTLKDTQVD------------DNTFVFFTSDNG-PAQHT 277

Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE 355
               G+   LR  K + WDGGMR     W P  K  Q  ++EL    D  PT+ +  G +
Sbjct: 278 PPKGGNGGLLRCGKGSTWDGGMREPGIAWWPG-KIRQGRTAELAATVDLFPTIMSITGAK 336

Query: 356 INDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLK---YVAGTDNN 412
                +DG++   +L     + R   ++   +  NP   Y A+R    K   Y +G    
Sbjct: 337 SGPAFIDGIDMSPILFDSKPSNRESYVY-FPSSYNPSTGYHAIRWKQYKAHYYTSG---- 391

Query: 413 GQSDEWYGD 421
           G  D+ Y D
Sbjct: 392 GVCDKIYPD 400


>gi|332666885|ref|YP_004449673.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
 gi|332335699|gb|AEE52800.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
          Length = 443

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 140/465 (30%), Positives = 215/465 (46%), Gaps = 53/465 (11%)

Query: 13  CTLLFNDAFL--NTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL-ILN 69
           C  LF   F+  +T   + P+II I+ DDLG+ D+S +G     TPNID LA  G+  +N
Sbjct: 12  CLPLFVLTFMVYSTVHAQHPNIIYIMTDDLGYGDLSGYGRKDFLTPNIDKLAAQGIKFVN 71

Query: 70  QHYVQALCTPSRSALMTGKYPIH--IGMQHGVILEGEPWGLPLTEKL--LPQYLKEAGYA 125
            +    LCTP+R+A MTG+YP    +G+   +        + LT     +   ++ +GY 
Sbjct: 72  AYSAAPLCTPTRTAFMTGRYPARTPVGLMEPLTPSKRDSTVGLTAAFPSVATLMRASGYE 131

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVD 185
           T  IGKWHLGF  +  +P   GFD  +G   G  DY  H               H++  +
Sbjct: 132 TALIGKWHLGFLPQ-NSPVKNGFDYFFGIHSGAADYISHKTGPAGRRI------HDLYEN 184

Query: 186 NKTI--GIYSTDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
           ++ +    Y TDL+T+ A+  + + HN  KP FL L + A H      P+Q P+++   +
Sbjct: 185 DQAVYPEGYLTDLFTQKAVTFLKQKHN--KPFFLTLTYNAAHW-----PWQGPNDK--PY 235

Query: 243 LDISD----PERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSN 298
           +D  D         YA M++RLDE +G +I  L +  +   +IV+F  DNG   +     
Sbjct: 236 VDSVDLRIGGSAAVYAAMMNRLDEGIGEIIKTLDEEQLSNQTIVIFTNDNGGEKY----- 290

Query: 299 KGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHIS-DWLPTLCAAAGIEIN 357
              N  L   K   W+GG+R  A +  P  K T   S++   I+ DW  T+ +A G +++
Sbjct: 291 -SDNGGLAKAKGALWEGGIRVPAIVKWPG-KITAGSSTQQVAITMDWTKTILSAGGAKMD 348

Query: 358 -DTSLDGVNQWDVLTKGAK-TKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQS 415
            +  LDG++   +LT   K   R+        +        A+R+ D KY+   D  G+ 
Sbjct: 349 KNFPLDGIDLLPILTGDKKIIPRTIYWRTFQRLKQ-----KAIRMGDWKYL--QDEKGEY 401

Query: 416 DEWYGDTDNEIDKYSPKEVLYSKAGITFNALKTKLQIKQKHAADP 460
                   N I     K  L +K  I F  LK K    +K    P
Sbjct: 402 ------LFNLIADQEEKNDLKAKEEIIFKRLKKKYADWEKTVLQP 440


>gi|340618178|ref|YP_004736631.1| sulfatase [Zobellia galactanivorans]
 gi|339732975|emb|CAZ96350.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
          Length = 494

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 134/415 (32%), Positives = 201/415 (48%), Gaps = 50/415 (12%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A +KP+II ILADDLG+ D+S +G  +  TP ID LA  G+   QHY  A +C P+RS+L
Sbjct: 42  AQQKPNIIYILADDLGYGDLSSYGQEKFETPRIDELAQRGMKFTQHYSGAPVCAPARSSL 101

Query: 85  MTGKYPIHIGMQHGVILEGEPWG-LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTP 143
           MTG++  H  ++  + LEGE    LP     + + LK+AGY T A GKW LGF      P
Sbjct: 102 MTGQHTGHTPIRGNMELEGEGQTPLPGNSVTIAEMLKKAGYITGAFGKWGLGFIGTEGDP 161

Query: 144 TFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQV------DNKTIGIYSTDLY 197
             +GFD  YGY          +C+     Y    + HN +       D      Y+ D  
Sbjct: 162 VAQGFDEFYGY----------NCQRMSHRYYPTHLWHNEEKVLLEGNDWTKTETYAPDKI 211

Query: 198 TEAAINVIAEHNKSKPMFLYLAHLAVHA------GNTYEPFQAPDEE---VAKFLDISD- 247
             A +N I E NK KP F Y+  +  HA       + ++ F+   EE    A+ L  SD 
Sbjct: 212 QAATLNFI-ETNKEKPFFAYVPFVLPHAELISPNDSIFDKFKDKYEETPYTAENLYTSDY 270

Query: 248 ----------PERR---TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
                     P+ +    +A MV R+D  VG ++  L + G+ +N+IV+F +DNG  + G
Sbjct: 271 GPDIVPQEYAPQLKPHAAFASMVYRMDVYVGQILDKLDELGIADNTIVMFTSDNGPHAEG 330

Query: 295 IHSNK--GSNHPLRGMKSTPWDGGMRG-VAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
               +   S+  L+G+K   ++GG+R    A+W   +K     S  +    D +PTL   
Sbjct: 331 GADPEFFNSSGGLKGIKRDLYEGGIRSPFLAVWPNKIK-AGSTSDHVSAFWDLMPTLAEI 389

Query: 352 AGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYV 406
              ++  +S DG++    L      K+ E L+   ++ N +K   A+R  D K V
Sbjct: 390 TNTDVPASS-DGISFLPTLLGKKDQKQHEYLYWEFSIQNGRK---AVRKGDWKGV 440


>gi|403260898|ref|XP_003922887.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Saimiri boliviensis
           boliviensis]
          Length = 482

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 168/362 (46%), Gaps = 25/362 (6%)

Query: 40  LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM--- 95
           +GW D+  +G     TPN+D +A  G +    Y    LC+PSR+AL+TG+ PI  G    
Sbjct: 1   MGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60

Query: 96  ----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
               ++    +    G+P +E+LLP+ LKEAGY T  +GKWHLG  R  + P   GFD  
Sbjct: 61  NAHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLGH-RPQFHPLKHGFDEW 119

Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           +G        YD+  +     Y+  +M         ++ KT     T +Y + A++ I  
Sbjct: 120 FGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKR 179

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
             + +P FLY A  A HA         P      FL  S   R  Y   V  +D+S+G +
Sbjct: 180 QARRRPFFLYWAVDATHA---------PVYASKPFLGTS--RRGRYGDAVREMDDSIGKM 228

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           +  L+   + +++ V F +DNGA         GSN P    K T ++GGMR  A  W P 
Sbjct: 229 LQLLQDLRVADSTFVFFTSDNGAALISAPEQGGSNSPFLCGKQTTFEGGMREPALAWWPG 288

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
                +VS +L  I D   T  A AG+   +D ++DG+N    L +G    R    +  D
Sbjct: 289 HITPGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPALLQGRVMDRPVFYYRGD 348

Query: 387 NV 388
            +
Sbjct: 349 TL 350


>gi|372210892|ref|ZP_09498694.1| N-acetylgalactosamine-6-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 463

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 117/346 (33%), Positives = 172/346 (49%), Gaps = 35/346 (10%)

Query: 6   KYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNG 65
           K    +   + F    LN TA +KP++I ILADDLG+ D+S +G   + TPNID LA  G
Sbjct: 5   KTVLRIVSLVAFLTLGLNVTAQEKPNVIYILADDLGYGDLSCYGQKILSTPNIDRLAKKG 64

Query: 66  LILNQHYV-QALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGY 124
           +    HY   A+C+PSR+ LMTG+ P H+ +      + EP G+     +LP+  K AGY
Sbjct: 65  MKFTDHYSGNAVCSPSRAVLMTGQNPAHVYITGNS--KAEP-GMDSEMTVLPELFKSAGY 121

Query: 125 ATHAIGKWHLG--FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNM 182
            T A GKW LG    ++   P  +GFD  YG+            + T   Y    M HN 
Sbjct: 122 VTGAFGKWGLGDTTAKDNRNPLHQGFDEFYGW----------KTQVTAHTYYPTKMVHNG 171

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSK-PMFLY----LAHLAVHA---GNTYEPFQA 234
           Q      G Y  D+  E A + I E++K+K P F Y    + H A+HA    +     + 
Sbjct: 172 QEVPIKKGTYVHDVIMEHAFDFIKENSKNKQPFFCYIPTAIPHAAMHAPKKLHKKWRKKL 231

Query: 235 PDEEVAKFLDISDPERR--------TYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMA 286
           P  +       +DPE           +  MV  LD  VG ++  L+K+ + +N++++F +
Sbjct: 232 PQFDTIIGTYYADPEPTPDVINPIAGFGAMVEHLDNQVGQILKMLKKYKVDKNTLIIFTS 291

Query: 287 DNGAPSFGIHSNK--GSNHPLRGMKSTPWDGGMRG-VAAIWSPWLK 329
           DNGA   G H+ +   SN PLRG K   ++GG+R  + A W   +K
Sbjct: 292 DNGAHHEGGHNPEFWNSNGPLRGGKRDVYEGGIRAPMLAHWPTKIK 337


>gi|432342876|ref|ZP_19592103.1| arylsulfatase [Rhodococcus wratislaviensis IFP 2016]
 gi|430772100|gb|ELB87901.1| arylsulfatase [Rhodococcus wratislaviensis IFP 2016]
          Length = 459

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 117/375 (31%), Positives = 177/375 (47%), Gaps = 37/375 (9%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+++ IL D++GW D   +G +  PTP ID LA  G+  N + V+A CTP+RSA+MTG+
Sbjct: 6   KPNVVFILVDNVGWGDFGVYGGTT-PTPRIDKLASEGIRFNDYNVEAQCTPTRSAIMTGR 64

Query: 89  YPIHIGMQHGVILEGEPW-GLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRG 147
           +P+  G  + +   GE   G+   E  +   L EAGYAT   GKWHLG   E   PT +G
Sbjct: 65  HPVRSG-TYKLSWPGEGMSGMAPWEYTIADLLSEAGYATALYGKWHLG-NTEGRLPTDQG 122

Query: 148 FDSHYGYWQGLQDYYDHSCKATFEPYQGLDM--------------RHNMQVDNKTIGIYS 193
           FD     W G++D +D +    +  YQ L +                 M +D     I  
Sbjct: 123 FDE----WWGIKDSWDEAGYTAWPLYQELGLPAPMIWEGRKGEQSTPAMPLDLDVRPIVD 178

Query: 194 TDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRT 252
                   +  I ++  + KP F+YL +  +H      P          + + S      
Sbjct: 179 EKYLVPKTVEFIKKNAAAKKPFFVYLGYSQMHPPMIANP---------NYANTSTARSGV 229

Query: 253 YAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRG-MKST 311
           YA  ++ +D+ VG ++ A++  G+ +N+I++  +DN       H   GSN P RG   +T
Sbjct: 230 YADCLAEMDDRVGQIVDAVKAAGIEDNTIIVLSSDNAGGGALPHVGPGSNGPWRGNFFNT 289

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG---IEINDTSLDGVNQWD 368
           P++G MR  A I  P       V+++     DWLPTL    G   +   D  +DGV+   
Sbjct: 290 PFEGSMRVSAMIRWPGAIPAGVVTAQTLAAVDWLPTLAGMVGASELVPRDRPIDGVDASA 349

Query: 369 -VLTKGAKTKRSEIL 382
            +L K  KT R   L
Sbjct: 350 FMLGKSDKTGRDSYL 364


>gi|348551622|ref|XP_003461629.1| PREDICTED: arylsulfatase A [Cavia porcellus]
          Length = 507

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 125/389 (32%), Positives = 185/389 (47%), Gaps = 53/389 (13%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCT 78
           A L TT+P  P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +LCT
Sbjct: 13  AGLATTSP--PNIVLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCT 70

Query: 79  PSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFR 138
           PSR+AL+TG+ P+ +GM  GV+      GLPL E  L + L   GY T   GKWHLG   
Sbjct: 71  PSRAALLTGRLPVRMGMYPGVLGPKSKGGLPLEEVTLAEVLASHGYLTGMAGKWHLGV-- 128

Query: 139 EVYTPTFRGFDSHYGYWQGLQDYYDH---SCK--ATFEPYQGLD-----------MRHNM 182
               P       H+G+ + L   Y H    C+    F P    D           +  N+
Sbjct: 129 ---GPGGAFLPPHHGFHRFLGIPYSHDQGPCQNLTCFPPATSCDGTCDQDEVPIPLLANL 185

Query: 183 QVDNKTIGI---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQ 233
            ++ +   +         ++ DL T+A         + +P FLY A    H    Y  F 
Sbjct: 186 TIEAQPPWLPGLEARYVAFARDLMTDA-------QRQGRPFFLYYASHHTH----YPQFS 234

Query: 234 APDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSF 293
                   F+  S   R  +   +  LD +VG ++ ALR  G+L +++V+F +DNG  + 
Sbjct: 235 G-----QSFVGRSG--RGPFGDSLMELDAAVGALMLALRDLGLLRDTLVIFTSDNGPETM 287

Query: 294 GIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
            + S  G++  LR  K T ++GG+R  A  + P    T  V+ EL    D LPTL A +G
Sbjct: 288 RM-SRGGNSGLLRCGKGTTFEGGVREPAVAFWPG-HITPGVTHELASSLDLLPTLAALSG 345

Query: 354 IEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
             + + +LDG++   VL    K+ R  + 
Sbjct: 346 ASLPNVTLDGMDISSVLMSSGKSPRESLF 374


>gi|340619560|ref|YP_004738013.1| sulfatase [Zobellia galactanivorans]
 gi|339734357|emb|CAZ97734.1| Sulfatase, family S1-14 [Zobellia galactanivorans]
          Length = 483

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 128/405 (31%), Positives = 189/405 (46%), Gaps = 36/405 (8%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           +   L   L FN +       K+P+++++  DD G+ DV  +G++   TP+ID LA  G 
Sbjct: 6   FALVLVTLLSFNFSIAQNQDSKRPNVVLVFIDDEGYGDVGVYGATGFETPHIDQLAAKGT 65

Query: 67  ILNQHY-VQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYA 125
               +Y  Q +C+ SR+ LMTG YP  IG   G +      GL   E  + +  KE  Y 
Sbjct: 66  RFTNYYSAQPVCSASRAGLMTGCYPNRIGFS-GALFPRHEIGLNKDELTMAEMFKEQDYK 124

Query: 126 THAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ-- 183
           T   GKWH+G+ +E + P   GFD   G      D + H   AT E       R      
Sbjct: 125 TACFGKWHMGWQKE-FLPLQHGFDEFVGLPYS-NDMWPHDS-ATGEHLSAEKNRAKFPDL 181

Query: 184 ---VDNKTIGIYS--------TDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPF 232
                NKTI            T LYTE A++ I   N ++P FLY+ H   H      P 
Sbjct: 182 PIIEGNKTIAYVKSLKDQDKLTTLYTEKAVDFI-NRNANEPFFLYVPHTMGHI-----PL 235

Query: 233 QAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS 292
              D+   K       E+  Y  ++  +D SVG +  AL+++G+ +N+I +F  DNG P 
Sbjct: 236 GVSDKFRGK------SEQGLYGDVMMEIDWSVGEIDKALKQNGIADNTIFIFTTDNG-PW 288

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
               ++ GS+  LR  K+T W+GG R    I  P       VS +L    D LPT  A  
Sbjct: 289 LNFGNHAGSSGGLREGKTTSWEGGQRVPFIIRWPGKTPEGTVSDKLACAVDLLPTFAAIT 348

Query: 353 GIEINDTSLDGVNQWDVLTKG--AKTKRSEIL--HNIDNVDNPQK 393
             +++D  +DGV+    L KG  ++T R+ IL  H  +N++  +K
Sbjct: 349 NGKLSDNKIDGVD-ISPLWKGQFSETPRNTILYYHGKNNLNGVRK 392


>gi|363727329|ref|XP_424471.3| PREDICTED: arylsulfatase A [Gallus gallus]
          Length = 506

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 132/411 (32%), Positives = 198/411 (48%), Gaps = 32/411 (7%)

Query: 22  LNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPS 80
           L   A   P  +++LADDLG+ D+  +G     TPN+D +A  GL     Y   A+C+PS
Sbjct: 18  LRGAAGGPPSFVLLLADDLGFGDLGSYGHPSSATPNLDRMAARGLRFTDFYSSSAVCSPS 77

Query: 81  RSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF-FRE 139
           R+AL+TG++ +  G+  GV   G   GLPL+E  + + LK  GYAT  +GKWHLG   R 
Sbjct: 78  RAALLTGRFQMRSGIYPGVFYPGSRGGLPLSEVTIAEVLKAKGYATAIVGKWHLGLGARG 137

Query: 140 VYTPTFRGFD-------SH-YGYWQGLQDY-YDHSCKATFEPYQGL---DMRHNMQVDNK 187
            + P  +GFD       SH  G  Q L  +  D  C  T +  QGL    +  N  +  +
Sbjct: 138 SFLPIHQGFDHFLGVPYSHDQGPCQNLTCFPPDIKCFGTCD--QGLVPVPLFWNQSIVQQ 195

Query: 188 TIGIYS-TDLYTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
            +       LY + A + IA+   +  P  LY A     + +T+ P  A  E   +    
Sbjct: 196 PVSFPDLVPLYNKFARDFIADCARRGVPFLLYYA-----SHHTHYPQFASQEYAGR---- 246

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
              +R  +   +S  D SVG ++ AL+++G+   + V F +DNG PS    +  GS+  L
Sbjct: 247 --SQRGPFGDALSEFDGSVGQLLQALQENGLENTTFVFFTSDNG-PSTMRMARGGSSGLL 303

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
           +  K T ++GGMR  A  + P  +    V+ EL    D LPTL A AG  + + SLDG +
Sbjct: 304 KCGKGTTYEGGMREPAVAYWPG-RIAPGVTHELASTLDILPTLTALAGAALPNVSLDGYD 362

Query: 366 QWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
              +L +  K+ R  +     + D P     A+R+   K    T  +  SD
Sbjct: 363 LSPLLFESGKSPRQTMFFYPPSPD-PLHGPFAVRLGKYKAHYFTQGSFHSD 412


>gi|372210887|ref|ZP_09498689.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 476

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 128/426 (30%), Positives = 198/426 (46%), Gaps = 60/426 (14%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           K+P+II ILADD+G  D+  +G  +I TPN+DALA  G+   QHY   A+C PSRS+LMT
Sbjct: 29  KQPNIIYILADDMGIGDLGTYGQKKIKTPNLDALAAGGIKFTQHYSGSAVCAPSRSSLMT 88

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEK--LLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           G++  H  ++  + L+G     PL E    L + LK+AGY T   GKW LGF      P 
Sbjct: 89  GQHTGHTPIRGNMELKGIGGQQPLPESSITLAEMLKKAGYTTGMFGKWGLGFGEG--DPN 146

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN------KTIGIYSTDLYT 198
            +GFD  YG       Y D      + P     + HN ++D+       +  +Y+ D   
Sbjct: 147 KQGFDEFYG-------YNDQKLAHRYYPTH---LNHNQEIDSLEGNDWTSTEVYAQDKIQ 196

Query: 199 EAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI------------- 245
           +  +  I + NK KP F Y+  +  HA           + V KF ++             
Sbjct: 197 QKTLEFI-QTNKDKPFFAYVPMVLPHAELISPQDSILQQYVGKFDEVPYKNHPYLSDYGP 255

Query: 246 ---------SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIH 296
                     +     YA MV+R+D  +G +++ L+K G+  N+I++F +DNGA     H
Sbjct: 256 NMVYYKYCSQETPYAVYASMVTRMDAYIGQIVSKLKKLGIDNNTIIMFSSDNGA-----H 310

Query: 297 SNKGSN-------HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
           +  G+N         LRG+K   ++GG+R    +  P    +  V+  +    D +PTL 
Sbjct: 311 AAGGANPEFFDSTDGLRGIKRDLYEGGIRAPFIVAWPGTIPSGVVTDHISAFWDVMPTLA 370

Query: 350 AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGT 409
              GI+    S DG++    L    +    E ++   N+   +K   A+R+   K V   
Sbjct: 371 DIVGIKAPKQS-DGISFLPTLLNQKQQNIHEYMYWEYNIKGGRK---AVRLGKWKGVIYK 426

Query: 410 DNNGQS 415
            NN +S
Sbjct: 427 ANNKKS 432


>gi|241998988|ref|XP_002434137.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215495896|gb|EEC05537.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 167

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 77/158 (48%), Positives = 107/158 (67%)

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           L+T+ A+++I +H+ +KPMFLY +H AVH G+     +AP   +A F  I +  R  +AG
Sbjct: 1   LFTKKALSLIKDHDPAKPMFLYFSHQAVHCGDYKVKLEAPASAIAHFPYIKEQNRSIHAG 60

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
            V  LD+SVG V+ AL K GML+NSI++F  DNG   +G   N G N PLRG K T W+G
Sbjct: 61  AVFELDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGKEPNSGYNWPLRGSKFTLWEG 120

Query: 316 GMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG 353
           G+RG A +WSP L +T ++S++L HI+DWLPT  +AAG
Sbjct: 121 GVRGAAFVWSPLLSKTGRLSNQLMHITDWLPTFYSAAG 158


>gi|326432029|gb|EGD77599.1| steroid sulfatase [Salpingoeca sp. ATCC 50818]
          Length = 537

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 121/391 (30%), Positives = 185/391 (47%), Gaps = 47/391 (12%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           + P+ ++++ DDLG+ D+   G   + TP++D+L  +G+ ++Q  V A +CTPSR+  +T
Sbjct: 30  QPPNFLVLMVDDLGYGDLGCFGRKNVSTPHVDSLCRDGVKMDQFLVPASVCTPSRAGFLT 89

Query: 87  GKYPIHIGMQHGVI------LEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGF---- 136
           G+ P   GM   V+        G+P G P +E  LP+ L+E GYAT   GKWHLG     
Sbjct: 90  GRLPRRFGMTANVLPWRVMATTGQPTGFPDSEITLPEVLREKGYATGMSGKWHLGISNLT 149

Query: 137 FREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL---DMRHNMQVDNKTI---- 189
               + P   GFDS    W GL     H C    +PY GL    +R  M + N T+    
Sbjct: 150 HHWAHLPKQHGFDS----WLGLPYTNMHGC----DPYHGLQNESLRDCMVMANNTVVSQP 201

Query: 190 GIYS--TDLYTEAAINVIAEHNK-SKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDIS 246
            IYS  T   T+ AI+ I E    ++P F Y+A++ VH       F +P     KF ++S
Sbjct: 202 TIYSNLTQTLTDHAIHFITEKAALNQPFFFYMAYVHVHT----PLFSSP-----KFKNVS 252

Query: 247 DPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLR 306
                 +   V  +D+SVG ++ AL +  +  N++VL  +DNG       S +G    L+
Sbjct: 253 --AGGVFGDNVEEMDDSVGQILRALDEAHIANNTLVLLTSDNGPFLEEGWSRRGRTGGLK 310

Query: 307 GMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTSLDGVN 365
           G K   ++GG+R       P +     V  E     D +PT  A A      D  +DG N
Sbjct: 311 GGKGQTYEGGIRVPGIARWPGVIPAGTVLHEAVTTLDLMPTFAALADTTAPTDRIIDGAN 370

Query: 366 QWDVLTKGAKTKRSEIL------HNIDNVDN 390
            W+++       R + +      HN+    N
Sbjct: 371 VWEMMVHPHNATRPDPIIWHYCGHNVTAARN 401


>gi|319643165|ref|ZP_07997794.1| arylsulfatase [Bacteroides sp. 3_1_40A]
 gi|345520543|ref|ZP_08799930.1| arylsulfatase [Bacteroides sp. 4_3_47FAA]
 gi|317385206|gb|EFV66156.1| arylsulfatase [Bacteroides sp. 3_1_40A]
 gi|345456873|gb|EET15372.2| arylsulfatase [Bacteroides sp. 4_3_47FAA]
          Length = 464

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 128/399 (32%), Positives = 187/399 (46%), Gaps = 40/399 (10%)

Query: 12  TCTLLFNDAFLN--------TTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
           T  +LF+ A L+        TT  K P++I I+ADDLG  D+  +G  QI TPNID +A 
Sbjct: 4   TRKILFSVALLSSGLTMAQTTTVEKSPNVIYIMADDLGIGDLGCYGQRQIKTPNIDGIAQ 63

Query: 64  NGLILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGE-----PWGLPLTEKLLPQ 117
           NG+   QHY  + +  PSR AL+TGK+  H  ++    + G         LP  E  +  
Sbjct: 64  NGMKFMQHYSGSTVSAPSRCALITGKHMGHAAIRGNAKVAGSDGLLYETPLPAGEVTVAD 123

Query: 118 YLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD 177
             K   Y T  +GKW +G       P   GFD  YGY   L   + HS    F      +
Sbjct: 124 IFKTKNYVTGCVGKWGMGGPGTEGVPGKHGFDYFYGY---LGQRFAHSYYPEFLH----E 176

Query: 178 MRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPF 232
               + +D K    YS DL  E A+N I E N  KP FLY +    HA     G     +
Sbjct: 177 NEQKIMLDGK---YYSHDLMLEKALNFIDE-NAQKPFFLYFSPTIPHADLDIMGEAMTEY 232

Query: 233 QAPDEEVA------KFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMA 286
           +    E         +    +P R  YA MV+ LD+SVG +I  L++ G+ +++I++F +
Sbjct: 233 EGEFCETPFGGSRDGYKSQQNP-RAAYAAMVTYLDKSVGLIIKELKEKGLYDHTIIVFTS 291

Query: 287 DNGAPSFGIH--SNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDW 344
           DNG  S G H  S   SN P RG K   ++GG+R    I  P +     V++ +    D+
Sbjct: 292 DNGVHSEGGHDPSYFDSNGPFRGQKRDLYEGGIRTPFVIQWPGVIPQGVVTNHISAFWDF 351

Query: 345 LPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           LPT+      +I   ++DG++    LT     K  + ++
Sbjct: 352 LPTIGELVQADI-PQNIDGISYLPTLTGKGIQKEHDCIY 389


>gi|255035424|ref|YP_003086045.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254948180|gb|ACT92880.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 457

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 109/356 (30%), Positives = 172/356 (48%), Gaps = 25/356 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ-HYVQALCTPSRSALMT 86
           + P+I+ ILADDLG+ D+  HG   + TPNIDALA  G+I    H    +C+PSRS L+T
Sbjct: 21  QSPNIVFILADDLGYGDIGAHGQKLLRTPNIDALAKEGMIFTDIHAGAPVCSPSRSVLIT 80

Query: 87  GKYPIHIGMQHGVILEGEPWG-----------LPLTEKLLPQYLKEAGYATHAIGKWHLG 135
           G +  H  ++    + G   G           L   +  + + + ++GY T   GKWHL 
Sbjct: 81  GLHTGHTTIRGNATIRGGIVGNKGKQTVRRANLAAGDFTVGKLMAQSGYTTALTGKWHLD 140

Query: 136 FFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTD 195
            +  + TP  RGFD   G+       Y +          G+ ++   Q +N   G Y+ D
Sbjct: 141 GYDTLATPIHRGFDQFSGWLIAYPGTYANGYWPAKRYVNGV-LKDVEQNENGRKGYYADD 199

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
           L T+ ++  +A    +K  F+    L ++  + + P  A D    K  D    + + Y  
Sbjct: 200 LTTDESLAFLAAQKDAKKPFV----LMINYNSPHSPLDAADSSAYKDRDWPQ-DMKIYGA 254

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKG------SNHPLRGMK 309
            V  LDE+VG +   L + G+ +N+IV F +DNG  S G            SN  LRG K
Sbjct: 255 QVHHLDENVGKIKKYLTESGLAKNTIVFFCSDNGPRSEGTPQQTAIAEFFDSNGRLRGYK 314

Query: 310 STPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
              ++GG+R    +W+P + +   VSSE  + +D +PT    AG +++ T+ DG +
Sbjct: 315 RDMYEGGIRVPMVVWAPGIVKPGSVSSEPAYFADIMPTFADIAGSKVSYTT-DGAS 369


>gi|424046834|ref|ZP_17784396.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Vibrio cholerae HENC-03]
 gi|408884894|gb|EKM23622.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Vibrio cholerae HENC-03]
          Length = 500

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 138/432 (31%), Positives = 196/432 (45%), Gaps = 58/432 (13%)

Query: 20  AFLNTTAPKKPHIIIILADDLGWNDVS-FH-GSSQIPTPNIDALAYNGLILNQHYVQALC 77
           +F  T A  +P+I +I  DD+G ++VS +H G     TPNID++A  G++L  +Y Q  C
Sbjct: 17  SFAATAAQDQPNIFVIFTDDIGVSNVSAYHRGLMSSQTPNIDSIAEKGMLLTDYYAQPSC 76

Query: 78  TPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFF 137
           T  RSA +TG+ P+  GM H V L G P GL      LP+ LKE GY T   GK HLG  
Sbjct: 77  TAGRSAFLTGQLPVRTGM-HTVGLPGGPVGLNQNTPTLPELLKELGYVTGQFGKNHLG-D 134

Query: 138 REVYTPTFRGFDSHYGYWQGLQ--DYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYST- 194
           R+ + PT  GFD ++G+   L   +Y +       E +Q    R+ +       G  S  
Sbjct: 135 RDEFLPTMHGFDEYWGWLYHLNAMEYTEDPDWPKDEAFQKFAPRNVIHAKADGKGGQSIK 194

Query: 195 --------------DLYTEAAINVIAEHNKS-KPMFLYL----AHLAVHAGNTYEPFQAP 235
                         D     AI+ I    K+ KP F +      H+  H    YE     
Sbjct: 195 DDGALTIERMRTLDDEVNRHAIDFIERAVKAEKPFFTWYCPSRGHVWTHLSPKYEAMLGT 254

Query: 236 DEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGI 295
           +    + + + D            LD+ VG ++A L + G+ +N+IV+F ADNG P    
Sbjct: 255 NGWGLQEVVMKD------------LDDHVGEMLAKLEELGVADNTIVIFTADNG-PEIMT 301

Query: 296 HSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA--- 352
             + G   P  G K T W+GG+R    I  P      KVS+ +F   D LPTL  AA   
Sbjct: 302 WPDGGMT-PFHGEKGTTWEGGVRAPFLISWPEKIPANKVSNGMFDGMDLLPTLVEAAGGP 360

Query: 353 ---------GIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDL 403
                    G + +   LDG NQ ++LT   K+ R+EI +        +    A+RV D 
Sbjct: 361 ADLKQQMLKGYKGHKAHLDGYNQMEMLTNNGKSNRNEIFY------YERDKLQAVRVGDW 414

Query: 404 KYVAGTDNNGQS 415
           K      N+G S
Sbjct: 415 KAHFVVQNHGWS 426


>gi|32475083|ref|NP_868077.1| N-acetylgalactosamine-6-sulfate sulfatase [Rhodopirellula baltica
           SH 1]
 gi|32445623|emb|CAD75624.1| N-acetylgalactosamine-6-sulfate sulfatase [Rhodopirellula baltica
           SH 1]
          Length = 484

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 131/451 (29%), Positives = 202/451 (44%), Gaps = 52/451 (11%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           +     C+       L   +  +P+I++ILADDLG+ D+  +G+ +  TP +D LA  G+
Sbjct: 16  WMLVALCSHACVPTLLRADSNDRPNIVLILADDLGYGDLGCYGNDEQATPVLDRLATQGV 75

Query: 67  ILNQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEG--------------EPWGLPLT 111
              Q Y     C+P+R+AL+TG+Y  H+G     I  G                 GLP  
Sbjct: 76  RWTQAYANGPECSPTRAALLTGRYQQHVGGLECAIGVGNVGRYDDAIRLHLVNELGLPAN 135

Query: 112 EKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATF 170
              L + L   GY T   GKWHLG +   ++P   GFD       G  DYY +    AT+
Sbjct: 136 RPTLAKRLSSVGYETALFGKWHLG-YEAKFSPMMHGFDEALYCIGGAMDYYHYLDSVATY 194

Query: 171 EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTY 229
                 ++ HN +  +     Y TD  T+ A+  I + N + KP FLYL + A H     
Sbjct: 195 ------NLFHNGRPISGE--GYFTDTITDQAVRFIGDRNANDKPFFLYLPYTAPHT---- 242

Query: 230 EPFQAPDEEVAKFLDISDPERR-------TYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
            P+QAP E     L I  P  +        Y  MV  +DE +G V+ A+ +  M + ++V
Sbjct: 243 -PYQAPGESPVDPLPIDSPLWKQNADPPGVYRAMVRHMDEGIGKVLHAIEESKMTDRTLV 301

Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHI 341
           +F +DNG       ++   N PLRG K   ++GG+R  + A W   L +   VS ++   
Sbjct: 302 IFASDNGG------TSASRNEPLRGFKGQAFEGGIRVPLIARWPGHLPEG-VVSDQVTIT 354

Query: 342 SDWLPTLCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVD 401
            D   ++ AAAG  I  T  D +   DVL+  A  +  +            + ++ +R  
Sbjct: 355 FDLTASMLAAAG--ITPTQEDAMEGIDVLSLAANDEPVQPRTLYWRKPRDPQVWSGMRDG 412

Query: 402 DLKYV----AGTDNNGQSDEWYGDTDNEIDK 428
           + KYV    A  D      EW  +  ++I +
Sbjct: 413 NWKYVRQEKATVDGRTSIQEWLFNLADDISE 443


>gi|254282184|ref|ZP_04957152.1| arylsulfatase A [gamma proteobacterium NOR51-B]
 gi|219678387|gb|EED34736.1| arylsulfatase A [gamma proteobacterium NOR51-B]
          Length = 535

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 124/427 (29%), Positives = 209/427 (48%), Gaps = 67/427 (15%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQI---PTPNIDALAYNGLILNQHYVQALCTPSRSAL 84
           ++P+I++ILADD+GW ++  +G  ++   PTP +D +A  G+    HY +  CTP+R AL
Sbjct: 68  RRPNILVILADDIGWGELGSYGGGKLKGAPTPALDQMADEGMRFLSHYTEPSCTPTRVAL 127

Query: 85  MTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           MTG++P+  G+   V+  G+  GL   E  L + L EAGYAT   GKWHLG  +E + P 
Sbjct: 128 MTGRHPVRTGLDE-VLFPGQVKGLVADEVTLAEVLSEAGYATGMFGKWHLGELQE-HQPQ 185

Query: 145 FRGFDSHYGY----------WQGLQDYYD-----------------HSCKATFE-PYQGL 176
           ++GFD  Y Y          W+    +YD                  + + TF+ P  G+
Sbjct: 186 YQGFD--YAYYNLYNGAIWPWRENATHYDTENDTGITGPPYFIDIPEAYEETFDIPLHGI 243

Query: 177 DMRHNM-----QVDNKTIGIYST---DLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
            MR        ++D  ++  ++T   +L  E    +  +H    P F Y A       NT
Sbjct: 244 -MRAKRNTPAEEIDPLSLSRFNTFDNELTDEVIAFMRDQHEAGIPFFAYFAT------NT 296

Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
            + F  PD + + +LD S+ + R     + + D+++  +  +L   G+ EN++VL+++DN
Sbjct: 297 QQVFSCPDVD-SPYLDKSNCQAR----QLVQHDKNMARLFESLDNMGIDENTLVLWISDN 351

Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
           G P    + + G +  LRG KS  ++GG+R       P      +   ++ H+SDW  T+
Sbjct: 352 G-PMNKFYPSTGFSW-LRGYKSEVYEGGVRTPGIAKWPGSIAPGQTPIDIVHVSDWYTTI 409

Query: 349 CAAAGIEI---NDTSLDGVNQWDVLTKGAKTKRSE-------ILHNIDNVDNPQKYYAAL 398
              AG +    +D  +DGV+Q  +L  G    R +       I +   +   P    +A+
Sbjct: 410 ANLAGAKAAIPDDRVIDGVDQRSLLFNGEGYSRRDYVFFYRYIAYKNLSSTGPASMLSAI 469

Query: 399 RVDDLKY 405
           R+ D+K+
Sbjct: 470 RMGDIKF 476


>gi|354465430|ref|XP_003495183.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cricetulus
           griseus]
          Length = 493

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 114/356 (32%), Positives = 169/356 (47%), Gaps = 26/356 (7%)

Query: 37  ADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM 95
           A+ +GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+TG+ PI  G 
Sbjct: 8   AEVMGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGF 67

Query: 96  -------QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
                  ++    +    G+P +E LLP+ LK+AGY    +GKWHLG  R  + P   GF
Sbjct: 68  YTSNGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQFHPLKHGF 126

Query: 149 DSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINV 204
           D  +G        YD+  K     Y+  +M         ++ KT     T LY + A++ 
Sbjct: 127 DEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANLTQLYLQEALDF 186

Query: 205 I-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDES 263
           I  +H +  P FLY A  A HA         P      FL  S   R  Y   V  +D+S
Sbjct: 187 IRTQHARQSPFFLYWAIDATHA---------PVYASRPFLGTS--HRGRYGDAVREIDDS 235

Query: 264 VGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI 323
           VG +++ L+  G+ +N+ V F +DNGA         GSN P    K T ++GGMR  A  
Sbjct: 236 VGKILSLLQNLGISKNTFVFFTSDNGAALISAPKEGGSNGPFLCGKQTTFEGGMREPAIA 295

Query: 324 WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKR 378
           W P      +VS +L  I D   T  + AG++  +D  +DG++    + +G    R
Sbjct: 296 WWPGHIAAGQVSHQLGSIMDLFTTSLSIAGLKPPSDRIIDGLDLLPTMLQGHVMDR 351


>gi|395819498|ref|XP_003783120.1| PREDICTED: arylsulfatase A [Otolemur garnettii]
          Length = 507

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 127/395 (32%), Positives = 185/395 (46%), Gaps = 45/395 (11%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           AL   LL   A L T +P  P++++I ADDLG+ D+  +G     TPN+D LA  GL   
Sbjct: 3   ALGAFLLALVAGLATASP--PNVVLIFADDLGYGDLGSYGHPSSTTPNLDQLAAGGLRFT 60

Query: 70  QHYVQ-ALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHA 128
             YV  +LCTPSR+AL+TG+ P+ +GM  GV+      GLPL E  L + L   GY T  
Sbjct: 61  DFYVPVSLCTPSRAALLTGRLPVRMGMYPGVLKPSSQGGLPLEEVTLAEVLAAKGYLTGM 120

Query: 129 IGKWHLGFFRE-VYTPTFRGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LD 177
            GKWHLG   E ++ P   GF    G  Y        + +C     P  G        + 
Sbjct: 121 AGKWHLGVGPEGIFLPPHHGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIP 180

Query: 178 MRHNMQVDNKTIGI---------YSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNT 228
           +  N+ V+ +   +         ++ DL T+A         + +P FLY A    H    
Sbjct: 181 LLANLTVEVQPPWLPGLEARYVAFARDLMTDA-------QRQGRPFFLYYASHHTH---- 229

Query: 229 YEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADN 288
           Y  F         F + S   R  +   +  LD +VG ++ A+   G+LE ++V+F ADN
Sbjct: 230 YPQFSG-----QSFAEHSG--RGPFGDSLMELDAAVGALMTAMEDLGLLEETVVIFTADN 282

Query: 289 GAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPT 347
           G  +  + S  G +  LR  K T ++GG+R  A A W   +  T  V+ EL    D LPT
Sbjct: 283 GPETMRM-SRGGCSGLLRCGKGTTFEGGVREPALAFWPGHI--TPGVTHELASTLDLLPT 339

Query: 348 LCAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
           L A  G  + + +LDG +   +L    K+ R  + 
Sbjct: 340 LAALTGAPLPNVTLDGFDLSPLLLGTGKSPRQSLF 374


>gi|291232668|ref|XP_002736267.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 518

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 118/373 (31%), Positives = 177/373 (47%), Gaps = 28/373 (7%)

Query: 15  LLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV- 73
           L+F+  +   + P KP++I++L DD+GW D+   G+    TPN+D +A  G ++   Y  
Sbjct: 12  LMFHAFYSTNSNPSKPNVILMLMDDMGWGDLGVLGNPAKETPNLDRMASEGALMTDFYAP 71

Query: 74  QALCTPSRSALMTGKYPIHIGM----QHGVILEGEPW---GLPLTEKLLPQYLKEAGYAT 126
             LC+PSR++L+TG+ PI  G      H        +   G+P +E +LP+ L +AGY +
Sbjct: 72  NPLCSPSRASLLTGRLPIRNGFYTTNDHARCSYTPQYIVGGIPDSEIVLPELLNKAGYRS 131

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDM----RHNM 182
             IGKWHLG  +  Y P   GFD ++G        YD+  +     Y+  DM        
Sbjct: 132 KIIGKWHLG-HQTQYHPLKHGFDEYFGAPNCHVGPYDNKKQPNIPVYRDADMIGRYYEEF 190

Query: 183 QVDNKTIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAK 241
           ++D K+     T ++ E AI  I  +H   +  FLY    A H         +P      
Sbjct: 191 KID-KSGESNLTQMFIEEAIAFIEKQHQTGEQFFLYWTPDASH---------SPHYASKS 240

Query: 242 FLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGS 301
           FL  S  +R  Y   V  LD  VG ++  L++  +  N+ V F +DNGA   G  +  GS
Sbjct: 241 FLGTS--QRELYGDAVMELDYGVGQILGKLKELQIENNTFVFFSSDNGAALSG-GAYVGS 297

Query: 302 NHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEI-NDTS 360
           N P    K T ++GGMR  +  W P   +  KVS +L  I D   T    AG+ +  D  
Sbjct: 298 NGPFLCGKHTTFEGGMREPSIAWWPGRIKPGKVSHQLGTIMDLFTTALDIAGVTLPTDRI 357

Query: 361 LDGVNQWDVLTKG 373
           +DG +    L  G
Sbjct: 358 IDGTSLLPALVNG 370


>gi|148223113|ref|NP_001088386.1| galactosamine (N-acetyl)-6-sulfate sulfatase precursor [Xenopus
           laevis]
 gi|54038351|gb|AAH84661.1| LOC495239 protein [Xenopus laevis]
          Length = 520

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 120/388 (30%), Positives = 182/388 (46%), Gaps = 34/388 (8%)

Query: 11  LTCTLLFNDAFLNTTAP------KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYN 64
           + C+L    AFLN           +P+I+++L DD+GW D+   G     TPN+D +A  
Sbjct: 5   IMCSLAL--AFLNPVLVLGGMNVTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMASE 62

Query: 65  GLILNQHYV-QALCTPSRSALMTGKYPIHIGM-------QHGVILEGEPWGLPLTEKLLP 116
           G++    Y    LC+PSR+AL+TG+ PI  G        ++    +    G+  +E L P
Sbjct: 63  GMLFPNFYTANPLCSPSRAALLTGRLPIRNGFFTTNSHARNAYTPQEIVGGISDSEILFP 122

Query: 117 QYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGL 176
           + LK+AGY    IGKWHLG  R  Y P   GFD  +G        YD+        Y+  
Sbjct: 123 ELLKKAGYINKIIGKWHLGH-RPSYHPLRHGFDEWFGSPNCHFGPYDNKQIPNIPLYRDW 181

Query: 177 DM----RHNMQVDNKTIGIYSTDLYTEAAINVI-AEHNKSKPMFLYLAHLAVHAGNTYEP 231
           +M         +++KT     T +Y + A+  I ++    +P FLY +  A HA      
Sbjct: 182 NMVGRYYEEFNINHKTGESNLTQIYLQEALEFIYSQAVAQQPFFLYWSIDATHA------ 235

Query: 232 FQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
              P      FL  S+  R  Y   V  +D S+G ++  L   G+ +++ V F +DNGA 
Sbjct: 236 ---PVYASRPFLGTSN--RGLYGDAVREIDFSIGKILNLLNLTGIAKDTFVFFTSDNGAA 290

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
                   GSN P    K T ++GGMR     W P      +VS +L  I D   T  + 
Sbjct: 291 LISAPMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIMDLFTTSLSL 350

Query: 352 AGIEI-NDTSLDGVNQWDVLTKGAKTKR 378
           AG+EI  D ++DG++   V+ +  +T+R
Sbjct: 351 AGLEIPRDRAIDGIDLSPVILESKETER 378


>gi|449138380|ref|ZP_21773646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448882925|gb|EMB13473.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 467

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 137/444 (30%), Positives = 205/444 (46%), Gaps = 64/444 (14%)

Query: 7   YFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGL 66
           YF A   T L  D         +P+I++ILADDLG+ D+  +G+    TP +D LA  G+
Sbjct: 5   YFHACVPTSLRADPN------DRPNIVLILADDLGYGDLGCYGNDAQATPTLDRLARQGV 58

Query: 67  ILNQHYVQAL-CTPSRSALMTGKYPIHIGMQHGVILEG--------------EPWGLPLT 111
              Q Y     C+P+R+AL+TG+Y  H+G     I  G                 GLP  
Sbjct: 59  RWTQAYANGPECSPTRAALLTGRYQQHVGGLECAIGVGNVGRYDDAIRLHLVSELGLPAD 118

Query: 112 EKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDH-SCKATF 170
              L + L + GY T   GKWHLG+  + ++P   GF+       G  DYY +    AT+
Sbjct: 119 RPTLAKRLSDVGYETALFGKWHLGYEAQ-FSPMQHGFEEALYCIGGAMDYYHYLDSVATY 177

Query: 171 EPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKS-KPMFLYLAHLAVHAGNTY 229
             +     R+   +  +    Y TD  T+ A+  + + N S +P FLYL + A H     
Sbjct: 178 NLF-----RNGRPISGEG---YFTDTITDHAVQYVRDRNDSEQPFFLYLPYTAPHT---- 225

Query: 230 EPFQAPDEEVAKFLDI-------SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIV 282
            P+Q P +     L +       SD     Y  MV  +DE +G V+ A+ + GM + ++V
Sbjct: 226 -PYQPPGQSPVDPLPLDSPLWKQSDDPPGVYRSMVRHMDEGIGKVLHAIEESGMTDRTLV 284

Query: 283 LFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHI 341
           +F +DNG       ++   N PLRG K   ++GG+R  + A W   L +   VS ++   
Sbjct: 285 IFASDNGG------TSASRNEPLRGFKGQTFEGGIRVPLIARWPGHLPE-DTVSQQVTIT 337

Query: 342 SDWLPTLCAAAGIE-INDTSLDGVNQWDVLTKGAKTKRSE--ILHNIDNVDNPQKYYAAL 398
            D   ++ AAAGI    + +++G+   DVL+  A  +  E   LH     D PQ  +  +
Sbjct: 338 FDLTASMLAAAGITPAQEDAMEGI---DVLSLAASGEPPEPRTLHWRKPRD-PQ-VWGGI 392

Query: 399 RVDDLKYV----AGTDNNGQSDEW 418
           R   LKYV    A TD      EW
Sbjct: 393 RDGKLKYVRQEKATTDGRTTIQEW 416


>gi|397468273|ref|XP_003805816.1| PREDICTED: N-acetylgalactosamine-6-sulfatase isoform 1 [Pan
           paniscus]
          Length = 482

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 112/362 (30%), Positives = 169/362 (46%), Gaps = 25/362 (6%)

Query: 40  LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM--- 95
           +GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+TG+ PI  G    
Sbjct: 1   MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60

Query: 96  ----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
               ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG  R  + P   GFD  
Sbjct: 61  NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH-RPQFHPLKHGFDEW 119

Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           +G        YD+  +     Y+  +M         ++ KT     T +Y + A++ I  
Sbjct: 120 FGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKR 179

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
             +  P FLY A  A HA         P      FL  S  +R  Y   V  +D+S+G +
Sbjct: 180 QARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRYGDAVREIDDSIGKM 228

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           +  L+   + +N+ V F +DNGA         GSN P    K T ++GGMR  A  W P 
Sbjct: 229 LELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPG 288

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
                +VS +L  I D   T  A AG+   +D ++DG+N    L +G    R    +  D
Sbjct: 289 HVAAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGD 348

Query: 387 NV 388
            +
Sbjct: 349 TL 350


>gi|149196404|ref|ZP_01873459.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
 gi|149140665|gb|EDM29063.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
          Length = 456

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 116/346 (33%), Positives = 186/346 (53%), Gaps = 24/346 (6%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMTG 87
           KP+II I+ DD+G+  +  +G   I TP +D +A  GL L  +Y   A+C PSR +LMTG
Sbjct: 19  KPNIIFIMCDDMGYGQLGSYGQKMIKTPRLDQMAKEGLRLTDYYAGTAVCAPSRCSLMTG 78

Query: 88  KYPIHIGMQHGVILEGEPWG---LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           +   H+G  +    +  P G   +P     + + +KEAGYAT  IGKW LG+      P 
Sbjct: 79  Q---HVGHTYIRGNKEYPTGQEPIPAETITVAEKMKEAGYATALIGKWGLGYPGSEGEPN 135

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
            +GFD  +GY    +  ++H  K      + L +++N     K I  YS  + T+ A   
Sbjct: 136 KQGFDYFFGY-NDQKHAHNHFPKFLLRNEETLTLKNN---SGKEIE-YSQYMLTDEAKGF 190

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAP--DEEVAKFLDISDPER-RTYAGMVSRLD 261
           I + NK  P FLYLA++  H+       Q P  DE   ++ D S PE+ + +AGM+SRLD
Sbjct: 191 IKK-NKDNPFFLYLAYVIPHS-----RLQIPGDDECYLQYKDESWPEKQKKHAGMISRLD 244

Query: 262 ESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNH--PLRGMKSTPWDGGMRG 319
           + VG+++  L++  + EN++V+F +DNGA   G    +  N   PL G+K + ++GG+R 
Sbjct: 245 KDVGSILDLLKEMNLAENTLVVFTSDNGAHREGGARPEFFNDSGPLSGIKRSMYEGGVRV 304

Query: 320 VAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVN 365
                 P + +  +VS+ +    D +PT C   G++  +  +DG++
Sbjct: 305 PFIAHWPGVIKPGQVSNHIGAHWDLMPTACELGGVQPPE-GIDGIS 349


>gi|440749357|ref|ZP_20928605.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
 gi|436482362|gb|ELP38485.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
          Length = 484

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 130/394 (32%), Positives = 184/394 (46%), Gaps = 57/394 (14%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMTG 87
           KP+II ILADDLG+ D+   G ++I TPN+D LA  G++   HY  A +C PSR+ALMTG
Sbjct: 33  KPNIIFILADDLGYGDLGVTGQTKIETPNLDKLAREGMLFTNHYSGATVCAPSRAALMTG 92

Query: 88  KYPIHIGMQHGVILEGEPWG---LPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
            +  H  ++     E +P G   +P +   LP+ +K AGYAT A GKW LGF      P 
Sbjct: 93  LHTGHTPIRGNK--EIQPEGQHPMPDSVLTLPKMMKRAGYATGAFGKWGLGFVGTSGDPI 150

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN--KTIGIYSTDLYTEAAI 202
            +GFD  +GY    Q Y        F  Y  L+        N       ++ D+  +AA+
Sbjct: 151 NQGFDQFFGY--NCQRY----AHRYFPEYLWLNSEKVFLPGNDWTNKATFAPDVIQKAAL 204

Query: 203 NVIAEHNKSKPMFLYLAHLAVHA------------------GNTYEPFQAPDE------- 237
             I E N   P F+Y+  +  HA                  G+  E F AP         
Sbjct: 205 EFI-EANSGNPFFMYVPMVIPHAELAADEESPIFKKYRERFGDERE-FVAPAGWDYGPGI 262

Query: 238 EVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHS 297
            +  +  +  P R TYA MV +LD  VG +IA L   G+ EN+I++F +DNG      H 
Sbjct: 263 NIPGYQSVKHP-RATYATMVEKLDAHVGEIIAKLEVLGIRENTIIVFASDNGP-----HQ 316

Query: 298 NKG-------SNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLC 349
             G       SN   RG K   ++GG++  + A W P   +   VS  +    D +PT  
Sbjct: 317 EGGNDPDFFDSNGIYRGYKRDLYEGGIKTSLIANW-PRKIKAGSVSDHVSAFWDLMPTFA 375

Query: 350 AAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
             AG+ I D  LDG++    L    +  + E L+
Sbjct: 376 ELAGVSI-DAELDGISMVPTLLGEGRQVQHEYLY 408


>gi|336413808|ref|ZP_08594157.1| hypothetical protein HMPREF1017_01265 [Bacteroides ovatus
           3_8_47FAA]
 gi|335934825|gb|EGM96808.1| hypothetical protein HMPREF1017_01265 [Bacteroides ovatus
           3_8_47FAA]
          Length = 472

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 126/391 (32%), Positives = 187/391 (47%), Gaps = 32/391 (8%)

Query: 11  LTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQ 70
           L   L+   A    +  K P+++ I+ADDLG  D+  +G ++I TP IDALA  G+   Q
Sbjct: 18  LMLPLISGQAQKQDSPSKTPNVVFIMADDLGIGDLGCYGQNRIKTPAIDALAAQGMKFTQ 77

Query: 71  HYVQA-LCTPSRSALMTGKYPIH--IGMQHGVILE-GEPWGLPLT--EKLLPQYLKEAGY 124
           HY  + +  PSR  L+TGK+  H  I    G   E G  + L L   E  + +  K   Y
Sbjct: 78  HYSGSTVSAPSRCVLLTGKHTGHSYIRGNKGYKAEDGRSYDLNLADEEITIGEMFKTKNY 137

Query: 125 ATHAIGKWHLGFFREVYTPTFRGFDSHYGYW-QG-LQDYYDHSCKATFEPYQGLDMRHNM 182
            T  +GKW LG       P  +GFD  +GY  QG   +YY         P Q  +    +
Sbjct: 138 VTACVGKWGLGGPSAEGHPNKQGFDYFFGYLGQGNAHNYY---------PVQLFENNTPV 188

Query: 183 QVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPFQAPDE 237
            +++K   +YS D+  + A+  I + N  KP F+YL     HA      N    +    E
Sbjct: 189 MLNHK---VYSHDMIMDKALEFI-DKNADKPFFVYLTPTIPHADIIVPNNELFDYDGKFE 244

Query: 238 EV---AKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFG 294
           EV    K     +  R T+A MV+RLD  V  VI  L   G+L+N+I++F +DNG    G
Sbjct: 245 EVPYPGKGYRPQEKPRATFAAMVTRLDRDVQRVIDLLADKGVLDNTIIIFTSDNGTHKEG 304

Query: 295 IHSNK--GSNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAA 352
            H  +   SN P RGMK   ++GG+R    +  P +     VS ++    D++PT+C   
Sbjct: 305 GHDPRYFDSNGPFRGMKRDLYEGGIRTPFIVKWPGVIPAGSVSFQVSAFWDFMPTMCELI 364

Query: 353 GIEINDTSLDGVNQWDVLTKGAKTKRSEILH 383
           G      ++DG++    LT   K K+ + L+
Sbjct: 365 GAPA-PANIDGLSYLPTLTGKGKQKQHDYLY 394


>gi|325110685|ref|YP_004271753.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324970953|gb|ADY61731.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 592

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 123/376 (32%), Positives = 179/376 (47%), Gaps = 45/376 (11%)

Query: 4   ARKYFFALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAY 63
            R   F+L+  L F        A  +P++I IL DD GW D+S HG+  I TPNID +A 
Sbjct: 5   VRISVFSLS-ILAFLSYLCVAEAADRPNVIAILTDDQGWGDLSIHGNLAINTPNIDKMAR 63

Query: 64  NGLILNQHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAG 123
            G  L + +V  +C+P+R+  +TG++ +  G+ +     GE   +   E ++ +  K AG
Sbjct: 64  EGAQLERFFVSPVCSPTRAEFLTGRHHVRCGV-YSTSAGGE--RVNADETMIGEVFKRAG 120

Query: 124 YATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ 183
           Y T A GKWH G  +  Y P  RGFD  YG+  G    Y       F P     + HN  
Sbjct: 121 YQTAAFGKWHSG-MQAPYHPNSRGFDEFYGFCSGHWGNY-------FSPM----LEHNG- 167

Query: 184 VDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFL 243
            D  T   +  D  T  A++ I EH + +P F+YL +   H+     P Q P+    K  
Sbjct: 168 -DLVTGEGFIIDDLTNRAMSYIDEH-QQEPFFVYLPYNTPHS-----PMQVPERWYKKLA 220

Query: 244 DI----SDP--------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
           D+     DP          R    M   +D +VG ++  L +  + EN+IVL+  DNG  
Sbjct: 221 DLELTQEDPNASEKQLLHTRAALAMCENIDWNVGRLLEKLDQLNLSENTIVLYFCDNGP- 279

Query: 292 SFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAI-WSPWLKQTQKVSSELFHISDWLPTLCA 350
                + K  N  +RG K +  +GG+R    I W   +K   KV  ++    D LPTL  
Sbjct: 280 -----NGKRYNGGMRGRKGSTDEGGVRSPLFIRWPAKIKPGTKV-PQICSAMDLLPTLAD 333

Query: 351 AAGIEINDTS-LDGVN 365
            AGIE+ DT  LDG++
Sbjct: 334 MAGIEVTDTKPLDGIS 349


>gi|325111172|ref|YP_004272240.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324971440|gb|ADY62218.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 474

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 114/349 (32%), Positives = 174/349 (49%), Gaps = 29/349 (8%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           + P+ +I+  DD G+ DV   G+    TPN+D +A  G      YV QA+C+ SR++L+T
Sbjct: 21  ETPNFVIVFTDDQGYADVGCFGAEGYETPNLDRMAQEGRKFTSFYVSQAVCSASRTSLLT 80

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G Y + +G+ +G +      G+   E  L + +K+ GY T A+GKWHLG+ ++ + P   
Sbjct: 81  GCYNVRVGI-NGALSHKSTVGISTEEMTLGELVKQQGYTTAAVGKWHLGY-QQKFLPLQH 138

Query: 147 GFDSHYG--YWQGLQDYYDHSCKATFEP-----YQGLDMRHNMQVDNKTIGIYS----TD 195
           GFD ++G  Y   +  Y+      T E      Y  L +    ++ +  +        T 
Sbjct: 139 GFDRYFGLPYSNDMWPYHPRLEGMTLEEARVKGYPDLPLYSGNEIIDAQVTPEDQRQLTT 198

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
            YTE A++ I + N  KP  LYLAH   H      P    D    KF D ++  R  Y  
Sbjct: 199 QYTEQALDFI-DDNHDKPFLLYLAHSMPHV-----PLFVSD----KFQDKTEQGR--YGD 246

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWD 314
           ++  +D SVG V+A L+KHG+ +N++V+F +DNG   S+G H+  GS  P R  K T W+
Sbjct: 247 VIQEIDWSVGQVLARLKKHGLDDNTLVIFTSDNGPWLSYGNHA--GSAGPFREGKGTSWE 304

Query: 315 GGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDG 363
           GG+R    +  P          EL    D  PT+ A A   + D  +DG
Sbjct: 305 GGVRVPCIMRWPGKIPAGTSCDELAATIDIFPTVAALADAPLPDHKIDG 353


>gi|449134034|ref|ZP_21769542.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448887354|gb|EMB17735.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 728

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 111/340 (32%), Positives = 168/340 (49%), Gaps = 26/340 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           +KP+I++IL DD G+ D+  +G++++ TP ID +A  G+    +Y  A +C+PSR+ L+T
Sbjct: 4   EKPNIVLILVDDQGYYDLGCYGATEVKTPRIDEMAGGGIRFTDYYAAAPICSPSRAGLLT 63

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G YP  +G    V       G+   E  L +  K+ GY T  IGKWHLG F E + P  +
Sbjct: 64  GCYPRRVGNHVWVHRADSNTGIHSDELTLAELFKDNGYKTACIGKWHLG-FHEPFLPQNQ 122

Query: 147 GFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS-TDLYTEAAINVI 205
           GFD ++G    L    D      FE   G+ ++ +  V  +       T LYT  AI+ I
Sbjct: 123 GFDHYFGLLHNL----DPVETVYFEDVGGVPLQRDRDVVKRPADPDELTKLYTNEAIDFI 178

Query: 206 AEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVG 265
            E NK  P  LYL H  +H      P    +E    F   S  +   Y   +  LD  VG
Sbjct: 179 -EANKEGPFLLYLPHTMLH-----NPLGVSEE----FKGTS--QWGEYGDAIQELDHHVG 226

Query: 266 NVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWS 325
            +I  L++  + +N++V++ +DNG         +     +RG K + ++GG+R  A  W 
Sbjct: 227 RIIDVLKRLNIDDNTVVVYASDNGR-----GPGRNPQQKIRGRKLSTYEGGIRVPAIAWG 281

Query: 326 PWLKQTQKVSS-ELFHISDWLPTLCAAAGIEI-NDTSLDG 363
           P L     +SS  +    DW PTL   A IEI ++  +DG
Sbjct: 282 PGLGLRAGISSPAVVRAMDWYPTLATLADIEIPSERVIDG 321


>gi|365877064|ref|ZP_09416570.1| Cerebroside-sulfatase [Elizabethkingia anophelis Ag1]
 gi|442586911|ref|ZP_21005733.1| Cerebroside-sulfatase [Elizabethkingia anophelis R26]
 gi|365755338|gb|EHM97271.1| Cerebroside-sulfatase [Elizabethkingia anophelis Ag1]
 gi|442563318|gb|ELR80531.1| Cerebroside-sulfatase [Elizabethkingia anophelis R26]
          Length = 454

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 127/415 (30%), Positives = 195/415 (46%), Gaps = 42/415 (10%)

Query: 11  LTCTL--LFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           + C L  LF   F N  A K+P+II++L DD+G+ D+  +G+  I TP +D ++ NGL+ 
Sbjct: 5   VRCALIYLFTVLFSNARAQKQPNIIVVLTDDMGYADIGAYGNPVIKTPFLDQMSRNGLMA 64

Query: 69  NQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
             + V +  CTPSR++++TG+Y     +    I  G   GLP  E  + + LK  GY T 
Sbjct: 65  TNYVVSSPTCTPSRASMLTGRYSSRYDLPWP-IAPGSKQGLPDDEVTIAEMLKANGYNTG 123

Query: 128 AIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLD----MRHNMQ 183
            +GKWHLG  +    P  +GFD +YG        Y H  KA   PY   D    M  N +
Sbjct: 124 MVGKWHLGDQKAENKPNGQGFDFYYGI------LYSHDYKA---PYVNTDIPIRMFRNTK 174

Query: 184 VD-NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKF 242
           V+  K      T LYT+ +IN I +  K KP FLYLAH   H          P    A+ 
Sbjct: 175 VEIEKPADSLLTRLYTKESINYIRQQKKDKPFFLYLAHNMPH---------LPVYYAAQS 225

Query: 243 LDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNG----------APS 292
             + + +      +++ +DE +  +   L + GM +N+I +F +DNG             
Sbjct: 226 SALKNEKGGALGAVITEMDEGLAAIWKVLEEKGMADNTIFVFSSDNGPWTNFPERMEGDG 285

Query: 293 FGIHSNKGSNHPLRGMKSTPWDGGMR-GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAA 351
             I S+ GS    RG K+  ++GG R      W   +K    ++S + ++ D LPT+   
Sbjct: 286 VTIASHVGSAGIFRGSKAWSYEGGARVPFIVYWKNKIKPGTVLTSAISNL-DLLPTIARW 344

Query: 352 AGIEI-NDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKY 405
               +  +  LDG +   +L    K K SE +H    + N      A++  + KY
Sbjct: 345 TNSPLPKNRELDGQDISQLLEN--KVKESEYIHRPIFIVNGSTKPEAVKYGNWKY 397


>gi|329957723|ref|ZP_08298198.1| arylsulfatase [Bacteroides clarus YIT 12056]
 gi|328522600|gb|EGF49709.1| arylsulfatase [Bacteroides clarus YIT 12056]
          Length = 469

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 124/354 (35%), Positives = 180/354 (50%), Gaps = 34/354 (9%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYV-QALCTPSRSALMT 86
           ++P+I+I+L DD G+ D+  +GS  + TPN+D LA  G      Y   A  TPSR+AL+T
Sbjct: 36  RQPNIVILLTDDQGYGDMKCYGSPHLQTPNLDRLAAEGTRFTNFYAGAAASTPSRAALLT 95

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFR 146
           G+Y   +G+  GV+ +    G+  +E  L +YLK+  YAT  +GKWHLG+  E Y P   
Sbjct: 96  GRYAERVGVP-GVVDDTSENGIKRSEITLAEYLKQNDYATGIVGKWHLGYQPE-YLPLRH 153

Query: 147 GFDSHYG--YWQGLQDYY---DHSCKATFEPYQGLDMRHNMQVD--NKTIGIYSTDLYTE 199
           GF   +G  Y   +  Y+   DH+       Y  L +  N +V   N  +   +T L TE
Sbjct: 154 GFTEFFGLPYSNDMWPYHPAPDHA-------YPALPLYDNDRVIEYNAPVNQMTTRL-TE 205

Query: 200 AAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSR 259
            AI  I +H K++P FLYL +   H      P    D    KF   S      YA ++  
Sbjct: 206 RAIRFINDH-KAEPFFLYLPYTQPHV-----PLGVSD----KFKGKSG--EGLYADVIME 253

Query: 260 LDESVGNVIAALRKHGMLENSIVLFMADNGA-PSFGIHSNKGSNHPLRGMKSTPWDGGMR 318
           +D SVG ++  L+ +G+ EN+IV+F +DNG   S+G H   GSN  LR  K T +DGG R
Sbjct: 254 IDWSVGEIMKTLKANGLDENTIVMFTSDNGPWLSYGNHG--GSNGGLREGKGTTFDGGQR 311

Query: 319 GVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTS-LDGVNQWDVLT 371
               +  P +    KVS +     D  PTL      ++   +  DG + W +LT
Sbjct: 312 VPFIVRMPGVIPAGKVSDQFLSALDITPTLVNLTQSQMPRMNKFDGQDAWGILT 365


>gi|440717773|ref|ZP_20898250.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436437075|gb|ELP30749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 474

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 114/397 (28%), Positives = 195/397 (49%), Gaps = 37/397 (9%)

Query: 9   FALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLIL 68
           FA T  +    +   TT    P++I++++DD GW DV F+G+  + TPN+DA+A  G+  
Sbjct: 12  FAFTLAVPATQSIAETTGTNSPNVILLMSDDQGWGDVGFNGNEVVQTPNLDAMASAGVRF 71

Query: 69  NQHYVQA-LCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATH 127
           ++ Y  A LC+P+R + +TG+YP   G     IL     G+ + E  + + L++ GYAT 
Sbjct: 72  DRFYAAAPLCSPTRGSCLTGRYPFRFG-----ILAAHTGGMRVGEITIAEMLQKRGYATG 126

Query: 128 AIGKWHLGFFR--EVYT------PTFRGFDSHYGYWQGLQDY--------YDHSCKATFE 171
             GKWH+G+ +  EV T      P+  GFD ++     +  +        +D       E
Sbjct: 127 MFGKWHIGWVKPDEVSTRGFYSPPSHHGFDEYFATTSAVPTWDPTITPQDWDSWGNGPGE 186

Query: 172 PYQ-GLDMRHNMQVDNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYE 230
           P++ G    HN +   + +    + +  +  I  I E N++KP F  +   A H     E
Sbjct: 187 PWKGGFPYVHNGREAKENLSGDDSRVIMDRVIPFI-EANQAKPFFATVWFHAPH-----E 240

Query: 231 PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGA 290
           P  A  EE  K    +  +R+ Y G ++ +D+ VG + A LR+ G+ +N++V F +DNG 
Sbjct: 241 PVVA-GEEFKKLYPKAGSKRKNYYGCITAMDQQVGRLRAKLRELGIEDNTVVFFCSDNG- 298

Query: 291 PSFGIHSNKG--SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
           PS G+ + KG  S  P +G K T ++GG+   A    P        +       D+LPT+
Sbjct: 299 PSDGL-AKKGVASAGPFKGHKHTMYEGGLLVPACAEWPGTIPAGTSTEVRCSTVDFLPTV 357

Query: 349 CAAAG---IEINDTSLDGVNQWDVLTKGAKTKRSEIL 382
            +  G   ++     +DG++   ++   AK +  ++ 
Sbjct: 358 ASIVGDSMVQKATRPIDGIDLMPLIRGEAKDRDRDLF 394


>gi|355710478|gb|EHH31942.1| hypothetical protein EGK_13112 [Macaca mulatta]
          Length = 482

 Score =  159 bits (401), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 113/362 (31%), Positives = 167/362 (46%), Gaps = 25/362 (6%)

Query: 40  LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM--- 95
           +GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+TG+ PI  G    
Sbjct: 1   MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 60

Query: 96  ----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
               ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG  R  + P   GFD  
Sbjct: 61  NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH-RPQFHPLKHGFDEW 119

Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           +G        YD+  +     Y+  +M         ++ KT     T +Y + A++ I  
Sbjct: 120 FGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKR 179

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
             +  P FLY A  A HA         P      FL  S  +R  Y   V  +D+SVG +
Sbjct: 180 QARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRYGDAVREIDDSVGKM 228

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           +  L    + +N+ V F +DNGA         GSN P    K T ++GGMR  A  W P 
Sbjct: 229 LELLHDLHVADNTFVFFTSDNGAALISAPEQAGSNGPFLCGKQTTFEGGMREPALAWWPG 288

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
                +VS +L  I D   T  A AG+    D ++DG+N    L +G    R    +  D
Sbjct: 289 HIAAGQVSHQLGSIMDLFTTSLALAGLTPPRDRAIDGLNLLPALLQGRLMDRPVFYYRGD 348

Query: 387 NV 388
            +
Sbjct: 349 TL 350


>gi|393786360|ref|ZP_10374496.1| hypothetical protein HMPREF1068_00776 [Bacteroides nordii
           CL02T12C05]
 gi|392659989|gb|EIY53606.1| hypothetical protein HMPREF1068_00776 [Bacteroides nordii
           CL02T12C05]
          Length = 467

 Score =  159 bits (401), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 120/372 (32%), Positives = 180/372 (48%), Gaps = 29/372 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSALMT 86
           K P+++ I+ADDLG  D+  +G  +I TP IDALA  G+   QHY  + +  PSR  L+T
Sbjct: 31  KAPNVVFIMADDLGIGDLGCYGQERIKTPAIDALAKQGMKFTQHYSGSTVSAPSRCVLLT 90

Query: 87  GKYPIH--IGMQHGVILE-GEPWGLPLT--EKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           GK+  H  I    G I E G  +   L   E  + +  K   Y T  +GKW LG      
Sbjct: 91  GKHTGHSFIRGNKGYIAEDGRKYDFNLADEEVTVGEIFKTKNYTTACVGKWGLGGPSAEG 150

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAA 201
            P  +GFD  +GY               + P Q  +    +++D K   +YS D+  + A
Sbjct: 151 HPNRQGFDYFFGYL-------GQGNAHNYFPVQLFENNTPVKLDYK---VYSHDIIMDKA 200

Query: 202 INVIAEHNKSKPMFLYLAHLAVHA-----GNTYEPFQAPDEE---VAKFLDISDPERRTY 253
           +  + +H  ++P FLYL     HA      N    +    EE   V K+     P R T+
Sbjct: 201 LAFLDKH-ANEPFFLYLTPTIPHADIIVPNNELFDYDGKFEEVPYVGKYRPQEKP-RATF 258

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNK--GSNHPLRGMKST 311
           A MV+RLD  +  VI  L + G+LEN+I++F +DNG    G H      SN P RG+K  
Sbjct: 259 AAMVTRLDRDIQRVIDLLAEKGVLENTIIIFTSDNGTHKEGGHDPGYFDSNGPFRGIKRD 318

Query: 312 PWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLT 371
            ++GG+R    +  P +     VS ++    D++PT+C   G +    ++DG++   VLT
Sbjct: 319 LYEGGIRTPFIVRWPGVIPAGSVSYQISTFWDFMPTMCELIGAKA-PANIDGISYLPVLT 377

Query: 372 KGAKTKRSEILH 383
              K K  + L+
Sbjct: 378 GKGKQKEHDHLY 389


>gi|262382591|ref|ZP_06075728.1| arylsulfatase [Bacteroides sp. 2_1_33B]
 gi|262295469|gb|EEY83400.1| arylsulfatase [Bacteroides sp. 2_1_33B]
          Length = 471

 Score =  159 bits (401), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 128/396 (32%), Positives = 191/396 (48%), Gaps = 64/396 (16%)

Query: 9   FALTCTLLFNDAFLNTTAPKKP-HIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLI 67
             ++ TLL         AP++P +++ ILADDLG+ DV  +G   I TPNID +A  G++
Sbjct: 7   LCISGTLLLIGMSGCEKAPERPMNVVYILADDLGYGDVGCYGQQIIKTPNIDRMAKEGML 66

Query: 68  LNQHYVQALCT---PSRSALMTGKYPIHIGMQ--HGVILEGEPWGLPLTEK--LLPQYLK 120
             QHY  A CT   PSR +LMTG +  H  ++    +  EG+    P+ E    L + +K
Sbjct: 67  FTQHY--AGCTVSAPSRCSLMTGLHTGHTQVRGNREITPEGQQ---PMREDTYTLGKLMK 121

Query: 121 EAGYATHAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRH 180
            AGY T   GKW LG    V  P   GFD  YGY          +C+     +    + H
Sbjct: 122 SAGYTTGIFGKWGLGNPGSVSIPNKMGFDEFYGY----------NCQRQSHSFYPDHLWH 171

Query: 181 NMQV------DNKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHA--------- 225
           N +       +N     YS DL  E A+  I +H K +P F  L +   HA         
Sbjct: 172 NEEKVLFPENENNACKTYSQDLIHEQALKFIRDH-KEQPFFAMLTYTLPHAELNLPHDSI 230

Query: 226 ----GNTYE--PFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNVIAALRKHGMLEN 279
                N++E  P+    ++V    + S+    ++A MVSRLDE VG+V+A L++ G+ +N
Sbjct: 231 YKMYENSFEETPYIGKFDKVYGGYNTSEKPLASFAAMVSRLDEYVGDVMAELKELGLDKN 290

Query: 280 SIVLFMADNGAPSFGIHSNKGSN-------HPLRGMKSTPWDGGMRGVAAIWSPWLKQTQ 332
           +IV+F +DNG      H   G++        P RG+K   ++GG+R     W P    T 
Sbjct: 291 TIVIFTSDNGP-----HHEGGADPDFFKSYGPFRGIKRDVYEGGIRIPMVAWCP---GTI 342

Query: 333 KVSSELFHIS---DWLPTLCAAAGIEINDTSLDGVN 365
           K  ++  HIS   D +PTL    G  + +  +DG++
Sbjct: 343 KAGAQSDHISAFWDVMPTLAELTGTVLPE-QMDGIS 377


>gi|114221|sp|P15289.3|ARSA_HUMAN RecName: Full=Arylsulfatase A; Short=ASA; AltName:
           Full=Cerebroside-sulfatase; Contains: RecName:
           Full=Arylsulfatase A component B; Contains: RecName:
           Full=Arylsulfatase A component C; Flags: Precursor
 gi|28858|emb|CAA36399.1| arylsulfatase A [Homo sapiens]
 gi|28860|emb|CAA36398.1| arylsulphatase a [Homo sapiens]
 gi|30016943|gb|AAP03431.1| arylsulfatase A [Homo sapiens]
 gi|189054653|dbj|BAG37503.1| unnamed protein product [Homo sapiens]
          Length = 507

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 190/404 (47%), Gaps = 30/404 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
           + P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +LCTPSR+AL+T
Sbjct: 19  RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 78

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+ +GM  GV++     GLPL E  + + L   GY T   GKWHLG   E  + P  
Sbjct: 79  GRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPH 138

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
           +GF    G  Y        + +C     P  G        + +  N+ V+ +   +   +
Sbjct: 139 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLE 198

Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
             Y   A +++A+   + +P FLY A    H    Y  F         F + S   R  +
Sbjct: 199 ARYMAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 247

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              +  LD +VG ++ A+   G+LE ++V+F ADNG  +  + S  G +  LR  K T +
Sbjct: 248 GDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 306

Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           +GG+R  A A W   +     V+ EL    D LPTL A AG  + + +LDG +   +L  
Sbjct: 307 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 364

Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
             K+ R  +       D  +  + A+R    K    T  +  SD
Sbjct: 365 TGKSPRQSLFFYPSYPDEVRGVF-AVRTGKYKAHFFTQGSAHSD 407


>gi|87312329|ref|ZP_01094424.1| arylsulfatase A [Blastopirellula marina DSM 3645]
 gi|87284951|gb|EAQ76890.1| arylsulfatase A [Blastopirellula marina DSM 3645]
          Length = 477

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 126/374 (33%), Positives = 178/374 (47%), Gaps = 40/374 (10%)

Query: 29  KPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQALCTPSRSALMTGK 88
           KP+ +II  DDLG+ D+   GS    TPN++A+A  G+ L   Y   +C+PSR+ALMTG 
Sbjct: 30  KPNFVIINIDDLGYADIEPFGSEVNRTPNLNAMADEGMKLTCFYAAPVCSPSRAALMTGC 89

Query: 89  YPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGF 148
           YP        V+  G   G+   E  + + +KE GYAT  IGKWHLG  +  + PT +GF
Sbjct: 90  YPKRALTIPHVLFPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLG-DQPDFLPTRQGF 148

Query: 149 DSHYGY---------WQGLQDYYDHSC---KATFEPYQGLDMRHNMQVDNKTIGIYSTDL 196
           D +YG            G++  Y       K   +P   L +  N  V  + +    T+L
Sbjct: 149 DYYYGLPYSNDMGPAADGVKSNYGAPIPQRKGKGQP--PLPLLRNETVLQRVLAKDQTEL 206

Query: 197 ---YTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
              YTE AI  I +H + KP FLYL H AVH      P    D    K           Y
Sbjct: 207 VTNYTEEAIQFIRDH-QEKPFFLYLPHSAVHF-----PMYPGDAFRGKN------SHGLY 254

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPS-FGIHSNKGSNHPLRGMKSTP 312
              V  +D SVG V+ AL+  G+ + ++V+F +DNG  + FG       N PLR  K+T 
Sbjct: 255 NDWVEEVDWSVGQVLQALKDLGLDQRTLVIFTSDNGGQTRFG-----AVNKPLRAGKATT 309

Query: 313 WDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCA-AAGIEINDTSLDGVNQWDVLT 371
           ++GGMR    +  P        S  +  + D LPTL   A G    D  +DG +   +L 
Sbjct: 310 YEGGMRVPTIVRWPGKVPAGSSSDAVVGMIDVLPTLVKLAGGTTPTDRKIDGADIGPIL- 368

Query: 372 KGAKTKRSEILHNI 385
             A  K ++  H++
Sbjct: 369 --AGVKEAKSPHDV 380


>gi|313569791|ref|NP_000478.3| arylsulfatase A isoform a precursor [Homo sapiens]
 gi|313569793|ref|NP_001078894.2| arylsulfatase A isoform a precursor [Homo sapiens]
 gi|313569795|ref|NP_001078895.2| arylsulfatase A isoform a precursor [Homo sapiens]
 gi|313569797|ref|NP_001078896.2| arylsulfatase A isoform a precursor [Homo sapiens]
 gi|1399961|gb|AAB03341.1| arylsulfatase A [Homo sapiens]
 gi|47678297|emb|CAG30269.1| ARSA [Homo sapiens]
 gi|109451022|emb|CAK54372.1| ARSA [synthetic construct]
 gi|109451600|emb|CAK54671.1| ARSA [synthetic construct]
 gi|220983390|dbj|BAH11167.1| arylsulfatase A [Homo sapiens]
 gi|261857504|dbj|BAI45274.1| arylsulfatase A [synthetic construct]
          Length = 509

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 125/404 (30%), Positives = 190/404 (47%), Gaps = 30/404 (7%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQ-ALCTPSRSALMT 86
           + P+I++I ADDLG+ D+  +G     TPN+D LA  GL     YV  +LCTPSR+AL+T
Sbjct: 21  RPPNIVLIFADDLGYGDLGCYGHPSSTTPNLDQLAAGGLRFTDFYVPVSLCTPSRAALLT 80

Query: 87  GKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFRE-VYTPTF 145
           G+ P+ +GM  GV++     GLPL E  + + L   GY T   GKWHLG   E  + P  
Sbjct: 81  GRLPVRMGMYPGVLVPSSRGGLPLEEVTVAEVLAARGYLTGMAGKWHLGVGPEGAFLPPH 140

Query: 146 RGFDSHYG--YWQGLQDYYDHSCKATFEPYQG--------LDMRHNMQVDNKTIGIYSTD 195
           +GF    G  Y        + +C     P  G        + +  N+ V+ +   +   +
Sbjct: 141 QGFHRFLGIPYSHDQGPCQNLTCFPPATPCDGGCDQGLVPIPLLANLSVEAQPPWLPGLE 200

Query: 196 L-YTEAAINVIAE-HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTY 253
             Y   A +++A+   + +P FLY A    H    Y  F         F + S   R  +
Sbjct: 201 ARYMAFAHDLMADAQRQDRPFFLYYASHHTH----YPQFSG-----QSFAERSG--RGPF 249

Query: 254 AGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPW 313
              +  LD +VG ++ A+   G+LE ++V+F ADNG  +  + S  G +  LR  K T +
Sbjct: 250 GDSLMELDAAVGTLMTAIGDLGLLEETLVIFTADNGPETMRM-SRGGCSGLLRCGKGTTY 308

Query: 314 DGGMRGVA-AIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIEINDTSLDGVNQWDVLTK 372
           +GG+R  A A W   +     V+ EL    D LPTL A AG  + + +LDG +   +L  
Sbjct: 309 EGGVREPALAFWPGHI--APGVTHELASSLDLLPTLAALAGAPLPNVTLDGFDLSPLLLG 366

Query: 373 GAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVAGTDNNGQSD 416
             K+ R  +       D  +  + A+R    K    T  +  SD
Sbjct: 367 TGKSPRQSLFFYPSYPDEVRGVF-AVRTGKYKAHFFTQGSAHSD 409


>gi|449132445|ref|ZP_21768528.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448888347|gb|EMB18667.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 478

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 115/358 (32%), Positives = 176/358 (49%), Gaps = 47/358 (13%)

Query: 26  APKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPSRSAL 84
           A + P+ ++I ADDLG+ D+S + SS + TP++DALA  G      +V A +C+PSR+AL
Sbjct: 39  ADRPPNFVLIFADDLGYGDISCYDSSGVKTPHLDALAEEGFRSTDFFVSANVCSPSRAAL 98

Query: 85  MTGKYPIHIGM---QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVY 141
           +TG+YP+  GM   ++  + + + +G    E  +P+ L  AGY +  +GKWHLG   E  
Sbjct: 99  LTGRYPMRCGMPVARNENVAKYQDYGFAPEELTIPELLHPAGYRSLMVGKWHLGMEVEGS 158

Query: 142 TPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYS------TD 195
            P   GFD ++G              + +EP +G +  HN     K +   +      T 
Sbjct: 159 HPLDAGFDEYFG------------IPSNYEPRRGKN--HNTLYRGKAVEQENVPCEVLTK 204

Query: 196 LYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAG 255
            YT+  I  I +  +  P F+Y++H  VH  N   P +        F+  S  E+  Y  
Sbjct: 205 RYTDEVIEFI-DRQQDDPFFIYVSHHIVH--NPLRPSK-------DFVGTS--EKGKYGD 252

Query: 256 MVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDG 315
            +  LD S G ++ A+R  G+ EN++V+F +DNG       +  GS   L G K    +G
Sbjct: 253 FIKELDHSTGRIMQAIRDAGIDENTLVVFTSDNGP------TRNGSAGGLNGGKYCTMEG 306

Query: 316 GMRGVAAI--WSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDGVNQWDVL 370
           G R V  I  W+  +   Q VS       D LP  C  AG++  +D  LDG N   +L
Sbjct: 307 GHR-VPGIFRWAGKIAPNQ-VSDATLTSMDLLPLFCGLAGVQPPDDRKLDGKNILPIL 362


>gi|426383228|ref|XP_004058189.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gorilla gorilla
           gorilla]
          Length = 528

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 112/362 (30%), Positives = 169/362 (46%), Gaps = 25/362 (6%)

Query: 40  LGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMTGKYPIHIGM--- 95
           +GW D+  +G     TPN+D +A  G++    Y    LC+PSR+AL+TG+ PI  G    
Sbjct: 47  MGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTT 106

Query: 96  ----QHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPTFRGFDSH 151
               ++    +    G+P +E+LLP+ LK+AGY +  +GKWHLG  R  + P   GFD  
Sbjct: 107 NAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLGH-RPQFHPLKHGFDEW 165

Query: 152 YGYWQGLQDYYDHSCKATFEPYQGLDM----RHNMQVDNKTIGIYSTDLYTEAAINVIAE 207
           +G        YD+  +     Y+  +M         ++ KT     T +Y + A++ I  
Sbjct: 166 FGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANLTQIYLQEALDFIKR 225

Query: 208 HNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDISDPERRTYAGMVSRLDESVGNV 267
             +  P FLY A  A HA         P      FL  S  +R  Y   V  +D+S+G +
Sbjct: 226 QARHHPFFLYWAVDATHA---------PVYASKPFLGTS--QRGRYGDAVREIDDSIGKM 274

Query: 268 IAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKSTPWDGGMRGVAAIWSPW 327
           +  L+   + +N+ V F +DNGA         GSN P    K T ++GGMR  A  W P 
Sbjct: 275 LELLQDLHVADNTFVFFTSDNGAALISAPEQGGSNGPFLCGKQTTFEGGMREPALAWWPG 334

Query: 328 LKQTQKVSSELFHISDWLPTLCAAAGI-EINDTSLDGVNQWDVLTKGAKTKRSEILHNID 386
                +VS +L  I D   T  A AG+   +D ++DG+N    L +G    R    +  D
Sbjct: 335 HVAAGQVSHQLGSIMDLFTTSLALAGLTPPSDRAIDGLNLLPTLLQGRLMDRPIFYYRGD 394

Query: 387 NV 388
            +
Sbjct: 395 TL 396


>gi|326331907|ref|ZP_08198194.1| twin-arginine translocation pathway signal [Nocardioidaceae
           bacterium Broad-1]
 gi|325950404|gb|EGD42457.1| twin-arginine translocation pathway signal [Nocardioidaceae
           bacterium Broad-1]
          Length = 463

 Score =  158 bits (400), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 122/404 (30%), Positives = 189/404 (46%), Gaps = 47/404 (11%)

Query: 28  KKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHY-VQALCTPSRSALMT 86
           +KP++++IL DDLGW D+S +GS  I TP++D LA +G+     Y   ++C+P+R  L T
Sbjct: 47  RKPNLLVILGDDLGWADLSAYGSPDIKTPHLDRLAASGVRFTDAYSASSVCSPTRFGLYT 106

Query: 87  GKYPIHI--GMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAIGKWHLGFFREVYTPT 144
           G+YP  +  G+   +    +  G+PL    L   LKE GYAT   GKWH G+    Y+PT
Sbjct: 107 GRYPGRLAGGLPEPISAPNKLHGIPLDHPTLASLLKEQGYATALFGKWHCGYL-PWYSPT 165

Query: 145 FRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDNKTIGIYSTDLYTEAAINV 204
             G+D  +G + G  DY+          + G    +  +V+ + +  Y T++ TE A++ 
Sbjct: 166 RVGWDEFFGNFSGGLDYFSKIN------HNGAHDLYEDEVEVEDLRYY-TEIVTERAVSF 218

Query: 205 IAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDE--------------EVAKFLDISDPER 250
           I E +   P  L L     H      P++ P++              E            
Sbjct: 219 I-ERDHDTPWLLNLNFTTPHW-----PWEGPEDKAVSDELTARIEGGEKGVLFHTDGGSL 272

Query: 251 RTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPLRGMKS 310
            TY  MV  LD+S+G VI AL + G L +++V F +DNG   F       +  P  G KS
Sbjct: 273 ATYKTMVEDLDQSIGQVIKALERTGQLRDTVVFFASDNGGERF------SNVWPFTGAKS 326

Query: 311 TPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAG-IEINDTSLDGVNQWDV 369
              +GG+R    +  P + + ++VS E    +DW  T    AG     D  LDG +   +
Sbjct: 327 GLNEGGIRVPTILSWPGILRGRQVSDEPVVTTDWTATFLELAGATPAEDYPLDGASLVPL 386

Query: 370 LTKGAKTKRSEILHNIDNVDNPQKYYAALRVDDLKYVA--GTDN 411
           L  G   +  ++   +       +   ALR   LKY+   GTD+
Sbjct: 387 LFGGRPRETQDLFWRM-------RGQRALRRGSLKYLQLDGTDH 423


>gi|417300776|ref|ZP_12087967.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327542927|gb|EGF29380.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 450

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 122/359 (33%), Positives = 174/359 (48%), Gaps = 37/359 (10%)

Query: 10  ALTCTLLFNDAFLNTTAPKKPHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILN 69
           +L C +L         A  K +I++ L DD G+ D+   GS  + TPNID L   G+   
Sbjct: 17  SLICAVLSAWPTSVVEATDKLNIVVFLTDDQGYGDLGCFGSDSLETPNIDRLCKQGMKFT 76

Query: 70  QHYVQALCTPSRSALMTGKYPIHIGMQHGVILEGEPWGLPLTEKLLPQYLKEAGYATHAI 129
             YV   C+P+R A MTG +    G    VI   +  G+   E   P+ LK AGY T  +
Sbjct: 77  DFYVHQRCSPTRLAFMTGSHAHRAGCTK-VIYNRDRIGIHAGEVTTPELLKTAGYTTGIV 135

Query: 130 GKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQ----VD 185
           GKWHLG + + + PT  GFD  YG+   L               QG  +  N++    + 
Sbjct: 136 GKWHLGEW-DAFNPTRHGFDFFYGFMIDLD--------------QGTGIYRNLKRIESIK 180

Query: 186 NKTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAGNTYEPFQAPDEEVAKFLDI 245
           +KT G +S  L  +AAI  I E NK +P FLY A    H      P+  P+E   +F   
Sbjct: 181 HKTDGKHSPKLL-DAAIGFIKE-NKERPFFLYYASPLPHT-----PW-IPNE---RFKGT 229

Query: 246 SDPERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAPSFGIHSNKGSNHPL 305
           S   R TY  ++  +D  VG ++ AL KH + EN++ +F +DNG P  GI  N G   P 
Sbjct: 230 S--RRSTYGDVIREIDWQVGELMNALDKHEIAENTLFIFASDNG-PVLGI--NGGDAGPY 284

Query: 306 RGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTLCAAAGIE-INDTSLDG 363
           R  K T ++GG+R    +  P   +    +S++  I D LPT CA A ++   D  +DG
Sbjct: 285 RDGKWTDFEGGIRVPCIMRWPGTIEPGSTNSQITGIIDLLPTFCAMAEVDPPGDRVIDG 343


>gi|87309613|ref|ZP_01091747.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
 gi|87287377|gb|EAQ79277.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
          Length = 496

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 131/439 (29%), Positives = 201/439 (45%), Gaps = 62/439 (14%)

Query: 23  NTTAPKK-PHIIIILADDLGWNDVSFHGSSQIPTPNIDALAYNGLILNQHYVQA-LCTPS 80
           N+  PK+ P+I+  L DDLGW D+  +GSS   TPN+D LA +G+     Y    +C+P+
Sbjct: 35  NSAEPKRTPNIVFFLVDDLGWKDIGVYGSSFYHTPNVDGLAASGMRFTNAYAACQVCSPT 94

Query: 81  RSALMTGKYPIHIGM------------QHGVILEGEPWG--LPLTEKLLPQYLKEAGYAT 126
           R+++MTGKYP  +G+            +    L   P+   L L E  L + LK+ GYAT
Sbjct: 95  RASIMTGKYPQRVGITDYIGAAQPDKWKRNTPLLPAPYQTRLALEETTLAEALKQRGYAT 154

Query: 127 HAIGKWHLGFFREVYTPTFRGFDSHYGYWQGLQDYYDHSCKATFEPYQGLDMRHNMQVDN 186
              GKWHLG   E   P  +GFD + G   G+     +  K  F PY       N ++ +
Sbjct: 155 FFAGKWHLG--PEGNWPEDQGFDVNIG---GIDRGGPYGGKKYFSPY------GNPRLTD 203

Query: 187 KTIGIYSTDLYTEAAINVIAEHNKSKPMFLYLAHLAVHAG-NTYEPFQAPDEEVAKFLDI 245
              G +  D      +  I +H + +P   YL+  +VH      E  +   +E+ + +  
Sbjct: 204 GPDGEHLPDRLASETVKFIEQH-QDQPFLAYLSFYSVHTPLMAREDLKQKYDEIKQRIRF 262

Query: 246 SDP--------------ERRTYAGMVSRLDESVGNVIAALRKHGMLENSIVLFMADNGAP 291
           + P              E   YAGMV  +D +VG V+ AL +  + +N++V+F +DNG  
Sbjct: 263 AGPIWGEEGKSKLRLVQEHSVYAGMVEAMDAAVGKVLDALDRLKLTDNTLVIFTSDNG-- 320

Query: 292 SFGIHSNKG---SNHPLRGMKSTPWDGGMRGVAAIWSPWLKQTQKVSSELFHISDWLPTL 348
             G+ +++G   SN PLRG K   ++GG+R    +  P +      S  L    D+LPT+
Sbjct: 321 --GLSTSEGHPTSNLPLRGGKGWMYEGGIREPLVVRYPGVTSPGSESDALVTSPDFLPTI 378

Query: 349 CAAAGIEINDTSLDGVNQWDVLTKGAKTKRSEILHNIDNVDNPQKY-YAALRVDDLKYVA 407
            A      +    DGV+    L +G    R  I  +  +  N      AA+R  D K + 
Sbjct: 379 LAVVDKPGDKIDTDGVSIISAL-EGKPLDRGPIFWHYPHYGNQGGSPTAAVREGDWKLI- 436

Query: 408 GTDNNGQSDEWYGDTDNEI 426
                    EWY D   E+
Sbjct: 437 ---------EWYEDGKVEL 446


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.318    0.135    0.420 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,071,010,655
Number of Sequences: 23463169
Number of extensions: 448464816
Number of successful extensions: 1007133
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 5213
Number of HSP's successfully gapped in prelim test: 4571
Number of HSP's that attempted gapping in prelim test: 967143
Number of HSP's gapped (non-prelim): 20612
length of query: 593
length of database: 8,064,228,071
effective HSP length: 148
effective length of query: 445
effective length of database: 8,886,646,355
effective search space: 3954557627975
effective search space used: 3954557627975
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)