BLASTP 2.2.26 [Sep-21-2011]


Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer, 
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997), 
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs",  Nucleic Acids Res. 25:3389-3402.


Reference for compositional score matrix adjustment: Altschul, Stephen F., 
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.

Query= psy12301
         (632 letters)

Database: nr 
           23,463,169 sequences; 8,064,228,071 total letters

Searching..................................................done



>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 567

 Score =  651 bits (1679), Expect = 0.0,   Method: Compositional matrix adjust.
 Identities = 313/553 (56%), Positives = 405/553 (73%), Gaps = 37/553 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWNDLSFHGS+EIPTPNIDALA+NGI+LNN+Y QPVCTPSR +LMTGKYPI  GMQGPP 
Sbjct: 39  GWNDLSFHGSDEIPTPNIDALAFNGIVLNNLYTQPVCTPSRVALMTGKYPIKLGMQGPPT 98

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +GAEP G+PL+E+ LPEYLRELGY+T+AIGKWHLGF+++ YTP  RGF+SHFGY  G +S
Sbjct: 99  YGAEPNGLPLSEKLLPEYLRELGYTTRAIGKWHLGFYKQAYTPTRRGFDSHFGYYTGYVS 158

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD++L D Y    E  G DMRRN + AWD VG+YATD+FT EAV+LI++QP ++PLF+Y
Sbjct: 159 YYDYLLQDVYQNFGEFQGFDMRRNDTIAWDVVGKYATDVFTDEAVRLIKEQPANQPLFMY 218

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAH+A H GN GK+LEAPQ  +N+F +I DPNRR YAAMV KLD+SVG V+ AL  K ML
Sbjct: 219 LAHVAVHTGNRGKYLEAPQSEVNKFNHILDPNRRIYAAMVSKLDESVGRVVEALTEKKML 278

Query: 332 ENSIIIFMSDNGAPTVE-----YR-ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
           +N+II+FMSDNG+P+ +     +R E     NWGSN+PYRG+KNTLWEGGVK  + +W+P
Sbjct: 279 QNTIIVFMSDNGSPSFDGSGRNFRPEAGVTANWGSNFPYRGIKNTLWEGGVKSASFIWAP 338

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
             Q+NPRVS Q+MHI+DWLPTLY+AAGG+   LP N+DG+DQW+SL L            
Sbjct: 339 YFQENPRVSKQLMHITDWLPTLYSAAGGNPGFLPKNLDGVDQWTSLTL------------ 386

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
                    N PS+RN VL+NIDEK+R A +  D+WKL++G+  NGT+DG++G TR  + 
Sbjct: 387 ---------NLPSQRNFVLLNIDEKQRYAGILKDNWKLIVGSTANGTLDGFFGATRP-RT 436

Query: 506 PLLNFNAIVESKTYQSLQQLSQNI------FLPISNIDKMRSTRQQATIHCGANPA-PMT 558
           P  N  A++ S   ++L +LS ++      F P+S++ ++ + R ++TI C  + A P T
Sbjct: 437 P-YNATAVLYSPAGRALTKLSDSLPFATAGFGPMSSVRELLAMRYESTIRCNPSAAGPRT 495

Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
            SPC  G   LF+L  DPCE NN+A     +S QLYE LKY+RR LVPQ++ +P    A+
Sbjct: 496 QSPCPAGEACLFDLVADPCETNNVAKKYVAVSGQLYEALKYYRRLLVPQTN-RPFDPAAN 554

Query: 619 PKRFNDTWSPWIY 631
           P RFN+TWS W+Y
Sbjct: 555 PARFNNTWSTWMY 567



 Score =  119 bits (297), Expect = 7e-24,   Method: Compositional matrix adjust.
 Identities = 51/80 (63%), Positives = 66/80 (82%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRN + AWD VG+YATD+FT EAV+LI++QP ++PLF+YLAH+A H GN GK+LEAPQ 
Sbjct: 179 MRRNDTIAWDVVGKYATDVFTDEAVRLIKEQPANQPLFMYLAHVAVHTGNRGKYLEAPQS 238

Query: 61  TINQFQYITDPNRRTYAALT 80
            +N+F +I DPNRR YAA+ 
Sbjct: 239 EVNKFNHILDPNRRIYAAMV 258


>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 610

 Score =  564 bits (1454), Expect = e-158,   Method: Compositional matrix adjust.
 Identities = 287/554 (51%), Positives = 378/554 (68%), Gaps = 46/554 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWNDLSFHGS++I TPN+DALAYNG+ILN+ Y  PVCTPSRA+LMTG YPIH GMQG P+
Sbjct: 52  GWNDLSFHGSDQIQTPNLDALAYNGVILNSQYVLPVCTPSRAALMTGMYPIHNGMQGLPL 111

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             +EPR +P   + LP YL++LGY+T+ +GKWHLG++++E+TP YRGF+SH GY NG++S
Sbjct: 112 EASEPRALP-AGKLLPSYLKDLGYTTRMVGKWHLGYYQKEFTPTYRGFDSHLGYWNGIVS 170

Query: 212 YYDHILSDQYSRT--VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           YYD+IL +  +R     LNG DMRRN++ A+D  G YAT++FT EAV LI     + PLF
Sbjct: 171 YYDYILQEDDNRKPRSSLNGFDMRRNITPAYDLQGRYATEMFTDEAVHLIRSHNKNTPLF 230

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LY++HLA HAGN GK LEAPQE IN+F +I DPNRRT+A MV+KLD+SVG +I+AL+   
Sbjct: 231 LYMSHLAVHAGNPGKFLEAPQEAINKFLHIADPNRRTFAGMVQKLDESVGRLIAALEESN 290

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML NSII+F +DNGAPT+       Y NWGSNYP RG+K TLWEGG +  + +WSP I+ 
Sbjct: 291 MLHNSIIVFSADNGAPTI-----GGYPNWGSNYPLRGIKETLWEGGTRGVSFIWSPLIKN 345

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
            PRVS +++HISDWLPTLY AAGG+   +  N+DG++QW                     
Sbjct: 346 TPRVSNELIHISDWLPTLYAAAGGNMENVK-NLDGVNQW--------------------- 383

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDS----WKLVLGTQENGTMDGYYGQTRSNKV 505
            S+  N PS RN +L+NIDE  +TAA+R++S    WKLV+GT  NGT D Y+G+  +NK 
Sbjct: 384 PSISENLPSERNEILLNIDEILQTAAIRMNSRKYHWKLVVGTVNNGTYDEYFGEG-TNKF 442

Query: 506 PLLN----FNAIVESKTYQSLQQLSQNIFL--PISNIDKMRSTRQQATIHCG---ANPAP 556
           P  N    FNA+  S+ Y++L++L+Q   L  P S  +KM+  R+ ATI C     NP  
Sbjct: 443 PENNPPYDFNAVANSQVYRTLERLAQKRLLTYPTSK-EKMKVLRRDATIRCDYKIINPNR 501

Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
             PS   N  C L+++  DPCE NN+    P+++S L+  L  HR +LVPQ  ++     
Sbjct: 502 CNPSDGPNQVC-LYDIKRDPCELNNLTPFYPNVASYLFRALVQHRESLVPQESKELQQDL 560

Query: 617 ADPKRFNDTWSPWI 630
           ADP  FN TW+PWI
Sbjct: 561 ADPVNFNGTWTPWI 574



 Score =  107 bits (267), Expect = 2e-20,   Method: Compositional matrix adjust.
 Identities = 48/81 (59%), Positives = 61/81 (75%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRN++ A+D  G YAT++FT EAV LI     + PLFLY++HLA HAGN GK LEAPQE
Sbjct: 193 MRRNITPAYDLQGRYATEMFTDEAVHLIRSHNKNTPLFLYMSHLAVHAGNPGKFLEAPQE 252

Query: 61  TINQFQYITDPNRRTYAALTK 81
            IN+F +I DPNRRT+A + +
Sbjct: 253 AINKFLHIADPNRRTFAGMVQ 273


>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 536

 Score =  509 bits (1310), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 257/540 (47%), Positives = 352/540 (65%), Gaps = 34/540 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNIDALAYNGIILN+ Y+Q   TPSRA+L+TGKYP+  G+QGP I
Sbjct: 19  GWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKYPMKLGLQGPSI 78

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AE R +P   + + EY +++GY+T  +GKWHLG  R   TP +RGF+  FG+ NG  S
Sbjct: 79  TPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGFYNGFTS 137

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD++ S+      E +G D+RR+   +W+  G+YATDLF + AV +I+   V+ PLF+ 
Sbjct: 138 YYDYV-SNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNTPLFMM 196

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +AHLA H GN GK LEAPQET+N+F++I DPNRRTYAAMV KLDDS+G V  AL+ K ML
Sbjct: 197 IAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGAVFEALEAKNML 256

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N+I++F+SDNGAPTV       + NWGSNYP RG+K+TL+EGGV+  A +WSP + Q+ 
Sbjct: 257 QNTIVVFISDNGAPTV-----GPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSPLLVQSS 311

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS  ++HI+DWLPTL+TA GGD S L  ++DG+DQWS                     S
Sbjct: 312 RVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWS---------------------S 350

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L+ + PS RN + +NIDEK R AA+R   WKL++GT  NG+ +GY+G   +  +    +N
Sbjct: 351 LVYDLPSARNDIPLNIDEKTRNAALRFSYWKLIVGTSGNGSYNGYFGAPLNENIEEQQYN 410

Query: 512 --AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             AI +S   +  ++++ N   P+S  D     R+ AT+ C    A   P    +G   L
Sbjct: 411 TSAINDSPVGRIAKKINYN---PLSETD-FDGLRRVATLKCLDAKAKRNPCDPASGAVCL 466

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           +N+ NDPCE+N++A   P +  ++   L  +R+ LVPQ     D+  ADPK F   W+PW
Sbjct: 467 YNIPNDPCEENDLAKFFPSVVRRIKRTLVDYRQGLVPQIETSIDIENADPKLFQFAWNPW 526



 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 65/91 (71%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +RR+   +W+  G+YATDLF + AV +I+   V+ PLF+ +AHLA H GN GK LEAPQE
Sbjct: 157 LRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNTPLFMMIAHLAVHVGNEGKWLEAPQE 216

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
           T+N+F++I DPNRRTYAA+      ++  V+
Sbjct: 217 TVNKFKHIRDPNRRTYAAMVSKLDDSIGAVF 247


>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
          Length = 558

 Score =  508 bits (1309), Expect = e-141,   Method: Compositional matrix adjust.
 Identities = 257/540 (47%), Positives = 352/540 (65%), Gaps = 34/540 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNIDALAYNGIILN+ Y+Q   TPSRA+L+TGKYP+  G+QGP I
Sbjct: 41  GWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKYPMKLGLQGPSI 100

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AE R +P   + + EY +++GY+T  +GKWHLG  R   TP +RGF+  FG+ NG  S
Sbjct: 101 TPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGFYNGFTS 159

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD++ S+      E +G D+RR+   +W+  G+YATDLF + AV +I+   V+ PLF+ 
Sbjct: 160 YYDYV-SNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNTPLFMM 218

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +AHLA H GN GK LEAPQET+N+F++I DPNRRTYAAMV KLDDS+G V  AL+ K ML
Sbjct: 219 IAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGAVFEALEAKNML 278

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N+I++F+SDNGAPTV       + NWGSNYP RG+K+TL+EGGV+  A +WSP + Q+ 
Sbjct: 279 QNTIVVFISDNGAPTV-----GPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSPLLVQSS 333

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS  ++HI+DWLPTL+TA GGD S L  ++DG+DQWS                     S
Sbjct: 334 RVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWS---------------------S 372

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L+ + PS RN + +NIDEK R AA+R   WKL++GT  NG+ +GY+G   +  +    +N
Sbjct: 373 LVYDLPSARNDIPLNIDEKTRNAALRFSYWKLIVGTSGNGSYNGYFGAPLNENIEEQQYN 432

Query: 512 --AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             AI +S   +  ++++ N   P+S  D     R+ AT+ C    A   P    +G   L
Sbjct: 433 TSAINDSPVGRIAKKINYN---PLSETD-FDGLRRVATLKCLDAKAKRNPCDPASGAVCL 488

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           +N+ NDPCE+N++A   P +  ++   L  +R+ LVPQ     D+  ADPK F   W+PW
Sbjct: 489 YNIPNDPCEENDLAKFFPSVVRRIKRTLVDYRQGLVPQIETSIDIENADPKLFQFAWNPW 548



 Score =  104 bits (260), Expect = 1e-19,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 65/91 (71%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +RR+   +W+  G+YATDLF + AV +I+   V+ PLF+ +AHLA H GN GK LEAPQE
Sbjct: 179 LRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNTPLFMMIAHLAVHVGNEGKWLEAPQE 238

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
           T+N+F++I DPNRRTYAA+      ++  V+
Sbjct: 239 TVNKFKHIRDPNRRTYAAMVSKLDDSIGAVF 269


>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 713

 Score =  504 bits (1299), Expect = e-140,   Method: Compositional matrix adjust.
 Identities = 263/563 (46%), Positives = 346/563 (61%), Gaps = 54/563 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS++IPTPNIDALAYNGIILNN Y   +CTPSR++LMTGK PIH GMQ   +
Sbjct: 83  GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 142

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +  EPRG+PL+E+ LPEYLRE+GY T A+GKWHLG+F++EYTP YRGF+SHFGY NG+  
Sbjct: 143 FPTEPRGLPLSEKLLPEYLREIGYKTHAVGKWHLGYFKKEYTPTYRGFDSHFGYWNGLQD 202

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY HI  +      E  G DMRRNL+ AWDTVG+Y+TDLFT EA++LI +   D+P+FLY
Sbjct: 203 YYTHITQEPDPAFSEFKGFDMRRNLTVAWDTVGKYSTDLFTNEAIRLINEHDTDRPMFLY 262

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAHLA H GN  +   AP E I +F YI DP RR  AA+V KLD SVG V+ AL+ +GML
Sbjct: 263 LAHLAVHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKLDQSVGDVMDALRNRGML 322

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ENSI++FMSDNGAP+          N GSNYP RG+KN+ WEGG +  A +WSP I+++ 
Sbjct: 323 ENSIVLFMSDNGAPS-----QGILNNKGSNYPLRGIKNSPWEGGTRGVAAIWSPLIKKSK 377

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS QMM ISDWLPTL +AAG +  +L                       NIDG D WS+
Sbjct: 378 RVSNQMMFISDWLPTLLSAAGVNRKQL----------------------GNIDGFDLWSA 415

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT---------MDGYYGQTRS 502
           L+LN  S R+ V++NID+    A+ R  ++K ++G  E G+          +G   Q   
Sbjct: 416 LVLNKVSSRSEVVLNIDDLSDYASFRRGNFKYIIGRTETGSDWLGASGDPSEGISPQYDP 475

Query: 503 NKVPL----LNFNAIVESKTYQSLQQ------------LSQNIFLPISNIDKMRSTRQQA 546
            K+      +  + I+ +K    L++            L       I +++ +   R++A
Sbjct: 476 YKILYSKTGVAISGIITAKQAMELKERRKRSIKIYDTALETKFEEKILSVENISELRRKA 535

Query: 547 TIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
            I C        P      PC LFN+  DPCE  N+A  RP I + L  +L  +R T +P
Sbjct: 536 QIKCNVQEKDRIPCEPMTAPC-LFNIEKDPCEMVNLAERRPVIMAILERILMKYRITAIP 594

Query: 607 QSHEQPDLVQADPKRFNDTWSPW 629
            S+   D  +ADP  +N+TW+ W
Sbjct: 595 ASNLDGD-PRADPTLWNNTWTSW 616



 Score =  106 bits (265), Expect = 4e-20,   Method: Compositional matrix adjust.
 Identities = 47/80 (58%), Positives = 58/80 (72%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRNL+ AWDTVG+Y+TDLFT EA++LI +   D+P+FLYLAHLA H GN  +   AP E
Sbjct: 223 MRRNLTVAWDTVGKYSTDLFTNEAIRLINEHDTDRPMFLYLAHLAVHKGNEEQLFRAPDE 282

Query: 61  TINQFQYITDPNRRTYAALT 80
            I +F YI DP RR  AA+ 
Sbjct: 283 EIAKFSYILDPERRIQAAVV 302


>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 671

 Score =  502 bits (1293), Expect = e-139,   Method: Compositional matrix adjust.
 Identities = 266/564 (47%), Positives = 350/564 (62%), Gaps = 55/564 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS++IPTPNIDALAYNGIILNN Y   +CTPSR++LMTGK PIH GMQ   +
Sbjct: 39  GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 98

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL+E+ LPEYLRE+GY T A+GKWHLG+F++EYTP YRGF+SHFGY NG+  
Sbjct: 99  FPAEPRGLPLSEKLLPEYLREVGYKTHAVGKWHLGYFKKEYTPTYRGFDSHFGYWNGLQD 158

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY HI  +      E  G DMRRNL+ AWDTVG+Y+TDLFT EAV+LI +  +D+P+FLY
Sbjct: 159 YYTHITQEPDPVYSEYKGFDMRRNLTVAWDTVGKYSTDLFTNEAVRLINEHNIDQPMFLY 218

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAHLA H GN  +   AP E I +F YI DP RR  AA+V KLD SVG V+ AL+ +GML
Sbjct: 219 LAHLAPHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKLDQSVGDVMDALRNRGML 278

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ENSI++FMSDNGAP+          N GSNYP RG+KN+ WEGG +  A +WSP I+++ 
Sbjct: 279 ENSIVVFMSDNGAPS-----QGLLSNEGSNYPLRGIKNSPWEGGTRGVAAIWSPLIKKSK 333

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS QMM +SDWLPTL +AAG D  +L  NIDG D W                     S+
Sbjct: 334 RVSNQMMFMSDWLPTLLSAAGVDRRQLG-NIDGYDLW---------------------SA 371

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM---------DGYYGQTRS 502
           L+LN  S R+ V++NID+    AA R   +K ++G  E G+          +G   Q   
Sbjct: 372 LVLNKISSRSEVMLNIDDLSDYAAFRRGDFKYIIGRTETGSAWLGASGDPSEGISPQYDP 431

Query: 503 NKVPL----LNFNAIVESKTYQSLQQ-------------LSQNIFLPISNIDKMRSTRQQ 545
           +K+      +  + I+ +K    L++             L  N    I +++ +   R++
Sbjct: 432 HKILYSKTGVAISGIITAKQAMELKEERRKRSVKIYDTALKTNFEEKILSVENISELRRK 491

Query: 546 ATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLV 605
           A I C        P      PC LFN+  DPCE  N+A  RP I + L  +L  +R T++
Sbjct: 492 AQIKCNVKEKDRIPCEPMTAPC-LFNIEKDPCEMVNLAEKRPMIMAILERILMKYRLTVI 550

Query: 606 PQSHEQPDLVQADPKRFNDTWSPW 629
           P S+   D  +A+P  +N+TW+ W
Sbjct: 551 PPSNLDGD-PRANPSLWNNTWTSW 573



 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 48/80 (60%), Positives = 59/80 (73%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRNL+ AWDTVG+Y+TDLFT EAV+LI +  +D+P+FLYLAHLA H GN  +   AP E
Sbjct: 179 MRRNLTVAWDTVGKYSTDLFTNEAVRLINEHNIDQPMFLYLAHLAPHKGNEEQLFRAPDE 238

Query: 61  TINQFQYITDPNRRTYAALT 80
            I +F YI DP RR  AA+ 
Sbjct: 239 EIAKFSYILDPERRIQAAVV 258


>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 583

 Score =  491 bits (1264), Expect = e-136,   Method: Compositional matrix adjust.
 Identities = 249/540 (46%), Positives = 341/540 (63%), Gaps = 34/540 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGSN+IPTPNIDALAY G++L N Y   +CTPSRA+L+TGKYPIH GMQ   I
Sbjct: 52  GWNDVSFHGSNQIPTPNIDALAYYGVLLKNHYVAALCTPSRAALLTGKYPIHLGMQHEAI 111

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + +EPRG+PL E+ LP+YL+++ Y T  +GKWHLG+++ EYTPLYRGF++HFGY NG+  
Sbjct: 112 FPSEPRGLPLEEKLLPQYLKDMNYVTHIVGKWHLGYYKMEYTPLYRGFDTHFGYWNGLQD 171

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY H  ++ Y+  +   G DMRRN + AWDT+G+Y+ DL+T EAV+LI     D P+FLY
Sbjct: 172 YYSHKTAEPYTLNI---GMDMRRNFTVAWDTMGKYSVDLYTDEAVRLINTHNTDNPMFLY 228

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LA +A HAGNA +  +A  E I +F YI DP R+ YAA+V KLD+SVG V+ AL+ + ML
Sbjct: 229 LAQIAPHAGNANQLPQALPEEIEKFSYIIDPKRKRYAAVVSKLDESVGKVVEALKNRNML 288

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NSII+FMSDNGAPT  +       N GSNYP+RG+K TLWEGGV+  A +WSP I++  
Sbjct: 289 SNSIILFMSDNGAPTSGF-----LSNGGSNYPFRGIKKTLWEGGVRGAAAIWSPLIKKRE 343

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS  +MH++DWLPTL +AAGG+   +                        IDGLD W +
Sbjct: 344 RVSYHLMHVTDWLPTLLSAAGGNKEEI----------------------GTIDGLDMWET 381

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG-TQENGTMDGYYGQTRSNKVPLLNF 510
           ++    S R  VLINID+   + A+R+  +K V G +++N    G  G+  S + P  + 
Sbjct: 382 IVYGDCSPRKEVLINIDDICDSEAIRIGKYKYVRGESKKNNDWVGDSGRNSSEQRPPYSP 441

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
             ++ SK   ++  L  N  + +++  KM   RQQA +HC               PC LF
Sbjct: 442 ETVLNSKVGIAIASLPGNNNVALTSA-KMLELRQQAEVHCNVTEEEKVQCDTQLAPC-LF 499

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           +L  DPCE  NI    P ++ +L E+L  HR T++  ++   D V+A P  +N TW  W+
Sbjct: 500 DLERDPCEMINIIKQEPLVAKELQEVLLQHRSTMLLPNNRIID-VRASPILWNCTWVCWM 558



 Score = 95.1 bits (235), Expect = 1e-16,   Method: Compositional matrix adjust.
 Identities = 43/80 (53%), Positives = 56/80 (70%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRN + AWDT+G+Y+ DL+T EAV+LI     D P+FLYLA +A HAGNA +  +A  E
Sbjct: 189 MRRNFTVAWDTMGKYSVDLYTDEAVRLINTHNTDNPMFLYLAQIAPHAGNANQLPQALPE 248

Query: 61  TINQFQYITDPNRRTYAALT 80
            I +F YI DP R+ YAA+ 
Sbjct: 249 EIEKFSYIIDPKRKRYAAVV 268


>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 689

 Score =  486 bits (1252), Expect = e-134,   Method: Compositional matrix adjust.
 Identities = 269/614 (43%), Positives = 366/614 (59%), Gaps = 53/614 (8%)

Query: 39  LYLAHLAAHAGNAG-KHLEAPQETINQFQYITDPNR--RTYAALTKSTTLTLLIV--YGW 93
           L+LA L  + G      L +P+  IN F  +  P +  +      +   + +++    GW
Sbjct: 13  LFLAELTVNVGAIFFSSLLSPKVKIN-FPSLGKPGKDPKHVDGCRQKPHIIIILADDLGW 71

Query: 94  NDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWG 153
           ND+SFHGS++IPTPNIDA+AYNGIILN+ Y   +CTPSR +LMTGK PIH GMQ   +  
Sbjct: 72  NDVSFHGSDQIPTPNIDAIAYNGIILNSHYVAALCTPSRTALMTGKNPIHLGMQHSVLLP 131

Query: 154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
           +EPRG+PL+E+ LPEYLRE+GY T A+GKWHLG+FRREYTP +RGF++HFGY NG+  YY
Sbjct: 132 SEPRGLPLSEKLLPEYLREVGYRTHAVGKWHLGYFRREYTPTFRGFDTHFGYWNGLQDYY 191

Query: 214 DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLA 273
            HI  +   +  E  G DMRRNL+ AWDT G+Y+TDLFT+EAV+LI +   D P+FLYLA
Sbjct: 192 THITQEPDPQFGEFMGLDMRRNLTAAWDTQGKYSTDLFTEEAVRLINEHDKDDPMFLYLA 251

Query: 274 HLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLEN 333
           HLA H GN  + L A  E I +F YI DP RR  AA+V K+D SVG V+ AL+++GMLEN
Sbjct: 252 HLAPHKGNPNRLLRASDEDIARFSYILDPERRIQAAVVSKVDQSVGEVMDALRKRGMLEN 311

Query: 334 SIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRV 393
           SI++FMSDNGAPT       +  N GSNYP RG+K++ WEGG +  A +WSP I+++ RV
Sbjct: 312 SIVLFMSDNGAPT-----QGSLSNQGSNYPLRGIKDSPWEGGTRGVAAIWSPLIKESQRV 366

Query: 394 SLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLL 453
           S QMM I+DWLPTL +AAG D+ +L                       NIDG D W +L 
Sbjct: 367 SNQMMFIADWLPTLLSAAGVDSKQL----------------------GNIDGFDLWPALR 404

Query: 454 LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM-DGYYGQTRSNKVPLLNFNA 512
            +  S R+ V++NID+  + A++R   +K V+G  E G    G  G       P  + + 
Sbjct: 405 SDKISPRSDVVLNIDDISKYASIRRGDFKYVIGETETGRAWVGATGAPSEGIPPKYDPDK 464

Query: 513 IVESKTYQSLQQL---------------SQNIFL--PISNIDKMRSTRQQATIHCGANPA 555
           ++ SK   ++  +               S  I     I + +K+   R  A + C     
Sbjct: 465 VLYSKAGVAISGVLTAKQAKELKKKKNTSSKIDFEEKILSAEKIFQLRNDARVQCNVKEE 524

Query: 556 PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLV 615
              P   T  PC LFN+  DPCE  N+A  RP I + +  +L  +R T VP S+   D  
Sbjct: 525 NRIPCDPTAAPC-LFNIEKDPCEMVNLADKRPLILAIMERVLMNYRLTTVPPSNVNGD-P 582

Query: 616 QADPKRFNDTWSPW 629
           +A+P  +N+TW+ W
Sbjct: 583 RANPALWNNTWTTW 596



 Score =  102 bits (254), Expect = 6e-19,   Method: Compositional matrix adjust.
 Identities = 72/203 (35%), Positives = 99/203 (48%), Gaps = 42/203 (20%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRNL+ AWDT G+Y+TDLFT+EAV+LI +   D P+FLYLAHLA H GN  + L A  E
Sbjct: 210 MRRNLTAAWDTQGKYSTDLFTEEAVRLINEHDKDDPMFLYLAHLAPHKGNPNRLLRASDE 269

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILN 120
            I +F YI DP RR  AA+      ++  V                  +DAL   G++ N
Sbjct: 270 DIARFSYILDPERRIQAAVVSKVDQSVGEV------------------MDALRKRGMLEN 311

Query: 121 N--MYAQPVCTPSRASLMT--GKYPIHTGMQGPPIWGAEPRGV-----PLTER------- 164
           +  ++      P++ SL      YP+  G++  P W    RGV     PL +        
Sbjct: 312 SIVLFMSDNGAPTQGSLSNQGSNYPLR-GIKDSP-WEGGTRGVAAIWSPLIKESQRVSNQ 369

Query: 165 ------FLPEYLRELGYSTKAIG 181
                 +LP  L   G  +K +G
Sbjct: 370 MMFIADWLPTLLSAAGVDSKQLG 392


>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
          Length = 648

 Score =  472 bits (1215), Expect = e-130,   Method: Compositional matrix adjust.
 Identities = 255/570 (44%), Positives = 347/570 (60%), Gaps = 50/570 (8%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
           A TK   + +++    G+ND+ FHGSNEIPTPNIDALAYNG+ILN+ Y Q +CTPSR++ 
Sbjct: 18  AQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQALCTPSRSAF 77

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           +TGKYPIH GMQ   I   EP G+PL E  LP+YL+  GY+T AIGKWHLGFFR+EYTP 
Sbjct: 78  LTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPT 137

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           YRGF+SHFGY  G+  YY H +   ++     +G+DMRRN++  W   G+Y+T LFT EA
Sbjct: 138 YRGFDSHFGYWQGLQDYYKHTVHATFTPE---HGYDMRRNMTVDWSAQGKYSTTLFTDEA 194

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           V+LI +   + P+F+YLAHLA H+GN    L+AP E I +F +I DP RR YAAMV  LD
Sbjct: 195 VRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYAAMVSMLD 254

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
            SVG+VI+AL+ K MLENSII+FMSDNGA     +    + N GSNYP RG KN+ WEG 
Sbjct: 255 KSVGSVIAALRDKHMLENSIIVFMSDNGA-----KPDGIHANHGSNYPLRGNKNSAWEGA 309

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           ++  A +WSP I++  RVS  +MHISDWLPT YTAAG + + LP                
Sbjct: 310 MRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELP---------------- 353

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
                  +DG+D W+S+     S R  +L NIDE     A+R+ +WK + G+  NG  DG
Sbjct: 354 ------KMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALRVGNWKYLYGSTTNGKSDG 407

Query: 496 YYGQTRSNKVPLLNFNAIVESK---------TYQSLQQLSQ---NIFLPISNIDKMRSTR 543
           +YG +  + +   + +A++ S+         TYQ +++  Q   N    + + + +++ R
Sbjct: 408 WYGSSGRDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQGDTNFTHKLLDSETIKTLR 467

Query: 544 QQATIHC-GANPAPMTPSPCTNG---PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
             A + C   N   +  S   N    PC LFN+  DPCEQ N+A+ RP I   +   L  
Sbjct: 468 GAAEVKCPRVNFEEIPESKKCNAVESPC-LFNIKEDPCEQINLAAERPMIVLNMEMALAR 526

Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
            ++T +P     P    ADP ++N+TW  W
Sbjct: 527 FKQTALP-IRNVPRDPNADPAKWNNTWVNW 555



 Score = 92.8 bits (229), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 41/80 (51%), Positives = 55/80 (68%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRN++  W   G+Y+T LFT EAV+LI +   + P+F+YLAHLA H+GN    L+AP E
Sbjct: 171 MRRNMTVDWSAQGKYSTTLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDE 230

Query: 61  TINQFQYITDPNRRTYAALT 80
            I +F +I DP RR YAA+ 
Sbjct: 231 EIAKFGHIADPERRIYAAMV 250


>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
          Length = 646

 Score =  470 bits (1209), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 255/570 (44%), Positives = 345/570 (60%), Gaps = 52/570 (9%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
           A TK   + +++    G+ND+ FHGSNEIPTPNIDALAYNG+ILN+ Y Q +CTPSR++ 
Sbjct: 18  AQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQALCTPSRSAF 77

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           +TGKYPIH GMQ   I   EP G+PL E  LP+YL+  GY+T AIGKWHLGFFR+EYTP 
Sbjct: 78  LTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPT 137

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           YRGF+SHFGY  G+  YY H +          +G+DMRRN++  W   G+Y+T LFT EA
Sbjct: 138 YRGFDSHFGYWQGLQDYYKHTV-----HFTPEHGYDMRRNMTVDWSAQGKYSTTLFTDEA 192

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           V+LI +   + P+F+YLAHLA H+GN    L+AP E I +F +I DP RR YAAMV  LD
Sbjct: 193 VRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYAAMVSMLD 252

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
            SVG+VI+AL+ K MLENSII+FMSDNGA     +    + N GSNYP RG KN+ WEG 
Sbjct: 253 KSVGSVIAALRDKHMLENSIIVFMSDNGA-----KPDGIHANHGSNYPLRGNKNSAWEGA 307

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           ++  A +WSP I++  RVS  +MHISDWLPT YTAAG + + LP                
Sbjct: 308 MRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELP---------------- 351

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
                  +DG+D W+S+     S R  +L NIDE     A+R+ +WK + G+  NG  DG
Sbjct: 352 ------KMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALRVGNWKYLYGSTTNGKSDG 405

Query: 496 YYGQTRSNKVPLLNFNAIVESK---------TYQSLQQLSQ---NIFLPISNIDKMRSTR 543
           +YG +  + +   + +A++ S+         TYQ +++  Q   N    + + + +++ R
Sbjct: 406 WYGSSGRDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQGDTNFTHKLLDSETIKTLR 465

Query: 544 QQATIHC-GANPAPMTPSPCTNG---PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
             A + C   N   +  S   N    PC LFN+  DPCEQ N+A+ RP I   +   L  
Sbjct: 466 GAAEVKCPRVNFEEIPESKKCNAVESPC-LFNIKEDPCEQINLAAERPMIVLNMEMALAR 524

Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
            ++T +P     P    ADP ++N+TW  W
Sbjct: 525 FKQTALP-IRNVPRDPNADPAKWNNTWVNW 553



 Score = 92.8 bits (229), Expect = 5e-16,   Method: Compositional matrix adjust.
 Identities = 41/80 (51%), Positives = 55/80 (68%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRN++  W   G+Y+T LFT EAV+LI +   + P+F+YLAHLA H+GN    L+AP E
Sbjct: 169 MRRNMTVDWSAQGKYSTTLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDE 228

Query: 61  TINQFQYITDPNRRTYAALT 80
            I +F +I DP RR YAA+ 
Sbjct: 229 EIAKFGHIADPERRIYAAMV 248


>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 710

 Score =  468 bits (1203), Expect = e-129,   Method: Compositional matrix adjust.
 Identities = 256/569 (44%), Positives = 348/569 (61%), Gaps = 61/569 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS++IPTPNIDALAYNG+ILN+ Y   +CTPSR++L+TGKYPIHTGMQ   I
Sbjct: 55  GWNDVSFHGSDQIPTPNIDALAYNGVILNSHYVSALCTPSRSALLTGKYPIHTGMQHLVI 114

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEPRG+PL E+ LP+YL+E GY+T AIGKWH GF RREYTP YRGF+SHFGY  G+  
Sbjct: 115 LEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGFDSHFGYWQGLQD 174

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFL 270
           YY H +     +   L G DMRRN+S A DT G+Y+TDLFT EAV+LIE+ +P   P+FL
Sbjct: 175 YYTHEVGSSNPKEGFL-GFDMRRNMSLARDTYGKYSTDLFTDEAVRLIEEHRPEAGPMFL 233

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLAHLA H+GN  + L+AP E + +F Y+ DP RR YAAM+ KLD SVG V+SAL+RK M
Sbjct: 234 YLAHLAPHSGNDNEPLQAPDEEVAKFSYVEDPERRIYAAMMSKLDQSVGEVVSALRRKNM 293

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L+NSI++FM+DNGA T        + N GSNYP RG+K + WEG V+  A +WSP IQ+ 
Sbjct: 294 LQNSIVVFMADNGAAT-----QGIHYNRGSNYPLRGIKASAWEGAVRGAAAVWSPLIQRP 348

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            R+  ++M I+DWLPTL +A+G                    L    R ++NIDG+DQW 
Sbjct: 349 KRIYNELMSIADWLPTLLSASG--------------------LRDVVRVSANIDGVDQWP 388

Query: 451 SLLLNTPSR-RNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
           ++    PS  RN +L+NID     +A+R   +K VLGT  NG  + +YG+T   +   L 
Sbjct: 389 AISGVAPSPPRNEILVNIDPIFNYSALRRGEFKYVLGTVGNG--EEWYGETGRPENQGLE 446

Query: 510 -------------------FNAIVESKTYQSLQQLSQNIFLPISN----------IDKMR 540
                                 ++ +K    ++ + ++ +  + +          +D++ 
Sbjct: 447 GASPTYDPETVLMSKAGTAIAGLLTAKQVSEIRAVRKDTYGRVKDDEMATSKLLTVDELL 506

Query: 541 STRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYH 600
             R  A++ C    +          PC LFN+  DPCEQ N+A+SR  I + L E L  +
Sbjct: 507 KLRSSASLRCTVPESERVACHPLQSPC-LFNIKEDPCEQRNLAASRAMILATLEEALLKY 565

Query: 601 RRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           R T +P S+  P+  +A+P  +N TW  W
Sbjct: 566 RVTALPPSN-VPNDPKANPAFWNHTWVNW 593



 Score = 99.8 bits (247), Expect = 4e-18,   Method: Compositional matrix adjust.
 Identities = 47/80 (58%), Positives = 61/80 (76%), Gaps = 1/80 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           MRRN+S A DT G+Y+TDLFT EAV+LIE+ +P   P+FLYLAHLA H+GN  + L+AP 
Sbjct: 194 MRRNMSLARDTYGKYSTDLFTDEAVRLIEEHRPEAGPMFLYLAHLAPHSGNDNEPLQAPD 253

Query: 60  ETINQFQYITDPNRRTYAAL 79
           E + +F Y+ DP RR YAA+
Sbjct: 254 EEVAKFSYVEDPERRIYAAM 273


>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
          Length = 543

 Score =  462 bits (1190), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 252/537 (46%), Positives = 336/537 (62%), Gaps = 46/537 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS +IPTPNIDALAY+G+IL N Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 34  GWNDVGFHGSGQIPTPNIDALAYSGLILQNYYVTPICTPSRSALMTGKYPIHTGMQHTVL 93

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +GAEPRG+PLTE+ LPEYLRELGY+ + +GKWHLG + +EYTPLYRGF+SH GY  G   
Sbjct: 94  FGAEPRGLPLTEKILPEYLRELGYTNRLVGKWHLGSYTKEYTPLYRGFDSHLGYWTGHQD 153

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYDH   +         G DMRRN+  A+D  G+Y+TD+FT+EAV++IE+     PLFLY
Sbjct: 154 YYDHTAVENPGW-----GFDMRRNMDLAYDLHGQYSTDVFTQEAVKIIENHNTTNPLFLY 208

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAH+A H+ N    L AP ET+ +F  I    R+ +AAM+ KLDDSVG V+ +L ++ ML
Sbjct: 209 LAHVAVHSANPYNPLPAPDETVEKFSNIPSYKRQRFAAMLSKLDDSVGAVVESLSKRKML 268

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NSIIIF +DNG     +       N  SN+P RGVKNT+WEGGV+   +LWSP I+Q  
Sbjct: 269 KNSIIIFSTDNGGAAAGFN-----LNAASNFPLRGVKNTVWEGGVRGAGLLWSPLIKQPQ 323

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RV+ Q MHISDWLPTL +A G ++S    NIDGLD W SL  N                 
Sbjct: 324 RVAQQFMHISDWLPTLLSAVGANSSL--KNIDGLDLWESLSQNK---------------- 365

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
              N+P  R+ +L NID+    AA+ + +WKLV GT  NG  DG+YG +  +     + +
Sbjct: 366 ---NSP--RSEILHNIDDIYGNAALTVGAWKLVKGTTYNGAWDGWYGPSGRDYT--YDLS 418

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN-PAPMTPSPCTNGPCYLF 570
            +V S T  +L+ ++ ++ +P    D++   R  + I C  + P P  P   T   C LF
Sbjct: 419 LVVNSPTGVALKSINMSL-VP----DEITKLRNASRIDCATDQPIPCKPLEET---C-LF 469

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           N+  DPCE  N+A   P I   L E L  +  T +P  +   D  + +PK +N+TW+
Sbjct: 470 NVIQDPCELRNLAKENPKIVMNLEETLNKYNNTALPPGNLPLD-PRGNPKYWNNTWT 525



 Score = 82.4 bits (202), Expect = 7e-13,   Method: Compositional matrix adjust.
 Identities = 38/79 (48%), Positives = 54/79 (68%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRN+  A+D  G+Y+TD+FT+EAV++IE+     PLFLYLAH+A H+ N    L AP E
Sbjct: 169 MRRNMDLAYDLHGQYSTDVFTQEAVKIIENHNTTNPLFLYLAHVAVHSANPYNPLPAPDE 228

Query: 61  TINQFQYITDPNRRTYAAL 79
           T+ +F  I    R+ +AA+
Sbjct: 229 TVEKFSNIPSYKRQRFAAM 247


>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
          Length = 528

 Score =  461 bits (1185), Expect = e-127,   Method: Compositional matrix adjust.
 Identities = 243/546 (44%), Positives = 335/546 (61%), Gaps = 44/546 (8%)

Query: 83  TTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
           + + LL V+GWND+ FHGS +IPTPNIDALAY+G++L+  Y  P+CTPSR++LMTGKYPI
Sbjct: 12  SLILLLAVHGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPI 71

Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
           HTGMQ   + GAEPRG+PL E+ LPEYLRELGY+T  +GKWHLGF+++EYTP YRGF++H
Sbjct: 72  HTGMQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVGKWHLGFYKKEYTPTYRGFDTH 131

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
            G+  G   Y+DH   +         G D+RR +  AWD  G+Y+TD+FTKEAV+LI++ 
Sbjct: 132 IGFWTGHHDYFDHTAVEN-----PYWGLDIRRGMQPAWDLHGQYSTDIFTKEAVRLIDNH 186

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
              +P+FLYLAH A H+GN    L AP E + +F  I D NRR +A M+ KLD SVG V+
Sbjct: 187 NSSRPMFLYLAHAAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGMLSKLDQSVGQVV 246

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL+++ ML +SIIIF SDNG P   +       N  SN+P RGVKNTLWEGGV+   ++
Sbjct: 247 DALRKRDMLRDSIIIFSSDNGGPAAGFN-----LNAASNWPLRGVKNTLWEGGVRGTGLI 301

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
           WSP++ +  RVS QM+HI+DWLPTL TAAGGD+S L                        
Sbjct: 302 WSPRLVRPGRVSRQMLHITDWLPTLITAAGGDSSNL-----------------------T 338

Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
           +DG+D W++L  +T S R +VL NID+    +A+ +  WKL+ G+   G  DG+YG   S
Sbjct: 339 VDGMDLWNALSEDTESPRANVLHNIDDIYGVSAITVGDWKLIRGSTYKGVWDGWYGP--S 396

Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
            +  + + + ++ S T  ++  +   I       D +   R+ A I C      +     
Sbjct: 397 GREWVYDVDGVISSTTAHAIASVGLGI-----TTDVVLMLRENALIKCPLRNDSLPICKP 451

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLY-ELLKYHRRTLVPQSHEQPDLVQADPKR 621
               C LFN+  DPCE NN+    P I  +L  EL K++   L+P +   P   + DP  
Sbjct: 452 LEEAC-LFNIYQDPCEDNNLVKQFPTIVRKLQDELKKFNSSALLPGN--LPWDSKGDPNL 508

Query: 622 FNDTWS 627
           ++ TW+
Sbjct: 509 WDHTWN 514



 Score = 86.3 bits (212), Expect = 5e-14,   Method: Compositional matrix adjust.
 Identities = 39/79 (49%), Positives = 54/79 (68%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +RR +  AWD  G+Y+TD+FTKEAV+LI++    +P+FLYLAH A H+GN    L AP E
Sbjct: 156 IRRGMQPAWDLHGQYSTDIFTKEAVRLIDNHNSSRPMFLYLAHAAVHSGNPYNPLPAPDE 215

Query: 61  TINQFQYITDPNRRTYAAL 79
            + +F  I D NRR +A +
Sbjct: 216 EVAKFNNIFDYNRRRFAGM 234


>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 532

 Score =  455 bits (1170), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 245/537 (45%), Positives = 332/537 (61%), Gaps = 44/537 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS +IPTPNIDALAY+G++L+  Y  P+CTPSR++LMTGKYPIH GMQ   +
Sbjct: 26  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIHIGMQHGVL 85

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            GAEPRG+PL E+ LPEYLR+LGYST  +GKWHLGF+ +EYTP YRGF SH G+  G   
Sbjct: 86  KGAEPRGLPLHEKILPEYLRDLGYSTHIVGKWHLGFYTKEYTPTYRGFASHTGFWTGHQD 145

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+DH   +         G DMRR++  AWD  G+Y+TD+FTKEA++LI+     +PLFLY
Sbjct: 146 YFDHTAVES-----PYWGLDMRRDMEPAWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLY 200

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAH A H+GN    L AP E + +F  I D NRR +A ++ KLD SVG V+  L++KGML
Sbjct: 201 LAHAAVHSGNPYNPLPAPDEEVAKFDNIFDYNRRRFAGILSKLDKSVGQVVEGLRKKGML 260

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            +S+IIF +DNG P   +       N  SN+P RGVKNTLWEGGV+   ++WSP++ ++ 
Sbjct: 261 RDSVIIFSTDNGGPAAGFN-----LNAASNWPLRGVKNTLWEGGVRGAGLIWSPKLTRSG 315

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS Q++HI+DWLPTL TAAGGD S   LNIDG+D WS+L         S+         
Sbjct: 316 RVSRQLLHITDWLPTLITAAGGDPSN--LNIDGMDLWSAL---------SD--------- 355

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
              +T S R SVL NID+    +AV +  WKLV G+  NG  DG+YG   S +  + +  
Sbjct: 356 ---DTESPRTSVLHNIDDIYGVSAVTVGDWKLVQGSTYNGAWDGWYGP--SGREWVYDVG 410

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
            ++ S+  +++  +   I       + +R  R++A I C      +        PC LFN
Sbjct: 411 GVIGSEAGRAVASVGLGI-----TAETVRMLREKAMIKCPPKNDSLPLCKPLQEPC-LFN 464

Query: 572 LGNDPCEQNNIASSRPDISSQLY-ELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           +  DPCE NN+    P I  +L  EL K++   + P +   P   + DP  ++ TW+
Sbjct: 465 VYQDPCEDNNLVRQSPTIVRKLQEELKKFNSSAIYPGN--LPWDNKGDPSLWDHTWN 519



 Score = 86.3 bits (212), Expect = 4e-14,   Method: Compositional matrix adjust.
 Identities = 40/79 (50%), Positives = 54/79 (68%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR++  AWD  G+Y+TD+FTKEA++LI+     +PLFLYLAH A H+GN    L AP E
Sbjct: 161 MRRDMEPAWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLYLAHAAVHSGNPYNPLPAPDE 220

Query: 61  TINQFQYITDPNRRTYAAL 79
            + +F  I D NRR +A +
Sbjct: 221 EVAKFDNIFDYNRRRFAGI 239


>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
          Length = 519

 Score =  454 bits (1168), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 239/541 (44%), Positives = 329/541 (60%), Gaps = 41/541 (7%)

Query: 87  LLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM 146
           ++   GWND+ FHGS +IPTPNIDALAY+G++L+  Y   +CTPSR++LMTGKYPIHTGM
Sbjct: 5   VVFFEGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTSICTPSRSALMTGKYPIHTGM 64

Query: 147 QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
           Q   + GAEPRG+PL E+ LPEYLRELGYST  +GKWHLGF++REYTP YRGF++H GY 
Sbjct: 65  QHSILKGAEPRGLPLHEKILPEYLRELGYSTHIVGKWHLGFYKREYTPTYRGFDTHIGYW 124

Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
            G   YYDH   +         G DMRR +  AWD  GEY+TD+FTKEAV+LI +    +
Sbjct: 125 TGHHDYYDHTAVEN-----PYWGLDMRRGMKPAWDLHGEYSTDVFTKEAVKLINNHNSSR 179

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           P+FLYLAH A H+GN    L AP E + +F  I D NRR +A ++ KLD SVG V+ AL 
Sbjct: 180 PMFLYLAHAAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGILSKLDQSVGQVVEALY 239

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
           +  ML++S+I+F +DNG P   +       N  SN+P RGVKNTLWEGGV+   ++WSP+
Sbjct: 240 KNNMLQDSVIVFSTDNGGPAAGFN-----LNAASNWPLRGVKNTLWEGGVRGAGLIWSPR 294

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           + +  RVS QM+HI+DWLPTL TAAGG+ S L                      ++IDG+
Sbjct: 295 LVRAGRVSRQMIHITDWLPTLITAAGGNASNL----------------------TSIDGV 332

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
           + W +L  +T S R ++L NID+    +A+ +  WK + G+  NG  DG+YG   S +  
Sbjct: 333 NLWDALSEDTESPRTNILHNIDDIYGVSAITVGDWKFIQGSTYNGVWDGWYGP--SGREW 390

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
           + + + ++ S   +++  +  ++   I     + S R+ A I C      +        P
Sbjct: 391 IYDADGVISSTAGRAIASVGLSVTTEI-----ISSLRENAMIKCPPRNDSLPICRPLQEP 445

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LFN+  DPCE NN+    P I  +L E LK    T +   +   D  +ADP  ++ TW
Sbjct: 446 C-LFNIYQDPCEDNNLVKQFPTIVRKLQEELKKFNNTAILPGNLPWD-SKADPNLWDHTW 503

Query: 627 S 627
           +
Sbjct: 504 N 504



 Score = 88.2 bits (217), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 41/79 (51%), Positives = 53/79 (67%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR +  AWD  GEY+TD+FTKEAV+LI +    +P+FLYLAH A H+GN    L AP E
Sbjct: 145 MRRGMKPAWDLHGEYSTDVFTKEAVKLINNHNSSRPMFLYLAHAAVHSGNPYNPLPAPDE 204

Query: 61  TINQFQYITDPNRRTYAAL 79
            + +F  I D NRR +A +
Sbjct: 205 EVAKFNNIFDYNRRRFAGI 223


>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
 gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
          Length = 533

 Score =  454 bits (1168), Expect = e-125,   Method: Compositional matrix adjust.
 Identities = 253/542 (46%), Positives = 339/542 (62%), Gaps = 42/542 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +HGS+EIPTPNIDALAYNGIILN  Y  PVCTPSR++LMTG++PIH GMQ   +
Sbjct: 21  GWNDVGYHGSDEIPTPNIDALAYNGIILNRYYVLPVCTPSRSALMTGRHPIHNGMQHRVL 80

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +G E RG+PLTE+ LPEYL++LGYST  +GKWHLGF+++EYTPLYRGFESH G+  G   
Sbjct: 81  FGVETRGLPLTEKLLPEYLQKLGYSTHIVGKWHLGFYKKEYTPLYRGFESHIGFWTGHQD 140

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYDH   ++      L G DMR  +  AW   GEY+T ++T+E+V++I++    KPLFLY
Sbjct: 141 YYDHTAEEE-----RLWGLDMRHGMKPAWYLHGEYSTHVYTRESVKIIKNYNSTKPLFLY 195

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +AH A H+GN    L AP +T+++  +I + NRR YAAMV +LD SVG +ISAL+   ML
Sbjct: 196 VAHAAVHSGNKYNPLPAPDKTVDKLDHIQNYNRRRYAAMVSELDTSVGNIISALKDANML 255

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NSI+IF +DNG P        NY N+ SN+P RGVKNTLWEGGV+   ++WS + + N 
Sbjct: 256 SNSIVIFSTDNGGPA----NGLNY-NYASNWPMRGVKNTLWEGGVRGVGVVWSSKFKPNS 310

Query: 392 --RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             RVS +MMHI DWLPTLY AAGGD   L  N+DG + W+SLL           D L   
Sbjct: 311 GGRVSNRMMHICDWLPTLYAAAGGDVRDLGENLDGYNMWNSLL-----------DDL--- 356

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
                  PS RN +L NID+    +A+ +  WKLV GT  NG  D ++G T  N     N
Sbjct: 357 -------PSPRNEILHNIDDIFGNSALTMSQWKLVKGTTYNGFYDSWFGPTGRNPGYFYN 409

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN----G 565
              ++ S + ++L  L     +P+ N + +R  R++ATI CG +    + S  +N     
Sbjct: 410 IGQVLSSLSNKALSTLQT---VPL-NPESIRRIREEATIDCGNHQLISSESRFSNCNPLK 465

Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
              LFN+  DPCE +N+A S P +  ++ E L Y   T V  +    DL + +PK +   
Sbjct: 466 ESCLFNVDEDPCEIDNVAKSFPQVVKEMEERLLYWNSTAVKPAKVINDL-RGNPKFWGYV 524

Query: 626 WS 627
           W+
Sbjct: 525 WT 526



 Score = 78.2 bits (191), Expect = 1e-11,   Method: Compositional matrix adjust.
 Identities = 35/80 (43%), Positives = 54/80 (67%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MR  +  AW   GEY+T ++T+E+V++I++    KPLFLY+AH A H+GN    L AP +
Sbjct: 156 MRHGMKPAWYLHGEYSTHVYTRESVKIIKNYNSTKPLFLYVAHAAVHSGNKYNPLPAPDK 215

Query: 61  TINQFQYITDPNRRTYAALT 80
           T+++  +I + NRR YAA+ 
Sbjct: 216 TVDKLDHIQNYNRRRYAAMV 235


>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 579

 Score =  451 bits (1161), Expect = e-124,   Method: Compositional matrix adjust.
 Identities = 245/544 (45%), Positives = 329/544 (60%), Gaps = 46/544 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGSN+IPTPNIDAL YNGIILN  Y  P  TPSR +  TG+YPI  GMQG  I
Sbjct: 42  GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADI 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G EPRG+PL  + LPE+LR LGY+TK IGKWH+G++  +YTPL+RGF++  G+ N  IS
Sbjct: 102 RGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYIS 161

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD+  S+Q      ++G+DM R    A+    EYATD+FT EA+ +IE+  +++PL+L 
Sbjct: 162 YYDYNYSNQ-----NMSGYDMHRGDDPAYGMNREYATDMFTSEAINIIENHELNRPLYLQ 216

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           L+HLA H+      LE P    N  +  +I +PNRR YA MV KLD+SVG +  AL  KG
Sbjct: 217 LSHLAVHS-----PLEQPANVYNDREPIHIREPNRRKYAKMVSKLDESVGRIFHALGEKG 271

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML++S+I+F++DNGA +V       YRN+GSNYP RG K TL+EGGV+  A LWS ++Q+
Sbjct: 272 MLKDSLILFLTDNGAASV-----GRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLQR 326

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             RVS +++HI+DWLPTLY+AAGGD   L                        IDG+DQW
Sbjct: 327 AARVSNELVHITDWLPTLYSAAGGDLKDL----------------------GEIDGIDQW 364

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L       R+ +L+NIDE  +T       +KL+ GT E+G  DGYYG +      L  
Sbjct: 365 RLLSEGRGRGRDKLLLNIDEVLKTEGAIYSRFKLLRGTIESGYYDGYYGDSGKTLETLPT 424

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA---PMTPSPCTNGP 566
           +  IV   T    + L+ ++  P++    M   RQ AT+ C AN         +PC    
Sbjct: 425 YTDIVMKSTVS--ESLTYHLGGPVTQPSTMVRLRQDATVQCNANFTYYRRQMLTPCNVTE 482

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LF++ NDPCE  NIA +   I+  L   L+ + + LV Q+ +  D   ADP+  N TW
Sbjct: 483 C-LFDIVNDPCETRNIADTYVRIARDLNYYLEQYSQILVKQTKKPIDW-SADPRNKNGTW 540

Query: 627 SPWI 630
            PW+
Sbjct: 541 GPWL 544



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/82 (40%), Positives = 48/82 (58%), Gaps = 7/82 (8%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           M R    A+    EYATD+FT EA+ +IE+  +++PL+L L+HLA H+      LE P  
Sbjct: 177 MHRGDDPAYGMNREYATDMFTSEAINIIENHELNRPLYLQLSHLAVHS-----PLEQPAN 231

Query: 61  TINQFQ--YITDPNRRTYAALT 80
             N  +  +I +PNRR YA + 
Sbjct: 232 VYNDREPIHIREPNRRKYAKMV 253


>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
          Length = 544

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 246/555 (44%), Positives = 335/555 (60%), Gaps = 44/555 (7%)

Query: 73  RRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
           R   AA        L    GWND+ FHGS +IPTPNIDALAY+G++L+  Y  P+CTPSR
Sbjct: 18  RHVTAAERPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTPSR 77

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
           ++LMTGKYPIHTGMQ   +  AEPRG+PL E+ LPEYL+ELGY T  +GKWHLGF+ ++Y
Sbjct: 78  SALMTGKYPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLKELGYRTHIVGKWHLGFYTKQY 137

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
           TP YRGF+SH G+ +G   Y+DH   +         G DMRR +  AWD  G+Y+TD+FT
Sbjct: 138 TPTYRGFDSHIGFWSGHQDYFDHTAVES-----PYWGLDMRRGMEAAWDLHGQYSTDVFT 192

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
            EAV+LI +    KPLFLYLAH A H+GN    L AP   + +F  I D NRR +AAM+ 
Sbjct: 193 SEAVKLINNHNDSKPLFLYLAHAAVHSGNPYDPLPAPDVDVAKFTNIFDYNRRRFAAMLS 252

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
           KLD+SVG V+ ALQRKGML+NSII+F +DNG P   +       N  SN+P RG KNTLW
Sbjct: 253 KLDESVGEVVEALQRKGMLKNSIIVFSTDNGGPPAGFN-----LNAASNWPLRGTKNTLW 307

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           EGGV+   ++WSP++ +  RVS Q+ HISDWLPTL TAAGGD S   L+IDGL       
Sbjct: 308 EGGVRGAGLIWSPRLIRPGRVSRQLFHISDWLPTLLTAAGGDPSN--LSIDGL------- 358

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
                         D W +L  +T S R SVL NID+    + + +  WK + G+   G 
Sbjct: 359 --------------DLWHALSEDTESPRQSVLHNIDDIFGVSGITIGEWKYIQGSTYEGQ 404

Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
            DG+YG   S +  + +   ++ S T +++  +  ++     + + +R  R+QA I C  
Sbjct: 405 WDGWYGP--SGREWVYDIGGVIGSPTGRAVASVGYSL-----SSENVRRLREQAMIKCRP 457

Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE-LLKYHRRTLVPQSHEQ 611
               +        PC LF++  DPCE NN+ +  P+I+  + E L K++   L+P +   
Sbjct: 458 RNDTLPLCKPLIAPC-LFHVLKDPCEDNNLINEYPEIARAMREDLKKWNATALLPGN--L 514

Query: 612 PDLVQADPKRFNDTW 626
           P   +A+P+ ++ TW
Sbjct: 515 PWDKRANPELWDHTW 529



 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 41/79 (51%), Positives = 52/79 (65%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR +  AWD  G+Y+TD+FT EAV+LI +    KPLFLYLAH A H+GN    L AP  
Sbjct: 172 MRRGMEAAWDLHGQYSTDVFTSEAVKLINNHNDSKPLFLYLAHAAVHSGNPYDPLPAPDV 231

Query: 61  TINQFQYITDPNRRTYAAL 79
            + +F  I D NRR +AA+
Sbjct: 232 DVAKFTNIFDYNRRRFAAM 250


>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
          Length = 575

 Score =  449 bits (1155), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 247/548 (45%), Positives = 337/548 (61%), Gaps = 56/548 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNIDAL YNGIILN  Y  P  TPSR++ +TG YPI  GMQG  I
Sbjct: 38  GWNDVGFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRSAFLTGLYPIRIGMQGDGI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G EPRG+PL  + LPE+LR+LGY+TK IGKWH+G+   +YTPL+RGF+   G+ N  IS
Sbjct: 98  RGGEPRGLPLDIKILPEHLRDLGYTTKLIGKWHMGYHTPQYTPLHRGFDFFLGFYNSHIS 157

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD+  S+Q      ++G+D+ R    A     EYATDLFTKEAV++IE   + +PL+L 
Sbjct: 158 YYDYHYSNQ-----NMSGYDLHRGDDPAHGINREYATDLFTKEAVRMIETHELPRPLYLQ 212

Query: 272 LAHLAAHAGNAGKHLEAPQETIN--QFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           ++HLA HA      LE P++  N  +F YI +PNRR YA MV +LDDSVG ++ AL  KG
Sbjct: 213 ISHLAVHAP-----LEQPRDEYNDGRFSYIREPNRRKYARMVSRLDDSVGRIVHALGEKG 267

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML +S+I+F++DNGA +V       YRNWGSNYP RG K TL+EGGV+  A LWSP++Q+
Sbjct: 268 MLRDSLILFLTDNGAASV-----GRYRNWGSNYPLRGTKYTLYEGGVRSVAALWSPRLQK 322

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             RVS Q++HI+DWLPTLY+AAGGD   L                        IDG+D W
Sbjct: 323 VARVSNQLIHITDWLPTLYSAAGGDLKDL----------------------GQIDGIDLW 360

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
             L       R+ +L+NIDE ++T       +KL+ G+ ENG  DG+YG + R+ ++P  
Sbjct: 361 RVLSEGYGRGRDKLLLNIDEIQKTEGAIYSRFKLLRGSVENGYYDGFYGDSGRTLEIPPY 420

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP------SPC 562
             + ++ S   QS   ++ ++  P++    M   R+ AT+ C      MT       + C
Sbjct: 421 T-DVVLRSSVSQS---ITYHLGGPVTQPSTMIQLRRDATVQC----QNMTYHYRHAFTTC 472

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
               C LF++ NDPCE  N+A + P I+  L   L+++   L+ Q     D + ADP+R 
Sbjct: 473 NVTEC-LFDIVNDPCETKNVADAYPRIARDLDLYLEHYGHILMRQIKHPVDWL-ADPRRR 530

Query: 623 NDTWSPWI 630
           N+TW PW+
Sbjct: 531 NNTWEPWM 538



 Score = 67.8 bits (164), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 35/69 (50%), Positives = 47/69 (68%), Gaps = 7/69 (10%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN--QFQYITDP 71
           EYATDLFTKEAV++IE   + +PL+L ++HLA HA      LE P++  N  +F YI +P
Sbjct: 186 EYATDLFTKEAVRMIETHELPRPLYLQISHLAVHAP-----LEQPRDEYNDGRFSYIREP 240

Query: 72  NRRTYAALT 80
           NRR YA + 
Sbjct: 241 NRRKYARMV 249


>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
          Length = 545

 Score =  448 bits (1153), Expect = e-123,   Method: Compositional matrix adjust.
 Identities = 237/551 (43%), Positives = 333/551 (60%), Gaps = 42/551 (7%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           AA+       L    GWND+ FHGS +IPTPNIDALAY+G++L+  Y  P+CTPSR++LM
Sbjct: 23  AAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALM 82

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TGK+PIHTGMQ   +  AEPRG+PL E+ LP+YLRELGYST  +GKWHLGF+ +EYTP+Y
Sbjct: 83  TGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYTPMY 142

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           RGF+SH G+ +G   Y+DH   +         G DMRR L++AWD  G+Y+TD+FTKEAV
Sbjct: 143 RGFDSHIGFWSGHHDYFDHSAVES-----PYWGLDMRRGLNSAWDLHGQYSTDIFTKEAV 197

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
           +LI D    +P+FLYL+H A H+GN+   L AP + + +F  I +  RR +A M+ KLD+
Sbjct: 198 KLINDHNASRPMFLYLSHAAVHSGNSYNPLPAPDQDVAKFTNIFNYERRRFAGMLSKLDE 257

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           SVG V+ AL++KGML+NS+I+F +DNG P   +       N  SN+P RG KNTLWEGGV
Sbjct: 258 SVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFN-----LNAASNWPLRGTKNTLWEGGV 312

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
           +   +LWSP++ +  RVS Q+ HISDWLPTL TAAGG+ S L                  
Sbjct: 313 RGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAAGGNPSNL------------------ 354

Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
                +IDG+D W +L  +T S R  VL NID+    A +    WK + G+  NG  DG+
Sbjct: 355 -----SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQGSTYNGQWDGW 409

Query: 497 YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
           YG   S +  + +   ++ S   +++  +   +     + D +R  R+ A I C      
Sbjct: 410 YGP--SGREWVYDIGGVIGSTAGRAVASVGLAL-----SADAIRRLREDAMIKCPPKNDS 462

Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
           +        PC LFN+  DPCE NN+ +  P + ++L + L+    T +   +   D  +
Sbjct: 463 LPLCKPLEAPC-LFNVQQDPCEDNNLINKYPSLVTELQDKLRKLNATAILPGNLPWD-NK 520

Query: 617 ADPKRFNDTWS 627
           A+P  ++ TW+
Sbjct: 521 ANPDYWDHTWT 531



 Score = 85.9 bits (211), Expect = 7e-14,   Method: Compositional matrix adjust.
 Identities = 38/79 (48%), Positives = 55/79 (69%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR L++AWD  G+Y+TD+FTKEAV+LI D    +P+FLYL+H A H+GN+   L AP +
Sbjct: 173 MRRGLNSAWDLHGQYSTDIFTKEAVKLINDHNASRPMFLYLSHAAVHSGNSYNPLPAPDQ 232

Query: 61  TINQFQYITDPNRRTYAAL 79
            + +F  I +  RR +A +
Sbjct: 233 DVAKFTNIFNYERRRFAGM 251


>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
          Length = 545

 Score =  446 bits (1146), Expect = e-122,   Method: Compositional matrix adjust.
 Identities = 236/551 (42%), Positives = 331/551 (60%), Gaps = 42/551 (7%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           AA+       L    GWND+ FHGS +IPTPNIDALAY+G++L+  Y  P+CTPSR++LM
Sbjct: 23  AAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALM 82

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TGK+PIHTGMQ   +  AEPRG+PL E+ LP+YLRELGYST  +GKWHLGF+ +EYTP+Y
Sbjct: 83  TGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYTPMY 142

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           RGF+SH G+ +G   Y+DH   +         G DMRR L++AWD  G+Y+TD+FTKEAV
Sbjct: 143 RGFDSHIGFWSGHHDYFDHSAVES-----PYWGLDMRRGLNSAWDLHGQYSTDIFTKEAV 197

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
           +LI D    +P+FLYL H A H+GN+   L  P + + +F  I +  RR +A M+ KLD+
Sbjct: 198 KLINDHNASRPMFLYLPHAAVHSGNSYNPLPVPDQDVAKFTNIFNYERRRFAGMLSKLDE 257

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           SVG V+ AL++KGML+NS+I+F +DNG P   +       N  SN+P RG KNTLWEGGV
Sbjct: 258 SVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFN-----LNAASNWPLRGTKNTLWEGGV 312

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
           +   +LWSP++ +  RVS Q+ HISDWLPTL TAAGG+ S L                  
Sbjct: 313 RGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAAGGNPSNL------------------ 354

Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
                +IDG+D W +L  +T S R  VL NID+    A +    WK + G+  NG  DG+
Sbjct: 355 -----SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQGSTYNGQWDGW 409

Query: 497 YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
           YG   S +  + +   ++ S   +++  +   +     + D +R  R+ A I C      
Sbjct: 410 YGP--SGREWVYDIGGVIGSTAGRAVASVGLTL-----SADAIRRLREDAMIKCPPKNDS 462

Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
           +        PC LFN+  DPCE NN+ +  P + ++L + L+    T +   +   D  +
Sbjct: 463 LPLCKPLEAPC-LFNVQQDPCEDNNLINKYPSLVTELRDRLRKLNATAILPGNLPWD-NR 520

Query: 617 ADPKRFNDTWS 627
           A+P  ++ TW+
Sbjct: 521 ANPDYWDHTWT 531



 Score = 83.2 bits (204), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 37/79 (46%), Positives = 53/79 (67%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR L++AWD  G+Y+TD+FTKEAV+LI D    +P+FLYL H A H+GN+   L  P +
Sbjct: 173 MRRGLNSAWDLHGQYSTDIFTKEAVKLINDHNASRPMFLYLPHAAVHSGNSYNPLPVPDQ 232

Query: 61  TINQFQYITDPNRRTYAAL 79
            + +F  I +  RR +A +
Sbjct: 233 DVAKFTNIFNYERRRFAGM 251


>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
          Length = 581

 Score =  442 bits (1137), Expect = e-121,   Method: Compositional matrix adjust.
 Identities = 251/544 (46%), Positives = 333/544 (61%), Gaps = 44/544 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGSN+IPTPNIDAL YNGIILN  Y  P  TPSR +  TG+YPI  GMQG  I
Sbjct: 42  GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADI 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G EPRG+PL  + LPE+LR LGY+TK IGKWH+G++  +YTPL+RGF++  G+ N  IS
Sbjct: 102 RGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYIS 161

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD+  S+Q      ++G+DM R    A+    EYATD+FT+EA+ +IE+  +++PL+L 
Sbjct: 162 YYDYSYSNQ-----NMSGYDMHRGDDPAYGMNREYATDMFTREAINIIENHELNRPLYLQ 216

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           L+HLA HA      LE P    N  +  +I +PNRR YA MV KLD+SVG +  AL  KG
Sbjct: 217 LSHLAVHA-----PLEQPMNVYNDREPIHIREPNRRKYAKMVSKLDESVGRIFHALGEKG 271

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML++S+I+F++DNGA +V       YRN+GSNYP RG K TL+EGGV+  A LWS ++Q+
Sbjct: 272 MLKDSLILFLTDNGAASV-----GRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLQK 326

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             RVS +++HI+DWLPTLY+AAGGD   L   IDG+DQW   LL+  S R          
Sbjct: 327 AARVSNELVHITDWLPTLYSAAGGDLKDLG-KIDGIDQWR--LLSEGSGRGRG------- 376

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
                     R+ +L+NIDE  +T       +KL+ GT ENG  DGYYG +      L  
Sbjct: 377 ----------RDKLLLNIDEVLKTEGAIYSRFKLLRGTNENGYYDGYYGDSGKTLETLPT 426

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA---PMTPSPCTNGP 566
           +  IV   T    + L+ ++  P++    M   RQ AT+ C AN         +PC    
Sbjct: 427 YTDIVMKSTVS--ESLTYHLGGPVTQPSTMVRLRQDATVQCNANFTYYRRQMLTPCNVTE 484

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LF++ NDPCE  NIA +   I+  L   L+ +   LV Q  +  D   ADP+  NDTW
Sbjct: 485 C-LFDIVNDPCETRNIAETYVRIARDLNYFLEEYSHILVKQMKKPIDW-SADPRNRNDTW 542

Query: 627 SPWI 630
            PW+
Sbjct: 543 GPWV 546



 Score = 63.9 bits (154), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 49/82 (59%), Gaps = 7/82 (8%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           M R    A+    EYATD+FT+EA+ +IE+  +++PL+L L+HLA HA      LE P  
Sbjct: 177 MHRGDDPAYGMNREYATDMFTREAINIIENHELNRPLYLQLSHLAVHA-----PLEQPMN 231

Query: 61  TINQFQ--YITDPNRRTYAALT 80
             N  +  +I +PNRR YA + 
Sbjct: 232 VYNDREPIHIREPNRRKYAKMV 253


>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 570

 Score =  440 bits (1132), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 245/540 (45%), Positives = 328/540 (60%), Gaps = 38/540 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGSN+I TPNIDALAYNGIILN+ Y   +CTPSRASLMTGKYP   GMQ   I
Sbjct: 58  GWNDVSFHGSNQIQTPNIDALAYNGIILNSHYVPALCTPSRASLMTGKYPTSLGMQHLVI 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+PL E  +PEY  + GY+T A+GKWHLGFF++EYTP+YRGF+SHFG+ NG   
Sbjct: 118 LSPEPWGLPLNETLMPEYFNKNGYATHAVGKWHLGFFKKEYTPIYRGFDSHFGHWNGFQD 177

Query: 212 YYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLF 269
           YYDH  +SD       L G+DMRRN    +   G Y TD+FTKEA+++I++    K PLF
Sbjct: 178 YYDHTTMSDS------LKGYDMRRNFEVDYSYQGMYTTDVFTKEAIKIIDNHNSQKGPLF 231

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL+HLA H+GN     +AP++ I++ + I DP R+ YAAMV KLD+SVG V+SAL++  
Sbjct: 232 LYLSHLAPHSGNPDNPFQAPEDEISKHECINDPGRKIYAAMVTKLDESVGQVVSALEKNK 291

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML NSIIIFMSDNGA T        + N GSNYP RG+K + WEGGV+  A +WSP + +
Sbjct: 292 MLNNSIIIFMSDNGAATYGL-----HSNRGSNYPLRGLKESPWEGGVRGTAAIWSPFLNK 346

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             RVS Q+MH+SDWLPTL TAA            GL+  S+ L+N        IDG+D W
Sbjct: 347 TKRVSKQLMHMSDWLPTLLTAA------------GLNYSSTQLINK-------IDGIDMW 387

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
           + L  + PS R  V  N DE +  +++ +DSWK V GT + G  D ++ +   N      
Sbjct: 388 NVLSNDLPSPRKEVFNNYDEIENYSSLMIDSWKYVEGTAQEGKADYWFEEPSRNNCSEYR 447

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
               V ++    L++ S  I    +    +  TR   T         +T  P     C L
Sbjct: 448 ----VSNEDIFRLRRDSTIICDNPTFSSSLSITRNNHTDVKNKTKYVLTCDPLLKRFC-L 502

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FNL +DPCE+ N+A   PD+  ++   L   ++++V +   +P+   ++P  +N TW  W
Sbjct: 503 FNLNDDPCERLNLADVFPDVVKRIKNRLLELKKSVV-KPLNKPEDPYSNPMFYNGTWVNW 561



 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 37/81 (45%), Positives = 54/81 (66%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MRRN    +   G Y TD+FTKEA+++I++    K PLFLYL+HLA H+GN     +AP+
Sbjct: 193 MRRNFEVDYSYQGMYTTDVFTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAPE 252

Query: 60  ETINQFQYITDPNRRTYAALT 80
           + I++ + I DP R+ YAA+ 
Sbjct: 253 DEISKHECINDPGRKIYAAMV 273


>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 593

 Score =  440 bits (1131), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 237/548 (43%), Positives = 331/548 (60%), Gaps = 43/548 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGSN+IPTPNIDAL YNGIILN  Y  P  TPSRA+  TG YPI  GMQG  I
Sbjct: 42  GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRAAFFTGLYPIRIGMQGEGI 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G EPRG+PL  R LPEYLR LGY+TK IGKWH+G+   ++TPL+RGF++  G+ N  +S
Sbjct: 102 QGGEPRGLPLNIRILPEYLRGLGYTTKLIGKWHVGYHTPQHTPLHRGFDAFLGFYNSHVS 161

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD+  S Q      ++G+DM R  + A+    EYATDLFT EA+++I+     +PL+L 
Sbjct: 162 YYDYRYSYQ-----NMSGYDMHRGDNPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQ 216

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           ++HLA HA       +       +F++I + NRR YA+MV +LD+SVG ++  L  +GML
Sbjct: 217 ISHLAVHAPIESPDDDHRNSNRERFKHIPEVNRRNYASMVTRLDESVGRIVHTLGNRGML 276

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            +S+I+F++DNGAPT+       +RNWGSN P RG+K TL+EGGV+  A+LWSP++++  
Sbjct: 277 RDSLILFLTDNGAPTI-----GKFRNWGSNLPLRGMKYTLYEGGVRGVAVLWSPRLRKAA 331

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RV  +++HI+DWLPT Y+AAGGD   L                        IDG++QW  
Sbjct: 332 RVCNELVHITDWLPTFYSAAGGDLKDL----------------------GEIDGVNQWRM 369

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF- 510
           L    P  R S+L+NIDE  RT      S+KL+ G+ E G  DGY+     + +  +   
Sbjct: 370 LNDGEPVARQSLLLNIDEVSRTEGAIYKSFKLLRGSIEGGHYDGYHTIREPSWIEHVEHK 429

Query: 511 -NAIVESKTY-----QSL--QQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
            N   E  +Y     +SL  Q ++ ++  P++    +   RQ+AT+HC +N +    + C
Sbjct: 430 KNEPTEVPSYTETVLKSLVSQSITYHLGGPVTQPSTINQLRQEATVHCRSNGSYNRFTTC 489

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
               C LF++ NDPCE  NIA   P I+ +L   L+ + R +  Q     D + ADP+R 
Sbjct: 490 NVTEC-LFDINNDPCETKNIAQQYPRIARELDLFLEKYGRNVTRQMKVPVDWL-ADPRRT 547

Query: 623 NDTWSPWI 630
           N TW PW+
Sbjct: 548 NGTWEPWM 555



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 46/80 (57%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           M R  + A+    EYATDLFT EA+++I+     +PL+L ++HLA HA       +    
Sbjct: 177 MHRGDNPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQISHLAVHAPIESPDDDHRNS 236

Query: 61  TINQFQYITDPNRRTYAALT 80
              +F++I + NRR YA++ 
Sbjct: 237 NRERFKHIPEVNRRNYASMV 256


>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
          Length = 543

 Score =  440 bits (1131), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 233/535 (43%), Positives = 322/535 (60%), Gaps = 42/535 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS +IPTPNIDALAY+G++L+  Y  P+CTPSR++LMTGK+PIHTGMQ   +
Sbjct: 36  GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTPSRSALMTGKHPIHTGMQHGVL 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEPRG+PL E+ LPEY R+LGYST  +GKWHLGF+ +EYTP+YRGF+SH G+ +G   
Sbjct: 96  KCAEPRGLPLHEKLLPEYFRDLGYSTHIVGKWHLGFYTKEYTPMYRGFDSHIGFWSGHHD 155

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+DH   ++        G DMRR L  AWD  G+Y+TD+FTKEAV+LI++    +P+FLY
Sbjct: 156 YFDHSAVEE-----PYWGLDMRRGLEPAWDLHGQYSTDVFTKEAVKLIDNHNTSRPMFLY 210

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAH A H+GN    L A  + + +F  I + NRR +A ++ KLD+SVG V+ AL+R+ ML
Sbjct: 211 LAHAAVHSGNPYNPLPAHDQDVAKFTNIFNYNRRRFAGILSKLDESVGLVVEALRRRKML 270

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ENS+I+F +DNG     +       N  SN+P RG KNTLWEGGV+   +LWSP++ +  
Sbjct: 271 ENSVIVFSTDNGGAPAGFN-----LNAASNWPLRGTKNTLWEGGVRGVGLLWSPKLIKPG 325

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RV  Q++HISDWLPTL TAAGGD S L                        IDG+D W +
Sbjct: 326 RVGRQLIHISDWLPTLLTAAGGDPSNL-----------------------TIDGMDIWHA 362

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L  +T S R  VL NID+    A +    WK + G+  NG  DG+YG   S +  + +  
Sbjct: 363 LSEDTESPRKMVLHNIDDIYGVAGITYGDWKFIQGSTYNGQWDGWYGP--SGREWVYDVG 420

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
            ++ S   +++  +  ++     + D +R  R+ A I C      +        PC LFN
Sbjct: 421 GVIGSAAGRAVASVGLSL-----SADNIRKLREDAMIKCPPRNDNLPLCKPLEAPC-LFN 474

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           +  DPCE NN+    P + S L E L+    + V   +   D  +A+P  ++ TW
Sbjct: 475 VRQDPCEDNNLIEKYPSVVSDLKERLRVLNASAVLPGNLPWD-KRANPDYWDHTW 528



 Score = 82.0 bits (201), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 38/79 (48%), Positives = 53/79 (67%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR L  AWD  G+Y+TD+FTKEAV+LI++    +P+FLYLAH A H+GN    L A  +
Sbjct: 171 MRRGLEPAWDLHGQYSTDVFTKEAVKLIDNHNTSRPMFLYLAHAAVHSGNPYNPLPAHDQ 230

Query: 61  TINQFQYITDPNRRTYAAL 79
            + +F  I + NRR +A +
Sbjct: 231 DVAKFTNIFNYNRRRFAGI 249


>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
          Length = 581

 Score =  438 bits (1127), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 245/547 (44%), Positives = 331/547 (60%), Gaps = 45/547 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHG+NEIPTPNIDALAYNG+ILN  Y  P+CTPSR++LMTG+YPI  GMQG P+
Sbjct: 46  GWNDVSFHGANEIPTPNIDALAYNGVILNKYYTMPICTPSRSALMTGRYPIRDGMQGTPM 105

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEPRG+PL    +PE +R LGY T+ +GKWHLG+   +YTP+ RGF++ FGY NG IS
Sbjct: 106 RPAEPRGIPLNVSLMPEQMRRLGYETRLVGKWHLGYTTEDYTPVRRGFDTFFGYYNGFIS 165

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTV--GEYATDLFTKEAVQLIEDQPVDKPLF 269
           YYD+ +   ++ T E+ G+D+ R+ S +++     EY TDL T EA ++I +    KPLF
Sbjct: 166 YYDYWIG--WNDTNEVTGYDLHRDESDSFELAHSSEYFTDLITDEAEKIIRNNKNAKPLF 223

Query: 270 LYLAHLAAHAGNAGKHLEAPQET-----IN-QFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
           L ++HLA HAG+  K  + P E      +N  F YI D   R YA M+  LD+SVG V+ 
Sbjct: 224 LEISHLAVHAGS--KVHDDPLEVRRTDDVNASFPYIEDYQHRKYAGMMAALDESVGRVVK 281

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
           AL+   MLENSIIIFMSDNGAPTV       Y N GSNYP RG+K  ++EG  +  A ++
Sbjct: 282 ALKEAEMLENSIIIFMSDNGAPTVGL-----YNNTGSNYPMRGIKGGMFEGAARAAACIF 336

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
           SP I+ + RVS ++MHI DWLPTLYTAAGG+   L    DG     +L L+         
Sbjct: 337 SPLIKAHSRVSEELMHIVDWLPTLYTAAGGNPMDLQSQFDG-----ALPLD--------- 382

Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
            G+ QWSS++   PS R S+L+NIDE +   A  +   KLV G  +    DGYYG + ++
Sbjct: 383 -GVSQWSSIVAGGPSSRQSLLVNIDEAQGFEAAIIGRHKLVKGMTKE---DGYYGNSGND 438

Query: 504 -KVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
              P  N   ++ S    S+ +L+        +  +    RQ++ I C     P T +  
Sbjct: 439 PSFPAYNVKKVLSSTAGASIGKLAG---FASPSARRALWLRQKSVITC----KPFTSAAN 491

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
            +G C LF+L  DPCE  +++S  P I  +L   L  +RR L+PQ++   D     PK F
Sbjct: 492 CSGTC-LFDLSKDPCETRDLSSKLPLIVKKLESFLGEYRRVLMPQTNSPQDACGL-PKYF 549

Query: 623 NDTWSPW 629
           N  + PW
Sbjct: 550 NGVYMPW 556



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 41/75 (54%), Gaps = 8/75 (10%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET-----IN-QFQY 67
           EY TDL T EA ++I +    KPLFL ++HLA HAG+  K  + P E      +N  F Y
Sbjct: 199 EYFTDLITDEAEKIIRNNKNAKPLFLEISHLAVHAGS--KVHDDPLEVRRTDDVNASFPY 256

Query: 68  ITDPNRRTYAALTKS 82
           I D   R YA +  +
Sbjct: 257 IEDYQHRKYAGMMAA 271


>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
          Length = 563

 Score =  438 bits (1126), Expect = e-120,   Method: Compositional matrix adjust.
 Identities = 233/529 (44%), Positives = 322/529 (60%), Gaps = 43/529 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNID +A++G+ L+N Y  P+CTPSRA+LMTGKYPIHTGMQ   I
Sbjct: 60  GWNDVGFHGSNQIPTPNIDIMAWSGVSLHNYYVTPICTPSRAALMTGKYPIHTGMQHTVI 119

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PLTE+ LP+YL+ELGY T  +GKWHLG +++EY PL RGF+SH G+ NG I 
Sbjct: 120 FAAEPRGLPLTEKILPQYLKELGYKTHLVGKWHLGSYKKEYLPLNRGFDSHLGFWNGKID 179

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
            YDH   ++        G D RR+ STA D  G+YATD++T EAV++I+      PLFL 
Sbjct: 180 MYDHTNQEK-----GYWGFDFRRDFSTAHDLFGQYATDVYTNEAVKIIKSHNTSSPLFLM 234

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L+H A H GN  + + AP++    F +I D  RR +AA++ KLD+SVG V++AL+ KG+L
Sbjct: 235 LSHSAVHTGNPSEPIRAPEKLFVNFTHIQDFQRRKFAAVLTKLDESVGEVVAALKAKGVL 294

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            +SI++F +DNG     + +     N  SNYP RGVKNTLWEGGV+    LWSP I +  
Sbjct: 295 NDSIVVFTTDNGGAAAGFND-----NAASNYPLRGVKNTLWEGGVRGAGWLWSPFIDKRS 349

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RV+ Q MH+ DWLPTL +AAG + S +  +IDG+DQW                      +
Sbjct: 350 RVATQRMHLVDWLPTLLSAAGMNVSSIK-HIDGVDQW---------------------CA 387

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L  + PS R S++ NID++  +A++    WK+  GT   G+ DG+YG   + +    +  
Sbjct: 388 LSQDLPSARESLVHNIDDESGSASITYKQWKVHKGTNYGGSWDGWYGP--AGREGAYDTT 445

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
            ++ SK   +L  +     LP  + + +   R +AT+ CG   A   P      PC LFN
Sbjct: 446 RLLASKAAGALLDIG---MLP--DTEHILRLRSEATVECGDREA--LPCRPLEAPC-LFN 497

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
           +  DPCE  N+A   PD+   + + L    RT VP ++ QP     DPK
Sbjct: 498 IDEDPCETRNLADIHPDVLQVMLKELDRVNRTAVPPNN-QPLTPGGDPK 545



 Score = 76.3 bits (186), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 52/82 (63%), Gaps = 1/82 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+ STA D  G+YATD++T EAV++I+      PLFL L+H A H GN  + + AP++
Sbjct: 195 FRRDFSTAHDLFGQYATDVYTNEAVKIIKSHNTSSPLFLMLSHSAVHTGNPSEPIRAPEK 254

Query: 61  TINQFQYITDPNRRTYAA-LTK 81
               F +I D  RR +AA LTK
Sbjct: 255 LFVNFTHIQDFQRRKFAAVLTK 276


>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
 gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
          Length = 585

 Score =  436 bits (1122), Expect = e-119,   Method: Compositional matrix adjust.
 Identities = 241/541 (44%), Positives = 321/541 (59%), Gaps = 43/541 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS +IPTPNIDALAY+GIILN  YA P+CTPSRASLMTGK+PI+ GMQ   I
Sbjct: 42  GWNDVSFHGSLQIPTPNIDALAYSGIILNRHYAPPLCTPSRASLMTGKHPINIGMQHHVI 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L ++ +PEY RE GY T+ +GKWHLGFFR+ YTP  RGF+SH+GY+   I 
Sbjct: 102 EVDEPWGLGLDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTMRGFDSHYGYIGPYID 161

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED--QPVDKPLF 269
           Y+DH L  Q S T    G DMRRNL   +   G YATDLFT+EAV+LI D  Q    PLF
Sbjct: 162 YWDHSL--QMSNT-STRGLDMRRNLQVDYSARGTYATDLFTREAVRLIHDHNQTSANPLF 218

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           L + HLA H GN    ++AP+E +  F +I DP RR  AAM+ KLDD VG ++ AL++ G
Sbjct: 219 LVVTHLAPHTGNEDDPMQAPEEDVELFSFIKDPKRRVLAAMISKLDDGVGQIVQALKKSG 278

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML N++++F +DNGAPT+       + N GSN+P RG K + WEG V+  A +WSP +  
Sbjct: 279 MLNNTVVLFYADNGAPTI-----GKHSNGGSNFPLRGQKYSPWEGAVRTAAAIWSPLLNN 333

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             RVS Q +H+SDWLPTL  AAG + +     IDG DQW +L        N N+ G    
Sbjct: 334 TGRVSNQWIHVSDWLPTLARAAGIEANFSGSEIDGQDQWEAL-------NNPNLTG---- 382

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
                     R +V+ NID+ +  ++   + WKLV GT   G  D + G           
Sbjct: 383 ----------RTTVMHNIDQFQHYSSYTKNGWKLVNGTSWEGAFDDWMGSLDEED----- 427

Query: 510 FNAIVESKTYQSLQQLSQNI-FLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
                 S+   S++ +S  +  L   ++D +   R+ AT+HC   P           PC 
Sbjct: 428 ----ELSEEEYSVRLISSVVGRLTKVDLDDVARIRRDATVHCPYIPGGTIDCEPQKSPC- 482

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LFN+  DPCE NNIA+ RPD+  QL + ++ +R +   +   QP    +DP+ +N TW+ 
Sbjct: 483 LFNILEDPCETNNIANQRPDLLKQLGQDVERYRLSAT-EPRNQPADPYSDPRYYNRTWTW 541

Query: 629 W 629
           W
Sbjct: 542 W 542



 Score = 83.2 bits (204), Expect = 4e-13,   Method: Compositional matrix adjust.
 Identities = 41/81 (50%), Positives = 51/81 (62%), Gaps = 2/81 (2%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIED--QPVDKPLFLYLAHLAAHAGNAGKHLEAP 58
           MRRNL   +   G YATDLFT+EAV+LI D  Q    PLFL + HLA H GN    ++AP
Sbjct: 179 MRRNLQVDYSARGTYATDLFTREAVRLIHDHNQTSANPLFLVVTHLAPHTGNEDDPMQAP 238

Query: 59  QETINQFQYITDPNRRTYAAL 79
           +E +  F +I DP RR  AA+
Sbjct: 239 EEDVELFSFIKDPKRRVLAAM 259


>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
          Length = 546

 Score =  433 bits (1114), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 235/536 (43%), Positives = 319/536 (59%), Gaps = 42/536 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHG ++IPTPNIDALAY G++L+  Y  P+CTPSR++LMTGK+PIHTGMQ   +
Sbjct: 39  GWNDVGFHGLSQIPTPNIDALAYTGLLLDRYYVSPICTPSRSALMTGKHPIHTGMQHGVL 98

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEPRG+PL E+ LPEYLR LGYST  +GKWHLGF+ +EYTP YRGF+SH G+ +G   
Sbjct: 99  KCAEPRGLPLQEKLLPEYLRNLGYSTHMVGKWHLGFYTKEYTPTYRGFDSHLGFWSGHHD 158

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+DH   ++        G DMRR L  AWD  G+Y+TD+FTKEAV+LI++    +P+FLY
Sbjct: 159 YFDHTAVEE-----PYWGLDMRRGLEPAWDLHGQYSTDVFTKEAVRLIDNHNTSRPMFLY 213

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L+H A H+GN    L A    + +F  I D NRR +A M+ KLD+SVG V+ AL+++ ML
Sbjct: 214 LSHAAVHSGNPYNPLPAHDHDVAKFPKILDYNRRRFAGMLSKLDESVGLVVEALRKRKML 273

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ENS+I+F +DNG P   +       N  SN+P RG KNTLWEGGV+   ++WSP++    
Sbjct: 274 ENSVIVFSTDNGGPPAGFN-----LNAASNFPLRGTKNTLWEGGVRGTGLVWSPKLINPG 328

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R+S Q++HISDWLPTL TA GGD S L                        IDG+D W +
Sbjct: 329 RISRQLIHISDWLPTLLTAVGGDPSNLA-----------------------IDGVDMWRA 365

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L  +T S R  VL NID+    A +    WK + G+  NG  D +YG   S +    +  
Sbjct: 366 LNEDTISPRKMVLHNIDDIYGIAGITYGDWKFIQGSTYNGQWDDWYGP--SGRDWAYDPY 423

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
           A++ES   ++    S  + L   NI+K+   +Q A I C      +        PC LFN
Sbjct: 424 AVMESAAGRA--AASVGLSLTAGNIEKL---QQGAAIKCSTRNENLPLCKPLEAPC-LFN 477

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           +  DPCE+NN+    P I S L E L     + +   +   D  + +P  ++ TW+
Sbjct: 478 VREDPCEKNNLIEKYPSIVSDLKEKLSLLNASAILPGNLPWD-KRGNPDYWDHTWT 532



 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 38/79 (48%), Positives = 52/79 (65%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR L  AWD  G+Y+TD+FTKEAV+LI++    +P+FLYL+H A H+GN    L A   
Sbjct: 174 MRRGLEPAWDLHGQYSTDVFTKEAVRLIDNHNTSRPMFLYLSHAAVHSGNPYNPLPAHDH 233

Query: 61  TINQFQYITDPNRRTYAAL 79
            + +F  I D NRR +A +
Sbjct: 234 DVAKFPKILDYNRRRFAGM 252


>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
          Length = 557

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 239/560 (42%), Positives = 328/560 (58%), Gaps = 63/560 (11%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +GWND+SFHGS +IPTPN+DALA++G+IL N Y  P+CTPSR++LMTGK+PIHTGMQ   
Sbjct: 36  FGWNDVSFHGSKQIPTPNLDALAFSGLILQNYYVTPLCTPSRSALMTGKHPIHTGMQHDV 95

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           ++G    G+PL+E  LPEYL++LGY    +GKWHLG ++  YTPL+RGF+SH+GY  G  
Sbjct: 96  LYGYSRYGLPLSEITLPEYLKDLGYKNHIVGKWHLGHYKSVYTPLFRGFDSHYGYWTGHQ 155

Query: 211 SYYDHILSDQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
            YYDH         VE N  G+DMRRN S  W   G+Y T L T EA  +I    V  PL
Sbjct: 156 DYYDHT-------AVEWNAWGYDMRRNHSVDWSAYGKYTTTLLTDEACDVITKHDVSSPL 208

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLY+AHLA H+ N    L+AP+ET+  F  I +  RR YAAMV +LD SVG ++ +L   
Sbjct: 209 FLYVAHLAVHSANPYSPLQAPEETVEMFSSIENLQRRRYAAMVHELDVSVGKIVKSLGDN 268

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            MLEN++I+F +DNG P   + +     N  SN+P +GVKNT WEGGV+   ++WSP I 
Sbjct: 269 NMLENTVIVFSTDNGGPAEGFNQ-----NAASNWPLKGVKNTPWEGGVRAAGLIWSPLIP 323

Query: 389 QNPR--VSLQMMHISDWLPTLYTAAG-----GDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
           ++ R  V   +M ISDWLPTL+ AAG     GD + L   +DG+  W             
Sbjct: 324 KSRRGQVMSNLMDISDWLPTLFEAAGQSRICGDVATLK-GLDGVSHW------------- 369

Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT- 500
                   +++L + PS RN VL NID++   AA+R ++WKLV GT   G+ DG+YG + 
Sbjct: 370 --------NTVLYDKPSARNHVLHNIDDQLGYAAIRKENWKLVKGTTYEGSWDGWYGPSG 421

Query: 501 RSNK-----------VPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
           R+N+           V L+  NA+++S   Q +  L       +S    +      A I 
Sbjct: 422 RTNESSTNFINSVDDVSLIYKNAVLDSDAAQYIAMLG------MSATQNLYQQLLDAEIS 475

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
           CG  P    P    +  C LF++  DPCE NN+A   P I+  L  LL ++  T V   +
Sbjct: 476 CGERPVDAHPCLPLDALC-LFDIEQDPCEYNNLAEKMPHITEDLLSLLSWYNSTAVAPLN 534

Query: 610 EQPDLVQADPKRFNDTWSPW 629
             PD + ++PK +N T++ W
Sbjct: 535 TSPDPM-SNPKYWNYTYTNW 553



 Score = 80.1 bits (196), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 38/87 (43%), Positives = 50/87 (57%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRN S  W   G+Y T L T EA  +I    V  PLFLY+AHLA H+ N    L+AP+E
Sbjct: 172 MRRNHSVDWSAYGKYTTTLLTDEACDVITKHDVSSPLFLYVAHLAVHSANPYSPLQAPEE 231

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTL 87
           T+  F  I +  RR YAA+     +++
Sbjct: 232 TVEMFSSIENLQRRRYAAMVHELDVSV 258


>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
          Length = 564

 Score =  432 bits (1111), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 239/544 (43%), Positives = 329/544 (60%), Gaps = 48/544 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNIDALAYNGIILN  Y  P  TPSR +  TG YPI  GMQG  I
Sbjct: 42  GWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGI 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G EPRG+PL  + LPE+LR LGY TK IGKWH+GF   +YTPL+RGF++ FG+ N  I+
Sbjct: 102 RGGEPRGLPLHIKILPEHLRGLGYVTKLIGKWHMGFHTLQYTPLHRGFDTFFGFYNSHIT 161

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD+  S+Q      + G+DM      A+    EYATDLFT EA+++IE+  + +PL+L 
Sbjct: 162 YYDYEYSNQ-----NMTGYDMHCGDDPAYGMKREYATDLFTNEAIKIIENHELPRPLYLQ 216

Query: 272 LAHLAAHAGNAGKHLEAPQETI-NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           ++HLA HA      +E P ++  ++   I +PNRR YA MV KLD+SVG V+ AL  KGM
Sbjct: 217 ISHLAVHAP-----IEQPDDSSRDEIVQIREPNRRKYAKMVSKLDESVGRVVHALGEKGM 271

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L +S+I+F++DNGA ++       YRN+GSNYP RG K TL+EGGV+  A LWS ++++ 
Sbjct: 272 LRDSLILFLTDNGAASI-----GRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLEKG 326

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RV  +++HI+DWLPTLY+AAGGD   L                        IDG+DQW 
Sbjct: 327 ARVFKKLIHITDWLPTLYSAAGGDLKDL----------------------GKIDGIDQWR 364

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLLN 509
            L       R  +L+NIDE   T       +KL+ G+  NG  D YYG + R+ + P   
Sbjct: 365 VLSEGQGHGREKLLLNIDEVMITEGAIYSRFKLLRGSVGNGYYDKYYGDSGRTLETPPYT 424

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA---PMTPSPCTNGP 566
              +++S   QS   ++ ++  P++    M   R++AT+ C  N +     + + C    
Sbjct: 425 -EVVLKSAVSQS---ITYHLGGPVTQPSTMVQLRREATVQCHPNMSYYYRHSFTFCNVTE 480

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LF++ NDPCE  NIA +   I+  L   L+++ R L+ Q  +  D + ADPKR N+TW
Sbjct: 481 C-LFDIVNDPCETKNIAEAYARIARDLDLYLEHYGRVLMKQIRKPVDWL-ADPKRRNNTW 538

Query: 627 SPWI 630
            PW+
Sbjct: 539 EPWM 542



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/74 (40%), Positives = 47/74 (63%), Gaps = 6/74 (8%)

Query: 8   AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI-NQFQ 66
           A+    EYATDLFT EA+++IE+  + +PL+L ++HLA HA      +E P ++  ++  
Sbjct: 184 AYGMKREYATDLFTNEAIKIIENHELPRPLYLQISHLAVHAP-----IEQPDDSSRDEIV 238

Query: 67  YITDPNRRTYAALT 80
            I +PNRR YA + 
Sbjct: 239 QIREPNRRKYAKMV 252


>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 579

 Score =  431 bits (1109), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 237/544 (43%), Positives = 329/544 (60%), Gaps = 48/544 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNIDALAYNGIILN  Y  P  TPSR +  TG YPI  GMQG  I
Sbjct: 42  GWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGI 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G EPRG+PL  + LPE+LR LGY+TK IGKWH+G+   +YTPL+RGF++ FG+ N  I+
Sbjct: 102 RGGEPRGLPLHIKILPEHLRGLGYTTKLIGKWHMGYHTPQYTPLHRGFDTFFGFYNSHIT 161

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD+  S+Q      + G+DM R    A     EY TDLFTKEA+++IE+  + +PL+L 
Sbjct: 162 YYDYEYSNQ-----NMTGYDMHRGDDPAHGIKREYVTDLFTKEAIKIIENHELPRPLYLQ 216

Query: 272 LAHLAAHAGNAGKHLEAPQE-TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           ++HLA HA      +E P + + ++   I +PNRR YA MV KLD+SVG ++ AL  KGM
Sbjct: 217 ISHLAVHAP-----IEQPDDSSSDEIIQIREPNRRKYAKMVSKLDESVGRIVHALGEKGM 271

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L +S+I+F++DNGA ++       YRN+GSNYP RG K TL+EGGV+  A LWS ++++ 
Sbjct: 272 LRDSLILFLTDNGAASI-----GRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLEKA 326

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RV  +++HI+DWLPTLY+AAGGD   L                        IDG+DQW 
Sbjct: 327 ARVFKKLIHITDWLPTLYSAAGGDLKDL----------------------GEIDGIDQWR 364

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLLN 509
            L       R  +L+NIDE  +T       +KL+ G+  NG  D YYG + R+ + P   
Sbjct: 365 VLSEEQGRGREKLLLNIDEVMKTEGAIYSRFKLLRGSIGNGYYDKYYGDSGRTLETPPYT 424

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA---PMTPSPCTNGP 566
              +++S   QS   ++ ++  P++    M   R++AT+ C  N       + + C    
Sbjct: 425 -EVVLKSAVSQS---ITYHLGGPVTQPSTMIQLRREATVQCHPNVTYYYRHSFTFCNVTE 480

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LF++ NDPCE  NIA +   I+  L   L+ + R L+ Q  +  D + ADP+R N+TW
Sbjct: 481 C-LFDIVNDPCETKNIAEAYARIARDLDLYLEQYGRVLMKQIRKPVDWL-ADPRRRNNTW 538

Query: 627 SPWI 630
            PW+
Sbjct: 539 EPWM 542



 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 6/81 (7%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           M R    A     EY TDLFTKEA+++IE+  + +PL+L ++HLA HA      +E P +
Sbjct: 177 MHRGDDPAHGIKREYVTDLFTKEAIKIIENHELPRPLYLQISHLAVHAP-----IEQPDD 231

Query: 61  -TINQFQYITDPNRRTYAALT 80
            + ++   I +PNRR YA + 
Sbjct: 232 SSSDEIIQIREPNRRKYAKMV 252


>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 545

 Score =  431 bits (1108), Expect = e-118,   Method: Compositional matrix adjust.
 Identities = 242/572 (42%), Positives = 332/572 (58%), Gaps = 48/572 (8%)

Query: 62  INQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGI 117
           I  F Y+        AA  ++T   ++ +     GWND+ FHGS +IPTPNIDALAY+G+
Sbjct: 5   IFLFSYLLAARNWIVAAAPETTPPHIIFIVADDLGWNDVGFHGSGQIPTPNIDALAYSGL 64

Query: 118 ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYST 177
           ILN  Y  P+CTPSR++LMTGKYPIHTGMQ   + GAEPRG+PL E+ LPEYLRELGY T
Sbjct: 65  ILNRYYVSPICTPSRSALMTGKYPIHTGMQRGVLKGAEPRGLPLKEKLLPEYLRELGYRT 124

Query: 178 KAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
             +GKWHLGF+ +EYTP YRGFESH GY  G   YYDH   ++        G DMRRN+ 
Sbjct: 125 HIVGKWHLGFYTKEYTPTYRGFESHLGYWTGHQDYYDHSAVEE-----PYWGMDMRRNME 179

Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
            AWD  G+Y+TD+FTKEAV+LI+     +P+FLYLAH A H+ N    L    E +++ +
Sbjct: 180 PAWDLHGQYSTDVFTKEAVKLIKSHNASQPMFLYLAHAAVHSANPYNPLPVSDEIVDKIK 239

Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
            I +  RR +A M   +D SVG V+ AL+++ ML + +I+F +DNG P   +       N
Sbjct: 240 -IPEYKRRRFAGMASAMDASVGAVVEALEKRNMLRDCLIVFSTDNGGPAEGFN-----LN 293

Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
             SN+P RGVK+TL+EGGV+   ++WS ++ +  R+S    HI+DWLPTL  AAGG    
Sbjct: 294 AASNWPLRGVKDTLFEGGVRGVGLVWSERLARPGRLSEATFHIADWLPTLLRAAGGY--- 350

Query: 418 LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
             +N  G                  IDGLD W +L  +  S R  VL NID+   TAA+ 
Sbjct: 351 --VNATG-----------------KIDGLDAWEALADDKESPRRVVLHNIDDVLGTAAIS 391

Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESK-TYQSLQQLSQNIFLPISNI 536
           +D WKL+ G+   G  + +YG   S +    +  A+ E     ++L++L + +     + 
Sbjct: 392 VDQWKLIKGSTYRGRWNQWYGP--SGRDYPYDVEAVAEGCLVAKALKRLGRGL-----DR 444

Query: 537 DKMRSTRQQATIHCGANP-APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE 595
            K+   R QATIHCG  P  P          C L ++  DPCE+ N+A  +PD+  +L E
Sbjct: 445 GKLVELRNQATIHCGEQPQQPQKACNPLEAAC-LIDVRADPCERRNLADEQPDVVRRLEE 503

Query: 596 LLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
            L     + VP  +   D  QADP  ++ TW+
Sbjct: 504 ELARLNASAVPPGNLPWD-SQADPALWDHTWN 534



 Score = 74.7 bits (182), Expect = 1e-10,   Method: Compositional matrix adjust.
 Identities = 36/90 (40%), Positives = 55/90 (61%), Gaps = 1/90 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRN+  AWD  G+Y+TD+FTKEAV+LI+     +P+FLYLAH A H+ N    L    E
Sbjct: 174 MRRNMEPAWDLHGQYSTDVFTKEAVKLIKSHNASQPMFLYLAHAAVHSANPYNPLPVSDE 233

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIV 90
            +++ + I +  RR +A +  +   ++  V
Sbjct: 234 IVDKIK-IPEYKRRRFAGMASAMDASVGAV 262


>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
 gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
          Length = 564

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 231/569 (40%), Positives = 331/569 (58%), Gaps = 42/569 (7%)

Query: 62  INQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGI 117
             Q + I+ P+ RT  +   S    ++ +     GWND+ FHGS +IPTPN+DALAY+GI
Sbjct: 16  FQQGEAISCPSNRTENSSMDSARPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAYSGI 75

Query: 118 ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYST 177
           ILN  Y  P+CTPSR++LMTGKYPIHTGMQ   ++  EPRG+PL+E+ LP+YL++LGYS 
Sbjct: 76  ILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVLYAMEPRGLPLSEKLLPQYLKDLGYSN 135

Query: 178 KAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
             +GKWHLG ++  +TP+ RGF+SH G+  G      H ++D  +      G DMRR   
Sbjct: 136 HIVGKWHLGHYQLRFTPMQRGFDSHTGFWTG-----HHHMNDHTAVEHGHWGLDMRRGYD 190

Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
            A+D  G+Y T +   EA+ +++      PLFLY+AH A H+ N    L AP ET+    
Sbjct: 191 VAYDLHGQYTTHVLGAEAIAIVQGHNKSSPLFLYVAHAAVHSANPYDFLPAPDETVANLG 250

Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
           +I +  RR +AAM+ +LD +VG+++ AL  + MLEN+II+F SDNG P   + +     N
Sbjct: 251 HIENYRRRKFAAMMVELDRTVGSLVDALHARDMLENTIIVFSSDNGGPADGFND-----N 305

Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
             SN+P RGVKNTLWEGG++    +WSP +Q   RVS QM+ + DWLPTLY AAGG+ S 
Sbjct: 306 AASNWPLRGVKNTLWEGGLRAAGFIWSPLLQNVSRVSHQMVQVCDWLPTLYEAAGGNVSA 365

Query: 418 LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
           LP ++DG+  W  L                  SS     P+RR  +L NID+   TAA+ 
Sbjct: 366 LPTDLDGISVWHEL------------------SS---GAPTRRIEILHNIDDIWGTAALT 404

Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
           + +WKLV G+  N T DG+YG            + +  S   + +++LS    LP    +
Sbjct: 405 VGNWKLVKGSHYNRTWDGWYGPAGIRDEKAYALDKLANSPAGKVMEELS---LLPTP--E 459

Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
           ++   R++AT+ CGA     +       PC L+++ +DPCE NN+A     I   L   L
Sbjct: 460 RITQLRREATVSCGAGAHMASECDPLERPC-LYDVESDPCEYNNLAEEHLHILQSLLARL 518

Query: 598 KYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
             +  T VP S+ + D  + +P+R+N TW
Sbjct: 519 ADYNSTAVPPSNLEDD-PRGEPQRWNFTW 546



 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 44/79 (55%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR    A+D  G+Y T +   EA+ +++      PLFLY+AH A H+ N    L AP E
Sbjct: 185 MRRGYDVAYDLHGQYTTHVLGAEAIAIVQGHNKSSPLFLYVAHAAVHSANPYDFLPAPDE 244

Query: 61  TINQFQYITDPNRRTYAAL 79
           T+    +I +  RR +AA+
Sbjct: 245 TVANLGHIENYRRRKFAAM 263


>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
 gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
          Length = 675

 Score =  428 bits (1101), Expect = e-117,   Method: Compositional matrix adjust.
 Identities = 236/538 (43%), Positives = 319/538 (59%), Gaps = 34/538 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNIDALAY+GIILN  Y  P+CTPSRASLMTGK PI+ GMQ   I
Sbjct: 43  GWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKNPINIGMQHYVI 102

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L ++ +PEY +E GY T  +GKWHLGF  ++YTP  RGF++H GYL   + 
Sbjct: 103 VSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGYLGPYVD 162

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+D+ L  ++S      G+DMR NL+  +D+ G YATD FTK A  +IE      PLFL 
Sbjct: 163 YWDYTL--KFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDPLFLV 220

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           + HLA HA N    L+AP+E I +F YI+D  RR YAAMV KLDDSVG + ++L+ K ML
Sbjct: 221 VNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIFNSLRSKNML 280

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NSII+FMSDNGAPT        + N GSNYP RG+K+  WE   +  A +WSP +Q+  
Sbjct: 281 DNSIILFMSDNGAPTAAL-----HANTGSNYPLRGIKSVPWEAATRCVAAIWSPLLQERQ 335

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS Q +HISDWLPTL +AAG D   +P                 S+ +S IDG DQW +
Sbjct: 336 RVSNQFIHISDWLPTLASAAGID---IPF----------------SKDHSEIDGQDQWEA 376

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L  +T + R  VL  IDE    ++   + +K V GT  NG+ DG+YGQ  ++   L +  
Sbjct: 377 LSYDTGNPRRVVLNMIDEIYGYSSYMENGFKFVNGTYSNGSYDGWYGQPNTSDQTLSDDQ 436

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
            I      +  +   + I     + D ++  R+ A ++C   P     +P    PC LF+
Sbjct: 437 YIDLVLQTEITRWAGETI-----SRDTIKYLRKHARVNCNHQPEANKCNPLKR-PC-LFD 489

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           + NDPCE N+++   P    +L   ++ +RR L  +   +P    A+P  F   W+ W
Sbjct: 490 IINDPCELNDLSHKFPMKFRELRSTVQTYRR-LATKPRNKPADPAANPANFGGVWTWW 546



 Score = 82.4 bits (202), Expect = 6e-13,   Method: Compositional matrix adjust.
 Identities = 40/80 (50%), Positives = 50/80 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MR NL+  +D+ G YATD FTK A  +IE      PLFL + HLA HA N    L+AP+E
Sbjct: 181 MRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDPLFLVVNHLAPHAANDDDPLQAPEE 240

Query: 61  TINQFQYITDPNRRTYAALT 80
            I +F YI+D  RR YAA+ 
Sbjct: 241 DIRKFDYISDERRRIYAAMV 260


>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
 gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
          Length = 607

 Score =  427 bits (1097), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 236/538 (43%), Positives = 319/538 (59%), Gaps = 34/538 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNIDALAY+GIILN  Y  P+CTPSRASLMTGK PI+ GMQ   I
Sbjct: 43  GWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKNPINIGMQHYVI 102

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L ++ +PEY +E GY T  +GKWHLGF  ++YTP  RGF++H GYL   + 
Sbjct: 103 VSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGYLGPYVD 162

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+D+ L  ++S      G+DMR NL+  +D+ G YATD FTK A  +IE      PLFL 
Sbjct: 163 YWDYTL--KFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDPLFLV 220

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           + HLA HA N    L+AP+E I +F YI+D  RR YAAMV KLDDSVG + ++L+ K ML
Sbjct: 221 VNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIFNSLRSKNML 280

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NSII+FMSDNGAPT        + N GSNYP RG+K+  WE   +  A +WSP +Q+  
Sbjct: 281 DNSIILFMSDNGAPTAAL-----HANTGSNYPLRGIKSVPWEAATRCVAAIWSPLLQERQ 335

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS Q +HISDWLPTL +AAG D   +P                 S+ +S IDG DQW +
Sbjct: 336 RVSNQFIHISDWLPTLASAAGID---IPF----------------SKDHSEIDGQDQWEA 376

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L  +T + R  VL  IDE    ++   + +K V GT  NG+ DG+YGQ  ++   L +  
Sbjct: 377 LSYDTGNPRRVVLNMIDEIYGYSSYMENGFKFVNGTYSNGSYDGWYGQPNTSDQTLSDDQ 436

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
            I      +  +   + I     + D ++  R+ A ++C   P     +P    PC LF+
Sbjct: 437 YIDLVLQTEITRWAGETI-----SRDTIKYLRKHARVNCNHQPEANKCNPLKR-PC-LFD 489

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           + NDPCE N+++   P    +L   ++ +RR L  +   +P    A+P  F   W+ W
Sbjct: 490 IINDPCELNDLSHKFPMKFRELRSTVQTYRR-LATKPRNKPADPAANPANFGGVWTWW 546



 Score = 82.4 bits (202), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 40/80 (50%), Positives = 50/80 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MR NL+  +D+ G YATD FTK A  +IE      PLFL + HLA HA N    L+AP+E
Sbjct: 181 MRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDPLFLVVNHLAPHAANDDDPLQAPEE 240

Query: 61  TINQFQYITDPNRRTYAALT 80
            I +F YI+D  RR YAA+ 
Sbjct: 241 DIRKFDYISDERRRIYAAMV 260


>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 552

 Score =  426 bits (1095), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 231/535 (43%), Positives = 314/535 (58%), Gaps = 40/535 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS +IPTPN+DALAY+GIILN  Y  P+CTPSRA+LMTG+YPIHTGMQ   +
Sbjct: 36  GWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVTPICTPSRAALMTGRYPIHTGMQHAVL 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +G EPRG+PL E+ LPEYLRELGY    +GKWHLG + R YTPL RGF+SH G+  G   
Sbjct: 96  YGMEPRGLPLEEKLLPEYLRELGYKNHIVGKWHLGHYTRRYTPLERGFDSHVGFWTG--- 152

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
              H + D  +   E  G DMRR    A+D  G+Y T +   EAV  I +  +  PLFLY
Sbjct: 153 --HHHMFDHSAVETETWGLDMRRGYDVAYDLHGKYTTHVIRDEAVARIGNHSIGDPLFLY 210

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +AH A H+ N    L AP  T+   +++    RR +AAM+ +LD+SVG ++ AL+ KGML
Sbjct: 211 VAHAAVHSANPYDFLPAPDVTVAGLEHVEPYPRRKFAAMLSELDESVGAIVEALKIKGML 270

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N+II+F SDNG P   +       N  SN+P RGVKNTLWEGGV+    +WSP I ++ 
Sbjct: 271 DNTIIVFSSDNGGPAEGFNS-----NAASNWPLRGVKNTLWEGGVRAAGFIWSPLIAESR 325

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS QM+HISDWLPTL  AAG +   LP  +DG++ W SLL       N N         
Sbjct: 326 RVSQQMIHISDWLPTLLDAAGYNVETLPKTLDGINVWKSLL-------NGN--------- 369

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
                 + R  +L NID+   +AA+ +++WK++ GT   G  D +YG          N  
Sbjct: 370 -----STERREILHNIDDIWGSAALTVNNWKILKGTNYKGQWDNWYGPPGDRDPEAYNIT 424

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
           A+    T ++L +++    LP      +R+ R  AT+HC +     T +P    PC L++
Sbjct: 425 AVQHCPTGKALSEMN---LLPTEA--TIRNLRLNATVHC-SKEVRATCNPLEQ-PC-LYD 476

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           +  DPCE +N+A   P I   + + L  +  T VP  +   D  + DP+ +  TW
Sbjct: 477 VLQDPCELDNLAERFPAILDSMLKKLDAYNATAVPPGNLALD-TRGDPRFWGYTW 530



 Score = 61.2 bits (147), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 43/79 (54%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR    A+D  G+Y T +   EAV  I +  +  PLFLY+AH A H+ N    L AP  
Sbjct: 171 MRRGYDVAYDLHGKYTTHVIRDEAVARIGNHSIGDPLFLYVAHAAVHSANPYDFLPAPDV 230

Query: 61  TINQFQYITDPNRRTYAAL 79
           T+   +++    RR +AA+
Sbjct: 231 TVAGLEHVEPYPRRKFAAM 249


>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
 gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
 gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
          Length = 562

 Score =  424 bits (1091), Expect = e-116,   Method: Compositional matrix adjust.
 Identities = 239/541 (44%), Positives = 323/541 (59%), Gaps = 43/541 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS EIPTPNIDALAY+GIILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 37  GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL E+ LP+YL ELGY++   GKWHLG ++ +YTPLYRGF SH G+ +G   
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHVGFWSGHQD 156

Query: 212 YYDHILSDQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PL 268
           Y DH         VE N  G DMR     A+D  G Y TD+ T  +V++I +    K PL
Sbjct: 157 YNDHT-------AVENNQWGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPL 209

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLY+AH A H+ N    L  P   + +  +I +  RR +AAMV K+D+SVG ++  L++ 
Sbjct: 210 FLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQIVDQLRKS 269

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            MLENSIIIF SDNG P   +       N+ SNYP +GVKNTLWEGGV+   ++WSP ++
Sbjct: 270 NMLENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLK 324

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           ++ RVS Q MHI DWLPTL  AAGG  +                    S  +  IDG   
Sbjct: 325 KSQRVSNQTMHIIDWLPTLLEAAGGQPAL-------------------SNLSKQIDGQSI 365

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
           W +L+ +  S R +VL NID+   +AA+ +  WKLV GT   G+ DG+YG        L 
Sbjct: 366 WRALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGWYGPAGERDPRLY 425

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGP 566
           ++  +  S+  ++L+ L     LP S  D+ R  R  AT+ C    +  T    T  + P
Sbjct: 426 DWQLVGRSRAGKALEALK---MLP-SRADQQR-IRAAATVSCPGQSSQGTSCVATAFSAP 480

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LF++ +DPCEQ N+A   P++ + L   L+    T VP S++  D  +ADP+ +N TW
Sbjct: 481 C-LFHIRDDPCEQYNLAKQYPEVVNALMTELERFNATAVPPSNKPAD-PRADPRFWNYTW 538

Query: 627 S 627
           +
Sbjct: 539 T 539



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A+D  G Y TD+ T  +V++I +    K PLFLY+AH A H+ N    L  P 
Sbjct: 172 MRNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPD 231

Query: 60  ETINQFQYITDPNRRTYAALT 80
             + +  +I +  RR +AA+ 
Sbjct: 232 NDVIKMSHIPNYKRRKFAAMV 252


>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
          Length = 591

 Score =  423 bits (1088), Expect = e-115,   Method: Compositional matrix adjust.
 Identities = 237/557 (42%), Positives = 338/557 (60%), Gaps = 60/557 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ FHGS++I TPNIDAL YNGIILN  Y  P  TPSR +  TG+YPI  GMQG  I
Sbjct: 41  GWDDVGFHGSDQIRTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRMGMQGEDI 100

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G EPRG+PL  R LPE+LR+LGY TK IGKWHLG++  ++TPL RGF+S  G+ N  +S
Sbjct: 101 QGGEPRGIPLNVRILPEFLRDLGYMTKLIGKWHLGYYTPQHTPLRRGFDSFLGFYNSHVS 160

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY++  S Q      ++G+DM R  + A+ +  +Y TD FT EA+++IE     +PL+L 
Sbjct: 161 YYNYKYSFQ-----NMSGYDMHRGDAPAYGSTDKYVTDFFTDEAIKIIEYYDPSRPLYLQ 215

Query: 272 LAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           ++HLA HA      LE PQ+     +QF +I + NRR YA MV +LD+SVG ++ AL  K
Sbjct: 216 ISHLAVHA-----PLEGPQDYNHYDSQFLHIREINRRKYAKMVLRLDNSVGRIVQALGNK 270

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           GML+NS+I+F++DNGA  +       +RN+GSNYP RG+K TL+EGGV+  A+LWSP+++
Sbjct: 271 GMLKNSLILFLTDNGAAPI-----GKFRNYGSNYPLRGMKYTLYEGGVRGVALLWSPRLR 325

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +  RVS ++MH++DWLPTLY+ AGGD   L   IDG+DQW  L        N ++     
Sbjct: 326 KTARVSDELMHVTDWLPTLYSIAGGDVRDLG-EIDGIDQWCML--------NGSL----- 371

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
                   PS R+ +L+NIDE  +T       +KLV G+ E G  DGY+       VP++
Sbjct: 372 --------PSARDRLLLNIDEVSKTEGAIYKQFKLVRGSIEGGLYDGYFYDHIERNVPIM 423

Query: 509 NF-----NAIVESKTYQSLQQLSQNI-------FL--PISNIDKMRSTRQQATIHCGA-N 553
                    IV+       + + +++       FL  P++    +   R++AT++C + N
Sbjct: 424 LIMPSDPKTIVQEDVPPYTETILRSVVSETITSFLGDPVTQPSTIIQLRREATVNCRSRN 483

Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
            + +T   C    C LF++ NDPCE NNIA     IS +L + L+ +   +V +  + P 
Sbjct: 484 RSFLT---CNVTEC-LFDVNNDPCETNNIAEQYSRISRELDKSLEQYWH-IVKRQMKIPV 538

Query: 614 LVQADPKRFNDTWSPWI 630
              ADP++ N+TW PWI
Sbjct: 539 DWLADPRKTNNTWEPWI 555



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 8/83 (9%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           M R  + A+ +  +Y TD FT EA+++IE     +PL+L ++HLA HA      LE PQ+
Sbjct: 176 MHRGDAPAYGSTDKYVTDFFTDEAIKIIEYYDPSRPLYLQISHLAVHA-----PLEGPQD 230

Query: 61  ---TINQFQYITDPNRRTYAALT 80
                +QF +I + NRR YA + 
Sbjct: 231 YNHYDSQFLHIREINRRKYAKMV 253


>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
          Length = 546

 Score =  419 bits (1077), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 230/542 (42%), Positives = 330/542 (60%), Gaps = 44/542 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHG+N+IPTPNIDALAYNG+IL   Y  P+CTPSR + +TG+YPI TGMQG P+
Sbjct: 35  GWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 94

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EPR +PL    LPEYLR+LGY+T  +GKWH+G++   +TP  RGF++ FGY NG IS
Sbjct: 95  KAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYNGYIS 154

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H +        +L+ H+  +NLS A++   EY TDL T+ A  +I++    KPL+L 
Sbjct: 155 YFNHTIKQNNHVGYDLHYHN-SKNLSVAYNF--EYITDLITERAENIIKNHDRRKPLYLQ 211

Query: 272 LAHLAAHAGNAGKHLEA--PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           L+HLA H+ +A + +E    QET    +YI D NRR YA +V  +D+SVG VI AL +  
Sbjct: 212 LSHLAVHSSDAKEVMEVRDEQETNATLEYIEDYNRRKYAGVVTAMDESVGRVIKALGQSS 271

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           MLENSII+FMSDNGA T    E     N+GSNYP RG+K TL+EGG++  A ++S  IQ 
Sbjct: 272 MLENSIIVFMSDNGAQTEGLLE-----NYGSNYPLRGLKFTLFEGGIRGVACVYSRLIQN 326

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
           + R+S +++HI+DWLPT Y+AAGG+   L  N+DG+DQW +++    S R          
Sbjct: 327 SSRISNELIHITDWLPTFYSAAGGNLENLEENMDGVDQWDTIV----SGRE--------- 373

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
                   S+R SVL+NIDE +  ++  L  +KL+ G  +N     YYG    S   P  
Sbjct: 374 --------SKRGSVLLNIDEVENVSSALLGKYKLING--KNIQYSDYYGDNGTSVSYPEY 423

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
           N ++++ S    ++   + N  L I ++ K+R+   +A + C       T  P     C 
Sbjct: 424 NVSSVLHSFVGSTIFH-TTNFTLNIEDVIKLRN---KARVVCNN----FTSYPKCVDKC- 474

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LF++ NDPCE  +++   P+I  +L   +  + + L+ Q++   D   + P+ FN TW P
Sbjct: 475 LFDVYNDPCETTDLSDEYPEIVKKLNYFITEYEKVLLHQTNAPVD-PHSYPENFNGTWMP 533

Query: 629 WI 630
           WI
Sbjct: 534 WI 535



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 4/82 (4%)

Query: 3   RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA--PQE 60
           +NLS A++   EY TDL T+ A  +I++    KPL+L L+HLA H+ +A + +E    QE
Sbjct: 176 KNLSVAYNF--EYITDLITERAENIIKNHDRRKPLYLQLSHLAVHSSDAKEVMEVRDEQE 233

Query: 61  TINQFQYITDPNRRTYAALTKS 82
           T    +YI D NRR YA +  +
Sbjct: 234 TNATLEYIEDYNRRKYAGVVTA 255


>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
 gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
          Length = 545

 Score =  418 bits (1074), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 235/543 (43%), Positives = 318/543 (58%), Gaps = 62/543 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS +IPTPNIDALAY+GIILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 37  GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL E+ LP+YL +LGY++   GKWHLG ++ EYTPL+RGF SH         
Sbjct: 97  YAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFSSH--------- 147

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
                          L G DMR     A++  G+Y TD+ TK +V +I +    K PLFL
Sbjct: 148 -------------KNLWGLDMRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFL 194

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H+GN    L  P + + +  +I    RR YAAMV K+D+SVG ++  L++  M
Sbjct: 195 YVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVGVIVDQLRKSNM 254

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSIIIF SDNG P   +       N+ SNYP RGVKNTLWEGGV+   ++WSP +++ 
Sbjct: 255 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLRGVKNTLWEGGVRAAGLVWSPLLEKR 309

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN----IDGL 446
            RV  + MHISDWLPTL  AAGG                      P+  N+N    +DG+
Sbjct: 310 QRVVDETMHISDWLPTLIEAAGG---------------------IPALANANLSQSLDGM 348

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
             W +L+   PS R +VL NID+   +AA+ +  WK++ GT  NG  DG+YG        
Sbjct: 349 SIWQTLVHAYPSPRKTVLHNIDDIWGSAALTVGDWKMIQGTNYNGRWDGWYGPGGERDPH 408

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--N 564
           L ++N +++S + +++  L     LP S  D+ R  R+ AT+ C       T    T  +
Sbjct: 409 LYDWNLVLKSSSGKAMTGLK---MLP-SRADQQR-LRETATVTCQGQSTQGTSCVATAFS 463

Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
            PC LF++ +DPCEQ N+A   P +   L   L++   T VP S+ QPD  +A P  FN 
Sbjct: 464 APC-LFHVRDDPCEQFNLAEQYPKVLDLLMAELQHINSTAVPPSN-QPDDPRASPMYFNY 521

Query: 625 TWS 627
           TW+
Sbjct: 522 TWT 524



 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 1/91 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A++  G+Y TD+ TK +V +I +    K PLFLY+AH A H+GN    L  P 
Sbjct: 155 MRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFLYVAHAAGHSGNPYNPLPVPD 214

Query: 60  ETINQFQYITDPNRRTYAALTKSTTLTLLIV 90
           + + +  +I    RR YAA+      ++ ++
Sbjct: 215 DDVMKLDHILHYKRRRYAAMVAKMDESVGVI 245


>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
 gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
          Length = 545

 Score =  417 bits (1073), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 235/543 (43%), Positives = 318/543 (58%), Gaps = 62/543 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS +IPTPNIDALAY+GIILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 37  GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL E+ LP+YL +LGY++   GKWHLG ++ EYTPL+RGF SH         
Sbjct: 97  YAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFSSH--------- 147

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
                          L G DMR     A++  G+Y TD+ TK +V +I +    K PLFL
Sbjct: 148 -------------KNLWGLDMRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFL 194

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H+GN    L  P + + +  +I    RR YAAMV K+D+SVG ++  L++  M
Sbjct: 195 YVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVGVIVDQLRKSNM 254

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSIIIF SDNG P   +       N+ SNYP RGVKNTLWEGGV+   ++WSP +++ 
Sbjct: 255 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLRGVKNTLWEGGVRAAGLVWSPLLEKR 309

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN----IDGL 446
            RV  + MHISDWLPTL  AAGG                      P+  N+N    +DG+
Sbjct: 310 QRVVDETMHISDWLPTLIEAAGG---------------------IPALANANLSQSLDGM 348

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
             W +L+   PS R +VL NID+   +AA+ +  WK++ GT  NG  DG+YG        
Sbjct: 349 SIWQTLVHAYPSPRKTVLHNIDDIWGSAALTVGDWKMIQGTNYNGRWDGWYGPGGERDPH 408

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--N 564
           L ++N +++S + +++  L     LP S  D+ R  R+ AT+ C       T    T  +
Sbjct: 409 LYDWNLVLKSSSGKAMTGLK---MLP-SRADQQR-LRETATVTCQGQSTQGTSCVATAFS 463

Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
            PC LF++ +DPCEQ N+A   P +   L   L++   T VP S+ QPD  +A P  FN 
Sbjct: 464 APC-LFHVRDDPCEQFNLAEQYPKVLDLLVAELQHINSTAVPPSN-QPDDPRASPMYFNY 521

Query: 625 TWS 627
           TW+
Sbjct: 522 TWT 524



 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 1/91 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A++  G+Y TD+ TK +V +I +    K PLFLY+AH A H+GN    L  P 
Sbjct: 155 MRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFLYVAHAAGHSGNPYNPLPVPD 214

Query: 60  ETINQFQYITDPNRRTYAALTKSTTLTLLIV 90
           + + +  +I    RR YAA+      ++ ++
Sbjct: 215 DDVMKLDHILHYKRRRYAAMVAKMDESVGVI 245


>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
 gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
          Length = 573

 Score =  417 bits (1072), Expect = e-114,   Method: Compositional matrix adjust.
 Identities = 236/540 (43%), Positives = 314/540 (58%), Gaps = 38/540 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGSN+IPTPNIDALAY+GIILN  Y  P+CTPSRASLMTGK+P++ GMQ   I
Sbjct: 39  GWNDVSFHGSNQIPTPNIDALAYDGIILNRHYVPPLCTPSRASLMTGKHPMNIGMQDHVI 98

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L ++ +P+Y RE GY T  +GKWHLGFFRR YTP YRGF+SHFGYL   I 
Sbjct: 99  ISDEPWGLGLDQKLMPQYFREAGYRTHLVGKWHLGFFRRAYTPTYRGFDSHFGYLGPYID 158

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+DH L  Q + T    G DMRRN +  +D  G YATDLF  EAV+LI+     KPLFL 
Sbjct: 159 YWDHSL--QMNET-SARGLDMRRNTAVNYDANGTYATDLFNDEAVRLIDSHNRSKPLFLV 215

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L HLA H GN    L+AP + I +F YI DP RRT AAMV ++D  VG +  +L+R+ ML
Sbjct: 216 LTHLAPHTGNEDDPLQAPADEIAKFDYIQDPKRRTLAAMVSRIDTGVGRIYRSLERRRML 275

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            N+II+F +DNGAPT+     S     GSNYP RG K + WEG V+  A++WS  + +  
Sbjct: 276 NNTIILFYADNGAPTLGIHANS-----GSNYPLRGQKESPWEGAVRGAALIWSRLLPRKG 330

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            VS Q +H+SDWLPTL  AAG       + I              S  NS IDG +QWS+
Sbjct: 331 IVSNQWLHVSDWLPTLGHAAG-------IRI--------------SPNNSPIDGQNQWST 369

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN-- 509
           L  ++ S R  V+ N   +   ++     WK V GT   G  D + GQ +  +       
Sbjct: 370 LTSSSNSGRTVVMNNAHNEFTYSSYIKRGWKYVNGTSFKGAYDRWLGQLKDGEQISHEEY 429

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           +  ++ S+T  S  QL+++    + +   +  +  Q T++  A P      PC      L
Sbjct: 430 YKRLIASQTIGSSMQLTRDDVKRLRSKAMVECSSAQGTVYVQAIPCEPLVKPC------L 483

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DPCE+ N+A   P++   L   ++  +R  +P    QP   ++DP   N TW+ W
Sbjct: 484 FNVVEDPCERVNLADQYPELLKDLQSDVERFKRDALP-PRNQPSDSRSDPALHNYTWTWW 542



 Score = 87.8 bits (216), Expect = 1e-14,   Method: Compositional matrix adjust.
 Identities = 44/91 (48%), Positives = 54/91 (59%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRN +  +D  G YATDLF  EAV+LI+     KPLFL L HLA H GN    L+AP +
Sbjct: 176 MRRNTAVNYDANGTYATDLFNDEAVRLIDSHNRSKPLFLVLTHLAPHTGNEDDPLQAPAD 235

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
            I +F YI DP RRT AA+       +  +Y
Sbjct: 236 EIAKFDYIQDPKRRTLAAMVSRIDTGVGRIY 266


>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
 gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
          Length = 542

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 228/539 (42%), Positives = 314/539 (58%), Gaps = 54/539 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ F GS +IPTPNIDALAY+G+ILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 43  GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 102

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL  + LP+YL ELGY++   GKWHLG ++R YTPLYRGF SH GY      
Sbjct: 103 YAAEPRGLPLDLKILPQYLNELGYTSHIAGKWHLGHWKRVYTPLYRGFSSHHGYW----- 157

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFL 270
                            G DMR     A+D  G+Y TD+ T+ ++Q+I + +P   PLFL
Sbjct: 158 -----------------GLDMRNGTEIAYDLHGQYTTDVITQHSLQVIANHKPAKGPLFL 200

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H+GN    L A  + + +   I    RR YAA+V  +D+SVG ++  L++  M
Sbjct: 201 YVAHAAVHSGNPYNPLPASDDAVRRLDKIQHYKRRKYAALVTAMDESVGKIVEQLRKSRM 260

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSII+F +DNG P   +       N+ SNYP RGVKNTLWEGGV+  A+LWSPQ+ + 
Sbjct: 261 LENSIIVFSTDNGGPAEGFNS-----NFASNYPLRGVKNTLWEGGVRGAALLWSPQLTKR 315

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
           PR + Q+MH+ DWLPTL  AAGG  +   L                    + +DG+  W 
Sbjct: 316 PRTAEQLMHMVDWLPTLVEAAGGKVALSKLEA------------------AKLDGVSLWQ 357

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           +L+ N  S RNS+L NID+   +AA+ +  WKLV G+  NG+ DG+YG          N+
Sbjct: 358 ALVRNEASPRNSILHNIDDIWGSAALSVGVWKLVQGSHYNGSWDGWYGPAGLRNPHDYNW 417

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN--GPCY 568
             + +    Q++QQL     LP    D +R  R+ AT+ C       +P    N      
Sbjct: 418 QLVTKCAAGQAMQQLK---MLP-QQADIIR-LRKAATVSCKEAKVKPSPDIACNPLKEIC 472

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           LFN+ +DPCEQ N+A   P + + L   L+    T +P ++ +PD  + DP+ +N TW+
Sbjct: 473 LFNVLDDPCEQRNLAKQHPKVLALLLNELQLFNATAIPPAN-KPDDPRGDPRLWNHTWT 530



 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 47/83 (56%), Gaps = 1/83 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A+D  G+Y TD+ T+ ++Q+I + +P   PLFLY+AH A H+GN    L A  
Sbjct: 161 MRNGTEIAYDLHGQYTTDVITQHSLQVIANHKPAKGPLFLYVAHAAVHSGNPYNPLPASD 220

Query: 60  ETINQFQYITDPNRRTYAALTKS 82
           + + +   I    RR YAAL  +
Sbjct: 221 DAVRRLDKIQHYKRRKYAALVTA 243


>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 578

 Score =  416 bits (1070), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 225/545 (41%), Positives = 325/545 (59%), Gaps = 44/545 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHG+ +IPTPNIDAL YNGIILN  Y  P C+P+RA+ +TGKYPI  GMQG  I
Sbjct: 36  GWNDVGFHGATQIPTPNIDALGYNGIILNKHYVLPSCSPTRAAFLTGKYPIRMGMQGAGI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G EPRG+P+  + LPEYL+ LGY T  IGKWH+G+   ++ P  RGF+  +G+ N  I 
Sbjct: 96  AGGEPRGLPVHVQTLPEYLQGLGYETNLIGKWHVGYHTPKHLPNRRGFDYFYGFYNSHIG 155

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD+    +YS+   ++G DM  N  TA+ T G YATD FT+ A+ +I    ++ P++L 
Sbjct: 156 YYDY----RYSQG-NMSGFDMHINGETAYGTDGVYATDRFTQAAIDVIYRHDLESPMYLQ 210

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           ++HLA HA       + P +  ++F++I++P RR YA MV +LDDS+G ++S+L  +GML
Sbjct: 211 VSHLAPHAPMDVPFEDNPYD--DEFRHISEPKRRAYAKMVARLDDSLGRIVSSLGDRGML 268

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NS+I+FMSDNG  T+       +RNW SN+P RG K TL+EGGV+  A +WSP++    
Sbjct: 269 KNSVIVFMSDNGGATI-----GKFRNWASNWPLRGTKYTLFEGGVRSVAAMWSPKLLSKG 323

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS Q+ H++DWLPTL+  AGGD   L                        IDG+ QW  
Sbjct: 324 RVSEQLFHVTDWLPTLFQVAGGDMRDL----------------------GPIDGVSQWDV 361

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLLNF 510
           L + T + R+ +L+NIDE  +T A     +K+V G+  +G  D   G+  R +K P+ N 
Sbjct: 362 LSIGTGNVRDKILLNIDEYTQTEAAIDKRFKIVRGSLYHGYYDNVEGEIGRGHKNPMYNT 421

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS-----PCTNG 565
           + I++S    +++   +++ +P++    M   ++QAT+ C  N   M  +     P  N 
Sbjct: 422 SRILKSAVSDAIR---EHLGVPVTQESVMWELQRQATVLCRPNMTNMQYTSRYTLPSCNQ 478

Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
              LF+LG DPCE NN+A S P    +L EL       L+ +    P    ADP+R+N T
Sbjct: 479 TACLFDLGIDPCETNNVAKSYPAQLQEL-ELFLDKYSNLLTRQIRMPVDYMADPRRWNYT 537

Query: 626 WSPWI 630
           W PW+
Sbjct: 538 WQPWL 542



 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 48/80 (60%), Gaps = 2/80 (2%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           M  N  TA+ T G YATD FT+ A+ +I    ++ P++L ++HLA HA       + P +
Sbjct: 171 MHINGETAYGTDGVYATDRFTQAAIDVIYRHDLESPMYLQVSHLAPHAPMDVPFEDNPYD 230

Query: 61  TINQFQYITDPNRRTYAALT 80
             ++F++I++P RR YA + 
Sbjct: 231 --DEFRHISEPKRRAYAKMV 248


>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  415 bits (1067), Expect = e-113,   Method: Compositional matrix adjust.
 Identities = 238/544 (43%), Positives = 329/544 (60%), Gaps = 49/544 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHG++EIPTPNIDALAYNG+ILN  Y  P+CTPSR + +TGKYPI TGMQG P+
Sbjct: 15  GWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPICTPSRTAFLTGKYPIRTGMQGYPL 74

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            GAEPRG+ L    LPEYL++LGY+T  +GKWH+G+  R Y P +RGF++  GY NG I 
Sbjct: 75  QGAEPRGILLNNILLPEYLQKLGYATHLVGKWHVGYHTRNYGPTHRGFDTFAGYYNGYIQ 134

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           Y++H L +      E  G+D+ R +      +   +Y TDL T EA  +I      KPL+
Sbjct: 135 YFNHTLYES-----EQLGYDLHRIIGDDHKIEYRYDYMTDLITDEAENIISSHNPAKPLY 189

Query: 270 LYLAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           L +AHLAAH+ +A + +E    +ET     YI D NRR YA++V  LD+SVG VI AL++
Sbjct: 190 LQVAHLAAHSSDAEEEMEVRNWKETNATLGYIEDINRRKYASVVATLDESVGRVIDALKK 249

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
             ML NSII+F+SDNGA T  + +     N+GSNYP RG+K +L+EGG++  A ++SP I
Sbjct: 250 TDMLNNSIIVFISDNGAQTEGFLQ-----NYGSNYPLRGLKFSLFEGGIRGAACIYSPLI 304

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
            +  RVS Q+ HI+DWLPTLY+AAGG++S L   +DG DQWS++     S +N       
Sbjct: 305 DRPSRVSNQLFHITDWLPTLYSAAGGNSSDLK-QLDGFDQWSAI----KSAKN------- 352

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV-P 506
                     S+R+S+LINIDEK+ + A  +  +KLV  T E      YY  + +N + P
Sbjct: 353 ----------SKRSSILINIDEKENSEAALIGHYKLVTDTSE---YQKYYNYSGNNALYP 399

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
             N   I+ S    ++  +S +      N  K+   R++ATI C  N      S CTN  
Sbjct: 400 KYNVTTILASPAASAIASISISTL----NASKIMQLRKEATIIC-KNFTDF--SNCTNRS 452

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LF+L  DPCE  ++ S  P    +L   +  ++  L+ Q++   D     P+ FNDTW
Sbjct: 453 C-LFDLNKDPCETTDLFSKHPKDVKRLNMFIDRYKSVLMNQTNAPVDPA-GFPRNFNDTW 510

Query: 627 SPWI 630
            PW+
Sbjct: 511 MPWL 514



 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 42/71 (59%), Gaps = 2/71 (2%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
           +Y TDL T EA  +I      KPL+L +AHLAAH+ +A + +E    +ET     YI D 
Sbjct: 165 DYMTDLITDEAENIISSHNPAKPLYLQVAHLAAHSSDAEEEMEVRNWKETNATLGYIEDI 224

Query: 72  NRRTYAALTKS 82
           NRR YA++  +
Sbjct: 225 NRRKYASVVAT 235


>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
 gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
          Length = 542

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 233/539 (43%), Positives = 315/539 (58%), Gaps = 59/539 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS EIPTPNIDALAY+GIILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 37  GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL E+ LP+YL ELGY++   GKWHLG ++ +YTPLYRGF SH+G       
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHWGL------ 150

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
                              DMR     A+D  G Y TD+ T ++V++I +    K PLFL
Sbjct: 151 -------------------DMRNGTQVAYDLHGHYTTDVITDQSVKVIANHNATKGPLFL 191

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H+ N    L  P   + +  +I +  RR +AAMV K+DDSVG ++  L++  M
Sbjct: 192 YVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDDSVGQIVDQLRKSNM 251

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSIIIF SDNG P   +       N+ SNYP +GVKNTLWEGGV+   ++WSP ++++
Sbjct: 252 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKS 306

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS Q MHI DWLPTL  AAGG  +   L                   +  IDG   W 
Sbjct: 307 QRVSNQTMHIVDWLPTLLEAAGGQPALANL-------------------SKQIDGQSIWR 347

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           +L+ +  S R +VL NID+   +AA+ +  WKLV GT   G+ DG+YG        L ++
Sbjct: 348 ALVHDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGWYGPAGERDPRLYDW 407

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGPCY 568
             +  S+  ++L+ L     LP S  D+ R  R  AT+ C    +  T    T  + PC 
Sbjct: 408 QLVGRSRAGKALEALK---MLP-SRADQQR-IRAAATVSCPGQSSQGTSCVATAFSAPC- 461

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           LF++ +DPCEQ N+A   P++ S L   L+    T VP S++  D  +ADP+ +N TW+
Sbjct: 462 LFHIRDDPCEQYNLAKQYPEVVSALMAELERFNATAVPPSNKPAD-PRADPRFWNFTWT 519



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 44/81 (54%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A+D  G Y TD+ T ++V++I +    K PLFLY+AH A H+ N    L  P 
Sbjct: 152 MRNGTQVAYDLHGHYTTDVITDQSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPD 211

Query: 60  ETINQFQYITDPNRRTYAALT 80
             + +  +I +  RR +AA+ 
Sbjct: 212 NDVIKMSHIPNYKRRKFAAMV 232


>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
          Length = 562

 Score =  414 bits (1063), Expect = e-112,   Method: Compositional matrix adjust.
 Identities = 226/542 (41%), Positives = 334/542 (61%), Gaps = 43/542 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHG+N+IPTPNIDALAYNG+IL   Y  P+CTPSR + +TG+YPI TGMQG P+
Sbjct: 50  GWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 109

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EPR +PL    LPEYLR+LGY+T  +GKWH+G++   +TP  RGF++ FGY +G IS
Sbjct: 110 KAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYSGYIS 169

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H +        +L+ +D  +NLS  ++   EY TDL T+ A  +I++    KPL+L 
Sbjct: 170 YFNHTIKQDDHIGYDLH-YDNSKNLSIDYNF--EYTTDLITERAENIIKNHDRRKPLYLQ 226

Query: 272 LAHLAAHAGNAGKHLEA--PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           L HLAAH+ +A + +E    QET    +YI D NRR YA +V  +D+SVG VI AL +  
Sbjct: 227 LCHLAAHSSDAKEVMEVRDEQETNATLKYIEDYNRRKYAGVVTAMDESVGRVIKALGQSS 286

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           MLENSII+F+SDNGA T    E     N+GSNYP RG+K TL+EGG++  A ++S  IQ 
Sbjct: 287 MLENSIIVFISDNGAQTEGLLE-----NYGSNYPLRGLKFTLFEGGIRGVACVYSRLIQN 341

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
           + R+S ++MHI+DWLPT Y+AAGG+   L  N+DG+DQW ++           + G +  
Sbjct: 342 SSRISNELMHITDWLPTFYSAAGGNLENLEENMDGVDQWDTI-----------VSGKE-- 388

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
                   S+R SVL+NIDE +  ++  +  +KL++  + N   + YYG    S   P  
Sbjct: 389 --------SKRESVLLNIDEVEDVSSALIGKYKLIINGK-NIQYNDYYGDNGTSVSYPEY 439

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
           N ++++ S    ++   + N  L I ++ K+R+   +A + C       + S C +  C 
Sbjct: 440 NVSSVLHSIAGSTIFH-ATNFTLNIKDVIKLRN---KARVVCN---NFTSYSKCVD-KC- 490

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LF++ NDPCE  ++++  P+I  +L   +  + + L+ Q++   D  ++ P+ FN TW P
Sbjct: 491 LFDIYNDPCETTDLSNEHPEIVKKLNYFIAEYEKVLLHQTNAPVD-PRSYPENFNGTWMP 549

Query: 629 WI 630
           WI
Sbjct: 550 WI 551



 Score = 60.1 bits (144), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 49/82 (59%), Gaps = 4/82 (4%)

Query: 3   RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA--PQE 60
           +NLS  ++   EY TDL T+ A  +I++    KPL+L L HLAAH+ +A + +E    QE
Sbjct: 191 KNLSIDYNF--EYTTDLITERAENIIKNHDRRKPLYLQLCHLAAHSSDAKEVMEVRDEQE 248

Query: 61  TINQFQYITDPNRRTYAALTKS 82
           T    +YI D NRR YA +  +
Sbjct: 249 TNATLKYIEDYNRRKYAGVVTA 270


>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 575

 Score =  410 bits (1053), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 234/556 (42%), Positives = 321/556 (57%), Gaps = 47/556 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS +IPTPNIDALAYNG ILN  Y QP CTPSRA+L+TGKYPI  G+QGPPI
Sbjct: 44  GWNDVGFHGSIQIPTPNIDALAYNGAILNRHYVQPTCTPSRAALLTGKYPIRYGLQGPPI 103

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +   +P  E+ LP+YL+ELGYST  +GKWHLG +++ +TP  RGF+SHFGY NG IS
Sbjct: 104 ASGKASALPTNEKILPQYLKELGYSTHLVGKWHLGHYQKRFTPTKRGFDSHFGYWNGYIS 163

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTV--GEYATDLFTKEAVQLIEDQPVDK-PL 268
           Y +   S   +RT  ++G D RR    A + +    YATD+FT+EA ++IE    +   +
Sbjct: 164 YRN---STHATRT--MSGIDARRGFERAGNEMDRDRYATDVFTEEARKIIESSKRENTEM 218

Query: 269 FLYLAHLAAHAGNAG-KHLEAPQETINQ--FQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           FL ++HLA H+GN+G  HLE   +T N   F YI + NRR YA MV  LDDSVG +I +L
Sbjct: 219 FLMVSHLAVHSGNSGPNHLEVLNKTYNDEAFGYIENENRRLYAGMVTSLDDSVGKIIESL 278

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
               MLENSI++F+SDNGAPT +      + N+GSN+P RG K ++ EGGV+  A +WSP
Sbjct: 279 HEHEMLENSIVVFISDNGAPTDD--PLWGHDNFGSNWPLRGKKASVLEGGVRGVAAIWSP 336

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
            +++   VS  + HI+DWLPTLYTAAGG+   L                        IDG
Sbjct: 337 WLKEKNTVSNNLFHITDWLPTLYTAAGGNFEDL----------------------GKIDG 374

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ-ENGTMDGYYGQTRSNK 504
           ++QW SL   +   R+ +LINIDE +   A+  + WK+V   +  N   +  Y     N 
Sbjct: 375 VNQWKSLTDTSKQVRSKILINIDETRGEEALIFNQWKVVKSNRTSNMVFNLKYAGDPGNA 434

Query: 505 VPLLNFNAIVESKTYQSLQQ---LSQNIFLPISNI-DKMRSTRQQATIH---CG---ANP 554
            P  N + +V S   + L +   L+Q+  +  S   D    +R  A +    C    +NP
Sbjct: 435 GPKYNMSGVVASTVGKRLSKINCLTQDSCMNASTARDVFYVSRSLAKVDGQTCPERVSNP 494

Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
            P     C +G C LF++  DPCE  N+A   P        +L+  ++ +  Q+ +    
Sbjct: 495 NPDQQYECFDGYC-LFDILQDPCEYRNVAKQNPQTLDATIHMLEQFKKEMTMQAPDPIID 553

Query: 615 VQADPKRFNDTWSPWI 630
             ADP+RF   W PW+
Sbjct: 554 PDADPRRFAGYWEPWL 569



 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 46/72 (63%), Gaps = 4/72 (5%)

Query: 15  YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAG-KHLEAPQETINQ--FQYITD 70
           YATD+FT+EA ++IE    +   +FL ++HLA H+GN+G  HLE   +T N   F YI +
Sbjct: 195 YATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSGPNHLEVLNKTYNDEAFGYIEN 254

Query: 71  PNRRTYAALTKS 82
            NRR YA +  S
Sbjct: 255 ENRRLYAGMVTS 266


>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
 gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
          Length = 544

 Score =  409 bits (1050), Expect = e-111,   Method: Compositional matrix adjust.
 Identities = 233/539 (43%), Positives = 314/539 (58%), Gaps = 59/539 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS EIPTPNIDALAY+GIILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 37  GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL E+ LP+YL ELGY++   GKWHLG ++ +YTPL+RGF SH+G       
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLHRGFSSHWGL------ 150

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
                              DMR     A+D  G Y TD+ T  +V++I      K PLFL
Sbjct: 151 -------------------DMRNGTQVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFL 191

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H+ N    L  P   + +  +I    RR +AAMV K+DDSVG ++  L++  M
Sbjct: 192 YVAHAACHSSNPYNPLPVPDNDVLKMSHIPHYKRRKFAAMVSKMDDSVGQIVDQLRKSNM 251

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSIIIF SDNG P   +       N+ SNYP +GVKNTLWEGGV+   ++WSP ++++
Sbjct: 252 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKS 306

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS Q MHI DWLPTL  AAGG     P  +  L+Q               IDG   W 
Sbjct: 307 QRVSNQTMHIIDWLPTLLEAAGG-----PPAVANLNQ--------------QIDGQSIWR 347

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           +L+ +  S R +VL NID+   +AA+ +  WKLV GT   G+ DG+YG          ++
Sbjct: 348 ALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGWYGPAGERDPRQYDW 407

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGPCY 568
             + +S+  ++L  L     LP S  D+ R  R  AT+ C A  +  T    T  + PC 
Sbjct: 408 QLVAKSRVGKALDALK---MLP-SRADQQR-IRAAATVTCPAQSSQGTSCLATAFSAPC- 461

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           LF++ +DPCEQ N+A   P++ + L   L+    T VP S++  D  +ADP+ +N TW+
Sbjct: 462 LFHIRDDPCEQYNLAKQYPEVVNALMTELERFNATAVPPSNKPAD-PRADPRFWNYTWT 519



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A+D  G Y TD+ T  +V++I      K PLFLY+AH A H+ N    L  P 
Sbjct: 152 MRNGTQVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAHAACHSSNPYNPLPVPD 211

Query: 60  ETINQFQYITDPNRRTYAALT 80
             + +  +I    RR +AA+ 
Sbjct: 212 NDVLKMSHIPHYKRRKFAAMV 232


>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
 gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
          Length = 558

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 231/544 (42%), Positives = 323/544 (59%), Gaps = 43/544 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G ND+    +N+IPTPNIDAL YNG++L+  Y Q  CTPSRA+ +TG YPI + MQG PI
Sbjct: 38  GHNDIGLR-TNQIPTPNIDALGYNGVVLDRYYVQNACTPSRAAFLTGNYPIRSAMQGLPI 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E R +PL    +P++L+ LGY T  +GKWHLG   R  TP  +GF+SHFGY NG   
Sbjct: 97  VAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWHLGSAYRSSTPTEKGFDSHFGYWNGFTG 156

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYD+  +D  S  +E  G D+     T     G+YAT +FT+ A+ +IE     +PLFL 
Sbjct: 157 YYDY-FTDFNSTAIE--GFDLHDRFETERGYQGQYATRVFTERALDIIEGHNTTRPLFLL 213

Query: 272 LAHLAAHAGNAGKHLEAPQETINQ--FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           + HLAAHAG  G  L  P E   Q  + YI DP RR YA +V +LD S+G V+  L  + 
Sbjct: 214 MTHLAAHAGRDGTELGVPNEVEAQRTYSYIQDPRRRLYAEIVAELDRSIGQVVRKLSERQ 273

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           MLENSII+F SDNGAPTV       Y N GSN+P RG+K T +EGG++  A ++SP +++
Sbjct: 274 MLENSIILFFSDNGAPTV-----GPYTNSGSNWPLRGIKLTNFEGGIRGTATIFSPLLKK 328

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V+ +++H+SDWLPT Y AAGG+ + L   IDG++QW                     
Sbjct: 329 RGYVNKELIHVSDWLPTFYAAAGGNLADLG-PIDGVNQW--------------------- 366

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDS--WKLVLGTQENGTMDGYYGQT-RSNKVP 506
            +L L+TPS R+ +L+NI+E+  T ++  D+  +KLV G  E GT DGY+G + RS   P
Sbjct: 367 PTLSLDTPSPRSEILVNINEQDNTTSIITDNGRFKLVTGAFEGGTYDGYFGDSGRSPDTP 426

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
             +  A+++S+T  ++Q+L+Q    PI+   ++R TR Q  +    N +   P  C+  P
Sbjct: 427 PYDPFAVLQSETNIAIQELTQT---PITR-QQIRVTRAQIDLSWCRNDSFRPPLNCSQ-P 481

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LF+L NDPCE  NI    P+I+ +L   +      LVPQ     D   A+P+R+N+TW
Sbjct: 482 C-LFDLENDPCETTNIIEENPEIAERLLRRIADFFEVLVPQGDSTID-PNANPQRYNNTW 539

Query: 627 SPWI 630
             W+
Sbjct: 540 CTWL 543



 Score = 67.4 bits (163), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 33/70 (47%), Positives = 42/70 (60%), Gaps = 2/70 (2%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ--FQYITD 70
           G+YAT +FT+ A+ +IE     +PLFL + HLAAHAG  G  L  P E   Q  + YI D
Sbjct: 186 GQYATRVFTERALDIIEGHNTTRPLFLLMTHLAAHAGRDGTELGVPNEVEAQRTYSYIQD 245

Query: 71  PNRRTYAALT 80
           P RR YA + 
Sbjct: 246 PRRRLYAEIV 255


>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
          Length = 571

 Score =  407 bits (1045), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 228/550 (41%), Positives = 317/550 (57%), Gaps = 60/550 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSNEIPTPNIDALAY G+ILN  YA P CTPSR + +TG++PI  G+QG P+
Sbjct: 37  GWNDVGFHGSNEIPTPNIDALAYGGVILNRHYALPTCTPSRTAFLTGRHPIRMGLQGIPM 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEPRGVPL ER LPEYLRELGY T+ +GKWHLG++  ++TP  RGF+S  GY  GVI+
Sbjct: 97  NVAEPRGVPLHERLLPEYLRELGYVTRLVGKWHLGYYTDKHTPTRRGFDSFVGYYGGVIT 156

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG--------EYATDLFTKEAVQLIEDQP 263
           Y++H ++            D    +   WDT G        +Y TD  + +A  +I++  
Sbjct: 157 YFNHTVT-----------KDKHTGIDYHWDTSGKIEPFDNDQYVTDFISDQAEAVIKNHD 205

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQ--ETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
             KPLFL LAH+AAHA      +E     E  +   YI D NRR YA +V  +DDSVG V
Sbjct: 206 RKKPLFLQLAHVAAHASENRDPIEVRNMTEVNDTLSYIPDINRRKYAGVVTAMDDSVGRV 265

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           + AL+   ML NSIIIFMSDNG+PT E    + Y N+GSNYP RG+K T++EGGV+VPA 
Sbjct: 266 VKALKDANMLSNSIIIFMSDNGSPTAE----APYTNYGSNYPLRGIKATVFEGGVRVPAC 321

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
           ++SP+++   RVS ++ HI+DW PTLY  AGGD S++  ++DG+DQWSS+   + S++  
Sbjct: 322 VFSPRLKDRFRVSDELFHITDWFPTLYKLAGGDLSKIQ-DLDGVDQWSSI---SGSQK-- 375

Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
                           S R S+L+NIDE     A     +KL+ G       D YYG+  
Sbjct: 376 ----------------SNRESLLVNIDEVSNPEAAISGYYKLIRGINR---YDDYYGKDG 416

Query: 502 SNKVP-LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
           ++  P   +   ++ S   +++  L      P   I ++R+   +AT+ C          
Sbjct: 417 NDYSPKTYDVTGVLSSLAGRAIASLGNQYLPPQKRITELRN---KATLRCEKK----DDR 469

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
           P     C LF++  DPCE  NI    P++   L   ++ ++  L+ Q++   D     P+
Sbjct: 470 PSCRDTC-LFDIVKDPCETTNIIDEHPEMEIVLNTYIQGYKAVLMNQTNTYIDPAGY-PE 527

Query: 621 RFNDTWSPWI 630
            FN  W PW+
Sbjct: 528 NFNGYWMPWM 537



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 31/84 (36%), Positives = 42/84 (50%), Gaps = 10/84 (11%)

Query: 9   WDTVG--------EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ- 59
           WDT G        +Y TD  + +A  +I++    KPLFL LAH+AAHA      +E    
Sbjct: 174 WDTSGKIEPFDNDQYVTDFISDQAEAVIKNHDRKKPLFLQLAHVAAHASENRDPIEVRNM 233

Query: 60  -ETINQFQYITDPNRRTYAALTKS 82
            E  +   YI D NRR YA +  +
Sbjct: 234 TEVNDTLSYIPDINRRKYAGVVTA 257


>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
          Length = 563

 Score =  406 bits (1043), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 218/542 (40%), Positives = 320/542 (59%), Gaps = 44/542 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHG+++IPTPNIDALAYNG+IL   Y  P+CTPSR + +TG+YPI TGMQG P+
Sbjct: 50  GWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 109

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E R +PL    LPEYLR+LGY+T  +GKWH+G++   +TP YRGF++  GY +G I+
Sbjct: 110 KAGEERAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGYIT 169

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H +        +L+ +D+  NLS  +    EY TDL T+ A  +I +    KPL+L 
Sbjct: 170 YFKHTIEQNLHVGYDLH-YDVAGNLSVKYSH--EYMTDLITERAEDIIFNHNRSKPLYLQ 226

Query: 272 LAHLAAHAGNAGKHLEA--PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           L+H+AAH+ +A  ++E    +ET     YI D +RR  A +V  +D+SVG V+ AL++  
Sbjct: 227 LSHVAAHSSDAKANMEVRDEEETNATLGYIEDFDRRKLAGVVTAMDESVGRVVQALRQTN 286

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           MLENSII+FMSDNGA TV         N+GSNYP RG+K TL+EGGV+  A ++SP I+ 
Sbjct: 287 MLENSIIVFMSDNGAQTVGL-----LNNYGSNYPLRGLKFTLFEGGVRGVACVYSPAIKN 341

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             R+S Q++H++DWLPT Y+AAGG+   L  N+DG+D                     QW
Sbjct: 342 PSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLDGVD---------------------QW 380

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
           ++++    +RR +VL+NIDEK+  +   +  +KL+ G   N +   YYG +  S   P  
Sbjct: 381 ATIVSEKKTRRKNVLLNIDEKQALSGALMGRYKLINGA--NVSYGDYYGDSGTSESYPEY 438

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
           N + ++ S    +L  +S   F    N +++   R+ AT+ C       T  P     C 
Sbjct: 439 NVSNVLHSAAGSALASVSNATF----NSERIIELRRAATVVCDN----FTSYPRCLDRC- 489

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LF++  DPCE  +++   P++  +L   +  +   L  Q+    D  ++ P+ FN TW P
Sbjct: 490 LFDVYKDPCETTDVSREHPEVVDELNRFIAEYFAVLKTQTSGLVD-PRSYPEHFNGTWMP 548

Query: 629 WI 630
           WI
Sbjct: 549 WI 550



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 45/81 (55%), Gaps = 4/81 (4%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA--PQET 61
           NLS  +    EY TDL T+ A  +I +    KPL+L L+H+AAH+ +A  ++E    +ET
Sbjct: 192 NLSVKYSH--EYMTDLITERAEDIIFNHNRSKPLYLQLSHVAAHSSDAKANMEVRDEEET 249

Query: 62  INQFQYITDPNRRTYAALTKS 82
                YI D +RR  A +  +
Sbjct: 250 NATLGYIEDFDRRKLAGVVTA 270


>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
 gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
          Length = 556

 Score =  404 bits (1038), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 228/540 (42%), Positives = 311/540 (57%), Gaps = 57/540 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS +IPTPNIDALAY+G+ILN  Y  P+CTPSR+SLMTGKY IHTGMQ   +
Sbjct: 37  GFNDVGFHGSAQIPTPNIDALAYSGLILNRYYVAPICTPSRSSLMTGKYAIHTGMQHTVL 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +GAEPRG+PL E+ LP+YL +LGY++   GKWHLG ++  YTPL RGF SH G+      
Sbjct: 97  YGAEPRGLPLEEKLLPQYLNDLGYTSHIAGKWHLGHWKMPYTPLRRGFNSHHGFW----- 151

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
                            G DMR     A++  G+Y TD+ T+ ++ +I + PV K PLFL
Sbjct: 152 -----------------GLDMRNGSRVAYELHGQYTTDVITQHSIDVIANHPVSKGPLFL 194

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH AAH+GN    L    + + +   I    RR YAAMV KLD+SVG ++  L+R  M
Sbjct: 195 YVAHAAAHSGNPYNPLPVADDEVIKLGRIEHYKRRRYAAMVSKLDESVGIIVDQLRRSNM 254

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSII+F +DNG     +       N+ SNYP RGVKNTLWEGGV+   +LWS +  + 
Sbjct: 255 LENSIIVFATDNGGAAEGFN-----LNFASNYPLRGVKNTLWEGGVRGVGLLWSQRYLKR 309

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
           PRV+ Q MHI+DWLPT    AGG  +                    +   +++DG   W 
Sbjct: 310 PRVAEQTMHITDWLPTFVEVAGGKEAL-------------------ANFTTSLDGQSIWQ 350

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           +L+ +  S R SVL NID+   ++A+ +  WKL+ GT   G  DG+YG        L N+
Sbjct: 351 ALVQDEASPRKSVLHNIDDIWNSSALTVGDWKLIKGTNYKGAWDGWYGPAGERDPQLYNW 410

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN---GPC 567
             + +S   +++ QL     LP S  D+ R  R  AT++C    +        N    PC
Sbjct: 411 QQVAKSLAGKAMLQLK---VLP-SRADQQR-LRSAATVNCPNQQSNTQKGSVCNPLQTPC 465

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
            LFNL +DPCEQ N+A+  P + + L   LK+   T +P S+  PD  + +PK +N TW+
Sbjct: 466 -LFNLRDDPCEQYNLATQYPLVLNSLLSELKHFNSTAIPPSN-LPDDPRGNPKFWNYTWT 523



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 1/91 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A++  G+Y TD+ T+ ++ +I + PV K PLFLY+AH AAH+GN    L    
Sbjct: 155 MRNGSRVAYELHGQYTTDVITQHSIDVIANHPVSKGPLFLYVAHAAAHSGNPYNPLPVAD 214

Query: 60  ETINQFQYITDPNRRTYAALTKSTTLTLLIV 90
           + + +   I    RR YAA+      ++ I+
Sbjct: 215 DEVIKLGRIEHYKRRRYAAMVSKLDESVGII 245


>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
          Length = 531

 Score =  404 bits (1037), Expect = e-110,   Method: Compositional matrix adjust.
 Identities = 224/542 (41%), Positives = 310/542 (57%), Gaps = 64/542 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+SFHGS++IPTPN+  +A  GIIL+  Y Q  CTPSR +L+TG+YPI +GMQG P+
Sbjct: 35  GYNDVSFHGSSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPL 94

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E R +PL    +P + + LGY T  +GKWHLG   +E TPL +GF+SHFGY NG + 
Sbjct: 95  KAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVG 154

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+D++   +      + G D+       W + G YAT+LFT+ ++ +IE   V  PLFL 
Sbjct: 155 YFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLV 214

Query: 272 LAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           ++HLAAH G  G  L  P   +T ++F YI DP RR YA +V  LD S+G +++ L  K 
Sbjct: 215 VSHLAAHTGQNGSELGVPDVDQTNHEFSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQ 274

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML+NSI++F SDNGA TV       Y N GSN+P RGVK + +EGGV+V A ++SP   +
Sbjct: 275 MLDNSIVLFFSDNGAQTV-----GMYENSGSNWPLRGVKFSDFEGGVRVAATIYSPLFHK 329

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              VS  ++HISDWLPTLY+AAGGD + L   IDG+DQW +L         +N       
Sbjct: 330 KGYVSEHLIHISDWLPTLYSAAGGDVAHLG-QIDGIDQWDAL---------TN------- 372

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
                N PS R  +LINIDE     A+  D +KL+ G+   GT D YYG + R  + P  
Sbjct: 373 -----NNPSNRTEILINIDEVDENFAIIRDKFKLIQGSYHEGTFDQYYGDSGRGPENPTP 427

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
           N N                      +++   R+  Q   ++            CT G C 
Sbjct: 428 NPNHTT-------------------TDLSWCRAPDQTPILN------------CTKG-C- 454

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LF+L  DPCE  NI  S P+I++QLYE +    + LVPQ ++  D  ++DP  +N+TW  
Sbjct: 455 LFDLDKDPCETTNIIESEPEIANQLYEKIAQFWKELVPQRNKDTD-PKSDPIFYNNTWCT 513

Query: 629 WI 630
           W+
Sbjct: 514 WL 515



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 34/75 (45%), Positives = 46/75 (61%), Gaps = 2/75 (2%)

Query: 8   AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQF 65
            W + G YAT+LFT+ ++ +IE   V  PLFL ++HLAAH G  G  L  P   +T ++F
Sbjct: 182 VWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNHEF 241

Query: 66  QYITDPNRRTYAALT 80
            YI DP RR YA + 
Sbjct: 242 SYIQDPRRRLYAGVV 256


>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
 gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
          Length = 531

 Score =  403 bits (1036), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 234/540 (43%), Positives = 315/540 (58%), Gaps = 56/540 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ F GS +IPTPNIDALAY+G+ILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 32  GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 91

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL  + LP+YL +LGY++   GKWHLG ++R YTPLYRGF SH GY      
Sbjct: 92  YAAEPRGLPLDLKILPQYLNDLGYTSHIAGKWHLGHWQRVYTPLYRGFSSHHGYW----- 146

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
                            G DMR     A+D  G+Y+TD+ T+ ++ +I      K PLFL
Sbjct: 147 -----------------GLDMRNGTEVAYDLHGQYSTDVITQHSLNVISKHNATKGPLFL 189

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H+GN    L    + + +   I    RR YAA+V  +D+SVG ++  L++  M
Sbjct: 190 YVAHAAVHSGNPYNPLPVKDDAVRRLDTIQHYKRRKYAALVTAMDESVGKIVEQLRKSRM 249

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LE SII+F +DNG P   +       N+ SNYP RGVKNTLWEGGV+  A+LWSPQ+ + 
Sbjct: 250 LETSIIVFSTDNGGPAEGFNS-----NFASNYPLRGVKNTLWEGGVRGAALLWSPQLTKR 304

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
           PR + Q MHI DWLPTL  AAGG+ +   L                    + +DG+  W 
Sbjct: 305 PRTAEQTMHIVDWLPTLVEAAGGEAALAKLG------------------TAKLDGISLWQ 346

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           +LL N  S R SVL NID+   +AA+ +  WKLV GT  NG+ DG+YG          ++
Sbjct: 347 ALLKNETSPRKSVLHNIDDIWGSAALSVGDWKLVKGTNYNGSWDGWYGPAGMRNPHDYDW 406

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC-GANP--APMTPSPCTNGPC 567
            ++ +S T Q++QQL     LP    D +R  R+ A + C  ANP  AP       + PC
Sbjct: 407 QSVAKSATGQAMQQLK---MLP-KTADMIR-LREAANVICKDANPKDAPHIACNPLSEPC 461

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
            LFN+  DPCEQ N A   P +   L + L+    T VP ++ +PD  +ADP+ +N TW+
Sbjct: 462 -LFNVREDPCEQRNQAKQHPKVLELLLKELQLLNATAVPPAN-KPDDPRADPRLWNHTWT 519



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 1/83 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A+D  G+Y+TD+ T+ ++ +I      K PLFLY+AH A H+GN    L    
Sbjct: 150 MRNGTEVAYDLHGQYSTDVITQHSLNVISKHNATKGPLFLYVAHAAVHSGNPYNPLPVKD 209

Query: 60  ETINQFQYITDPNRRTYAALTKS 82
           + + +   I    RR YAAL  +
Sbjct: 210 DAVRRLDTIQHYKRRKYAALVTA 232


>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 599

 Score =  402 bits (1032), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 240/568 (42%), Positives = 329/568 (57%), Gaps = 56/568 (9%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
           +LIV    GWND+ FHGS +IPTPNIDALAYNG+ILN  Y QP CTPSRA+L+TGKYPI 
Sbjct: 37  ILIVADDLGWNDVGFHGSIQIPTPNIDALAYNGVILNRHYVQPTCTPSRAALLTGKYPIR 96

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
            G+QG PI    P  +PL E+ LP+YL++LGYST  +GKWHLG  + ++TP+ RGF+SHF
Sbjct: 97  YGLQGFPIIAGVPLALPLNEKILPQYLKDLGYSTHLVGKWHLGANKNQHTPIKRGFDSHF 156

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIE-D 261
           GY NG ISY +   S     T  + G D RR    A D  V  YATD+FT EA ++I+  
Sbjct: 157 GYWNGFISYRNSTHS-----TGLMVGKDARRGFERAGDEMVDRYATDIFTDEANKVIKLC 211

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETIN--QFQYITDPNRRTYAAMVKKLDDSV 318
           +  DKP+FL ++HLA H G  G + LE   +T N  +F YI +  RR YA M+  LD+SV
Sbjct: 212 KNHDKPMFLMVSHLAVHTGVPGPNILEVSNKTHNDIRFDYIENKERRLYAGMLTSLDESV 271

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G++I +L   GMLE+SI++F+SDNGAP  +      Y N GSN+P RG K  + +GGV+ 
Sbjct: 272 GSIIESLDNNGMLEDSIVLFISDNGAPADD--PIWGYGNSGSNWPLRGEKGAVLDGGVRG 329

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
            A +WSP +++  R+   + HI+DWLPTLYTAAGGD   L   IDG+DQW S  L   S+
Sbjct: 330 VAAIWSPWLKKKHRIFENLFHITDWLPTLYTAAGGDFEDLG-QIDGVDQWES--LTETSK 386

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD-GYY 497
           R                    R+ VLINIDE     A+  + WK+V   + N T D  YY
Sbjct: 387 RT-------------------RSMVLINIDETSGEEALIFNHWKVV---KSNRTSDIAYY 424

Query: 498 GQTR---SNKVPLLNFNAIVESKTYQSLQQLS-----QNIFLPISNIDKMRSTRQQATI- 548
            +      N  P  N  A+ ES     L +++      +    ++  D   + R QA I 
Sbjct: 425 LKYSGEPGNVGPDYNMTAVAESLAGSRLSKINCLVSKADCMDTLTTYDLFYNVRSQAKID 484

Query: 549 -HCG---ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL 604
             C    ++P P     C + PC LF++  DPCE  NIAS  P+  +   ++L   +  +
Sbjct: 485 GKCSERISDPDPNARYECFDSPC-LFDIQRDPCEYQNIASRNPEALNMTADILAQFKNEM 543

Query: 605 VPQSHEQPDLVQADPKRFNDTWSPWIYR 632
             Q++ + D  ++DP+ F+  W  W+ +
Sbjct: 544 TMQNYPETD-PKSDPRFFDGYWDTWLEK 570



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 37/86 (43%), Positives = 50/86 (58%), Gaps = 5/86 (5%)

Query: 2   RRNLSTAWD-TVGEYATDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKH-LEAP 58
           RR    A D  V  YATD+FT EA ++I+  +  DKP+FL ++HLA H G  G + LE  
Sbjct: 181 RRGFERAGDEMVDRYATDIFTDEANKVIKLCKNHDKPMFLMVSHLAVHTGVPGPNILEVS 240

Query: 59  QETIN--QFQYITDPNRRTYAALTKS 82
            +T N  +F YI +  RR YA +  S
Sbjct: 241 NKTHNDIRFDYIENKERRLYAGMLTS 266


>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
          Length = 563

 Score =  401 bits (1031), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 218/542 (40%), Positives = 326/542 (60%), Gaps = 44/542 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHG+++IPTPNIDALAYNG+IL   Y  P+CTPSR + +TG+YPI TGMQG P+
Sbjct: 50  GWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGHPL 109

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E R +PL    LPEYLR+LGY+T  +GKWH+G++   +TP YRGF++  GY +G ++
Sbjct: 110 DPGEVRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGFMT 169

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H +   +    +L+ +D+  NLS  +    EY TDL T+ A  +I +    KPL+L 
Sbjct: 170 YFNHTIEQNHHVGYDLH-YDVAGNLSVKYSH--EYMTDLITERAEDIILNHNHSKPLYLQ 226

Query: 272 LAHLAAHAGNAGKHLEA--PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           L+H+AAH+ N  K +E    +ET     YI D +RR  A +V  LD+SVG V+ AL++  
Sbjct: 227 LSHIAAHSSNINKTVEVRDEEETNATLGYIEDFDRRKLAGVVTALDESVGRVVQALKQAN 286

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           MLENSII+FMSDNGA TV   + S     GSNYP RG+K TL+EGGV+  A ++SP I+ 
Sbjct: 287 MLENSIILFMSDNGAQTVGLLDNS-----GSNYPLRGMKFTLFEGGVRGAACVYSPAIKN 341

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             R+S Q++H++DWLPT Y+AAGG+   L  N+DG+DQW++++    S  N         
Sbjct: 342 PSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLDGVDQWATIV----SEEN--------- 388

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
                   +RR ++L+NIDEK+  +   +  +KL+ G +       YYG +  S   P  
Sbjct: 389 --------TRRKNILLNIDEKEDLSGALMGRYKLINGAKLQ--YGDYYGDSGMSESYPEY 438

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
           N + ++ S    ++  +S N+ L   N +++   R+ A + C       T  P     C 
Sbjct: 439 NVSNVLHSAAGFAVASVS-NVTL---NSERIIELRRAAMVVCDN----FTSYPKCLDRC- 489

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LF++ NDPCE  +++   P++  ++   +  +   L+ Q+  + D  ++ P+ FN TW P
Sbjct: 490 LFDVYNDPCETTDVSREHPEVVDEINRFIAEYFTVLMTQTPTKTD-PRSYPENFNGTWMP 548

Query: 629 WI 630
           WI
Sbjct: 549 WI 550



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 44/81 (54%), Gaps = 4/81 (4%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA--PQET 61
           NLS  +    EY TDL T+ A  +I +    KPL+L L+H+AAH+ N  K +E    +ET
Sbjct: 192 NLSVKYSH--EYMTDLITERAEDIILNHNHSKPLYLQLSHIAAHSSNINKTVEVRDEEET 249

Query: 62  INQFQYITDPNRRTYAALTKS 82
                YI D +RR  A +  +
Sbjct: 250 NATLGYIEDFDRRKLAGVVTA 270


>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 502

 Score =  400 bits (1029), Expect = e-109,   Method: Compositional matrix adjust.
 Identities = 226/540 (41%), Positives = 321/540 (59%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGSNEIPTPNID LA NG+IL+N Y  P+CTPSR+++MTG+YPIHTGMQ   I
Sbjct: 31  GWNDISFHGSNEIPTPNIDRLANNGVILDNYYVLPICTPSRSAIMTGRYPIHTGMQQDTI 90

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +G  P GV L E+FLP+YL++ GY T  +GKWHLGFF ++YTP YRGF+S++G   G   
Sbjct: 91  FGPNPYGVGLNEKFLPQYLKQQGYKTHGVGKWHLGFFAKQYTPTYRGFDSYYGSYLGKGD 150

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H  ++ YS      G D+  N +  +   G Y+T+++T EA+  I +    +PLFLY
Sbjct: 151 YWNHSNTETYS------GLDLHDNENGVFSQDGNYSTEMYTAEAISCINNHNSSEPLFLY 204

Query: 272 LAHLAAHAGNAGKH-LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           LA+ A H+ N  +  L+APQE I++F YI    RR YAAM+  +D  VG V  AL +K M
Sbjct: 205 LAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQRRKYAAMLGYMDYGVGRVHDALAKKKM 264

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L+NSIIIF +DNG P   +       NW +N+P RGVK TL+EGGV+  + ++S  I ++
Sbjct: 265 LDNSIIIFTTDNGGPANGFD-----YNWANNFPLRGVKATLFEGGVRGVSFVYSKLI-ES 318

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
           PRVS +++HI+DWLPTL   AGG  S      DG                  +DG DQW+
Sbjct: 319 PRVSHELIHITDWLPTLVNLAGGKVS------DGF-----------------LDGFDQWA 355

Query: 451 SLLLNTPSRRNSVLINIDEKK-RTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
           +L     S+RN VL+NIDEK  +  A+R+ SWK++   +E    DG+Y        P  +
Sbjct: 356 TLQNKQSSQRNEVLLNIDEKVWKNEALRVGSWKII---KEGNYWDGWY--------PPPS 404

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           FN            + S N F  +S           +T+ CG +  P+  + C +  C L
Sbjct: 405 FN------------EQSNNSFSYLS-----------STVKCGHD-IPIVINHC-DSYC-L 438

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           F++  DPCE N+++   P++ ++L   L  +R+++VP  +      ++DPK  N  W PW
Sbjct: 439 FHIDEDPCEINDLSKKFPEVLAELINRLNTYRQSMVPPRNNMTIDPRSDPKLHNGVWQPW 498



 Score = 65.5 bits (158), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 31/68 (45%), Positives = 45/68 (66%), Gaps = 1/68 (1%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITDP 71
           G Y+T+++T EA+  I +    +PLFLYLA+ A H+ N  +  L+APQE I++F YI   
Sbjct: 177 GNYSTEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHE 236

Query: 72  NRRTYAAL 79
            RR YAA+
Sbjct: 237 QRRKYAAM 244


>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
          Length = 881

 Score =  397 bits (1020), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 234/543 (43%), Positives = 311/543 (57%), Gaps = 46/543 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G ND+ FHGSN+IPTPNIDALAY+GIILN  Y+ P+CTPSRA+LMTG++P++ GMQ   I
Sbjct: 45  GLNDVGFHGSNQIPTPNIDALAYDGIILNRHYSAPMCTPSRAALMTGRHPMNVGMQHYVI 104

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L E+ LP+Y RE GY T  IGKWHLGF+   Y P  RGF++H GYL   I 
Sbjct: 105 DSDEPWGLGLQEKLLPQYFREAGYRTHLIGKWHLGFYAEPYLPTNRGFDTHIGYLGPYID 164

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
           Y+ +I S   S T E  G+D+R+NL+  +   G YATD FT+ AV++I       + + L
Sbjct: 165 YWSYI-SKMDSATFE--GYDLRQNLAVNYKPNGTYATDYFTEAAVEIIRSHNRTGERMLL 221

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            L HLA H GN    L+AP+ETI +F YI D +RRTYAAM+ KLDDSVG V  ALQ+ GM
Sbjct: 222 VLNHLAPHTGNDDAPLQAPEETIEKFAYIRDTDRRTYAAMMSKLDDSVGQVYRALQQHGM 281

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSII+F+SDNG  T        + N GSNYP RG K++ WEG V+  A++WSP ++  
Sbjct: 282 LENSIIVFISDNGGVT-----RGMHSNSGSNYPLRGQKHSPWEGAVRTAALIWSPLLKDR 336

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN---IDGLD 447
            RVS Q  HISDWLPTL +AAG                       P+ R  N   IDG+D
Sbjct: 337 QRVSNQWFHISDWLPTLASAAG----------------------IPAYRGDNLSDIDGID 374

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
           QW +L   T + R  V+ N+DE     +   +  K V GT  +G  DG+YG        +
Sbjct: 375 QWEALAYGTGNPRQRVMNNLDEIFGYTSYVDNGLKYVNGTTLDGVNDGWYGA-------I 427

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
            + + +  +  Y      S    L   +   +   R+ A I CG +      +P    PC
Sbjct: 428 DDTDTLPSNAEYLRTVMASHVGRLTKPDEKVVSYLREHARIDCGNDIGNRDCNPLRR-PC 486

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELL-KYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
            LF++ NDPCE +++    PD   Q+ E + KY  R   P++  +P    ADP +F   W
Sbjct: 487 -LFDIINDPCEMHDLFERYPDKVRQIQERVEKYRLRAAKPRN--RPHDPAADPSKFGGVW 543

Query: 627 SPW 629
           + W
Sbjct: 544 NWW 546



 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 55/92 (59%), Gaps = 1/92 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +R+NL+  +   G YATD FT+ AV++I       + + L L HLA H GN    L+AP+
Sbjct: 182 LRQNLAVNYKPNGTYATDYFTEAAVEIIRSHNRTGERMLLVLNHLAPHTGNDDAPLQAPE 241

Query: 60  ETINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
           ETI +F YI D +RRTYAA+      ++  VY
Sbjct: 242 ETIEKFAYIRDTDRRTYAAMMSKLDDSVGQVY 273


>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 559

 Score =  396 bits (1017), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 195/408 (47%), Positives = 264/408 (64%), Gaps = 33/408 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS++IPTPNID LAY+G++L+N Y QP+CTP+RA+LMTG++PIH G+Q   I
Sbjct: 39  GWDDVSFHGSDQIPTPNIDELAYSGVLLHNYYVQPICTPTRAALMTGRHPIHLGLQDGVI 98

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             + P G+PL E  +P+YL+ LGY T  +GKWHLGFF  +YTPLYRGF++HFGY NG   
Sbjct: 99  VASHPYGLPLNETIMPQYLKPLGYDTHIVGKWHLGFFAWQYTPLYRGFDTHFGYYNGEEG 158

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYDH   +      +  G D R N        GEY+T+LFT  A ++I +   +KPLFLY
Sbjct: 159 YYDHTAEEP-----KYIGLDFRNNTELFKSAYGEYSTELFTSYAEKIIHNHNKNKPLFLY 213

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAH A H+GN+   LEAP +  ++F YI D  RRT+A MV  LDDSVG +  AL+  G+ 
Sbjct: 214 LAHQAVHSGNSYSPLEAPYKYTSRFPYIQDERRRTFAGMVSALDDSVGNITRALKHSGLY 273

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           EN II+F +DNG P   +       N+ SN+P RG+K+TLWEGGV+    + SP +++  
Sbjct: 274 ENCIILFSTDNGGPAAGFDA-----NYASNWPLRGIKHTLWEGGVRGDGFIHSPLLEKPG 328

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R+S  M+H+ DWLPT+Y  AGG+ S LP N+DG + W +L     SR             
Sbjct: 329 RISTDMIHVCDWLPTIYHIAGGNVSSLPTNLDGFNVWDTL-----SR------------- 370

Query: 452 LLLNTPSRRNSVLINIDEKK--RTAAVRLDSWKLVLGTQENGTMDGYY 497
               TPS R  +L NID  K    AA+R+  +K++LGT+E+G  DG+Y
Sbjct: 371 ---GTPSPRTEILHNIDPHKNHNAAALRVGEFKIILGTEEHGEWDGWY 415



 Score = 73.9 bits (180), Expect = 3e-10,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 50/82 (60%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            R N        GEY+T+LFT  A ++I +   +KPLFLYLAH A H+GN+   LEAP +
Sbjct: 174 FRNNTELFKSAYGEYSTELFTSYAEKIIHNHNKNKPLFLYLAHQAVHSGNSYSPLEAPYK 233

Query: 61  TINQFQYITDPNRRTYAALTKS 82
             ++F YI D  RRT+A +  +
Sbjct: 234 YTSRFPYIQDERRRTFAGMVSA 255


>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
 gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
          Length = 541

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 229/539 (42%), Positives = 309/539 (57%), Gaps = 59/539 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS +IPTPNIDALAY+GIILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 38  GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVTPICTPSRSALMTGKYPIHTGMQHAVL 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+ L E+ LP+YL +LGY++   GKWHLG ++ +YTPL+RGF SH+G       
Sbjct: 98  YAAEPRGLSLKEKILPQYLNDLGYTSHIAGKWHLGHWKLKYTPLFRGFSSHWGL------ 151

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
                              DMR     A+D  G Y TD+ T  AV++I +      PLFL
Sbjct: 152 -------------------DMRNGTEVAYDLHGRYTTDVITDHAVKVIANHNTTSGPLFL 192

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H+ N    L  P   + +  +I    RR +AAMV K+D+SVG ++  L++  M
Sbjct: 193 YVAHAACHSSNPYNPLPVPDNEVMKLGHIPHYKRRKFAAMVTKMDESVGLIVDQLRKSNM 252

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSIIIF SDNG P   +       N+ SNYP +GVKNTLWEGGV+   ++WSP ++  
Sbjct: 253 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLIWSPLLKSR 307

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS Q +HISDWLPTL  AAGG  +                    S  +  IDG   W 
Sbjct: 308 QRVSTQTIHISDWLPTLLEAAGGAAAL-------------------SNLSKGIDGQSIWQ 348

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           +L+ + PS R  V+ NID+   +A + +  WKLV GT   G  DG+YG        L ++
Sbjct: 349 ALVHDEPSPRLKVIHNIDDIWGSAGLSVGDWKLVKGTNYYGKWDGWYGPAGERDPHLYDW 408

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC-GANP-APMTPSPCTNGPCY 568
             I +S++ ++L  L     LP S  ++ R  R  AT+ C G +P      +   + PC 
Sbjct: 409 QQIPKSRSGKALNGLK---MLP-SQAEQQR-LRAAATVKCLGQSPQGSSCKATAFSAPC- 462

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           LFN+ +DPCEQ N+A   P +   L E LKY   T VP ++ +P   +A PK +N TW+
Sbjct: 463 LFNIHDDPCEQFNLAEQYPKVLKALMEELKYQNSTAVPPAN-KPGDRRASPKYWNYTWT 520



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 34/92 (36%), Positives = 46/92 (50%), Gaps = 2/92 (2%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A+D  G Y TD+ T  AV++I +      PLFLY+AH A H+ N    L  P 
Sbjct: 153 MRNGTEVAYDLHGRYTTDVITDHAVKVIANHNTTSGPLFLYVAHAACHSSNPYNPLPVPD 212

Query: 60  ETINQFQYITDPNRRTYAAL-TKSTTLTLLIV 90
             + +  +I    RR +AA+ TK      LIV
Sbjct: 213 NEVMKLGHIPHYKRRKFAAMVTKMDESVGLIV 244


>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
          Length = 563

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 229/561 (40%), Positives = 310/561 (55%), Gaps = 85/561 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+SFHGS++IPTPN+  +A  GIIL+  Y Q  CTPSR +L+TG+YPI +GMQG P+
Sbjct: 50  GYNDVSFHGSSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPL 109

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E R +PL    +P + + LGY T  +GKWHLG   +E TPL +GF+SHFGY NG + 
Sbjct: 110 KAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVG 169

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+D++   +      + G D+       W + G YAT+LFT+ ++ +IE   V  PLFL 
Sbjct: 170 YFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLV 229

Query: 272 LAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           ++HLAAH G  G  L  P   +T ++F YI DP RR YA +V  LD S+G +++ L  K 
Sbjct: 230 VSHLAAHTGQNGSELGVPDVDQTNHEFSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQ 289

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML+NSI++F SDNGA TV       Y N GSN+P RGVK + +EGGV+V A ++SP   +
Sbjct: 290 MLDNSIVLFFSDNGAQTV-----GMYENSGSNWPLRGVKFSDFEGGVRVAATIYSPLFHK 344

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              VS  ++HISDWLPTLY+AAGGD + L   IDG+DQW +L         +N       
Sbjct: 345 KGYVSEHLIHISDWLPTLYSAAGGDVAHLG-QIDGIDQWDAL---------TN------- 387

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
                N PS R  +LINIDE     A+  D +KL+      GT + YY QT       L 
Sbjct: 388 -----NNPSNRTEILINIDEVDENFAIIRDKFKLI-----QGTPNYYYQQT-------LL 430

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSP-------- 561
           FN   +S +Y                      T  Q     G  P   TP+P        
Sbjct: 431 FNC--KSGSYH-------------------EGTFDQYYGDSGRGPENPTPNPNHTTTDLS 469

Query: 562 ------------CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
                       CT G C LF+L  DPCE  NI  S P+I++QLYE +    + LVPQ +
Sbjct: 470 WCRAPDQTPILNCTKG-C-LFDLDKDPCETTNIIESEPEIANQLYEKIAQFWKELVPQRN 527

Query: 610 EQPDLVQADPKRFNDTWSPWI 630
           +  D  ++DP  +N+TW  W+
Sbjct: 528 KDTD-PKSDPIFYNNTWCTWL 547



 Score = 71.2 bits (173), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/75 (45%), Positives = 46/75 (61%), Gaps = 2/75 (2%)

Query: 8   AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQF 65
            W + G YAT+LFT+ ++ +IE   V  PLFL ++HLAAH G  G  L  P   +T ++F
Sbjct: 197 VWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNHEF 256

Query: 66  QYITDPNRRTYAALT 80
            YI DP RR YA + 
Sbjct: 257 SYIQDPRRRLYAGVV 271


>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
 gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
          Length = 528

 Score =  395 bits (1015), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 233/556 (41%), Positives = 315/556 (56%), Gaps = 59/556 (10%)

Query: 76  YAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
           YA         L    G+ND+ F GS +IPTPNIDALAY+G+ILN  Y  P+CTPSR++L
Sbjct: 16  YAIAQPHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSAL 75

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           MT KYPIHTGMQ   ++ AEPRG+PL  + LP+YL +LGY++   GKWHLG ++R YTPL
Sbjct: 76  MTAKYPIHTGMQHTVLYAAEPRGLPLNLKILPQYLNDLGYTSHIAGKWHLGHWKRVYTPL 135

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           YRGF SH+G                          DMR     A+D  G+Y TD+ T+ +
Sbjct: 136 YRGFSSHWGL-------------------------DMRNGTELAYDLHGQYTTDVITQHS 170

Query: 256 VQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           + +I      K PLFLY+AH A H+GN    L A  + + +   I D  RR YAA+V  +
Sbjct: 171 LNVIAKHNSTKGPLFLYVAHAAVHSGNPYNPLPAKDDIVRRLGTIQDYKRRKYAALVTAM 230

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           DDSVG ++  L++  ML+NSIIIF +DNG P   +       N+ SNYP RGVKNTLWEG
Sbjct: 231 DDSVGRIVEQLRKSHMLDNSIIIFSTDNGGPAEGFNS-----NFASNYPLRGVKNTLWEG 285

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
           GV+   ++WS Q+ + PRV+ Q MHI+DWLPTL  AAGG+ +   LN             
Sbjct: 286 GVRGAGLVWSAQLSKRPRVADQTMHIADWLPTLLEAAGGEAALAQLN------------- 332

Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
                N+ +DG+  W +LL N  S R SVL NID+   +AA+ +  WKLV GT  NG+ D
Sbjct: 333 -----NAQLDGMSVWQALLKNEVSPRKSVLHNIDDIWGSAALSVGDWKLVKGTNYNGSWD 387

Query: 495 GYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP 554
           G+YG   +  +   ++  + +S   Q+LQQL     LP    D MR+ R+ A + C    
Sbjct: 388 GWYGPAGTRNLRDYDWYLVAKSLAGQALQQLK---VLP-KTADMMRA-REAANVICKDAR 442

Query: 555 APMTPSPCTN---GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQ 611
               P+   N    PC LFN+  DPCEQ N A   P +   + + L+    T VP ++ +
Sbjct: 443 LKDAPNISCNPLKAPC-LFNIREDPCEQRNQADVYPKVLELMLKELQLINSTAVPPAN-K 500

Query: 612 PDLVQADPKRFNDTWS 627
           PD     P+ +N TW+
Sbjct: 501 PDDPLGYPRFWNYTWT 516



 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 1/83 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A+D  G+Y TD+ T+ ++ +I      K PLFLY+AH A H+GN    L A  
Sbjct: 147 MRNGTELAYDLHGQYTTDVITQHSLNVIAKHNSTKGPLFLYVAHAAVHSGNPYNPLPAKD 206

Query: 60  ETINQFQYITDPNRRTYAALTKS 82
           + + +   I D  RR YAAL  +
Sbjct: 207 DIVRRLGTIQDYKRRKYAALVTA 229


>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
 gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
          Length = 512

 Score =  395 bits (1014), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 218/563 (38%), Positives = 313/563 (55%), Gaps = 81/563 (14%)

Query: 76  YAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPS 131
           + ALT +    ++ +     GW+D+SFHGS +IPTPNID LA  G+ILNN Y  P+CTP+
Sbjct: 15  FVALTANKKPHIVFIVADDLGWDDVSFHGSGQIPTPNIDGLAKTGVILNNYYVSPICTPT 74

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           R+++MTGKYPIHTGMQ   I  A+P G+ L E  +P+YL+ LGY+T  +GKWHLGFF+ E
Sbjct: 75  RSAIMTGKYPIHTGMQHSVILAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLGFFKYE 134

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
           YTP+ RGF+S+FGY  G   Y+DH  +++Y   ++L  HD  +++ T W   G Y++DLF
Sbjct: 135 YTPIQRGFDSYFGYWCGKGDYWDHSNNEKYGWGLDL--HDSEQDVWTEW---GHYSSDLF 189

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
            ++AV +I       PLFLYL   A H+ N  + L+AP + I++F+ I D  RR +AAMV
Sbjct: 190 AEKAVNVISTHNASVPLFLYLPFQAVHSANFIQPLQAPPDLIDKFKNIKDERRRIFAAMV 249

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D ++  V+ +L+ + M  NSII+F +DNG P   +       N  SN+P RGVK TL
Sbjct: 250 SSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANGFDS-----NMASNFPLRGVKRTL 304

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGG++  A + SP I +  RV  ++MH+SDWLPTLYT AGGD   L             
Sbjct: 305 WEGGIRGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVAGGDIHDL------------- 351

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
                     N+DG D W S+  +  S R  ++ NID     AA R   WK+V+   +  
Sbjct: 352 ---------QNLDGFDLWDSISTDAMSPREEMVHNIDPVNWEAAYRFREWKIVV--NQTK 400

Query: 492 TMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR---STRQQATI 548
            M G+Y                                  P+ NI++     +T + A +
Sbjct: 401 YMSGWY----------------------------------PLPNIEEREPHPATLRDAVV 426

Query: 549 HCGANPAPMTPSPCT--NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
            CG  P P  P  CT  +GPC LFN+ NDPCE  N+A    +I + +   L+ +++ +VP
Sbjct: 427 KCG--PPPEIPVNCTASDGPC-LFNIKNDPCEYVNLAKKELEILNNMLIWLEGYKKGMVP 483

Query: 607 QSHEQPDLVQADPKRFNDTWSPW 629
             +   D   A+P  +   W+ W
Sbjct: 484 IRNTPLD-PSANPANYGGVWTAW 505



 Score = 63.5 bits (153), Expect = 3e-07,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 44/75 (58%)

Query: 8   AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 67
            W   G Y++DLF ++AV +I       PLFLYL   A H+ N  + L+AP + I++F+ 
Sbjct: 177 VWTEWGHYSSDLFAEKAVNVISTHNASVPLFLYLPFQAVHSANFIQPLQAPPDLIDKFKN 236

Query: 68  ITDPNRRTYAALTKS 82
           I D  RR +AA+  S
Sbjct: 237 IKDERRRIFAAMVSS 251


>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
          Length = 557

 Score =  394 bits (1013), Expect = e-107,   Method: Compositional matrix adjust.
 Identities = 219/544 (40%), Positives = 313/544 (57%), Gaps = 49/544 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SF+G +EIPTPNID+LAYNG+ILN  Y  P+CTPSR +  TG+YPI +GMQG P+
Sbjct: 48  GWNDVSFNGGDEIPTPNIDSLAYNGVILNRHYVLPICTPSRTAFFTGQYPIRSGMQGYPL 107

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            GAEPR +PL    LP+YLR+LGY+T  +GKWH+G+    +TP  RGF++  GY +G I 
Sbjct: 108 QGAEPRSIPLNNILLPQYLRKLGYATHLVGKWHVGYQTNNHTPTNRGFDTFLGYYSGYIE 167

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           Y+ H L +        +G+D+ R++      +   +Y TDL T EA  +I      KPL+
Sbjct: 168 YFSHNLVENGQ-----SGYDIHRSVGDNHTIEYRYDYMTDLITDEAENIISSHNPAKPLY 222

Query: 270 LYLAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           L LAHLA HA      +E    + T +   YI D NRR +A+ V  +D+S+G V+ AL+R
Sbjct: 223 LQLAHLAPHASTVDDVIEVRSWKATNDTLGYIRDINRRKFASAVVTMDESIGRVVDALRR 282

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
             ML+NSII+FMSDNGAPT    +     N+GSNYP RG+K + +EG V+  A ++SP I
Sbjct: 283 ADMLKNSIIVFMSDNGAPT----KDQILYNFGSNYPLRGMKQSFFEGAVRGVACIYSPLI 338

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
               RVS Q+ HI+DWLPTLY AAGGD S L                        +DG+D
Sbjct: 339 DFPSRVSTQLFHITDWLPTLYAAAGGDASDL----------------------KALDGID 376

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVP 506
           QWS +      RR  +L+NI+EK  + A  +  +KL+   ++      YYG T +     
Sbjct: 377 QWSMIKNADNGRRQFLLVNINEKTDSKAALIGRYKLI---RDQSEYTKYYGSTGKDPSYC 433

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
             N   ++ S    ++  +S++      +IDK++  R+++ + C  +      S CT   
Sbjct: 434 EYNVTNVLASPVASAIADISRSSL----DIDKIKQLREKSRVVCRVSRF----SRCTKRT 485

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LF++  DPCE  N++S  P I  +L  L+  ++  LV Q+++  D     P  FNDTW
Sbjct: 486 C-LFDIKKDPCETTNVSSKYPKILKKLNALIDSYQSILVNQTNKPVDPA-GFPCNFNDTW 543

Query: 627 SPWI 630
            PW+
Sbjct: 544 MPWL 547



 Score = 49.3 bits (116), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/67 (41%), Positives = 36/67 (53%), Gaps = 2/67 (2%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
           +Y TDL T EA  +I      KPL+L LAHLA HA      +E    + T +   YI D 
Sbjct: 198 DYMTDLITDEAENIISSHNPAKPLYLQLAHLAPHASTVDDVIEVRSWKATNDTLGYIRDI 257

Query: 72  NRRTYAA 78
           NRR +A+
Sbjct: 258 NRRKFAS 264


>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
 gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
          Length = 478

 Score =  393 bits (1010), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 219/495 (44%), Positives = 290/495 (58%), Gaps = 58/495 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS +IPTPNIDALAY+GIILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 37  GFNDVGFHGSADIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL E+ LP+YL ELGY++   GKWHLG ++ +YTPLYRGF SH+G       
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYASHIAGKWHLGHWKLKYTPLYRGFSSHWGL------ 150

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
                              DMR     A+D  G Y TD+ T  +V++I      K PLFL
Sbjct: 151 -------------------DMRNGTEVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFL 191

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H+ N    L  P   + +  +I    RR +AAMV K+DDSVG ++  L++  M
Sbjct: 192 YVAHAACHSSNPYNPLPVPDNDVIKMAHIPHYKRRKFAAMVSKMDDSVGQIVDQLRKSNM 251

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSIIIF SDNG P   +       N+ SNYP +GVKNTLWEGGV+   ++WSP ++++
Sbjct: 252 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKS 306

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS Q MHI DWLPTL  AAGG     P  +  L +               IDG   W 
Sbjct: 307 QRVSNQTMHIIDWLPTLLEAAGG-----PAAVANLSK--------------QIDGQSIWR 347

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           +L+ +  S R +VL NID+   +AA+ +  WKLV GT   G+ DG+YG        L ++
Sbjct: 348 ALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGWYGPAGERNPRLYDW 407

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGPCY 568
             +  S+  ++L+ L     LP S  D+ R  R  AT+ C A  +  T    T  + PC 
Sbjct: 408 QMVARSRAGKALEALK---MLP-SQADQQR-IRAAATVSCPAQSSKGTSCLATAFSAPC- 461

Query: 569 LFNLGNDPCEQNNIA 583
           LF++ +DPCEQ N+A
Sbjct: 462 LFHIRDDPCEQYNLA 476



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A+D  G Y TD+ T  +V++I      K PLFLY+AH A H+ N    L  P 
Sbjct: 152 MRNGTEVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAHAACHSSNPYNPLPVPD 211

Query: 60  ETINQFQYITDPNRRTYAALT 80
             + +  +I    RR +AA+ 
Sbjct: 212 NDVIKMAHIPHYKRRKFAAMV 232


>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
 gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
          Length = 498

 Score =  392 bits (1007), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 227/546 (41%), Positives = 313/546 (57%), Gaps = 73/546 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS +IPTPN+DALA +G+ILNN Y  P+ TPSRAS MTGKYPIH G+Q   +
Sbjct: 12  GWDDISFHGSPQIPTPNLDALANSGVILNNYYVSPMDTPSRASFMTGKYPIHMGVQHDTL 71

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P GVPLTE+FLPE+LRE+GY T A+GKW LGFF +EYTP YRGF+S FG+      
Sbjct: 72  HNRQPFGVPLTEKFLPEFLREMGYQTHAVGKWQLGFFAKEYTPTYRGFDSFFGFWTSHED 131

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY+H+ +D         G D+RRNL  + D  G Y T+L  +EA ++IE+   DKPLFLY
Sbjct: 132 YYNHVANDGGY------GIDLRRNLDVSNDHTGVYGTELLAREADEVIENHSGDKPLFLY 185

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAH A H GN  + L+AP+  +++F+YITD  RRT+A MV  LD+S+  ++++L+RKG+ 
Sbjct: 186 LAHQAVHVGNMDEPLQAPKRHVDKFKYITDERRRTFAGMVSSLDESMRQLVTSLKRKGLY 245

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NSIIIF +DNG        ++     GSN+P RG KNTLWEGGV+  A + SP I++  
Sbjct: 246 QNSIIIFTTDNGGAAGGLDMSA-----GSNFPLRGNKNTLWEGGVRGVAFVHSPLIKRPG 300

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RV   +MH SDWLPTL+  AGG + +L   IDG D W  +   TPS R   + G+D    
Sbjct: 301 RVYDGLMHASDWLPTLHLLAGG-SPKLLGQIDGYDLWDGISKGTPSPRYEVLHGIDTM-- 357

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
                             +   AA+R+  +K++L        D Y               
Sbjct: 358 ------------------QGNRAALRVGDYKVILNQDLKFYSDWY--------------- 384

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTR--QQATIHCG-ANPAPM--TPSPCTN-- 564
                K  +   +L           D +R  R    AT+ C   +P P+  T +P  N  
Sbjct: 385 -----KRPEGEHEL-----------DYIRKPRLLSNATVDCTIKSPHPLLYTHAPLCNPQ 428

Query: 565 -GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
             PC LFN+  DPCE +N+A+  P+    L + LK++   + P  + + D + A+P R  
Sbjct: 429 KKPC-LFNIKWDPCEYHNLAAFMPNTLKVLLDRLKFYAEGMKPVIYPRADDL-ANPDRLG 486

Query: 624 DTWSPW 629
             W+PW
Sbjct: 487 GVWTPW 492



 Score = 88.6 bits (218), Expect = 9e-15,   Method: Compositional matrix adjust.
 Identities = 40/82 (48%), Positives = 56/82 (68%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +RRNL  + D  G Y T+L  +EA ++IE+   DKPLFLYLAH A H GN  + L+AP+ 
Sbjct: 146 LRRNLDVSNDHTGVYGTELLAREADEVIENHSGDKPLFLYLAHQAVHVGNMDEPLQAPKR 205

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            +++F+YITD  RRT+A +  S
Sbjct: 206 HVDKFKYITDERRRTFAGMVSS 227


>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
 gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
          Length = 657

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 220/538 (40%), Positives = 307/538 (57%), Gaps = 54/538 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNIDALAY GIILN  Y  P+CTPSRA++MTG+ PI  GMQ   I
Sbjct: 42  GWNDVGFHGSNQIPTPNIDALAYGGIILNRHYTAPMCTPSRAAIMTGRNPISVGMQHYVI 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L ++ +PEY RE GY T  +GKWHLGFF ++YTP  RGF+SH  Y      
Sbjct: 102 DSDEPWGLGLDQKIMPEYFREAGYRTHLVGKWHLGFFAQQYTPTMRGFDSHTNY------ 155

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
                            G+DMR NL+  +D  G YATD FT  A ++I+     +PLFL 
Sbjct: 156 ----------------TGYDMRHNLAVDYDANGTYATDHFTDAASRIIDKHNPSEPLFLM 199

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           + HLA H GN    L+AP+E I +F++I+D NRR YAAMV KLDDSVG +  +L+ K M+
Sbjct: 200 VNHLAPHTGNDNDPLQAPEERIKKFEHISDENRRIYAAMVSKLDDSVGAIFKSLRAKNMI 259

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NSII+F+SDNGAP+        + N GSN+P RG KN+ WEG  +  A +WSP +++  
Sbjct: 260 QNSIILFISDNGAPSFGL-----HSNSGSNFPLRGQKNSPWEGATRNVAAIWSPLLEERQ 314

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS Q +HISDWLPT  + AG    RLP                 +R +S +DG+DQW +
Sbjct: 315 RVSNQYIHISDWLPTFASIAG---IRLP----------------QARNSSEVDGVDQWEA 355

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L  +T   R  +L  IDE    ++   + +K V GT   G  DG+YGQ  +N+ P  +  
Sbjct: 356 LSYDTGYPRRVILNMIDEIFGYSSYMENGFKYVNGTTGGGNFDGWYGQL-NNEDPHPSDE 414

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
             +++     + QLS ++     + + ++  R+ + + C     P T +P     C LF+
Sbjct: 415 EYIKTVLETEIAQLSGDVL----SSELIKHLRKHSRVVCHKPLHPTTCNPLKRA-C-LFD 468

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           + NDPCE N+I++  P    +L   ++  RR L  +   +P     +P      W+ W
Sbjct: 469 IVNDPCELNDISAREPFKFRELRTKVETLRR-LAKKPRNKPHDPAGNPANHGGVWTWW 525



 Score = 81.3 bits (199), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 38/80 (47%), Positives = 52/80 (65%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MR NL+  +D  G YATD FT  A ++I+     +PLFL + HLA H GN    L+AP+E
Sbjct: 160 MRHNLAVDYDANGTYATDHFTDAASRIIDKHNPSEPLFLMVNHLAPHTGNDNDPLQAPEE 219

Query: 61  TINQFQYITDPNRRTYAALT 80
            I +F++I+D NRR YAA+ 
Sbjct: 220 RIKKFEHISDENRRIYAAMV 239


>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
 gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
          Length = 579

 Score =  392 bits (1006), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 234/559 (41%), Positives = 311/559 (55%), Gaps = 46/559 (8%)

Query: 80  TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
           TK   + +LI   G ND+SFHGSN+I TPNIDALAYNGI+LN  Y   +CTPSRA+L+TG
Sbjct: 26  TKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTG 85

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           KYPIHTGMQ   I   EP G+P  ER +PE  R+ GYST  +GKWHLGF+R++ TP  RG
Sbjct: 86  KYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRG 145

Query: 199 FESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           F+ HFGY NG I YYDH   +L   YS      G D RR+L    +  G YAT+ FT EA
Sbjct: 146 FDHHFGYYNGYIDYYDHQVRLLDRNYSA-----GLDFRRDLEPCPEANGTYATEAFTSEA 200

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
            ++IE     KPLF+ L+HLA H GN    ++AP+E + +F +I DP RRTYA MV  LD
Sbjct: 201 KRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMVSSLD 260

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
            SV   I AL+  GML NSII+  SDNGAPT+       + N GSNYPYRG K + WEGG
Sbjct: 261 KSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGG 315

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           ++    LWSP +++   VS Q +H  DWLPTL  AAG                 SL  + 
Sbjct: 316 IRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG----------------VSLPQDL 359

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
           P      +DG++ W  L  N   +  +++  +DE    ++   D+ K V G+   G  D 
Sbjct: 360 P------LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFEGRYDQ 413

Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----G 551
           + G+  +N+   L+  +  +      +Q L  N  L     D++R  R +AT  C    G
Sbjct: 414 WLGELETNEDDPLS-ESYEQHVLASDVQSLLGNRGL---TEDRIRQMRSEATETCPPVEG 469

Query: 552 ANP-APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
            NP  P         PC+ F+L  DPCE+ N+A   P    QL + ++  R+T +P +  
Sbjct: 470 QNPLEPHFKCEPLKAPCF-FDLAKDPCERYNLAQIYPLQLQQLADEVEQIRKTAIPSARV 528

Query: 611 QPDLVQADPKRFNDTWSPW 629
                +A+P   N  W  W
Sbjct: 529 PHSDSRANPTFHNGNWEWW 547



 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+L    +  G YAT+ FT EA ++IE     KPLF+ L+HLA H GN    ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEE 236

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +I DP RRTYA +  S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMVSS 258


>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
          Length = 525

 Score =  391 bits (1005), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 223/545 (40%), Positives = 318/545 (58%), Gaps = 64/545 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHG++EIPTPNIDALAYNG+ILN  Y  P+CTPSR + +TGKYPI TGMQG  +
Sbjct: 25  GWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPLCTPSRTAFLTGKYPIRTGMQGYVL 84

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEPRG+PL +  LPEYLR+LGY+T  +GKWH+G+  + YTP  RGF++  GY NG I 
Sbjct: 85  QPAEPRGIPLNDTLLPEYLRKLGYATHLVGKWHVGYHTKNYTPTRRGFDTFLGYYNGYIH 144

Query: 212 YYDHILSDQYSRTVELNGHDM---RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           Y++H + D+  + +  + H +    R +   +D    Y TD+ T E   +I      KPL
Sbjct: 145 YFNHTILDEEQKYLGYDFHRIVGENRTIEYRYD----YITDIITDEVENIIFSHNPAKPL 200

Query: 269 FLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           +L ++H AAH+G  G  ++    +ET     YI D NRR YA++V  LD+SVG +I AL+
Sbjct: 201 YLQVSHDAAHSGGIGIEMQVRDWKETNATLGYIEDINRRKYASVVATLDESVGRIIDALR 260

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
           +  ML+NSII+F+SDNGA T  + +     N+GSNYP RG+K +L+EGGV+  A ++SP 
Sbjct: 261 KTDMLKNSIIVFISDNGAQTEGFLQ-----NYGSNYPLRGLKFSLFEGGVRGAAYIYSPL 315

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           I +  RVS Q+ HI+DWLPTLY+AAGG++S L   +DG DQWS++         S  D  
Sbjct: 316 IDRLSRVSTQLFHITDWLPTLYSAAGGNSSDLK-QLDGFDQWSAI--------KSAED-- 364

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV- 505
                      S+R S+LINIDE+  +   +                  YY  + +N + 
Sbjct: 365 -----------SKRKSILINIDERDTSEYQK------------------YYNYSGNNALY 395

Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
           P  N   I+ S    ++  +S ++     N +K+   R++ATI C      M  S CTN 
Sbjct: 396 PKYNAINILASPVASAIASISTSML----NANKIMQLRKEATIVC---KNFMDFSNCTNR 448

Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
            C LF++  DPCE  ++++  P    +L   +  ++  LV Q +   D        FN+T
Sbjct: 449 SC-LFDVKEDPCETTDLSAKYPKEVERLNMFIDRYKSVLVKQPNTPIDPA-GYAYHFNNT 506

Query: 626 WSPWI 630
           W PW+
Sbjct: 507 WIPWL 511



 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 2/71 (2%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
           +Y TD+ T E   +I      KPL+L ++H AAH+G  G  ++    +ET     YI D 
Sbjct: 177 DYITDIITDEVENIIFSHNPAKPLYLQVSHDAAHSGGIGIEMQVRDWKETNATLGYIEDI 236

Query: 72  NRRTYAALTKS 82
           NRR YA++  +
Sbjct: 237 NRRKYASVVAT 247


>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
          Length = 513

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 224/544 (41%), Positives = 316/544 (58%), Gaps = 50/544 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G ND+ F+GS +IPTP+IDALAYNGIIL+  Y Q  CTPSRA+L+TG+YPI  GMQG PI
Sbjct: 6   GRNDVGFYGSGQIPTPSIDALAYNGIILDRFYTQCSCTPSRAALLTGQYPIRLGMQGLPI 65

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E + +PL    +P+YL+ LGY T  +GKWHLG+   E TPL RGF+SHFGY NG + 
Sbjct: 66  RAGENKSLPLDVVTMPQYLKRLGYKTHLVGKWHLGYAHIEDTPLQRGFDSHFGYWNGFVG 125

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+++    + +    + G D+   +  AW   G+YATDLFT +A+++I++   +KPLFL 
Sbjct: 126 YFNYTAVYELANDTMVKGFDLFDGVVPAWQERGKYATDLFTHKAMKIIDEHNSEKPLFLV 185

Query: 272 LAHLAAHAGNAGKHLEAPQ--ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LAHLA H G  G  L  P   +   +F +I +P RR YA +V +LDDSVG ++  L  K 
Sbjct: 186 LAHLAGHTGEDGVELGVPDVAQAETRFSFIKNPKRRLYADIVSRLDDSVGQIVEKLDEKN 245

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           +L+NSII+F+SDNGA T+       Y N GSN+P RG+K T +EGG++ PA ++SP   Q
Sbjct: 246 LLDNSIILFLSDNGAQTI-----GVYENSGSNWPLRGLKLTEFEGGIRAPAAIYSPLFHQ 300

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              VS +++HI+D+LPT Y AAGG+   L   IDG++QW  L     S++          
Sbjct: 301 KGYVSKELIHITDFLPTFYAAAGGNLEYLG-KIDGVNQWEVL-----SQQK--------- 345

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
                  P +R+ +L+N +E    +A+ +  +KLV G         Y+G + R+ + P  
Sbjct: 346 -------PGKRDEILVNFNEIDNVSAIIMGQYKLVQGRLAFPEYGDYHGDSGRNPEDPPY 398

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH-C-GANPAPMTPSPCTNGP 566
           NF A+           LS+NI     N +K+ S R Q  +  C G    P+    CT G 
Sbjct: 399 NFTAV-----------LSKNI--NQQNRNKIESLRNQTDLSWCRGEKNQPLI--DCTEG- 442

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
           C LFNL +DPCE  NI      I+ +L   L    + LVPQ     D  +++P+ +N TW
Sbjct: 443 C-LFNLESDPCETTNIIQEEKIIAGKLLGRLFEFSQILVPQQVAIFD-PESNPEFYNGTW 500

Query: 627 SPWI 630
             W+
Sbjct: 501 CTWL 504



 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 34/75 (45%), Positives = 47/75 (62%), Gaps = 2/75 (2%)

Query: 8   AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ--ETINQF 65
           AW   G+YATDLFT +A+++I++   +KPLFL LAHLA H G  G  L  P   +   +F
Sbjct: 153 AWQERGKYATDLFTHKAMKIIDEHNSEKPLFLVLAHLAGHTGEDGVELGVPDVAQAETRF 212

Query: 66  QYITDPNRRTYAALT 80
            +I +P RR YA + 
Sbjct: 213 SFIKNPKRRLYADIV 227


>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
          Length = 526

 Score =  390 bits (1003), Expect = e-106,   Method: Compositional matrix adjust.
 Identities = 225/547 (41%), Positives = 306/547 (55%), Gaps = 78/547 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS++IPTPNID LAY+G++L+N Y QPVCTP+R +L+TG+YP+H G+Q   I
Sbjct: 38  GWDDVSFHGSHQIPTPNIDELAYSGVLLHNYYVQPVCTPTRGALLTGRYPMHLGLQHFVI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+PL E  LP YL++LGYST  +GKWHLGFF +EYTP YRGF+SH+GY  G   
Sbjct: 98  TPNEPVGLPLNETTLPTYLKKLGYSTHMVGKWHLGFFAKEYTPTYRGFDSHYGYFLGHQD 157

Query: 212 YYDH--ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           YY H  + ++Q+       G D+R N+     T GEY  +LFT +A +LI D    KPLF
Sbjct: 158 YYTHNALWNNQW-------GFDLRHNMDLQRSTFGEYGPELFTTQAEKLIYDHDHKKPLF 210

Query: 270 LYLAHLAAHAGNAGKH---LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           LY AH A H GN+G +   LEAP +  ++F +I D  RR YA MV  LDDSVG +  AL+
Sbjct: 211 LYFAHQAVHYGNSGPNGTLLEAPYKYTSRFPHIADHQRRIYAGMVSALDDSVGNITRALK 270

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
           + G+  NSII+F +DNG P     E   +     N P RG KNTLWEGG++  A + S  
Sbjct: 271 KSGLYNNSIIVFSTDNGGP----HECEAF-----NTPLRGTKNTLWEGGIRGAAFIHSVL 321

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           +++  RV   MMH+ DWLPT+Y  AGG+ S L                       N+DG 
Sbjct: 322 LEKPKRVCEGMMHVVDWLPTMYHVAGGNLSDL----------------------HNLDGF 359

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
           + W +L     S R  +L NID   +TAA+R+ + K+++G                NK  
Sbjct: 360 NMWDTLSSGVSSPRTEILHNIDPITQTAALRVGNLKVIVG---------------ENKSD 404

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC--TN 564
                 I   K   +L +             +  S  +   +HC + P P+    C    
Sbjct: 405 WFVPPQITFGKNGTNLTK-------------QFGSDHRGIIVHCSSVP-PVAYKNCDPKQ 450

Query: 565 GPCYLFNLGNDPCEQNNIASSRPDI-SSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
            PC L+N+ NDPCE  N+A   PD+  S LY+L  Y+  ++ P     P   +++P   N
Sbjct: 451 KPC-LYNIKNDPCEYYNLADEYPDVLQSLLYKLNDYNSTSVTPWV--IPPDPRSNPSLHN 507

Query: 624 DTWSPWI 630
           DTWSPWI
Sbjct: 508 DTWSPWI 514



 Score = 72.8 bits (177), Expect = 6e-10,   Method: Compositional matrix adjust.
 Identities = 37/85 (43%), Positives = 50/85 (58%), Gaps = 3/85 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH---LEA 57
           +R N+     T GEY  +LFT +A +LI D    KPLFLY AH A H GN+G +   LEA
Sbjct: 173 LRHNMDLQRSTFGEYGPELFTTQAEKLIYDHDHKKPLFLYFAHQAVHYGNSGPNGTLLEA 232

Query: 58  PQETINQFQYITDPNRRTYAALTKS 82
           P +  ++F +I D  RR YA +  +
Sbjct: 233 PYKYTSRFPHIADHQRRIYAGMVSA 257


>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
 gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
          Length = 579

 Score =  390 bits (1001), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 234/560 (41%), Positives = 311/560 (55%), Gaps = 48/560 (8%)

Query: 80  TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
           TK   + +LI   G ND+SFHGSN+I TPNIDALAYNGI+LN  Y   +CTPSRA+L+TG
Sbjct: 26  TKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTG 85

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           KYPIHTGMQ   I   EP G+P  ER +PE  R+ GYST  +GKWHLGF+R++ TP  RG
Sbjct: 86  KYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRG 145

Query: 199 FESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           F+ HFGY NG I YYDH   +L   YS      G D RR+L    +  G YAT+ FT EA
Sbjct: 146 FDHHFGYYNGYIDYYDHQVRMLDRNYSA-----GLDFRRDLEPCPEANGTYATEAFTSEA 200

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
            ++IE     KPLF+ L+HLA H GN    ++AP+E + +F +I DP RRTYA M+  LD
Sbjct: 201 KRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLD 260

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
            SV   I AL+  GML NSII+  SDNGAPT+       + N GSNYPYRG K + WEGG
Sbjct: 261 KSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGG 315

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           ++    LWSP +++   VS Q +H  DWLPTL  AAG                 SL  + 
Sbjct: 316 IRSAGALWSPLLKERGYVSNQAIHAVDWLPTLAGAAG----------------VSLPQDL 359

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
           P      +DG++ W  L  N   +  +++  +DE    ++   D+ K V G+   G  D 
Sbjct: 360 P------LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQ 413

Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----G 551
           + G+  +N+   L   +  +      +Q L  N  L     D++R  R +AT  C    G
Sbjct: 414 WLGELETNEDDPLG-ESYEQHVLASDVQSLLGNRGL---TKDRIRQMRSEATETCPPIEG 469

Query: 552 ANPAPMTPSPCT--NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
            NP   +   C     PC+ F+L  DPCE+ N+A   P    QL + L+  R+T +P + 
Sbjct: 470 QNPLE-SHFKCEPLKAPCF-FDLAKDPCERYNLAQMYPLQLQQLADELEQIRKTAIPSAR 527

Query: 610 EQPDLVQADPKRFNDTWSPW 629
                 +A+P   N  W  W
Sbjct: 528 VPHSDSRANPTFHNGNWEWW 547



 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+L    +  G YAT+ FT EA ++IE     KPLF+ L+HLA H GN    ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEE 236

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +I DP RRTYA +  S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMISS 258


>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
 gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
          Length = 550

 Score =  389 bits (1000), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 229/567 (40%), Positives = 314/567 (55%), Gaps = 56/567 (9%)

Query: 78  ALTKSTTLTLLIV----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
           A T ST   ++I+     G+ND+SFHGSN+I TPNIDALAYNG++LN +Y   +CTPSRA
Sbjct: 20  AETPSTKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGVLLNKLYVPNLCTPSRA 79

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           +L+TGKYPIHTGMQ   I   EP G+P  ER +PE+ R+ GYST+ IGKWHLGF+ ++ T
Sbjct: 80  TLLTGKYPIHTGMQHYVIITDEPWGLPKQERLMPEFFRDAGYSTQLIGKWHLGFWEKDLT 139

Query: 194 PLYRGFESHFGYLNGVISYYD---HILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL 250
           P  RGF+ H+GY NG I YYD   H+L+  Y++ V     D RR+L       G YATD 
Sbjct: 140 PTMRGFDHHYGYYNGYIDYYDHTLHMLTKNYTKGV-----DFRRDLDPCPQDNGTYATDA 194

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
           FT EA ++IE     +PLF+ L+HLA H GN    ++AP+E + +F +ITDP RRTYA M
Sbjct: 195 FTAEAKRVIEQHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHITDPKRRTYAGM 254

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           +  LD SVG  + AL   GML NSII+  SDNGAPTV       + N GSNYP+RG K +
Sbjct: 255 ISSLDKSVGQTVKALSDNGMLNNSIILLYSDNGAPTVGI-----HSNQGSNYPFRGQKES 309

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
            WEGG++    +WSP +QQ   VS Q++H  DWLPTL  AA      LP N+        
Sbjct: 310 PWEGGIRSAGAIWSPLLQQRGYVSNQIIHAIDWLPTLAAAA---NVELPENL-------- 358

Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
                       +DG++ W SL  N   +   +L  +D+    ++   D+ K V G+   
Sbjct: 359 -----------KLDGVNMWPSLSGNIEPQPRRLLHVMDDVFGYSSYMRDNLKYVNGSSFE 407

Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNI----DKMRSTRQQA 546
           G  D + G   S +   L+         Y + Q LS  +   I +     +++   R++A
Sbjct: 408 GQYDQWLGDLESGQDDPLS--------PYYAQQVLSSEVHAVIGSQSVTNERIEQMREEA 459

Query: 547 TIHCGANP----APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRR 602
           T  C  N       +        PC+ FNL  DPCE+ N+A+  P     L + ++  R+
Sbjct: 460 THRCPLNEQDHLQEIYLCEPLKAPCF-FNLAKDPCERFNLANLYPLQLQFLSQEVEQFRQ 518

Query: 603 TLVPQSHEQPDLVQADPKRFNDTWSPW 629
             VP +       +A+PK     W  W
Sbjct: 519 NAVPSARVPISDERANPKLHGGYWEWW 545



 Score = 82.0 bits (201), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 38/82 (46%), Positives = 51/82 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+L       G YATD FT EA ++IE     +PLF+ L+HLA H GN    ++AP+E
Sbjct: 176 FRRDLDPCPQDNGTYATDAFTAEAKRVIEQHDKSRPLFMVLSHLAVHTGNEDNPMQAPEE 235

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +ITDP RRTYA +  S
Sbjct: 236 EVAKFAHITDPKRRTYAGMISS 257


>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
 gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
          Length = 577

 Score =  389 bits (999), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 226/554 (40%), Positives = 311/554 (56%), Gaps = 45/554 (8%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
            + L+   G ND+SFHGSN+I TPNIDALAYNGI+LN  Y   +CTPSRA+L+TGKYPIH
Sbjct: 31  VIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIH 90

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           TGMQ   I   EP G+P  ER +PE  R+ GYST  +GKWHLGF+R++ TP  RGF+ HF
Sbjct: 91  TGMQHFVIITDEPWGLPSRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHF 150

Query: 204 GYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
           GY NG I YYDH   +L   YS     +G D RR+L    +  G YAT+ FT EA ++IE
Sbjct: 151 GYYNGYIDYYDHQVRMLDRNYS-----HGLDFRRDLEPCPEANGTYATEAFTSEAKRIIE 205

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
                KPLF+ ++HLA H GN    ++AP+E + +F +I DP RRTYA M+  LD SV  
Sbjct: 206 QHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQ 265

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
            I AL+   ML NSII+  SDNGAPT+       + N GSNYPYRG K + WEGG++   
Sbjct: 266 TIGALKDNDMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGGIRSAG 320

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
            LWSP +++   VS Q +H  DWLPTL  AAG                 SL  + P    
Sbjct: 321 ALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG----------------VSLPQDLP---- 360

Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
             +DG++ W  L  N   +  +++  +DE    ++   D+ K + G+   G  D + G+ 
Sbjct: 361 --LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMKDTLKYINGSSFQGRYDHWLGEL 418

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----GANP-A 555
           ++N+   L+  +  +      +Q +  N +L     D++R  R +AT  C    G NP  
Sbjct: 419 KTNEDHPLS-ESYEQHVLASDVQSILGNRWL---TEDRIRQMRSEATEMCPPVEGLNPLE 474

Query: 556 PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLV 615
           P         PC+ F+L  DPCE+ N+A + P    QL + ++  R+T +P +       
Sbjct: 475 PQFKCEPLKAPCF-FDLAKDPCERYNLAETYPLQLQQLADEVEQIRKTAIPSARVPYSDS 533

Query: 616 QADPKRFNDTWSPW 629
           +A+P   N  W  W
Sbjct: 534 RANPTFHNGNWEWW 547



 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 51/82 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+L    +  G YAT+ FT EA ++IE     KPLF+ ++HLA H GN    ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEE 236

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +I DP RRTYA +  S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMISS 258


>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 594

 Score =  388 bits (997), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 228/548 (41%), Positives = 310/548 (56%), Gaps = 57/548 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SF GS +IPTPN+DALA  G+IL N Y QP CTPSRA+L+TG YPIH+GMQ   I
Sbjct: 94  GWNDVSFTGSGQIPTPNLDALASAGVILQNHYVQPFCTPSRAALLTGMYPIHSGMQHYVI 153

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+PL  + LP++L++LGY T  IGKWHLG F++E+ P  RGF+SH GY NG I 
Sbjct: 154 RSREPWGLPLDFKLLPQHLKDLGYRTHLIGKWHLGQFKKEFLPTRRGFDSHLGYYNGYID 213

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H  +  Y R   L+         +      EYAT LFT  A ++I D  VD PLFLY
Sbjct: 214 YFTH--NHTYKRDSALDFFKDEVPYHSE-----EYATRLFTDRAEEIIRDHDVDNPLFLY 266

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
            AHLA H        +APQETI++F Y+ D NR T+AAM+ +LD SVG V+ AL +KG+L
Sbjct: 267 FAHLAVHRATDRDPFQAPQETIDKFSYVGDRNRTTFAAMLAELDVSVGRVVEALAKKGIL 326

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N+II+F SDNG      + T+   N G+N+P RG K TL+EGG +VPA +WSP I++  
Sbjct: 327 DNTIILFSSDNGG-----QATAPMENTGTNFPLRGQKRTLFEGGTRVPAFVWSPLIRRPR 381

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RV   M+HI DWLPT+Y+ AGGD    P N+                    IDG++ W +
Sbjct: 382 RVFYDMVHIVDWLPTIYSLAGGD----PRNL------------------GRIDGMNVWGA 419

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           +     S R  +L NID  +R  A+R+  +KL+ G   N T      +   N+      +
Sbjct: 420 VSDGMASPRTEILYNIDPVQRDLAIRVGRYKLLYGPMYNLT------EWYDNRGFGYASH 473

Query: 512 AIVE-----SKTYQSLQQLSQNIFLPISNIDKMRST----RQQATIHCGANPAPMTPSPC 562
           +++E     S   + L+ L    +      D  R      R++  +HCG NP+  T    
Sbjct: 474 SVLERYMDKSAVARVLRDLG--YWWKGREEDYSRRPTWQWREETRVHCG-NPSSATACNP 530

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
           +  PC LF+L  DPCE NN+      ++  L   L+ H R+  P ++ +PD     P   
Sbjct: 531 SEQPC-LFDLATDPCEYNNLYGQLTQVARYLLSRLEAHARSARPINN-KPDDYSFQP--- 585

Query: 623 NDTWSPWI 630
           N TW PWI
Sbjct: 586 NATWGPWI 593



 Score = 77.8 bits (190), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 18/109 (16%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           EYAT LFT  A ++I D  VD PLFLY AHLA H        +APQETI++F Y+ D NR
Sbjct: 240 EYATRLFTDRAEEIIRDHDVDNPLFLYFAHLAVHRATDRDPFQAPQETIDKFSYVGDRNR 299

Query: 74  RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
            T+AA+     +++  V                  ++ALA  GI+ N +
Sbjct: 300 TTFAAMLAELDVSVGRV------------------VEALAKKGILDNTI 330


>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
 gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
          Length = 485

 Score =  387 bits (994), Expect = e-105,   Method: Compositional matrix adjust.
 Identities = 193/408 (47%), Positives = 261/408 (63%), Gaps = 30/408 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS++IPTPN+D+LAY+G+IL N Y  P+CTP+R+++MTG++PIHTG+Q   I
Sbjct: 3   GWNDVSFHGSDQIPTPNLDSLAYSGVILGNYYVSPICTPTRSAIMTGRHPIHTGLQHGVI 62

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            GA P G+PL E  LP+YL+ LGY+T  +GKWHLG    E+TP +RGF+S+FGYL G  +
Sbjct: 63  SGATPFGLPLNETILPQYLKPLGYATHIVGKWHLGHHAWEFTPTFRGFDSYFGYLTGKDN 122

Query: 212 YYDHILSDQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           YYDH   D+ +   EL   G D+R      W   G Y+T+LF  EA ++I      KPLF
Sbjct: 123 YYDHT-DDESNSPEELGYKGLDLRNGTEPVWTENGTYSTELFATEAERIITSHDTSKPLF 181

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL H A H+GN    L+APQ+ I++F +I  P RRT+AAMV  LDD+VG V  AL  +G
Sbjct: 182 LYLPHQAVHSGNPDNPLQAPQKYIDKFPHIQHPGRRTFAAMVSALDDAVGNVTKALSARG 241

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           MLENS+IIF +DNG P   + +     N+ SN+P RGVKNTLWEGGV     + SP I+Q
Sbjct: 242 MLENSVIIFTTDNGGPAAGFDQ-----NYASNWPLRGVKNTLWEGGVHGTGFVHSPLIKQ 296

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             R + +++H+ D LPT+Y  AGGD++ L                       N+DG + W
Sbjct: 297 PKRTTHELLHVCDLLPTIYELAGGDSTEL----------------------KNLDGTNVW 334

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
            ++     S R  VL NID K++TAA+R   +K++LG    G  DG+Y
Sbjct: 335 ETISRGVQSPRVEVLHNIDPKRKTAALRYGDYKIILGEAYKGAWDGWY 382



 Score = 76.3 bits (186), Expect = 5e-11,   Method: Compositional matrix adjust.
 Identities = 35/82 (42%), Positives = 49/82 (59%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R      W   G Y+T+LF  EA ++I      KPLFLYL H A H+GN    L+APQ+
Sbjct: 144 LRNGTEPVWTENGTYSTELFATEAERIITSHDTSKPLFLYLPHQAVHSGNPDNPLQAPQK 203

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            I++F +I  P RRT+AA+  +
Sbjct: 204 YIDKFPHIQHPGRRTFAAMVSA 225



 Score = 39.3 bits (90), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 1/56 (1%)

Query: 575 DPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           DPCE NNIA     + + L   L+ +  T VP    QP   + +PK     WSPWI
Sbjct: 399 DPCEFNNIADWNKPLVNFLMGRLEAYSETAVP-IRNQPTDPRGNPKYHGGVWSPWI 453


>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
 gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
          Length = 579

 Score =  387 bits (994), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 232/559 (41%), Positives = 311/559 (55%), Gaps = 46/559 (8%)

Query: 80  TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
           TK   + +LI   G ND+SFHGSN+I TPNIDALAYNGI+LN  Y   +CTPSRA+L+TG
Sbjct: 26  TKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTG 85

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           KYPI+TGMQ   I   EP G+P  ER +PE  R+ GYST  +GKWHLGF+R++ TP  RG
Sbjct: 86  KYPIYTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRG 145

Query: 199 FESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           F+ HFGY NG I YYDH   +L   YS      G D RR+L    +  G YAT+ FT EA
Sbjct: 146 FDHHFGYYNGYIDYYDHQVRLLDRNYSA-----GLDFRRDLEPCPEANGTYATEAFTSEA 200

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
            ++IE     KPLF+ L+HLA H GN    ++AP+E + +F +I DP RRTYA MV  LD
Sbjct: 201 KRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMVSSLD 260

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
            SV   I AL+  GML NSII+  SDNGAPT+       + N GSNYPYRG K + WEGG
Sbjct: 261 KSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGG 315

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           ++    LWSP +++   VS Q +H  DWLPTL  AAG                 SL  + 
Sbjct: 316 IRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG----------------VSLPQDL 359

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
           P      +DG++ W  L  N   +  +++  +DE    ++   D+ K V G+   G  D 
Sbjct: 360 P------LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMKDTLKYVNGSSFEGRYDQ 413

Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----G 551
           + G+  +N+   L+  +  +      +Q L  N  L     D++R  R +AT  C    G
Sbjct: 414 WLGELETNEDDPLS-ESYEKHVLASDVQSLLGNRGL---TEDRIRQMRSEATETCPPVEG 469

Query: 552 ANP-APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
            NP  P         PC+ F+L  DPCE+ N+A   P    +L + ++  R+T +P +  
Sbjct: 470 QNPLEPHFKCEPLKAPCF-FDLAKDPCERYNLAQIYPLQLQELADEVEQIRKTAIPSARV 528

Query: 611 QPDLVQADPKRFNDTWSPW 629
                +A+P   N  W  W
Sbjct: 529 PHSDSRANPTFHNANWEWW 547



 Score = 80.9 bits (198), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 51/82 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+L    +  G YAT+ FT EA ++IE     KPLF+ L+HLA H GN    ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEE 236

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +I DP RRTYA +  S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMVSS 258


>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
 gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
          Length = 578

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 230/560 (41%), Positives = 308/560 (55%), Gaps = 48/560 (8%)

Query: 80  TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
           TK   + +LI   G ND+SFHGSN+I TPNIDALAYNGI+LN  Y   +CTPSRA+L+TG
Sbjct: 26  TKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTG 85

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           KYPIHTGMQ   I   EP G+P  ER +PE  RE GYST  +GKWHLGF+ ++ TP  RG
Sbjct: 86  KYPIHTGMQHFVIITDEPWGLPQRERLMPEIFREAGYSTHLVGKWHLGFWHKDLTPTRRG 145

Query: 199 FESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           F+ HFGY NG I YYDH   +L   YS      G D RR+L    +  G YAT+ FT EA
Sbjct: 146 FDHHFGYYNGYIDYYDHQVRMLDRNYSA-----GLDFRRDLEPCPEANGTYATEAFTAEA 200

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
            ++IE     KPLF+ ++HLA H GN    ++AP+E + +F +I DP RRTYA M+  LD
Sbjct: 201 KRIIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLD 260

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
            SV   + AL+  GML NSII+  SDNGAPT+       + N GSNYPYRG K + WEGG
Sbjct: 261 KSVAQTVGALKDNGMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGG 315

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           ++    LWSP +++   VS Q +H  DWLPTL  AAG                 SL  + 
Sbjct: 316 IRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG----------------VSLPQDL 359

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
           P      +DG++ W  L  N   +  +++  +DE    ++   D+ K V G+   G  D 
Sbjct: 360 P------LDGINLWPLLSGNEQPKPRTMIHVLDEVFGYSSYMKDTLKYVNGSSFQGRYDQ 413

Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPIS-NIDKMRSTRQQATIHC---- 550
           + G+  ++       + + ES     L    Q+I        D++R  R +AT  C    
Sbjct: 414 WLGELETSAD-----DPLSESYEQHVLASDVQSILGNRGLTEDRIRQMRSEATELCPPVE 468

Query: 551 GANP-APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
           G NP  P         PC+ F+L NDPCE+ N+A + P    QL + +   R+T +  + 
Sbjct: 469 GQNPLEPHFKCEPLKAPCF-FDLANDPCERYNLARTYPLQLQQLADEVVLIRKTAISSAR 527

Query: 610 EQPDLVQADPKRFNDTWSPW 629
                 +A+P   N  W  W
Sbjct: 528 VAHSDSRANPAYHNGNWEWW 547



 Score = 79.7 bits (195), Expect = 4e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 51/82 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+L    +  G YAT+ FT EA ++IE     KPLF+ ++HLA H GN    ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTAEAKRIIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEE 236

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +I DP RRTYA +  S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMISS 258


>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
 gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
          Length = 634

 Score =  384 bits (987), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 231/575 (40%), Positives = 328/575 (57%), Gaps = 53/575 (9%)

Query: 60  ETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIIL 119
            T+    Y  +P +R +  L  +  +      GWND+ FHGSN+I TP+IDALAY+G+IL
Sbjct: 19  RTLAVAAYGQNPPKRPHIVLITADDM------GWNDVGFHGSNQIATPHIDALAYDGVIL 72

Query: 120 NNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKA 179
           N  Y+ P+CTPSRA+LMTG++PI+ GMQ   I   EP G+ L +R +P+Y R  GY T  
Sbjct: 73  NRHYSAPMCTPSRAALMTGRHPINVGMQHYVIDSDEPWGLGLDQRIMPQYFRAAGYRTHM 132

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
           IGKWHLGFF   Y P  RGF++H GYL   + Y+ ++ S   S T E  G+DMR+N    
Sbjct: 133 IGKWHLGFFTEHYIPTNRGFDTHIGYLGPYVDYWSYV-SKMNSGTFE--GYDMRQNQFVN 189

Query: 240 WDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
           +   G YATD FT  A  +I       +P+ L + HLA HAGN    L+APQETI++F Y
Sbjct: 190 YAANGTYATDYFTSAARDIIAQHGKSGQPMLLVMNHLAPHAGNDDDPLQAPQETIDRFAY 249

Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
           I + +RRTYAAM+ KLDDSVG V  +L+  GML+NSII+F+SDNG  T        + N 
Sbjct: 250 IGNRDRRTYAAMMSKLDDSVGAVYESLRANGMLDNSIIVFLSDNGGVT-----RGMHSNT 304

Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG---GDT 415
           GSNYP+RG K++ WEG V+  A++WSP ++   RVS Q  HISDWLPTL +AAG   G++
Sbjct: 305 GSNYPFRGQKHSPWEGAVRTAALIWSPLLKDTQRVSNQWFHISDWLPTLASAAGIALGNS 364

Query: 416 SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAA 475
           S    +IDG+DQW +L   T                     P +R  ++ N+DE     +
Sbjct: 365 SF--SDIDGIDQWEALAYGT-------------------GNPRQR--LMNNLDEIFGYTS 401

Query: 476 VRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISN 535
              + +K + GT  NG  D +YG+T     PL   + + + + Y S    ++       +
Sbjct: 402 YMENGFKYINGTTLNGVNDAWYGETD----PL---DQLPDDEQYLSSVMATRIARAGRLD 454

Query: 536 IDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE 595
              +   R+ AT++CG + A    +P    PC LF++ NDPCE ++I+   P     +  
Sbjct: 455 EKLISYLRRHATVNCGNSIATKPCNPLVK-PC-LFDIINDPCEMHDISDEYPKRLRGIRN 512

Query: 596 LLKYHR-RTLVPQSHEQPDLVQADPKRFNDTWSPW 629
            L+ +R R   P++  +P   +ADP +F   W+ W
Sbjct: 513 RLELYRLRAAKPRN--RPHDPEADPSKFGGVWNWW 545



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 39/92 (42%), Positives = 53/92 (57%), Gaps = 1/92 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR+N    +   G YATD FT  A  +I       +P+ L + HLA HAGN    L+APQ
Sbjct: 182 MRQNQFVNYAANGTYATDYFTSAARDIIAQHGKSGQPMLLVMNHLAPHAGNDDDPLQAPQ 241

Query: 60  ETINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
           ETI++F YI + +RRTYAA+      ++  VY
Sbjct: 242 ETIDRFAYIGNRDRRTYAAMMSKLDDSVGAVY 273


>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 514

 Score =  384 bits (986), Expect = e-104,   Method: Compositional matrix adjust.
 Identities = 195/399 (48%), Positives = 253/399 (63%), Gaps = 33/399 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGSN+IPTPNIDALA+ GIILNN Y  PVCTPSR++L+TGKYPIHTG+Q   I
Sbjct: 42  GWNDVGFHGSNQIPTPNIDALAFTGIILNNYYVAPVCTPSRSALLTGKYPIHTGLQHGVI 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G+ P G+ L E+ LPEYLR L Y T+ +GKWHLG F+++YTP YRGF+SH+GY  G   
Sbjct: 102 HGSAPYGLNLNEKLLPEYLRSLNYVTRHVGKWHLGSFKKDYTPEYRGFDSHYGYWTGHQD 161

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYDH   +         G+DMRR ++      G Y TDLFT EAV++I+    +KPLFLY
Sbjct: 162 YYDHTAIENPG----FWGYDMRRGMNVTRSDFGYYTTDLFTNEAVKVIKGHDSNKPLFLY 217

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAHLA H+GN    L+AP ET+ +F YI D NRR +A M+ KLD+SVG V+ AL    M+
Sbjct: 218 LAHLATHSGNKYSPLQAPAETVAKFNYIKDKNRRLFAGMLSKLDESVGKVVEALADSNMI 277

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            N +I+F +DNG P   +       N  SN+P RGVK+TLWEGGV+    +WSP +  + 
Sbjct: 278 NNCVILFSTDNGGPAGGFN-----LNAASNWPLRGVKDTLWEGGVRGVGFIWSPFLPSS- 331

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           +VS  M+HI+DWLPTL                       L L   S   S+IDG++ W +
Sbjct: 332 KVSNAMIHITDWLPTL-----------------------LSLTNASNSISDIDGINVWPN 368

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
           L    PS R  +L+NID +++ AA+R  +WK   G   N
Sbjct: 369 LSKGLPSVRKEILLNIDTERKIAALRYKNWKYKKGNANN 407



 Score = 89.0 bits (219), Expect = 7e-15,   Method: Compositional matrix adjust.
 Identities = 41/79 (51%), Positives = 53/79 (67%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR ++      G Y TDLFT EAV++I+    +KPLFLYLAHLA H+GN    L+AP E
Sbjct: 178 MRRGMNVTRSDFGYYTTDLFTNEAVKVIKGHDSNKPLFLYLAHLATHSGNKYSPLQAPAE 237

Query: 61  TINQFQYITDPNRRTYAAL 79
           T+ +F YI D NRR +A +
Sbjct: 238 TVAKFNYIKDKNRRLFAGM 256


>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
          Length = 561

 Score =  384 bits (985), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 219/560 (39%), Positives = 310/560 (55%), Gaps = 58/560 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS ++ TPN+DALAY+G+IL N Y QP+CTPSRA+LMTG++PIHTGMQ   I
Sbjct: 37  GWNDVGFHGSEQVLTPNLDALAYDGVILENYYVQPICTPSRAALMTGRHPIHTGMQQNVI 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEP G+ L E   P+YL+++GY T  +GKWHLGFF  +YTP+ RGF+SH+GY  G   
Sbjct: 97  YSAEPYGLGLNEIIFPQYLKQIGYKTHIVGKWHLGFFAEQYTPIERGFDSHYGYYMGAED 156

Query: 212 YYDHILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           Y+ HI  + +  +   + G D RRN        G+Y+T+LFT EA  +I      +PLF+
Sbjct: 157 YWVHIAGNAHEVSFNASWGLDFRRNGEVVKTAFGQYSTELFTTEAENIIASHNQSEPLFM 216

Query: 271 YLAHLAAHAGN---AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           Y+A  A H+ N       LEAP +   +F +I +  RR +AAMV  LDDSVG +  AL  
Sbjct: 217 YVAQQAVHSANPYTGDAELEAPFKYYEKFPHIKNEKRRKFAAMVSALDDSVGNITKALHA 276

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           KG+LEN+II F +DNG P   +       N  SN+P RGVK +LWEGGV+    +WSP +
Sbjct: 277 KGILENTIICFSTDNGGPASGFD-----FNDASNWPLRGVKESLWEGGVRGTGFVWSPLM 331

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           + +   S  MM I D+LPTL  AAG D S+L  ++DG+DQW  +     S R   +    
Sbjct: 332 KTSGYTSPAMMQIFDFLPTLLKAAGYDMSKLSPDLDGIDQWEII-----SERKDAV---- 382

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT--------MDGYYG- 498
                       R  +L NID  + T A+R++  K+V+ +Q   T         D +YG 
Sbjct: 383 ------------RTEMLHNIDNTEGTRALRINEMKVVV-SQGGSTRSWPLWYSTDQFYGG 429

Query: 499 ------QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
                    ++     + ++ V  +    L  + +N+  P+ +   ++       IHCG 
Sbjct: 430 YNHTDHSVSADLSAKWDIDSEVAPRVNSDLTSVLRNLGRPLRSGTPLK-------IHCGP 482

Query: 553 NPAPMTPSPCTNG--PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
            P P     C  G  PC LFN+  DPCE +N+A  RPD+ S + + L  ++ T V   ++
Sbjct: 483 VP-PSARGSCDVGKKPC-LFNITADPCEFHNLADERPDMLSTMLDRLDQYQSTAVAPRNK 540

Query: 611 QPDLVQADPKRFNDTWSPWI 630
             D     P +    W PWI
Sbjct: 541 PVD-PNGLPSKHGGIWEPWI 559



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 3/85 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN---AGKHLEA 57
            RRN        G+Y+T+LFT EA  +I      +PLF+Y+A  A H+ N       LEA
Sbjct: 178 FRRNGEVVKTAFGQYSTELFTTEAENIIASHNQSEPLFMYVAQQAVHSANPYTGDAELEA 237

Query: 58  PQETINQFQYITDPNRRTYAALTKS 82
           P +   +F +I +  RR +AA+  +
Sbjct: 238 PFKYYEKFPHIKNEKRRKFAAMVSA 262


>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
           purpuratus]
          Length = 514

 Score =  383 bits (983), Expect = e-103,   Method: Compositional matrix adjust.
 Identities = 215/546 (39%), Positives = 306/546 (56%), Gaps = 72/546 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+S HGS++IPTPNID LA +G+ L N Y  P+CTPSR+++MTG++PIHTG+Q   I
Sbjct: 22  GWDDVSLHGSSQIPTPNIDTLAQDGVTLTNYYVSPLCTPSRSAIMTGRHPIHTGLQFGVI 81

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               P G+ L E+ + +YL+ LGYST A+GKWHLG+F +EYTP +RGF+S FG+ NG   
Sbjct: 82  SPEAPYGLGLEEKTMAQYLKTLGYSTHAVGKWHLGYFAKEYTPTWRGFDSFFGFYNGRGD 141

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY H          E++G+D+ +N        G+Y+TD+F +EA Q+I+     +PLFLY
Sbjct: 142 YYTH-----EEVQSEVSGYDLHKNGKVYRPAFGQYSTDIFNQEAEQIIKAHNASQPLFLY 196

Query: 272 LAHLAAHAG-NAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           LAH A HAG    + L+AP +   +F +I    RR YAAMV  LDDSVG++   L+  G+
Sbjct: 197 LAHQAVHAGVYPDRLLQAPDKYYQRFPHIETEGRRMYAAMVSALDDSVGSISQTLRDAGL 256

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +NSII+F +DNG  T ++ + ++  NW    P RG K+TLWEGGV+  A + SP I++ 
Sbjct: 257 YDNSIIVFTTDNGGATYDFFDGTHASNW----PLRGGKHTLWEGGVRGTAFVNSPLIKKP 312

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            R S QMMH+ DWLPTL++ AGGD   L                       N DG DQW 
Sbjct: 313 RRFSDQMMHVCDWLPTLHSVAGGDPKEL----------------------KNSDGFDQWD 350

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           +L  N PS RN +L NI+  ++ AA+R+  +KL  G                       F
Sbjct: 351 ALSNNAPSPRNEILHNIEPIEKYAAIRVGDFKLHYGA----------------------F 388

Query: 511 NAIVESKTYQSL---QQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG-- 565
              + SK +      +  + N+   + N  ++R   +QA             + C  G  
Sbjct: 389 GGKIGSKYFSGWYPPEGATSNMAEYVPNSFRVRCPSKQAN----------ASTNCDYGDR 438

Query: 566 -PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
            PC L+N+ +DPCE NNIA    DI   +   L+ +RR+ V +   QP   Q++PK   +
Sbjct: 439 KPC-LYNIRDDPCEYNNIADWNQDIVEAMTARLREYRRSAV-EPRNQPLDPQSNPKLHGN 496

Query: 625 TWSPWI 630
            W PW+
Sbjct: 497 VWEPWV 502



 Score = 62.4 bits (150), Expect = 7e-07,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 1/83 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQ 59
           + +N        G+Y+TD+F +EA Q+I+     +PLFLYLAH A HAG    + L+AP 
Sbjct: 157 LHKNGKVYRPAFGQYSTDIFNQEAEQIIKAHNASQPLFLYLAHQAVHAGVYPDRLLQAPD 216

Query: 60  ETINQFQYITDPNRRTYAALTKS 82
           +   +F +I    RR YAA+  +
Sbjct: 217 KYYQRFPHIETEGRRMYAAMVSA 239


>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
          Length = 502

 Score =  379 bits (973), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 213/505 (42%), Positives = 292/505 (57%), Gaps = 43/505 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS ++PTPNIDALA +GIIL+N Y   +C+PSR +LMTGK+PI  G+Q   I
Sbjct: 35  GWDDVSFHGSRQVPTPNIDALASDGIILDNYYVHTLCSPSRGALMTGKHPIQIGLQRGVI 94

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A+P G+ L E+ LPEYL  LGY +  +GKWHLG    EYTP++RGFESHFG+  G  S
Sbjct: 95  MPAQPSGLGLKEKLLPEYLNTLGYKSHMVGKWHLGMCAEEYTPMHRGFESHFGFYQGCES 154

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y  H+  +        +G D   N        G+Y+T LFT +A QL+ +     P+FLY
Sbjct: 155 YTTHMCGN--------SGLDFWLNEEPDHSAGGQYSTSLFTAKAEQLLAEHDTASPMFLY 206

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAH A H GN  +   AP +  ++  +I+D  RR  A M+  LD+SVG V  AL  KGML
Sbjct: 207 LAHQAVHVGNQDQKFYAPDKYTDKLSFISDDRRRQMAGMLTALDESVGKVTKALHAKGML 266

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           EN+II+F SDNGAP +   +       GSN+P RG K ++WEGG +VPA +WSP  +++ 
Sbjct: 267 ENTIIVFSSDNGAPHIGNSD----HTVGSNFPLRGGKPSIWEGGSRVPAFVWSPLFKKSG 322

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            +S Q+MHISDWLPTL  AAG DTS LP N+ G+ QW                      +
Sbjct: 323 YISKQLMHISDWLPTLLQAAGYDTSALPTNLYGVSQW---------------------QT 361

Query: 452 LLLNTPSRRNSVLINI--DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV--PL 507
           L  N PS R+SVL N+    K ++ A+R+   KL+    ++    G+Y    S+ +  P 
Sbjct: 362 LQENGPSPRDSVLYNVIAGCKDKSWALRVGDMKLLRAQGKSKKNAGWYQPFSSDILSQPK 421

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NG 565
               AI+     + LQ     I   +   +K  ST  +  I CGA PA +  + C   + 
Sbjct: 422 SRDPAIISVVPAELLQSQVTPILRELGRGEKSSSTEHR--IDCGAPPAGVVYN-CIGMDK 478

Query: 566 PCYLFNLGNDPCEQNNIASSRPDIS 590
           PC LFN+  DPCE +N+A+  P+I+
Sbjct: 479 PC-LFNITADPCEYHNLANDMPEIA 502



 Score = 62.0 bits (149), Expect = 9e-07,   Method: Compositional matrix adjust.
 Identities = 27/67 (40%), Positives = 40/67 (59%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G+Y+T LFT +A QL+ +     P+FLYLAH A H GN  +   AP +  ++  +I+D  
Sbjct: 179 GQYSTSLFTAKAEQLLAEHDTASPMFLYLAHQAVHVGNQDQKFYAPDKYTDKLSFISDDR 238

Query: 73  RRTYAAL 79
           RR  A +
Sbjct: 239 RRQMAGM 245


>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
 gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
          Length = 577

 Score =  379 bits (972), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 226/563 (40%), Positives = 312/563 (55%), Gaps = 50/563 (8%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           A+   +  + L+   G+ND+SFHGSN+I TPNIDALAYNGI+LN  Y   +CTPSRA+L+
Sbjct: 24  ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TGKYPIHTGMQ   I   EP G+P  ER +PE  R+ GY+T  +GKWHLGF+R++ TP  
Sbjct: 84  TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143

Query: 197 RGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
           RGF+ HFGY NG + YYD    +L   YS      G D RR+L    +  G YAT+ FT 
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYS-----TGLDFRRDLEPCREAEGTYATEAFTT 198

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
           EA ++IE    ++PLF+ L+HLA H GN    ++AP+E + +F +I DP RRTYA M+  
Sbjct: 199 EARKVIERHDKNRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISS 258

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           LD SVG  + AL   GML NSI++  SDNGAPTV       + N GSNYP+RG K + WE
Sbjct: 259 LDKSVGQTMRALADNGMLNNSIVLLYSDNGAPTVGI-----HSNAGSNYPFRGQKESPWE 313

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
           GG++    LWSP +QQ   VS Q +H  DWLPTL  AAG                    +
Sbjct: 314 GGIRSVGALWSPLLQQRGYVSNQAIHAIDWLPTLAAAAG--------------------V 353

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
           + PS  +  +DGL+ W SL  +   +  ++L  +D+    ++   D+ K V G+   G  
Sbjct: 354 SLPS--DLRLDGLNLWPSLSASAQPQPRNLLHVLDDVFGYSSYTQDTLKYVNGSSFEGRY 411

Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLS--QNIFLPIS-NIDKMRSTRQQATIHC 550
           D + G+       L +      S+ Y  L + S  Q +   ++   D++R  R +AT  C
Sbjct: 412 DHWLGE-------LEDGEEDPNSRDYGQLVRASEVQAVLGSVAPTDDRIRQMRDEATQRC 464

Query: 551 GANPA----PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
                    P+         C LF+L NDPCE+ N+A+  P     L + ++  R T +P
Sbjct: 465 PIQERDPEDPVYLCEPLKAAC-LFDLANDPCERYNLANLYPLQLQLLAQEVEELRLTALP 523

Query: 607 QSHEQPDLVQADPKRFNDTWSPW 629
            +      V+ADP   +  W  W
Sbjct: 524 SARVPRADVRADPALHDGHWEWW 546



 Score = 80.1 bits (196), Expect = 3e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 52/82 (63%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+L    +  G YAT+ FT EA ++IE    ++PLF+ L+HLA H GN    ++AP+E
Sbjct: 177 FRRDLEPCREAEGTYATEAFTTEARKVIERHDKNRPLFMVLSHLAVHTGNEDNPMQAPEE 236

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +I DP RRTYA +  S
Sbjct: 237 EVAKFAHIRDPKRRTYAGMISS 258


>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 554

 Score =  378 bits (970), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 217/541 (40%), Positives = 299/541 (55%), Gaps = 44/541 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS++IPTPN+D LA +G+ILNN Y  P CTPSRA+LMTG YPI TGMQG PI
Sbjct: 47  GWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLYPIRTGMQGMPI 106

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEP G+P   R LP+YL+E GY T  +GKWHLG ++   TP  RGF+S +GY  G   
Sbjct: 107 DVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFDSFYGYYYGESD 166

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H +S +        G D   N    W  +G Y+T +FTK A  +IE++   KPL L 
Sbjct: 167 YFAHTISYE-----NHTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLV 221

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           + H A H     + L+APQE I++F YI + NR  YA MV  LD SVG V+ AL   GML
Sbjct: 222 ITHQATHCALERERLQAPQENIDKFPYIGEKNRTIYAGMVDALDQSVGQVVKALSDAGML 281

Query: 332 ENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           EN+II+F SDN G P  E+    N R  G N+P RG K T+WEGG +  A +WSP + + 
Sbjct: 282 ENTIIVFSSDNGGGPWGEH----NSR--GFNWPLRGAKGTVWEGGTRAAAFVWSPLLARR 335

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS Q+MHI+DWLPT Y+ AGG+ ++L                        +DG D W 
Sbjct: 336 RRVSHQLMHITDWLPTFYSIAGGNAAKL----------------------VELDGHDMWR 373

Query: 451 SLLLNTPSRRNSVLINIDEK-KRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
            L     S R  +L N D K  ++AA+R   +KLVL   E G     Y +      P  +
Sbjct: 374 PLSHGRRSPRVELLYNYDYKYTKSAALRNHRYKLVL--DETGRFADRY-KVPGGSRPRHD 430

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
            + +    T  S+ +     F     ++  +S R++A + CG           T+   YL
Sbjct: 431 LDYLAAKSTAASVLR----DFYKKKQLNFPKSWRRKARLTCGKQKKINFSWSATS--VYL 484

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           F++  DPCE NN+AS  P++ + L + L  +    +P+ +      ++ P+    TW+PW
Sbjct: 485 FDVVTDPCELNNLASKLPNVVASLKKRLDAYGAAALPERNNDTYDPRSFPENHGGTWAPW 544

Query: 630 I 630
           +
Sbjct: 545 V 545



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 45/79 (56%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
           N    W  +G Y+T +FTK A  +IE++   KPL L + H A H     + L+APQE I+
Sbjct: 185 NKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALERERLQAPQENID 244

Query: 64  QFQYITDPNRRTYAALTKS 82
           +F YI + NR  YA +  +
Sbjct: 245 KFPYIGEKNRTIYAGMVDA 263


>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 511

 Score =  377 bits (968), Expect = e-102,   Method: Compositional matrix adjust.
 Identities = 213/501 (42%), Positives = 291/501 (58%), Gaps = 44/501 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS +IPTPN+D LA +GIILNN Y QP CTPSRA+LMTG YPIHTGMQ   +
Sbjct: 2   GWDDVSFHGSAQIPTPNMDTLAADGIILNNYYVQPACTPSRAALMTGLYPIHTGMQHGVL 61

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEP G+PL+   +P+YL+ LGY T  +GKW+LG ++  YTP +RGF+S +GY + V  
Sbjct: 62  SPAEPYGLPLSVSIMPQYLKNLGYETHIVGKWNLGNYKLSYTPTFRGFDSFYGYYSAVED 121

Query: 212 YYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY+H +L D  +      G D   N     +  G Y+T L+T+    LIE++ V KP FL
Sbjct: 122 YYNHTVLWDNQT------GLDFWLNTQPLRNVSGIYSTQLYTERTKFLIENRDVSKPFFL 175

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YL + A H GN    L+APQE I++F YI + NR  +A M   LD+S+G V  AL    M
Sbjct: 176 YLPYQAVHCGNFDDPLQAPQENIDKFPYIGEENRTIFAGMTDALDESIGEVFRALFDADM 235

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L+N+II+  +DNG     +       N G N+P RG K TLWEGG +  A +WSP +++ 
Sbjct: 236 LDNTIIVLSADNGGLPFGFES-----NRGYNFPLRGGKGTLWEGGCRASAFVWSPLLKRK 290

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             VS QMMHI+DW+PTLY+AAGG+   L                        +DG D W 
Sbjct: 291 GVVSDQMMHITDWVPTLYSAAGGNPQTL----------------------GKLDGTDMWV 328

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            L   + S R  +L NID   R +A+R  ++KLVLG    G  D  Y  T  ++ P  + 
Sbjct: 329 PLAYGSASPRYEILYNIDPIDRNSALRYTNYKLVLGIANGGAYDQRYLTTGGSR-PYGDL 387

Query: 511 NAIV-ESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           + ++ +S+  Q L++     +L   +       RQ+AT+ CG N  P      +  P YL
Sbjct: 388 DLLMAQSEAAQVLREYYGTGYLKFPD-----QWRQRATLTCG-NKVPKNFVSLS--PPYL 439

Query: 570 FNLGNDPCEQNNIASSRPDIS 590
           F+L  DPCE NNIA+S+P +S
Sbjct: 440 FDLTADPCEINNIANSQPGVS 460



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/70 (45%), Positives = 45/70 (64%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G Y+T L+T+    LIE++ V KP FLYL + A H GN    L+APQE I++F YI + N
Sbjct: 149 GIYSTQLYTERTKFLIENRDVSKPFFLYLPYQAVHCGNFDDPLQAPQENIDKFPYIGEEN 208

Query: 73  RRTYAALTKS 82
           R  +A +T +
Sbjct: 209 RTIFAGMTDA 218


>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
          Length = 545

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 214/546 (39%), Positives = 299/546 (54%), Gaps = 47/546 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+S HGS +IPTPNIDALA +GI+LNN Y QP+CTPSRA+L+TGK+P+H G+Q   I
Sbjct: 38  GWDDISLHGSEQIPTPNIDALAADGILLNNYYVQPICTPSRAALLTGKHPVHLGLQHNTI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+ L ER LPEYL  LGY +  +GKWHLG+F  ++TP YRGF+SHFGYLNG   
Sbjct: 98  PAPSAYGLGLNERILPEYLNTLGYDSHMVGKWHLGYFTPQHTPTYRGFKSHFGYLNGCED 157

Query: 212 YYDHILSDQYSRTVELNGH--DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           Y DH L+  +S T+ ++G   D   +      + G+Y+T++FT  A +LI      +PLF
Sbjct: 158 YLDHTLAYDFS-TLGMDGWGLDFWNDTKIHRTSFGQYSTEIFTTRAEELIRSN-TGEPLF 215

Query: 270 LYLAHLAAHAGNA---GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           LY++H A H+GN    G  L+AP +  N+F YI    RR  A M+  LDDSVG +   L 
Sbjct: 216 LYMSHQAVHSGNPGLNGSKLQAPWKYFNKFNYIQSDERRRLAGMISALDDSVGNITRTLH 275

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
            KGML N+II+F +DNG P   + +     N  SN+P RG K T+WEGG++    +WSP 
Sbjct: 276 AKGMLNNTIIVFSTDNGGPADRFDD-----NCASNWPLRGGKRTMWEGGLRGNGFVWSPL 330

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           ++ +  VS   M I DWLPTL  AAG D  +LP +++G+ QW                  
Sbjct: 331 LKHSNYVSEHFMSIMDWLPTLLDAAGYDLGKLPRDVEGMSQW------------------ 372

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
               SL  N+PS R+ +L NID       +RL   K+V G        G+Y        P
Sbjct: 373 ---KSLSENSPSPRDYLLHNIDPLTGEGGLRLGDMKVVFGGGNTKAHRGWY-------PP 422

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN-- 564
            + +     S      +    +IF  +      +    Q  + CG  P  +T + C +  
Sbjct: 423 EMRWGNETVSAPSHLFETDLWDIFHDLGRPPVQKKISVQ--VECGPAPPDVTTN-CKSWI 479

Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
            PC LFN+  DPCE +N++  +PD+  +L  ++K    T+ P  ++  D     P R   
Sbjct: 480 EPC-LFNISADPCEYHNLSEDQPDMLGKLLSIVKAINATVTP-PYDNSDDPNGYPGRHGG 537

Query: 625 TWSPWI 630
            W PWI
Sbjct: 538 VWKPWI 543



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 44/75 (58%), Gaps = 4/75 (5%)

Query: 11  TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA---GKHLEAPQETINQFQY 67
           + G+Y+T++FT  A +LI      +PLFLY++H A H+GN    G  L+AP +  N+F Y
Sbjct: 189 SFGQYSTEIFTTRAEELIRSN-TGEPLFLYMSHQAVHSGNPGLNGSKLQAPWKYFNKFNY 247

Query: 68  ITDPNRRTYAALTKS 82
           I    RR  A +  +
Sbjct: 248 IQSDERRRLAGMISA 262


>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
          Length = 512

 Score =  377 bits (967), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 220/556 (39%), Positives = 303/556 (54%), Gaps = 67/556 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+S+HG  +I TPNIDALA+NGI L   YAQP+CTPSRA+L+TG+YPI+ G+Q   I
Sbjct: 1   GWNDVSYHGCPQIRTPNIDALAWNGIRLRRYYAQPLCTPSRAALLTGRYPINMGLQHSVI 60

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +  EPRG+PL++  LP++L +LGY T  +GKWH+GFF++EYTP  RGFE H G+    I 
Sbjct: 61  YNEEPRGLPLSDTLLPQWLADLGYVTHHLGKWHIGFFKKEYTPTMRGFERHVGFWGAYID 120

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY H     Y       G DMRRNL  A +  G Y T L TKEA+++IE+ PVDKPLFLY
Sbjct: 121 YYKH--EKAYLGPTRSPGLDMRRNLFLARNDTGRYVTQLLTKEALEVIENHPVDKPLFLY 178

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAHLA H+      L+ P + I Q++ I    R  YA MV  LD+SVG V  AL + GML
Sbjct: 179 LAHLAPHSAGPQDPLQVPDKYIEQYRDIGSTERMKYAGMVSALDESVGLVFEALAKNGML 238

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            N++ +F SDNGA            N+ S++P RG K T WEGGV+  A+LWS  +   P
Sbjct: 239 SNTLFVFTSDNGA-----DAAGPNANYASSWPLRGQKYTPWEGGVRAAALLWSHAL--GP 291

Query: 392 RVSL-----QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           RV        + HI+DWLPTLY  AGGD + L                       +IDG+
Sbjct: 292 RVRAGGDYEHLFHITDWLPTLYQLAGGDPAHL----------------------GDIDGV 329

Query: 447 DQWSSLLLNTPSR-----RNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
              ++L+ +   +     R  VL+NID     +A+    +K V G  E G  + +Y    
Sbjct: 330 SHVNALVGDASPQPPQPPRTEVLVNIDPIDNYSAIIDGRFKFVAGVAERGIYEQWY---- 385

Query: 502 SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDK-------MRSTRQQATIHCGANP 554
                      I+ +  + S + L Q    P++ + K          +   A I CG   
Sbjct: 386 ----------PILGNVDWNSTEPLEQCEASPVARVLKSFGPEPACGPSGGAAPIDCGVRD 435

Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPD-ISSQLYELLKYHRRTLVPQSHEQPD 613
                 P T  PC LF+L  DPCE N++++ RP+ + S L +L +Y   ++ P +     
Sbjct: 436 PAKACQP-TVAPC-LFDLREDPCEYNDVSAERPELVKSLLTKLRRYEDASMPPANIPHDP 493

Query: 614 LVQADPKRFNDTWSPW 629
           L  +DP  +++ W PW
Sbjct: 494 L--SDPGLYDNAWIPW 507



 Score = 84.7 bits (208), Expect = 1e-13,   Method: Compositional matrix adjust.
 Identities = 46/123 (37%), Positives = 67/123 (54%), Gaps = 18/123 (14%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRNL  A +  G Y T L TKEA+++IE+ PVDKPLFLYLAHLA H+      L+ P +
Sbjct: 139 MRRNLFLARNDTGRYVTQLLTKEALEVIENHPVDKPLFLYLAHLAPHSAGPQDPLQVPDK 198

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILN 120
            I Q++ I    R  YA +  +   ++ +V+                  +ALA NG++ N
Sbjct: 199 YIEQYRDIGSTERMKYAGMVSALDESVGLVF------------------EALAKNGMLSN 240

Query: 121 NMY 123
            ++
Sbjct: 241 TLF 243


>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 510

 Score =  375 bits (964), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 217/544 (39%), Positives = 301/544 (55%), Gaps = 53/544 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+SFHGS +IPTPNID LA +G+ILNN Y  P CTPSRA+LMTG YPIHTGMQ   I
Sbjct: 14  GWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPTCTPSRAALMTGLYPIHTGMQSDII 73

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A P G+PL  + LP++ R+LGY    IGKWHLGFF+  Y P+ RGF++ FG+  G   
Sbjct: 74  EPAAPWGLPLENKILPQHFRDLGYDVNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSND 133

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY+H    + +      G  +   L+ AW  + +    L T   + +       +P F Y
Sbjct: 134 YYNHTSGSENNE----QGVSLVATLTAAWGRIAQGVEPLLTDSPLNVYP-----QPFFCY 184

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
            +H A H+    +  +AP   + +F YI + NR  YA M+  LD+SVG V+ AL   GML
Sbjct: 185 FSHHAVHSALMAEPFQAPARNVLKFPYIGESNRTIYAGMLDALDESVGRVVEALDNAGML 244

Query: 332 ENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           E++II+F SDN GAP  E        N G N+P RG K TLWEG V+VPA +WSP+  + 
Sbjct: 245 EDTIIVFSSDNGGAPCGE------ASNQGFNWPLRGAKFTLWEGSVRVPAFVWSPKFLKQ 298

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS Q+MHISDWLPTLYTAAGGD S L + IDG+                     D W 
Sbjct: 299 SRVSNQLMHISDWLPTLYTAAGGDVSALGV-IDGV---------------------DMWH 336

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY---GQTR-SNKVP 506
           SL  ++ S+R  +L+NID     + +R   +KLV+G   +G ++ +Y   G TR +N + 
Sbjct: 337 SLTHSSASKRKEILLNIDASFNISGLRYKQYKLVVGGGFDGELNDHYYFTGGTRPTNNID 396

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
            L  N    ++  +   +  +  + P++  + +    ++  +    N  PM P       
Sbjct: 397 ELR-NESTMARVLKKFYEKQRRSWSPLTWSENVAVDCRRDRLT--ENFVPMQPP------ 447

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
            YLFN+  DPCE NN+A ++      L + L  +  T+VP S  QPD  +A P+     W
Sbjct: 448 -YLFNIEQDPCELNNLAKTKVHTLRFLMKKLDAYNSTVVP-SLFQPDDPRAYPEYHGGIW 505

Query: 627 SPWI 630
           SPW+
Sbjct: 506 SPWL 509



 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 5/75 (6%)

Query: 5   LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
           L+ AW  + +    L T   + +       +P F Y +H A H+    +  +AP   + +
Sbjct: 154 LTAAWGRIAQGVEPLLTDSPLNVYP-----QPFFCYFSHHAVHSALMAEPFQAPARNVLK 208

Query: 65  FQYITDPNRRTYAAL 79
           F YI + NR  YA +
Sbjct: 209 FPYIGESNRTIYAGM 223


>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
 gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
          Length = 542

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 224/547 (40%), Positives = 302/547 (55%), Gaps = 47/547 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G ND+SFHGSN+I TPNIDALAYNG++LN  Y   +CTPSRA+L+TGKYPIHTGMQ   I
Sbjct: 2   GMNDVSFHGSNQILTPNIDALAYNGVLLNKHYVPNLCTPSRATLLTGKYPIHTGMQHWVI 61

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+P  ER +PE  RE GYST  +GKWHLGF+R++ TP  RGF+ H+GY NG I 
Sbjct: 62  ITDEPWGLPKKERLMPELFREAGYSTHLVGKWHLGFWRQDLTPTMRGFDHHYGYYNGYID 121

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YYDH   +L   YS      G D RR+        G YAT+ FT EA ++IE+    KPL
Sbjct: 122 YYDHQVRLLGTNYSA-----GLDFRRDFEPNPKANGTYATEAFTSEAKRIIEEHDKSKPL 176

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           F+ L+HLA H GN    ++AP+E + +F +I DP RRTYA M+  LD SVG  + AL+  
Sbjct: 177 FMVLSHLAVHTGNEDNPMQAPEEEVAKFSHIKDPKRRTYAGMISSLDKSVGQTMRALKDN 236

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            ML N+I++  SDNGAPTV       + N GSNYPYRG K + WEGG++    LWSP ++
Sbjct: 237 DMLNNTIVLLYSDNGAPTV-----GIHSNAGSNYPYRGQKESPWEGGIRSVGALWSPLLE 291

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +   VS Q +H  DWLPTL  AAG     LP +++                   +DG + 
Sbjct: 292 KQGYVSNQAIHAIDWLPTLAAAAG---VSLPQDLE-------------------LDGENL 329

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
           W SL      +    +  +DE    ++   D+ K V G+  +G  D + G   +N+   L
Sbjct: 330 WPSLSEKAEPKPRKFIHVLDEVFGYSSYMSDTLKYVNGSSFDGQYDYWLGDLEANEDDPL 389

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA--NPAPMTP----SPC 562
           +     E      +Q +  +  L    I +MRS   +AT  C A     P+ P     P 
Sbjct: 390 S-GIYGEQVLASEVQSVLGSHGLTEDRILQMRS---EATQKCPAIEGQDPLQPHYRCEPL 445

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
              PC+ F+L  DPCE+ N+A   P    +L + ++  R+T +P +       +ADP   
Sbjct: 446 V-APCF-FDLAQDPCERYNLARLYPLQLEKLRQEVEEFRQTAIPSARVPFSDERADPIFH 503

Query: 623 NDTWSPW 629
           N  W  W
Sbjct: 504 NGNWEWW 510



 Score = 79.0 bits (193), Expect = 8e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 50/82 (60%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+        G YAT+ FT EA ++IE+    KPLF+ L+HLA H GN    ++AP+E
Sbjct: 140 FRRDFEPNPKANGTYATEAFTSEAKRIIEEHDKSKPLFMVLSHLAVHTGNEDNPMQAPEE 199

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +I DP RRTYA +  S
Sbjct: 200 EVAKFSHIKDPKRRTYAGMISS 221


>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 533

 Score =  375 bits (962), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 222/561 (39%), Positives = 312/561 (55%), Gaps = 55/561 (9%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           +A  K   +  ++V   GW+D+SFHGS +IPTPN+D LA +G+ILNN Y Q  CTPSRA+
Sbjct: 20  SASKKPPNIIFMLVDDLGWDDVSFHGSPQIPTPNMDVLAGDGVILNNYYVQHFCTPSRAA 79

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           LMTG YPIH G+Q   I  A+P G+PL  + +PE+ +++GY T  IGKWHLG+FR+EYTP
Sbjct: 80  LMTGLYPIHNGLQDFVIDVAQPYGLPLYLKVMPEFFKDMGYETHMIGKWHLGYFRKEYTP 139

Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
            YRGF+S +GY NG   YY+H ++   S++     +++R+   T                
Sbjct: 140 TYRGFDSFYGYYNGAEDYYNHSITKVISQS-----YNIRQGSVTKKRIENYIKNTELVLL 194

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           ++         +PLFLYLA+ + H       LEAP+E I +F YI + NR  +A  V  L
Sbjct: 195 SLTFYLSILFSQPLFLYLAYQSVHG-----PLEAPEENIMKFPYIGEENRTIFAGAVDAL 249

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D SVG V+  L    ML+N+II+  +DNG      +      N G N+P RG K TLWEG
Sbjct: 250 DQSVGEVLETLYEADMLDNTIIVLSTDNGGLPFGVQS-----NRGYNFPLRGAKGTLWEG 304

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLL 433
           G +  A +WSP + QN RVS QM+HI+DWLPTLY+AAG   + + L  +DG D       
Sbjct: 305 GTRGSAFIWSPLLNQNRRVSDQMLHITDWLPTLYSAAGMRGNPVSLGKVDGYD------- 357

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
                          W+ L  N PS R  VL NID     +A+R  ++KLVLGT E+G  
Sbjct: 358 --------------MWTQLSYNLPSSRYEVLYNIDPTDLNSALRYTNYKLVLGTYEDGEF 403

Query: 494 DGYYGQTRSNKVPLLNFN-AIVESKTYQSLQQLSQ--NIFLPISNIDKMRSTRQQATIHC 550
           D  +  T  ++ P  +   A+ +SK  + L+      N  LP       R+ RQ+AT++C
Sbjct: 404 DQRFRTTGGSR-PYGDLEIAMAQSKAARVLKGFYNRGNFLLP-------RNWRQKATVNC 455

Query: 551 GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK-YHRRTLVPQSH 609
            +N  P   +  +  P YLF+L  DPCE NNIAS+   +   L + L+ Y   ++ P + 
Sbjct: 456 VSNNGP--KNFVSQAPPYLFDLTKDPCELNNIASAEAGLVDFLKQRLQSYASSSIAPLN- 512

Query: 610 EQPDLVQADPKRFNDTWSPWI 630
            +P   +  P+  N TW PWI
Sbjct: 513 -KPTDPRGYPENLNGTWGPWI 532



 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 34/61 (55%), Gaps = 9/61 (14%)

Query: 35  KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA----ALTKSTTLTLLIV 90
           +PLFLYLA+ + H       LEAP+E I +F YI + NR  +A    AL +S    L  +
Sbjct: 206 QPLFLYLAYQSVHG-----PLEAPEENIMKFPYIGEENRTIFAGAVDALDQSVGEVLETL 260

Query: 91  Y 91
           Y
Sbjct: 261 Y 261


>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 588

 Score =  374 bits (961), Expect = e-101,   Method: Compositional matrix adjust.
 Identities = 214/539 (39%), Positives = 301/539 (55%), Gaps = 38/539 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+S++G  +I TPNIDALA+NGI L   Y QP+CTPSRA+LMTG+YPIHTGMQ   I
Sbjct: 76  GWNDVSYNGCPQIRTPNIDALAWNGIRLQRYYTQPMCTPSRAALMTGRYPIHTGMQHFVI 135

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EPRG+PL  + LP++L +LGY ++ +GKWHLGF+++EYTP  RGF+ H G   G + 
Sbjct: 136 LQNEPRGLPLKFKLLPQWLGDLGYVSQMLGKWHLGFYKKEYTPTMRGFQKHIGSWGGFVD 195

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY HI  ++       +G D R+ LS   +  G+Y T+  T+ A ++IE+ P++KPLFLY
Sbjct: 196 YYSHIRFNKIG--FSHSGLDFRQGLSEGREFDGQYYTEFMTEAATRVIENHPLEKPLFLY 253

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAHLA H  N    L+ P++  +++  I   NR  YA MV  LD+SVG V+ AL ++GML
Sbjct: 254 LAHLAPHGANRHDPLQVPKKYSDKYHDIGHWNRTMYAGMVSALDESVGAVVEALGKRGML 313

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            +++++F SDNG  T          N+ S++P++G K TLWEGG+ VP  +WSP      
Sbjct: 314 SDTVLVFSSDNGGDT-----NGENPNYASSWPFKGQKRTLWEGGIHVPGFIWSPLFSGMR 368

Query: 392 RVSL-QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
                 + HISDWLPTLY  AGGD S L                       +IDG+    
Sbjct: 369 GFDYNNIFHISDWLPTLYQLAGGDPSDL----------------------GDIDGISHLD 406

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           SL   + + R  +LINID  +  +A+    +KLV G    G +D ++ Q   N     ++
Sbjct: 407 SLSRRSETPRKELLINIDPIENVSAIIEGHFKLVSGVVRGGVLDEWF-QVPGNIT--WDY 463

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
           N   +      + ++ +N    ++   +  S      I CG    P  P   T  PC LF
Sbjct: 464 NRARQECETSLVARVLRNAGHDVACGSEDGSF--PTPIKCGKR-DPSKPCVPTVAPC-LF 519

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           +L  DPCE NNIA    ++  +L   L+ +R T VP  +  P   QAD    N+ WS W
Sbjct: 520 DLSKDPCEYNNIAEQHNEVLQRLLGKLEGYRETSVPPGN-LPSDEQADAALHNNVWSSW 577



 Score = 66.2 bits (160), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 31/82 (37%), Positives = 49/82 (59%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            R+ LS   +  G+Y T+  T+ A ++IE+ P++KPLFLYLAHLA H  N    L+ P++
Sbjct: 214 FRQGLSEGREFDGQYYTEFMTEAATRVIENHPLEKPLFLYLAHLAPHGANRHDPLQVPKK 273

Query: 61  TINQFQYITDPNRRTYAALTKS 82
             +++  I   NR  YA +  +
Sbjct: 274 YSDKYHDIGHWNRTMYAGMVSA 295


>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
 gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
          Length = 633

 Score =  373 bits (957), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 204/460 (44%), Positives = 271/460 (58%), Gaps = 55/460 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS EIPTPNIDALAY+GIILN  Y  P+CTPSR++LMTGKYPIHTGMQ   +
Sbjct: 37  GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + AEPRG+PL E+ LP+YL ELGY++   GKWHLG ++ +YTPLYRGF SH+G       
Sbjct: 97  YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHWGL------ 150

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
                              DMR     A+D  G Y TD+ T+ +V++I +    K PLFL
Sbjct: 151 -------------------DMRNGTQVAYDLHGHYTTDVITEHSVKVIANHNATKGPLFL 191

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H+ N    L  P   + +  +I +  RR +AAMV K+DDSVG ++  L++  M
Sbjct: 192 YVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDDSVGQIVDQLRKSNM 251

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSIIIF SDNG P   +       N+ SNYP +GVKNTLWEGGV+   ++WSP ++++
Sbjct: 252 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKS 306

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS Q MHI DWLPTL  AAGG  +   L                   +  IDG   W 
Sbjct: 307 QRVSNQTMHIVDWLPTLLEAAGGQPALANL-------------------SKQIDGQSIWR 347

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           +L+ +  S R +VL NID+   +AA+ +  WKLV GT   G+ DG+YG        L ++
Sbjct: 348 ALVQDKASPRLNVLHNIDDIWGSAAISVGDWKLVKGTNYRGSWDGWYGPAGERDPRLYDW 407

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC 550
             +  S+  ++L+ L     LP S  D+ R     AT+ C
Sbjct: 408 QLVGRSRAGKALEGLK---MLP-SRADQQR-IPAAATVSC 442



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/81 (35%), Positives = 44/81 (54%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
           MR     A+D  G Y TD+ T+ +V++I +    K PLFLY+AH A H+ N    L  P 
Sbjct: 152 MRNGTQVAYDLHGHYTTDVITEHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPD 211

Query: 60  ETINQFQYITDPNRRTYAALT 80
             + +  +I +  RR +AA+ 
Sbjct: 212 NDVIKMSHIPNYKRRKFAAMV 232


>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
          Length = 593

 Score =  372 bits (954), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 217/575 (37%), Positives = 301/575 (52%), Gaps = 73/575 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS++IPTPN+D LA +G+ILNN Y  P CTPSRA+LMTG YPI TGMQG PI
Sbjct: 47  GWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLYPIRTGMQGMPI 106

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEP G+P   R LP+YL+E GY T  +GKWHLG ++   TP  RGF+S +GY  G   
Sbjct: 107 DVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFDSFYGYYYGESD 166

Query: 212 YYDHILS-------------------------------DQYSRTVELNGH---DMRRNLS 237
           Y+ H +S                               D ++ T+    H   D   N  
Sbjct: 167 YFAHTISYVRHLSWAFFLTRKCXCRGFDSFYGYYYGESDYFAHTISYENHTGLDFWLNKK 226

Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
             W  +G Y+T +FTK A  +IE++   KPL L + H A H     + L+APQE I++F 
Sbjct: 227 PVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALERERLQAPQENIDKFP 286

Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYR 356
           YI + NR  YA MV  LD SVG V+ AL   GMLEN+II+F SDN G P  E+       
Sbjct: 287 YIGEKNRTIYAGMVDALDQSVGQVVKALSDAGMLENTIIVFSSDNGGGPWGEHNSR---- 342

Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
             G N+P RG K T+WEGG +  A +WSP + +  RVS Q+MHI+DWLPT Y+ AGG+ +
Sbjct: 343 --GFNWPLRGAKGTVWEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTFYSIAGGNAA 400

Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEK-KRTAA 475
           +L                        +DG D W  L     S R  +L N D K  ++AA
Sbjct: 401 KL----------------------VELDGHDMWRPLSHGRRSPRVELLYNYDYKYTKSAA 438

Query: 476 VRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISN 535
           +R   +KLVL   E G     Y +      P  + + +    T  S+ +     F     
Sbjct: 439 LRNHRYKLVL--DETGRFADRY-KVPGGSRPRHDLDYLAAKSTAASVLR----DFYKKKQ 491

Query: 536 IDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE 595
           ++  +S R++A + CG           T+   YLF++  DPCE NN+AS  P++ + L +
Sbjct: 492 LNFPKSWRRKARLTCGKQKKINFSWSATS--VYLFDVVTDPCELNNLASKLPNVVASLKK 549

Query: 596 LLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
            L  +    +P+ +      ++ P+    TW+PW+
Sbjct: 550 RLDAYGAAALPERNNDTYDPRSFPENHGGTWAPWV 584



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 31/79 (39%), Positives = 45/79 (56%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
           N    W  +G Y+T +FTK A  +IE++   KPL L + H A H     + L+APQE I+
Sbjct: 224 NKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALERERLQAPQENID 283

Query: 64  QFQYITDPNRRTYAALTKS 82
           +F YI + NR  YA +  +
Sbjct: 284 KFPYIGEKNRTIYAGMVDA 302


>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 527

 Score =  370 bits (951), Expect = e-100,   Method: Compositional matrix adjust.
 Identities = 220/558 (39%), Positives = 302/558 (54%), Gaps = 54/558 (9%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           + L  +    L+   GW+D+SFHGS +IPTPN+DALA +GIILNN Y QPVCTPSRA+LM
Sbjct: 19  SKLPPNIIFMLVDGLGWDDVSFHGSPQIPTPNMDALAADGIILNNYYVQPVCTPSRAALM 78

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TG YPIHTG+Q   +  AEP G+PL  + +PEY ++LGY T  IGKW+LG++ +EYTP Y
Sbjct: 79  TGMYPIHTGLQHGVLLAAEPNGLPLEFKIMPEYFKDLGYETHLIGKWNLGYYMKEYTPTY 138

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAW--DTVGEYATDLFTKE 254
           RGF+S +G+ N    Y+ H            N   + ++ +  W   +   Y T L    
Sbjct: 139 RGFDSFYGFYNYEEDYFTH------------NLEFVNQSNAMVWRPSSFCVYLT-LSPDT 185

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
              L+       P FLYL+H + H  +    L+AP+E I +F YI D  R  YA MV  L
Sbjct: 186 GFDLLSASIERGPFFLYLSHQSVHGASGNDPLQAPEENIAKFPYIGDERRTKYAGMVDAL 245

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D+S G V+ AL   GML N+II+  S NG        +    N GSN+P RG K  LWEG
Sbjct: 246 DESAGDVLEALYEAGMLANTIIVMSSANGG-----LSSGVESNAGSNFPLRGGKGALWEG 300

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
           G +  A +WSP + Q  RVS QMMHI+DWLPTLY AA G+    P N+            
Sbjct: 301 GTRASAFIWSPLLYQKNRVSDQMMHITDWLPTLYAAAQGN----PFNL------------ 344

Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
                   +DG + W+ L  N PS R  +L NID   +T+A+R  ++KLVLG       +
Sbjct: 345 ------GTLDGQNLWNHLSYNLPSPRYELLYNIDPIDQTSAIRFLNYKLVLGESSGSEEN 398

Query: 495 GYYGQTRSNKVPLLNFN-AIVESKTYQSLQQL-SQNIFLPISNIDKMRSTRQQATIHCGA 552
            +  +T  N  P  + + A+ +SK  + L+   ++ I    SN       RQ+A ++CG 
Sbjct: 399 EHRLKTPGNSRPYRDLDIAMAQSKAARVLRNFHNRGILFMDSN------WRQKAALNCGD 452

Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
           +      +     P YLF+L  DPCE  NI    P   S L   L  + +T +  S++Q 
Sbjct: 453 DE---RENVVPGAPPYLFDLAKDPCEMRNIVLIEPGWVSILTLRLHIYEQTSIAPSNKQI 509

Query: 613 DLVQADPKRFNDTWSPWI 630
           D  +  P+  N TW+PWI
Sbjct: 510 D-PRGFPENLNGTWAPWI 526



 Score = 46.2 bits (108), Expect = 0.057,   Method: Compositional matrix adjust.
 Identities = 19/47 (40%), Positives = 27/47 (57%)

Query: 36  PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKS 82
           P FLYL+H + H  +    L+AP+E I +F YI D  R  YA +  +
Sbjct: 198 PFFLYLSHQSVHGASGNDPLQAPEENIAKFPYIGDERRTKYAGMVDA 244


>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
          Length = 542

 Score =  369 bits (946), Expect = 4e-99,   Method: Compositional matrix adjust.
 Identities = 215/551 (39%), Positives = 304/551 (55%), Gaps = 55/551 (9%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +GW+D+ FHGS +IPTPN+DALA +GIIL+N Y+QP+CTPSR SL+TGK+PI  G+Q   
Sbjct: 34  FGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV 93

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I+ A+P G+ L E+ LPEYL+ LGY +  +GKWHLGFF  EYTP+ RGF+SH+G+     
Sbjct: 94  IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGASE 153

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            Y  HI          + G D   N        G Y+T LFT +A QL+ +    +P+FL
Sbjct: 154 DYMTHIGG--------MGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQTEPMFL 205

Query: 271 YLAHLAAHAGNA---GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           Y +H A H  +    G  L AP +  ++F +I D   +  A M+  LD+SVG +  +L  
Sbjct: 206 YFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLTKSLHT 265

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
            GMLEN+IIIF SDNGAP        N     SNYP RG K ++WEGG +VPA +WSP +
Sbjct: 266 NGMLENTIIIFSSDNGAP------HENTDVCSSNYPLRGAKTSVWEGGTRVPAFVWSPLL 319

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           +++  VS QMMHISDWLPTL  AAG + S LP ++ G+ QW                   
Sbjct: 320 KKSGYVSKQMMHISDWLPTLLEAAGYNMSALPGDLYGVSQW------------------- 360

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTA-AVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
              +L  N PS RNS+L N D KK+ + A+R+   KL+   Q  G    + G  +   + 
Sbjct: 361 --QALQENGPSARNSMLYNADHKKQGSWALRVGDMKLL---QAEGATQLHSGWQQPFAIN 415

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDK-MRSTRQQATIHCGANPAPMTPSPC--T 563
           L +     E      +Q+    +   +  + + MR++  +  ++CG   + +T   C  T
Sbjct: 416 LQSQLPSNEPNNTNYVQEEQSQVASILEELGRGMRNSTTKYFVNCGFISSGVTSHSCIDT 475

Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDI----SSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
             PC L+N+  DPCE +N+A   P++     + L +LLK +   +      +P    A P
Sbjct: 476 GRPC-LYNVTADPCEYHNLAKDLPEVVERLEASLDDLLKMYIEPI-----NKPADPAAYP 529

Query: 620 KRFNDTWSPWI 630
           K     W PWI
Sbjct: 530 KNHGGVWMPWI 540



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA---GKHLEAPQETINQFQYIT 69
           G Y+T LFT +A QL+ +    +P+FLY +H A H  +    G  L AP +  ++F +I 
Sbjct: 179 GHYSTTLFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIA 238

Query: 70  DPNRRTYAALTKS 82
           D   +  A +  +
Sbjct: 239 DVECQHMAGMISA 251


>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
 gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
          Length = 525

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 223/550 (40%), Positives = 314/550 (57%), Gaps = 52/550 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF GSN   TPNIDALAY+G+ILNN+Y   +CTPSR++L+TGKYPI TGMQ   I
Sbjct: 6   GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTPAMCTPSRSALLTGKYPISTGMQHYVI 65

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+PL E  + E  ++ GY T  +GKWHLG  +R +TP  RGF++H GYL   I 
Sbjct: 66  VNDQPWGLPLNETTMAEIFQQNGYYTSLLGKWHLGMSQRNFTPTKRGFDTHLGYLGAYID 125

Query: 212 YYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD-KP 267
           YYD      S  YSR     GHD R NL  + D VG Y TD+ +  AV+LIE   V  KP
Sbjct: 126 YYDQTYLQSSQNYSR-----GHDFRDNLEASHDKVGHYVTDILSDAAVELIEKHNVTAKP 180

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL L+HLA HA N    L+AP E ++QF+YI + + R YAAMV +LD SVG VI AL  
Sbjct: 181 LFLLLSHLAPHAANDNDPLQAPMEELSQFEYIQNKSHRYYAAMVSRLDKSVGRVIEALAN 240

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           + ML NSI++F+SDNG P+V       + N  SNYP RG K+T WEGG++  A +WS + 
Sbjct: 241 REMLRNSIVLFLSDNGGPSV-----GEHANTASNYPLRGQKHTPWEGGIRSSAAIWSTEF 295

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           ++   V  Q ++I D LPTL +AAG   S  P +++DGL+ WS+L     S     +  +
Sbjct: 296 ERLGSVWKQRIYIGDLLPTLCSAAG--ISLDPSMHLDGLNLWSALKYGYESVEREIVHSI 353

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
           D                   +D +   A +R   WK++ GT  +G  DG+ GQ  +++V 
Sbjct: 354 DA------------------VDPQPHLAYMR-GKWKIINGTTNSGQYDGWLGQRITDEVD 394

Query: 507 --LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN 564
              ++++ ++ + T     QL  +      NI ++   RQQA I C   P P    PC  
Sbjct: 395 PRSVHYDQLIRNTTVWQQLQLLADKPQQERNITEL---RQQAQILC---PTPANAKPCMP 448

Query: 565 --GPCYLFNLGNDPCEQNNIASSRPDISS--QLYELL-KYHRRTLVPQSHEQPDLVQADP 619
              PC+ F++  DPCEQNN+     + S    L+E + +Y R   +P +  +P   + DP
Sbjct: 449 LIAPCF-FDIEVDPCEQNNLYDEYRNTSIFLDLWERIDQYARNAHLPNN--KPADPKCDP 505

Query: 620 KRFNDTWSPW 629
           + + + W+ W
Sbjct: 506 RYYRNEWTWW 515



 Score = 76.6 bits (187), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 51/81 (62%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQ 59
            R NL  + D VG Y TD+ +  AV+LIE   V  KPLFL L+HLA HA N    L+AP 
Sbjct: 144 FRDNLEASHDKVGHYVTDILSDAAVELIEKHNVTAKPLFLLLSHLAPHAANDNDPLQAPM 203

Query: 60  ETINQFQYITDPNRRTYAALT 80
           E ++QF+YI + + R YAA+ 
Sbjct: 204 EELSQFEYIQNKSHRYYAAMV 224


>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 529

 Score =  368 bits (945), Expect = 5e-99,   Method: Compositional matrix adjust.
 Identities = 221/561 (39%), Positives = 308/561 (54%), Gaps = 50/561 (8%)

Query: 77  AALTKSTTLTLLIVY------GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           A  +KS+T+   I++      GW D+SF G  +IPTPN+D LA  GIILNN Y QP+C P
Sbjct: 10  AVFSKSSTVPPNIIFILADDLGWADVSFRGDPQIPTPNLDVLASQGIILNNYYVQPLCAP 69

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR +LM+G YPIHTG+Q       EP G+P     +PEYL+ LGY+T  IGKWHLG+ + 
Sbjct: 70  SRGALMSGLYPIHTGLQHLVPGPGEPWGLPTNLTIMPEYLKNLGYATHMIGKWHLGYHKE 129

Query: 191 EYTPLYRGFESHFGYLNGVISYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
            YTP  RGF+S +GYLNG   YYDH IL    S      G D   N +   +    Y+T+
Sbjct: 130 SYTPTRRGFDSFYGYLNGGEDYYDHTILWSNAS------GLDFWENTTPVRNEGNHYSTE 183

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           LFTK+A  LI+     KP+FLY +H A H G+    LEAP   I  F YI + NR  +A 
Sbjct: 184 LFTKKAQSLIKHHDPAKPMFLYFSHQAVHCGDYKVELEAPALAIAHFPYIKELNRSIHAG 243

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V +LD SVG V+ AL ++GML NSI+IF +DNG         S Y     N+P RG K 
Sbjct: 244 AVYELDKSVGLVMEALNKRGMLSNSIVIFSTDNGGLPWGVEPNSGY-----NWPLRGSKE 298

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
           T WEGG +  A +WSP + ++ R+S QMMHI+DWLPTLY+AAGG+ S L           
Sbjct: 299 TNWEGGARGAAFVWSPLLFKSGRLSNQMMHITDWLPTLYSAAGGNVSTL----------- 347

Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
                       NIDG D W +L  +  S R  VLINID  + ++A+ +   K+VLG+  
Sbjct: 348 -----------GNIDGKDMWKALSEDLESPRQEVLINIDPIENSSALIVGRHKVVLGSFN 396

Query: 490 NGTMDGYYGQTRSNKVPLLNFN-AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
            G+ D        ++ P+   +  ++ S+T + L+      F  +  +    + R +A +
Sbjct: 397 EGSHDMRMKAPGGSR-PVDGLDQMMLSSRTGKVLKD-----FYNVRQLTVRPNWRNEAVV 450

Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
            C     P      +  P Y F+L +DPCE NN+A+S      +L + +K + + +VP +
Sbjct: 451 RCDRYAPPNNFVAAS--PPYYFDLEHDPCELNNLAASNVTELEELIKKIKEYAKGMVPPA 508

Query: 609 HEQPDLVQADPKRFNDTWSPW 629
           ++  D  +  PK  +  W PW
Sbjct: 509 NKPLD-PRGLPKYNHGLWGPW 528



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 33/78 (42%), Positives = 43/78 (55%), Gaps = 4/78 (5%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
            Y+T+LFTK+A  LI+     KP+FLY +H A H G+    LEAP   I  F YI + NR
Sbjct: 179 HYSTELFTKKAQSLIKHHDPAKPMFLYFSHQAVHCGDYKVELEAPALAIAHFPYIKELNR 238

Query: 74  RTYAA----LTKSTTLTL 87
             +A     L KS  L +
Sbjct: 239 SIHAGAVYELDKSVGLVM 256


>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 502

 Score =  364 bits (935), Expect = 7e-98,   Method: Compositional matrix adjust.
 Identities = 194/503 (38%), Positives = 290/503 (57%), Gaps = 44/503 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+S+HGS +I TPNIDALA+NGI LN  Y QP+CTPSR++ +TG YP++TGMQ   I
Sbjct: 37  GWNDVSYHGSPQILTPNIDALAWNGIRLNRYYTQPLCTPSRSAFLTGCYPMNTGMQHSVI 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EPRG+PL  + LP++L + GY ++ +GKWHLG+++ EYTP  RGF+SH G   G   
Sbjct: 97  LTTEPRGLPLHYKLLPQWLGDFGYVSRMLGKWHLGYYKEEYTPTMRGFQSHVGSWEGFSD 156

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY HI+   + +T  ++GHD RR++  + +  G Y T + T+EA+++I+D P +KPLFLY
Sbjct: 157 YYSHIMDFSW-QTWSISGHDFRRDMQKSKEDDGRYYTHVMTEEALKIIKDHPNEKPLFLY 215

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +AHLA H+GN  + L+AP +  + +  I  P+R  YA M+  LD+SVG V  AL R+G+L
Sbjct: 216 IAHLAVHSGNQPEPLKAPTKYTDPYMDIGHPSRTLYAGMLSVLDESVGAVFEALGRRGIL 275

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
             +I++  +DNGA       T+ + ++GS +P+RG K T WEGGV+VPA++WSP   ++ 
Sbjct: 276 NETIVVLTTDNGAGI-----TTQFSSFGSGWPFRGQKGTAWEGGVRVPAVVWSPLFSEHR 330

Query: 392 RVSLQ-MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
              ++ + H+SDWLPT Y  AGGD S L                        IDG+ Q  
Sbjct: 331 GAVVESLFHVSDWLPTFYELAGGDPSNL----------------------GEIDGISQLD 368

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY----GQTRSNKVP 506
           +L       R  +++NID  +  +A+ +  +KL+ G  + G  D +Y      TR+ K  
Sbjct: 369 TLRHREKLPRTEIVLNIDPIENVSAIIVYPFKLMQGDDQGGHYDDWYPFMGNVTRTQKDA 428

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
                A V  +       +  + F+     D +       ++ CG         P    P
Sbjct: 429 REGCEASVVYRV------MKHSGFVVTCGQDPVTYA---TSVKCGEKDPTKACKPIVK-P 478

Query: 567 CYLFNLGNDPCEQNNIASSRPDI 589
           C LF+L  DPCE N+I++   ++
Sbjct: 479 C-LFDLSKDPCEYNDISAENIEV 500



 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 32/79 (40%), Positives = 52/79 (65%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR++  + +  G Y T + T+EA+++I+D P +KPLFLY+AHLA H+GN  + L+AP +
Sbjct: 176 FRRDMQKSKEDDGRYYTHVMTEEALKIIKDHPNEKPLFLYIAHLAVHSGNQPEPLKAPTK 235

Query: 61  TINQFQYITDPNRRTYAAL 79
             + +  I  P+R  YA +
Sbjct: 236 YTDPYMDIGHPSRTLYAGM 254


>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
 gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
          Length = 472

 Score =  363 bits (933), Expect = 1e-97,   Method: Compositional matrix adjust.
 Identities = 207/493 (41%), Positives = 283/493 (57%), Gaps = 44/493 (8%)

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           MTGKYPIHTGMQ   ++G EPRG+PLTE+ LP+YL+ELGY     GKWHLG + R++TPL
Sbjct: 1   MTGKYPIHTGMQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHIYGKWHLGSYTRKHTPL 60

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTK 253
            RGF+SH G+  G    +DH         VE N  G DMRR    A+D  G Y T +   
Sbjct: 61  ERGFDSHVGFWTGHHHMFDHT-------AVETNAWGLDMRRGFDVAYDLHGYYTTHVIRD 113

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
           E+V  I      +P+FLY++H A H+ N    L AP ET+ +   I++ +RR +AAM+ +
Sbjct: 114 ESVAAIRAHNTSQPMFLYVSHAATHSANPYDFLPAPDETVERLAGISNYSRRKFAAMLTE 173

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           LD SVG +I AL  +GML+NSII+F +DNG P   +       N  SN+P RG KNTLWE
Sbjct: 174 LDQSVGAIIVALSERGMLDNSIIVFSTDNGGPAEGFN-----NNAASNWPLRGTKNTLWE 228

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
           GGV+    +WSP I++  RVS Q+MHISDWLPTL  AAG D ++LP N+DG+  W     
Sbjct: 229 GGVRGAGCIWSPLIEEKRRVSHQIMHISDWLPTLLDAAGYDLNKLPSNLDGISVW----- 283

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
             P  RN ++              +RR+ +L NID+   +AA+ +D WK+V GT   G  
Sbjct: 284 --PELRNGDV--------------TRRHEILHNIDDIWGSAALTVDDWKVVKGTNYEGKW 327

Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
           D +YG    +     N ++I    T ++L +L   +  P   I K+   R++AT+ C AN
Sbjct: 328 DAWYGPAGDHDTKSYNLSSIFGCPTGKALSKLK--MLPPEEEIIKL---RREATVQC-AN 381

Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
                 +P  + PC LF+L NDPCE  N+A   PDI   L + L  +  T VP  +  P 
Sbjct: 382 GTRNKCNPLES-PC-LFDLFNDPCEFENLADQYPDILEVLLKKLSDYNATAVPPGN-MPL 438

Query: 614 LVQADPKRFNDTW 626
             + DP+ +  TW
Sbjct: 439 DSRGDPRFWGYTW 451



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 44/79 (55%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR    A+D  G Y T +   E+V  I      +P+FLY++H A H+ N    L AP E
Sbjct: 92  MRRGFDVAYDLHGYYTTHVIRDESVAAIRAHNTSQPMFLYVSHAATHSANPYDFLPAPDE 151

Query: 61  TINQFQYITDPNRRTYAAL 79
           T+ +   I++ +RR +AA+
Sbjct: 152 TVERLAGISNYSRRKFAAM 170


>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
          Length = 549

 Score =  363 bits (931), Expect = 2e-97,   Method: Compositional matrix adjust.
 Identities = 220/581 (37%), Positives = 306/581 (52%), Gaps = 84/581 (14%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +GW+D+ FHGS +IPTPN+DALA +GIIL+N Y+QP+CTPSR SL+TGK+PI  G+Q   
Sbjct: 11  FGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV 70

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I+ A+P G+ L E+ LPEYL+ LGY +  +GKWHLGFF  EYTP+ RGF+SH+G+     
Sbjct: 71  IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGASE 130

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            Y  HI          + G D   N        G Y+T LFT +A QL+ +    +P+FL
Sbjct: 131 DYMTHIGG--------MGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQTEPMFL 182

Query: 271 YLAHLAAHAGNA---GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           Y +H A H  +    G  L AP +  ++F +I D   +  A M+  LD+SVG +  +L  
Sbjct: 183 YFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLTKSLHT 242

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
            GMLEN+IIIF SDNGAP        N     SNYP RG K ++WEGG +VPA +WSP +
Sbjct: 243 NGMLENTIIIFSSDNGAPH------ENTDVCSSNYPLRGAKTSVWEGGTRVPAFVWSPLL 296

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           +++  VS QMMHISDWLPTL  AAG + S LP                      ++ G+ 
Sbjct: 297 KKSGYVSKQMMHISDWLPTLLEAAGYNMSALP---------------------GDLYGVS 335

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTA-AVRLDSWKLVLGTQENGTMDG----------- 495
           QW +L  N PS RNS+L N D KK+ + A+R+   KL+          G           
Sbjct: 336 QWQALQENGPSARNSMLYNADHKKQGSWALRVGDMKLLQAEGATQLHSGWQQPFAINLQS 395

Query: 496 -----------YYGQTRSNKVPLLNFNAIVESKTYQS---------LQQLSQNIFLPISN 535
                      Y  + +S    +L  +  +  K              +QL++   +  + 
Sbjct: 396 QLPSNEPNNTNYVQEEQSQVASILEEHETLAGKRRHEATFWLASTLCKQLAEPAVIQQAR 455

Query: 536 IDKMRSTRQQATIHCGANPAPMTPSPCTNG--PCYLFNLGNDPCEQNNIASSRPDI---- 589
            +K RS R     +CG     + P  C N   PC L+N+  DPCE  N+A   P++    
Sbjct: 456 QNKRRSGR--TVPNCGFASTGVAPRSCVNTGRPC-LYNVTADPCEYYNLAEYLPEVVERL 512

Query: 590 SSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
            ++L +LLK   R++ P +  +P    A PK     W PWI
Sbjct: 513 ETRLDDLLK---RSIEPIN--KPADPAAYPKNHGGVWMPWI 548



 Score = 47.4 bits (111), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA---GKHLEAPQETINQFQYIT 69
           G Y+T LFT +A QL+ +    +P+FLY +H A H  +    G  L AP +  ++F +I 
Sbjct: 156 GHYSTTLFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIA 215

Query: 70  DPNRRTYAALTKS 82
           D   +  A +  +
Sbjct: 216 DVECQHMAGMISA 228


>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 557

 Score =  361 bits (926), Expect = 8e-97,   Method: Compositional matrix adjust.
 Identities = 208/518 (40%), Positives = 295/518 (56%), Gaps = 47/518 (9%)

Query: 81  KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
           K T + +++    GWND SFHGS EIPTPN+DALA +G+IL + YAQP+CTP+RA+L+TG
Sbjct: 78  KPTNVVMILADDMGWNDASFHGSAEIPTPNLDALASSGVILQSHYAQPMCTPTRAALLTG 137

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
            YP HTGMQ   I   EP G+PL  + LP YL E  Y +  +GKWHLG     + P  R 
Sbjct: 138 LYPFHTGMQNFVIRTGEPWGLPLDYKILPHYLDEAYYHSHLVGKWHLGMHNPAFLPTARH 197

Query: 199 FESHFGYLNGVISYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
           F +H GY NG I Y+ H  +S      + L+ H     ++   +    YAT LFTK AV 
Sbjct: 198 FNTHVGYYNGFIDYFTHEHISPGNDSLIGLDWH-----INEENENEEGYATHLFTKRAVN 252

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
           LIE+    +PLF+ L+HLA HAG      +AP+E+I +F +I D NR+ YAAMV +LD S
Sbjct: 253 LIENHKSTEPLFILLSHLAPHAGCKRDPFQAPRESIEKFAHIKDQNRKVYAAMVDELDQS 312

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           VG V+ AL RK +L++++I+F+SDNG  T     T    N GSNYP+RG K TL+EGG +
Sbjct: 313 VGQVVEALYRKRLLDSTMIVFLSDNGGQT-----TGVMNNTGSNYPFRGQKRTLFEGGTR 367

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
           V A +WS  I + PR+   +MH++DWLPT+   AG      P  +DG D           
Sbjct: 368 VSAFVWSTDIVKKPRIESGLMHVTDWLPTILKRAG---LSYPRELDGRD----------- 413

Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
                     QWS L    PS R  VL N++   ++AA+R D +KL++G +   ++   +
Sbjct: 414 ----------QWSMLSEGAPSARAEVLYNLNYVDQSAALRNDRFKLIVGPRNGVSLLNDW 463

Query: 498 GQTRSNKVPLLNFNAIV-ESKTYQSLQQLSQNIFLPISNIDKMRST----RQQATIHCGA 552
            Q    K      + ++ ES T ++L++    ++  + + D + +     R +A I CG+
Sbjct: 464 DQLHKGKHSRAQLDGMMDESLTARTLKE--AGLWWKMRHGDTLSADHSTWRDKAEIKCGS 521

Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDIS 590
           +    T       PC LF+L NDPCE NN+ +  P +S
Sbjct: 522 HDK--TLCDVHRSPC-LFDLENDPCELNNVIAKYPAVS 556



 Score = 73.2 bits (178), Expect = 4e-10,   Method: Compositional matrix adjust.
 Identities = 34/66 (51%), Positives = 45/66 (68%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           YAT LFTK AV LIE+    +PLF+ L+HLA HAG      +AP+E+I +F +I D NR+
Sbjct: 241 YATHLFTKRAVNLIENHKSTEPLFILLSHLAPHAGCKRDPFQAPRESIEKFAHIKDQNRK 300

Query: 75  TYAALT 80
            YAA+ 
Sbjct: 301 VYAAMV 306


>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
 gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
          Length = 584

 Score =  360 bits (924), Expect = 1e-96,   Method: Compositional matrix adjust.
 Identities = 214/549 (38%), Positives = 306/549 (55%), Gaps = 51/549 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF GSN   TPNIDALAY+G+ILNN+Y  P+CTPSRA+L+TGKYPI+TGMQ   I
Sbjct: 46  GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+PL E  + E  RE GY T  +GKWHLGF +R +TP  RGF+ H GYL   + 
Sbjct: 106 VNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHLGYLGAYVD 165

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VDKPL 268
           YY      Q   +   NGHD R NL ++ D VG Y TD+ T  AV+ IED       +PL
Sbjct: 166 YYTQSYEQQ---SKGYNGHDFRDNLKSSHDQVGHYITDVLTDAAVKEIEDHASKNSSQPL 222

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FL L HLA HA N    ++AP E +++F+YI +   R YAAMV KLD SVG+VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDNPMQAPAEEVSRFEYIRNKTHRYYAAMVSKLDQSVGSVIDALARQ 282

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            ML+NSI++F+SDNG PT     T+      SNYP RG K++ WEG ++  A +WS + +
Sbjct: 283 EMLQNSIVLFLSDNGGPTQGQHSTT-----ASNYPLRGQKDSPWEGALRSSAAIWSTEFE 337

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +   V  Q ++I D LPTL  AA G +    L++DGL+ WS+L     S     +  +D+
Sbjct: 338 RLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDE 396

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP-- 506
                              D      +     WK++ GT   G  DG+ G   +++V   
Sbjct: 397 -------------------DVAVPHLSYTRGKWKVISGTTNQGLYDGWLGHRETSEVDPR 437

Query: 507 LLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP-SPC-- 562
            + +  +V  +  +Q LQQ++       S    +   R QA+I C   P P T   PC  
Sbjct: 438 AVEYEELVRNTSVWQQLQQVT-------SGERNISELRDQASIEC---PNPATGVRPCLP 487

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISS--QLYELLKYHRRTLVPQSHEQPDLVQADPK 620
              PC LF++  DPCE++N+ +   + ++   L+  ++   +   P +++  D    DP+
Sbjct: 488 LEAPC-LFDIEADPCERSNLYAEYKNSTTFLDLWARIQQFAKQAQPPNNKPAD-PDCDPR 545

Query: 621 RFNDTWSPW 629
            F++ W+ W
Sbjct: 546 FFHNEWTWW 554



 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 3/83 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
            R NL ++ D VG Y TD+ T  AV+ IED       +PLFL L HLA HA N    ++A
Sbjct: 183 FRDNLKSSHDQVGHYITDVLTDAAVKEIEDHASKNSSQPLFLLLNHLAPHAANDDNPMQA 242

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P E +++F+YI +   R YAA+ 
Sbjct: 243 PAEEVSRFEYIRNKTHRYYAAMV 265


>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
          Length = 500

 Score =  357 bits (917), Expect = 8e-96,   Method: Compositional matrix adjust.
 Identities = 213/544 (39%), Positives = 284/544 (52%), Gaps = 86/544 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+S HGS EIPTPNID LA +GI+LNN Y QP+CTPSRA+LMTG++P+H G+Q   I
Sbjct: 36  GWDDISLHGSQEIPTPNIDLLATDGILLNNYYVQPICTPSRAALMTGRHPVHLGLQHDVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A+P G+ L E  LP+YL+ LGYST  +GKWHLGF+ +E+TP  RGF+SH GY  G   
Sbjct: 96  VWAQPYGLGLNETLLPQYLKTLGYSTHMVGKWHLGFYDKEHTPTKRGFDSHLGYYTGCED 155

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYDH     +  T +  G D   +        G+Y+T++FT EA ++I +  V KPLFLY
Sbjct: 156 YYDHT----WGFTKQDWGLDFWHDREVDRSAFGQYSTEVFTSEAERVIAEHDVSKPLFLY 211

Query: 272 LAHLAAHAGNAGK--HLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LA  A H+GN G    L+AP + +  F  I    RR +A MV  LDDSV  V  AL  KG
Sbjct: 212 LAQQAVHSGNPGNKVRLQAPWKYVKNFMGIKSEERRVFAGMVSALDDSVSNVTKALHEKG 271

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           +L ++IIIF +DNG P        +Y +   N+P RG K T+WEGGV+    +WSP ++ 
Sbjct: 272 ILNDTIIIFSTDNGGPA----NRQDYND-ACNWPLRGSKRTMWEGGVRGNGFIWSPLLEN 326

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
           +  VS  +M I DW+PT+  AAG D S+LP ++DGL QW +L     SR  S        
Sbjct: 327 SGYVSEHLMQIVDWVPTVLEAAGFDMSKLPPSLDGLSQWKAL-----SRHES-------- 373

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
                   S R  +L NID      A+R    K+V G  +     G+Y            
Sbjct: 374 --------SPRKVLLHNIDPINGDGALRTTDMKIVFGGGDTERYPGWYEPEE-------- 417

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A++ S + +                                 P+     P    PC +
Sbjct: 418 --ALIRSPSKR---------------------------------PSSTACKPWLK-PC-V 440

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF----NDT 625
           FN+  DPCE  NIA   PDI  +L + +    +T VP     P  V +DPK      N  
Sbjct: 441 FNITADPCEYYNIADFHPDILQELLQKVDEFNQTAVP-----PRNVPSDPKGLPINNNGV 495

Query: 626 WSPW 629
           W PW
Sbjct: 496 WGPW 499



 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 30/72 (41%), Positives = 44/72 (61%), Gaps = 2/72 (2%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGK--HLEAPQETINQFQYITD 70
           G+Y+T++FT EA ++I +  V KPLFLYLA  A H+GN G    L+AP + +  F  I  
Sbjct: 184 GQYSTEVFTSEAERVIAEHDVSKPLFLYLAQQAVHSGNPGNKVRLQAPWKYVKNFMGIKS 243

Query: 71  PNRRTYAALTKS 82
             RR +A +  +
Sbjct: 244 EERRVFAGMVSA 255


>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
          Length = 1007

 Score =  357 bits (916), Expect = 1e-95,   Method: Compositional matrix adjust.
 Identities = 216/561 (38%), Positives = 296/561 (52%), Gaps = 62/561 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFH S +I TPNID LAY+G+ILN  Y  P  T S+ +LMTG +P+  G Q    
Sbjct: 467 GWNDVSFHSSKQIFTPNIDVLAYHGVILNRHYCAPFGTASQVALMTGSHPLSVGTQSASN 526

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +    + +PEY R+ GY+T  IGKW LGF R++YTP  RGF+SHFG+L   I 
Sbjct: 527 EPDQPWTLDPELKLMPEYFRDAGYATHLIGKWGLGFSRKDYTPTQRGFDSHFGFLGPYID 586

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+DH L     R     G DMRRNL   +   G YATDLF  EAV+LI +    KPL L 
Sbjct: 587 YWDHSLR---LRNTSTRGLDMRRNLDVDYSVNGSYATDLFNGEAVRLIREHDQKKPLLLV 643

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L HLA H GN    ++AP E + +F YI D  RR  AAM+ K+D+ VG +I  L  + ML
Sbjct: 644 LTHLAPHTGNEDDPMQAPAEEVEKFDYIRDEKRRVLAAMISKVDEGVGQIIQTLAERDML 703

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV---------------------KNT 370
           +NSI++F +DNGAPTV       + N GSN+P RGV                     K +
Sbjct: 704 DNSIVLFYADNGAPTV-----GMHSNSGSNFPLRGVCIDNWSTLHIRCTNGCFSFQQKYS 758

Query: 371 LWEGGVKVPAILWSPQIQQNP-RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
            WEG V+  A +WSP +     RVS Q +H+SDWLPTL  AAG         I+G+    
Sbjct: 759 PWEGAVRTVAAIWSPLLNLTAGRVSNQWIHVSDWLPTLAHAAG---------IEGIPI-- 807

Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
                      S IDG +QW +L     + RN V+ NIDE    ++   + WK V GT  
Sbjct: 808 ----------GSEIDGRNQWEALKNPAITVRNVVMNNIDELHHYSSYSRNGWKYVNGTSW 857

Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
            G  D + G+             +        L        +P+ +++ +   R+ AT+ 
Sbjct: 858 EGKFDNWMGELEDEDELSEEEYVV-------RLAGSVVGRMMPL-DLEHVARLRRAATVE 909

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
           C          P    PC LFNL +DPCE+NN+A  + DI  +L E ++ +R+T V +  
Sbjct: 910 CEVEGQVKACQP-MESPC-LFNLMDDPCERNNVALDQLDILEELREEVERYRQTAV-EPR 966

Query: 610 EQPDLVQADPKRFNDTWSPWI 630
            +P   ++DP  +N+TW+ W+
Sbjct: 967 NKPADPRSDPGFYNNTWTWWL 987



 Score = 81.6 bits (200), Expect = 1e-12,   Method: Compositional matrix adjust.
 Identities = 39/79 (49%), Positives = 47/79 (59%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRNL   +   G YATDLF  EAV+LI +    KPL L L HLA H GN    ++AP E
Sbjct: 604 MRRNLDVDYSVNGSYATDLFNGEAVRLIREHDQKKPLLLVLTHLAPHTGNEDDPMQAPAE 663

Query: 61  TINQFQYITDPNRRTYAAL 79
            + +F YI D  RR  AA+
Sbjct: 664 EVEKFDYIRDEKRRVLAAM 682


>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
 gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
 gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
 gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
 gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
 gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
          Length = 585

 Score =  356 bits (914), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 216/550 (39%), Positives = 305/550 (55%), Gaps = 53/550 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF GSN   TPNIDALAY+G+ILNN+Y  P+CTPSRA+L+TGKYPI+TGMQ   I
Sbjct: 46  GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+PL E  + E  RE GY T  +GKWHLG  +R +TP  RGF+ H GYL   + 
Sbjct: 106 VNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHLGYLGAYVD 165

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPL 268
           YY      Q       NGHD R +L +  D VG Y TDL T  AV+ IED   +   +PL
Sbjct: 166 YYTQSYEQQ---NKGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPL 222

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FL L HLA HA N    ++AP E +++F+YI++   R YAAMV +LD SVG+VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDALARQ 282

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            ML+NSII+F+SDNG PT     T+      SNYP RG KN+ WEG ++  A +WS + +
Sbjct: 283 EMLQNSIILFLSDNGGPTQGQHSTT-----ASNYPLRGQKNSPWEGALRSSAAIWSTEFE 337

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +   V  Q ++I D LPTL  AA G +    L++DGL+ WS+L     S     +  +D+
Sbjct: 338 RLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDE 396

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP-- 506
                              D  +   +     WK++ GT   G  DG+ G   +++V   
Sbjct: 397 -------------------DVAEPHLSYTRGKWKVISGTTNQGLYDGWLGHRETSEVDPR 437

Query: 507 LLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP-SPC-- 562
            + +  +V  +  +  LQQ+S        NI ++   R Q+ I C   P P T   PC  
Sbjct: 438 AVEYEELVRNTSVWLQLQQVS----FGERNISEL---RDQSRIEC---PDPATGVKPCLP 487

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQADP 619
             GPC LF++  DPCE++N+ +   +  S ++  L    +    Q+H    +P     DP
Sbjct: 488 LEGPC-LFDIEADPCERSNLYAEYQN--STIFLDLWSRIQQFAKQAHPPNNKPGDPNCDP 544

Query: 620 KRFNDTWSPW 629
           + +++ W+ W
Sbjct: 545 RFYHNEWTWW 554



 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
            R +L +  D VG Y TDL T  AV+ IED   +   +PLFL L HLA HA N    ++A
Sbjct: 183 FRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQA 242

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P E +++F+YI++   R YAA+ 
Sbjct: 243 PAEEVSRFEYISNKTHRYYAAMV 265


>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
 gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
          Length = 591

 Score =  356 bits (914), Expect = 2e-95,   Method: Compositional matrix adjust.
 Identities = 215/548 (39%), Positives = 300/548 (54%), Gaps = 49/548 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF GSN   TPNIDALAY+G+ILNN+Y  P+CTPSRA+L+TGKYPI+TGMQ   I
Sbjct: 46  GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+PL E  + E  RE GY T  +GKWHLGF +R +TP  RGF+ HFGYL   + 
Sbjct: 106 VNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHFGYLGAYVD 165

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPL 268
           YY      Q       NGHD R NL +  D VG Y TD+ T  AV+ IED   +   +PL
Sbjct: 166 YYTQSYEQQ---NKGYNGHDFRDNLKSTHDHVGRYITDVLTDAAVKEIEDHGSKNSSQPL 222

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FL L HLA HA N    ++AP E +++F+YI +   R YAAMV KLD SVG VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDNPMQAPAEEVSRFEYIGNKTHRYYAAMVSKLDQSVGYVIDALARQ 282

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            ML+NSI++F+SDNG PT     T+      SNYP RG K++ WEG ++  A +WS + +
Sbjct: 283 EMLQNSIVLFLSDNGGPTQGQHSTT-----ASNYPLRGQKDSPWEGALRSSAAIWSTEFE 337

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           +   V  Q ++I D LPTL  AAG   S  P L++DGL+ WS+L     S     +  +D
Sbjct: 338 RLGSVWKQQIYIGDLLPTL--AAGAGISPDPALHLDGLNLWSALKYGYESVEREIVHVID 395

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV-- 505
           +                   D      +     WK++ GT   G  DG+ G   +++V  
Sbjct: 396 E-------------------DVAMPHLSYTRGKWKVISGTTNQGLYDGWLGHRETSEVDP 436

Query: 506 -PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN 564
              +    +  +  ++ LQQ+S        NI ++   R QA+I C      + P     
Sbjct: 437 RAAVYEELVRNTSVWRQLQQVS----FGERNISEL---RDQASIECPDTANGVRPCLPLE 489

Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQADPKR 621
            PC LF++  DPCE++N+ +   +  S  +  L    +    QSH    +P     DP+ 
Sbjct: 490 APC-LFDIEADPCERSNLYAEYKN--STTFLDLWARIQQFAKQSHPPNNKPGDPDCDPRF 546

Query: 622 FNDTWSPW 629
           +++ W+ W
Sbjct: 547 YHNEWTWW 554



 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 3/83 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
            R NL +  D VG Y TD+ T  AV+ IED   +   +PLFL L HLA HA N    ++A
Sbjct: 183 FRDNLKSTHDHVGRYITDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDNPMQA 242

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P E +++F+YI +   R YAA+ 
Sbjct: 243 PAEEVSRFEYIGNKTHRYYAAMV 265


>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
 gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
          Length = 585

 Score =  356 bits (913), Expect = 3e-95,   Method: Compositional matrix adjust.
 Identities = 213/550 (38%), Positives = 303/550 (55%), Gaps = 53/550 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF GS+   TPNIDALAY+G+ILNN+Y  P+CTPSRA+L+TGKYPI+TGMQ   I
Sbjct: 46  GFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+P+ E  + E  RE GY T  +GKWHLG  +R +TP  RGF+ HFGYL   + 
Sbjct: 106 VNDQPWGLPINETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGYLGAYVD 165

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPL 268
           YY      Q       NGHD R NL +  D VG Y TD+ T  AV+ IED   +   +PL
Sbjct: 166 YYTQSYEQQ---NKGYNGHDFRDNLKSTHDYVGHYVTDVLTDAAVKEIEDHGSKNSSQPL 222

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FL L HLA HA N    ++AP E +++F+YI++   R YAAMV +LD SVG+VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDALARQ 282

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            ML+NSII+F+SDNG PT     T+      SNYP RG KN+ WEG ++  A +WS + +
Sbjct: 283 EMLQNSIILFLSDNGGPTEGQHSTT-----ASNYPLRGQKNSPWEGALRSSAAIWSTEFE 337

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +   V  Q ++I D LPTL  AA G +    L++DGL+ WS+L     S     +  +D+
Sbjct: 338 RLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDE 396

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP-- 506
                              D      +     WK++ GT   G  DG+ G   +++V   
Sbjct: 397 -------------------DVAVPHLSYTRGKWKVISGTTNEGLYDGWLGHRETSEVDPR 437

Query: 507 LLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP-SPC-- 562
            + +  ++  +  +  LQQ+S       S    +   R Q+ I C   P P T   PC  
Sbjct: 438 AVEYEELIRNTSVWLQLQQVS-------SGERNISELRYQSRIEC---PDPATGVKPCLP 487

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE---QPDLVQADP 619
             GPC LF++  DPCE++N+ +   +  S ++  L    +    Q+H    +P     DP
Sbjct: 488 LEGPC-LFDIEADPCERSNLYAEYQN--STIFLDLWARIQQFAKQAHSPNNKPGDPNCDP 544

Query: 620 KRFNDTWSPW 629
           + +++ W+ W
Sbjct: 545 RFYHNEWTWW 554



 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
            R NL +  D VG Y TD+ T  AV+ IED   +   +PLFL L HLA HA N    ++A
Sbjct: 183 FRDNLKSTHDYVGHYVTDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQA 242

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P E +++F+YI++   R YAA+ 
Sbjct: 243 PAEEVSRFEYISNKTHRYYAAMV 265


>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
 gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
          Length = 548

 Score =  355 bits (911), Expect = 4e-95,   Method: Compositional matrix adjust.
 Identities = 213/550 (38%), Positives = 295/550 (53%), Gaps = 50/550 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF GSN   TPNIDALAY+G+ILNN+Y   +CTPSRA+L+TGKYPI+TGMQ   I
Sbjct: 6   GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGMQHYVI 65

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+P  E+ + E  RE GY T  +GKWHLG  +R +TP  RGF+ H GYL   + 
Sbjct: 66  VNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYLGAYVD 125

Query: 212 YYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VD 265
           YYD         Y+R     GHD R NL+   D VG Y TD+ +  AV+LIE        
Sbjct: 126 YYDQTYQQNGKNYAR-----GHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           +PLFL L+HLA HA NA   ++AP E + +F+YI +   R YAAMV +LD SVG VI AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRYYAAMVSRLDRSVGRVIDAL 240

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
            R+ ML++SI++F+SDNG PTV    ++      SNYP RG KN+ WEGG++  A +WS 
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSST-----ASNYPLRGQKNSPWEGGLRSSAAIWST 295

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
           + ++   V  Q  +I D LPTL  AA G      L +DG++ WS+L     S     +  
Sbjct: 296 EFERLGTVWKQQTYIGDLLPTL-AAAAGIAPDPDLKLDGMNLWSALKYGYESLEREIVHN 354

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
           +D                   +D      A     WK+V GT      DG+ G+  + +V
Sbjct: 355 ID-------------------MDPSAPYLAYTRGKWKVVNGTTNAALFDGWLGERVTREV 395

Query: 506 PLLNFN---AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
              + N    I  +  +Q LQ     + +   N   + S RQQA I C   P    P   
Sbjct: 396 DPRSANYEELIRNTSVWQHLQ-----LLMGSQNERNITSLRQQAQIECPEPPGATKPCLP 450

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQADP 619
              PC LF++  DPCE+NN+     + +  L  L +  +     Q+H     P+    DP
Sbjct: 451 LLAPC-LFDIDADPCERNNLYEDYRNTTIFLDLLQRIEQ--FASQAHAPNNMPEDPDCDP 507

Query: 620 KRFNDTWSPW 629
           + +++ W+ W
Sbjct: 508 RFYHNEWTWW 517



 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 3/83 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VDKPLFLYLAHLAAHAGNAGKHLEA 57
            R NL+   D VG Y TD+ +  AV+LIE        +PLFL L+HLA HA NA   ++A
Sbjct: 144 FRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSSQPLFLLLSHLAPHAANADDPMQA 203

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P E + +F+YI +   R YAA+ 
Sbjct: 204 PAEELAKFEYIRNETHRYYAAMV 226


>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
 gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
          Length = 585

 Score =  355 bits (910), Expect = 5e-95,   Method: Compositional matrix adjust.
 Identities = 213/550 (38%), Positives = 303/550 (55%), Gaps = 53/550 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF GS+   TPNIDALAY+G+ILNN+Y  P+CTPSRA+L+TGKYPI+TGMQ   I
Sbjct: 46  GFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+PL E  + E  RE GY T  +GKWHLG  +R +TP  RGF+ HFGYL   + 
Sbjct: 106 VNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGYLGAYVD 165

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPL 268
           YY      Q       NGHD R NL +  D VG Y TD+ T  AV+ IED   +   +PL
Sbjct: 166 YYTQSYEQQ---NKGYNGHDFRDNLKSTHDHVGHYVTDVLTDAAVKEIEDHGSKNSSQPL 222

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FL L HLA HA N    ++AP E +++F+YI++   R YAAMV +LD SVG+VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKAHRYYAAMVSRLDKSVGSVIDALARQ 282

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            ML+NSII+F+SDNG PT     T+      SNYP RG KN+ WEG ++  A +WS + +
Sbjct: 283 EMLQNSIILFLSDNGGPTEGQHSTT-----ASNYPLRGQKNSPWEGALRSSAAIWSTEFE 337

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +   V  Q ++I D LPTL  AA G +    L++DGL+ WS+L     S     +  +D+
Sbjct: 338 RLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDE 396

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP-- 506
                              D      +     WK++ GT   G  DG+ G   +++V   
Sbjct: 397 -------------------DVAVPHLSYTRGKWKVISGTTNEGLYDGWLGHRETSEVDPG 437

Query: 507 LLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP-SPC-- 562
            + +  +V  +  +  LQQ++       S    +   R ++ I C   P P T   PC  
Sbjct: 438 AVEYEELVRNTSVWLQLQQVT-------SGERNISELRDKSRIEC---PDPATGVKPCLP 487

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE---QPDLVQADP 619
             GPC LF++  DPCE++N+ +   +  S ++  L    +    Q+H    +P     DP
Sbjct: 488 LEGPC-LFDIEADPCERSNLYAEYQN--STIFLDLWERIQQFAKQAHSPNNKPGDPNCDP 544

Query: 620 KRFNDTWSPW 629
           + +++ W+ W
Sbjct: 545 RFYHNEWTWW 554



 Score = 70.1 bits (170), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
            R NL +  D VG Y TD+ T  AV+ IED   +   +PLFL L HLA HA N    ++A
Sbjct: 183 FRDNLKSTHDHVGHYVTDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQA 242

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P E +++F+YI++   R YAA+ 
Sbjct: 243 PAEEVSRFEYISNKAHRYYAAMV 265


>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
           purpuratus]
          Length = 479

 Score =  354 bits (909), Expect = 7e-95,   Method: Compositional matrix adjust.
 Identities = 194/493 (39%), Positives = 274/493 (55%), Gaps = 60/493 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS++IPTP+IDALA  G+IL N Y  P+CTP+R+++MTGK+PIHTG+Q   I
Sbjct: 40  GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+   E  +P+YLR LGY T  +GKWHLGF+ +E+TP+ RGFES FGY  G   
Sbjct: 100 IADEPYGLGTNETIMPQYLRSLGYRTHMVGKWHLGFYSKEHTPIERGFESTFGYYLGQQD 159

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H    Q  R   L G D   N S      G+Y+T+++T++  ++I +    +PL++Y
Sbjct: 160 YFTH--ETQVKRKHTLTGFDFHVNGSIYKPVFGQYSTEIYTEKTQEIIRNHNPQEPLYIY 217

Query: 272 LAHLAAHAGN-AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           LAH A H+ N  G+ L+AP +   +F  IT+ NRR +AAMV  LDDS+G +   L+   +
Sbjct: 218 LAHQAVHSANYNGQRLQAPYKYYERFPNITNENRRKFAAMVSALDDSLGNITQTLKESSL 277

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             N++I+F +DNG P   +       N+ +N+P RGVK+T WEGG++    LW   I++ 
Sbjct: 278 YNNTVIVFTTDNGGPAHGFDA-----NYANNWPLRGVKDTTWEGGLRGAGFLWGALIEKP 332

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            R S  MMH+ DW+PTLY  AGG+TS L                       ++DG+D W 
Sbjct: 333 GRTSDGMMHVCDWVPTLYGLAGGNTSTL----------------------QHLDGIDVWP 370

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            L    PS R  +L NID  +  +A+R+  +KLV G   NG+   +Y     + V     
Sbjct: 371 MLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQGQNYNGSWSDWYPPEGESSVD---- 426

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
              V+SK              P+ N   +    + A      +P      PC      LF
Sbjct: 427 ---VDSK--------------PVPNAFVVSCPSKPANASTNCDPK---EKPC------LF 460

Query: 571 NLGNDPCEQNNIA 583
           N+ +DPCE NNIA
Sbjct: 461 NIRHDPCEFNNIA 473



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 48/71 (67%), Gaps = 1/71 (1%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN-AGKHLEAPQETINQFQYITDP 71
           G+Y+T+++T++  ++I +    +PL++YLAH A H+ N  G+ L+AP +   +F  IT+ 
Sbjct: 190 GQYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSANYNGQRLQAPYKYYERFPNITNE 249

Query: 72  NRRTYAALTKS 82
           NRR +AA+  +
Sbjct: 250 NRRKFAAMVSA 260


>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
 gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
          Length = 548

 Score =  354 bits (908), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 214/552 (38%), Positives = 296/552 (53%), Gaps = 54/552 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF GSN   TPNIDALAY+G+ILNN+Y   +CTPSRA+L+TGKYPI+TGMQ   I
Sbjct: 6   GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGMQHYVI 65

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+P  E+ + E  RE GY T  +GKWHLG  +R +TP  RGF+ H GYL   + 
Sbjct: 66  VNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYLGAYVD 125

Query: 212 YYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VD 265
           YYD         Y+R     GHD R NL+   D VG Y TD+ +  AV+LIE        
Sbjct: 126 YYDQTYQQNGKNYAR-----GHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           +PLFL L+HLA HA NA   ++AP E + +F+YI +   R YAAMV +LD SVG VI AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRHYAAMVSRLDRSVGRVIDAL 240

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
            R+ ML++SI++F+SDNG PTV    T+      SNYP RG KN+ WEGG++  A +WS 
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSTT-----ASNYPLRGQKNSPWEGGLRSSAAIWST 295

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
           + ++   V  Q  +I D LPTL  AAG                   +   P  +   +DG
Sbjct: 296 EFERLGTVWKQQTYIGDLLPTLAAAAG-------------------IAPDPDLK---LDG 333

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQ--TR 501
           ++ WS+L     S    ++ NID        A     WK+V GT      DG+ G+  TR
Sbjct: 334 MNLWSALKYGYESLEREIVHNIDMDPSAPYLAYTRGKWKVVNGTTNAALFDGWLGERVTR 393

Query: 502 SNKVPLLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
                  N+  ++  +  +Q LQ     + +   N   + S RQQA I C        P 
Sbjct: 394 EMDPRSANYEELIRNTSVWQHLQ-----LLMGSQNERNITSLRQQAQIECPEPSGATKPC 448

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQA 617
                PC LF++  DPCE+NN+     + +  L  L +  +     Q+H     P+    
Sbjct: 449 LPLLAPC-LFDIDADPCERNNLYEDYRNTTIFLDLLQRIEQ--FASQAHAPNNMPEDPDC 505

Query: 618 DPKRFNDTWSPW 629
           DP+ +++ W+ W
Sbjct: 506 DPRFYHNEWTWW 517



 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 3/83 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VDKPLFLYLAHLAAHAGNAGKHLEA 57
            R NL+   D VG Y TD+ +  AV+LIE        +PLFL L+HLA HA NA   ++A
Sbjct: 144 FRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSSQPLFLLLSHLAPHAANADDPMQA 203

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P E + +F+YI +   R YAA+ 
Sbjct: 204 PAEELAKFEYIRNETHRHYAAMV 226


>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
          Length = 531

 Score =  353 bits (907), Expect = 1e-94,   Method: Compositional matrix adjust.
 Identities = 212/536 (39%), Positives = 302/536 (56%), Gaps = 53/536 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D S HGS +IPTPN+DALA  G++LNN Y QP+C+PSR +LMTG YP H G++ P +
Sbjct: 37  GWADTSLHGSAQIPTPNLDALASTGVLLNNYYVQPLCSPSRGALMTGLYPAHNGIRMPLV 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            GA+  G+PL  + LPE+L++LGY T  +GKWHLG+F   YTP YRGF++ FG+ NG I 
Sbjct: 97  -GAQVAGLPLQFKILPEHLKDLGYETHIVGKWHLGYFNLNYTPTYRGFDTFFGFHNGPID 155

Query: 212 YYDHILSDQYSRTVELNGH---DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY  I+        E  GH   D     S        YAT      A  +I ++   KPL
Sbjct: 156 YYRGIM--------EQEGHVGLDFWNGTSALPLKERTYATARLQNHAKSIIANRNTSKPL 207

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLAH A H+  + + L+AP E   +F YI D +R+  A M+  LD SVG ++  L++ 
Sbjct: 208 FLYLAHQAVHSVYSPEFLQAPVENTKKFPYIRDSSRKILAGMMDALDQSVGELVEELEKA 267

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+LE++I++F +DNG     +  T    N G N+P RG+K T+WEGG++  A +WSP+++
Sbjct: 268 GILEDTILVFSTDNGG----HPYTRADPNRGFNWPLRGIKGTVWEGGIRGTAFIWSPRLE 323

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
            + RVS Q+MHISDWLPTLY+AAGGD   L PL  DG+                     D
Sbjct: 324 VSRRVSTQLMHISDWLPTLYSAAGGDVQTLGPL--DGI---------------------D 360

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
            W SL    PS R  VL+  + +   AA+R  + K+VLGT  +     Y  +      P 
Sbjct: 361 MWQSLSTGGPSPRIEVLLEFNNETDRAALRYLNHKIVLGTFSDEVSKRY--EVIGGVRPG 418

Query: 508 LNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
           ++ +A+   SK  + L++   N  L +   DK  S R++AT+ C  +      S   +  
Sbjct: 419 IDLDALQRNSKAGRVLKRFYGNPQLFLHGFDK--SWRRRATVRCNEHGRTNFNS---SSK 473

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
            +LF+L +DPCE  N+A S   + + L+  L  + RTL P+ ++     + DPK F
Sbjct: 474 VFLFDLESDPCELRNLADSNVTLLTDLFVKLGAYLRTLRPEINK-----EIDPKGF 524



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 38/68 (55%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           YAT      A  +I ++   KPLFLYLAH A H+  + + L+AP E   +F YI D +R+
Sbjct: 185 YATARLQNHAKSIIANRNTSKPLFLYLAHQAVHSVYSPEFLQAPVENTKKFPYIRDSSRK 244

Query: 75  TYAALTKS 82
             A +  +
Sbjct: 245 ILAGMMDA 252


>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
 gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
          Length = 567

 Score =  352 bits (903), Expect = 4e-94,   Method: Compositional matrix adjust.
 Identities = 215/571 (37%), Positives = 301/571 (52%), Gaps = 47/571 (8%)

Query: 75  TYAALTKSTTLT----LLIV----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP 126
           T     K+TT T    ++I+     G++D+SF G  E  TPNIDALAY+G IL+N+YA  
Sbjct: 14  TICCRVKATTTTKRPNIIIILADDMGFDDVSFRGGREFLTPNIDALAYHGRILDNLYAPA 73

Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           +CTPSR +L++G+YP HTG Q   I   EP G+ L    +PE  +E GYST  IGKWHLG
Sbjct: 74  MCTPSRGALLSGRYPAHTGTQHFVISNEEPWGLTLNATLMPEIFKEAGYSTNLIGKWHLG 133

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
           F   EYTP  RGF+ H+GY    I YY      Q        G+D RRN+       G Y
Sbjct: 134 FASPEYTPTRRGFDYHYGYWGAYIDYYQR--RSQMPVANYSMGYDFRRNMDLECQNRGVY 191

Query: 247 ATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
            TDL T+EA  +I E    ++  FL L+HLA H  N    L+AP+E I +F YI DP RR
Sbjct: 192 ITDLLTQEAEHVIREKAAANETFFLMLSHLATHTANDNDPLQAPEEEIRKFAYIKDPRRR 251

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
            YAAMV KLD SVG +I+ L +   LENSI+IF SDNGAP+V       + N GSN+P R
Sbjct: 252 KYAAMVAKLDQSVGRIITTLAQTEQLENSIVIFYSDNGAPSV-----GQFANTGSNWPLR 306

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           G KNT WEGG++V   +WS Q+Q       Q ++++DWLPTL  AAG D     L +DGL
Sbjct: 307 GQKNTPWEGGIRVAGAIWSTQLQARGNKFTQPIYVADWLPTLAHAAGIDLPESSLQLDGL 366

Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
           D W  L     S+  +++                   +L  +D+    +++ L  WK + 
Sbjct: 367 DLWPQL---AGSKDAAHVP----------------REILHALDDVWHVSSMLLGHWKYIN 407

Query: 486 GTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQ 545
           GT  +GT D        + +   N   +VE +   + + L+      +    K+ S R  
Sbjct: 408 GTTASGTYDQVLTYRELDDLDPRNSRYVVEVRNSPASRALAHFDLQRLGQ-QKITSLRHS 466

Query: 546 ATIHCGANPAPMTP--SPCTNGPCYLFNLGNDPCEQNNIASSRPD--ISSQLYELLKYHR 601
           A++ CG       P    C      L+++  DPCEQNN+  S+    +   L + L+  R
Sbjct: 467 ASVRCGNLQRGCNPLVEEC------LYDIAADPCEQNNLIYSKQHEAVLMALRKRLQEFR 520

Query: 602 RTLVPQSHEQPDLVQADPKRFNDTWSPWIYR 632
            + +P  + +P + Q+DP     TW  ++ +
Sbjct: 521 LSALPPGN-RPSMRQSDPSLHECTWDNFLVK 550



 Score = 71.2 bits (173), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/81 (44%), Positives = 46/81 (56%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T+EA  +I E    ++  FL L+HLA H  N    L+AP+
Sbjct: 177 FRRNMDLECQNRGVYITDLLTQEAEHVIREKAAANETFFLMLSHLATHTANDNDPLQAPE 236

Query: 60  ETINQFQYITDPNRRTYAALT 80
           E I +F YI DP RR YAA+ 
Sbjct: 237 EEIRKFAYIKDPRRRKYAAMV 257


>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
           purpuratus]
          Length = 476

 Score =  352 bits (902), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 191/493 (38%), Positives = 274/493 (55%), Gaps = 63/493 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS++IPTP+IDALA  G+IL N Y  P+CTP+R+++MTGK+PIHTG+Q   I
Sbjct: 40  GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+   E  +P+YLR LGY T  +GKWHLGFF+   TP +RGFES++GY  G+  
Sbjct: 100 IADEPYGLGTNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGYYGGMQD 159

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H  ++       L G D   N S      G+Y+T+++T++  ++I +    +PL++Y
Sbjct: 160 YFTHESTEH-----TLTGFDFHVNGSIYKPVFGQYSTEIYTEKTQEIIRNHNPQEPLYIY 214

Query: 272 LAHLAAHAGN-AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           LAH A H+ N  G+ L+AP +   +F  IT+ NRR +AAMV  LDDS+G +   L+   +
Sbjct: 215 LAHQAVHSANYNGQRLQAPYKYYERFPNITNENRRKFAAMVSALDDSLGNITQTLKESSL 274

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             N++I+F +DNG P   +       N+ +N+P RGVK+T WEGG++    LW   I++ 
Sbjct: 275 YNNTVIVFTTDNGGPAHGFDA-----NYANNWPLRGVKDTTWEGGLRGAGFLWGALIEKP 329

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            R S  MMH+ DW+PTLY  AGG+TS L                       ++DG+D W 
Sbjct: 330 GRTSDGMMHVCDWVPTLYGLAGGNTSTL----------------------QHLDGIDVWP 367

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            L    PS R  +L NID  +  +A+R+  +KLV G   NG+   +Y     + V     
Sbjct: 368 MLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQGQNYNGSWSDWYPPEGESSVD---- 423

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
              V+SK              P+ N   +    + A      +P      PC      LF
Sbjct: 424 ---VDSK--------------PVPNAFVVSCPSKPANASTNCDPK---EKPC------LF 457

Query: 571 NLGNDPCEQNNIA 583
           N+ +DPCE NNIA
Sbjct: 458 NIRHDPCEFNNIA 470



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 48/71 (67%), Gaps = 1/71 (1%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN-AGKHLEAPQETINQFQYITDP 71
           G+Y+T+++T++  ++I +    +PL++YLAH A H+ N  G+ L+AP +   +F  IT+ 
Sbjct: 187 GQYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSANYNGQRLQAPYKYYERFPNITNE 246

Query: 72  NRRTYAALTKS 82
           NRR +AA+  +
Sbjct: 247 NRRKFAAMVSA 257


>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
 gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
          Length = 583

 Score =  351 bits (901), Expect = 5e-94,   Method: Compositional matrix adjust.
 Identities = 214/551 (38%), Positives = 303/551 (54%), Gaps = 54/551 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF GSN   TPNIDALAY+G+ILNN+Y  P+CTPSRA+L+TGKYPI+TGMQ   I
Sbjct: 46  GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAPMCTPSRAALLTGKYPINTGMQHYVI 105

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+PL E  + +  R  GY T  IGKWHLG  +R YTP  RGF+ H GYL   + 
Sbjct: 106 VNDQPWGLPLNETTMADIFRGNGYRTSLIGKWHLGMSQRNYTPTLRGFDYHLGYLGAYVD 165

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPL 268
           YY+    +Q S+     GHD R NL    + V +Y TD+ T  AV+ I+D       KPL
Sbjct: 166 YYNQSY-EQVSKGYR--GHDFRENLKPNHEHVDKYVTDILTDAAVREIDDHAAKNNSKPL 222

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FL L HLA HA N    ++AP + ++ F+YI D   R YAAMV +LD SVG V+ AL R+
Sbjct: 223 FLLLNHLAPHAANDADPMQAPADELSGFEYIRDETHRYYAAMVSRLDKSVGHVVDALARQ 282

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            ML+NSI++F+SDNG PT+    T+      SNYP RG KN+ WEGG++  A +WS + +
Sbjct: 283 DMLQNSILLFLSDNGGPTLGEHATT-----ASNYPLRGQKNSPWEGGIRSSAAIWSTEFE 337

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +   V  Q ++I D LPTL  AAG                   +   P  R   +DGL+ 
Sbjct: 338 RLGSVWKQQIYIGDLLPTLAAAAG-------------------ISPDPDLR---LDGLNL 375

Query: 449 WSSLLLNTPSRRNSVLINIDEK--KRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
           WS+L     S    ++  IDE+  +   A     WK+V GT  +G  DG+ G+  + +V 
Sbjct: 376 WSALKYGYESVEREIVHVIDEEAPEPHLAYTRGKWKVVSGTTNSGRYDGWLGERVTTEVD 435

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNID----KMRSTRQQATIHCGANPA-PMTPSP 561
                    S+ Y++L Q + +++L +  +      +   RQQA I C A      TP  
Sbjct: 436 -------PRSQEYEALIQ-NTSVWLQLQKVSGGERNISGLRQQARIECPATTEDEHTPCL 487

Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQAD 618
               PC LF++  DPCEQNN+     + +  +++ L    +    Q+H    +P     +
Sbjct: 488 PLEAPC-LFDIEADPCEQNNLYEEYKNTT--VFQDLWQRVQQFTAQAHPPNNKPGDPSCN 544

Query: 619 PKRFNDTWSPW 629
           P  +++ W+ W
Sbjct: 545 PAFYHNEWTWW 555



 Score = 64.7 bits (156), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 34/83 (40%), Positives = 46/83 (55%), Gaps = 3/83 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
            R NL    + V +Y TD+ T  AV+ I+D       KPLFL L HLA HA N    ++A
Sbjct: 183 FRENLKPNHEHVDKYVTDILTDAAVREIDDHAAKNNSKPLFLLLNHLAPHAANDADPMQA 242

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P + ++ F+YI D   R YAA+ 
Sbjct: 243 PADELSGFEYIRDETHRYYAAMV 265


>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
          Length = 502

 Score =  350 bits (899), Expect = 9e-94,   Method: Compositional matrix adjust.
 Identities = 200/504 (39%), Positives = 286/504 (56%), Gaps = 57/504 (11%)

Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
           GMQG  I G EPRG+PL+ R LPE+LR LGY+TK IGKWHLG+   ++TPL+RGF+S  G
Sbjct: 2   GMQGESIQGGEPRGLPLSVRILPEHLRGLGYTTKMIGKWHLGYHTPQHTPLHRGFDSFLG 61

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           + N  +SYYD+  S Q      ++G+DM R  + A+    +Y TDLFT EAV++I+    
Sbjct: 62  FYNSHVSYYDYKYSYQ-----NMSGYDMHRGDAPAYGLTDKYVTDLFTDEAVRIIQTHDP 116

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETIN---QFQYITDPNRRTYAAMVKKLDDSVGTV 321
            +PL+L ++HLA HA      LE PQ+  +   +F +I + NRR YA MV KLDDSVG +
Sbjct: 117 SRPLYLQISHLAVHAP-----LENPQDYDHYDKRFMHIVEQNRRKYAKMVSKLDDSVGLI 171

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           ++ L  +GML +S+I+F++DNGA  V       +RN+GSN+P RG+K TL+EGGV+  A+
Sbjct: 172 VNTLGSRGMLRDSVILFLTDNGAAPV-----GKFRNYGSNFPLRGMKYTLYEGGVRGVAV 226

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
           LWSP++ +  RV   +MHI+DWLPTLY AA GD   L                       
Sbjct: 227 LWSPRLSKAARVCNNLMHITDWLPTLYAAADGDLRDL----------------------G 264

Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
            IDG++QW  L    P  R+ +L+NIDE  +T       +KL+ G+ E+G  DG++    
Sbjct: 265 KIDGVNQWRMLNDGYPIVRDKLLLNIDEISKTEGAIYKQYKLLRGSIEDGHYDGHHRHDL 324

Query: 502 SNKVPLL----------NFNAIVESKTYQSL-----QQLSQNIFLPISNIDKMRSTRQQA 546
                +L          N    V   T   L     Q ++ ++  P++    M   R +A
Sbjct: 325 ERGYNILPPFFGNNSRKNIQENVPPYTETILISPVSQSITYHLGGPVTQPSTMVQLRNEA 384

Query: 547 TIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
           T+ C  N +  +   C    C LF++ NDPCE  NIA   P I+ +L ++L+ + R +  
Sbjct: 385 TVSCRFNVSYYSFVTCNVTEC-LFDIENDPCETKNIAEQYPRIARELDQILEQYARNVAM 443

Query: 607 QSHEQPDLVQADPKRFNDTWSPWI 630
           Q     D + ADP+R N+TW PWI
Sbjct: 444 QIKPSVDWL-ADPRRTNNTWEPWI 466



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 8/83 (9%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           M R  + A+    +Y TDLFT EAV++I+     +PL+L ++HLA HA      LE PQ+
Sbjct: 84  MHRGDAPAYGLTDKYVTDLFTDEAVRIIQTHDPSRPLYLQISHLAVHAP-----LENPQD 138

Query: 61  TIN---QFQYITDPNRRTYAALT 80
             +   +F +I + NRR YA + 
Sbjct: 139 YDHYDKRFMHIVEQNRRKYAKMV 161


>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
 gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
          Length = 554

 Score =  348 bits (893), Expect = 5e-93,   Method: Compositional matrix adjust.
 Identities = 207/546 (37%), Positives = 290/546 (53%), Gaps = 46/546 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF G  E  TPNIDALAY+G +L+ +YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct: 38  GFDDVSFRGGREFVTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP  + L    +PE  +E GYST  +GKWHLGF R EYTP  RGF+ HFGY    I 
Sbjct: 98  SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
           Y+      + S+    N   G+D RRN+       G Y TDL T EA +LI+D    ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTTEAERLIKDHADKEQP 212

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL L+HLAAH  N    L+AP+E I +F YI DPNRR YAAM+ KLD SVG +IS L  
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSVGRIISTLSS 272

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              L+NSI+IF SDNGAP+V       + N GSN+P RG KNT WEGGV+V   +WS ++
Sbjct: 273 TDQLDNSIVIFYSDNGAPSV-----GMFSNTGSNFPLRGQKNTPWEGGVRVAGAIWSSKL 327

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           Q    +  Q ++++DWLPTL  AAG +     L +DG+D W  L                
Sbjct: 328 QARGSIFSQPLYVADWLPTLSRAAGIELDS-SLKLDGIDLWPEL---------------- 370

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
              S   + P     +L  +D+  R +A+++  WK V GT  +G  D        + +  
Sbjct: 371 ---SGSADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTADGRYDSVLTYRELDDLDP 427

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
                 V  +   + + LS+     ++   ++  TR+ A + CG       P    C   
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISLTRRLAAVRCGDLQRSCNPLLEEC--- 483

Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
              L+++  DPCEQNN+  S    D+ + L   ++   R    +   +  + +ADP    
Sbjct: 484 ---LYDISTDPCEQNNLVYSERHSDVLTALRRRVR-ELRASASRPGNRASMPEADPTLHT 539

Query: 624 DTWSPW 629
             W  +
Sbjct: 540 CAWESF 545



 Score = 82.0 bits (201), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 41/80 (51%), Positives = 51/80 (63%), Gaps = 1/80 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA +LI+D    ++PLFL L+HLAAH  N    L+AP+
Sbjct: 176 FRRNMDLECRDRGVYVTDLLTTEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPE 235

Query: 60  ETINQFQYITDPNRRTYAAL 79
           E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFSYIKDPNRRKYAAM 255


>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
 gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
          Length = 560

 Score =  347 bits (890), Expect = 1e-92,   Method: Compositional matrix adjust.
 Identities = 217/565 (38%), Positives = 298/565 (52%), Gaps = 90/565 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+S  G+ E  TPNIDALAY+G +L+ +YA  +CTPSR +L++GKYPIHTG Q   I
Sbjct: 44  GFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKYPIHTGTQHSVI 103

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L    +PE  R+ GYST  +GKWHLGF R EYTP YRGF+ H+GY  G I 
Sbjct: 104 LNEEPWGLALNATLMPEIFRDAGYSTNLVGKWHLGFVRPEYTPTYRGFDYHYGYWGGYID 163

Query: 212 YY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKP 267
           YY     + SD YS      G+D RRN+       G Y TDL T EA ++I +QP  ++P
Sbjct: 164 YYQRRSQMPSDNYSM-----GYDFRRNMQVECTDRGVYVTDLLTNEAERIIREQPAKEQP 218

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL L+HLA H GN    L+AP+E + +F +I DPNRR YAAMV KLD+SVG VI+AL R
Sbjct: 219 LFLILSHLAPHTGNEIDPLQAPEEELQKFVHINDPNRRLYAAMVSKLDESVGRVITALAR 278

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              LENSI+IF SDNGAP+V       + N GSN+P +G K++ WEGG++V   +WSP +
Sbjct: 279 TDQLENSIVIFYSDNGAPSVGI-----FANTGSNWPLKGQKDSPWEGGLRVAGAIWSPLL 333

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           +    +  Q +++ D+LPTL  AAG +     L++DG+D W    L+ P           
Sbjct: 334 KARGNLFTQPIYVGDFLPTLAHAAGIELDS-SLHLDGIDLWPQ--LSGPK---------- 380

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD------------- 494
                  + P     +L  +D+  R  ++ L  +K V GT   G  D             
Sbjct: 381 -------DAPHVPREILHQLDDVGRMQSLLLGQYKYVNGTTSKGQYDQLLTYRKLDDLDP 433

Query: 495 --GYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC-- 550
               YG    N       +A+ +      +QQ++Q         D++    + A + C  
Sbjct: 434 RASIYGMAVRN-------SAVSKELASFDIQQITQ---------DRIDILSRSAAVICSD 477

Query: 551 ---GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR------ 601
              G N        C      L++L  DPCEQNN+A S+     Q  E+L   R      
Sbjct: 478 MQRGCN---ALLEEC------LYDLSTDPCEQNNLAYSK-----QHSEILAIMRQRVQQL 523

Query: 602 RTLVPQSHEQPDLVQADPKRFNDTW 626
           R+       +P+   ADP     TW
Sbjct: 524 RSGAVAPGNRPNERDADPSLHECTW 548



 Score = 80.5 bits (197), Expect = 2e-12,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 52/81 (64%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA ++I +QP  ++PLFL L+HLA H GN    L+AP+
Sbjct: 182 FRRNMQVECTDRGVYVTDLLTNEAERIIREQPAKEQPLFLILSHLAPHTGNEIDPLQAPE 241

Query: 60  ETINQFQYITDPNRRTYAALT 80
           E + +F +I DPNRR YAA+ 
Sbjct: 242 EELQKFVHINDPNRRLYAAMV 262


>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
 gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
          Length = 565

 Score =  347 bits (889), Expect = 2e-92,   Method: Compositional matrix adjust.
 Identities = 208/546 (38%), Positives = 289/546 (52%), Gaps = 46/546 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF G  E  TPNIDALAY+G +L+ +YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP  + L    +PE  +E GYST  +GKWHLGF R EYTP  RGF+ HFGY    I 
Sbjct: 98  SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
           Y+      + S+    N   G+D RRN+       G Y TDL T EA +LI+     ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAERLIKGHAGKEQP 212

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL L+HLAAH  N    L+AP+E I +F YI DPNRR YAAM+ KLD SVG ++S+L  
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSVGRIMSSLAA 272

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              LENSI+IF SDNGAP+V       + N GSN+P RG KNT WEGGV+V   +WS ++
Sbjct: 273 TDQLENSIVIFYSDNGAPSV-----GMFANTGSNFPLRGQKNTPWEGGVRVAGAIWSSKL 327

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           Q    +  Q ++++DWLPTL  AAG       L +DG+D W+ L                
Sbjct: 328 QARGSIFSQPLYVADWLPTLCRAAGIKLDS-TLKLDGIDLWAEL---------------- 370

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
              S   + P     +L  +D+  R AA++L  WK V GT   G  D        + +  
Sbjct: 371 ---SGSADAPHVPREILHILDDVWRLAALQLGQWKYVNGTTAAGRYDSLLTYRELDDLDP 427

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
                 V  +   + + LS+     ++   ++  TR+ A + CG       P    C   
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISITRRLAAVRCGDLQRSCNPLLEEC--- 483

Query: 566 PCYLFNLGNDPCEQNNIASSRP--DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
              L+++  DPCEQNN+  S    D+ + L   ++   R    +   +  + +ADP    
Sbjct: 484 ---LYDISTDPCEQNNLVYSEKHSDVLAALRRRVQ-ELRASASRPGNRASMPEADPTLHT 539

Query: 624 DTWSPW 629
             W  +
Sbjct: 540 CAWESF 545



 Score = 79.0 bits (193), Expect = 7e-12,   Method: Compositional matrix adjust.
 Identities = 40/80 (50%), Positives = 50/80 (62%), Gaps = 1/80 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA +LI+     ++PLFL L+HLAAH  N    L+AP+
Sbjct: 176 FRRNMELECRDRGVYVTDLLTAEAERLIKGHAGKEQPLFLMLSHLAAHTANEDDPLQAPE 235

Query: 60  ETINQFQYITDPNRRTYAAL 79
           E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFSYIKDPNRRKYAAM 255


>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
 gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
 gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
          Length = 564

 Score =  345 bits (884), Expect = 5e-92,   Method: Compositional matrix adjust.
 Identities = 207/546 (37%), Positives = 291/546 (53%), Gaps = 46/546 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF G  E  TPNIDALAY+G +L+ +YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP  + L    +PE  +E GYST  +GKWHLGF R EYTP  RGF+ HFGY    I 
Sbjct: 98  SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
           Y+      + S+    N   G+D RRN+       G Y TDL T EA +LI+D    ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAERLIKDHADKEQP 212

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL L+HLAAH  N    L+AP+E I +F YI DPNRR YAAM+ KLD SVG +I+AL  
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSVGRIITALSS 272

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              LENSI+IF SDNGAP+V       + N GSN+P RG KNT WEGGV+V   +WS  +
Sbjct: 273 TDQLENSIVIFYSDNGAPSV-----GMFSNTGSNFPLRGQKNTPWEGGVRVAGAIWSSGL 327

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           Q    +  Q ++++DWLPTL  AA  +     L +DG+D W  L                
Sbjct: 328 QARGSIFRQPLYVADWLPTLSRAADIELDS-SLKLDGIDLWPEL---------------- 370

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
              S   + P     +L  +D+  R +A+++  WK V GT  +G  D        + +  
Sbjct: 371 ---SGSADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTASGRYDSVLTYRELDDLDP 427

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
            +    V  +   + + LS+     ++   ++  TR+ A + CG       P    C   
Sbjct: 428 RDSRYAVTVRNSATSRALSRYDLRRLTQ-QRISLTRRLAAVRCGDLQRSCNPLLEEC--- 483

Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
              L+++ +DPCEQNN+  S    D+ + L   ++   R    +   +  + +ADP    
Sbjct: 484 ---LYDILSDPCEQNNLVYSERHSDVLTALRRRVQ-ELRASASRPGNRASMPEADPTLHT 539

Query: 624 DTWSPW 629
             W  +
Sbjct: 540 CAWESF 545



 Score = 82.0 bits (201), Expect = 9e-13,   Method: Compositional matrix adjust.
 Identities = 41/80 (51%), Positives = 51/80 (63%), Gaps = 1/80 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA +LI+D    ++PLFL L+HLAAH  N    L+AP+
Sbjct: 176 FRRNMELECRDRGVYVTDLLTAEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPE 235

Query: 60  ETINQFQYITDPNRRTYAAL 79
           E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFSYIKDPNRRKYAAM 255


>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
 gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
          Length = 565

 Score =  344 bits (883), Expect = 7e-92,   Method: Compositional matrix adjust.
 Identities = 207/546 (37%), Positives = 287/546 (52%), Gaps = 46/546 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF G  E  TPNIDALAY+G +L+  YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP  + L    +PE  +  GYST  +GKWHLGF R EYTP  RGF+ HFGY    I 
Sbjct: 98  SNEEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
           Y+      + S+    N   G+D RRN+       G Y TDL T EA +LI+D    ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMELECRDRGAYVTDLLTAEAERLIKDHADKEQP 212

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL L+HLAAH  N    L+AP+E I +F YI DPNRR YAA++ KLD SVG ++S L  
Sbjct: 213 LFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIISKLDQSVGRIMSTLAA 272

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              LENSI+IF SDNGAP+V       + N GSN+P RG KNT WEGGV+V   +WS ++
Sbjct: 273 TDQLENSIVIFYSDNGAPSV-----GMFANTGSNFPLRGQKNTPWEGGVRVAGAIWSSKL 327

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           Q    +  Q ++++DWLPTL  AAG +     L +DG+D W  L                
Sbjct: 328 QARGSIFSQPLYVADWLPTLSHAAGIELDS-SLKLDGIDLWPEL---------------- 370

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
              S   + P     +L  +D+  R AA++L  WK V GT   G  D        + +  
Sbjct: 371 ---SGSADAPHVPREILHILDDIWRVAALQLGQWKYVNGTTAAGRYDSLLTYRELDDLDP 427

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
                 V  +   + + LS+     ++   ++  TR+ A + CG       P    C   
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISITRRLAAVRCGDLQRSCNPLLEEC--- 483

Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
              L+++  DPCEQNN+  S    D+ + L   ++   R    +   +  + +ADP    
Sbjct: 484 ---LYDISTDPCEQNNLVYSERHSDVLAALRRRVR-ELRASASRPGNRASMPEADPALHT 539

Query: 624 DTWSPW 629
             W  +
Sbjct: 540 CAWESF 545



 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 41/80 (51%), Positives = 51/80 (63%), Gaps = 1/80 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA +LI+D    ++PLFL L+HLAAH  N    L+AP+
Sbjct: 176 FRRNMELECRDRGAYVTDLLTAEAERLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPE 235

Query: 60  ETINQFQYITDPNRRTYAAL 79
           E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFAYIKDPNRRKYAAI 255


>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
 gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
          Length = 559

 Score =  344 bits (883), Expect = 8e-92,   Method: Compositional matrix adjust.
 Identities = 217/563 (38%), Positives = 297/563 (52%), Gaps = 72/563 (12%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           A+   +  + L+   G+ND+SFHGSN+I TPNIDALAYNGI+LN  Y   +CTPSRA+L+
Sbjct: 24  ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TGKYPIHTGMQ   I   EP G+P  ER +PE  R+ GY+T  +GKWHLGF+R++ TP  
Sbjct: 84  TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143

Query: 197 RGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
           RGF+ HFGY NG + YYD    +L   YS      G D RR+L    +  G YAT+ FT 
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYS-----TGLDFRRDLEPCREAEGTYATEAFTT 198

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
           EA ++IE     +PLF+ L+HLA H GN    ++AP+E + +F +I DP RRTYA M+  
Sbjct: 199 EARKVIERHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISS 258

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           LD SVG  + AL   GML NSI++  SDNGAPTV       + N GSNYP+RG       
Sbjct: 259 LDKSVGQTMRALADNGMLNNSIVLLYSDNGAPTVGI-----HSNAGSNYPFRG------- 306

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
                          Q   VS Q +H  DWLPTL  AAG                    +
Sbjct: 307 ---------------QRGYVSNQAIHAIDWLPTLAAAAG--------------------V 331

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
           + PS  +  +DGL+ W SL  +   +  ++L  +D+    ++   D+ K V G+   G  
Sbjct: 332 SLPS--DLRLDGLNLWPSLSASAQPQPRNLLHVLDDVFGYSSYTQDTLKYVNGSSFEGRY 389

Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLS--QNIFLPIS-NIDKMRSTRQQATIHC 550
           D + G+       L +      SK Y  L + S  Q +   ++   D++R  R +AT  C
Sbjct: 390 DHWLGE-------LEDGEEDPNSKDYGQLVRASEVQAVLGSVAPTDDRIRQMRDEATQRC 442

Query: 551 GANPA----PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
                    P+         C LF+L NDPCE+ N+A+  P     L + ++  R T +P
Sbjct: 443 PIQERDPEDPVYLCEPLKAAC-LFDLANDPCERYNLANLYPLQLQLLAQEVEELRLTALP 501

Query: 607 QSHEQPDLVQADPKRFNDTWSPW 629
            +      V+ADP   +  W  W
Sbjct: 502 SARVPRADVRADPALHDGHWEWW 524



 Score = 79.3 bits (194), Expect = 5e-12,   Method: Compositional matrix adjust.
 Identities = 36/82 (43%), Positives = 51/82 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+L    +  G YAT+ FT EA ++IE     +PLF+ L+HLA H GN    ++AP+E
Sbjct: 177 FRRDLEPCREAEGTYATEAFTTEARKVIERHDKSRPLFMVLSHLAVHTGNEDNPMQAPEE 236

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +I DP RRTYA +  S
Sbjct: 237 EVAKFAHIRDPKRRTYAGMISS 258


>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
 gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
          Length = 575

 Score =  344 bits (882), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 197/499 (39%), Positives = 280/499 (56%), Gaps = 41/499 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF G  E  TPNIDALA++G IL+ +YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct: 48  GFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 107

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L    +PE  ++ GYST  IGKWHLGF R EYTP  RGF+ H+GY    I 
Sbjct: 108 SNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGFDYHYGYWGAYID 167

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFL 270
           YY    S   +R   L G+D RRN+       G Y TDL T EA ++I E +  ++PLFL
Sbjct: 168 YYQR-RSKMPARNYSL-GYDFRRNMELECRDRGVYVTDLLTNEAERVIREREGQEEPLFL 225

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            L+HLA H  N    L+AP+E I +F YI DPNRR YAAMV KLD SVG ++SAL   G 
Sbjct: 226 VLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGRIVSALNSTGQ 285

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSI+IF SDNGAP+V       + N GSN+P RG KNT WEGGV+V   +WS Q+Q  
Sbjct: 286 LENSIVIFYSDNGAPSV-----GMFANTGSNWPLRGQKNTPWEGGVRVAGAIWSAQLQAR 340

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             +  Q ++++DWLP+L  AAG +   LP  ++                   +DG+D W 
Sbjct: 341 GNIFTQPIYVADWLPSLAHAAGIE---LPHTLE-------------------LDGIDLWP 378

Query: 451 SLL--LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
            L   ++ P     +L  +D++ R  + ++  WK + GT   G  D        + +   
Sbjct: 379 QLAGPVDAPHVPREILHILDDEWRVTSFQMGHWKYINGTTSAGQYDQVLTYRELDDLDPR 438

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNGP 566
               +V  +   + + L+++    ++   ++ + R+ A + CG       P    C    
Sbjct: 439 ESRYVVTVRNSPTSRALARHDIRRMTQ-QRVMTLRRSAAVRCGDLQRGCNPLVEEC---- 493

Query: 567 CYLFNLGNDPCEQNNIASS 585
             ++++ +DPCE+NN+  S
Sbjct: 494 --VYDIWDDPCEKNNLVYS 510



 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 50/81 (61%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA ++I E +  ++PLFL L+HLA H  N    L+AP+
Sbjct: 186 FRRNMELECRDRGVYVTDLLTNEAERVIREREGQEEPLFLVLSHLATHTANEDDPLQAPE 245

Query: 60  ETINQFQYITDPNRRTYAALT 80
           E I +F YI DPNRR YAA+ 
Sbjct: 246 EEIRKFAYIKDPNRRKYAAMV 266


>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
 gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
          Length = 554

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 206/546 (37%), Positives = 288/546 (52%), Gaps = 46/546 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF G  E  TPNIDALAY+G +L+ +YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP  + L    +PE  +E GYST  +GKWHLGF R EYTP  RGF+ HFGY    I 
Sbjct: 98  SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
           Y+      + S+    N   G+D RRN+       G Y TDL T EA +LI+D    ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTTEAERLIKDHADKEQP 212

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL L+HLAAH  N    L+AP+E I +F YI D NRR YAAM+ KLD SVG +IS L  
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDSNRRKYAAMISKLDQSVGRIISTLSS 272

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              LENSI+IF SDNGAP+V       + N GSN+P RG KNT WEGGV+V   +W+ ++
Sbjct: 273 TDQLENSIVIFYSDNGAPSV-----GMFSNTGSNFPLRGQKNTPWEGGVRVAGAIWNSKL 327

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           Q    +  Q ++++DWLPTL  AAG +     L +DG+D W  L                
Sbjct: 328 QARGSIFSQPLYVADWLPTLSRAAGIELDS-SLKLDGIDLWPEL---------------- 370

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
              S   + P     +L  +D+  R +A+++  WK V GT   G  D        + +  
Sbjct: 371 ---SGTADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTAAGRYDSVLTYRELDDLDP 427

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
                 V  +   + + LS+     ++   ++  TR+ A + CG       P    C   
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISLTRRLAAVRCGDLQRSCNPLLEEC--- 483

Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
              L+++  DPCEQNN+  S    D+ + L   ++   R    +   +  + +ADP    
Sbjct: 484 ---LYDISTDPCEQNNLVYSERHSDVLTALRRRVQ-ELRASASRPGNRASMPEADPTLHT 539

Query: 624 DTWSPW 629
             W  +
Sbjct: 540 CAWESF 545



 Score = 78.6 bits (192), Expect = 9e-12,   Method: Compositional matrix adjust.
 Identities = 40/80 (50%), Positives = 50/80 (62%), Gaps = 1/80 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA +LI+D    ++PLFL L+HLAAH  N    L+AP+
Sbjct: 176 FRRNMDLECRDRGVYVTDLLTTEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPE 235

Query: 60  ETINQFQYITDPNRRTYAAL 79
           E I +F YI D NRR YAA+
Sbjct: 236 EEIQKFSYIKDSNRRKYAAM 255


>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
          Length = 500

 Score =  343 bits (881), Expect = 1e-91,   Method: Compositional matrix adjust.
 Identities = 197/495 (39%), Positives = 284/495 (57%), Gaps = 40/495 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D S HGS +IPTPN+DALA  G++LNN Y QP+C+PSRA+LMTG YP H G++  P+
Sbjct: 37  GWADTSLHGSAQIPTPNLDALASTGVLLNNYYVQPLCSPSRAALMTGLYPAHNGIR-MPL 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            GA+  G+PL  + LPE+L++LGY T  +GKWHLG     YTP YRGF++ FG+ NG I 
Sbjct: 96  MGAQVAGLPLQFKILPEHLKDLGYETHMVGKWHLGHSSLNYTPTYRGFDTFFGFYNGPID 155

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY  I+  +    + L+  +  R L         YAT  F   A  +I ++   KPLFLY
Sbjct: 156 YYHGIMEQE--GHIGLDFWNGTRALPLEERI---YATTRFQDHANYIIANRNASKPLFLY 210

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LAH A H+    + L+AP E   +F ++ D +R++ A MV  LD SVG ++  L++ G+L
Sbjct: 211 LAHQAVHSAYEPEFLQAPGENTKKFPFLGDASRKSLAGMVDALDQSVGELVEELEKAGIL 270

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           E++I++F +DNG     +  T +  N G N+P RG+K T+WEGG++  A +WSP+++ + 
Sbjct: 271 EDTILVFSTDNGG----HPYTRSDPNRGFNWPLRGIKGTVWEGGIRGTAFIWSPRLELSS 326

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           RVS Q+MHISDWLPTLY+AAGG+   L   +DG+                     D W S
Sbjct: 327 RVSTQLMHISDWLPTLYSAAGGNVQNLG-RLDGI---------------------DMWQS 364

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L    PS R  VL+  + +   AA+R  + K+VLG   N   + Y  +      P ++ +
Sbjct: 365 LSTGGPSPRTEVLLEFNNETDRAALRYLNHKIVLGIYSNEVSERY--EVVGGVRPGIDLD 422

Query: 512 AIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
            +   SK  + L +   N  L +   DK  S R +AT+ C  N  P   +  +    +LF
Sbjct: 423 VLQRNSKAGRVLARFYGNPQLFLRGFDK--SWRSRATVRCNEN-GPTNFNSSSQ--VFLF 477

Query: 571 NLGNDPCEQNNIASS 585
           +L  DPCE +N+A S
Sbjct: 478 DLETDPCELHNLADS 492



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 39/68 (57%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           YAT  F   A  +I ++   KPLFLYLAH A H+    + L+AP E   +F ++ D +R+
Sbjct: 185 YATTRFQDHANYIIANRNASKPLFLYLAHQAVHSAYEPEFLQAPGENTKKFPFLGDASRK 244

Query: 75  TYAALTKS 82
           + A +  +
Sbjct: 245 SLAGMVDA 252


>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
 gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
          Length = 565

 Score =  343 bits (880), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 206/546 (37%), Positives = 287/546 (52%), Gaps = 46/546 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF G  E  TPNIDALAY+G +L+  YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct: 38  GFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP  + L    +PE  +  GYST  +GKWHLGF R EYTP  RGF+ HFGY    I 
Sbjct: 98  SNKEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157

Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
           Y+      + S+    N   G+D RRN+       G Y TDL T EA +LI+D    ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMELECRDRGAYVTDLLTAEAERLIKDHADKEQP 212

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL L+HLAAH  N    L+AP+E I +F YI DPNRR YAA++ KLD SVG ++S L  
Sbjct: 213 LFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIISKLDQSVGRIMSTLAA 272

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              LENSI+IF SDNGAP++       + N GSN+P RG KNT WEGGV+V   +WS ++
Sbjct: 273 TDQLENSIVIFYSDNGAPSL-----GMFANTGSNFPLRGQKNTPWEGGVRVAGAIWSSKL 327

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           Q    +  Q ++++DWLPTL  AAG +     L +DG+D W  L                
Sbjct: 328 QARGSIFSQPLYVADWLPTLSHAAGIELDS-SLKLDGIDLWPEL---------------- 370

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
              S   + P     +L  +D+  R AA++L  WK V GT   G  D        + +  
Sbjct: 371 ---SGSADAPHVPREILHILDDIWRVAALQLGQWKYVNGTTAAGRYDSLLTYRELDDLDP 427

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
                 V  +   + + LS+     ++   ++  TR+ A + CG       P    C   
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISITRRLAAVRCGDLQRSCNPLLEEC--- 483

Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
              L+++  DPCEQNN+  S    D+ + L   ++   R    +   +  + +ADP    
Sbjct: 484 ---LYDISTDPCEQNNLVYSERHSDVLAALRRRVR-ELRASASRPGNRASMPEADPALHT 539

Query: 624 DTWSPW 629
             W  +
Sbjct: 540 CAWESF 545



 Score = 82.0 bits (201), Expect = 8e-13,   Method: Compositional matrix adjust.
 Identities = 41/80 (51%), Positives = 51/80 (63%), Gaps = 1/80 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA +LI+D    ++PLFL L+HLAAH  N    L+AP+
Sbjct: 176 FRRNMELECRDRGAYVTDLLTAEAERLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPE 235

Query: 60  ETINQFQYITDPNRRTYAAL 79
           E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFAYIKDPNRRKYAAI 255


>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
 gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
          Length = 575

 Score =  343 bits (879), Expect = 2e-91,   Method: Compositional matrix adjust.
 Identities = 204/542 (37%), Positives = 296/542 (54%), Gaps = 44/542 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF G  E  TPNIDALA++G IL+ +YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct: 48  GFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 107

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L    +PE  ++ GYST  IGKWHLGF R EYTP  RGF+ H+GY    I 
Sbjct: 108 SNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGFDYHYGYWGAYID 167

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFL 270
           YY    S   +R   L G+D RRN+       G Y TDL T EA ++I E +  ++PLFL
Sbjct: 168 YYQR-RSKMPARNYSL-GYDFRRNMELECRDRGVYVTDLLTNEAERVIREREGQEQPLFL 225

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            L+HLA H  N    L+AP+E I +F YI DPNRR YAAMV KLD SVG ++SAL   G 
Sbjct: 226 VLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGRIVSALNSTGQ 285

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSI+IF SDNGAP+V       + N GSN+P RG KNT WEGGV+V   +WS Q+Q  
Sbjct: 286 LENSIVIFYSDNGAPSV-----GMFANTGSNWPLRGQKNTPWEGGVRVAGAIWSAQLQAR 340

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             +  Q ++++DWLP+L  AAG     LP  ++                   +DG+D W 
Sbjct: 341 GNIFTQPIYVADWLPSLAHAAG---IGLPHTLE-------------------LDGIDLWP 378

Query: 451 SLL--LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
            L   ++ P     +L  +D++ R  + ++  WK + GT   G  D        + +   
Sbjct: 379 QLAGPVDAPHVPREILHILDDEWRVTSFQMGHWKYINGTTSAGQYDQVLTYRELDDLDPR 438

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNGP 566
               +V  +   + + L+++    ++   ++ + R+ A + CG       P    C    
Sbjct: 439 ESRYVVTVRNSPTSRALARHDIRRMTQ-QRVMTLRRSAAVRCGDLQRGCNPLVEEC---- 493

Query: 567 CYLFNLGNDPCEQNNIASS--RPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
             ++++ +DPCE+NN+  S    ++ + L + ++  R       + Q  + ++DP     
Sbjct: 494 --VYDIWDDPCEKNNLVYSTRHSEVLAALRQRVQMLREGASAPGNRQ-GIRRSDPTLHKC 550

Query: 625 TW 626
           TW
Sbjct: 551 TW 552



 Score = 77.4 bits (189), Expect = 2e-11,   Method: Compositional matrix adjust.
 Identities = 39/81 (48%), Positives = 50/81 (61%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA ++I E +  ++PLFL L+HLA H  N    L+AP+
Sbjct: 186 FRRNMELECRDRGVYVTDLLTNEAERVIREREGQEQPLFLVLSHLATHTANEDDPLQAPE 245

Query: 60  ETINQFQYITDPNRRTYAALT 80
           E I +F YI DPNRR YAA+ 
Sbjct: 246 EEIRKFAYIKDPNRRKYAAMV 266


>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
 gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
          Length = 570

 Score =  342 bits (878), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 213/547 (38%), Positives = 286/547 (52%), Gaps = 52/547 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+SF G  E  TPNIDALAY+G +L+ +YA  +CTPSR +L++G+YPIHTG Q   I
Sbjct: 41  GFDDVSFRGGREFITPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 100

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP  +      +PE  +  GYST  +GKWHLGF R EYTP +RGF+ H+GY    I 
Sbjct: 101 SNEEPWALDSNATLMPEIFQRAGYSTNLVGKWHLGFSRPEYTPTHRGFDYHYGYWGAYID 160

Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
           YY      + S+    N   G+D RRN+       G Y TDL T EA +LI++Q   DKP
Sbjct: 161 YY-----QRRSKMPVANYSLGYDFRRNMELECRDRGTYVTDLLTTEAERLIKEQAGKDKP 215

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL L+HLA H  N    L+AP+E I +F YI DPNRR YAAMV KLD SVG +IS L  
Sbjct: 216 LFLMLSHLATHTANEDDPLQAPEEEIRKFSYIKDPNRRKYAAMVSKLDQSVGRIISTLAS 275

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
              LENSI+IF SDNGAP+V       + N GSN+P RG KNT WEGGV+V   +WS ++
Sbjct: 276 TDQLENSIVIFYSDNGAPSV-----GMFANTGSNWPLRGQKNTPWEGGVRVAGAVWSSKL 330

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           +    +  Q +++ DWLPTL  AA  D    P  + GL                  DG+D
Sbjct: 331 EARGSIFTQPIYVGDWLPTLAHAA--DIELEPKILAGL------------------DGID 370

Query: 448 QWSSL--LLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
            W  L    + P     +L ++D+  +  ++ +  WK V GT   G  D        + +
Sbjct: 371 LWPQLSGAADAPHVPREILHSLDDVWKVGSLLMGHWKYVNGTTSAGQYDSVLTYREFDDI 430

Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR--STRQQATIHCGANPAPMTP--SP 561
                   V  +   + + L++   L +  + + R    R+ A +HCG       P    
Sbjct: 431 DPRESRYAVTIRNSPASRALTR---LDLRRLTQQRIAGLRRSAAVHCGDFQRSCNPLLED 487

Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR--RTLVPQSHEQPDLVQADP 619
           C      LF++  DPCEQNN+A S    S  L  L K  R  R    +   +  +  ADP
Sbjct: 488 C------LFDISADPCEQNNLAYSERH-SDVLAALRKRFRELRASASRPGNRASMPSADP 540

Query: 620 KRFNDTW 626
                TW
Sbjct: 541 TWHTCTW 547



 Score = 83.6 bits (205), Expect = 3e-13,   Method: Compositional matrix adjust.
 Identities = 42/81 (51%), Positives = 51/81 (62%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA +LI++Q   DKPLFL L+HLA H  N    L+AP+
Sbjct: 179 FRRNMELECRDRGTYVTDLLTTEAERLIKEQAGKDKPLFLMLSHLATHTANEDDPLQAPE 238

Query: 60  ETINQFQYITDPNRRTYAALT 80
           E I +F YI DPNRR YAA+ 
Sbjct: 239 EEIRKFSYIKDPNRRKYAAMV 259


>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
 gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
          Length = 619

 Score =  342 bits (877), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 214/548 (39%), Positives = 293/548 (53%), Gaps = 56/548 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+S  G+ E  TPNIDALAY+G +L+ +YA  +CTPSR +L++GKYPIHTG Q   I
Sbjct: 44  GFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKYPIHTGTQHYVI 103

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L    +PE  RE GYST  +GKWHLGF R EYTP +RGF++H+GY    I 
Sbjct: 104 GNEEPWGLALNTTLMPEIFREAGYSTNLVGKWHLGFSRPEYTPTHRGFDNHYGYWGAYID 163

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFL 270
           YY      Q        G+D RRN+       G Y TDL T EA ++I +QP  ++PLFL
Sbjct: 164 YYQR--RSQMPLGNYSVGYDFRRNMQVECTDRGVYVTDLLTNEAERIIREQPAKEQPLFL 221

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            L+HLA H GN  K L+AP+E + +F +I DPNRR YAAMV KLD+SVG VI+AL R   
Sbjct: 222 ILSHLAPHTGNTNKPLQAPEEELRKFTHIKDPNRRLYAAMVSKLDESVGRVITALARTDQ 281

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           LENSI+IF SDNGAP+V       + N GSN+P +G KN+ WEGG++V   +WSP ++  
Sbjct: 282 LENSIVIFYSDNGAPSV-----GMFSNTGSNWPLKGQKNSPWEGGLRVAGAIWSPLLKAR 336

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             +  Q +++ D+LPTL  AAG +     L++DG+D W  L                   
Sbjct: 337 GNLFTQPIYVGDFLPTLAHAAGIELDS-SLHLDGIDLWPQL------------------- 376

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           S   + P     +L  +D+  R  ++    +K V GT   G  D        + +     
Sbjct: 377 SGPKDAPHVPREILHQLDDVWRVQSLFQGQYKYVNGTTLAGQFDQVLTYRELDDLDPRAS 436

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC--- 567
             +VE +   + + L++         D  R T  QA I+  +  A +  S    G C   
Sbjct: 437 RYVVEVRNSPASRALAR--------YDLQRIT--QARINTISRSAAVICSDMQRG-CNAL 485

Query: 568 ---YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR------RTLVPQSHEQPDLVQAD 618
               L++L  DPCEQNN+A S+     Q  E+L   R      R      + +P    AD
Sbjct: 486 LEECLYDLSTDPCEQNNLAYSK-----QHSEILTSMRQRVQQLRAGAMAPNNRPSERDAD 540

Query: 619 PKRFNDTW 626
           P     TW
Sbjct: 541 PSLHECTW 548



 Score = 84.3 bits (207), Expect = 2e-13,   Method: Compositional matrix adjust.
 Identities = 40/81 (49%), Positives = 53/81 (65%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA ++I +QP  ++PLFL L+HLA H GN  K L+AP+
Sbjct: 182 FRRNMQVECTDRGVYVTDLLTNEAERIIREQPAKEQPLFLILSHLAPHTGNTNKPLQAPE 241

Query: 60  ETINQFQYITDPNRRTYAALT 80
           E + +F +I DPNRR YAA+ 
Sbjct: 242 EELRKFTHIKDPNRRLYAAMV 262


>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 548

 Score =  342 bits (877), Expect = 3e-91,   Method: Compositional matrix adjust.
 Identities = 203/544 (37%), Positives = 288/544 (52%), Gaps = 49/544 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS++IPTPNID LA +GI L+N Y QP+CTPSRA+LMTG YPI TGMQ   I
Sbjct: 48  GWDDVSFHGSSQIPTPNIDVLAADGITLHNYYVQPMCTPSRAALMTGLYPIRTGMQHWVI 107

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGK--WHLGFFRREYTPLYRGFESHFGYLNGV 209
              EP G+PL  + +PE+L++LGYST  +GK  + L  F      L     +   +    
Sbjct: 108 RSPEPWGLPLELKLMPEHLKDLGYSTHLVGKVLFDLKKFIVSVNRLCINESTEVCH---- 163

Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            ++   +    Y       G D R       +  G+YAT LFT  A+ +IE     KPLF
Sbjct: 164 -TFVSAVTLCIYFVYKSHAGLDFRNGEEPFHNDTGQYATTLFTDRAISIIEQHNQTKPLF 222

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL+HLA H     + L+AP E + +F YI + +R  YA MV  LD SVG V+ AL R G
Sbjct: 223 LYLSHLAPHGATHDEPLQAPDENVEKFDYIGEEDRTIYAGMVDALDVSVGRVVEALSRAG 282

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           MLENSI+ F SDNGA    +R      N G N+P RG+K ++WEGGV+VP  +WSP +++
Sbjct: 283 MLENSIVAFSSDNGAVPFGFRS-----NRGFNWPLRGIKASVWEGGVRVPGFVWSPLLRK 337

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
           + R+S QMMHI+DWLPTLY AAGG+T  L                        +DG + W
Sbjct: 338 SARLSTQMMHITDWLPTLYAAAGGNTKNL----------------------GQLDGHNMW 375

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
            +L     S R  +L+NID    +A +R    KL++    NGT D +        VP   
Sbjct: 376 EALSRGWRSPRTEILLNIDPITGSAGLRFGKHKLLI-PGVNGTTDAH--------VPTTG 426

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKM---RSTRQQATIHCGANPAPMTPSPCTNGP 566
           +    E       Q L+  +     N+D +   R     A + CG  PA    +  +   
Sbjct: 427 YPRPTEDMDTMMQQSLAAKVLRRFYNVDGIWFRRGWENSAEVDCG--PAEWQSNFVSGRV 484

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
             LF++  DPCE  N+A++R      +   L+ +   ++P  +++ D  ++ P++    W
Sbjct: 485 YNLFDIERDPCEMINLAATRRGTVRMMMRKLQSYAAAMIPPRNKEED-PRSFPEKHGGVW 543

Query: 627 SPWI 630
           + W+
Sbjct: 544 TTWL 547



 Score = 70.1 bits (170), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 31/75 (41%), Positives = 46/75 (61%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G+YAT LFT  A+ +IE     KPLFLYL+HLA H     + L+AP E + +F YI + +
Sbjct: 197 GQYATTLFTDRAISIIEQHNQTKPLFLYLSHLAPHGATHDEPLQAPDENVEKFDYIGEED 256

Query: 73  RRTYAALTKSTTLTL 87
           R  YA +  +  +++
Sbjct: 257 RTIYAGMVDALDVSV 271


>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
          Length = 613

 Score =  341 bits (874), Expect = 9e-91,   Method: Compositional matrix adjust.
 Identities = 216/612 (35%), Positives = 323/612 (52%), Gaps = 91/612 (14%)

Query: 73  RRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
           +R Y     +    L    GWND+SFHGS++IPTP+IDALA  G+IL N Y  P+CTP+R
Sbjct: 20  QRVYTDQPPNIIFILADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTR 79

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGK---------- 182
           +++MTGK+PIHTG++   I  + P G+ L E+ +P+YLRELGY T  +GK          
Sbjct: 80  SAIMTGKHPIHTGLEHGVIGVSHPYGLGLEEKLMPQYLRELGYRTHMVGKVSLEHGVIGV 139

Query: 183 ---WHLGF-------FRRE------------YTPLYRGFESHFGYLNGVISYYDH-ILSD 219
              + LG        + RE             TP +RGFES +GY  G+  YY H I SD
Sbjct: 140 SHPYGLGLEEKLMHQYLRELGYRTHMVGKESLTPSHRGFESFYGYYAGMGDYYTHEITSD 199

Query: 220 QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 279
                  + G D   + S      G+Y+T++FT+   ++I      +PL++YLAH A H+
Sbjct: 200 G-----NMTGFDFHMDGSVHKPVFGQYSTEIFTERTQEIILKHNPKEPLYIYLAHQAVHS 254

Query: 280 GN-AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
            N  G+ L+AP E   +F  IT  NRR YAAMV  LDDS+G +  +L+   +  N++I+F
Sbjct: 255 ANYDGQRLQAPHEYYKRFPNITHENRRKYAAMVAALDDSIGNITQSLKESSLYNNTLIVF 314

Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMM 398
            +DNG P   +       N+ SN+P RG+K+T WEGG++    LW   I++  R+S  MM
Sbjct: 315 TTDNGGPANGFD-----FNYASNWPLRGMKHTTWEGGLRGVGFLWGALIEKPGRMSDGMM 369

Query: 399 HISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPS 458
           H+ DW+PTLY  AGG+TS L                       ++DG+D W  L     S
Sbjct: 370 HVCDWVPTLYGLAGGNTSTL----------------------QHLDGIDVWPMLSRGKQS 407

Query: 459 RRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY------GQTRSNKV-----PL 507
            R  +L  ID     +A+R+  +KLV G +  G  +G+Y       Q+   ++     PL
Sbjct: 408 PRQEILHTIDPLLDLSAIRIGDYKLVQGQRGTGR-NGWYPPEGVGKQSPRQEILHTIDPL 466

Query: 508 LNFNAIV--ESKTYQSLQQLSQNIFLP-----ISNIDKMRSTRQQATIHCGANPAPMTPS 560
           L+ +AI   + K  Q  +   +N + P     ++++D  +       + C   PA  + +
Sbjct: 467 LDLSAIRIGDYKLVQGQRGTGRNGWYPPEGVSLADLDS-KPVPNAFVVSCPYKPANASTN 525

Query: 561 PCT--NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
            C     PC LFN+ +DPCE NNIA+   DI   L   ++ +R T+VP  +++PD  ++ 
Sbjct: 526 -CNPYKKPC-LFNIRHDPCEFNNIANWNQDIVELLLTRIEEYRATMVPIRNKEPD-PRSY 582

Query: 619 PKRFNDTWSPWI 630
           PK  ++ W PW+
Sbjct: 583 PKYHDNCWVPWV 594



 Score = 60.8 bits (146), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 45/73 (61%), Gaps = 1/73 (1%)

Query: 11  TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN-AGKHLEAPQETINQFQYIT 69
             G+Y+T++FT+   ++I      +PL++YLAH A H+ N  G+ L+AP E   +F  IT
Sbjct: 217 VFGQYSTEIFTERTQEIILKHNPKEPLYIYLAHQAVHSANYDGQRLQAPHEYYKRFPNIT 276

Query: 70  DPNRRTYAALTKS 82
             NRR YAA+  +
Sbjct: 277 HENRRKYAAMVAA 289


>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
          Length = 438

 Score =  340 bits (872), Expect = 1e-90,   Method: Compositional matrix adjust.
 Identities = 186/470 (39%), Positives = 281/470 (59%), Gaps = 50/470 (10%)

Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
           PVCTPSR + ++G++P+ TGMQG P+  AEPRG+ L +  LPEYLR+LGY+T  +GKWH+
Sbjct: 2   PVCTPSRVAFLSGRHPLRTGMQGYPLKAAEPRGLHLNDTLLPEYLRKLGYTTHLLGKWHV 61

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST--AWDTV 243
           G+  R Y P  RGF++  GY NGVI Y++H + +      E  G+D+ R +      +  
Sbjct: 62  GYLTRNYVPTRRGFDTFLGYFNGVIQYFNHTIEEN-----EQVGYDLHRIVGDNHTVEYR 116

Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITD 301
            +Y T+L T+EA  +I     +KP++L LAHLA+HA NA + +E    +ET     YI D
Sbjct: 117 YDYMTNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEAMEVYDWEETNATLGYIQD 176

Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
            NRR +A++V  LD S+G V+ AL +K ML+N+II+F+SDNGA T    E   Y+N+GSN
Sbjct: 177 VNRRKFASVVTTLDKSIGRVVDALNKKDMLKNTIILFISDNGAQT----EGPLYQNFGSN 232

Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
           YP RG+K +L+EGGV+  A ++SP I+ + +VS Q+ HI+DWLPTLY+AAGG++S L   
Sbjct: 233 YPLRGLKFSLYEGGVRGAACIYSPLIENSSKVSTQLFHITDWLPTLYSAAGGNSSDL-YQ 291

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
           +DG+DQ                     WS++     S+R SV+INI++   +AA+ +  +
Sbjct: 292 LDGIDQ---------------------WSTIKTQEASKRKSVVINIEDVDYSAAL-MGRY 329

Query: 482 KLVLGTQENGTMDGYYGQTRSN--KVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKM 539
           KL+   Q+      +Y   + N    P  N   I+ES    ++  +S +I       +K+
Sbjct: 330 KLI---QDKSDFQKHYTNYKGNDPSYPKYNVTNILESPAASAIANISNHIVTA----NKI 382

Query: 540 RSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDI 589
              R++AT+ C      +  S C +  C LF++  DPCE  +I+S  P++
Sbjct: 383 EELRKEATVVC----ENLMTSDCESRTC-LFDIYEDPCEVTDISSQYPEV 427



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 44/71 (61%), Gaps = 2/71 (2%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
           +Y T+L T+EA  +I     +KP++L LAHLA+HA NA + +E    +ET     YI D 
Sbjct: 118 DYMTNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEAMEVYDWEETNATLGYIQDV 177

Query: 72  NRRTYAALTKS 82
           NRR +A++  +
Sbjct: 178 NRRKFASVVTT 188


>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
          Length = 354

 Score =  338 bits (868), Expect = 4e-90,   Method: Compositional matrix adjust.
 Identities = 171/356 (48%), Positives = 227/356 (63%), Gaps = 17/356 (4%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +GW+D+ FHGS +IPTPN+DALA +GIIL+N Y+QP+CTPSR SL+TGK+PI  G+Q   
Sbjct: 11  FGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV 70

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I+ A+P G+ L E+ LPEYL+ LGY +  +GKWHLGFF  EYTP+ RGF+SH+G+     
Sbjct: 71  IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGASE 130

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            Y  HI          + G D   N        G Y+T LFT +A QL+ +    +P+FL
Sbjct: 131 DYMTHIGG--------MGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQTEPMFL 182

Query: 271 YLAHLAAHAGNA---GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           Y +H A H  +    G  L AP +  ++F +I D   +  A M+  LD+SVG +  +L  
Sbjct: 183 YFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLTKSLHA 242

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
            GMLEN+IIIF SDNGAP        N     SNYP RG K+++WEGG +VPA +WSP +
Sbjct: 243 NGMLENTIIIFSSDNGAP------HGNTDVCSSNYPLRGAKSSVWEGGTRVPAFVWSPLL 296

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
           +++  VS QMMHISDWLPTL  AAG D S LP ++ G+ QW +L  N PS RNS +
Sbjct: 297 KKSGYVSKQMMHISDWLPTLLKAAGYDMSALPGDLYGVSQWQALQENGPSARNSML 352



 Score = 47.4 bits (111), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 3/73 (4%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA---GKHLEAPQETINQFQYIT 69
           G Y+T LFT +A QL+ +    +P+FLY +H A H  +    G  L AP +  ++F +I 
Sbjct: 156 GHYSTTLFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIA 215

Query: 70  DPNRRTYAALTKS 82
           D   +  A +  +
Sbjct: 216 DVECQHMAGMISA 228


>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 374

 Score =  338 bits (866), Expect = 6e-90,   Method: Compositional matrix adjust.
 Identities = 171/344 (49%), Positives = 221/344 (64%), Gaps = 12/344 (3%)

Query: 89  IVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
           I  GW+D+S HGS++I TPNID LA  G+ L N Y  P+CTP+R+++MTGK+PIHTGMQ 
Sbjct: 40  IEQGWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGKHPIHTGMQH 99

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
             I   EP G+ L E+ + ++L+ LGYST A+GKWHLG+F  +Y P  RGF+S FGY NG
Sbjct: 100 DTIGADEPWGLGLDEKTMAQHLKSLGYSTHAVGKWHLGYFAEDYIPTRRGFDSFFGYYNG 159

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
              YY H  ++         G+D+ +N    W   G+Y+T++FT+EA Q+I+     +PL
Sbjct: 160 RGDYYTHEDTEG-----GFGGYDLHKNGEVHWPDFGQYSTEIFTEEAQQIIKTHNASQPL 214

Query: 269 FLYLAHLAAHAGNAGKHL-EAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           FLYLAH A HAG  GK L EAP +    F  I    RR +AAMV  LDDSVG +   LQ 
Sbjct: 215 FLYLAHQAVHAGVYGKDLVEAPHKYYQMFPNIKTEGRRMFAAMVAGLDDSVGNISQTLQD 274

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
            G+  NSII+F +DNG PT  Y       N  SN+P RG K+TLWEGGV+  A + SP I
Sbjct: 275 AGLYNNSIIVFTTDNGGPTNGYD-----GNHASNWPLRGCKHTLWEGGVRGTAFVNSPLI 329

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           ++  R S +MMH+ DWLPTLY  AGGD   L  N+DG DQW +L
Sbjct: 330 EKPRRFSDRMMHVCDWLPTLYGVAGGDPKEL-TNLDGFDQWDAL 372



 Score = 69.7 bits (169), Expect = 4e-09,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 48/81 (59%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHL-EAPQ 59
           + +N    W   G+Y+T++FT+EA Q+I+     +PLFLYLAH A HAG  GK L EAP 
Sbjct: 178 LHKNGEVHWPDFGQYSTEIFTEEAQQIIKTHNASQPLFLYLAHQAVHAGVYGKDLVEAPH 237

Query: 60  ETINQFQYITDPNRRTYAALT 80
           +    F  I    RR +AA+ 
Sbjct: 238 KYYQMFPNIKTEGRRMFAAMV 258


>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 513

 Score =  337 bits (865), Expect = 9e-90,   Method: Compositional matrix adjust.
 Identities = 204/554 (36%), Positives = 297/554 (53%), Gaps = 78/554 (14%)

Query: 80  TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMT 137
           TK+  + L++V   GW+D+  HGS++IPTPNID LA  G++L+N Y QP+CTPSRASLMT
Sbjct: 26  TKAPNVVLIVVDDLGWDDIGLHGSSQIPTPNIDKLAEEGVVLDNYYTQPICTPSRASLMT 85

Query: 138 GKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
           GKYP+  G+Q   I  A P G+P   + +P+YL +  Y    +GKWHLG  R E+ P  R
Sbjct: 86  GKYPVRLGLQHDVISAATPFGLPSNFKIMPQYLHDKNYDCHIVGKWHLGHSRSEFLPTRR 145

Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVEL-NGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           GF+ HFGY  G   +Y H  +D       L  G D+  N   A +  G+Y+TD++T  + 
Sbjct: 146 GFKDHFGYRLGSSDHYSHYGADDSDVPGSLFYGLDLWHNEVPAKEFNGKYSTDIYTHRST 205

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLD 315
            ++      +PLFLYLA+ A HAGN  + L+APQ  +++F   I +  RR YAAMV  +D
Sbjct: 206 DILRMHNKSRPLFLYLAYQAVHAGNPDQALQAPQSIVDRFSSSIRNDRRRRYAAMVSAVD 265

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
            ++G V+ A++  G   N+++ F +DNG P       +N R+ GSNYP R  K TLWEGG
Sbjct: 266 TAIGNVMGAIRANGFAGNTLVFFTNDNGGPI-----NANDRSPGSNYPLRAGKFTLWEGG 320

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           V+   I W+PQ+ + P     + HI D LPT+ +AAG   S +P  +DG+  W SL    
Sbjct: 321 VRGTGIFWAPQVLK-PGKFGGLSHIVDVLPTILSAAG--MSSVP-ELDGVSLWKSL---- 372

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
             +R+ N                 R  +L+NID    TAA RL  +KL+   + NG MDG
Sbjct: 373 SEKRSEN----------------PRTELLLNIDPVDHTAAYRLGDFKLI---KSNGLMDG 413

Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA 555
           ++      + P    +A   S+T +   +L   +F+    +D+                 
Sbjct: 414 WF------RTP----DAANHSRTIKC--KLDDPVFI-CRGVDR----------------- 443

Query: 556 PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLV 615
                PC      LFN+  DPCEQ+++ +  PDI  ++ E L  +R  + P   +    +
Sbjct: 444 -----PC------LFNIAEDPCEQDDVYADNPDIVKEIMERLHRYRHEMHPVEDKNVTRL 492

Query: 616 QADPKRFNDTWSPW 629
            ADP RF+  W PW
Sbjct: 493 -ADPARFDYIWMPW 505



 Score = 61.6 bits (148), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 1/81 (1%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDP 71
           G+Y+TD++T  +  ++      +PLFLYLA+ A HAGN  + L+APQ  +++F   I + 
Sbjct: 193 GKYSTDIYTHRSTDILRMHNKSRPLFLYLAYQAVHAGNPDQALQAPQSIVDRFSSSIRND 252

Query: 72  NRRTYAALTKSTTLTLLIVYG 92
            RR YAA+  +    +  V G
Sbjct: 253 RRRRYAAMVSAVDTAIGNVMG 273


>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
 gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
          Length = 532

 Score =  333 bits (855), Expect = 1e-88,   Method: Compositional matrix adjust.
 Identities = 203/533 (38%), Positives = 307/533 (57%), Gaps = 51/533 (9%)

Query: 77  AALTKSTT------LTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCT 129
           A + KST       +T+LI   GW D  FHGS++I TPN+DALAY+G+ILN  Y  P CT
Sbjct: 17  AVVCKSTAQQPPHIITILIDDLGWTDTGFHGSDQIKTPNMDALAYSGMILNRHYVLPSCT 76

Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           PSR++L+TG YPI TGMQG P+ G + R +PL+ +  PE+L+ LGY T  +GKWHLG+  
Sbjct: 77  PSRSALLTGLYPIRTGMQGMPLKGGDVRNLPLSFKLKPEFLKNLGYRTHLVGKWHLGYRT 136

Query: 190 REYTPLYRGFESHFGYLNGVISYYD--HILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
             + P  RGF+S FGY NG + Y+   H  +    +     G+D+ RN          YA
Sbjct: 137 INHLPNQRGFDSFFGYYNGYVDYFKFGHNQTVAGEKIEYFYGYDLHRNGEIYQTDKDTYA 196

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET-INQ-FQYITDPNRR 305
           T LFT+EA ++I++    +PL+LY +HLA H G+    +E P++  +N+ + +I    RR
Sbjct: 197 TRLFTREAEKIIKNHNESEPLYLYFSHLATHTGDDDIGMEVPEDADVNKTYGHIKHYGRR 256

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
            +A  +++LD SVG V+ AL+ K ML+NSII+ MSDNG  TV         NW SN+P  
Sbjct: 257 AFAGCLEELDKSVGEVMEALKEKNMLDNSIILIMSDNGGHTVSVDLPP---NWSSNWPLG 313

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           G K TL+EGGV+  A++WSP + +   ++   +HI+DWLPTLY+AAGG+   L +  DG+
Sbjct: 314 GTKFTLFEGGVRSVALIWSPLLPKG-VINDDFIHITDWLPTLYSAAGGNPKELGI-FDGI 371

Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS-WKLV 484
           DQW +L      +RNS +                R ++L+NID+K  +  V +++ WK++
Sbjct: 372 DQWEAL------KRNSAV---------------ARENILLNIDDKTGSEGVIIENRWKVL 410

Query: 485 LGTQEN-GTMDGYYGQTRS-NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRST 542
            G  +  G+ D ++G+  + +  P  + N I+ S+T Q +  L    +    N   + + 
Sbjct: 411 KGGYKTYGSYDFHWGKPENIHPNPKYDPNDIINSQTGQIVSNLKH--YKNSLNHSSIETL 468

Query: 543 RQQATIHCG--ANPAPMT------PSPCTNGPCY-LFNLGNDPCEQNNIASSR 586
           R+++T+ C    N  PM       PSP      + LF++ ND CE N + ++ 
Sbjct: 469 RKKSTVTCQPLVNNIPMNDYGFIEPSPEVRCKDFCLFDIINDQCEVNPVENNE 521



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 42/66 (63%), Gaps = 2/66 (3%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET-INQ-FQYITDPN 72
           YAT LFT+EA ++I++    +PL+LY +HLA H G+    +E P++  +N+ + +I    
Sbjct: 195 YATRLFTREAEKIIKNHNESEPLYLYFSHLATHTGDDDIGMEVPEDADVNKTYGHIKHYG 254

Query: 73  RRTYAA 78
           RR +A 
Sbjct: 255 RRAFAG 260


>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
 gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
          Length = 380

 Score =  333 bits (854), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 167/359 (46%), Positives = 230/359 (64%), Gaps = 11/359 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS +IPTPN+D LA  G+ILNN Y  P+CTP+RASLMTGKYPIH GMQ   I
Sbjct: 16  GWDDVSFHGSPQIPTPNLDYLATRGVILNNYYVSPICTPTRASLMTGKYPIHLGMQHFVI 75

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + A+P G+PL E  LP+YL+  GY T  IGKWHLGFF +EYTP YRGF+S +G  +    
Sbjct: 76  YAAQPYGLPLGEITLPQYLQIQGYKTAGIGKWHLGFFAKEYTPTYRGFDSFYGMWSAKAD 135

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H      S      G DMR N+        +YAT++FT+EA+++IE+    +PLFLY
Sbjct: 136 YWNHT-----SFENGFWGTDMRNNMEPVTTDKDKYATEVFTREALKVIENHNKSEPLFLY 190

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDP-NRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +AH A H+ N    L+AP++ + +F  + D   R+ YAAMV  +DDS+G V  AL++  M
Sbjct: 191 IAHQAPHSANPHDPLQAPEDKVKKFSGVIDKIERQQYAAMVTCVDDSIGEVFRALEKNRM 250

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L NS+I+F +DNG     +      RN GSNYP RG K+ +WEGGV+  A ++S  I+  
Sbjct: 251 LNNSVILFTTDNGGAPHGFN-----RNQGSNYPLRGGKDMMWEGGVRGTAFIYSDLIKHK 305

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
            RVS  ++ ++DW+PTLY  AGG    L  ++DG + W ++    PS R   +  +D W
Sbjct: 306 GRVSTDLIDVTDWVPTLYYLAGGTPGYLEPHMDGKNVWETISNKEPSPRKEVLHNIDPW 364



 Score = 65.5 bits (158), Expect = 9e-08,   Method: Compositional matrix adjust.
 Identities = 31/81 (38%), Positives = 51/81 (62%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MR N+        +YAT++FT+EA+++IE+    +PLFLY+AH A H+ N    L+AP++
Sbjct: 151 MRNNMEPVTTDKDKYATEVFTREALKVIENHNKSEPLFLYIAHQAPHSANPHDPLQAPED 210

Query: 61  TINQFQYITDP-NRRTYAALT 80
            + +F  + D   R+ YAA+ 
Sbjct: 211 KVKKFSGVIDKIERQQYAAMV 231


>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 490

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 190/498 (38%), Positives = 269/498 (54%), Gaps = 55/498 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+SFHGS +IPTPNID LA +G+ILNN Y  P+CTPSRA+LMTG YPI  GMQ   I
Sbjct: 37  GWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPLCTPSRAALMTGLYPIRNGMQLTSI 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A P G+PL  + LP++ ++LGY    IGKWHLGFF+  Y P+ RGF++ FG+  G   
Sbjct: 97  QAAGPWGLPLENKILPQHFKDLGYDVNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSND 156

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL--- 268
           YY+H     + + + +         S   +T+ +    L+    + +     +  PL   
Sbjct: 157 YYNHTSGSSHRKILAVTS-------SVQVNTLEKGRLSLWGPRELSVCFLHQIYSPLNFY 209

Query: 269 ----FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
               F Y++H A H     +  +AP   + +F YI +PNR  YA M+  LD+S+G V+ A
Sbjct: 210 LQPFFCYISHQAVHHALNAEMFQAPARNVLKFSYIGEPNRTIYAGMLDALDESMGRVVEA 269

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
           L   G+LE++II+F SDNG             N G N+P RG K TLWEG V+VPA +W+
Sbjct: 270 LSSAGILEDTIIVFSSDNGGSPYGLES-----NRGYNWPLRGAKFTLWEGSVRVPAFVWN 324

Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
           P++ +  RVS Q+MHISDWLPTLYTAAGG+ S L   IDG+                   
Sbjct: 325 PKLLKKSRVSNQLMHISDWLPTLYTAAGGNASALG-GIDGV------------------- 364

Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY---GQTR 501
             D W SL   + S+R  +L+NID     + +R   +KLV+G   +G +D +Y   G TR
Sbjct: 365 --DMWHSLTHGSASKRKEILLNIDSLYNISGLRYKQYKLVVGGGFDGELDDHYYFPGGTR 422

Query: 502 -SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
            +N +  L  N    ++  + L +     + P+       +  +   + C  N   +T +
Sbjct: 423 PTNNIDQLR-NQSTMARVLKELYEKQDRSWSPL-------AWSKNVAVDCRRN--RLTEN 472

Query: 561 PCTNGPCYLFNLGNDPCE 578
                P YLFN+  DPCE
Sbjct: 473 FVPMQPPYLFNIEEDPCE 490



 Score = 42.4 bits (98), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 16/45 (35%), Positives = 25/45 (55%)

Query: 35  KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAL 79
           +P F Y++H A H     +  +AP   + +F YI +PNR  YA +
Sbjct: 211 QPFFCYISHQAVHHALNAEMFQAPARNVLKFSYIGEPNRTIYAGM 255


>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
          Length = 633

 Score =  333 bits (853), Expect = 2e-88,   Method: Compositional matrix adjust.
 Identities = 208/590 (35%), Positives = 306/590 (51%), Gaps = 74/590 (12%)

Query: 81  KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
           ++  + L IV   GW+D+SFH + +IPTPNIDA+  + ++LN+ Y Q  CTPSR +L+TG
Sbjct: 55  RAPDIILFIVDDMGWSDVSFHANGQIPTPNIDAMCSDAVLLNSHYVQASCTPSRGALLTG 114

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           KYPI  G+Q   I       + L  R LP+YLR+LGY+T  +GKWHLGF+  +Y P  RG
Sbjct: 115 KYPIKIGLQEYVIQPGRQEALHLKHRLLPQYLRDLGYATHLVGKWHLGFYAEDYLPENRG 174

Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           F+S +G+ NG  +YY+H  SD   R     G+D   N  +  D  G+YATD+ T+    L
Sbjct: 175 FDSFYGFYNGAGTYYNHSASDADGRI----GYDWHLNKESDPDAHGKYATDIITQRVKHL 230

Query: 259 IEDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETIN--QFQYITDPNRRTYAAMVKKLD 315
           I+ +  +KP+FL ++H+A H G N  +  E  ++ I   +  +I   +R  YA M++ LD
Sbjct: 231 IQSRDPEKPMFLMISHMAPHGGDNEDELFEVDRQWIEDPEIAHIMVESRTKYAGMIRALD 290

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           D+ G  + AL+   ML+NS+I+F SDNG P++      +Y+   SN+P RG K ++WEGG
Sbjct: 291 DAFGQTVVALKEAEMLQNSVILFTSDNGGPSI---VDESYKTGASNWPLRGQKFSMWEGG 347

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           V+V + LWSP ++    ++  + HI+DWLPT Y  AGG+   LP++IDG D W S+  NT
Sbjct: 348 VRVSSFLWSPLLKSQRYINKHLYHITDWLPTFYKLAGGELRDLPIDIDGHDIWDSIDRNT 407

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
            S R + +   D   +   N   RR +  +   +         DSW    G    GTMD 
Sbjct: 408 DSPRKNLLINYDTGGT--ENFAVRRGNFKLMHGKPPPEVDPWYDSWYATAG----GTMDD 461

Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRST-RQQATIHCGANP 554
           +    ++ K          +S TY  L +          + ++ R   R+   + C  N 
Sbjct: 462 FMTLQKAMK----------KSVTYDILSRR--------PDFEERRDDWREAGKVKCKINR 503

Query: 555 APMTPSP------------------------CTNG---PCYLFNLGNDPCEQNNIASS-R 586
             M  SP                        CTN    PC LF+L  DPCE    + + R
Sbjct: 504 G-MADSPRGLDYRTYTVDDMLKASVIKNDEICTNSNEQPC-LFDLLEDPCELRQYSGADR 561

Query: 587 PDISSQLYEL----LKYHRRTLVPQSH--EQPDLVQADPKRFNDTWSPWI 630
             I  +L E+    L+     + P S   E  D   ADP++FN TW P+I
Sbjct: 562 EQIIRELSEIAEQYLRLKEAAVAPWSSRMEHTD-ADADPQKFNGTWMPYI 610



 Score = 46.6 bits (109), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 43/76 (56%), Gaps = 3/76 (3%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETIN--QFQ 66
           D  G+YATD+ T+    LI+ +  +KP+FL ++H+A H G N  +  E  ++ I   +  
Sbjct: 213 DAHGKYATDIITQRVKHLIQSRDPEKPMFLMISHMAPHGGDNEDELFEVDRQWIEDPEIA 272

Query: 67  YITDPNRRTYAALTKS 82
           +I   +R  YA + ++
Sbjct: 273 HIMVESRTKYAGMIRA 288


>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
 gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
          Length = 574

 Score =  332 bits (850), Expect = 5e-88,   Method: Compositional matrix adjust.
 Identities = 201/567 (35%), Positives = 296/567 (52%), Gaps = 42/567 (7%)

Query: 67  YITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA 124
           ++    + TY    K   + +++    G++D+S  G  E  TPNIDAL ++G +L+ +YA
Sbjct: 19  WVKASEKNTYRTGNKLPNVIIIMADDMGFDDISLRGGREFLTPNIDALGFHGRLLDRLYA 78

Query: 125 QPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWH 184
             +CTPSR +L++GKYPIHTG Q   I   EP  + L    +PE  R  GYST  +GKWH
Sbjct: 79  PAMCTPSRGALLSGKYPIHTGTQHFVISNEEPWSLMLNTTLMPEIFRSAGYSTNLVGKWH 138

Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
           LGF R EYTP +RGF+ H+GY    I YY      Q      + G+D RRN+       G
Sbjct: 139 LGFARPEYTPTHRGFDYHYGYWGAYIDYYQR--RSQMPEKTYIVGYDFRRNMEVECTDRG 196

Query: 245 EYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
            Y TDL T EA ++I++     KPLFL + HLA H  N    L+AP+E + +F +I DPN
Sbjct: 197 VYVTDLLTNEAERIIQETAAKQKPLFLMINHLATHTANDNDPLQAPEEEVQKFLHIKDPN 256

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
            R YAAMV KLD SVG VI+AL R   LENSI+IF SDNGAP+V       + N GSN+P
Sbjct: 257 HRKYAAMVSKLDQSVGRVITALARAEQLENSIVIFYSDNGAPSV-----GMFANTGSNWP 311

Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
            +G KN+ WEGGV+V  ++WS  +     +  Q +++ D+LPTL  AAG +     L +D
Sbjct: 312 LKGQKNSPWEGGVRVAGVIWSQLLTARGHLFTQPIYVGDFLPTLAHAAGIELPE-SLQLD 370

Query: 424 GLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
           G+D W  L  +                    + P     +L ++D+    +++ L  WK 
Sbjct: 371 GIDLWPQLAGSK-------------------DAPHVPREILHSLDDVSHFSSLLLGQWKY 411

Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTR 543
           V GT   G  D        +++       +VE +   + + L++     ++   +M + R
Sbjct: 412 VNGTTLAGQFDQQLTHRELDELDPRARRYVVEVRNSPASKALARYDLQRLTQ-HRMHTLR 470

Query: 544 QQATIHCG--ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKY 599
           + AT+ CG            C      L++L  DPCE NN+A S    +I + + + ++ 
Sbjct: 471 RLATVRCGDMQRGCNALLEEC------LYDLSVDPCELNNLAYSERHAEILATMRQRVQQ 524

Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTW 626
            R   +P  + +P +  A+P     TW
Sbjct: 525 LRAEALPPGN-RPGVSYANPNLHECTW 550



 Score = 71.6 bits (174), Expect = 1e-09,   Method: Compositional matrix adjust.
 Identities = 35/81 (43%), Positives = 47/81 (58%), Gaps = 1/81 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA ++I++     KPLFL + HLA H  N    L+AP+
Sbjct: 184 FRRNMEVECTDRGVYVTDLLTNEAERIIQETAAKQKPLFLMINHLATHTANDNDPLQAPE 243

Query: 60  ETINQFQYITDPNRRTYAALT 80
           E + +F +I DPN R YAA+ 
Sbjct: 244 EEVQKFLHIKDPNHRKYAAMV 264


>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
          Length = 547

 Score =  331 bits (848), Expect = 9e-88,   Method: Compositional matrix adjust.
 Identities = 197/561 (35%), Positives = 305/561 (54%), Gaps = 48/561 (8%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           AA      + +    GW+D S HGS  + TPN+D L  +G+ L+  Y   +C+P+R +++
Sbjct: 19  AATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVL 78

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TGKY    GMQG P+  AE RG+PL ER + +YL++ GY T+ +GKWH+G    E  P Y
Sbjct: 79  TGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTY 138

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           RGFE+HFG   G I YY++   +Q      + G  +  +L   W T G Y TD++T+++ 
Sbjct: 139 RGFENHFGVRGGFIDYYEYNAQEQLDGR-PVTGLCLFDDLQPDWTTEG-YITDVYTEKST 196

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
            +IE+  V +PL+L L H A H GN    L+AP E +   +++    RR +AAMVKKLDD
Sbjct: 197 TIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLDD 256

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           S+G +++ L++KGMLEN+II F +DNGAPTV     S     GSNYP RGVK + WEGG+
Sbjct: 257 SIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANS-----GSNYPLRGVKKSPWEGGI 311

Query: 377 KVPAILWS-PQIQQN----PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           +  A++W+ P++        +V    MH +DW+PTL  A G    ++P  +DG+  WS +
Sbjct: 312 RGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIG---EKIPAGLDGIPMWSHI 368

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
           + N PS R                        +  ID+    ++V L   KLV GT +  
Sbjct: 369 IENKPSPRTE----------------------IFEIDDYFNHSSVTLGRHKLVKGTIDE- 405

Query: 492 TMDGYYGQTRSNKV--PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
           ++  +YG+     +  P      + +SK ++SL    + I +P+ + D M + R +A + 
Sbjct: 406 SLSKHYGEDLRGIIGTPPDYKQKLRDSKAWESL----ETIGIPL-DADVM-ADRDEAIVT 459

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
           CG N  P   SP     C L+++  DPCE  +++   P ++  L   L+     ++P+  
Sbjct: 460 CG-NVVPKPCSPSAESWC-LYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREG 517

Query: 610 EQPDLVQADPKRFNDTWSPWI 630
           +     ++ PK FN TW  ++
Sbjct: 518 QYVADPKSAPKYFNYTWDAYL 538



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
           +L   W T G Y TD++T+++  +IE+  V +PL+L L H A H GN    L+AP E + 
Sbjct: 176 DLQPDWTTEG-YITDVYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVR 234

Query: 64  QFQYITDPNRRTYAALTK 81
             +++    RR +AA+ K
Sbjct: 235 AQRHVELHPRRIFAAMVK 252


>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 528

 Score =  328 bits (842), Expect = 4e-87,   Method: Compositional matrix adjust.
 Identities = 203/562 (36%), Positives = 280/562 (49%), Gaps = 75/562 (13%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           AA + +  L ++   GW+D+S HGS++IPTPNID LA  G++L N Y Q +CTPSR +LM
Sbjct: 17  AASSPNIVLIVIDDLGWDDISLHGSDQIPTPNIDKLAAEGVLLENYYTQAICTPSRGALM 76

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TGKYPIH G+Q   I GA+P G+P   + +P+YL    Y +  IGKWHLG  R E  P  
Sbjct: 77  TGKYPIHLGLQYDVIQGAQPYGLPTDFKIMPQYLSGTCYKSHIIGKWHLGHSRSELLPTR 136

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVEL-NGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           RGF SHFG+  G   Y++H   +      E+  G D+  N        G YA DLFTK A
Sbjct: 137 RGFHSHFGFRLGHSDYFNHWGEESSPVKNEMYAGLDLWSNEVPIKKYHGTYANDLFTKRA 196

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKL 314
           + ++E      PLFLYLAH A H G+    L+AP   + +F+  I D  R+ YAAMV  +
Sbjct: 197 ISILETHNKTTPLFLYLAHQAVHVGDGENPLQAPLSFVKKFEGKIYDEKRKHYAAMVSAM 256

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D S+G ++  +   G  EN+II   +DNG PT     + +     SN+P RG K TLWEG
Sbjct: 257 DASIGELMHGISTNGFAENTIIFLTNDNGGPTNGMASSGS-----SNHPLRGCKYTLWEG 311

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
           G +  A+ W P+ +        M HI+D LPTL TA+G   +    N DG D W +L  N
Sbjct: 312 GTRGSALFWYPK-KLGAGTYKGMAHITDVLPTLLTASGNAVAG---NFDGYDLWDALSTN 367

Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRR-------NSVLINIDEKKRTAAVRLDSWKLVLGT 487
              R+ +               P R        +S+L NID    + AVR   +KL+   
Sbjct: 368 AQPRKET-------------VEPQRTYGNEHDFHSLLYNIDPVTHSWAVRSGDFKLITDN 414

Query: 488 QENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQAT 547
             NG   G+Y                                      ++ +  TR   T
Sbjct: 415 ALNGEYSGWY------------------------------------PTVESVNLTR---T 435

Query: 548 IHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQ 607
           + CG+       S     PC LFNL  DPCE+N+I   RPD+  +L   L  ++ ++ P 
Sbjct: 436 VKCGSASKACDGS---KRPC-LFNLAADPCEENDIFDERPDMVRKLMVKLNEYQTSMRPM 491

Query: 608 SHEQPDLVQADPKRFNDTWSPW 629
            + +P+   ADP RF+  W PW
Sbjct: 492 EN-RPNSAAADPARFDFVWMPW 512



 Score = 63.2 bits (152), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 31/76 (40%), Positives = 44/76 (57%), Gaps = 1/76 (1%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDP 71
           G YA DLFTK A+ ++E      PLFLYLAH A H G+    L+AP   + +F+  I D 
Sbjct: 185 GTYANDLFTKRAISILETHNKTTPLFLYLAHQAVHVGDGENPLQAPLSFVKKFEGKIYDE 244

Query: 72  NRRTYAALTKSTTLTL 87
            R+ YAA+  +   ++
Sbjct: 245 KRKHYAAMVSAMDASI 260


>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
 gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
          Length = 576

 Score =  327 bits (837), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 206/566 (36%), Positives = 295/566 (52%), Gaps = 48/566 (8%)

Query: 72  NRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV 127
           +R T    T S    ++I+     G++DLS  G  E  TPNIDALA++G +L+ +YA  +
Sbjct: 22  DRETNTTRTGSKLPNIIIIMADDMGFDDLSIRGGREFLTPNIDALAFHGRLLDRLYAPAM 81

Query: 128 CTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           CTPSR +L++GKYPIHTG Q   I   EP  + L    +PE  R  GYST  +GKWHLG+
Sbjct: 82  CTPSRGALLSGKYPIHTGTQHFVISNQEPWSLKLNTTLMPEIFRAAGYSTNLVGKWHLGY 141

Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
            R E+TP +RGF+ H+GY    I YY      Q      + G+D RRN+       G Y 
Sbjct: 142 ARPEFTPTHRGFDYHYGYWGAYIDYYQR--RSQMPDKTYIMGYDFRRNMEVECADRGVYM 199

Query: 248 TDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           TDL T EA ++I++     +PLFL + HLA H GN    L+ P+E + +F +I DPN R 
Sbjct: 200 TDLLTNEAERVIQETAAKQQPLFLMINHLAVHTGNDNDPLQVPEEELQKFTHIKDPNHRK 259

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YAAMV KLD SVG VI AL R   LENSI+IF SDNGAP+V       + N GSN+P +G
Sbjct: 260 YAAMVSKLDKSVGRVIGALARAEQLENSIVIFYSDNGAPSV-----GMFANTGSNWPLKG 314

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP--LNIDG 424
            KN+ WEGG++V   +WSP ++    V  Q +++ D+LPTL  AAG +   LP  L +DG
Sbjct: 315 QKNSPWEGGLRVAGAIWSPLLKARSSVFTQPIYVGDFLPTLAHAAGIE---LPDSLQLDG 371

Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
           +D W    L+ P                    P     +L ++D+    +++ L  WK V
Sbjct: 372 IDLWPQ--LSGPK-----------------EAPHVPREILHSLDDVWNVSSLLLGHWKYV 412

Query: 485 LGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQ 544
            GT   G  D    Q   + +       +VE +   + + L++     ++   +M++   
Sbjct: 413 NGTTLAGQYDQQLVQRELDDLDPRANRYVVEVRNSLASRALARFDQQRLTQ-HRMKNIMI 471

Query: 545 QATIHCG--ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYH 600
            A + CG            C      L+++  DPCE+NN+A S    +I + + + +K  
Sbjct: 472 DAKVDCGDMQRGCNALLEEC------LYDMSVDPCERNNLAYSEQHAEILATMRQRIKQL 525

Query: 601 RRTLVPQSHEQPDLVQADPKRFNDTW 626
           R   VP     P +  A+P     TW
Sbjct: 526 RAGAVP-IQNLPGVAYANPTLHECTW 550



 Score = 70.5 bits (171), Expect = 2e-09,   Method: Compositional matrix adjust.
 Identities = 36/93 (38%), Positives = 51/93 (54%), Gaps = 1/93 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ 59
            RRN+       G Y TDL T EA ++I++     +PLFL + HLA H GN    L+ P+
Sbjct: 184 FRRNMEVECADRGVYMTDLLTNEAERVIQETAAKQQPLFLMINHLAVHTGNDNDPLQVPE 243

Query: 60  ETINQFQYITDPNRRTYAALTKSTTLTLLIVYG 92
           E + +F +I DPN R YAA+      ++  V G
Sbjct: 244 EELQKFTHIKDPNHRKYAAMVSKLDKSVGRVIG 276


>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score =  326 bits (836), Expect = 2e-86,   Method: Compositional matrix adjust.
 Identities = 195/554 (35%), Positives = 301/554 (54%), Gaps = 48/554 (8%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           AA      + +    GW+D S HGS  + TPN+D L  +G+ L+  Y   +C+P+R +++
Sbjct: 19  AATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVL 78

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TGKY    GMQG P+  AE RG+PL ER + +YL++ GY T+ +GKWH+G    E  P Y
Sbjct: 79  TGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTY 138

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           RGFE+HFG   G I YY++   +Q      + G  +  +L   W T G Y TD++T+++ 
Sbjct: 139 RGFENHFGVRGGFIDYYEYNAQEQLDGR-PVTGLCLFDDLQPDWTTEG-YITDVYTEKST 196

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
            +IE+  V +PL+L L H A H GN    L+AP E +   +++    RR +AAMVKKLDD
Sbjct: 197 TIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLDD 256

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           S+G +++ L++KGMLEN+II F +DNGAPTV     S     GSNYP RGVK + WEGG+
Sbjct: 257 SIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANS-----GSNYPLRGVKKSPWEGGI 311

Query: 377 KVPAILWS-PQIQQN----PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           +  A++W+ P++        +V    MH +DW+PTL  A G    ++P  +DG+  WS +
Sbjct: 312 RGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIG---EKIPAGLDGIPMWSHI 368

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
           + N PS R                        +  ID+    ++V L   KLV GT +  
Sbjct: 369 IENKPSPRTE----------------------IFEIDDYFNHSSVTLGRHKLVKGTIDE- 405

Query: 492 TMDGYYGQTRSNKV--PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
           ++  +YG+     +  P      + +SK ++SL    + I +P+ + D M + R +A + 
Sbjct: 406 SLSKHYGEDLRGIIGTPPDYKQKLRDSKAWESL----ETIGIPL-DADVM-ADRDEAIVT 459

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
           CG N  P   SP     C L+++  DPCE  +++   P ++  L   L+     ++P+  
Sbjct: 460 CG-NVVPKPCSPSAESWC-LYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREG 517

Query: 610 EQPDLVQADPKRFN 623
           +     ++ PK FN
Sbjct: 518 QYVSDPKSAPKYFN 531



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
           +L   W T G Y TD++T+++  +IE+  V +PL+L L H A H GN    L+AP E + 
Sbjct: 176 DLQPDWTTEG-YITDVYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVR 234

Query: 64  QFQYITDPNRRTYAALTK 81
             +++    RR +AA+ K
Sbjct: 235 AQRHVELHPRRIFAAMVK 252


>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
 gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
          Length = 532

 Score =  326 bits (835), Expect = 3e-86,   Method: Compositional matrix adjust.
 Identities = 195/554 (35%), Positives = 301/554 (54%), Gaps = 48/554 (8%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           AA      + +    GW+D S HGS  + TPN+D L  +G+ L+  Y   +C+P+R +++
Sbjct: 19  AATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVL 78

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TGKY    GMQG P+  AE RG+PL ER + +YL++ GY T+ +GKWH+G    E  P Y
Sbjct: 79  TGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTY 138

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           RGFE+HFG   G I YY++   +Q      + G  +  +L   W T G Y TD++T+++ 
Sbjct: 139 RGFENHFGVRGGFIDYYEYNAQEQLDGR-PVTGLCLFDDLQPDWTTEG-YITDVYTEKST 196

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
            +IE+  V +PL+L L H A H GN    L+AP E +   +++    RR +AAMVKKLDD
Sbjct: 197 TIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLDD 256

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           S+G +++ L++KGMLEN+II F +DNGAPTV     S     GSNYP RGVK + WEGG+
Sbjct: 257 SIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANS-----GSNYPLRGVKKSPWEGGI 311

Query: 377 KVPAILWS-PQIQQN----PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           +  A++W+ P++        +V    MH +DW+PTL  A G    ++P  +DG+  WS +
Sbjct: 312 RGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIG---EKIPAGLDGIPMWSHI 368

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
           + N PS R                        +  ID+    ++V L   KLV GT +  
Sbjct: 369 IENKPSPRTE----------------------IFEIDDYFNHSSVTLGRHKLVKGTIDE- 405

Query: 492 TMDGYYGQTRSNKV--PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
           ++  +YG+     +  P      + +SK ++SL    + I +P+ + D M + R +A + 
Sbjct: 406 SLSKHYGEDLRGIIGTPPDYKQKLRDSKAWESL----ETIGIPL-DADVM-ADRDEAIVT 459

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
           CG N  P   SP     C L+++  DPCE  +++   P ++  L   L+     ++P+  
Sbjct: 460 CG-NVVPKPCSPSAESWC-LYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREG 517

Query: 610 EQPDLVQADPKRFN 623
           +     ++ PK FN
Sbjct: 518 QYVADPKSAPKYFN 531



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/78 (37%), Positives = 46/78 (58%), Gaps = 1/78 (1%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
           +L   W T G Y TD++T+++  +IE+  V +PL+L L H A H GN    L+AP E + 
Sbjct: 176 DLQPDWTTEG-YITDVYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVR 234

Query: 64  QFQYITDPNRRTYAALTK 81
             +++    RR +AA+ K
Sbjct: 235 AQRHVELHPRRIFAAMVK 252


>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 470

 Score =  325 bits (834), Expect = 4e-86,   Method: Compositional matrix adjust.
 Identities = 195/497 (39%), Positives = 272/497 (54%), Gaps = 46/497 (9%)

Query: 141 PIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
           P+    Q   I   EP G+PL E  LPE+L +LGY++ A+GKWHLG+F++ YTP YRGF+
Sbjct: 5   PLLARTQHNVILEPEPWGLPLNEILLPEHLNKLGYTSHAVGKWHLGYFKKAYTPTYRGFK 64

Query: 201 SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
           S +G+ NG   YY H++   ++      G DMRR+L+  W +VG+Y+T LFTKEA  +I 
Sbjct: 65  SFYGFWNGYQDYYTHMVQATFA---SFEGFDMRRDLNPDWSSVGKYSTHLFTKEATDIIT 121

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
                 PLFLYLAHLA HAG     L+APQE IN FQ I D  RR YA M+K LD+SV  
Sbjct: 122 KHNNSVPLFLYLAHLAPHAGTYENPLQAPQEDINSFQSIKDKYRRKYAGMMKNLDNSVAE 181

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           V +AL    ML+N+I +F+SDNGAPT        +RN+GSN+P +G K T WEGG++  A
Sbjct: 182 VFNALHTNNMLDNTIFVFVSDNGAPT-----NGIHRNYGSNWPLKGEKATPWEGGIRTAA 236

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
            +WS  + +   +   +MHISDWLPTLY AAGG+T  L                      
Sbjct: 237 FVWSKLLSKKKTLPNPLMHISDWLPTLYQAAGGNTQDL---------------------- 274

Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
             IDG+  W S   N  S R  +L NID+    AA+R  ++K + G+   G +D Y+  +
Sbjct: 275 GAIDGISMWESFSNNIKSPRKQILHNIDDITGYAAIRDTNFKYIKGSTFLGYLD-YWSGS 333

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
            S      N +AI+ S T   L Q++ +   P      + S R  +T+ C      M P 
Sbjct: 334 LSPSTIRYNVDAILNSTTAGILSQINGDSLTP----SLIESLRNSSTVRCDFE-KDMKPC 388

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK--YHRRTL--VPQSHEQPDLVQ 616
                PC LF++  DPCE  N+ + +P+   + +   K  Y   +L    +S  +P  V+
Sbjct: 389 KPFIKPC-LFHIVKDPCELVNL-NYKPNSKMRKFVEAKIDYFETSLGKFRESASKPMNVR 446

Query: 617 ----ADPKRFNDTWSPW 629
               A+P  +N+TW  W
Sbjct: 447 GTKDANPALYNNTWISW 463



 Score = 90.9 bits (224), Expect = 2e-15,   Method: Compositional matrix adjust.
 Identities = 46/91 (50%), Positives = 58/91 (63%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRR+L+  W +VG+Y+T LFTKEA  +I       PLFLYLAHLA HAG     L+APQE
Sbjct: 93  MRRDLNPDWSSVGKYSTHLFTKEATDIITKHNNSVPLFLYLAHLAPHAGTYENPLQAPQE 152

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
            IN FQ I D  RR YA + K+   ++  V+
Sbjct: 153 DINSFQSIKDKYRRKYAGMMKNLDNSVAEVF 183


>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
          Length = 784

 Score =  314 bits (805), Expect = 7e-83,   Method: Compositional matrix adjust.
 Identities = 197/549 (35%), Positives = 288/549 (52%), Gaps = 77/549 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS +IPTPNID+LA +G+ILNN Y  P    ++   MTGKY  H G Q   +
Sbjct: 1   GWDDVSFHGSPQIPTPNIDSLAKSGVILNNYYVSPSSFATKTEFMTGKYATHLGTQHGVL 60

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV-I 210
              +P G+P TE+ LP+YL+E GY+  A+GKW LG+++ E  P  RGF+  +G L     
Sbjct: 61  HNKQPFGLPHTEKILPQYLKEAGYNNYAVGKWALGYYKEEMLPWKRGFDFFYGGLTSSGK 120

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            YY H   D      E  G D+RRN     +  G Y T+++T+EAV +I++   +KPLFL
Sbjct: 121 DYYTHSAFD------ENYGLDLRRNNEVIHNETGNYITEVYTREAVNIIKNYNDNKPLFL 174

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+AH A H GNA   L+AP+  + +  +I +  R+ +A MV  LD+SV  +  AL +KG+
Sbjct: 175 YVAHQAVHTGNADDPLQAPESYLKKLNHIKNIKRKLFAGMVLALDESVLNITIALAQKGL 234

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L+N++I+F +D G   V  +E S+     SN+P RG K T+WEGG++  A ++S  I+  
Sbjct: 235 LDNTVIVFTTDAGG-AVGGQELSS----ASNFPLRGSKLTVWEGGIRAVAFVYSDLIKNK 289

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVSL+M H +DWL T    AG       +N  G+                 +DG D W 
Sbjct: 290 GRVSLEMYHSTDWLLTTLGLAG-------INTKGIK----------------VDGFDVWK 326

Query: 451 SLLLNTPSRRNSVLINIDE-KKRTAAVRLDSWKLVLGTQENGTMDGYYGQ--TRSNKVPL 507
           S+     S R  +L   D   K   ++R+  +KLV+         G+YG    R  ++  
Sbjct: 327 SISEGEVSPRFEILHQFDPLTKNLCSLRVGDYKLVINQD-----IGFYGDWYPRPAEIGE 381

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN-PAP---MTPSPC- 562
           L +   +E+              LP             A + C  N P P   M   PC 
Sbjct: 382 LRYLKRIET--------------LP------------NAKVTCDKNYPHPFLEMHAPPCD 415

Query: 563 -TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKR 621
               PC +FN+  DPCE +N+A   P+    L + L+++R TLV  ++   D  QADP+ 
Sbjct: 416 PMKKPC-MFNIQWDPCEFHNLAEFMPNTLKVLLDRLQFYRLTLVKPTYPSVD-SQADPEN 473

Query: 622 FNDTWSPWI 630
            +  WSPW+
Sbjct: 474 SDGVWSPWV 482



 Score = 68.9 bits (167), Expect = 7e-09,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 50/80 (62%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +RRN     +  G Y T+++T+EAV +I++   +KPLFLY+AH A H GNA   L+AP+ 
Sbjct: 136 LRRNNEVIHNETGNYITEVYTREAVNIIKNYNDNKPLFLYVAHQAVHTGNADDPLQAPES 195

Query: 61  TINQFQYITDPNRRTYAALT 80
            + +  +I +  R+ +A + 
Sbjct: 196 YLKKLNHIKNIKRKLFAGMV 215


>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
 gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
          Length = 520

 Score =  311 bits (797), Expect = 7e-82,   Method: Compositional matrix adjust.
 Identities = 193/541 (35%), Positives = 281/541 (51%), Gaps = 75/541 (13%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YGWND+ +HGS  I TPN+D LA  G+ L N Y QP+CTPSR+ LMTG+Y IH G+Q   
Sbjct: 52  YGWNDIGYHGS-VIRTPNLDRLAAEGVKLENYYIQPICTPSRSQLMTGRYQIHFGLQHSI 110

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           IW  +P G+PL E  LP+ L+E GYST  +GKWHLGF++ EYTPL+RGF++ +G+L G  
Sbjct: 111 IWPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGFYKEEYTPLHRGFDTFYGFLTGSE 170

Query: 211 SYYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           ++Y H  S          NG D+R       D  G Y+T LF K+A+++I  Q   KP+F
Sbjct: 171 NHYSHRNSGGMPGFRPGWNGLDLRDQDRPVTDQNGTYSTHLFAKKAIEVIAQQDKSKPMF 230

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL   A HA      L+APQ+ I+ +++I D NRR YAAM   +D++VG +  AL++ G
Sbjct: 231 LYLPFQAVHA-----PLQAPQKYISMYRHINDYNRRMYAAMTTAMDEAVGNITDALKQYG 285

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           + +N++++F +DNG   +           G+N+P RG K +LWEGG++    + SP I+ 
Sbjct: 286 LWDNTVLVFSTDNGGQVMAG---------GNNWPLRGWKGSLWEGGIRGVGFVNSPLIKA 336

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ- 448
             R S  ++HISDW PTL   AGG T+     +DG D W ++    PS R   +  +D  
Sbjct: 337 KGRTSDALIHISDWFPTLVGLAGGSTNGTK-PLDGHDVWEAISEGKPSPRKEILHNIDPL 395

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
           + ++L   P +      N       AA+R   WKL         + GY G    + +P  
Sbjct: 396 FRTILYLPPHQWGDDTFNTSMH---AAIRSGDWKL---------LTGYPGN--GSWMPPP 441

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
           + + I E       Q L                                          +
Sbjct: 442 SSDLICEESDDPPGQHL------------------------------------------W 459

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LFN+  DP E+ +++   P +   L E L  ++++ VP  +  PD  +A+P    D WSP
Sbjct: 460 LFNIREDPEERTDLSKKYPLVVQDLLEKLAAYKKSSVPIFYPPPD-PRANPALHGDLWSP 518

Query: 629 W 629
           W
Sbjct: 519 W 519



 Score = 64.3 bits (155), Expect = 2e-07,   Method: Compositional matrix adjust.
 Identities = 33/73 (45%), Positives = 47/73 (64%), Gaps = 5/73 (6%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           D  G Y+T LF K+A+++I  Q   KP+FLYL   A HA      L+APQ+ I+ +++I 
Sbjct: 202 DQNGTYSTHLFAKKAIEVIAQQDKSKPMFLYLPFQAVHA-----PLQAPQKYISMYRHIN 256

Query: 70  DPNRRTYAALTKS 82
           D NRR YAA+T +
Sbjct: 257 DYNRRMYAAMTTA 269


>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
 gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
          Length = 491

 Score =  310 bits (795), Expect = 1e-81,   Method: Compositional matrix adjust.
 Identities = 193/539 (35%), Positives = 282/539 (52%), Gaps = 83/539 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ FHGS +I TPNID LA NG+IL+N Y QPVCTP+RASLMTGKYPIHTG+Q   I
Sbjct: 36  GWSDVGFHGS-KIQTPNIDRLAANGVILDNYYVQPVCTPTRASLMTGKYPIHTGLQHGII 94

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               P G+PL    LP+ LR+ GYST  +GKWHLGF+  E TP YRGF++ +G+ +G  +
Sbjct: 95  HNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLGFYNWESTPTYRGFDTFYGFYSGAEN 154

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           +Y H+  D Y         D+R N     D  G Y+  LFTK A Q++       PLF+Y
Sbjct: 155 HYTHV-QDHYL--------DLRDNEEIVRDQNGTYSAHLFTKRAEQIVRAHDPSTPLFMY 205

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A    H+      ++AP+E I+++ +I DP RRTYAAMV  +DD++G +  A  + G+ 
Sbjct: 206 MAFQNVHSP-----VQAPKEYIDRYSFIKDPLRRTYAAMVTIMDDALGNLTRAFDKAGLW 260

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           EN+I+IF +DNG            +N G +YP RG K+TLWEGGV+  A +    ++Q+ 
Sbjct: 261 ENTILIFSTDNGGVP---------KNGGYDYPLRGRKDTLWEGGVRGVAFVHGVALEQSG 311

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
                +MH++DW PTL + AGG           LD+            + ++DG D W S
Sbjct: 312 VKCKALMHVTDWYPTLVSLAGGS----------LDE------------DEDLDGYDVWES 349

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           +     S R  +L NID       + +      LG    G         R   + LL   
Sbjct: 350 ISHGVESPRKELLHNID------TINIPPGDGSLGFSTTGI------GLRVGDMKLL--- 394

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
             V + +Y          F+P  +    R+      IH   N  PM           L+N
Sbjct: 395 MAVPNISY----------FIPPED----RNGSVDWYIHSN-NKVPMV-------EVALYN 432

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           +  DP E++++    PD+ ++L   ++++R+T VP +++  D       + N  W+PW+
Sbjct: 433 ITADPYEKHDLHDKLPDVVTRLQLRVEHYRKTAVPPANKPKDPYARQVAKQNGAWTPWV 491



 Score = 58.9 bits (141), Expect = 8e-06,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 45/80 (56%), Gaps = 5/80 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R N     D  G Y+  LFTK A Q++       PLF+Y+A    H+      ++AP+E
Sbjct: 166 LRDNEEIVRDQNGTYSAHLFTKRAEQIVRAHDPSTPLFMYMAFQNVHSP-----VQAPKE 220

Query: 61  TINQFQYITDPNRRTYAALT 80
            I+++ +I DP RRTYAA+ 
Sbjct: 221 YIDRYSFIKDPLRRTYAAMV 240


>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
          Length = 491

 Score =  310 bits (793), Expect = 2e-81,   Method: Compositional matrix adjust.
 Identities = 189/548 (34%), Positives = 286/548 (52%), Gaps = 89/548 (16%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
           LLI+   +G+ND+ +HGS EI TPN+D LA  G+ L N Y QP+CTP+R+ LM+G+Y IH
Sbjct: 27  LLIIADDFGYNDIGYHGS-EIKTPNLDKLAGEGVKLENYYVQPICTPTRSQLMSGRYQIH 85

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           TG+Q   I   +P G+PL    LP+ L+E+GYST A+GKWHLGF++ EY P  RGF+SHF
Sbjct: 86  TGLQHGVIRPPQPNGLPLDSAILPQKLKEVGYSTHAVGKWHLGFYKEEYLPTNRGFDSHF 145

Query: 204 GYLNGVISYYDHILSDQYSRTV-ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           GYL G   Y+ H       R    + G D+R N++ A +  G Y+T LF ++A +++ + 
Sbjct: 146 GYLTGAEDYFKH------DRCFGAMCGTDLRDNMNPA-NYTGVYSTHLFAQKAAEVVNNH 198

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
             DKPLFLYL   + HA      L+ P++    + +I D  RRTYA MV  +D+++G + 
Sbjct: 199 NTDKPLFLYLPFQSVHAP-----LQVPEQYTKPYMHIQDKQRRTYAGMVSAMDEAIGNLT 253

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
              +++GM  N++++F +DNG               G+N+P RG K++LWEGGV    ++
Sbjct: 254 DVFKKRGMWNNTLMVFSTDNGG---------QIHAGGNNFPLRGWKDSLWEGGVHGVGLV 304

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL-LLNTPSRRNS 441
               +++   VS  ++H+SDW PTL +  GG T      +DG+DQW ++   ++P  R S
Sbjct: 305 HGQMLKRKGSVSRDLIHVSDWFPTLISMVGG-TLNGTKPLDGMDQWKTINEGSSPGPRQS 363

Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
            +  +D      +  P  +       D + R AAVR+  WKLV G   NG+         
Sbjct: 364 ILHNIDP-----MYPPVGQRVPSSPFDNRIR-AAVRMGDWKLVTGFPGNGSW-------- 409

Query: 502 SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSP 561
              VP  + ++I  SK        S+NI                                
Sbjct: 410 ---VPPPHMDSITLSK-----DSASKNI-------------------------------- 429

Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKR 621
                 +LFN+  DP E N+++ +   +   + ELL  + +T VP  +   D  +A+PK 
Sbjct: 430 ------WLFNITEDPNEHNDVSGTYQHVVINMLELLAQYNKTAVPCRYPAMD-SKANPKY 482

Query: 622 FNDTWSPW 629
               W PW
Sbjct: 483 HGGYWGPW 490



 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 47/82 (57%), Gaps = 6/82 (7%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R N++ A +  G Y+T LF ++A +++ +   DKPLFLYL   + HA      L+ P++
Sbjct: 169 LRDNMNPA-NYTGVYSTHLFAQKAAEVVNNHNTDKPLFLYLPFQSVHAP-----LQVPEQ 222

Query: 61  TINQFQYITDPNRRTYAALTKS 82
               + +I D  RRTYA +  +
Sbjct: 223 YTKPYMHIQDKQRRTYAGMVSA 244


>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
 gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
          Length = 584

 Score =  309 bits (792), Expect = 3e-81,   Method: Compositional matrix adjust.
 Identities = 190/549 (34%), Positives = 280/549 (51%), Gaps = 110/549 (20%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS +IPTP ID  A  G+ILNN Y  P+CTPSRAS+MTGKYPI+ GM     
Sbjct: 38  GWDDVSFHGSPQIPTPYIDFYANRGVILNNYYVSPMCTPSRASMMTGKYPINLGM----- 92

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                                          WHLGFF +EYTP+YRGF+S +G+ N    
Sbjct: 93  -------------------------------WHLGFFTKEYTPVYRGFDSFYGFWNAKTD 121

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H      S      G D+R N+       G Y T+LFT+EAV++IE      PLFLY
Sbjct: 122 YWNH-----SSYENNFWGVDLRDNMEPVQSEDGTYGTELFTREAVKVIEAHDTSTPLFLY 176

Query: 272 LAHLAAHAGNAGKHLEAPQETIN--------QFQ-YITDPNRRTYAAMVKKLDDSVGTVI 322
           +AH A H  N  + L+APQ+ I+        +F+  I D  R+ YAAMV  LD SVG + 
Sbjct: 177 VAHQAVHTANPNEPLQAPQDKIDVSLKQRQQRFKGTIDDDQRQVYAAMVTSLDQSVGDIF 236

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW--GSNYPYRGVKNTLWEGGVKVPA 380
           +AL ++ ML +S++IF +DNG             NW  GSN+P RG K+ LWEGGVK  A
Sbjct: 237 AALSKRHMLRDSVVIFTTDNGGAPYGL-------NWNRGSNFPLRGGKDMLWEGGVKGVA 289

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
            ++S  I+Q  RVS +++ ++DW+PT+Y  AGG    L  N+DG                
Sbjct: 290 FVYSDLIKQKGRVSKELIDVTDWVPTIYHLAGGTAEFLVPNMDG---------------- 333

Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
            N+     WS++    PS R+ +L NID  ++ A +R   +K+V G  +     G+Y + 
Sbjct: 334 KNV-----WSTISEGAPSPRDEILHNIDPWRKFAGLRKGKYKIVQGMDDTYKGVGWYDRY 388

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
             +                 +L  + Q   LP + ID  ++  ++    C +        
Sbjct: 389 PGH-----------------ALSSMKQPELLPGAVIDCKKTFDEERK--CDS-------- 421

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
             ++G   LF++  DPCE +++++  P++ +++   L+Y++   +P     P    A+P 
Sbjct: 422 --SDGKFCLFDMEEDPCEYHDLSNQLPEVLAEMKTRLEYYKNIALPPWFP-PINKAANPA 478

Query: 621 RFNDTWSPW 629
            F   WSPW
Sbjct: 479 NFGGFWSPW 487



 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 36/91 (39%), Positives = 51/91 (56%), Gaps = 9/91 (9%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R N+       G Y T+LFT+EAV++IE      PLFLY+AH A H  N  + L+APQ+
Sbjct: 137 LRDNMEPVQSEDGTYGTELFTREAVKVIEAHDTSTPLFLYVAHQAVHTANPNEPLQAPQD 196

Query: 61  TIN--------QFQ-YITDPNRRTYAALTKS 82
            I+        +F+  I D  R+ YAA+  S
Sbjct: 197 KIDVSLKQRQQRFKGTIDDDQRQVYAAMVTS 227


>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
 gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
          Length = 324

 Score =  309 bits (791), Expect = 4e-81,   Method: Compositional matrix adjust.
 Identities = 154/302 (50%), Positives = 202/302 (66%), Gaps = 17/302 (5%)

Query: 74  RTYAALTKSTTLTLLIV----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCT 129
           R  A +T+ T   ++I+     G+ND+SFHGSN+I TPNIDA AYNG+ILN  Y   +CT
Sbjct: 17  RQVAGVTQPTNPNIIIILIDDMGFNDVSFHGSNQILTPNIDAFAYNGVILNRYYVPNLCT 76

Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           PSRA+L+TGKYPIH GMQ       EP G+PL ER +P++ R+ GYST+ +GKWHLGF+R
Sbjct: 77  PSRAALLTGKYPIHNGMQHFVQIPDEPWGLPLGERLMPQFFRDAGYSTQLVGKWHLGFWR 136

Query: 190 REYTPLYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
           +++TP+ RGF+ HFGY NG I YYDH   +L   Y+      G D RR+L       G Y
Sbjct: 137 QDHTPIMRGFDHHFGYYNGYIDYYDHTHYMLDRNYTA-----GADFRRDLQRCHSDNGTY 191

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           AT+ FTKEA ++IE   + +PLF+ L+HLA H GN  + ++AP E + +F +I+DP RRT
Sbjct: 192 ATEAFTKEARRIIEQHDLSRPLFMVLSHLAVHTGNENQPMQAPYEEVAKFVHISDPKRRT 251

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  LD SVG  I AL  +GML NSI++  SDNGAPT+     S     GSN+P+RG
Sbjct: 252 YAGMVSSLDKSVGQTIRALSDRGMLNNSIVLLYSDNGAPTLGIHSNS-----GSNHPFRG 306

Query: 367 VK 368
           V 
Sbjct: 307 VS 308



 Score = 82.8 bits (203), Expect = 5e-13,   Method: Compositional matrix adjust.
 Identities = 37/82 (45%), Positives = 53/82 (64%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+L       G YAT+ FTKEA ++IE   + +PLF+ L+HLA H GN  + ++AP E
Sbjct: 177 FRRDLQRCHSDNGTYATEAFTKEARRIIEQHDLSRPLFMVLSHLAVHTGNENQPMQAPYE 236

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            + +F +I+DP RRTYA +  S
Sbjct: 237 EVAKFVHISDPKRRTYAGMVSS 258


>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
          Length = 775

 Score =  307 bits (787), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 189/502 (37%), Positives = 272/502 (54%), Gaps = 62/502 (12%)

Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           Y +  GMQG P+ GAE RG+PL    LPEYLR LGY+T  +GKWH+G+  + + P  RGF
Sbjct: 295 YKLLRGMQGYPLRGAERRGIPLNNTLLPEYLRRLGYTTHLVGKWHVGYHTKNFGPTRRGF 354

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE---------YATDL 250
           ++  GY  G+I Y++H L +         G+D+ R        VGE         Y TDL
Sbjct: 355 DTFLGYYTGMIQYFNHTLYESGQL-----GYDLHR-------IVGENHTVEYRYDYMTDL 402

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYA 308
            T E   +I     +KP++L L+HLA HA +A + +E    +ET + F YI D NRR YA
Sbjct: 403 LTDEVESIISSHNTEKPMYLQLSHLAPHASDAEEVMEVRDWKETNDTFGYIKDLNRRKYA 462

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
           ++V  LD SVG V+ AL+RK +L N+IIIFM+DNGA T  Y       N+GSNYP RGVK
Sbjct: 463 SVVATLDKSVGRVVDALKRKDILNNTIIIFMADNGAQTEGYLA-----NYGSNYPLRGVK 517

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            TL+EG V+  A ++SP I ++ RVS ++ HI+DWLPTLY+AAGG+ S        LDQ 
Sbjct: 518 FTLFEGAVRGVACIYSPLIDRSSRVSTKLFHITDWLPTLYSAAGGNPS-------DLDQ- 569

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
                         +DG++QWS++       R S+L+NI E    A   +  +KLV    
Sbjct: 570 --------------LDGINQWSTIKKVKIDERRSILVNIQEDNEAAL--MGKYKLVRDRS 613

Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
           +N     YY     +  P  N   ++ S    ++  +S ++        K+++ R++ATI
Sbjct: 614 DNQKHYDYYSGNNVS-YPKYNVTNVLTSPAASAIASVSAHVLTAA----KIKALRKKATI 668

Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
            C        PS C N  C LFN+  DPCE  +++S    +  +L   +  ++  L+ +S
Sbjct: 669 VC---KNLTNPSSCKNRTC-LFNIYKDPCEFTDLSSKNTKVVERLSVFIDSYKSVLMKES 724

Query: 609 HEQPDLVQADPKRFNDTWSPWI 630
           +   D     P  FN+TW PW+
Sbjct: 725 NTPVDPA-GFPCHFNNTWMPWL 745



 Score =  258 bits (660), Expect = 5e-66,   Method: Compositional matrix adjust.
 Identities = 144/346 (41%), Positives = 203/346 (58%), Gaps = 47/346 (13%)

Query: 146 MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
           MQG  I G EPRG+PL  + LPE+L+ LGY+T  IGKWHLG+   ++TP YRGF+   G+
Sbjct: 1   MQGEDIQGGEPRGLPLNVKILPEHLQGLGYTTNLIGKWHLGYHTLQHTPSYRGFDYFCGF 60

Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
            N  +SY+D+  S Q      ++G+DM    + A+    +Y TDLFT +AV++IE+    
Sbjct: 61  YNSHVSYHDYKYSYQ-----NMSGYDMHCGDAPAYGLNDKYVTDLFTDKAVKIIENHDSF 115

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
           +PL+L ++HLA HA      LE PQ+   +  +F +I + +RR YA MV KLDDSVG ++
Sbjct: 116 RPLYLQISHLAVHAP-----LENPQDYDHSDRRFIHIREQHRRKYARMVSKLDDSVGRIV 170

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
             L  +GML +S+I+F++DNGA  +       +RN+GSNYP RG       GGV+  ++L
Sbjct: 171 HTLGNRGMLRDSLILFLTDNGAAPI-----GKFRNFGSNYPLRG-------GGVRGVSVL 218

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
           WSP++ +  RV   ++HISDWLPTLY AA GD S L                        
Sbjct: 219 WSPRLNKPARVCDNLIHISDWLPTLYAAADGDLSDL----------------------GE 256

Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
           IDG+DQW  L    P  R+ +L+NIDE  +T       +KL+ G Q
Sbjct: 257 IDGVDQWRMLNDGHPIVRDKLLLNIDEISKTEGAIYKHYKLLRGMQ 302



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 29/71 (40%), Positives = 43/71 (60%), Gaps = 2/71 (2%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
           +Y TDL T E   +I     +KP++L L+HLA HA +A + +E    +ET + F YI D 
Sbjct: 397 DYMTDLLTDEVESIISSHNTEKPMYLQLSHLAPHASDAEEVMEVRDWKETNDTFGYIKDL 456

Query: 72  NRRTYAALTKS 82
           NRR YA++  +
Sbjct: 457 NRRKYASVVAT 467



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/70 (40%), Positives = 44/70 (62%), Gaps = 8/70 (11%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITD 70
           +Y TDLFT +AV++IE+    +PL+L ++HLA HA      LE PQ+   +  +F +I +
Sbjct: 95  KYVTDLFTDKAVKIIENHDSFRPLYLQISHLAVHAP-----LENPQDYDHSDRRFIHIRE 149

Query: 71  PNRRTYAALT 80
            +RR YA + 
Sbjct: 150 QHRRKYARMV 159


>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 311

 Score =  307 bits (786), Expect = 1e-80,   Method: Compositional matrix adjust.
 Identities = 152/321 (47%), Positives = 200/321 (62%), Gaps = 10/321 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D S HGS++IPTPN+DA+A +GIILN  Y QP+CTPSRA+LMTG+YP H GMQ   I
Sbjct: 1   GWDDTSIHGSSQIPTPNMDAIAADGIILNQHYVQPLCTPSRAALMTGRYPFHVGMQHSVI 60

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEP  +PL    +PEY R LGY T  +GKWHLG++ R+Y P+ RGF++  G+ N  + 
Sbjct: 61  KPAEPWALPLNYTLMPEYFRCLGYKTHMVGKWHLGYYDRQYVPIKRGFDTFLGFYNPSLD 120

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY+   +          GHD R      W    EYAT  +T + V++I       P+FL+
Sbjct: 121 YYNQNFTGN-----NHTGHDFRCGDQNYWAEEKEYATYYYTNKTVEIIRRHNKSAPMFLF 175

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L+H A H       L+ P   +  F YI + NR  +A MV  LD SVG V+  LQ+ GML
Sbjct: 176 LSHQAPHVSGGRPLLQVPTHGVRNFSYIGENNRTLFAGMVDALDQSVGIVVDTLQKAGML 235

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            ++II+F SDNGA    ++      N G N+P RG K TL+EGGV+  A LWS +++   
Sbjct: 236 NDTIIVFCSDNGALPWGFKS-----NRGYNWPLRGGKFTLYEGGVRGTAFLWSRRLKNTK 290

Query: 392 RVSLQMMHISDWLPTLYTAAG 412
           RVS Q+MHISDWLPTLY+AAG
Sbjct: 291 RVSNQLMHISDWLPTLYSAAG 311



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 40/82 (48%)

Query: 9   WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
           W    EYAT  +T + V++I       P+FL+L+H A H       L+ P   +  F YI
Sbjct: 144 WAEEKEYATYYYTNKTVEIIRRHNKSAPMFLFLSHQAPHVSGGRPLLQVPTHGVRNFSYI 203

Query: 69  TDPNRRTYAALTKSTTLTLLIV 90
            + NR  +A +  +   ++ IV
Sbjct: 204 GENNRTLFAGMVDALDQSVGIV 225


>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
 gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
 gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
 gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
          Length = 528

 Score =  304 bits (779), Expect = 7e-80,   Method: Compositional matrix adjust.
 Identities = 198/572 (34%), Positives = 286/572 (50%), Gaps = 85/572 (14%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + AA        L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 30  PARASDAAPPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 88

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L++ GY+T  +GKWHLG +R+
Sbjct: 89  SRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRK 148

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 149 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIECLNGTRCALDLRDGEEPAKEYTDIY 205

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  LI + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR 
Sbjct: 206 STNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRI 260

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  LD++VG V  AL+ +G+  N+++IF +DNG  T         R+ G+N+P RG
Sbjct: 261 YAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQT---------RSGGNNWPLRG 311

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            K TLWEGG++    + SP ++Q    S ++MHI+DWLPTL   AGG T      +DG D
Sbjct: 312 RKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTK-PLDGFD 370

Query: 427 QWSSLLLNTPSRR-----NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT---AAVRL 478
            W ++   +PS R     N + D  D       NT   +N           T   A +R 
Sbjct: 371 VWETISEGSPSPRVELLLNIDPDFFDGLPCPGKNTTPEKNDSFPLEHSAFNTSIHAGIRY 430

Query: 479 DSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDK 538
            +WKL+ G    G       Q+  ++VP              S+   ++ ++L       
Sbjct: 431 KNWKLLTGYPGCGYWFPPPSQSNISEVP--------------SVDSPTKTLWL------- 469

Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
                                          F++  DP E+++++   P I   L   L+
Sbjct: 470 -------------------------------FDINRDPEERHDVSREHPHIVQNLLSRLQ 498

Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           Y+    VP S+  P   + DPK     WSPW+
Sbjct: 499 YYHEHSVP-SYFPPLDPRCDPKG-TGVWSPWM 528



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 205 YSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRR 259

Query: 75  TYAALT 80
            YA + 
Sbjct: 260 IYAGMV 265


>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
 gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
          Length = 478

 Score =  302 bits (774), Expect = 3e-79,   Method: Compositional matrix adjust.
 Identities = 195/558 (34%), Positives = 274/558 (49%), Gaps = 110/558 (19%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YGWND+ +HGS  I TPN+D LA  G+ L N Y QP+C+PSR  LMTG+Y IH G+Q   
Sbjct: 13  YGWNDIGYHGSF-IKTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRYQIHYGLQHSV 71

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I    P G+PL E  LP+ L+E+GYST  +GKWHLGFFR+EY PL RGF++ +G+L G  
Sbjct: 72  ITHDRPHGLPLDEVTLPQKLKEIGYSTHLVGKWHLGFFRQEYLPLRRGFDTFYGFLTGGE 131

Query: 211 SYYDHILSDQYSRTV-ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            Y+ H   + YS    E +G D+R       D  G Y+T LF ++A+ +I     +KP+F
Sbjct: 132 DYWSHRRPNVYSTDASEYHGLDLRDQDEPVLDQNGTYSTHLFQRKAIDIIAHHDRNKPMF 191

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL+  A HA      L+AP+  I +++ + +   R YAAMV  +D++VG V  AL+  G
Sbjct: 192 LYLSFQAVHAP-----LQAPEGYIEKYKSVQNILIRKYAAMVTAMDEAVGNVTDALKGSG 246

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           + +N+++IF +DNGA           R+ GSN+P RG KNTLWEGG++    + S  +++
Sbjct: 247 LWDNTVLIFSTDNGA----------RRSAGSNWPLRGWKNTLWEGGIRGVCFVTSNLLER 296

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
               S  ++HISDW PTL   A G T                        +  +DG D W
Sbjct: 297 KGTKSDALIHISDWFPTLIHLAQGST----------------------HDSKPLDGFDVW 334

Query: 450 SSLLLNTPSRRNSVLINIDEKKRT-----------------AAVRLDSWKLVLGTQENGT 492
           +S+     S R  +L+NID  K                   AA+R   WKL+ G Q+ G 
Sbjct: 335 NSISTGAKSTREEILLNIDPLKCKIGFTCASGGGNFNTLTYAAIRSRDWKLLTGFQDQGG 394

Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
                                                        K R T        G 
Sbjct: 395 W--------------------------------------------KHRKT-------SGN 403

Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
           +    T SP  N   +LFN+ NDP E+ +++   PD+   L   LK + RT VP     P
Sbjct: 404 HFEKPTDSP--NKHLWLFNIRNDPQERTDLSEKYPDVVQDLLMKLKTYNRTAVP-PFWPP 460

Query: 613 DLVQADPKRFNDTWSPWI 630
              +A+P    D + PW+
Sbjct: 461 RDPRANPVLHGDVFGPWL 478



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 42/73 (57%), Gaps = 5/73 (6%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           D  G Y+T LF ++A+ +I     +KP+FLYL+  A HA      L+AP+  I +++ + 
Sbjct: 163 DQNGTYSTHLFQRKAIDIIAHHDRNKPMFLYLSFQAVHAP-----LQAPEGYIEKYKSVQ 217

Query: 70  DPNRRTYAALTKS 82
           +   R YAA+  +
Sbjct: 218 NILIRKYAAMVTA 230


>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
          Length = 516

 Score =  301 bits (771), Expect = 6e-79,   Method: Compositional matrix adjust.
 Identities = 184/541 (34%), Positives = 281/541 (51%), Gaps = 62/541 (11%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +GW D+ +H S EI TP +D L+  G+ L N Y QP+CTPSR  LMTG+Y IHTGMQ   
Sbjct: 35  FGWYDVGYHLS-EIRTPILDKLSSGGVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQI 93

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           IW  +P  VPL E+ LP+ ++E GY+T  +GKWHLG ++++  P  RGF+S+ GYL G  
Sbjct: 94  IWPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCLPTRRGFDSYLGYLTGSE 153

Query: 211 SYYDHILSDQYSR-TVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            YY HI     S   +     D+R   + A    G Y+T+LF++ AV +IE     +PLF
Sbjct: 154 DYYTHIRCHPISALNLTRCALDLREAEAVARSYKGTYSTELFSQRAVSIIEKHTSTEPLF 213

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LY+A  A HA      L+ P+  +  + +I D +RR+YA MV  +D++VG +  ALQ++G
Sbjct: 214 LYVAFQAVHA-----PLQVPERYVAPYSFIQDHSRRSYAGMVSAMDEAVGNITLALQKRG 268

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           + EN++++F +DNG  T+         + GSN+P RG K +LWEGGV+    + SP ++Q
Sbjct: 269 LWENTVLVFSTDNGGQTL---------SGGSNWPLRGRKWSLWEGGVRGVGFVASPLLEQ 319

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              +S Q++HISDWLPTL   AGG T+                          +DG D W
Sbjct: 320 PGTISRQLIHISDWLPTLVGLAGGSTN----------------------GTKPLDGFDVW 357

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
           +++     S R  +L NID      A    + + +   Q  G      G +R+N    ++
Sbjct: 358 NTISKGFASPRLELLHNIDPLYNDIAPCPGTRRPLTLAQVAG------GASRTNSGFNVS 411

Query: 510 FNAIVESKTYQSLQQL-SQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
            +A +    ++ L      +I+ P    +   ST  +A +                 P  
Sbjct: 412 IHAAIRYSNWKLLTGYPGCDIWFPRPGHNGSDSTPSEAELQ---------------KPVR 456

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           L+N+  DP E+N +++  P +  +L   L +H+++ VP +    D  + DP      W P
Sbjct: 457 LYNIEKDPEERNEVSAQFPAVVDRLLARLYHHQKSAVPINFPDDD-PKCDPGP-TGAWGP 514

Query: 629 W 629
           W
Sbjct: 515 W 515



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 46/82 (56%), Gaps = 5/82 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R   + A    G Y+T+LF++ AV +IE     +PLFLY+A  A HA      L+ P+ 
Sbjct: 176 LREAEAVARSYKGTYSTELFSQRAVSIIEKHTSTEPLFLYVAFQAVHA-----PLQVPER 230

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            +  + +I D +RR+YA +  +
Sbjct: 231 YVAPYSFIQDHSRRSYAGMVSA 252


>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 406

 Score =  301 bits (770), Expect = 9e-79,   Method: Compositional matrix adjust.
 Identities = 157/396 (39%), Positives = 225/396 (56%), Gaps = 33/396 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+S+HGS++IPTPNID LA +GIIL + Y QP+ TP+RA+L+TG YPIHTG Q   I
Sbjct: 40  GWHDVSYHGSDQIPTPNIDVLAMDGIILFHNYVQPLSTPTRAALLTGLYPIHTGTQRLDI 99

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGK---WHLGFFRREYTPLYRGFESHFGYLNG 208
             A+P G+      LP+    L  +  ++G    WHLGF + E+ P  RGF++ +G  NG
Sbjct: 100 GSADPIGLSADFTLLPQLSVTLADNFTSLGARSGWHLGFCKDEFKPTKRGFDTFYGIYNG 159

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
              Y+ H   D     ++++GH ++       +  G Y T L   +AVQLI ++P +KP 
Sbjct: 160 DSDYWTHFARDN---NIDVSGHALKDEKRALVEEAGRYLTSLLANQAVQLIHNRPKNKPF 216

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLY A  A H G +   L+AP+E I++F Y+ D +R+ +A  + +LD SVG ++ AL   
Sbjct: 217 FLYFAPTAVHCGGSNGSLQAPKEYISKFGYLADYDRQLFAGSLAELDKSVGLIVEALYVS 276

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G L N++I F +DNG   V +       N   N+P RG K T+ EGGV+ P  LWS  + 
Sbjct: 277 GQLNNTVIAFSTDNGGAPVGFSA-----NTSPNWPLRGTKGTVAEGGVRGPGFLWSSSLT 331

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
              RV+ Q+ H++DW+PT YTAAGG    L +                      IDG+DQ
Sbjct: 332 TRGRVTQQLFHVTDWMPTFYTAAGGQAKDLTM----------------------IDGVDQ 369

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
           W SL    PS R  VL NID     +A+R + +K++
Sbjct: 370 WESLTRGAPSPRKEVLYNIDPISDQSAIRGERYKIM 405



 Score = 67.8 bits (164), Expect = 2e-08,   Method: Compositional matrix adjust.
 Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 4/86 (4%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           +  G Y T L   +AVQLI ++P +KP FLY A  A H G +   L+AP+E I++F Y+ 
Sbjct: 189 EEAGRYLTSLLANQAVQLIHNRPKNKPFFLYFAPTAVHCGGSNGSLQAPKEYISKFGYLA 248

Query: 70  DPNRRTYAA----LTKSTTLTLLIVY 91
           D +R+ +A     L KS  L +  +Y
Sbjct: 249 DYDRQLFAGSLAELDKSVGLIVEALY 274


>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
 gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
          Length = 598

 Score =  300 bits (769), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 194/588 (32%), Positives = 286/588 (48%), Gaps = 105/588 (17%)

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIV------YGWNDLSFHGSNEIPTPNIDALAY 114
           +++ FQ         Y+ + +S++    IV      YGWND+ +HGS  I TPN+D LA 
Sbjct: 96  SVSYFQKSVRRQDVVYSDIQESSSGKPNIVFILADDYGWNDIGYHGS-VIRTPNLDRLAA 154

Query: 115 NGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELG 174
            G+ L N Y QP+C+PSR  LMTG+Y I  G+Q   IW  +P G+PL E  LP+ L+E G
Sbjct: 155 EGVKLENYYVQPLCSPSRCQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTLPQRLKEGG 214

Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY-SRTVELNGHDMR 233
           YST  +GKWHLGF++++YTP +RGF++ +GYL G   Y+ H        +    +G D+R
Sbjct: 215 YSTHIVGKWHLGFYKQDYTPTHRGFDTFYGYLTGAEDYWTHRQKGGLPGQPQTWSGLDLR 274

Query: 234 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 293
                  D  G Y+T LF  +A+++I  Q  +KP+FL+L+  A H       L+AP+E I
Sbjct: 275 DQNRPVTDQNGTYSTHLFANKAIEIIAQQDKNKPMFLFLSFQAVH-----DPLQAPEEDI 329

Query: 294 NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS 353
           +++ +I+D NRR YAAM   +D +VG V  AL++ G+ +N+++IF +DNG          
Sbjct: 330 SRYSHISDTNRRVYAAMTTIMDQAVGNVTRALKQYGLWDNTVLIFSTDNGG--------- 380

Query: 354 NYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
                G N+P RG K +LWEGGV+    + SP I+   R S  ++HISDW PTL   A G
Sbjct: 381 RVDRGGINWPLRGWKGSLWEGGVRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLASG 440

Query: 414 DTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD------------QWSSLLLNTPSRRN 461
            T+     +DG D W ++    PS R   +  +D            QW   + NT     
Sbjct: 441 STNGT-KPLDGHDVWEAISDGKPSPRREILHNIDPMFHTVPSPRPHQWGDRVFNTSVH-- 497

Query: 462 SVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQS 521
                       AA+R   WKL         + GY G T                     
Sbjct: 498 ------------AAIRSGDWKL---------LTGYPGNT--------------------- 515

Query: 522 LQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNN 581
                       S +    ST+++             P+       +LFN+  DP E+ +
Sbjct: 516 ------------SRVPPPSSTKEE-------------PADTPGKHLWLFNIREDPEERTD 550

Query: 582 IASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           ++   P +  +L E L  + RT VP  +   D  QA+P    + W PW
Sbjct: 551 LSQKHPGVVQELLEKLARYNRTAVPVFYPSFD-PQANPALHGNIWGPW 597



 Score = 63.5 bits (153), Expect = 4e-07,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 47/71 (66%), Gaps = 5/71 (7%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           D  G Y+T LF  +A+++I  Q  +KP+FL+L+  A H       L+AP+E I+++ +I+
Sbjct: 282 DQNGTYSTHLFANKAIEIIAQQDKNKPMFLFLSFQAVH-----DPLQAPEEDISRYSHIS 336

Query: 70  DPNRRTYAALT 80
           D NRR YAA+T
Sbjct: 337 DTNRRVYAAMT 347


>gi|350400025|ref|XP_003485710.1| PREDICTED: arylsulfatase B-like, partial [Bombus impatiens]
          Length = 301

 Score =  300 bits (768), Expect = 1e-78,   Method: Compositional matrix adjust.
 Identities = 136/217 (62%), Positives = 168/217 (77%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS++IPTPNIDALAYNGIILNN Y   +CTPSR++LMTGK PIH GMQ   +
Sbjct: 85  GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 144

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +  EPRG+PL+E+ LP+YL+E+GY T A+GKWHLG+F+++YTP YRGF+SHFGY NG+  
Sbjct: 145 YPTEPRGLPLSEKLLPQYLQEIGYKTHAVGKWHLGYFKKQYTPTYRGFDSHFGYWNGLED 204

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY HI  +  S+  E  G DMRRNL+ AWDT G+YATDLFT EAV+LI +   ++P+FLY
Sbjct: 205 YYTHIAQEPDSQYNEYKGFDMRRNLTVAWDTAGKYATDLFTNEAVRLINEHDTERPMFLY 264

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
           LAHLA H GN  + L AP E I +F YI DP RR  A
Sbjct: 265 LAHLAVHKGNENQLLRAPDEEIAKFSYILDPERRIQA 301



 Score =  102 bits (255), Expect = 5e-19,   Method: Compositional matrix adjust.
 Identities = 47/77 (61%), Positives = 56/77 (72%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MRRNL+ AWDT G+YATDLFT EAV+LI +   ++P+FLYLAHLA H GN  + L AP E
Sbjct: 225 MRRNLTVAWDTAGKYATDLFTNEAVRLINEHDTERPMFLYLAHLAVHKGNENQLLRAPDE 284

Query: 61  TINQFQYITDPNRRTYA 77
            I +F YI DP RR  A
Sbjct: 285 EIAKFSYILDPERRIQA 301


>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
 gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
 gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
          Length = 534

 Score =  299 bits (766), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 196/572 (34%), Positives = 289/572 (50%), Gaps = 85/572 (14%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + A         L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 36  PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+
Sbjct: 95  SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR 
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  +D++VG V  AL+  G+  N++ IF +DNG  T         R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            K TLWEGG++    + SP ++Q    S ++MHI+DWLPTL   AGG T+          
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNG--------- 368

Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
                           +DG + W ++    PS R  +L NID+         D +  +  
Sbjct: 369 -------------TKPLDGFNMWKTISEGHPSPRVELLHNIDQ---------DFFDGLPC 406

Query: 487 TQENGTMDGYYGQTRSNKVPLLN--FNAIVESKT-YQSLQQLSQNI-----FLPISNIDK 538
             +N T        + +  PL +  FN  + +   Y++ + L+ +      F P S    
Sbjct: 407 PGKNMT------PAKDDSFPLEHSAFNTSIHAGIRYKNWKLLTGHPGCGYWFPPPSQ--- 457

Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
                        +N + + P        +LF++  DP E+++++   P I   L   L+
Sbjct: 458 -------------SNVSEIPPVDPPTKTLWLFDINQDPEERHDVSREHPHIVQNLLSRLQ 504

Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           Y+    VP SH  P   + DPK     WSPW+
Sbjct: 505 YYHEHSVP-SHFPPLDPRCDPKS-TGVWSPWM 534



 Score = 55.5 bits (132), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 IYAGMV 271


>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
          Length = 534

 Score =  299 bits (765), Expect = 3e-78,   Method: Compositional matrix adjust.
 Identities = 196/572 (34%), Positives = 289/572 (50%), Gaps = 85/572 (14%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + A         L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 36  PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+
Sbjct: 95  SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR 
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  +D++VG V  AL+  G+  N++ IF +DNG  T         R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            K TLWEGG++    + SP ++Q    S ++MHI+DWLPTL   AGG T+          
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNG--------- 368

Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
                           +DG + W ++    PS R  +L NID+         D +  +  
Sbjct: 369 -------------TKPLDGFNMWKTISEGHPSPRVELLHNIDQ---------DFFDGLPC 406

Query: 487 TQENGTMDGYYGQTRSNKVPLLN--FNAIVESKT-YQSLQQLSQNI-----FLPISNIDK 538
             +N T        + +  PL +  FN  + +   Y++ + L+ +      F P S    
Sbjct: 407 PGKNMT------PAKDDSFPLEHSAFNTSIHAGIRYKNWKLLTGHPGCGYWFPPPSQ--- 457

Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
                        +N + + P        +LF++  DP E+++++   P I   L   L+
Sbjct: 458 -------------SNVSEIPPVGPPTKTLWLFDINQDPEERHDVSREHPHIVQNLLSRLQ 504

Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           Y+    VP SH  P   + DPK     WSPW+
Sbjct: 505 YYHEHSVP-SHFPPLDPRCDPKS-TGVWSPWM 534



 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 IYAGMV 271


>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
 gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
          Length = 492

 Score =  298 bits (763), Expect = 6e-78,   Method: Compositional matrix adjust.
 Identities = 170/416 (40%), Positives = 234/416 (56%), Gaps = 36/416 (8%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YGWND+ +H S+ I TPN+D LA  G+ L N Y QP+C+PSRA LMTG+Y I  GMQ   
Sbjct: 26  YGWNDIGYH-SSLIQTPNLDRLAQEGVKLENYYIQPICSPSRAQLMTGRYQIRYGMQHSV 84

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           +    P G+PL E  LP+ L+E GY+T  +GKWHLG F++EY P +RGF++ FG+L G  
Sbjct: 85  LMSDRPHGLPLGEVTLPQVLKESGYATHIVGKWHLGHFKKEYLPTWRGFDTFFGFLGGGE 144

Query: 211 SYYDHILSDQYSRTVE-LNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            Y+ H + ++   T E     D            G Y+T +F ++++ LI     DKP+F
Sbjct: 145 DYFTHRIPNEIVETPETYRAFDFWDGSKPCLSENGSYSTHVFARKSIDLISRHDKDKPMF 204

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL   A HA      LEAP+E IN++ +I   N RTYA +   +D++VG +  AL+  G
Sbjct: 205 LYLPFQAVHAP-----LEAPEEFINKYTHIRSKNMRTYAGVTTAMDEAVGNITRALKDHG 259

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           M EN+++IF +DNGA              GSN+P RG KNTLWEGGV+    + SP ++ 
Sbjct: 260 MWENTVLIFSTDNGASKA-----------GSNWPLRGFKNTLWEGGVRGVGFVSSPLLKV 308

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
             R+S  ++HISDW PT+   AGG T    PL  DG D W ++   TPS R   +  +D 
Sbjct: 309 KQRISDALLHISDWFPTIVRIAGGSTFGTKPL--DGYDVWDTISEGTPSPRTEILHNID- 365

Query: 449 WSSLLLNTPSRRNSV----LINIDEKKRTAAVRLDSWKLVLGTQENG-----TMDG 495
              L+  T S  + +      N   K   AA+R+  WKL+ G   NG     T DG
Sbjct: 366 --PLIAATNSDASKIHGHSTFNTSVK---AAIRVRDWKLLTGNPGNGDWQLQTFDG 416



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 30/70 (42%), Positives = 43/70 (61%), Gaps = 5/70 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G Y+T +F ++++ LI     DKP+FLYL   A HA      LEAP+E IN++ +I   N
Sbjct: 179 GSYSTHVFARKSIDLISRHDKDKPMFLYLPFQAVHAP-----LEAPEEFINKYTHIRSKN 233

Query: 73  RRTYAALTKS 82
            RTYA +T +
Sbjct: 234 MRTYAGVTTA 243


>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
 gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
          Length = 478

 Score =  296 bits (757), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 274/542 (50%), Gaps = 76/542 (14%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YGW+D+ +HGS  I TP +D LA  G+ L N Y QP+C+PSR  LMTG+Y I  G+Q   
Sbjct: 11  YGWDDIGYHGSF-IQTPKLDRLAKEGVKLENYYVQPICSPSRCQLMTGRYQIRYGLQHSV 69

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I    P G+PL E  LP+ L+E GYST  +GKWHLGFFR+E+ PL RGF+  +GYL G  
Sbjct: 70  ITSDRPHGLPLDEVTLPQKLKENGYSTYVVGKWHLGFFRKEHMPLQRGFDKFYGYLTGGE 129

Query: 211 SYYDHILSDQYSRT-VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            Y+ H   + Y++  +  +G D+R       D  G Y+T LF K+A+++I++    KP+F
Sbjct: 130 DYWTHRRPNLYAKDPLAFHGLDLRDQDKPVLDQNGTYSTHLFAKKAIEIIQNHERSKPMF 189

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL   A H+      LEAP E + +++ + +P  +TYAAMV  +D++VG V  AL+R  
Sbjct: 190 LYLPFQAVHSP-----LEAPTEYVEKYKDVKNPLVKTYAAMVSAMDEAVGNVTDALKRTR 244

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           + +N+++IF +DNGA           RN GSN+P RG KNTLWEGGV+    + S  +++
Sbjct: 245 LWDNTVLIFSTDNGA----------RRNAGSNWPLRGWKNTLWEGGVRGVGFVTSKLLRK 294

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             R    ++HISDW PTL   +                       + S     +DG D W
Sbjct: 295 GKRKCDALIHISDWFPTLLRIS----------------------RSVSNGTKPLDGFDIW 332

Query: 450 SSLLLNTPSRRNSVLINIDE-KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
            ++   TPS R  +L NID    +T    LDS        ++      Y   RS    LL
Sbjct: 333 DTISSGTPSPRKEILHNIDPLHPKTDLSSLDS------LHDHIFNPSVYAAIRSGDWKLL 386

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
             +  +    Y              ++ +K     +    H                  +
Sbjct: 387 TGDQGLGGWKYSG------------TSGNKYEDAVEPPNKHL-----------------W 417

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LFN+ +DP E+N+++   PD+  +L E L  + +T +P      D  +A+P    D + P
Sbjct: 418 LFNIRSDPQERNDLSVQYPDVVQELLERLAAYNKTALPPFWPSRD-PRANPALHGDLFGP 476

Query: 629 WI 630
           W+
Sbjct: 477 WL 478



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/73 (36%), Positives = 45/73 (61%), Gaps = 5/73 (6%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           D  G Y+T LF K+A+++I++    KP+FLYL   A H+      LEAP E + +++ + 
Sbjct: 161 DQNGTYSTHLFAKKAIEIIQNHERSKPMFLYLPFQAVHSP-----LEAPTEYVEKYKDVK 215

Query: 70  DPNRRTYAALTKS 82
           +P  +TYAA+  +
Sbjct: 216 NPLVKTYAAMVSA 228


>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 537

 Score =  296 bits (757), Expect = 3e-77,   Method: Compositional matrix adjust.
 Identities = 194/598 (32%), Positives = 287/598 (47%), Gaps = 113/598 (18%)

Query: 73  RRTYAALTKSTTLTLLIV---------------------YGWNDLSFHGSNEIPTPNIDA 111
           R + A L ++ T+T+LI                      YGW D+ +H S  I TPN+D 
Sbjct: 11  RCSCAFLVEALTVTVLICTGLIKGATGKPPHIVFIVADDYGWFDVGYHNST-IKTPNLDL 69

Query: 112 LAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR 171
           LA  G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q   I   +P  +PL E  LP+ L+
Sbjct: 70  LASRGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHFVIIAPQPNCLPLNETTLPQKLK 129

Query: 172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELN--- 228
           E GY+T  +GKWHLGF++ E  PL RGF+S FGYL+G+  Y+ H  S  +    E N   
Sbjct: 130 ESGYATHLVGKWHLGFYKNECMPLQRGFDSSFGYLSGMQDYWTHFRSGSFPGFPEGNHWL 189

Query: 229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 288
           G D   N   AW+  G Y+  +FT+ A ++I+    ++PLFLYL   + H       L+ 
Sbjct: 190 GIDFWDNNRVAWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHG-----PLQV 244

Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
           P++ +  + +  D  R+TYA MV  +D++VG V+ +LQ  G+  +++++F +DNG    +
Sbjct: 245 PEKYMKPYAHFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNGGTPGK 304

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPT 406
                     G+N+P RG KNTLWEGGV     +  P I    +  VS   MHISDW PT
Sbjct: 305 S---------GNNWPLRGTKNTLWEGGVHGVGFITGPMIPAGVQGTVSKHFMHISDWFPT 355

Query: 407 LYTA-AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS---NID-----------GLDQWSS 451
           L    AGG+T+ L L  D  + W+S+   TPS R     NID           G D+ + 
Sbjct: 356 LIEGVAGGNTAGLAL--DSYNMWNSITKGTPSPRKELLHNIDPYIRADHPFGYGYDEETD 413

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           ++         +        R AA+R+  WKL         + G+ G++     P  N +
Sbjct: 414 MIYPLSGLYPKMAAEFSTDMR-AAIRVGEWKL---------LTGFPGRSGWYPPPEWNIH 463

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
            I                                             P    N   +LFN
Sbjct: 464 PI--------------------------------------------DPVEAANKVTWLFN 479

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           +  DPCE+N+++   P++ ++L   L+ + +T VP        V+ADP   +  W PW
Sbjct: 480 ITADPCEKNDLSYQHPEVVTELVGRLEAYYKTSVPVRFPN-QTVKADPALHHGVWGPW 536



 Score = 46.6 bits (109), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 5/79 (6%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
           N   AW+  G Y+  +FT+ A ++I+    ++PLFLYL   + H       L+ P++ + 
Sbjct: 196 NNRVAWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHG-----PLQVPEKYMK 250

Query: 64  QFQYITDPNRRTYAALTKS 82
            + +  D  R+TYA +  +
Sbjct: 251 PYAHFQDVGRQTYAGMVAT 269


>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
          Length = 534

 Score =  295 bits (755), Expect = 4e-77,   Method: Compositional matrix adjust.
 Identities = 188/552 (34%), Positives = 280/552 (50%), Gaps = 87/552 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 57  GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 115

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FTK A  LI + P +KPL
Sbjct: 176 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATTLITNHPPEKPL 233

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 234 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAVGNVTAALKSH 288

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 289 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 339

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           Q    + +++HISDWLPTL   A G T+     +DG D W ++   +PS R   +  +D 
Sbjct: 340 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDMWKTISEGSPSPRVELLHNID- 397

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGTMDGYYG 498
               + ++P  RNS     D+              AA+R  +WKL+ G    G       
Sbjct: 398 -PDFVDSSPCPRNSTAPAKDDSSLPEHSAFNTSVHAAIRHRNWKLLTGYPGCGYWFPPPS 456

Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
           Q   +++P              SL   ++ ++L                           
Sbjct: 457 QYNVSEIP--------------SLDPPTKTLWL--------------------------- 475

Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
                      F++  DP E+++++   P I ++L   L+++ +  VP      D  + D
Sbjct: 476 -----------FDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQD-PRCD 523

Query: 619 PKRFNDTWSPWI 630
           PK     W PW+
Sbjct: 524 PKA-TGVWGPWM 534



 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 211 YSTNIFTKRATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 YYAGMV 271


>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
          Length = 532

 Score =  294 bits (753), Expect = 8e-77,   Method: Compositional matrix adjust.
 Identities = 187/550 (34%), Positives = 287/550 (52%), Gaps = 83/550 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS+ I TP++DALA  G+ L+N Y QP+CTPSR+ L++G+Y IHTG+Q   I
Sbjct: 55  GWNDVGFHGSS-IRTPHLDALAAGGVRLDNYYVQPLCTPSRSQLLSGRYQIHTGLQHQII 113

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 114 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 173

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FT+ A  LI + P +KPL
Sbjct: 174 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATALIANHPPEKPL 231

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NRR YA MV  +D++VG V +AL+++
Sbjct: 232 FLYLALQSVH-----EPLQVPEEYLKPYDFIEDKNRRHYAGMVSLMDEAVGNVTAALKKR 286

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+ +N++ IF +DNG  T+           G+N+P RG K TLWEGGV+    + SP ++
Sbjct: 287 GLWDNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGVGFVASPLLK 337

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD- 447
           Q    + +++HISDWLPTL   AGG T+     +DG D W ++   +PS R   +  +D 
Sbjct: 338 QKGVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWKTVSEGSPSPRVELLHNIDP 396

Query: 448 ---QWSSLLLNT--PSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQT 500
                S    N+  P++ +S   +      +  AA+R  +WKL+ G    G       Q+
Sbjct: 397 NFVDISPCPGNSTAPAKGDSSFPDYTAFNTSFHAAIRHRNWKLLTGYPGCGYWFPPPSQS 456

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
             +++P              SL   ++ ++L                             
Sbjct: 457 NVSEIP--------------SLDPPTKTLWL----------------------------- 473

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
                    F++  DP E+++++   P +  QL   L+++ +  VP  +   D  + DP+
Sbjct: 474 ---------FDIEQDPEERHDLSREYPHVIKQLLSRLQFYHKHSVPVYYPAQD-PRCDPE 523

Query: 621 RFNDTWSPWI 630
                W PW+
Sbjct: 524 S-TGAWGPWM 532



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NRR
Sbjct: 209 YSTNVFTERATALIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIEDKNRR 263

Query: 75  TYAALT 80
            YA + 
Sbjct: 264 HYAGMV 269


>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
          Length = 458

 Score =  294 bits (752), Expect = 1e-76,   Method: Compositional matrix adjust.
 Identities = 169/428 (39%), Positives = 242/428 (56%), Gaps = 31/428 (7%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + A         L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 36  PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+
Sbjct: 95  SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR 
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  +D++VG V  AL+  G+  N++ IF +DNG  T         R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            K TLWEGG++    + SP ++Q    S ++MHISDWLPTL   AGG T+     +DG +
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTNGTK-PLDGFN 376

Query: 427 QWSSLLLNTPSRRNSNIDGLDQ--WSSLLL----NTPSRRNSVLINIDEKKRT--AAVRL 478
            W ++    PS R   +  +DQ  +  L       TP++ +S  +       +  A +R 
Sbjct: 377 MWKTISEGHPSPRVELLHNIDQDFFDGLPCPGKNMTPAKDDSFPLEHSAFNTSIHAGIRY 436

Query: 479 DSWKLVLG 486
            +WKL+ G
Sbjct: 437 KNWKLLTG 444



 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 IYAGMV 271


>gi|241834402|ref|XP_002414990.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509202|gb|EEC18655.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 425

 Score =  293 bits (750), Expect = 2e-76,   Method: Compositional matrix adjust.
 Identities = 178/467 (38%), Positives = 247/467 (52%), Gaps = 61/467 (13%)

Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
           P+CTPSR++LMTG YPIH GMQ   I  A P G+PL  + LP++ ++LGY    IGKWHL
Sbjct: 2   PLCTPSRSALMTGLYPIHNGMQSTVILPAAPWGLPLENKILPQHFKDLGYDVNMIGKWHL 61

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
           GFF+  Y P+ RGF++ FG   G   YY+H       +  ++ G            TVG 
Sbjct: 62  GFFQTPYVPMKRGFDTFFGLYTGSNDYYNHTSGSVSKKQEQITG--------ALCLTVGG 113

Query: 246 YATDLFTKEAVQLIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
                 T     L+ D P++   +P F Y +H A H     +  +AP   + +F YI + 
Sbjct: 114 -----ITHGVEPLLTDSPLNFYPQPFFCYFSHQAVHRALDSETFQAPARNVLKFPYIEES 168

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
           NR  YA M+  LD+S+G V+ AL   GMLE++II+F SDNG             N G N+
Sbjct: 169 NRTIYAGMLDALDESMGRVVEALSSAGMLEDTIIVFSSDNGGGPYGIES-----NRGFNW 223

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
           P RG K T+WEGGV+VPA +WSP+  +  RVS Q+MHISDWLPTLYTAAGGD S L   I
Sbjct: 224 PLRGAKFTVWEGGVRVPAFVWSPKFLKKSRVSNQLMHISDWLPTLYTAAGGDASALG-TI 282

Query: 423 DGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWK 482
           DG+D W+SL     + R++                S+R  +L+N +     + +R   +K
Sbjct: 283 DGVDMWNSL-----THRSA----------------SKRKEILLNSNSLYNISGLRYKQYK 321

Query: 483 LVLGTQENGTMDGYY---GQTR-SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI--SNI 536
           LV+G   +G +D +Y   G TR +N +  L  N    ++  +   +     + P+  SN 
Sbjct: 322 LVVGGGFDGELDDHYYFPGGTRPTNDIDQLR-NESTTARVLRKFYEKQDRSWSPLAWSNN 380

Query: 537 DKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIA 583
             +   R + T     N  PM P        YLFN+  DPCE NN+A
Sbjct: 381 VAVDCRRNKLT----ENFVPMQPP-------YLFNIEKDPCELNNLA 416



 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 3/56 (5%)

Query: 27  LIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAL 79
           L+ D P++   +P F Y +H A H     +  +AP   + +F YI + NR  YA +
Sbjct: 121 LLTDSPLNFYPQPFFCYFSHQAVHRALDSETFQAPARNVLKFPYIEESNRTIYAGM 176


>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
          Length = 522

 Score =  293 bits (749), Expect = 3e-76,   Method: Compositional matrix adjust.
 Identities = 191/548 (34%), Positives = 275/548 (50%), Gaps = 79/548 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H SN I TP++DAL+  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 45  GWNDVGYHDSN-IFTPHLDALSAQGVRLENYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 103

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  +PL E+ LPE LRE GY T  +GKWHLG FR+E  P  RGF++ FGYL G   
Sbjct: 104 WPCQPSCIPLDEKLLPELLREAGYVTHMVGKWHLGMFRKECLPTRRGFDTFFGYLLGSED 163

Query: 212 YYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H       +  V     D R     A      Y+T++FT+ AV LI + P  KPLFL
Sbjct: 164 YYTHKRCVHIDALKVTRCALDFRDGEDIAAGYENMYSTNVFTERAVNLIANHPAQKPLFL 223

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLA  + H     + L+ P+E +  + +I + NR+ YA MV  +D++VG V  ALQ+ G+
Sbjct: 224 YLALQSVH-----EPLQVPEEYLQPYDFIQNKNRQHYAGMVTLMDEAVGNVTRALQKYGL 278

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             N++ IF +DNG  T+           G+N+P RG K TLWEGG++    + SP +++ 
Sbjct: 279 WNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGIRGVGFVSSPLLKRK 329

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW- 449
              S +++HISDWLPTL   AGG T+     +DG D W ++   +PS R   +  +D   
Sbjct: 330 RVKSSELIHISDWLPTLVNLAGGRTNGTK-PLDGFDVWKTISEGSPSPRMELLHNIDPGF 388

Query: 450 -SSLLLNTPSRRNSVLINIDEKKRT------AAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
             +   + P+   S          T      AA+R  +WKL         + GY G    
Sbjct: 389 VDTAPCDRPASPPSTRDPFPSAHSTFNVSIHAAIRYGNWKL---------LTGYPGCGHW 439

Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
              P L                         SN+ ++ S                 P P 
Sbjct: 440 FPPPSL-------------------------SNVTELPS-----------------PDPP 457

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
           T    +LF++  DP E+ +++S  P I  +L + L+Y+ +  VP  +   D    DP+  
Sbjct: 458 TKN-LWLFDIQQDPEERFDLSSQHPKIVKKLLDRLQYYHKNSVPVFYPDED-PHCDPEAA 515

Query: 623 NDTWSPWI 630
              W PW+
Sbjct: 516 G-VWGPWM 522



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ AV LI + P  KPLFLYLA  + H     + L+ P+E +  + +I + NR+
Sbjct: 199 YSTNVFTERAVNLIANHPAQKPLFLYLALQSVH-----EPLQVPEEYLQPYDFIQNKNRQ 253

Query: 75  TYAALT 80
            YA + 
Sbjct: 254 HYAGMV 259


>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
 gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
          Length = 533

 Score =  292 bits (747), Expect = 4e-76,   Method: Compositional matrix adjust.
 Identities = 164/408 (40%), Positives = 237/408 (58%), Gaps = 33/408 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP +DALA  G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 56  GWNDVGFHGS-AIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 114

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 115 LPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FT+ A  LI + P +KPL
Sbjct: 175 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATTLITNHPPEKPL 232

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NRR YA M   +D++VG V +AL+R+
Sbjct: 233 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDEAVGNVTAALERR 287

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 288 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 338

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +    + +++HISDWLPTL   AGG T+     +DG D W+++   +PS R   +  +D 
Sbjct: 339 RKGVKTRELIHISDWLPTLVKLAGGSTNGTK-PLDGFDVWNTISEGSPSPRMELLHNID- 396

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
             + +   P   NS+ +  DE              AAVR  +WKL+ G
Sbjct: 397 -PNFVDTAPCPGNSMALAKDESSLLEYSAFNTSIHAAVRHQNWKLLTG 443



 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NRR
Sbjct: 210 YSTNVFTERATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRR 264

Query: 75  TYAALT 80
            YA + 
Sbjct: 265 YYAGMA 270


>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
          Length = 533

 Score =  291 bits (746), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDMWKTI 380

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
              +PS R   +  +D   + + ++P  RNS+    D+              AA+R  +W
Sbjct: 381 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 438

Query: 482 KLVLG 486
           KL+ G
Sbjct: 439 KLLTG 443



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264

Query: 75  TYAALT 80
            YA + 
Sbjct: 265 HYAGMV 270


>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 574

 Score =  291 bits (746), Expect = 6e-76,   Method: Compositional matrix adjust.
 Identities = 184/540 (34%), Positives = 274/540 (50%), Gaps = 73/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +H    I TP +D LA  G+ L N Y QP+CTPSR+ LMTG+Y IHTG+Q   I
Sbjct: 57  GFNDIGYHNPT-IKTPTLDKLAAEGVRLENYYVQPICTPSRSQLMTGRYQIHTGLQHSII 115

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             ++P  +P     LPE LR+ GYST  +GKWHLGF+R+   P  +GF++ FG L G + 
Sbjct: 116 RPSQPSCLPSHMDTLPERLRQAGYSTHLVGKWHLGFYRKACLPTRKGFDTFFGSLTGSVD 175

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
           +Y+++  D       + G+D+      AW   G+Y+T LFT+ A +++E   P +KPLFL
Sbjct: 176 HYNYLSCDGPG----VCGYDLHDGEGVAWGQEGKYSTTLFTQRARKILESHNPTEKPLFL 231

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            L+  A H       L+ P+  I  ++ + +  RR  AAMV  +D++V  V  AL++ G 
Sbjct: 232 LLSLQAVHTP-----LQTPKSYIYPYRDMANIARRKLAAMVSTVDEAVRNVTYALRKYGF 286

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             NS+II+ +DNGA              GSN+P RG K T WEGG++  A + SP +++ 
Sbjct: 287 YRNSVIIYSTDNGAQPF---------TGGSNWPLRGRKGTYWEGGIRGVAFVHSPLLKRR 337

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS  ++HI+DW PTL   AGG+ S+    +DG D W ++     S R   +  +D   
Sbjct: 338 RRVSKALLHITDWFPTLVGLAGGNISQ-SRGLDGFDVWPTISEGRDSPRQEILHNID--- 393

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
                 P  ++    ++ +    AA+R+  WKL+ G   +G           + VP    
Sbjct: 394 ------PLHKHVTSESLWDTSIQAAIRVGDWKLLTGYPGHG-----------DWVP---- 432

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
              V          L +NI  P  N                           T    +LF
Sbjct: 433 -PQVLPTLPGRWWDLERNILSPYKN--------------------------STYKNIWLF 465

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           N+  DPCE++++A  RPD+  QL   L Y+ +T VP  +  PD  +ADP +    W PW+
Sbjct: 466 NITADPCERHDLADQRPDVVLQLLARLAYYNQTAVP-VYFPPDDPRADPSQHEGAWVPWV 524



 Score = 45.4 bits (106), Expect = 0.083,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 42/79 (53%), Gaps = 6/79 (7%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
             AW   G+Y+T LFT+ A +++E   P +KPLFL L+  A H       L+ P+  I  
Sbjct: 197 GVAWGQEGKYSTTLFTQRARKILESHNPTEKPLFLLLSLQAVHTP-----LQTPKSYIYP 251

Query: 65  FQYITDPNRRTYAALTKST 83
           ++ + +  RR  AA+  + 
Sbjct: 252 YRDMANIARRKLAAMVSTV 270


>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
 gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
 gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
 gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
 gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score =  291 bits (745), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 40  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 323 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
              +PS R   +  +D   + + ++P  RNS+    D+              AA+R  +W
Sbjct: 382 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 439

Query: 482 KLVLG 486
           KL+ G
Sbjct: 440 KLLTG 444



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 HYAGMV 271


>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
 gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score =  291 bits (745), Expect = 7e-76,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 41  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 217

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 218 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 272

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 273 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 323

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 324 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 382

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
              +PS R   +  +D   + + ++P  RNS+    D+              AA+R  +W
Sbjct: 383 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 440

Query: 482 KLVLG 486
           KL+ G
Sbjct: 441 KLLTG 445



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266

Query: 75  TYAALT 80
            YA + 
Sbjct: 267 HYAGMV 272


>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
 gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
 gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
 gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
          Length = 533

 Score =  291 bits (744), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 380

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
              +PS R   +  +D   + + ++P  RNS+    D+              AA+R  +W
Sbjct: 381 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 438

Query: 482 KLVLG 486
           KL+ G
Sbjct: 439 KLLTG 443



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264

Query: 75  TYAALT 80
            YA + 
Sbjct: 265 HYAGMV 270


>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
 gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
          Length = 535

 Score =  291 bits (744), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 41  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 217

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 218 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 272

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 273 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 323

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 382

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
              +PS R   +  +D   + + ++P  RNS+    D+              AA+R  +W
Sbjct: 383 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 440

Query: 482 KLVLG 486
           KL+ G
Sbjct: 441 KLLTG 445



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266

Query: 75  TYAALT 80
            YA + 
Sbjct: 267 HYAGMV 272


>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
 gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
 gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
 gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
          Length = 534

 Score =  291 bits (744), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 40  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
              +PS R   +  +D   + + ++P  RNS+    D+              AA+R  +W
Sbjct: 382 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 439

Query: 482 KLVLG 486
           KL+ G
Sbjct: 440 KLLTG 444



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 HYAGMV 271


>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
 gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
          Length = 533

 Score =  291 bits (744), Expect = 9e-76,   Method: Compositional matrix adjust.
 Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLMKLARGHTNGTK-PLDGFDVWKTI 380

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
              +PS R   +  +D   + + ++P  RNS+    D+              AA+R  +W
Sbjct: 381 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 438

Query: 482 KLVLG 486
           KL+ G
Sbjct: 439 KLLTG 443



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264

Query: 75  TYAALT 80
            YA + 
Sbjct: 265 HYAGMV 270


>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
 gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
          Length = 528

 Score =  291 bits (744), Expect = 1e-75,   Method: Compositional matrix adjust.
 Identities = 188/547 (34%), Positives = 274/547 (50%), Gaps = 66/547 (12%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +GWND+ +HGS EI TPN+D L+  G+ L N Y QP+CTPSR  LMTG+Y I TGMQ   
Sbjct: 39  FGWNDVGYHGS-EIKTPNLDKLSAKGVRLENYYVQPLCTPSRNQLMTGRYQIRTGMQHQI 97

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           IW  +P  VPL E+ LP+ +RE GY+T  +GKWHLG +R++  P  RGF+S+FGYL G  
Sbjct: 98  IWPCQPYCVPLDEKLLPQLMREAGYATHMVGKWHLGMYRKDCLPTRRGFDSYFGYLTGSE 157

Query: 211 SYYDHILSDQYSRTVELN----GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
            Y+ H    + S    LN      D+R     A    G Y+T LFT     +I  Q   K
Sbjct: 158 DYFSH---QRCSYVPPLNVTRCAVDLREGEEVATGYTGTYSTQLFTDRVTSIIAKQNSKK 214

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           PLFLY+A  A HA      L+ P+  +  + +I DPNRR YA MV  +D++VG +  ALQ
Sbjct: 215 PLFLYVALQAVHA-----PLQVPERYVAPYSFIKDPNRRLYAGMVSAMDEAVGNITLALQ 269

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
             G+ +N++++F +DNG  T+           GSN+P RG K +LWEGGV+    + SP 
Sbjct: 270 ETGLWDNTVLVFSTDNGGQTLAG---------GSNWPLRGRKWSLWEGGVRGVGFVTSPL 320

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           ++Q   VS Q++HISDWLPTL   AGG T+     +DG + W+++     S R   +  +
Sbjct: 321 LKQPGTVSRQLIHISDWLPTLVGLAGGSTNGTK-PLDGYNVWNAISKGYASPRLELLHNI 379

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
           D               + ++I     +A+ RL           N  + G  G + SN  P
Sbjct: 380 D--------------PLYVDIAPCPGSASQRLPV--------SNAPVSG--GGSTSN--P 413

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS----TRQQATIHCGANPAPMTPSPC 562
               N+      + +++ +   +       D+  +         TI    NP        
Sbjct: 414 WSWSNSGFNVSIHAAIRSMQWKLLTGYPGCDRWFTRPGLENTSGTIR-SVNPLK------ 466

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
              P  LF++  DP E++ ++   P++   L   L ++ R   P +    D  + DP   
Sbjct: 467 ---PVMLFDVEADPEERDEVSDQHPNVVDNLLRRLGHYLRRSQPITFPDDD-PRCDPG-- 520

Query: 623 NDTWSPW 629
              W PW
Sbjct: 521 TGAWGPW 527



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 5/82 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R     A    G Y+T LFT     +I  Q   KPLFLY+A  A HA      L+ P+ 
Sbjct: 180 LREGEEVATGYTGTYSTQLFTDRVTSIIAKQNSKKPLFLYVALQAVHA-----PLQVPER 234

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            +  + +I DPNRR YA +  +
Sbjct: 235 YVAPYSFIKDPNRRLYAGMVSA 256


>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
          Length = 534

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 166/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 40  AGASQPPHLVFLLADDLGWNDVGFHGS-RIHTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGG++    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 323 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
              +PS R   +  +D   + + ++P  RNS+    D+              AA+R  +W
Sbjct: 382 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 439

Query: 482 KLVLG 486
           KL+ G
Sbjct: 440 KLLTG 444



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 HYAGMV 271


>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
          Length = 564

 Score =  290 bits (741), Expect = 2e-75,   Method: Compositional matrix adjust.
 Identities = 180/541 (33%), Positives = 277/541 (51%), Gaps = 76/541 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+C+PSR+  +TGKY IHTG+Q   I
Sbjct: 58  GYRDIGYHGS-EIRTPTLDKLASEGVRLENYYVQPICSPSRSQFITGKYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             ++P  +PL    LP+ L++ GY T  +GKWHLGF+++E  P  RGF+S FG L G   
Sbjct: 117 RPSQPNCLPLDNMTLPQKLKKAGYQTHMVGKWHLGFYKKECMPTQRGFDSFFGSLLGSGD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY+H   D    +  + G+D+  N + AWD   G Y+T+++T+  +Q++     +KPLFL
Sbjct: 177 YYNHYKCD----SPGICGYDLYENNNAAWDQDNGIYSTEMYTQRVLQILSSHNPNKPLFL 232

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP   ++ +  I + NRR YAAM+  LDD+V  V +AL++ G 
Sbjct: 233 YIAYQAVHS-----PLQAPGNYLDNYNSILNENRRRYAAMLSCLDDAVNNVTTALKKYGF 287

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +NSIII+ SDNG   +           G+N+P RG K T WEGG++    + SP ++  
Sbjct: 288 YDNSIIIYSSDNGGQPMA---------GGNNWPLRGSKGTYWEGGIRAVGFVHSPILKMK 338

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             +  +++HI+DW PTL T AGG+     + +DG D W ++     S R   +  +D   
Sbjct: 339 GYICKELVHITDWFPTLVTLAGGEIDE-EIQLDGYDIWETISEGKRSPRQEILHNIDP-- 395

Query: 451 SLLLNTPSRRNS--VLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
              + T +R  S      I      +A+R++ WKL+ G    G           + +P  
Sbjct: 396 ---MYTKARNGSWAAGYGIWNTAIQSAIRVNHWKLLTGNPGYG-----------DWIPPQ 441

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
           +F                       SN+   R   ++ T+  G +              +
Sbjct: 442 SF-----------------------SNVGPNRWHNERVTLASGKS-------------VW 465

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LFN+  DP E+ +++   PDI  QL   L    +T VP  +  P   +++PK     W P
Sbjct: 466 LFNITADPYERIDLSDRYPDIVKQLLRRLAQFNKTAVPVRYP-PRDPRSNPKLNGGVWGP 524

Query: 629 W 629
           W
Sbjct: 525 W 525



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 34/105 (32%), Positives = 56/105 (53%), Gaps = 12/105 (11%)

Query: 1   MRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T+++T+  +Q++     +KPLFLY+A+ A H+      L+AP 
Sbjct: 193 LYENNNAAWDQDNGIYSTEMYTQRVLQILSSHNPNKPLFLYIAYQAVHS-----PLQAPG 247

Query: 60  ETINQFQYITDPNRRTYAALTK------STTLTLLIVYGWNDLSF 98
             ++ +  I + NRR YAA+        +   T L  YG+ D S 
Sbjct: 248 NYLDNYNSILNENRRRYAAMLSCLDDAVNNVTTALKKYGFYDNSI 292


>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
          Length = 535

 Score =  289 bits (740), Expect = 3e-75,   Method: Compositional matrix adjust.
 Identities = 164/408 (40%), Positives = 236/408 (57%), Gaps = 33/408 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q   I
Sbjct: 58  GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FTK A+ LI + P +KPL
Sbjct: 177 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPL 234

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 289

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 340

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           Q    + +++HISDWLPTL   A G T+     +DG D W ++   +PS R   +  +D 
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID- 398

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
             + + ++P  RNS     D+              AA+R  +WKL+ G
Sbjct: 399 -PNFVDSSPCPRNSTAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 445



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266

Query: 75  TYAALT 80
            YA + 
Sbjct: 267 HYAGMV 272


>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
          Length = 503

 Score =  288 bits (737), Expect = 6e-75,   Method: Compositional matrix adjust.
 Identities = 165/415 (39%), Positives = 235/415 (56%), Gaps = 36/415 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  + TP++DALA  G+ L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 27  GWNDVGFHGS-RLRTPHLDALAAGGVQLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 85

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +++E  P  RGF+++FGYL G   
Sbjct: 86  WPCQPSCVPLDEKLLPQLLQEAGYATHMVGKWHLGMYQKECLPTRRGFDTYFGYLLGSED 145

Query: 212 YYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
           YY H     I +   +R       D R     A      Y+T++FT  A  LI + P +K
Sbjct: 146 YYTHEHCVFIKALNVTRCAL----DFRDGEEVATQYKNLYSTNIFTNRATSLIANHPPEK 201

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           PLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+
Sbjct: 202 PLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHLYAGMVSLMDEAVGNVTAALK 256

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
             G+  N++ IF +DNG  T+           G+N+P RG K TLWEGGV+    + SP 
Sbjct: 257 SHGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGVGFVASPL 307

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           ++Q    + +++HISDWLPTL   AGGDT      +DG D W ++   +PS R   +  +
Sbjct: 308 LKQKGVKNRELIHISDWLPTLVKLAGGDTHGTK-PLDGFDVWRTISEGSPSPRMELLHNI 366

Query: 447 DQWSSLLLNTPSRRNSVLINIDEK---------KRTAAVRLDSWKLVLGTQENGT 492
           D   S +  +P   NS     D+             AA+R  +WKL+ G    G+
Sbjct: 367 D--PSFVDLSPCPGNSTTPAKDDSFPEYSTFNTSVHAAIRHGNWKLLTGFPGCGS 419



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT  A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 181 YSTNIFTNRATSLIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 235

Query: 75  TYAALT 80
            YA + 
Sbjct: 236 LYAGMV 241


>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
          Length = 534

 Score =  288 bits (736), Expect = 8e-75,   Method: Compositional matrix adjust.
 Identities = 163/408 (39%), Positives = 235/408 (57%), Gaps = 33/408 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q   I
Sbjct: 57  GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FTK A  LI + P +KPL
Sbjct: 176 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALITNHPPEKPL 233

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 234 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 288

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 289 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 339

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           Q    + +++HISDWLPTL   AGG  +     +DG D W ++   +PS R   +  +D 
Sbjct: 340 QKGVKNRELIHISDWLPTLVKLAGGHINGTK-PLDGFDVWKTISEGSPSPRVELLHNID- 397

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
             + + ++P   NS+    D+              AA+R  +WKL+ G
Sbjct: 398 -PNFVDSSPCPGNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 444



 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 211 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 HYAGMV 271


>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
 gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
           Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
           Flags: Precursor
 gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
          Length = 535

 Score =  288 bits (736), Expect = 9e-75,   Method: Compositional matrix adjust.
 Identities = 164/414 (39%), Positives = 234/414 (56%), Gaps = 33/414 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGSN I TP++D LA  G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  S  V     D R     A      Y+T++FT+ A  LI   P +KPL
Sbjct: 177 YYSHERCALID--SLNVTRCALDFRDGEQVATGYKNMYSTNIFTERATALITSHPPEKPL 234

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAVGNVTAALKSH 289

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGG++    + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGIRGVGFVASPLLK 340

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           Q    + +++HISDWLPTL   A G T      +DG D W ++   +PS R   +  +D 
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGSTKGTK-PLDGFDVWKTISEGSPSPRKELLHNID- 398

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGT 492
             + +  +P    S+    D+              AA+R  +WKL+ G    G 
Sbjct: 399 -PNFVDISPCPGKSLAPAKDDSSHPAYLAFNTSLHAAIRHGNWKLLTGYPGCGC 451



 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI   P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 212 YSTNIFTERATALITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266

Query: 75  TYAALT 80
            YA + 
Sbjct: 267 YYAGMV 272


>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
 gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
          Length = 535

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 163/414 (39%), Positives = 236/414 (57%), Gaps = 33/414 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GWHDVGFHGS-RIRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FT+ A  LI + P +KPL
Sbjct: 177 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALISNHPPEKPL 234

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NRR YA MV  +D++VG V +AL+  
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRRYYAGMVSLMDEAVGNVTAALKSH 289

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ +F +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 290 GLWNNTVFVFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 340

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +    S +++HISDWLPTL   AGG T      +DG D W ++   +PS R   +  +D 
Sbjct: 341 RKGVKSRELVHISDWLPTLVGLAGGSTKGTK-PLDGFDVWRTISEGSPSPRMELLHNID- 398

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGT 492
             + +  +P    S+    D+              AA+R  +WKL+ G    G 
Sbjct: 399 -PNFVDISPCPGQSLAPAKDDSSHPGYFSFNTSLHAAIRHGNWKLLTGYPGCGC 451



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NRR
Sbjct: 212 YSTNIFTERATALISNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRR 266

Query: 75  TYAALT 80
            YA + 
Sbjct: 267 YYAGMV 272


>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
          Length = 506

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 163/408 (39%), Positives = 235/408 (57%), Gaps = 33/408 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q   I
Sbjct: 29  GWNDVGFHGSC-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 87

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 88  WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 147

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FTK A  LI + P +KPL
Sbjct: 148 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALITNHPPEKPL 205

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 206 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 260

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 261 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 311

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           Q    + +++HISDWLPTL   AGG  +     +DG D W ++   +PS R   +  +D 
Sbjct: 312 QKGVKNRELIHISDWLPTLVKLAGGHINGTK-PLDGFDVWKTISEGSPSPRVELLHNID- 369

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
             + + ++P   NS+    D+              AA+R  +WKL+ G
Sbjct: 370 -PNFVDSSPCPGNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 416



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 183 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 237

Query: 75  TYAALT 80
            YA + 
Sbjct: 238 HYAGMV 243


>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
 gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
          Length = 488

 Score =  287 bits (735), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 190/581 (32%), Positives = 281/581 (48%), Gaps = 102/581 (17%)

Query: 54  HLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALA 113
           H+ A    +   Q +++ N     A+       L+   GW DL +HGS  I TPNI+ LA
Sbjct: 3   HINALSLVLLSAQILSEAN-----AIPPHIVFILVDDLGWFDLGYHGS-VIRTPNINQLA 56

Query: 114 YNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLREL 173
            +GIIL+N Y QP+CTP+R++LMTGKYPIH G Q   I   +P G+PL    LPE L++ 
Sbjct: 57  GDGIILDNYYVQPLCTPTRSALMTGKYPIHLGTQHGVILPGQPMGLPLDSSTLPEQLKQQ 116

Query: 174 GYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMR 233
           GY+T  +GKWHLGF++ ++ P  RGF+S +GY  G   ++ H +         L   D R
Sbjct: 117 GYATHIVGKWHLGFYKEDFVPTKRGFDSFYGYYCGAEDHFTHNV---------LGFLDFR 167

Query: 234 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 293
            N     D  G Y T  FTK AV  I       PLFLYL     H       ++AP E I
Sbjct: 168 DNDLIVKDQKGTYGTRAFTKRAVDTIHRHNSSSPLFLYLPFQNVHG-----PVQAPPEYI 222

Query: 294 NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS 353
           +++ +I D  RRT+AAMV  +D+++G V SAL+  G+ EN++++F +DNG          
Sbjct: 223 DKYSFIKDKTRRTHAAMVDIMDEAIGNVTSALKSAGLWENTLLVFSTDNGG--------- 273

Query: 354 NYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
            +   G NYP RG KNTLWEGGV+    +  P   ++  +   +MH++DW PTL   AGG
Sbjct: 274 IHTAGGYNYPLRGEKNTLWEGGVRGAGFVSGPMAPRHGMIYNGLMHVTDWYPTLVHLAGG 333

Query: 414 DTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT 473
                                       ++DG+D W +L  +T S R  +L NID K   
Sbjct: 334 SM------------------------QDSLDGVDLWDALKQDTASPRKEILHNIDLKIDV 369

Query: 474 AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
               +               +G   ++R  K+ L   NA     T+    +L +     I
Sbjct: 370 PTAFV--------------FEGVALRSRDMKLLLKVPNA-----TWLVPPELREP---GI 407

Query: 534 SNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
           +  + + S + Q  I                    L+N+  DP E+++++   PDI +++
Sbjct: 408 TANEGLGSAKMQEVIEVA-----------------LYNITADPTERHDLSGKFPDIVNEM 450

Query: 594 YELLKYHRRTLVPQSHEQPDLVQADPKRF-----NDTWSPW 629
            + + ++R  LVP     P + + DPK       N  WSPW
Sbjct: 451 KKRVDFYRGGLVP-----PIIKKNDPKAVLTAIKNGAWSPW 486



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/80 (37%), Positives = 38/80 (47%), Gaps = 5/80 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            R N     D  G Y T  FTK AV  I       PLFLYL     H       ++AP E
Sbjct: 166 FRDNDLIVKDQKGTYGTRAFTKRAVDTIHRHNSSSPLFLYLPFQNVHG-----PVQAPPE 220

Query: 61  TINQFQYITDPNRRTYAALT 80
            I+++ +I D  RRT+AA+ 
Sbjct: 221 YIDKYSFIKDKTRRTHAAMV 240


>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
          Length = 431

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 154/365 (42%), Positives = 216/365 (59%), Gaps = 23/365 (6%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + A         L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 36  PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+
Sbjct: 95  SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR 
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  +D++VG V  AL+  G+  N++ IF +DNG  T         R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            K TLWEGG++    + SP ++Q    S ++MHISDWLPTL   AGG T+     +DG +
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTNGTK-PLDGFN 376

Query: 427 QWSSL 431
            W ++
Sbjct: 377 MWKTI 381



 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 IYAGMV 271


>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
          Length = 521

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 190/550 (34%), Positives = 276/550 (50%), Gaps = 83/550 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ +HGS +I TP +DAL+  G+ L   Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 44  GWQDVGWHGS-QIRTPVLDALSAAGVRLERYYIQPLCTPSRSQLLTGRYQIHTGLQHEII 102

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LPE L+E GY T  +GKWHLG +R E  P  RGF+++FGYL G   
Sbjct: 103 WPCQPSCVPLDEKLLPELLKEAGYVTHMVGKWHLGMYRNECLPTRRGFDTYFGYLLGSED 162

Query: 212 YYDHI-LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H       S+ V     D+R     A      Y+T++FT+ A  LI +   +KPLFL
Sbjct: 163 YYSHEHCVPIVSKNVTRCALDLRDGEKIADGFKNMYSTNVFTQRAQDLIANHQPEKPLFL 222

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLA  + H     + L+ P++ +  + +I D  RR YA MV  LD++VG +  AL+ +G+
Sbjct: 223 YLALQSVH-----EPLQVPEKYVEPYSFIKDEKRRKYAGMVSILDEAVGNITEALKTRGL 277

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             N+++IF +DNG  T+           G+N+P RG K TLWEGGV+    + SP ++Q 
Sbjct: 278 WNNTVLIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGAGFVASPLLKQR 328

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
              S +++HISDWLPTL   AGG T+     +DG D W +L    PS R   +  +D   
Sbjct: 329 GVESHELIHISDWLPTLVKLAGGSTNDTK-PLDGFDVWKTLSEGEPSPRKELLHNID--P 385

Query: 451 SLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGTMDGYYGQT 500
             +  +P + N   + +D               AA+R   WKL+ G           G +
Sbjct: 386 IFVDPSPCKGNRSCLAVDYAAALKYSIFNISIHAAIRQGKWKLLTGNP---------GCS 436

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
                P L FN   ESKT                                      M+  
Sbjct: 437 HWFPPPSL-FN---ESKT--------------------------------------MSSD 454

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
           P T    +LF++ +D  E+N+++   PDI  +L   L+++ +  VP  +   D  + DP+
Sbjct: 455 PLTKK-LWLFDIVHDAEERNDLSDQYPDIVKKLLIRLQHYYKNSVPVFYPDDD-PRCDPE 512

Query: 621 RFNDTWSPWI 630
                W PW+
Sbjct: 513 A-TGAWGPWM 521



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI +   +KPLFLYLA  + H     + L+ P++ +  + +I D  RR
Sbjct: 198 YSTNVFTQRAQDLIANHQPEKPLFLYLALQSVH-----EPLQVPEKYVEPYSFIKDEKRR 252

Query: 75  TYAALT 80
            YA + 
Sbjct: 253 KYAGMV 258


>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
          Length = 517

 Score =  287 bits (734), Expect = 1e-74,   Method: Compositional matrix adjust.
 Identities = 194/573 (33%), Positives = 279/573 (48%), Gaps = 98/573 (17%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + AA        L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 30  PARASDAAPPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 88

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L++ GY+T  +GKWHLG +R+
Sbjct: 89  SRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRK 148

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 149 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIECLNGTRCALDLRDGEEPAKEYTDIY 205

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  LI + P +K +                 L+ P+E +  + +I D +RR 
Sbjct: 206 STNIFTKRATTLIANHPPEKSVH--------------DPLQVPEEYMEPYDFIQDKHRRI 251

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  LD++VG V  AL+ +G+  N+++IF +DNG  T         R+ G+N+P RG
Sbjct: 252 YAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQT---------RSGGNNWPLRG 302

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGL 425
            K TLWEGG++    + SP ++Q    S ++MHI+DWLPTL   AGG T    PL  DG 
Sbjct: 303 RKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTKPL--DGF 360

Query: 426 DQWSSLLLNTPSRR-----NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT---AAVR 477
           D W ++   +PS R     N + D  D       NT   +N           T   A +R
Sbjct: 361 DVWETISEGSPSPRVELLLNIDPDFFDGLPCPGKNTTPEKNDSFPLEHSAFNTSIHAGIR 420

Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
             +WKL+ G    G +                        T+  LQ       L + ++D
Sbjct: 421 YKNWKLLTGYPGCGGL-----------------------LTWVRLQ-------LQVPSVD 450

Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
               T                         +LF++  DP E+++++   P I   L   L
Sbjct: 451 SPTKT------------------------LWLFDINRDPEERHDVSREHPHIVQNLLSRL 486

Query: 598 KYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           +Y+    VP S+  P   + DPK     WSPW+
Sbjct: 487 QYYHEHSVP-SYFPPLDPRCDPKG-TGVWSPWM 517


>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
          Length = 429

 Score =  287 bits (734), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 156/381 (40%), Positives = 227/381 (59%), Gaps = 23/381 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP +DALA  G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 56  GWNDVGFHGS-AIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 114

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 115 LPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FT+ A  LI + P +KPL
Sbjct: 175 YYSHERCTLIDALN--VTRCALDFRDGEEVATGYKNMYSTNVFTERATTLITNHPPEKPL 232

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NRR YA M   +D++VG V +AL+R+
Sbjct: 233 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDEAVGNVTAALERR 287

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 288 GLWNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLLK 338

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +    + +++HISDWLPTL   AGG T+     +DG D W+++   +PS R   +  +D 
Sbjct: 339 RKGVKTRELIHISDWLPTLVKLAGGSTNGTK-PLDGFDVWNTISEGSPSPRMELLHNID- 396

Query: 449 WSSLLLNTPSRRNSVLINIDE 469
             + +   P   NS+ +  DE
Sbjct: 397 -PNFVDTAPCPGNSMALAKDE 416



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NRR
Sbjct: 210 YSTNVFTERATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRR 264

Query: 75  TYAALT 80
            YA + 
Sbjct: 265 YYAGMA 270


>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
          Length = 519

 Score =  286 bits (733), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 166/428 (38%), Positives = 239/428 (55%), Gaps = 36/428 (8%)

Query: 74  RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
           +T AA        L   +GW D+ +HGS EI TPN+D L+  G+ L N Y QP+CTPSR 
Sbjct: 23  QTLAARQPHVVFILADDFGWYDIGYHGS-EIRTPNLDKLSAGGVRLENYYVQPLCTPSRN 81

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
            LMTG+Y IHTGMQ   IW  +P  VPL E+ LP+ ++E GY+T  +GKWHLG ++++  
Sbjct: 82  QLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCL 141

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
           P  RGF+++ GYL G   Y+ H    Q  S  +     D+R     A    G Y+T+L +
Sbjct: 142 PTRRGFDTYLGYLTGSEDYFTHFRCYQSPSLNLSRCALDLRDGEEVATGYKGVYSTELLS 201

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
           + A+ +IE     KPLF+Y+A  A HA      L+ P+  +  + +I D NRR YA MV 
Sbjct: 202 QRAISIIERHISQKPLFMYVALQAVHA-----PLQVPERYVTPYSFIKDTNRRLYAGMVS 256

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
            +D++VG +  ALQ++G+  N++++F +DNG  T+         + GSN+P RG K +LW
Sbjct: 257 AMDEAVGNITMALQKEGLWNNTVLVFSTDNGGQTL---------SGGSNWPLRGRKWSLW 307

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           EGGV+    + SP ++Q   V+ +++HISDWLPTL   AGG T+     +DG D W+++ 
Sbjct: 308 EGGVRGVGFVASPLLKQPGTVNRELIHISDWLPTLVGLAGGSTTGTK-PLDGFDVWNAIS 366

Query: 433 LNTPSRRNS---NIDGL-------DQWSSLLLNTPSR----RNSVLINIDEKKRTAAVRL 478
               S R     NID L        Q    L    SR     +S  ++I      AA+R 
Sbjct: 367 KGFASPRLELLHNIDPLYYDTAPCKQHKLTLAQVVSRDMWANSSFNVSIH-----AAIRS 421

Query: 479 DSWKLVLG 486
             WKL+ G
Sbjct: 422 SKWKLLTG 429



 Score = 53.5 bits (127), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 40/70 (57%), Gaps = 5/70 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G Y+T+L ++ A+ +IE     KPLF+Y+A  A HA      L+ P+  +  + +I D N
Sbjct: 193 GVYSTELLSQRAISIIERHISQKPLFMYVALQAVHA-----PLQVPERYVTPYSFIKDTN 247

Query: 73  RRTYAALTKS 82
           RR YA +  +
Sbjct: 248 RRLYAGMVSA 257


>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
          Length = 573

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 187/542 (34%), Positives = 271/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 60  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 118

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 119 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 178

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 179 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTLLYAQRASHILASHSPRRPLFLY 234

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 235 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 289

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NSIIIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 290 NNSIIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 340

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 341 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 396

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 397 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 443

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 444 LAAFPGS----------------WWNLERMASARQA---------------------VWL 466

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L Y+ RT +P  +  E P   +A P      W 
Sbjct: 467 FNISADPYEREDLAGQRPDVVRALLARLAYYNRTAIPVRYPAENP---RAHPDFNGGAWG 523

Query: 628 PW 629
           PW
Sbjct: 524 PW 525



 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 200 SVAWGLSGQYSTLLYAQRASHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 254

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 255 RTMGNVARRKYAAMV 269


>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
          Length = 532

 Score =  286 bits (732), Expect = 2e-74,   Method: Compositional matrix adjust.
 Identities = 159/406 (39%), Positives = 234/406 (57%), Gaps = 29/406 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS+ I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I  G+Q   I
Sbjct: 55  GWNDVGFHGSS-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRMGLQHQII 113

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 114 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 173

Query: 212 YYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H       +  V     D R     A      Y+T++FT+ A  LI + P +KPLFL
Sbjct: 174 YYSHERCTLINALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALITNHPPEKPLFL 233

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLA  + H     + L+ P+E +  + +I D NR  YA MV  +D+++G V +AL+  G+
Sbjct: 234 YLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAIGNVTAALKSSGL 288

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++Q 
Sbjct: 289 WNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLKQK 339

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
              + +++HISDWLPTL   AGG T+     +DG D W ++   +PS R   +  +D   
Sbjct: 340 GVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWETISEGSPSPRVELLHNID--P 396

Query: 451 SLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
           + + ++P   NS+    D+              AA+R  +WKL+ G
Sbjct: 397 NFVDSSPCLGNSITPAKDDSSLPGYSAFNTSVHAAIRHGNWKLLTG 442



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 209 YSTNIFTERATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 263

Query: 75  TYAALT 80
            YA + 
Sbjct: 264 HYAGMV 269


>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
          Length = 431

 Score =  286 bits (731), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 153/365 (41%), Positives = 216/365 (59%), Gaps = 23/365 (6%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + A         L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 36  PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+
Sbjct: 95  SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR 
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  +D++VG V  AL+  G+  N++ IF +DNG  T         R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            K TLWEGG++    + SP ++Q    S ++MHI+DWLPTL   AGG T+     +DG +
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTK-PLDGFN 376

Query: 427 QWSSL 431
            W ++
Sbjct: 377 MWKTI 381



 Score = 55.5 bits (132), Expect = 9e-05,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 IYAGMV 271


>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
          Length = 492

 Score =  286 bits (731), Expect = 3e-74,   Method: Compositional matrix adjust.
 Identities = 163/408 (39%), Positives = 236/408 (57%), Gaps = 33/408 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+ TPSR+ L+TG+Y I TG+Q   I
Sbjct: 15  GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLXTPSRSQLLTGRYQIRTGLQHQII 73

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 74  WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 133

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FTK A+ LI + P +KPL
Sbjct: 134 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPL 191

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 192 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 246

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 247 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 297

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           Q    + +++HISDWLPTL   A G T+     +DG D W ++   +PS R   +  +D 
Sbjct: 298 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID- 355

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
             + + ++P  RNS+    D+              AA+R  +WKL+ G
Sbjct: 356 -PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 402



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 169 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 223

Query: 75  TYAALT 80
            YA + 
Sbjct: 224 HYAGMV 229


>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
          Length = 573

 Score =  285 bits (730), Expect = 4e-74,   Method: Compositional matrix adjust.
 Identities = 181/540 (33%), Positives = 272/540 (50%), Gaps = 72/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 61  GFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGKYQIHTGLQHSII 119

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+RRE  P  RGF++ FG L G   
Sbjct: 120 RPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDTFFGSLLGSGD 179

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T++  Q++      KP+FL
Sbjct: 180 YYTHFKCD----SPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFL 235

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP +    ++ I + NRR YAAM+  LD+++  V  AL++ G 
Sbjct: 236 YIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINNVTLALKKYGY 290

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +NSIII+ SDNG   +           GSN+P RG K T WEGG++    + SP ++  
Sbjct: 291 YDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 341

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             V  +++HI+DW PTL T A G      + +DG D W ++   +  RR+  +D L    
Sbjct: 342 GSVCKELVHITDWFPTLITLAEGQIDE-DIQLDGYDIWETI---SEGRRSPRVDILHNID 397

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            +     +   +    I      +A+R++ WKL+ G        GY     S+ VP   F
Sbjct: 398 PIYTKAKNGSWAAGYGIWNTAIQSAIRVNHWKLLTGNP------GY-----SDWVPPQAF 446

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
                                  SN+   R   ++ +   G                +LF
Sbjct: 447 -----------------------SNMGPNRWHNERVSWSAGKT-------------VWLF 470

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           N+  DP E+ ++++  PD+  QL   L    +T VP  +   D  +++PK     W PW 
Sbjct: 471 NITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPKLNGGVWGPWF 529



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T++  Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 196 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 250

Query: 60  ETINQFQYITDPNRRTYAAL 79
           +    ++ I + NRR YAA+
Sbjct: 251 KYFEHYRSINNINRRRYAAM 270


>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
          Length = 573

 Score =  285 bits (730), Expect = 5e-74,   Method: Compositional matrix adjust.
 Identities = 181/540 (33%), Positives = 272/540 (50%), Gaps = 72/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 61  GFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGKYQIHTGLQHSII 119

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+RRE  P  RGF++ FG L G   
Sbjct: 120 RPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDTFFGSLLGSGD 179

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T++  Q++      KP+FL
Sbjct: 180 YYTHFKCD----SPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFL 235

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP +    ++ I + NRR YAAM+  LD+++  V  AL++ G 
Sbjct: 236 YIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINNVTLALKKYGY 290

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +NSIII+ SDNG   +           GSN+P RG K T WEGG++    + SP ++  
Sbjct: 291 YDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 341

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             V  +++HI+DW PTL T A G      + +DG D W ++   +  RR+  +D L    
Sbjct: 342 GSVCKELVHITDWFPTLITLAEGQIDE-DIQLDGYDIWETI---SEGRRSPRVDILHNID 397

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            +     +   +    I      +A+R++ WKL+ G        GY     S+ VP   F
Sbjct: 398 PIYTKAKNGSWAAGYGIWNTAIQSAIRVNHWKLLTGNP------GY-----SDWVPPQAF 446

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
                                  SN+   R   ++ +   G                +LF
Sbjct: 447 -----------------------SNVGPNRWHNERVSWSAGKT-------------VWLF 470

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           N+  DP E+ ++++  PD+  QL   L    +T VP  +   D  +++PK     W PW 
Sbjct: 471 NITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPKLNGGVWGPWF 529



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T++  Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 196 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 250

Query: 60  ETINQFQYITDPNRRTYAAL 79
           +    ++ I + NRR YAA+
Sbjct: 251 KYFEHYRSINNINRRRYAAM 270


>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
          Length = 520

 Score =  285 bits (728), Expect = 7e-74,   Method: Compositional matrix adjust.
 Identities = 161/424 (37%), Positives = 236/424 (55%), Gaps = 37/424 (8%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
            L L    GWND+ FHGS  + TP++DALA  G+ L+N Y QP+CTPSR+ L+TG+Y IH
Sbjct: 35  VLVLADDLGWNDVGFHGS-RLRTPHLDALAAGGVRLDNYYTQPLCTPSRSQLLTGRYQIH 93

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++F
Sbjct: 94  TGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYF 153

Query: 204 GYLNGVISYYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           GYL G   Y+ H     I +   +R       D R     A +    Y+ ++F   A+ L
Sbjct: 154 GYLLGSEDYFSHEHCVFIKALNVTRCAL----DFRDGEEVATEYKNMYSANIFANRAISL 209

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I + P +KPLFLYLA  + H     + L+ P+E +  + +I D NRR YA MV  +D++V
Sbjct: 210 IANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIRDKNRRLYAGMVSLMDEAV 264

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K TLWEGG++ 
Sbjct: 265 GNVTAALESHGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGIRG 315

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              + SP ++Q    + +++HISDWLPTL   AGG T      +DG + W ++   +PS 
Sbjct: 316 VGFVASPLLKQKGVRTRELIHISDWLPTLVKLAGGSTQGTK-ALDGFNVWETISEGSPSP 374

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDE----------KKRTAAVRLDSWKLVLGTQ 488
           R   +  +D +   +  +P   N      D+              AA+R  +WKL+ G  
Sbjct: 375 RVELLHNIDPY--FVDPSPCPNNGTAPAKDDYSLLEYSTFNTSVHAAIRHRNWKLLTGFP 432

Query: 489 ENGT 492
             G+
Sbjct: 433 GCGS 436



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+ ++F   A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NRR
Sbjct: 197 YSANIFANRAISLIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIRDKNRR 251

Query: 75  TYAALT 80
            YA + 
Sbjct: 252 LYAGMV 257


>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
          Length = 473

 Score =  284 bits (727), Expect = 9e-74,   Method: Compositional matrix adjust.
 Identities = 189/545 (34%), Positives = 274/545 (50%), Gaps = 87/545 (15%)

Query: 99  HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRG 158
           HGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y IH G+Q   I   +P  
Sbjct: 3   HGS-VIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNC 61

Query: 159 VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILS 218
           VPL E+ LP+ L++ G ST  +GKWHLG +R+E  P  RGF+++FGYL G   YY H   
Sbjct: 62  VPLDEKLLPQLLKDAGSSTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTH--- 118

Query: 219 DQYSRTVELNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAH 274
           +  +    LNG     D+R     A +    Y+T++FTK A  LI + P +KPLFLYLA 
Sbjct: 119 EACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAF 178

Query: 275 LAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENS 334
            + H       L+ P+E +  + +I D +RR YA MV  LD++VG V  AL+ +G+  N+
Sbjct: 179 QSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNT 233

Query: 335 IIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVS 394
           ++IF +DNG  T         R+ G+N+P RG K TLWEGG++    + SP ++Q    S
Sbjct: 234 VLIFSTDNGGQT---------RSGGNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKS 284

Query: 395 LQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRR-----NSNIDGLDQ 448
            ++MHI+DWLPTL   AGG T    PL  DG D W ++   +PS R     N + D  D 
Sbjct: 285 RELMHITDWLPTLVNLAGGSTHGTKPL--DGFDVWETISEGSPSPRVELLLNIDPDFFDG 342

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT---AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
                 NT   +N           T   A +R  +WKL+ G    G       Q+  ++V
Sbjct: 343 LPCPGKNTTPEKNDSFPLEHSAFNTSIHAGIRYKNWKLLTGYPGCGYWFPPPSQSNISEV 402

Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
           P              S+   ++ ++L                                  
Sbjct: 403 P--------------SVDSPTKTLWL---------------------------------- 414

Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
               F++  DP E+++++   P I   L   L+Y+    VP S+  P   + DPK     
Sbjct: 415 ----FDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVP-SYFPPLDPRCDPKG-TGV 468

Query: 626 WSPWI 630
           WSPW+
Sbjct: 469 WSPWM 473



 Score = 56.6 bits (135), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 150 YSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRR 204

Query: 75  TYAALT 80
            YA + 
Sbjct: 205 IYAGMV 210


>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
 gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
 gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
          Length = 573

 Score =  284 bits (727), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 185/540 (34%), Positives = 272/540 (50%), Gaps = 76/540 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++      KPLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKR 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASIRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +   +  +A P      W PW
Sbjct: 465 FNISADPYEREDLADQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWGPW 523



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++      KPLFLY+A  A H       L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
          Length = 572

 Score =  284 bits (726), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 275/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++ +    +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P+     W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAAYNRTAIPVRYPAENP---RAHPEFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 43/75 (57%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++ +    +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
          Length = 569

 Score =  283 bits (725), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDMWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
          Length = 414

 Score =  283 bits (725), Expect = 1e-73,   Method: Compositional matrix adjust.
 Identities = 157/380 (41%), Positives = 220/380 (57%), Gaps = 42/380 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q   I
Sbjct: 57  GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FTK A  LI + P +KPL
Sbjct: 176 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALITNHPPEKPL 233

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 234 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 288

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 289 GLWNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLLK 339

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           Q    + +++HISDWLPTL   AGG       +I+G                  +DG D 
Sbjct: 340 QKGVKNRELIHISDWLPTLVKLAGG-------HING---------------TKPLDGFDV 377

Query: 449 WSSLLLNTPSRRNSVLINID 468
           W ++   +PS R  +L NID
Sbjct: 378 WKTISEGSPSPRVELLHNID 397



 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 211 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 HYAGMV 271


>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
 gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
 gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
          Length = 569

 Score =  283 bits (725), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQAV---------------------WL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
          Length = 408

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 152/359 (42%), Positives = 216/359 (60%), Gaps = 21/359 (5%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q   I
Sbjct: 58  GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FTK A+ LI + P +KPL
Sbjct: 177 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPREKPL 234

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 289

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLLK 340

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           Q    + +++HISDWLPTL   A G T+     +DG D W ++   +PS R   +  +D
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID 398



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 212 YSTNIFTKRAIALITNHPREKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266

Query: 75  TYAALT 80
            YA + 
Sbjct: 267 HYAGMV 272


>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
          Length = 413

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDMWKTI 380

Query: 432 LLNTPSRRNSNIDGLD 447
              +PS R   +  +D
Sbjct: 381 SEGSPSPRIELLHNID 396



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264

Query: 75  TYAALT 80
            YA + 
Sbjct: 265 HYAGMV 270


>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
 gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 40  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 323 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381

Query: 432 LLNTPSRRNSNIDGLD 447
              +PS R   +  +D
Sbjct: 382 SEGSPSPRIELLHNID 397



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 HYAGMV 271


>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
          Length = 415

 Score =  283 bits (724), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 41  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 217

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 218 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 272

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 273 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 323

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 324 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 382

Query: 432 LLNTPSRRNSNIDGLD 447
              +PS R   +  +D
Sbjct: 383 SEGSPSPRIELLHNID 398



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266

Query: 75  TYAALT 80
            YA + 
Sbjct: 267 HYAGMV 272


>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
 gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
 gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
 gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
 gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
          Length = 569

 Score =  283 bits (723), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
          Length = 569

 Score =  283 bits (723), Expect = 2e-73,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 274/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P+     W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPEFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
          Length = 569

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 274/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++ +    +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 43/75 (57%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++ +    +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
          Length = 415

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 152/359 (42%), Positives = 216/359 (60%), Gaps = 21/359 (5%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q   I
Sbjct: 58  GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FTK A+ LI + P +KPL
Sbjct: 177 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPL 234

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 289

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 340

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           Q    + +++HISDWLPTL   A G T+     +DG D W ++   +PS R   +  +D
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID 398



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266

Query: 75  TYAALT 80
            YA + 
Sbjct: 267 HYAGMV 272


>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
 gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
          Length = 414

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 40  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 99  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381

Query: 432 LLNTPSRRNSNIDGLD 447
              +PS R   +  +D
Sbjct: 382 SEGSPSPRIELLHNID 397



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 HYAGMV 271


>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 570

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 276/542 (50%), Gaps = 77/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +H S++I TP +D LA +G+ L N Y QP+CTPSR+  +TG+Y IHTG+Q   I
Sbjct: 59  GFNDIGYH-SSDIRTPVLDKLAADGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P  +  LP+ L+ELGYST  +GKWHLGF+++E  P  RGF+++FG L G ++
Sbjct: 118 RPCQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSVN 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
           YY +   D       L G D+    S AW   G+Y+T L+T+   +++    P  +PLF+
Sbjct: 178 YYTYDGCD----GAGLCGFDLHEGESVAWGQSGKYSTHLYTQRVRKILATHDPQSQPLFI 233

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +L+  A H       L+ P E I  +  + +  RR YAAMV  +D++V  +  AL++ G 
Sbjct: 234 FLSFQAVHT-----PLQYPDEYIYPYLGLENVARRKYAAMVSAVDEAVRNITYALRKYGY 288

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +NS+IIF +DNG   +         + GSN+P RG K T WEGG++    + SP +++ 
Sbjct: 289 YQNSVIIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGIRGLGFVHSPLLRKK 339

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS  ++HI+DW PTL   AGGD S L   +DG + W ++     S R   +  +D   
Sbjct: 340 RRVSKALVHITDWYPTLVGLAGGDES-LTEGVDGYNVWEAISEGKESPRLEILHNIDP-- 396

Query: 451 SLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
              L   +R  S+     I      A++R   WKL+ G       D  YG     ++ L 
Sbjct: 397 ---LYNHARTGSLQKGYGIWNTAIQASIRAGDWKLLTG-------DPGYGDWIPPQI-LP 445

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
            F                     P S   + R TR + T+                   +
Sbjct: 446 GF---------------------PSSWWKQERHTRPRKTV-------------------W 465

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LFN+  DP E+ +++  RPD+  QL   L Y+ RT VP  +   D  +ADP      W P
Sbjct: 466 LFNISGDPYERYDLSEQRPDVVKQLLARLVYYNRTAVPVRYPSED-PRADPYLNGGAWVP 524

Query: 629 WI 630
           W+
Sbjct: 525 WV 526



 Score = 43.1 bits (100), Expect = 0.45,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 6/79 (7%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
           S AW   G+Y+T L+T+   +++    P  +PLF++L+  A H       L+ P E I  
Sbjct: 199 SVAWGQSGKYSTHLYTQRVRKILATHDPQSQPLFIFLSFQAVHT-----PLQYPDEYIYP 253

Query: 65  FQYITDPNRRTYAALTKST 83
           +  + +  RR YAA+  + 
Sbjct: 254 YLGLENVARRKYAAMVSAV 272


>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
 gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
 gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
          Length = 413

 Score =  283 bits (723), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           A  ++   L  L+    GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ 
Sbjct: 39  AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P
Sbjct: 98  LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157

Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
             RGF+++FGYL G   YY H    L D  +  V     D R     A      Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           TK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D++VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGGV+    + SP ++Q    + +++HISDWLPTL   A G T+     +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 380

Query: 432 LLNTPSRRNSNIDGLD 447
              +PS R   +  +D
Sbjct: 381 SEGSPSPRIELLHNID 396



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264

Query: 75  TYAALT 80
            YA + 
Sbjct: 265 HYAGMV 270


>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
          Length = 520

 Score =  282 bits (722), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 183/562 (32%), Positives = 281/562 (50%), Gaps = 72/562 (12%)

Query: 75  TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           T AA        +   +GWND+ +H S EI TPN+D L+  G+ L N Y QP+C+PSR  
Sbjct: 23  TLAAPRPHIVFIMADDFGWNDVGYHNS-EIKTPNLDLLSAKGVRLQNYYVQPLCSPSRNQ 81

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           LMTG+Y IHTGMQ   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG ++++  P
Sbjct: 82  LMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYKKDCLP 141

Query: 195 LYRGFESHFGYLNGVISYYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
            +RGF+S+FGY  G   YY H     I +   +R       D+R     A    G Y+T+
Sbjct: 142 THRGFDSYFGYYLGSEDYYTHTRCYPITALNLTRCAL----DLRDGEEVATAYKGAYSTE 197

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           LF++ AV +I      +PLFLY+A  A H     + L+ P+  +  + +I D +RR YA 
Sbjct: 198 LFSQRAVSVIAKHNASQPLFLYVAMQAVH-----EPLQVPERYVTPYSFIKDVSRRKYAG 252

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           MV  +D++VG +  AL++ G+ +N++++F +DNG  T         R  GSN+P RG K 
Sbjct: 253 MVSAMDEAVGNISLALRQAGLWDNTVLVFSTDNGGQT---------RAGGSNWPLRGRKW 303

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
            LWEGG++    + SP ++    V+ ++MHISDWLPTL   AGG TS             
Sbjct: 304 ALWEGGIRGVGFVSSPLLENPGTVNNELMHISDWLPTLVRLAGGSTS------------- 350

Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
                        +DG + W+++     S R  +L NID      +          G Q+
Sbjct: 351 ---------GTKPLDGFNMWNTISKGFASPRLELLHNIDTLYEDTSP-------CPGRQQ 394

Query: 490 NGTM-DGYYGQTRSNKVPLLNFNAIVESKT-YQSLQQLSQNIFLPISNIDKMRSTRQQAT 547
             T+  G+ G+T +      +FN  + +   Y + + L+     P        +T + ++
Sbjct: 395 QLTLAQGFRGETWARS----SFNVSIHAAIRYLNWKLLTGYPGCPTWFPRPEHNTSESSS 450

Query: 548 IHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQ 607
                   P+ P         LF++  DP E+N +++  P +   L   L+ H +  +P 
Sbjct: 451 TSVD----PLKP-------VMLFDIQKDPEERNEVSAQFPAVVDYLLTRLQSHLKNSLPI 499

Query: 608 SHEQPDLVQADPKRFNDTWSPW 629
           +    D  + DP   +  W PW
Sbjct: 500 NFPDDD-PRCDPGP-SGAWGPW 519



 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 40/70 (57%), Gaps = 5/70 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G Y+T+LF++ AV +I      +PLFLY+A  A H     + L+ P+  +  + +I D +
Sbjct: 192 GAYSTELFSQRAVSVIAKHNASQPLFLYVAMQAVH-----EPLQVPERYVTPYSFIKDVS 246

Query: 73  RRTYAALTKS 82
           RR YA +  +
Sbjct: 247 RRKYAGMVSA 256


>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
          Length = 569

 Score =  282 bits (722), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 272/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP  L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPQQPNCLPLDQVTLPHKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQAV---------------------WL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
 gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
          Length = 470

 Score =  282 bits (722), Expect = 3e-73,   Method: Compositional matrix adjust.
 Identities = 178/540 (32%), Positives = 266/540 (49%), Gaps = 80/540 (14%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YGW+D+ +H ++ I TPN+D LA  G+ L N Y QPVC+PSR  LMTG+Y IH G+Q   
Sbjct: 11  YGWDDIGYH-NHFIHTPNLDRLASEGVKLENYYVQPVCSPSREQLMTGRYQIHYGLQHGV 69

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I    P G+PL E  LP+ L++ GY T  +GKWHLGF ++EYTPLYRGF+  +G+L G  
Sbjct: 70  IRNDRPHGLPLDEVTLPQRLKDNGYRTYMVGKWHLGFCKKEYTPLYRGFDKFYGFLTGSE 129

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            Y+ H       R   + G D+R       D  G Y+T LF ++A  +I     ++P+FL
Sbjct: 130 DYWTH------RRYKGVRGLDLRDQDEPVLDENGTYSTHLFARKATDMILKHDQNQPMFL 183

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YL   A H       L+ P++ + ++ +I     R YAA V  +D++VG V +AL+  G+
Sbjct: 184 YLPFQAVHGP-----LQVPEKYLQEYMHINFTVDRIYAAQVTAMDEAVGNVTNALKNSGL 238

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            EN+++IF +DNGA           R+ GSN+P RG KNTLWEGGV+    + S  +++ 
Sbjct: 239 WENTVLIFSTDNGA----------RRHTGSNWPLRGWKNTLWEGGVRGVGFVNSNLLKRK 288

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            + S  ++HISDW PTL   + G T+                          +DG D W 
Sbjct: 289 GKTSDALIHISDWFPTLLRISDGSTTG----------------------TKPLDGFDVWD 326

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           ++     S R  +L NID                           +Y     +  PL ++
Sbjct: 327 AISKGKTSPRKELLHNIDP--------------------------FYDDQTGSYSPL-SY 359

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
           + I  + +Y +L+     +          R  R+        +P P       N   +LF
Sbjct: 360 DNIFNTSSYSALRSGDWKLLTGYQGAGWNR--RRPYNHEFEESPDP------PNKRLWLF 411

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           N+  DP E+N+++   PDI   L E L  + +T VP     PD  + +P+   D + PW+
Sbjct: 412 NVREDPGERNDLSKMHPDIVQDLLEKLAVYNKTAVPIFWPDPD-PRMNPQLHGDVFGPWL 470



 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 5/69 (7%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           D  G Y+T LF ++A  +I     ++P+FLYL   A H       L+ P++ + ++ +I 
Sbjct: 154 DENGTYSTHLFARKATDMILKHDQNQPMFLYLPFQAVHGP-----LQVPEKYLQEYMHIN 208

Query: 70  DPNRRTYAA 78
               R YAA
Sbjct: 209 FTVDRIYAA 217


>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
 gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
 gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
          Length = 569

 Score =  282 bits (721), Expect = 4e-73,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
          Length = 572

 Score =  282 bits (721), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 270/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 177 YYTYDNCDGPG----VCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 441

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 442 LAAFPGS----------------WWNLERMASVRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
          Length = 358

 Score =  281 bits (720), Expect = 5e-73,   Method: Compositional matrix adjust.
 Identities = 150/341 (43%), Positives = 205/341 (60%), Gaps = 17/341 (4%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS EI TP++D LA  G+ L+N Y QP+CTPSR  LMTG+Y I TG+Q   I
Sbjct: 34  GWNDVGFHGS-EIKTPHLDRLAAQGVRLDNYYVQPLCTPSRNQLMTGRYQIRTGLQHQII 92

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ LRE GY T  +GKWHLG F+++  P +RGF+S FGYL G   
Sbjct: 93  WPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLPTHRGFQSFFGYLTGSED 152

Query: 212 YYDHI-LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   S      V     D+R   + A +  G Y+T+L T+ A  +I     D+PLFL
Sbjct: 153 YYTHKRCSPIAPLNVTRCALDLRDGDAVALNYSGRYSTELLTERATHIITQHTPDQPLFL 212

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A  A HA      L+ P   I  + +I DP+RR YA MV  +D++VG +   LQ  G+
Sbjct: 213 YVALQAVHAP-----LQVPDHYIAPYSFIQDPHRRRYAGMVSAMDEAVGNITHTLQETGL 267

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +N+++IF +DNG  T+           G+N+P RG K TLWEGGV+  A +  P I++ 
Sbjct: 268 WDNTVLIFSTDNGGQTLYG---------GNNWPLRGRKWTLWEGGVRGVAFVSGPLIEKP 318

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
             V+ +++HISDWLPT+   AG  T+     +DG D W S+
Sbjct: 319 GAVNRELIHISDWLPTIAGLAGASTNGTK-PLDGFDVWQSI 358



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 5/82 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R   + A +  G Y+T+L T+ A  +I     D+PLFLY+A  A HA      L+ P  
Sbjct: 174 LRDGDAVALNYSGRYSTELLTERATHIITQHTPDQPLFLYVALQAVHAP-----LQVPDH 228

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            I  + +I DP+RR YA +  +
Sbjct: 229 YIAPYSFIQDPHRRRYAGMVSA 250


>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
          Length = 564

 Score =  281 bits (720), Expect = 6e-73,   Method: Compositional matrix adjust.
 Identities = 182/541 (33%), Positives = 272/541 (50%), Gaps = 74/541 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TG+Y IHTG+Q   I
Sbjct: 54  GFRDVGYHGS-EIRTPTLDRLAAEGVKLENYYVQPMCTPSRSQFITGRYQIHTGLQHSVI 112

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 113 RPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMPTQRGFDTFFGSLLGSGD 172

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T++  Q++      KP+FL
Sbjct: 173 YYTHYKCD----SPRMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFL 228

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP     +++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 229 YIAYQAVHS-----PLQAPGMYYERYRSINNINRRRYAAMLACLDEAINNVTLALKRYGY 283

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NS+II+ SDNG  PT            G+N+P RG K T WEGG++    + SP ++ 
Sbjct: 284 YNNSVIIYSSDNGGQPTAG----------GNNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 333

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL T A G      L +DG D W ++   + SRR+  +D L   
Sbjct: 334 KGCVCKELIHITDWFPTLITLAEGQIDEDTL-LDGYDVWETI---SESRRSPRMDILHNI 389

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R++ WKL+ G        GY     S+ VP  +
Sbjct: 390 DPIYTKAKNGSWAAGFGIWNTAIQSAIRVNHWKLLTGNP------GY-----SDWVPPQS 438

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ +   G                +L
Sbjct: 439 F-----------------------SNVGPSRWHNERVSWTAGKT-------------VWL 462

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++S  PD+  QL   L    +T VP  +   D  +++P      W PW
Sbjct: 463 FNITADPYERVDLSSKYPDVVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPNFNGGVWGPW 521

Query: 630 I 630
            
Sbjct: 522 F 522



 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T++  Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 189 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 243

Query: 60  ETINQFQYITDPNRRTYAAL 79
               +++ I + NRR YAA+
Sbjct: 244 MYYERYRSINNINRRRYAAM 263


>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
          Length = 584

 Score =  281 bits (719), Expect = 7e-73,   Method: Compositional matrix adjust.
 Identities = 182/543 (33%), Positives = 280/543 (51%), Gaps = 79/543 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 65  GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 123

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E+GYST  +GKWHLGF+++   P  RGF++  G L G + 
Sbjct: 124 RPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDTFLGSLTGNVD 183

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G+D+    + AW+  G+Y+T L+ + A Q++      +PLFLY
Sbjct: 184 YYTYDNCD----GPGVCGYDLHEGENVAWEQSGQYSTLLYAQRASQILASHSPHQPLFLY 239

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 240 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGYY 294

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NS+IIF +DNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 295 DNSVIIFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLLKRKR 345

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL T AGG  S     +DG + W ++     S R   +  +D    
Sbjct: 346 RTSRALVHITDWYPTLVTLAGGSVSEAE-GLDGYNVWPAISEGQVSPRMEILHNIDP--- 401

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      A++R+  WKL+ G  + G  D    QT      L N
Sbjct: 402 --LYNHAKHGSLEGGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LAN 451

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F       ++ +L++L+               TRQ                       +L
Sbjct: 452 FPG-----SWWNLERLADG-------------TRQSV---------------------WL 472

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A+ RPD+   L   L Y+ RT +P  +  E P   +A P      W 
Sbjct: 473 FNISADPYEREDLAAQRPDVVRSLLARLVYYNRTAIPVRYPAENP---RAHPDFNGGAWG 529

Query: 628 PWI 630
           PW+
Sbjct: 530 PWV 532



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 44/75 (58%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW+  G+Y+T L+ + A Q++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 205 NVAWEQSGQYSTLLYAQRASQILASHSPHQPLFLYVAFQAVHT-----PLQSPREYLYRY 259

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 260 RTMGNVARRKYAAMV 274


>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
 gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
 gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
 gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
 gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
 gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
          Length = 573

 Score =  281 bits (718), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 184/540 (34%), Positives = 271/540 (50%), Gaps = 76/540 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++       PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKR 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG TS     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASIRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +   +  +A P      W PW
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWGPW 523



 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++       PLFLY+A  A H       L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
 gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
          Length = 574

 Score =  281 bits (718), Expect = 9e-73,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 61  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 119

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+ELGYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 179

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 180 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 235

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 236 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRHGFY 290

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 291 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 341

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 342 RTSRALVHITDWYPTLVALAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 397

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 398 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 444

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 445 LAAFPGS----------------WWNLERMASARQA---------------------VWL 467

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 468 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 524

Query: 628 PW 629
           PW
Sbjct: 525 PW 526



 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 201 NVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 255

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 256 RTMGNVARRKYAAMV 270


>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
 gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
          Length = 575

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 61  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 119

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 179

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 180 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 235

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  + SAL+R G  
Sbjct: 236 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSALKRYGFY 290

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 291 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 341

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 342 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 397

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 398 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 444

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 445 LAAFPGS----------------WWNLERMASARQA---------------------VWL 467

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 468 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 524

Query: 628 PW 629
           PW
Sbjct: 525 PW 526



 Score = 45.8 bits (107), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 201 NVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 255

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 256 RTMGNVARRKYAAMV 270


>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
          Length = 573

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 183/542 (33%), Positives = 271/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 60  GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 118

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L++LGY+T  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 119 RPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 178

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ +   +++      +PLFLY
Sbjct: 179 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTLLYAQRVSRILAGHSPRRPLFLY 234

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 235 VAFQAVHT-----PLQSPREYLYRYRGMGNVARRKYAAMVTCMDEAVRNITGALKRYGFY 289

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 290 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRTR 340

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 341 RTSRALLHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 396

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 397 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 443

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 444 LAAFPGSWW----------------NLERMASARQA---------------------VWL 466

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 467 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 523

Query: 628 PW 629
           PW
Sbjct: 524 PW 525



 Score = 47.0 bits (110), Expect = 0.035,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ +   +++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 200 SVAWGLSGQYSTLLYAQRVSRILAGHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 254

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 255 RGMGNVARRKYAAMV 269


>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
          Length = 573

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 185/542 (34%), Positives = 270/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 59  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMLGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 178 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 233

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ +++ +RR YAAMV  +D++V  +  AL+R G  
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRTMSNVSRRKYAAMVTCMDEAVRNITWALKRYGFY 288

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NSIIIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 289 NNSIIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLKRKR 339

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   A G  S     +DG D W ++     S R   +  +D    
Sbjct: 340 RTSHALVHITDWYPTLVGLARGTVSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 395

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 396 --LYNHARHGSLEGGFGIWNTAIQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 442

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 443 LAAFPGSWW----------------NLERMASARQA---------------------VWL 465

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 522

Query: 628 PW 629
           PW
Sbjct: 523 PW 524



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 44/75 (58%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 199 SVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 253

Query: 66  QYITDPNRRTYAALT 80
           + +++ +RR YAA+ 
Sbjct: 254 RTMSNVSRRKYAAMV 268


>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
          Length = 573

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 59  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 178 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 233

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  + SAL+R G  
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSALKRYGFY 288

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 289 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 339

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 340 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 395

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 396 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 442

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 443 LAAFPGS----------------WWNLERMASARQA---------------------VWL 465

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 522

Query: 628 PW 629
           PW
Sbjct: 523 PW 524



 Score = 45.8 bits (107), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 199 NVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 253

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 254 RTMGNVARRKYAAMV 268


>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
          Length = 600

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 188/580 (32%), Positives = 280/580 (48%), Gaps = 74/580 (12%)

Query: 52  GKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDA 111
           G+ LE  +E   + Q    P   T A         L    G+ D+ +HGS EI TP +D 
Sbjct: 47  GQGLEEEEEATFRAQSGERPEPSTAATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDR 105

Query: 112 LAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR 171
           LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I   +P  +PL    LP+ L+
Sbjct: 106 LAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLK 165

Query: 172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
           E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   YY H   D    +  + G+D
Sbjct: 166 EVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYD 221

Query: 232 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 276

Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEY 349
                ++ I + NRR YAAM+  LD+++  V  AL+  G   NSII++ SDNG  PT   
Sbjct: 277 RYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIVYSSDNGGQPTAG- 335

Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
                    GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL +
Sbjct: 336 ---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLIS 386

Query: 410 AAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE 469
            A G      + +DG D W ++   +   R+  +D L     +     +   +    I  
Sbjct: 387 LAEGQIDE-DIQLDGYDVWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWN 442

Query: 470 KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI 529
               +A+R+  WKL+ G        GY     S+ VP  +F                   
Sbjct: 443 TAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQSF------------------- 472

Query: 530 FLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDI 589
               SN+   R   ++ T+  G +              +LFN+  DP E+ +++   P I
Sbjct: 473 ----SNLGPNRWHNERITLSTGKS-------------VWLFNITADPYERVDLSHRYPGI 515

Query: 590 SSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
             QL   L    +T VP  +   D  +++P+     W PW
Sbjct: 516 VKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
          Length = 565

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 52  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 110

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+ELGYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 111 RPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 170

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 171 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 226

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 227 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRHGFY 281

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 282 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 332

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 333 RTSRALVHITDWYPTLVALAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 388

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 389 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 435

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 436 LAAFPGS----------------WWNLERMASARQA---------------------VWL 458

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 459 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 515

Query: 628 PW 629
           PW
Sbjct: 516 PW 517



 Score = 44.7 bits (104), Expect = 0.14,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 192 NVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 246

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 247 RTMGNVARRKYAAMV 261


>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
          Length = 601

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 186/557 (33%), Positives = 275/557 (49%), Gaps = 76/557 (13%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           AA T    L  ++    G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+ 
Sbjct: 72  AAATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 130

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
            +TGKY IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P
Sbjct: 131 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 190

Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTK 253
             RGF++ FG L G   YY H   D    +  + G+D+  N S AWD   G Y+T ++T+
Sbjct: 191 TKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDSAAWDHDNGIYSTQMYTQ 246

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
              Q++      KP+FLY+A+ A H+      L+AP      ++ I + NRR YAAM+  
Sbjct: 247 RVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSC 301

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLW 372
           LD++V  V  AL+  G   NSI+I+ SDNG  PT            GSN+P RG K T W
Sbjct: 302 LDEAVSNVTLALKTYGFYNNSILIYSSDNGGQPTAG----------GSNWPLRGSKGTYW 351

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           EGG++    + SP ++    V  +++HI+DW PTL + A G      + +DG D W ++ 
Sbjct: 352 EGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI- 409

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
             +   R+  +D L     +     +   +    I      +A+R+  WKL+ G      
Sbjct: 410 --SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN----- 462

Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
             GY     S+ VP  +F                       SN+   R   ++ T+  G 
Sbjct: 463 -PGY-----SDWVPPQSF-----------------------SNLGPNRWHNERITLSTGK 493

Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
           +              +LFN+  DP E+ +++S  P I  QL   L    +T VP  +   
Sbjct: 494 S-------------IWLFNITADPYERVDLSSRYPGIVKQLLRRLSQFNKTAVPVRYPPK 540

Query: 613 DLVQADPKRFNDTWSPW 629
           D  +++P+     W PW
Sbjct: 541 D-PRSNPRLNGGVWGPW 556



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N S AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 224 LYENDSAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 278

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 279 RYFEHYRSIININRRRYAAM 298


>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
          Length = 484

 Score =  281 bits (718), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 174/538 (32%), Positives = 267/538 (49%), Gaps = 96/538 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ ++ + EI TPN+D LA NG+ILN  YA   C+PSR +L+TG+YP   G+Q   +
Sbjct: 39  GWNDVGWN-NPEIKTPNLDRLASNGVILNASYALSTCSPSRTALLTGRYPFKLGLQHGVV 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+PL    LP+ L+ LGYST AIGKWHLGF R EYTP +RGF+S +G+ +G   
Sbjct: 98  KKGKPYGLPLNITLLPQKLKHLGYSTHAIGKWHLGFCRWEYTPTFRGFDSFYGFYSGSED 157

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY         +T  + G+D R N       + +Y+T  + + AV++I+     +PLFLY
Sbjct: 158 YYKR-------KTAAIRGYDFRMNTKVFKPKIKKYSTLDYGRRAVKIIQAHKRTEPLFLY 210

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +   A H+      L+ P+    +++ I D NRR ++ MV  +D ++G  I AL+++ ML
Sbjct: 211 MPFQAVHS-----PLQVPKSFEFKYRNIVDRNRRIFSGMVSAMDSAIGMTIKALKQRNML 265

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ-IQQN 390
            N++I+F+SDNG               G+N+P RG K TLWEGG +VP+ +W    +Q+ 
Sbjct: 266 TNTMIVFLSDNGGTPFFG---------GNNWPLRGSKATLWEGGTRVPSFVWGKMLLQKA 316

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
              S +M+H  DW PTL + AGG   +   ++DG+DQW  L     S+R           
Sbjct: 317 GYTSNEMIHAVDWFPTLVSLAGGTPEK---DVDGVDQWQML-----SQRQ---------- 358

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
                 PSRR   +  I+     AA+R   +KL+ GT   G   G+Y             
Sbjct: 359 ------PSRRKEFVYGINSITGRAAIRRGKYKLIEGTA--GHYSGWYPA----------- 399

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
                                P S + KM+  +Q    H  +                L+
Sbjct: 400 ---------------------PSSRVVKMKRMKQLNKRHVNS--------------TLLY 424

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           N+  DP E N++++  P ++  +   L  ++   V   + + +   + PK +   WSP
Sbjct: 425 NIQVDPIESNDLSAKYPHLTQFMLRRLNRYKAQEVEAFNPRKERY-SQPKFWGGNWSP 481


>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
          Length = 598

 Score =  280 bits (717), Expect = 1e-72,   Method: Compositional matrix adjust.
 Identities = 187/580 (32%), Positives = 282/580 (48%), Gaps = 74/580 (12%)

Query: 52  GKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDA 111
           G+ LE  +E   + Q    P   T +         L    G+ D+ +HGS EI TP +D 
Sbjct: 47  GQGLEDEEEAAFRAQAGERPEPSTASTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDK 105

Query: 112 LAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR 171
           LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I   +P  +PL    LP+ L+
Sbjct: 106 LAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLK 165

Query: 172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
           E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   YY H   D    +  + G+D
Sbjct: 166 EVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYD 221

Query: 232 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPS 276

Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEY 349
                ++ I + NRR YAAM+  LD+++  V  AL+  G   NSIII+ SDNG  PT   
Sbjct: 277 RYFEHYRSIVNINRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG- 335

Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
                    GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL +
Sbjct: 336 ---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLIS 386

Query: 410 AAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE 469
            A G      + +DG D W ++   +   R+  +D L     +     +   +    I  
Sbjct: 387 LAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNIDPIYTKAKNGSLAAGYGIWN 442

Query: 470 KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI 529
               +A+R+  WKL+ G        GY     S+ VP  +F                   
Sbjct: 443 TAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQSF------------------- 472

Query: 530 FLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDI 589
               SN+   R   ++ T+  G +              +LFN+  DP E+ ++++  P I
Sbjct: 473 ----SNLGPNRWHNERITLSTGKS-------------VWLFNITADPYERVDLSNRYPGI 515

Query: 590 SSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
             +L   L  + +T VP  +   D  +++P+     W PW
Sbjct: 516 VKKLLRRLSQYNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPS 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIVNINRRRYAAM 296


>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
          Length = 601

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 182/540 (33%), Positives = 270/540 (50%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 89  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 147

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 148 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 207

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD+  G Y+T ++T+   Q++      KPLFL
Sbjct: 208 YYTHYKCD----SPGMCGYDLYENDNAAWDSDNGIYSTQMYTQRVQQILASHNPTKPLFL 263

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLA+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 264 YLAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 318

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 319 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 368

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + IDG D W ++   +   R+  +D L   
Sbjct: 369 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQIDGYDIWETI---SEGLRSPRVDILHNI 424

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 425 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 473

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 474 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 497

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 498 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 556



 Score = 52.0 bits (123), Expect = 9e-04,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 45/80 (56%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD+  G Y+T ++T+   Q++      KPLFLYLA+ A H+      L+AP 
Sbjct: 224 LYENDNAAWDSDNGIYSTQMYTQRVQQILASHNPTKPLFLYLAYQAVHS-----PLQAPG 278

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 279 RYFEHYRSIININRRRYAAM 298


>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
           melanoleuca]
          Length = 573

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 269/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 59  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 178 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 233

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 288

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 289 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 339

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 340 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 395

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 396 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 442

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 443 LAAFPGS----------------WWNLERMASARQA---------------------VWL 465

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 522

Query: 628 PW 629
           PW
Sbjct: 523 PW 524



 Score = 47.4 bits (111), Expect = 0.027,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 199 SVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 253

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 254 RAMGNVARRKYAAMV 268


>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
          Length = 571

 Score =  280 bits (716), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 272/542 (50%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPRRPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG  S     +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGSASAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASVRQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RTMGNVARRKYAAMV 267


>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
          Length = 570

 Score =  280 bits (715), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 269/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 56  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 114

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 115 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 174

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 175 YYTYDNCDGPG----VCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 230

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 231 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 285

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 286 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 336

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 337 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 392

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 393 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 439

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 440 LAAFPGS----------------WWNLERMASARQA---------------------VWL 462

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 463 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 519

Query: 628 PW 629
           PW
Sbjct: 520 PW 521



 Score = 47.0 bits (110), Expect = 0.028,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 196 SVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 250

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 251 RTMGNVARRKYAAMV 265


>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 568

 Score =  280 bits (715), Expect = 2e-72,   Method: Compositional matrix adjust.
 Identities = 178/557 (31%), Positives = 265/557 (47%), Gaps = 107/557 (19%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +H S EI +P +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 55  GFNDIGYH-SGEIRSPTLDKLASEGVRLENYYVQPLCTPSRSQLITGRYQIHTGLQHSII 113

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P  RGF ++FG L G + 
Sbjct: 114 RPRQPNCLPLDVVTLPQRLQEIGYSTHMVGKWHLGFYRKDCLPTRRGFHTYFGSLTGSVD 173

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
           YY +   D  S    L G D+    S AW   G+Y+T L+T+   +++    P  +PLF+
Sbjct: 174 YYTYGSCDGKS----LCGFDLHEGESVAWGRAGKYSTHLYTQRVRKILATHDPTSQPLFI 229

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +L+  A H       L+ P+E I  ++ I +  RR YAAMV  +D++V  +  AL++ G 
Sbjct: 230 FLSLQAVHTP-----LKPPKEYIYPYRNIGNVPRRKYAAMVSIVDEAVRNITYALRKYGF 284

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             NS++IF +DNGA  +           GSN+P RG K T WEGG++    + SP I+  
Sbjct: 285 YRNSVVIFSTDNGAQPL---------TGGSNWPLRGCKGTYWEGGIRGVGFVHSPLIRHR 335

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            R+S  ++HI+DW PTL   AGG+ S                      ++  +DG D W 
Sbjct: 336 RRISRDLIHITDWYPTLVGLAGGNVS----------------------QHQGLDGFDVWP 373

Query: 451 SLLLNTPSRRNSVLINIDEKKR-----------------TAAVRLDSWKLVLGTQENGTM 493
           ++     S R  +L NID   R                  AA+R+  WKL+ G   NG  
Sbjct: 374 TISEGKESPRLEILHNIDPLNRRSRGSLKDGYGLWDTTVQAAIRVGDWKLLTGDPGNGD- 432

Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
                                               ++P   +    S+        G  
Sbjct: 433 ------------------------------------WVPPHVLTHFPSSWWHLERDIGEK 456

Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
              +     T  PC          E++++A  RPD+  +L   L +H RT +P  +  PD
Sbjct: 457 RKSVWLFNVTGDPC----------ERHDLAVHRPDVVRELLARLAFHNRTAIPVRYP-PD 505

Query: 614 LVQADPKRFNDTWSPWI 630
             +A+P      W PW+
Sbjct: 506 DTRANPSANGGAWRPWV 522



 Score = 45.8 bits (107), Expect = 0.069,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 42/76 (55%), Gaps = 6/76 (7%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
           S AW   G+Y+T L+T+   +++    P  +PLF++L+  A H       L+ P+E I  
Sbjct: 195 SVAWGRAGKYSTHLYTQRVRKILATHDPTSQPLFIFLSLQAVHTP-----LKPPKEYIYP 249

Query: 65  FQYITDPNRRTYAALT 80
           ++ I +  RR YAA+ 
Sbjct: 250 YRNIGNVPRRKYAAMV 265


>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 498

 Score =  279 bits (714), Expect = 3e-72,   Method: Compositional matrix adjust.
 Identities = 172/538 (31%), Positives = 270/538 (50%), Gaps = 87/538 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ ++  + I TP +D LA  G+ILN  Y  P+CTP RASLM+G Y    G+Q   +
Sbjct: 44  GWNDIGYNNPS-IFTPTLDKLAREGVILNQSYVLPMCTPDRASLMSGYYAYRVGLQHKVL 102

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEP G+PL    +P+ ++E GY+T  +GKWHLGF + EYTP YRGF+  +G+ N    
Sbjct: 103 DHAEPAGLPLNFTLIPQRMKEHGYTTYMLGKWHLGFCKWEYTPTYRGFDHFYGFYNAAED 162

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H  S            D+R      W   G Y+T ++ ++A + I       P+++Y
Sbjct: 163 YFNHTTSKYL---------DLRNGKEVDWSKNGTYSTYMYAEKATEYIATHNKSTPMYMY 213

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L   + H       +EAPQ+ ++ + +I D NRR  +AMV  +DD+V  VI+AL+R  + 
Sbjct: 214 LPFQSVHG-----VIEAPQKYLDMYTFIHDTNRRIKSAMVTAMDDAVKVVIAALERYELW 268

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N++++F +DNG P              +N+P RG K TLWEGG +    +    +++  
Sbjct: 269 DNTLLVFTTDNGGPANPNNAG-------NNWPLRGSKLTLWEGGTRGVGFVHGKMLEKKG 321

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            V+ +MMH++DW PTL   AGG                        + +S++DGL+ W +
Sbjct: 322 YVNNEMMHVTDWYPTLLHIAGG------------------------KADSDMDGLNVWDT 357

Query: 452 LLLNTPSRRNSVLINIDEKKRT-AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
           L    PS R   + NIDE + T AA+R+  +KL++G  + G  DG+        +P    
Sbjct: 358 LNKGYPSPRKEFIYNIDEIEPTGAAIRVGDYKLIVG--KAGHPDGW--------IPAPTI 407

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
           + + E               LP++  + +   R                    +   YLF
Sbjct: 408 DGVWEE-------------ILPMNEYEGLLDGRLSK-----------------DNDTYLF 437

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           N+  DP E+NN+A   P+  ++L   L+  ++ LVP     PD   +DP++F   WSP
Sbjct: 438 NIKADPTERNNLAEQMPEKVAELSARLEELKQKLVPAIRPAPDPKGSDPRKFGGAWSP 495



 Score = 45.4 bits (106), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 22/90 (24%), Positives = 44/90 (48%), Gaps = 5/90 (5%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R      W   G Y+T ++ ++A + I       P+++YL   + H       +EAPQ+
Sbjct: 174 LRNGKEVDWSKNGTYSTYMYAEKATEYIATHNKSTPMYMYLPFQSVHG-----VIEAPQK 228

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIV 90
            ++ + +I D NRR  +A+  +    + +V
Sbjct: 229 YLDMYTFIHDTNRRIKSAMVTAMDDAVKVV 258


>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 562

 Score =  279 bits (713), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 174/546 (31%), Positives = 271/546 (49%), Gaps = 75/546 (13%)

Query: 91  YGWNDLSFHGS---NEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
           +G+ND+ +H     +++ TP +D+LA  G+IL N Y QP+C+P+R  L+TG+Y IHTG+ 
Sbjct: 44  FGFNDIGYHAREHYSDMYTPFLDSLAAKGVILENYYVQPICSPTRGQLLTGRYQIHTGLA 103

Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
              I  A+P G+PL    L + LR+ GY T  +GKWHLGFFR EY P  RGF++ FG+LN
Sbjct: 104 HGIIRAAQPYGLPLDNILLSQQLRQCGYKTNMVGKWHLGFFREEYLPWNRGFQNFFGFLN 163

Query: 208 GVISYYDHILSDQYSRTVELNGHDM---RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           G ++++     +   +T    G+DM   R   + A  T GEY+T+LF +++ ++I+    
Sbjct: 164 GGVNHFTRYHCEP-KKTRRFCGYDMIDSRYGPTNA--TYGEYSTNLFIRKSKEMIDKHNK 220

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
            KP+FLYL+  A H       L+ P + + +F++I D NRR YA MV  +D  +  ++  
Sbjct: 221 QKPMFLYLSLQAVHGP-----LQVPNQYLKRFKHIRDKNRRIYAGMVYAMDRGIRQLVKH 275

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
           L+R  M +N+I IF +DNG  T            G+N+P RG K TLWEGG++    +  
Sbjct: 276 LKRARMWKNTIFIFSTDNGGQTTR---------GGNNWPLRGKKGTLWEGGIRGVGFVHG 326

Query: 385 PQIQ-QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
             +Q   PRV+ +++H+SDW PT+ +A           IDG DQW +L  N  S+R   +
Sbjct: 327 KPLQVTTPRVNKELLHVSDWYPTIMSATHCPYVVGTPPIDGYDQWETLRSNKTSKRTEIL 386

Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
             +D     L  +  R        D   R AA+R   WKL         + GY G     
Sbjct: 387 HNIDPLHKKLTESEFRE-----GFDVSVR-AAIRSGDWKL---------LTGYAGMNEWV 431

Query: 504 KVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT 563
             P      I       ++ Q+  +  +  ++   +R                       
Sbjct: 432 VPPECTHGNI-------TIGQIRDDFHVYPNSTKNVR----------------------- 461

Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
                LF +  DP E+  ++   P++ + L   LK +  + +P  +   D  ++DPKR  
Sbjct: 462 -----LFKITEDPHEEREVSDQYPEVVNLLLARLKEYNTSSLPVDYPHND-KKSDPKRHG 515

Query: 624 DTWSPW 629
             W PW
Sbjct: 516 GFWLPW 521



 Score = 57.8 bits (138), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 26/70 (37%), Positives = 43/70 (61%), Gaps = 5/70 (7%)

Query: 11  TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 70
           T GEY+T+LF +++ ++I+     KP+FLYL+  A H       L+ P + + +F++I D
Sbjct: 198 TYGEYSTNLFIRKSKEMIDKHNKQKPMFLYLSLQAVHGP-----LQVPNQYLKRFKHIRD 252

Query: 71  PNRRTYAALT 80
            NRR YA + 
Sbjct: 253 KNRRIYAGMV 262


>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
          Length = 607

 Score =  279 bits (713), Expect = 4e-72,   Method: Compositional matrix adjust.
 Identities = 178/539 (33%), Positives = 268/539 (49%), Gaps = 72/539 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 95  GFRDVGYHGS-EIKTPTLDKLAAQGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 153

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 154 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 213

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T++  Q++      KP+FL
Sbjct: 214 YYTHYKCD----SPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHDPRKPIFL 269

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP +    ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 270 YIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKRYGF 324

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +NSIII+ SDNG   +           GSN+P RG K T WEGG++    + SP ++  
Sbjct: 325 YDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 375

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             +  +++HI+DW PTL T A G        +DG D W ++   +  RR+  +D L    
Sbjct: 376 GTICKELVHITDWYPTLITLAEGWIDE-DTQLDGYDIWETI---SEGRRSPRVDILHNID 431

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            +     +   +    I      +A+R+  WKL+ G    G  D    Q+ SN  P    
Sbjct: 432 PIYTKAKNGSWAAGFGIWNTAIQSAIRVQHWKLLTGNP--GYSDWVPPQSFSNAGPNRWH 489

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
           N  +   T +S+                                             +LF
Sbjct: 490 NERISWSTGKSV---------------------------------------------WLF 504

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           N+  DP E+ +++S  P I  +L + L    +T VP  +   D  +++P+     W PW
Sbjct: 505 NITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 562



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T++  Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 230 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 284

Query: 60  ETINQFQYITDPNRRTYAAL 79
           +    ++ I + NRR YAA+
Sbjct: 285 KYFEHYRSIININRRRYAAM 304


>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
          Length = 556

 Score =  278 bits (712), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 188/572 (32%), Positives = 281/572 (49%), Gaps = 94/572 (16%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P R + A         L    GWNDL FHGS  I TP++DALA  G++L+N Y QP+CTP
Sbjct: 67  PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 125

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+ L+TG+Y IH G+Q   I   +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+
Sbjct: 126 SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 185

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
           E  P  RGF+++FGYL G   YY H   +  +    LNG     D+R     A +    Y
Sbjct: 186 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 242

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T++FTK A  +I + P +K              +    L+ P+E +  + +I D +RR 
Sbjct: 243 STNIFTKRATTVIANHPPEK--------------SVHDPLQVPEEYMEPYGFIQDKHRRI 288

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  +D++VG V  AL+  G+  N++ IF +DNG  T         R+ G+N+P RG
Sbjct: 289 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 339

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            K TLWEGG++    + SP ++Q    S ++MHI+DWLPTL   AGG T+          
Sbjct: 340 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNG--------- 390

Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
                           +DG + W ++    PS R  +L NID+         D +  +  
Sbjct: 391 -------------TKPLDGFNMWKTISEGHPSPRVELLHNIDQ---------DFFDGLPC 428

Query: 487 TQENGTMDGYYGQTRSNKVPLLN--FNAIVESKT-YQSLQQLSQN-----IFLPISNIDK 538
             +N T        + +  PL +  FN  + +   Y++ + L+ +      F P S    
Sbjct: 429 PGKNMT------PAKDDSFPLEHSAFNTSIHAGIRYKNWKLLTGHPGCGYWFPPPSQ--- 479

Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
                        +N + + P        +LF++  DP E+++++   P I   L   L+
Sbjct: 480 -------------SNVSEIPPVDPPTKTLWLFDINQDPEERHDVSREHPHIVQNLLSRLQ 526

Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           Y+    VP SH  P   + DPK     WSPW+
Sbjct: 527 YYHEHSVP-SHFPPLDPRCDPKS-TGVWSPWM 556


>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
          Length = 503

 Score =  278 bits (712), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 152/403 (37%), Positives = 236/403 (58%), Gaps = 29/403 (7%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +G++D+ +HGS EI TP +DAL+ +G+ L N Y QP+CTP+R+ LM+G+Y IHTG+Q   
Sbjct: 44  FGFHDVGYHGS-EIHTPTLDALSASGVRLENYYVQPICTPTRSQLMSGRYQIHTGLQHGI 102

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I   +P  +P     L + L+E GY+T  +GKWHLGF+++EY P  RGF+++FGYLN   
Sbjct: 103 INSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNRGFDTYFGYLNAAE 162

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            Y++H +  +  R +     D+R N     +  G+Y+  LFT +A+ +++     KPLFL
Sbjct: 163 DYFNHNVPWRQVRYL-----DLRDNNGPVRNETGQYSAHLFTGKAIDVVQSHNTSKPLFL 217

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLA+ + HA      LE P++  ++++ ITD NRRT+A MV  LD+ V  +  AL+ KG+
Sbjct: 218 YLAYQSVHA-----PLEVPEKYEHKYRNITDKNRRTFAGMVSALDEGVANLTQALKDKGL 272

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             N+++IF +DNG               G+NYP RG K +LWEGG      +    ++++
Sbjct: 273 WNNTVLIFSTDNGG---------QIHAGGNNYPLRGWKASLWEGGFHGVGFVSGGALKRS 323

Query: 391 PRVSLQMMHISDWLPTLYTAAGGD-TSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             VS  ++H+SDW PTL T AGG+     PL  DG +QW ++   TPS R   +  +D  
Sbjct: 324 GAVSKGLIHVSDWFPTLVTLAGGNLNGTKPL--DGFNQWDTISNETPSPREILLHNID-- 379

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
               +  P +   +  N  + +  AA+R+  +KL+ G   NG+
Sbjct: 380 ----ILYPQKGVPLYSNTWDTRVRAAIRVGDYKLITGDPGNGS 418



 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 29/82 (35%), Positives = 49/82 (59%), Gaps = 5/82 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R N     +  G+Y+  LFT +A+ +++     KPLFLYLA+ + HA      LE P++
Sbjct: 179 LRDNNGPVRNETGQYSAHLFTGKAIDVVQSHNTSKPLFLYLAYQSVHA-----PLEVPEK 233

Query: 61  TINQFQYITDPNRRTYAALTKS 82
             ++++ ITD NRRT+A +  +
Sbjct: 234 YEHKYRNITDKNRRTFAGMVSA 255



 Score = 44.3 bits (103), Expect = 0.18,   Method: Compositional matrix adjust.
 Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 1/63 (1%)

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
            +LFN+  DP E N+++S +P    +L ++L     T VP  +  PD  + DP    D W
Sbjct: 441 VWLFNITADPNEHNDLSSEKPLEVLRLLQILVQFNNTAVPPRYPAPD-PRCDPALHGDVW 499

Query: 627 SPW 629
            PW
Sbjct: 500 GPW 502


>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 500

 Score =  278 bits (712), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 179/557 (32%), Positives = 282/557 (50%), Gaps = 89/557 (15%)

Query: 80  TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
           TK   + +LI   G+ND+ +HGS EI TPNID LA  G+ L N Y QP+CTP+R+ L++G
Sbjct: 25  TKPNIVFILIDDLGYNDVGYHGS-EIYTPNIDKLAREGVRLENYYVQPICTPTRSQLLSG 83

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           +Y IHTG+Q   I  A+P  +P       + LRE GY+T A+GKWHLGF+++E  P  RG
Sbjct: 84  RYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATHAVGKWHLGFYKKECLPTQRG 143

Query: 199 FESHFGYLNGVISYYDHILSDQ--YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           F+S+FGYL G   Y+ H  +       +    G D+  N   AW+ VG Y+  +FT++  
Sbjct: 144 FDSYFGYLTGGEDYWTHHRAGDGLLPNSNHWLGMDLWDNEKVAWEYVGNYSAFVFTEKIQ 203

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
           Q++   PV++P  LYL   + H+      L+ P     +++ I + NRR YA M+  LD+
Sbjct: 204 QIVAQHPVEQPFLLYLPFQSVHSP-----LQVPSSYEERYKNIKNTNRRIYAGMMTCLDE 258

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           +VG ++ +LQ+ G+ +N+++IF +DNG               G+N+P RG K ++WEGG+
Sbjct: 259 AVGKIVHSLQQAGVWDNTVLIFSTDNGGEVAAG---------GNNWPLRGWKRSIWEGGM 309

Query: 377 K----VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           +    V + L    +Q    V+ Q++H+SDW PTL     G +S   + +DG + W ++ 
Sbjct: 310 RGVGFVNSRLLPASVQGT--VNKQLIHVSDWFPTLVQGVAG-SSLDNITLDGFNMWKTIS 366

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
            NT S R   +  +D W     N P   N+ +        +AA+R+  WKL+ G  + G+
Sbjct: 367 SNTSSPRQELLHNMDPWP----NQPKEFNANI--------SAAIRVGHWKLLTG--DPGS 412

Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
            +G                               Q I +P S I                
Sbjct: 413 NNG-------------------------------QWIPVPDSGI---------------- 425

Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
              P+ P   T    +LFN+  DP E +++++S P++  +L   LK +    VP  +   
Sbjct: 426 --IPIHPLKITGKDTWLFNITADPNEYHDLSTSHPEVVEELMAKLKKYYADSVPVQYPS- 482

Query: 613 DLVQADPKRFNDTWSPW 629
           ++ +A+P      W PW
Sbjct: 483 EVEEANPAYHQGVWGPW 499



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 5/76 (6%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
           N   AW+ VG Y+  +FT++  Q++   PV++P  LYL   + H+      L+ P     
Sbjct: 182 NEKVAWEYVGNYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHSP-----LQVPSSYEE 236

Query: 64  QFQYITDPNRRTYAAL 79
           +++ I + NRR YA +
Sbjct: 237 RYKNIKNTNRRIYAGM 252


>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
          Length = 571

 Score =  278 bits (712), Expect = 5e-72,   Method: Compositional matrix adjust.
 Identities = 183/542 (33%), Positives = 269/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 58  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 177 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLY 232

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG  S +   +DG D W ++     S R   +  +D    
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTVS-VADGLDGYDVWPAISEGRASPRTEILHNIDP--- 394

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 441

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 442 LAAFPGSWW----------------NLERMASARQA---------------------VWL 464

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 465 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 521

Query: 628 PW 629
           PW
Sbjct: 522 PW 523



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 252

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 253 RSMGNVARRKYAAMV 267


>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
 gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
 gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
          Length = 597

 Score =  278 bits (710), Expect = 7e-72,   Method: Compositional matrix adjust.
 Identities = 184/556 (33%), Positives = 275/556 (49%), Gaps = 76/556 (13%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
           A+T    L  ++    G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  
Sbjct: 69  AVTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQF 127

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           +TGKY IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P 
Sbjct: 128 ITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPT 187

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKE 254
            RGF++ FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T ++T+ 
Sbjct: 188 KRGFDTFFGSLLGSGDYYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQR 243

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
             Q++      KPLFLY+A+ A H+      L+AP      ++ I + NRR YAAM+  L
Sbjct: 244 VQQILASHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCL 298

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           D+++  V  AL+R G   NSIII+ SDNG  PT            GSN+P RG K T WE
Sbjct: 299 DEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWE 348

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
           GG++    + SP ++    V  +++HI+DW PTL + A G      + +DG D W ++  
Sbjct: 349 GGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI-- 405

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
            +   R+  +D L     +     +   +    I      +A+R+  WKL+ G       
Sbjct: 406 -SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------ 458

Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
            GY     S+ VP   F                       SN+   R   ++ T+  G +
Sbjct: 459 PGY-----SDWVPPQAF-----------------------SNLGPNRWHNERITLSTGKS 490

Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
                         +LFN+  DP E+ +++S  P I  +L   L    +T VP  +   D
Sbjct: 491 -------------IWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD 537

Query: 614 LVQADPKRFNDTWSPW 629
             +++P+     W PW
Sbjct: 538 -PRSNPRLNGGVWGPW 552



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPLFLYVAYQAVHS-----PLQAPG 274

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294


>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
 gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
 gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
          Length = 599

 Score =  278 bits (710), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 180/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 88  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 146

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 147 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 206

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 207 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFL 262

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 263 YIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAINNVTLALKMYGF 317

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 318 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAIGFVHSPLLKN 367

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 368 KGTVCKELVHITDWYPTLISLAEGQIDE-NIQLDGYDVWETI---SEGLRSPRVDILHNI 423

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 424 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVKHWKLLTGN------PGY-----SDWVPPQS 472

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 473 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 496

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  QL   L    +T VP  +   D  +++P+     W PW
Sbjct: 497 FNITADPYERVDLSNRYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 555



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 223 LYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 277

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 278 RYFEHYRSIVNINRRRYAAM 297


>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
          Length = 491

 Score =  278 bits (710), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 163/420 (38%), Positives = 235/420 (55%), Gaps = 39/420 (9%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
            L L    GW D+ +HGS  I TP +DAL   G+ L   Y QP+CTPSR+ L++G+Y IH
Sbjct: 4   VLVLADDLGWGDVGWHGS-AIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRYQIH 62

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           TG+Q   IW  +P  +PL E+ LPE L+E GY T  +GKWHLG +R+E  P +RGF+++F
Sbjct: 63  TGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYRKECLPTHRGFDTYF 122

Query: 204 GYLNGVISYYDHILSDQY----SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI 259
           GYL G   YY H   D+     ++ V     D R     A      Y+T+LFT+ A+ +I
Sbjct: 123 GYLLGSEDYYTH---DRCVFIKAKNVTRCALDFRDGEEVATGFKNVYSTNLFTERAIDVI 179

Query: 260 EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
            +   +KPLFLYLA  + H     + LE P++ +  +  I D  RR YA MV  +D++VG
Sbjct: 180 ANHKTEKPLFLYLAFQSVH-----EPLEVPEKYVKPYSSIKDVKRRHYAGMVTLMDEAVG 234

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            +  AL+  G+ +N++ +F +DNG  T+           G+N+P RG K TLWEGGV+  
Sbjct: 235 NLTDALKTYGLWDNTVFVFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGV 285

Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
             + SP +++    S +++HISDWLPTL   AGG T+     +DG D W+++    PS R
Sbjct: 286 GFVASPLLKRKGVESHELIHISDWLPTLVHLAGGHTNGTK-PLDGFDVWNTISEGRPSPR 344

Query: 440 NS---NID----------GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
                NID          G+D+ +SL  ++ S  +    NI      AA+R   WKL+ G
Sbjct: 345 VELLHNIDPMFVDDFPCLGIDKRTSLPQSSSSPWDDTFFNISIH---AAIRHGKWKLLTG 401



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T+LFT+ A+ +I +   +KPLFLYLA  + H     + LE P++ +  +  I D  RR
Sbjct: 166 YSTNLFTERAIDVIANHKTEKPLFLYLAFQSVH-----EPLEVPEKYVKPYSSIKDVKRR 220

Query: 75  TYAALT 80
            YA + 
Sbjct: 221 HYAGMV 226


>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
 gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
          Length = 598

 Score =  278 bits (710), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 180/540 (33%), Positives = 268/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 85  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 203

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 204 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFL 259

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 260 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 314

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDVWETI---SEGLRSPRVDILHNI 420

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 469

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 493

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++   P I  QL   L    +T VP  +   D  +++P+     W PW
Sbjct: 494 FNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 274

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294


>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
          Length = 600

 Score =  278 bits (710), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 185/561 (32%), Positives = 273/561 (48%), Gaps = 74/561 (13%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P   T AA        L    G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTP
Sbjct: 66  PEPSTTAASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTP 124

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+  +TGKY IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+
Sbjct: 125 SRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRK 184

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATD 249
           E  P  RGF++ FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T 
Sbjct: 185 ECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQ 240

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           ++T+   Q++      KPLFLY+A+ A H+      L+AP      ++ I + NRR YAA
Sbjct: 241 MYTQRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAA 295

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVK 368
           M+  LD+++  V  AL+  G   NSIII+ SDNG  PT            GSN+P RG K
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSK 345

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T WEGG++    + SP ++    V  +++HI+DW PTL + A G      + +DG D W
Sbjct: 346 GTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDVW 404

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
            ++   +   R+  +D L     +     +   +    I      +A+R+  WKL+ G  
Sbjct: 405 ETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN- 460

Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
                 GY     S+ VP  +F                       SN+   R   ++ ++
Sbjct: 461 -----PGY-----SDWVPPQSF-----------------------SNLGPNRWHNERISL 487

Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
             G +              +LFN+  DP E+ +++   P I  QL   L    +T VP  
Sbjct: 488 STGKS-------------VWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVR 534

Query: 609 HEQPDLVQADPKRFNDTWSPW 629
           +   D  +++P+     W PW
Sbjct: 535 YPPKD-PRSNPRLNGGVWGPW 554



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
          Length = 572

 Score =  278 bits (710), Expect = 8e-72,   Method: Compositional matrix adjust.
 Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 59  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P  RGF++ FG L G   
Sbjct: 118 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KPLFL
Sbjct: 178 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 233

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 234 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 288

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 289 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 338

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 339 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 394

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP   
Sbjct: 395 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 443

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 444 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 467

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++S  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 468 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 526



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 194 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 248

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 249 RYFEHYRSIININRRRYAAM 268


>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
          Length = 599

 Score =  278 bits (710), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 180/540 (33%), Positives = 268/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 88  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 146

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 147 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 206

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 207 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFL 262

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 263 YIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAINNVTLALKMYGF 317

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 318 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 367

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 368 KGTVCKELVHITDWYPTLISLAEGQIDE-NIQLDGYDVWETI---SEGLRSPRVDILHNI 423

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 424 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 472

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 473 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 496

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++   P I  QL   L    +T VP  +   D  +++P+     W PW
Sbjct: 497 FNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 555



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 223 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 277

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 278 RYFEHYRSIVNINRRRYAAM 297


>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
          Length = 595

 Score =  278 bits (710), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 185/561 (32%), Positives = 273/561 (48%), Gaps = 74/561 (13%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           P   T AA        L    G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTP
Sbjct: 66  PEPSTTAASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTP 124

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR+  +TGKY IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+
Sbjct: 125 SRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRK 184

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATD 249
           E  P  RGF++ FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T 
Sbjct: 185 ECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQ 240

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           ++T+   Q++      KPLFLY+A+ A H+      L+AP      ++ I + NRR YAA
Sbjct: 241 MYTQRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAA 295

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVK 368
           M+  LD+++  V  AL+  G   NSIII+ SDNG  PT            GSN+P RG K
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSK 345

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T WEGG++    + SP ++    V  +++HI+DW PTL + A G      + +DG D W
Sbjct: 346 GTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDVW 404

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
            ++   +   R+  +D L     +     +   +    I      +A+R+  WKL+ G  
Sbjct: 405 ETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN- 460

Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
                 GY     S+ VP  +F                       SN+   R   ++ ++
Sbjct: 461 -----PGY-----SDWVPPQSF-----------------------SNLGPNRWHNERISL 487

Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
             G +              +LFN+  DP E+ +++   P I  QL   L    +T VP  
Sbjct: 488 STGKS-------------VWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVR 534

Query: 609 HEQPDLVQADPKRFNDTWSPW 629
           +   D  +++P+     W PW
Sbjct: 535 YPPKD-PRSNPRLNGGVWGPW 554



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
 gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
 gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
 gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
          Length = 598

 Score =  278 bits (710), Expect = 9e-72,   Method: Compositional matrix adjust.
 Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 85  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P  RGF++ FG L G   
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP   
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 469

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++S  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294


>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
          Length = 570

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 85  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P  RGF++ FG L G   
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP   
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 469

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++S  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294


>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 182/554 (32%), Positives = 272/554 (49%), Gaps = 114/554 (20%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
           L IV   YGWND+ F  + ++ TPNID LA++G++LN+ Y  PVCTPSR S MTG Y   
Sbjct: 28  LFIVADDYGWNDVGFR-NPDVLTPNIDKLAHSGMVLNSSYVMPVCTPSRNSFMTGHYAFK 86

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           +G+Q   I   +    PL   FLP+ L+ELGY+T AIGKWHLGF + E TP YRGF++ F
Sbjct: 87  SGLQHLAILPKQAACAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFF 146

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
           G+ NG   YY          T+ + G    R+  T  +  GEY+T L+ + A  +I++  
Sbjct: 147 GFYNGQEDYY----------TLSVAGGKDFRDNRTPVNATGEYSTFLYARRAESIIKEHD 196

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
             KP+++YL   + H     + L+ PQ+  + +  +   +RRTY  MV  +D+++G +  
Sbjct: 197 ASKPMYMYLPFQSVH-----EPLQVPQKYSDMYSKVHTESRRTYLGMVTAMDEAIGNITK 251

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW----GSNYPYRGVKNTLWEGGVKVP 379
           AL+ KGM  +++IIF +DNG              W    G+NYP RG K T++EGG +  
Sbjct: 252 ALKTKGMFNDTLIIFTADNGG-------------WITFAGNNYPLRGGKATVFEGGTRAT 298

Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
           A +    IQ++  V   M+H  DW+PT+  AAGG                     TP   
Sbjct: 299 AFVSGAGIQKSNYVYPGMIHAVDWMPTVLRAAGG---------------------TP--- 334

Query: 440 NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----AAVRLDSWKLVLGTQENGTMDG 495
           +  +DG+DQW SL     S+R   + NID+ +      AA+R+  +KLV G        G
Sbjct: 335 DPTLDGIDQWDSLRTAGESKRTEFIYNIDDLRPEICGHAAIRMGDYKLVDG------FPG 388

Query: 496 YY-GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP 554
           +Y G  + ++V             Y   + +S N   P    DK+               
Sbjct: 389 FYPGWYKPDQV------------DYGLYEDVSTNYTRPF--FDKVM-------------- 420

Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
                         LFNL +DP E  +++   PDI  +L   L+Y+ + +VP +  +   
Sbjct: 421 --------------LFNLKDDPNEHVDLSKKHPDIVKKLKSRLEYYHKQIVPANFPKNSA 466

Query: 615 VQADPKRFNDTWSP 628
             A P  +   W+P
Sbjct: 467 SSA-PTNYGGFWTP 479



 Score = 42.7 bits (99), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 21/80 (26%), Positives = 42/80 (52%), Gaps = 5/80 (6%)

Query: 3   RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 62
           R+  T  +  GEY+T L+ + A  +I++    KP+++YL   + H     + L+ PQ+  
Sbjct: 167 RDNRTPVNATGEYSTFLYARRAESIIKEHDASKPMYMYLPFQSVH-----EPLQVPQKYS 221

Query: 63  NQFQYITDPNRRTYAALTKS 82
           + +  +   +RRTY  +  +
Sbjct: 222 DMYSKVHTESRRTYLGMVTA 241


>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
          Length = 613

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 180/542 (33%), Positives = 263/542 (48%), Gaps = 84/542 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 103 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 161

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 162 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 221

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 222 YYTYDNCDGPG----VCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPRQPLFLY 277

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 278 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 332

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 333 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 383

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG  S +   +DG D W ++     S R   +  +D    
Sbjct: 384 RTSRALVHITDWYPTLVGLAGGVVSAVD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 439

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G   NG           + +P   
Sbjct: 440 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTGDPGNG-----------DWIPPQT 486

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A        S +Q+                                           L
Sbjct: 487 LAAFPGXXXXPSFRQVVG-----------------------------------------L 505

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 506 FNISADPYERKDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 562

Query: 628 PW 629
           PW
Sbjct: 563 PW 564



 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 243 SVAWGLSGQYSTMLYAQRASHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 297

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 298 RTMGNVARRKYAAMV 312


>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 585

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 185/569 (32%), Positives = 288/569 (50%), Gaps = 87/569 (15%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ 125
           +T   R     L K   + L++    GWND+ ++ ++ +PTP ++ LA NG+ILNN Y+Q
Sbjct: 93  MTTAERPPCGDLGKPPHIVLVLADDVGWNDVGWN-NDFMPTPILNELASNGVILNNTYSQ 151

Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
           P C+PSRA+L+TGKYP + G+Q   +    P  +PL    L   L+ELGY   AIGKWHL
Sbjct: 152 PACSPSRAALLTGKYPANAGIQHLVVQEQHPYYLPLHNTLLSTKLKELGYMNHAIGKWHL 211

Query: 186 GFFRREYTPLYRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
           GF   +YTPL+RGF+S +G  NG +S Y  HI+   +      +G D+R N        G
Sbjct: 212 GFCNWKYTPLWRGFDSFYGIFNGYLSDYSTHIVHSPFIGEPGASGLDLRDNTGVVAHENG 271

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
            + T LFT+ A ++I +     PLFLY A L  H       L+AP    N  + + D  R
Sbjct: 272 THVTYLFTERAERIIRNHNPAAPLFLYTALLVPHG-----PLDAPPGFENSVE-VNDTRR 325

Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
           R YA M+  +D++VG + +AL+ KGM  +++ IF+SDNG               G+NYP+
Sbjct: 326 REYAGMISAMDEAVGNITNALKDKGMWNDTLFIFLSDNGGDMFYA---------GNNYPF 376

Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
           RG K +LWEG VK+P  ++   +++    + +++H +D   TL +AAGG           
Sbjct: 377 RGNKASLWEGSVKIPGFIYGNMLEKKGYSNNELIHFTDLFATLLSAAGG----------- 425

Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINID--EKKRTAAVRLDSWK 482
                     TP   + +IDG++QW ++     S+R+ +L+++D  E  R AA+R++ +K
Sbjct: 426 ----------TP---DDDIDGINQWDTVCRGDASKRSEILLHLDTFEATRGAALRMNDYK 472

Query: 483 LVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRST 542
            + G  E               VP       VE+     L Q     ++P  + D     
Sbjct: 473 YIEGITE--------------LVP-----QRVEAHERFHLDQW----YVPAEHPDA---- 505

Query: 543 RQQATIHCGANPAPMTPSPCTN-GPCYLFNLGNDPCEQNNIASS--RPDISSQLYELLKY 599
                      P P  P    N    +LFNL +DP E  N+A+     DI  ++   L+Y
Sbjct: 506 -----------PIPDAPLISQNVNNTFLFNLRDDPLETTNLATDPEMQDILEEMQARLQY 554

Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           ++    P  + + ++ ++DPK F+  W+P
Sbjct: 555 YKDNAAPLWYPE-NIDESDPKYFDGAWTP 582


>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
          Length = 555

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 85  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P  RGF++ FG L G   
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP   
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 469

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++S  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294


>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
          Length = 614

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 85  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R++  P  RGF++ FG L G   
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP   
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQA 469

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++S  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294


>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
          Length = 382

 Score =  277 bits (709), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 147/387 (37%), Positives = 219/387 (56%), Gaps = 44/387 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ F  + ++ TP++D LA  G+ILN+ Y QP+C+PSR   M+G +P HTG+Q   I
Sbjct: 36  GWNDVGFR-NPQVLTPHLDKLAKAGVILNSSYVQPLCSPSRNCFMSGYFPYHTGLQDGVI 94

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A P  VP+   FLP+ L+ELGYST A+GKWHLGF   +YTP YRGF++  GY  G   
Sbjct: 95  RPASPGFVPIKFTFLPQKLKELGYSTHAVGKWHLGFCNLKYTPTYRGFDTFVGYYIGAED 154

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY H  + +Y +    +G+D+R N S   +  G+Y+T +F + AV +I+    D PL+LY
Sbjct: 155 YYKH--TREYDK---FSGYDLRFNTSVYTEAKGKYSTRVFAERAVDIIKSHDTDTPLYLY 209

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L   A HA      LE P E  N +++I D  RRTY  M+  LD++V  + +AL+  G++
Sbjct: 210 LPFQAVHA-----PLEVPPEYENLYKHIHDLPRRTYCGMISALDEAVANITNALEETGLI 264

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI-QQN 390
           +N +++F +DNG P         +    +N P RG K TLWEGG K    ++S  + ++ 
Sbjct: 265 DNLLLVFTTDNGGP---------FNGAANNLPLRGCKATLWEGGTKGAGFIYSKTLLKKT 315

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             ++  MMH  DW PT    AGG+ +                       + N+DG+ Q+ 
Sbjct: 316 GYLNTGMMHAVDWYPTFVELAGGENT-----------------------DPNMDGISQYD 352

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVR 477
            L+   PS+R   + NI +K ++AA+R
Sbjct: 353 MLINGGPSKRKEFVYNIYDKSKSAAIR 379



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 5/82 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R N S   +  G+Y+T +F + AV +I+    D PL+LYL   A HA      LE P E
Sbjct: 170 LRFNTSVYTEAKGKYSTRVFAERAVDIIKSHDTDTPLYLYLPFQAVHA-----PLEVPPE 224

Query: 61  TINQFQYITDPNRRTYAALTKS 82
             N +++I D  RRTY  +  +
Sbjct: 225 YENLYKHIHDLPRRTYCGMISA 246


>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
          Length = 571

 Score =  277 bits (708), Expect = 1e-71,   Method: Compositional matrix adjust.
 Identities = 178/539 (33%), Positives = 267/539 (49%), Gaps = 72/539 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 59  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 118 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 178 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFL 233

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 234 YIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTLALKRYGF 288

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             NSIII+ SDNG   +           GSN+P RG K T WEGG++    + SP ++  
Sbjct: 289 YNNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 339

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L    
Sbjct: 340 GTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNID 395

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            +     +   +    I      +A+R+  WKL+ G        GY     S+ VP   F
Sbjct: 396 PIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQAF 444

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
                                  SN+   R   ++ T+  G +              +LF
Sbjct: 445 -----------------------SNLGPNRWHNERITLSTGKS-------------IWLF 468

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           N+  DP E+ +++   P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 469 NITADPYERVDLSGRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 526



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 194 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFLYIAYQAVHS-----PLQAPG 248

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 249 RYFEHYRSIVNINRRRYAAM 268


>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
          Length = 503

 Score =  277 bits (708), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 152/405 (37%), Positives = 229/405 (56%), Gaps = 29/405 (7%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YG++D+ +HGS  I TPNID LA+ G+ L N Y QP+CTP+R+ L++G+Y IHTG+Q   
Sbjct: 41  YGYHDVGYHGS-AIRTPNIDRLAFEGVRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSI 99

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           IW A+P  +P     L + LRE GY+   +GKWHLGF++ EY P  RGF+S +GYL G  
Sbjct: 100 IWAAQPNALPKELPTLADKLREEGYANHIVGKWHLGFYKEEYVPTNRGFDSFYGYLTGSE 159

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKP 267
            YY+           ++ G D R N  +  ++  EY+T L+ +   QL+ D      D+P
Sbjct: 160 FYYNKTYCLAQINRSDVCGLDFRENDRSINES--EYSTHLYAERTKQLVADHTSAHPDQP 217

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFLYLA  + H       LE P +    +++I D NR+ YA MV  +D++V  V  A ++
Sbjct: 218 LFLYLALQSVHG-----PLEVPAQYRTPYKHIKDENRQIYAGMVSCMDEAVKNVTDAFKK 272

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
            G+ +N+I++F +DNG    E          G+N+P RG K +LW+GG++    + S  +
Sbjct: 273 YGLWDNTILVFSTDNGGQVAE---------GGNNWPLRGWKGSLWDGGMRGVGFVHSTLL 323

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           +Q   V+ Q+MH+SDW PTL   AGG+ S + L  DG DQW+++       R   +  +D
Sbjct: 324 KQKGAVAHQLMHVSDWFPTLLNQAGGNASSVKL--DGFDQWAAISDGAEGPRTEILHNID 381

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
                 LN   +R      + +    AA+R++ WKL+ G   NG+
Sbjct: 382 P-----LN--GKRGKRTEEVFDNTVRAALRMNKWKLLTGEPGNGS 419



 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 8/70 (11%)

Query: 14  EYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 70
           EY+T L+ +   QL+ D      D+PLFLYLA  + H       LE P +    +++I D
Sbjct: 192 EYSTHLYAERTKQLVADHTSAHPDQPLFLYLALQSVHG-----PLEVPAQYRTPYKHIKD 246

Query: 71  PNRRTYAALT 80
            NR+ YA + 
Sbjct: 247 ENRQIYAGMV 256


>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
          Length = 597

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 178/539 (33%), Positives = 267/539 (49%), Gaps = 72/539 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 85  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 203

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 204 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFL 259

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+R G 
Sbjct: 260 YIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTLALKRYGF 314

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             NSIII+ SDNG   +           GSN+P RG K T WEGG++    + SP ++  
Sbjct: 315 YNNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 365

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L    
Sbjct: 366 GTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNID 421

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            +     +   +    I      +A+R+  WKL+ G        GY     S+ VP   F
Sbjct: 422 PIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQAF 470

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
                                  SN+   R   ++ T+  G +              +LF
Sbjct: 471 -----------------------SNLGPNRWHNERITLSTGKS-------------IWLF 494

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           N+  DP E+ +++   P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 495 NITADPYERVDLSGRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFLYIAYQAVHS-----PLQAPG 274

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIVNINRRRYAAM 294


>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
          Length = 596

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
          Length = 597

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 180/540 (33%), Positives = 269/540 (49%), Gaps = 75/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 86  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 144

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 145 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 204

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 205 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 260

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 261 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 315

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 316 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 365

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 366 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 421

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 422 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 470

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 471 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 494

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP S   P   +++P+     W PW
Sbjct: 495 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVS--IPQDPRSNPRLNGGVWGPW 552



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 221 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 275

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 276 RYFEHYRSIININRRRYAAM 295


>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
          Length = 502

 Score =  276 bits (707), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 167/411 (40%), Positives = 232/411 (56%), Gaps = 36/411 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS EI TP +D L+  G+ L   Y QP+CTPSR+ L++G+Y IHTG+Q   I
Sbjct: 24  GWNDVGFHGS-EILTPTLDFLSGQGVRLAGYYTQPLCTPSRSQLLSGRYQIHTGLQHQII 82

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P   PL ++ LPE L+E GY T  +GKWHLG ++ +  P  RGF+S+FGY  G   
Sbjct: 83  WPCQPHCHPLEDKLLPELLKERGYVTHMVGKWHLGMYKTDCLPTRRGFDSYFGYWTGGED 142

Query: 212 YYDHILSDQYSRTVELN----GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           YY H   ++      LN      D R     A D   +Y+T LFT  AV LI +   +KP
Sbjct: 143 YYSH---ERCYLITTLNITRCALDFRDGEVPATDYQMKYSTHLFTDRAVDLITNHNPEKP 199

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFLYLA+ A H+      L+ P + I  +  I D NRR YA MV  LD++V  V +AL++
Sbjct: 200 LFLYLAYQAVHSP-----LQVPDQYIEPYTSIHDKNRRLYAGMVSVLDEAVANVTNALKK 254

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           +G+  N++ IF +DNG  T+         + G+N+P RG K TLWEGG++    + S  +
Sbjct: 255 QGLWNNTVFIFSTDNGGQTL---------SGGNNWPLRGRKWTLWEGGIRGVGFVTSSLL 305

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRRNS---NI 443
           +Q    S +++HISDWLPTL   AGG T S  PL  DG D W ++   +PS R     NI
Sbjct: 306 KQKGVTSKELIHISDWLPTLVNLAGGSTNSTKPL--DGFDVWKTISEGSPSPRVELLHNI 363

Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRT--------AAVRLDSWKLVLG 486
           D L + S     +P+ R+    +   K +         AA+R  +WKL+ G
Sbjct: 364 DPLFEDSFPCSVSPNTRDQAPTHAATKWQDSPFNVSVHAAIRHGNWKLLTG 414



 Score = 53.9 bits (128), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 39/67 (58%), Gaps = 5/67 (7%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y+T LFT  AV LI +   +KPLFLYLA+ A H+      L+ P + I  +  I D NR
Sbjct: 177 KYSTHLFTDRAVDLITNHNPEKPLFLYLAYQAVHSP-----LQVPDQYIEPYTSIHDKNR 231

Query: 74  RTYAALT 80
           R YA + 
Sbjct: 232 RLYAGMV 238


>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
          Length = 605

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 93  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 151

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 152 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 211

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   +++      KP+FL
Sbjct: 212 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQKILASHNPTKPIFL 267

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 268 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 322

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 323 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 372

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + IDG D W ++   +   R+  +D L   
Sbjct: 373 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQIDGYDIWETI---SEGLRSPRVDILHNI 428

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 429 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 477

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 478 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 501

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 502 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 560



 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   +++      KP+FLY+A+ A H+      L+AP 
Sbjct: 228 LYENDNAAWDYDNGIYSTQMYTQRVQKILASHNPTKPIFLYIAYQAVHS-----PLQAPG 282

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 283 RYFEHYRSIININRRRYAAM 302


>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
          Length = 599

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
 gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
 gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
          Length = 598

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
 gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
 gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
 gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
 gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
 gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
          Length = 599

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
          Length = 506

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 180/556 (32%), Positives = 273/556 (49%), Gaps = 75/556 (13%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
            A+ K   + +LI    GWND+S H S  + TPN+D L    + L + Y  P+CTP+R+ 
Sbjct: 13  VAIAKKPNVLILIADDLGWNDISLHNS-YLSTPNVDGLIQESLHLQSYYVNPICTPTRSV 71

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           LM+G+Y IHTG+Q   I GA+P G+PLT+   PE  ++ GY T  +GKWHLGF+  +  P
Sbjct: 72  LMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTHMVGKWHLGFYDEKMVP 131

Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
             RGFESH+GYL G   +Y+H    Q+ +    NG D R   ++   + G+Y+ DLF K 
Sbjct: 132 ENRGFESHYGYLIGAEGHYNH---SQFMQG--QNGVDFRDGGASTNSSWGQYSADLFAKR 186

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
              L+E   V++ L++Y+     H       LEAPQ  ++QF +I D +RR YAAM K +
Sbjct: 187 VEDLVEAHDVEESLYMYVGLQNVHYP-----LEAPQHYVDQFSWIKDRDRRVYAAMTKSM 241

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           DD+VG +I++ + KG+ E++I+ F +DNG  T+           G+N+P RG+KNTLWEG
Sbjct: 242 DDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTLYG---------GNNWPLRGLKNTLWEG 292

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
           G++    +  P +  + R   Q+MH+ D  PTL                 LD     L +
Sbjct: 293 GIRGIGSIKIPGVSADKRD--QLMHVVDMFPTL-----------------LD-----LTS 328

Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
            P+ R    DG  Q   L+ N  S+ +S LINID  ++       +W      +    + 
Sbjct: 329 CPNTRKVKFDGKSQAKMLVDNDESQSDSFLINIDPLRKNKGADNRTWHSDFDVRVQAGL- 387

Query: 495 GYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP 554
                 R  +  LL       S  Y      + +I+ P    D  ++++           
Sbjct: 388 ------RWQQWKLL--TGAPSSDNYP-----TGHIYPPEWPQDGRKNSQSNFE------- 427

Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
                   +N    LF++ NDP E   I++   +I  ++  +L  +  T VP     PD 
Sbjct: 428 -------SSNLNVRLFDIDNDPLEDFEISNQHTEIVEKMLNMLSSYNSTAVPCRWPNPDP 480

Query: 615 VQADPKRFNDTWSPWI 630
               P +    W PWI
Sbjct: 481 AGL-PDQNGGFWKPWI 495



 Score = 60.5 bits (145), Expect = 3e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 5/75 (6%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G+Y+ DLF K    L+E   V++ L++Y+     H       LEAPQ  ++QF +I D +
Sbjct: 176 GQYSADLFAKRVEDLVEAHDVEESLYMYVGLQNVHYP-----LEAPQHYVDQFSWIKDRD 230

Query: 73  RRTYAALTKSTTLTL 87
           RR YAA+TKS   T+
Sbjct: 231 RRVYAAMTKSMDDTV 245


>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
          Length = 572

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 181/542 (33%), Positives = 267/542 (49%), Gaps = 80/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 59  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 178 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHNPQRPLFLY 233

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 288

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 289 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLKRKR 339

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             S  ++HI+DW PTL   AGG  S     +DG D W ++     S R   +  +D    
Sbjct: 340 WTSHALVHITDWYPTLVGLAGGTVSAAD-GLDGYDVWPAVSEGRASPRTEILHNIDP--- 395

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G   +G           + +P   
Sbjct: 396 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTGDPGSG-----------DWIPPQT 442

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 443 LAAFPGS----------------WWNLERMASVRQA---------------------VWL 465

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 522

Query: 628 PW 629
           PW
Sbjct: 523 PW 524



 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 199 SVAWGLSGQYSTMLYAQRASHILASHNPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 253

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 254 RTMGNVARRKYAAMV 268


>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
          Length = 599

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 180/540 (33%), Positives = 267/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N   AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDRAAWDYDNGIYSTQMYTQRVQQILASHNPRKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLACLDEAINNVTLALETYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL T A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLITLAEGQIDE-DVQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G                +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKT-------------IWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 48.5 bits (114), Expect = 0.010,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N   AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDRAAWDYDNGIYSTQMYTQRVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
          Length = 597

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
          Length = 599

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
          Length = 593

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 180/540 (33%), Positives = 270/540 (50%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 81  GFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 139

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 140 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 199

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 200 YYTHYKCD----SPGMCGYDLYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFL 255

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP +    ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 256 YIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKAYGF 310

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 311 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 360

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G        +DG D W ++   +   R+  +D L   
Sbjct: 361 KGTVCKELVHITDWYPTLISLAEGQIDE-DTQLDGYDVWETI---SEGLRSPRVDILHNI 416

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 417 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 465

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 466 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 489

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++S  P I  +L + L    +T VP  +   D  +++P+     W PW
Sbjct: 490 FNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 548



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 216 LYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 270

Query: 60  ETINQFQYITDPNRRTYAAL 79
           +    ++ I + NRR YAA+
Sbjct: 271 KYFEHYRSIININRRRYAAM 290


>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
          Length = 596

 Score =  276 bits (706), Expect = 2e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
          Length = 599

 Score =  276 bits (706), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
          Length = 596

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +              +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 187/549 (34%), Positives = 281/549 (51%), Gaps = 82/549 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +H    I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 55  GFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLLTGRYQIHTGLQHSII 113

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P     LPE LR  GYST  +GKWHLGF+R+   P  +GF++ FG L G + 
Sbjct: 114 RSRQPSCLPRHMDTLPETLRRAGYSTHMVGKWHLGFYRKSCLPTRKGFDTFFGSLTGSVD 173

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
           YY +   +       + G+D+  N   AW   G+Y+T LFT+ A ++++   P D+PLFL
Sbjct: 174 YYSYGSCNGPG----VCGYDLHDNERVAWGHEGKYSTTLFTQRAHKILKSHDPADRPLFL 229

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            L+  A H       L+ P+  +  ++ + + +RR +AAMV  +D++V  V  AL++ G 
Sbjct: 230 LLSLQAVHGP-----LQPPKSFVYLYRDMANVDRRKFAAMVSTVDEAVRNVTYALRKYGY 284

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
            +NS+II+ +DNGA P V           GSN+P RG K T WEGGV+  A + SP ++ 
Sbjct: 285 YKNSVIIYSTDNGAQPYVG----------GSNWPLRGRKGTYWEGGVRGVAFVHSPLLKH 334

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             RVS  ++HI+DW PTL   AGG+ S+ P  +DG D W ++     S R   +  ++  
Sbjct: 335 RRRVSTALLHITDWFPTLVGLAGGNISQSP-GLDGFDVWPTISEGKESPRKEILHNINP- 392

Query: 450 SSLLLNTPSRRN--------SVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
              L   PS+R         S L  + +    AA+R+  WKL+ G   +G          
Sbjct: 393 ---LHRAPSKRGICNHSMLPSGLSAVWDTSVQAAIRVGDWKLLTGDPGDG---------- 439

Query: 502 SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSP 561
            + VP+                Q+      P  N+++       ++IH            
Sbjct: 440 -DWVPV----------------QMLSTPPRPWWNLERT----LMSSIHKDV--------- 469

Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKR 621
                 +LFN+  DP E+ +++  RPD+  QL   L ++ +T VP  +  PD  QADP R
Sbjct: 470 ------WLFNITADPYERQDLSQQRPDVVLQLLGRLAFYNQTAVP-VYFPPDDPQADPDR 522

Query: 622 FNDTWSPWI 630
               W PW+
Sbjct: 523 HQGAWVPWV 531



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 45/81 (55%), Gaps = 6/81 (7%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 62
           N   AW   G+Y+T LFT+ A ++++   P D+PLFL L+  A H       L+ P+  +
Sbjct: 193 NERVAWGHEGKYSTTLFTQRAHKILKSHDPADRPLFLLLSLQAVHGP-----LQPPKSFV 247

Query: 63  NQFQYITDPNRRTYAALTKST 83
             ++ + + +RR +AA+  + 
Sbjct: 248 YLYRDMANVDRRKFAAMVSTV 268


>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
          Length = 514

 Score =  276 bits (705), Expect = 3e-71,   Method: Compositional matrix adjust.
 Identities = 165/418 (39%), Positives = 231/418 (55%), Gaps = 35/418 (8%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
            L L    GW D+ +HGS  I TP +DAL   G+ L   Y QP+CTPSR+ L++G+Y IH
Sbjct: 27  VLVLADDLGWGDVGWHGS-AIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRYQIH 85

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           TG+Q   IW  +P  +PL E+ LPE L++ GY T  +GKWHLG +R+E  P  RGF+++F
Sbjct: 86  TGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFDTYF 145

Query: 204 GYLNGVISYY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
           GYL G   YY  DH +  + ++ V     D R     A      Y+T+LFT+ A+ LI +
Sbjct: 146 GYLLGSEDYYSHDHCVLIK-AKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIAN 204

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
              +KPLFLYLA  + H     + LE   E +  +  I D  RR YA MV  +D++VG +
Sbjct: 205 HKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLMDEAVGNL 259

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
             AL+  G+  N++++F +DNG  T+           G+N+P RG K TLWEGGV+    
Sbjct: 260 TDALKEYGLWNNTVLVFSTDNGGQTMAG---------GNNWPLRGRKWTLWEGGVRGVGF 310

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
           + SP ++Q    S +++HISDWLPTL   AGG T+     +DG D W ++    PS R  
Sbjct: 311 VASPLLKQKGVESHELIHISDWLPTLVHLAGGHTNGTK-PLDGFDVWKTISEGRPSPRVE 369

Query: 442 ---NID----------GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
              NID          G+D  +SL     S  ++ L NI      AA+R   WKL+ G
Sbjct: 370 LLHNIDPMFVDDYPCLGIDNRTSLPQINSSPWDNTLFNISVH---AAIRHGKWKLLTG 424



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T+LFT+ A+ LI +   +KPLFLYLA  + H     + LE   E +  +  I D  RR
Sbjct: 189 YSTNLFTERAIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRR 243

Query: 75  TYAALT 80
            YA + 
Sbjct: 244 RYAGMV 249


>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
          Length = 661

 Score =  275 bits (703), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 177/539 (32%), Positives = 266/539 (49%), Gaps = 72/539 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 149 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 207

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 208 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 267

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 268 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 323

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 324 YIAYQAVHS-----PLQAPVRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTLALKAYGF 378

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             NSIII+ SDNG               GSN+P RG K T WEGG++    + SP ++  
Sbjct: 379 YNNSIIIYSSDNGGQPAA---------GGSNWPLRGSKGTYWEGGIRAIGFVHSPLLKNK 429

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L    
Sbjct: 430 GTVCRELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNID 485

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +F
Sbjct: 486 PIYTKAKNGSLAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQSF 534

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
                                  SN+   R   ++ T+  G +              +LF
Sbjct: 535 -----------------------SNLGPTRWHNERITLSTGKS-------------VWLF 558

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           N+  DP E+ +++   P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 559 NITADPYERVDLSHRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 616



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 284 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPV 338

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 339 RYFEHYRSIVNINRRRYAAM 358


>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
          Length = 575

 Score =  275 bits (703), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 175/539 (32%), Positives = 269/539 (49%), Gaps = 73/539 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 60  GFHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 118

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+++E  P  RGF++  G L G + 
Sbjct: 119 RPRQPNCLPLHQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 178

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AWD  G+Y+T L+ +   Q++      +P+F+Y
Sbjct: 179 YYSYDNCDGPG----VCGFDLHEGENVAWDQAGKYSTLLYAQRVNQILASHNPQQPIFIY 234

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E I  ++ + +  RR YAAMV  +D +V  +  AL++ G  
Sbjct: 235 VAFQAVHT-----PLQSPREYIYPYRGMGNVARRKYAAMVTCMDAAVKNITRALKKYGYY 289

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+I+F SDNG  T          + GSN+P RG K T WEGGV+    + SP I++  
Sbjct: 290 SNSVIVFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLIKKKR 340

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG+ S    + +GL                  DG D W +
Sbjct: 341 RTSRALMHITDWYPTLVKLAGGNIS----SAEGL------------------DGYDVWGA 378

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           +     S R  +L NID     A           G+ E G   G +       + + +F 
Sbjct: 379 ISEGKESPRTEILHNIDPLYNLAKS---------GSLEEG--QGVWNTAVQASIRVKDFK 427

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC-GANPAPMTPSPCTNGPCYLF 570
            +     Y          ++P   +     +      H  GA  +            +LF
Sbjct: 428 LLTGDPGYSD--------WIPPQTLTNFPGSWWNLERHTDGARKS-----------LWLF 468

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           N+  DP E+++++SS+P++  +L   L Y+ RT +P  +   D  + +P+     W PW
Sbjct: 469 NITADPFERHDLSSSKPEVVKELLIRLGYYNRTAIPVRYPAED-PRGNPELNGGAWGPW 526



 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AWD  G+Y+T L+ +   Q++      +P+F+Y+A  A H       L++P+E I  +
Sbjct: 200 NVAWDQAGKYSTLLYAQRVNQILASHNPQQPIFIYVAFQAVHT-----PLQSPREYIYPY 254

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 255 RGMGNVARRKYAAMV 269


>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
          Length = 513

 Score =  275 bits (703), Expect = 5e-71,   Method: Compositional matrix adjust.
 Identities = 155/394 (39%), Positives = 227/394 (57%), Gaps = 32/394 (8%)

Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
           TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q   IW  +P  VPL E+ 
Sbjct: 49  TPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKL 108

Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDH---ILSDQYS 222
           LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   YY H    L D  +
Sbjct: 109 LPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLID--A 166

Query: 223 RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA 282
             V     D R     A      Y+T++FTK A+ LI + P +KPLFLYLA  + H    
Sbjct: 167 LNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVH---- 222

Query: 283 GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
            + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  G+  N++ IF +DN
Sbjct: 223 -EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDN 281

Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISD 402
           G  T+           G+N+P RG K +LWEGGV+    + SP ++Q    + +++HISD
Sbjct: 282 GGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISD 332

Query: 403 WLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNS 462
           WLPTL   A G T+     +DG D W ++   +PS R   +  +D   + + ++P  RNS
Sbjct: 333 WLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID--PNFVDSSPCPRNS 389

Query: 463 VLINIDEKKRT----------AAVRLDSWKLVLG 486
           +    D+              AA+R  +WKL+ G
Sbjct: 390 MAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 423



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A+ LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 190 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 244

Query: 75  TYAALT 80
            YA + 
Sbjct: 245 HYAGMV 250


>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
          Length = 578

 Score =  275 bits (702), Expect = 7e-71,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 268/540 (49%), Gaps = 74/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++   +   R+  +D L   
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             +     +   +    I      +A+R+  WKL+ G        GY     S+ VP  +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F                       SN+   R   ++ T+  G +  P             
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKSVWP------------- 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ ++++  P I  +L   L    +T VP  +   D  +++P+     W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
          Length = 597

 Score =  274 bits (700), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 184/542 (33%), Positives = 271/542 (50%), Gaps = 85/542 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 89  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 147

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+ELGYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 148 RPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 207

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 208 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 263

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 264 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRHGFY 318

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 319 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 369

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 370 RTSRALVHITDWYPTLVALAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 425

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 426 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 472

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
             A   S                  N+++M S RQ                       +L
Sbjct: 473 LAAFPGS----------------WWNLERMASARQA---------------------VWL 495

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD +     L+ Y+ RT +P  +  E P   +A P      W 
Sbjct: 496 FNISADPYEREDLAGQRPDXA----RLVDYN-RTAIPVRYPAENP---RAHPDFNGGAWG 547

Query: 628 PW 629
           PW
Sbjct: 548 PW 549



 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 229 NVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 283

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 284 RTMGNVARRKYAAMV 298


>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
          Length = 483

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 180/543 (33%), Positives = 267/543 (49%), Gaps = 102/543 (18%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YGWND+ F   + + TPNID LA +G+ILN+ Y  PVCTPSR S MTG+Y   +G+Q   
Sbjct: 35  YGWNDVGFRNPS-VLTPNIDKLARSGMILNSSYVMPVCTPSRNSFMTGQYAFKSGLQHIV 93

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I   +    PL   FLP+ L+ELGY+T AIGKWHLGF + E TP YRGF++ +GY NG  
Sbjct: 94  ILPQQATCAPLNNTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFYGYYNGAE 153

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            YY+  ++          G D R N  T  +  GEY+T L+ + A  +I+D    KP+++
Sbjct: 154 DYYNLSIA---------GGKDFRDN-RTPVNATGEYSTILYARRAESIIKDHDASKPMYM 203

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQ-FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           YL   + H     + L+   E+ +  +  +   +RRTY  MV  +D+++G +  AL+ KG
Sbjct: 204 YLPFQSVH-----EPLQVSLESSSDIYSKVHTESRRTYLGMVTAMDEAIGNITKALKTKG 258

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           M  +++I+F +DNG   + Y         G+NYP RG K T++EGG +  A +    IQ+
Sbjct: 259 MFNDTLIVFTADNGG-WITYA--------GNNYPLRGGKTTVFEGGTRATAFVSGAGIQK 309

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
           +  V   M+H  DW+PT+ +AAGG                     TP   +  +DG+DQW
Sbjct: 310 SNYVYPGMIHAVDWMPTVLSAAGG---------------------TP---DPTLDGIDQW 345

Query: 450 SSLLLNTPSRRNSVLINIDEKKRT----AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
            SL     S+R   + NID+ +      AA+R+  +KL+         DGY G  +    
Sbjct: 346 DSLRTAGESKRTEFIYNIDDLRPEICGHAAIRMGDYKLI---------DGYPGAFQDWYK 396

Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
           P       V S  Y+ +       F      DKM                          
Sbjct: 397 P-----DQVVSDLYEDVSLNDTRPFF-----DKMM------------------------- 421

Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
              LFNL +DP E  +++   PDI  +L   L+Y+ + +VP +  +     A P  +   
Sbjct: 422 ---LFNLKDDPNEHVDLSKKHPDIVKKLKSRLEYYHKQIVPANFPKNSASSA-PTNYGGF 477

Query: 626 WSP 628
           W+P
Sbjct: 478 WTP 480


>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 558

 Score =  273 bits (699), Expect = 1e-70,   Method: Compositional matrix adjust.
 Identities = 177/541 (32%), Positives = 271/541 (50%), Gaps = 77/541 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +H + +I TP +D LA  G+ L N Y QP+CTPSR+  +TG+Y IHTG+Q   I
Sbjct: 49  GFNDIGYH-NTDIHTPTLDRLAAAGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSII 107

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P   R LP+ L+E GY+T  +GKWHLGF++R+  P  RGF ++FG L G + 
Sbjct: 108 RSRQPSCLPFGLRTLPQRLQEAGYATHMVGKWHLGFYKRDCLPTRRGFNTYFGSLTGSVD 167

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
           YY +   D      ++ G D+      AW   G Y+T L+T+   +++    P  +PLF+
Sbjct: 168 YYTYKSCD----GPKVCGFDLHDGERVAWGQGGRYSTHLYTQRVRKILAAHDPSSQPLFI 223

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +L+  A H       L++P+E +  ++ + +  RR YA MV  +D++V  V  AL++ G 
Sbjct: 224 FLSFQAVHT-----PLQSPKEYVYPYRRMGNMFRRKYAGMVSAVDEAVRNVTYALRKYGY 278

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +N++I F +DNG   +           GSN+P RG K T WEGGV+    + SP +++ 
Sbjct: 279 YKNTVIFFSTDNGGQPLFG---------GSNWPLRGRKGTYWEGGVRGIGFVHSPLLRRR 329

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS  ++HI+DW PTL   AGG+ S+    +DG D W ++     S R   +  +D   
Sbjct: 330 RRVSRALIHITDWYPTLMRLAGGNVSQAE-GLDGFDMWDTISDGKESPRFEILHNIDP-- 386

Query: 451 SLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
              L   +R  S+     I      AAVR+  WKL+ G    G           + +P  
Sbjct: 387 ---LYNAARHGSLKNGYGIWNTAVQAAVRVGDWKLLTGNPGYG-----------DWIPPQ 432

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
                 ES  + SL+                R T  + ++                   +
Sbjct: 433 MLGNFPES--WWSLE----------------RHTEAKKSL-------------------W 455

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LFN+ +DP E+ ++A  RPD+  QL   L ++ RT VP  +   D  +ADP R    W P
Sbjct: 456 LFNVADDPYERFDLAERRPDVVKQLLARLSFYNRTAVPVRYPSED-PRADPSRNGGAWGP 514

Query: 629 W 629
           W
Sbjct: 515 W 515



 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 41/78 (52%), Gaps = 6/78 (7%)

Query: 7   TAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
            AW   G Y+T L+T+   +++    P  +PLF++L+  A H       L++P+E +  +
Sbjct: 190 VAWGQGGRYSTHLYTQRVRKILAAHDPSSQPLFIFLSFQAVHT-----PLQSPKEYVYPY 244

Query: 66  QYITDPNRRTYAALTKST 83
           + + +  RR YA +  + 
Sbjct: 245 RRMGNMFRRKYAGMVSAV 262


>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
          Length = 574

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 177/542 (32%), Positives = 280/542 (51%), Gaps = 79/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 61  GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 119

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+++E  P  RGF++  G L G + 
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 179

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G+D+    + AWD  G+Y+T L+ +   +++      +P+F+Y
Sbjct: 180 YYTYDNCDGPG----VCGYDLHEGENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E I +++ + +  RR YAAMV  +D++V  +  AL++ G  
Sbjct: 236 VAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 290

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NS+I+F +DNG  T          + GSN+P RG K T WEGGV+    + SP I++  
Sbjct: 291 DNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKRKR 341

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL + A G+ S +P  +DG + W ++     S R   +  +D    
Sbjct: 342 RTSWALVHITDWYPTLVSLARGNLSNVP-GLDGYNVWPAISEGKESPRTEILHNIDP--- 397

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      A++R+  WKL+ G  + G  D    QT      L N
Sbjct: 398 --LYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LTN 447

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F       ++ +L++L+          D +R +                         +L
Sbjct: 448 FPG-----SWWNLERLT----------DGLRKS------------------------VWL 468

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ +++  RPDI   L   L ++ RT +P  +  E P   +A P      W 
Sbjct: 469 FNITADPYERYDLSEQRPDIVRALLTRLVHYNRTAIPVRYPAENP---RAHPDFNGGAWG 525

Query: 628 PW 629
           PW
Sbjct: 526 PW 527



 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 43/75 (57%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AWD  G+Y+T L+ +   +++      +P+F+Y+A  A H       L++P+E I ++
Sbjct: 201 NVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRY 255

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 256 RSMGNVARRKYAAMV 270


>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
           (N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
           intestinalis]
          Length = 522

 Score =  273 bits (698), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 177/557 (31%), Positives = 283/557 (50%), Gaps = 79/557 (14%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
           LLIV    G+ND+ +H   +I TPNI+ LA +G+IL + Y QP+CTPSR+ LMTG+Y IH
Sbjct: 30  LLIVADDLGFNDVGYHNP-DIYTPNINKLAKDGVILESYYVQPICTPSRSQLMTGRYQIH 88

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           TG+Q   I+  +P  +P+ E  LP+ L+E GY+T A+GKWHLGF+++E  P  RGF++ +
Sbjct: 89  TGLQHSVIFAPQPNCLPVDEIILPQKLKEAGYTTHAVGKWHLGFYKKECLPTSRGFDTFY 148

Query: 204 GYLNGVISYYDHILSDQY---SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
           GY  G   YY   +   +   ++T  ++G D   N  T W+  G Y++ L+   AV++I+
Sbjct: 149 GYYCGAEDYYTKQVHANFHFGNKTRRVSGFDFHDNSRTEWEANGTYSSYLYRDRAVRIIK 208

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
                 PLF+YL   + H       L+ P + I ++++I D  RRT++AMV  +D+++G+
Sbjct: 209 SHNSSIPLFMYLPFQSVHFP-----LQVPAKYIKRYRHIKDRKRRTFSAMVTAMDEAIGS 263

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           V+ AL+ KG+ ++++++F +DNG  T+           G+N+P RG K +LWEGGV+   
Sbjct: 264 VVDALKWKGIWQDTLVVFTTDNGGQTLF---------GGNNWPLRGRKASLWEGGVRGVG 314

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLL-LNTPSR 438
           ++    I+   R S +++HISDW PTL   AG + S   PL  D  + W ++   +  S 
Sbjct: 315 LVRGYGIRDKGRSSNELVHISDWFPTLLYIAGANASGTKPL--DSYNVWDAIANKDVVSP 372

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLI------NIDEKKRTAAVRLDSWKLVLGTQENGT 492
           R   +  +D     L   P      LI       ++  K  AA+R   WKL  G      
Sbjct: 373 RVELLHNIDP----LFTRPYTPTPKLIPYPNKHGVNTTKGHAALRYKHWKLHTGDPGYDK 428

Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
            D                  +  +K+Y +L  +  N         K +ST    ++H   
Sbjct: 429 WD------------------LPPTKSYCNLSGMHTN------TNSKYKSTHLVPSVH--- 461

Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
                           LF++ NDP E+ N+A   P +   L   +  + ++ VP +    
Sbjct: 462 ----------------LFDIENDPQERKNLAQIYPKVVDLLLGKIAMYNKSAVPINFPPY 505

Query: 613 DLVQADPKRFNDTWSPW 629
           D  +++P      W PW
Sbjct: 506 D-PKSNPSLHGGVWEPW 521



 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 5/79 (6%)

Query: 4   NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
           N  T W+  G Y++ L+   AV++I+      PLF+YL   + H       L+ P + I 
Sbjct: 183 NSRTEWEANGTYSSYLYRDRAVRIIKSHNSSIPLFMYLPFQSVHFP-----LQVPAKYIK 237

Query: 64  QFQYITDPNRRTYAALTKS 82
           ++++I D  RRT++A+  +
Sbjct: 238 RYRHIKDRKRRTFSAMVTA 256


>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
          Length = 481

 Score =  273 bits (697), Expect = 2e-70,   Method: Compositional matrix adjust.
 Identities = 179/542 (33%), Positives = 267/542 (49%), Gaps = 105/542 (19%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FH   +I TPNID LA  G++LN+ Y QP+C+PSRA+ M+G YP  TG+Q   I
Sbjct: 37  GWNDIGFHNP-DIITPNIDKLAREGLLLNHHYVQPLCSPSRAAFMSGYYPFKTGLQHSVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+ELGY+T  +GKWH GF     TP YRGF+S FGY   +  
Sbjct: 96  LENQPVCLPLNITILPQKLKELGYATHIVGKWHNGFCSWNCTPTYRGFDSFFGYYGAMED 155

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY H++             D R N +  W   G Y+T  FT  A  +IE     +PLFLY
Sbjct: 156 YYTHVIRGFL---------DYRNNTTPVWTDNGTYSTLRFTDVATDIIERHNQSQPLFLY 206

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LA+ A +       +E P +    +  I   NRR ++ MV  LD++VG V   L+++G++
Sbjct: 207 LAYQAVYG-----PIEVPAKYEAMYPNIKSENRRKFSGMVSALDEAVGNVTKTLRQRGLM 261

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ++++I+F +DNG    E          G+NYP RG K T++EGG +    ++   +Q+  
Sbjct: 262 DDTLILFTADNGGGVDES---------GNNYPLRGSKFTVYEGGTRAVGFMYGSGLQKTG 312

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            V   M+H  DWLPTL  AAGG                     TP    S+ DG++ W S
Sbjct: 313 TVFDGMIHAVDWLPTLTAAAGG---------------------TPV---SDRDGINLWPS 348

Query: 452 LLLNTPSRRNSVLINIDEKKRT----AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
           L   +PS R  V+ N D   +     AA+R+  +KL+         DGY G       P 
Sbjct: 349 LSTASPSPRTEVVYNYDSHPQPVQGHAAIRVGDYKLI---------DGYPGPFPDWYKP- 398

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
                          +Q++ ++          R +R  A  +                  
Sbjct: 399 ---------------EQVTSSL--------NTRFSRDSANQY------------------ 417

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH-EQPDLVQADPKRFNDTW 626
            LFNL +DP E+N++++ RPD+  +L   L ++++  VP +  E PD + ++P  + + W
Sbjct: 418 QLFNLKDDPNERNDLSNFRPDMVKKLAARLAWYKKQAVPPNFPETPDDL-SNPALYGNVW 476

Query: 627 SP 628
           SP
Sbjct: 477 SP 478



 Score = 43.9 bits (102), Expect = 0.29,   Method: Compositional matrix adjust.
 Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 5/81 (6%)

Query: 2   RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
           R N +  W   G Y+T  FT  A  +IE     +PLFLYLA+ A +       +E P + 
Sbjct: 168 RNNTTPVWTDNGTYSTLRFTDVATDIIERHNQSQPLFLYLAYQAVYG-----PIEVPAKY 222

Query: 62  INQFQYITDPNRRTYAALTKS 82
              +  I   NRR ++ +  +
Sbjct: 223 EAMYPNIKSENRRKFSGMVSA 243


>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
          Length = 300

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 146/308 (47%), Positives = 192/308 (62%), Gaps = 12/308 (3%)

Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           +CTPSRA+L+TGKYPI+TGMQ   I   +P G+PL E  + E  RE GY T  +GKWHLG
Sbjct: 1   MCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLG 60

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
             +R +TP  RGF+ H GYL   + YY      Q       NGHD R +L +  D VG Y
Sbjct: 61  LSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQ---NKGYNGHDFRDSLKSTHDHVGHY 117

Query: 247 ATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
            TDL T  AV+ IED   +   +PLFL L HLA HA N    ++AP E +++F+YI++  
Sbjct: 118 VTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKT 177

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
            R YAAMV +LD SVG+VI AL R+ ML+NSII+F+SDNG PT     T+      SNYP
Sbjct: 178 HRYYAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQHSTT-----ASNYP 232

Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
            RG KN+ WEG ++  A +WS + ++   V  Q ++I D LPTL  AA G +    L++D
Sbjct: 233 LRGQKNSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLD 291

Query: 424 GLDQWSSL 431
           GL+ WS+L
Sbjct: 292 GLNLWSAL 299



 Score = 69.3 bits (168), Expect = 6e-09,   Method: Compositional matrix adjust.
 Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
            R +L +  D VG Y TDL T  AV+ IED   +   +PLFL L HLA HA N    ++A
Sbjct: 103 FRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQA 162

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P E +++F+YI++   R YAA+ 
Sbjct: 163 PAEEVSRFEYISNKTHRYYAAMV 185


>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
          Length = 476

 Score =  273 bits (697), Expect = 3e-70,   Method: Compositional matrix adjust.
 Identities = 180/550 (32%), Positives = 274/550 (49%), Gaps = 84/550 (15%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
           LLIV    G +D+ +HGS  I TPNID LAY G+ L N Y QP+CTPSR+ LMTG+Y IH
Sbjct: 4   LLIVADDLGHHDVGYHGS-VIKTPNIDHLAYTGVRLENYYVQPICTPSRSQLMTGRYQIH 62

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           TG+Q   I   +P  VPL    LPE L++ GYST  +GKWHLGF++ E  P+ RGF+S++
Sbjct: 63  TGLQHNIINPFQPNAVPLDLPMLPEVLKQNGYSTHMVGKWHLGFYKDEVLPMNRGFDSYY 122

Query: 204 GYLNGVISYYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           GYL G   Y+ H            + G D+R +    W+  G+Y+T LF ++A  ++   
Sbjct: 123 GYLTGSEDYFTHRRCGALPGANKTVCGIDLRNDFEVDWNQTGKYSTQLFAEKAEDVVRKH 182

Query: 263 PV---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
            V   D+PLFLY+A  A HA N     + P E +  +  I DP RR  A MV  +D++VG
Sbjct: 183 AVHQPDQPLFLYVAFQAVHAPN-----QVPNEYLKPYD-IDDPKRRLLAGMVTCMDEAVG 236

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            + +A +  G+ +N++++F +DNG    E        N G+N+P RG K++LWEGG++  
Sbjct: 237 NITAAFKDTGLWDNTVVVFTTDNGGDVNE--------NSGNNWPLRGWKHSLWEGGMRGV 288

Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
             + SP ++        +MH+SDW PTL   AG +TS +   +DG D W S+     S R
Sbjct: 289 GFVNSPLLKSFGYPVNGLMHVSDWFPTLLNLAGANTSSIS-GLDGHDIWKSIAEREDSPR 347

Query: 440 NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQ 499
              +  +D      L  P  R S L +  + +  +A+R+  +K++ G    G+       
Sbjct: 348 KELLHNIDP-----LYPP--RGSKLNHTYDVRVRSALRVGDFKIITGDPFTGSW------ 394

Query: 500 TRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP 559
                VP   ++     KT       S+N+                              
Sbjct: 395 -----VPPPEYH----KKTIPDPDPRSKNV------------------------------ 415

Query: 560 SPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
                   +LFN+  DPCE  +++ + P + + + + L  +  T VP      D  +++P
Sbjct: 416 --------WLFNVHEDPCETTDLSVALPGVLTMMLDKLLAYNATAVPCVFPDDD-PRSNP 466

Query: 620 KRFNDTWSPW 629
           +     + PW
Sbjct: 467 EEHGGAFGPW 476



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 9/83 (10%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
           +R +    W+  G+Y+T LF ++A  ++    V   D+PLFLY+A  A HA N     + 
Sbjct: 152 LRNDFEVDWNQTGKYSTQLFAEKAEDVVRKHAVHQPDQPLFLYVAFQAVHAPN-----QV 206

Query: 58  PQETINQFQYITDPNRRTYAALT 80
           P E +  +  I DP RR  A + 
Sbjct: 207 PNEYLKPYD-IDDPKRRLLAGMV 228


>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
          Length = 574

 Score =  272 bits (696), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 176/542 (32%), Positives = 280/542 (51%), Gaps = 79/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 61  GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 119

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+++E  P  RGF++  G L G + 
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 179

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G+D+    + AWD  G+Y+T L+ +   +++      +P+F+Y
Sbjct: 180 YYTYDNCDGPG----VCGYDLHEGENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E I +++ + +  RR YAAMV  +D++V  +  AL++ G  
Sbjct: 236 VAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 290

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NS+I+F +DNG  T          + GSN+P RG K T WEGGV+    + SP I++  
Sbjct: 291 DNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKRKR 341

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL + A G+ S +P  +DG + W ++     S R   +  +D    
Sbjct: 342 RTSWALVHITDWYPTLVSLARGNLSNVP-GLDGYNVWPAISEGKESPRTEILHNIDP--- 397

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      A++R+  WKL+ G  + G  D    QT      L N
Sbjct: 398 --LYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LTN 447

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F       ++ +L++L+          D +R +                         +L
Sbjct: 448 FPG-----SWWNLERLT----------DGLRKS------------------------VWL 468

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ +++  RPD+   L   L ++ RT +P  +  E P   +A P      W 
Sbjct: 469 FNITADPYERYDLSEQRPDVVRALLTRLVHYNRTAIPVRYPAENP---RAHPDFNGGAWG 525

Query: 628 PW 629
           PW
Sbjct: 526 PW 527



 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 43/75 (57%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AWD  G+Y+T L+ +   +++      +P+F+Y+A  A H       L++P+E I ++
Sbjct: 201 NVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRY 255

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 256 RSMGNVARRKYAAMV 270


>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
           caballus]
          Length = 523

 Score =  272 bits (696), Expect = 4e-70,   Method: Compositional matrix adjust.
 Identities = 179/550 (32%), Positives = 273/550 (49%), Gaps = 83/550 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G +D+  H S    TP +DALA  G++L+N Y QP+C PSR+ L++G+Y IHTG+Q   I
Sbjct: 46  GXHDVGLHES-RFSTPRLDALAAGGLLLDNYYTQPLCXPSRSQLLSGRYQIHTGLQHQII 104

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  +PL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 105 WPCQPSCLPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 164

Query: 212 YYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H       +  V     D R     A      Y+T +FT+ A  LI + P +KPLFL
Sbjct: 165 YYSHERCTFIDALNVTRCALDFRDGEEVATGYKNMYSTSVFTERATALITNHPPEKPLFL 224

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLA  + H     + L+ P+E +  + +I D NR  YA MV  LD++VG V +AL+ +G+
Sbjct: 225 YLALQSVH-----QPLQVPEEYLKPYDFIQDKNRYHYAGMVSLLDEAVGNVTAALKSRGL 279

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             N++ IF +DNG  T+           G+N+P RG K +LWEGGV+    + SP ++Q 
Sbjct: 280 WNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQK 330

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
              + +++HISDWLPTL   AGG T+     +DG + W ++   +PS R   +  +D   
Sbjct: 331 GVKNRELIHISDWLPTLVKLAGGSTNGTK-PLDGFNVWKTISEGSPSPRVELLHNID--P 387

Query: 451 SLLLNTP--------SRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQT 500
           + + ++P        ++ +S L        +  AA+R  +WKL         + GY G  
Sbjct: 388 NFVDDSPCPGRSMAPAKHDSFLPEYSTFNTSIHAAIRHKNWKL---------LTGYPG-- 436

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
                                        F P S                  N + + PS
Sbjct: 437 -------------------------CGYWFPPPSQY----------------NGSAIPPS 455

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
                  +LF++  DP E+++++   P +  QL   L+++ +  VP      D  + DPK
Sbjct: 456 DPPTKTLWLFDIDRDPEERHDVSREHPHVVRQLLSRLQFYHKHSVPVYFPAQD-PRCDPK 514

Query: 621 RFNDTWSPWI 630
                W PW+
Sbjct: 515 S-TGAWGPWM 523



 Score = 53.1 bits (126), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T +FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 200 YSTSVFTERATALITNHPPEKPLFLYLALQSVH-----QPLQVPEEYLKPYDFIQDKNRY 254

Query: 75  TYAALT 80
            YA + 
Sbjct: 255 HYAGMV 260


>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
          Length = 482

 Score =  271 bits (694), Expect = 6e-70,   Method: Compositional matrix adjust.
 Identities = 175/537 (32%), Positives = 257/537 (47%), Gaps = 93/537 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ F    ++ TPNID LA +G+ILN+ Y  PVCTPSR S MTG Y   +G+Q   I
Sbjct: 36  GWNDVGFRNP-DVLTPNIDKLARSGMILNSSYVMPVCTPSRNSFMTGHYAFKSGLQHLAI 94

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +    PL   FLP+ L+ELGY+T AIGKWHLGF + E TP YRGF++ FG+ NG   
Sbjct: 95  NPQQATCAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFFGFYNGQED 154

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY          T+ + G    R+     +  GEY+T L+++ A  +I++    KP+++Y
Sbjct: 155 YY----------TLSVAGGKDFRDNKVPVNATGEYSTFLYSRRAESIIKEHDASKPIYMY 204

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L   + HA      L+ P    + +  +   +RRTY  MV  +D+++G +  AL+ KG+ 
Sbjct: 205 LPFQSVHA-----PLQVPNRYTDMYTNVHTKSRRTYMGMVTAMDEAIGNITKALKTKGIF 259

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            +++IIF +DNG            +  G+NYP RG K TL+EGG +  A +    IQ+  
Sbjct: 260 NDTLIIFTTDNGGAV---------KFAGNNYPLRGGKATLFEGGTRATAFVTGAGIQKFN 310

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
                ++H  DW+PT+ +AAGG        +DG+DQW SL     S+RN  I  +D +  
Sbjct: 311 YEYSGLIHAVDWMPTVLSAAGGTADS---KLDGIDQWDSLRTARESKRNEFIYNIDDFRP 367

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
                            E    AA+R+  +KL+         DGY G  +    P     
Sbjct: 368 -----------------EIGGHAAIRIGDYKLI---------DGYPGLHQGWYKP----- 396

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
              +   Y   + +S N   P    DKM                             LFN
Sbjct: 397 ---DHVGYDLNEDVSTNCMRPF--FDKMM----------------------------LFN 423

Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           L +DP E  +++   PDI  +L   L+Y+   +VP +  +     A P  +   WSP
Sbjct: 424 LQDDPNEHVDLSRKHPDIVKKLKSRLEYYHNQIVPANFPKNSASSA-PTSYGGFWSP 479



 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 19/73 (26%), Positives = 38/73 (52%), Gaps = 5/73 (6%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           +  GEY+T L+++ A  +I++    KP+++YL   + HA      L+ P    + +  + 
Sbjct: 174 NATGEYSTFLYSRRAESIIKEHDASKPIYMYLPFQSVHA-----PLQVPNRYTDMYTNVH 228

Query: 70  DPNRRTYAALTKS 82
             +RRTY  +  +
Sbjct: 229 TKSRRTYMGMVTA 241


>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
          Length = 562

 Score =  271 bits (692), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 181/561 (32%), Positives = 263/561 (46%), Gaps = 111/561 (19%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +HGS EI TP +D LA  G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q   I
Sbjct: 41  GYNDIGYHGS-EIQTPVLDQLAGEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSII 99

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P     LPE L+E GYST  +GKWHLGF   E  P  RGF+S  G L G   
Sbjct: 100 RARQPLCLPPDTPTLPERLQEAGYSTHMVGKWHLGFCHPECLPTSRGFQSFLGSLTGSGD 159

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           ++    S Q     E  G D+      AW+  G Y+T L+T+    ++      KPLFLY
Sbjct: 160 HF----SFQSCDGTEACGFDLHDGDRPAWELRGNYSTRLYTERVKDILRRHDHRKPLFLY 215

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L+AP   + ++Q + +  RR YAAMV  +D+SVG ++S L+ +G  
Sbjct: 216 VALQAVHT-----PLQAPGHLLRRYQALGNRPRRHYAAMVSGVDESVGEIVSELRERGYY 270

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NS++I+ SDNG   +         + G N+P RG K + WEGGV+    + SP +++  
Sbjct: 271 DNSVLIYSSDNGGQPL---------SGGCNWPLRGGKGSYWEGGVRAVGFVHSPLLKRKG 321

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            VS  ++HISDW PTL + AG   S                       ++++DG D W +
Sbjct: 322 VVSHALIHISDWYPTLLSLAGYRES----------------------DSNHVDGQDVWET 359

Query: 452 LLLNTPSRRNSVLINIDEKKR----------------------TAAVRLDSWKLVLGTQE 489
           +    P  R  +L NID   R                       AA+R   WKL+ G   
Sbjct: 360 ISSGLPCPRTEILFNIDPVSRRHGDINPKLLTLNGFGIWDTGVRAAIRAGDWKLLTGNIG 419

Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
           +G  D +  QT                + +Q ++              K R  R ++   
Sbjct: 420 DG--DWFPPQTMPGG-----------PQQWQGME--------------KRRDQRGKSV-- 450

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
                             +LFN+  DP E+ ++A +RP++   L   L  + RT V    
Sbjct: 451 ------------------WLFNITADPYERADLAEARPEVVKVLLTRLSEYNRTAV-SPR 491

Query: 610 EQPDLVQADPKRFNDTWSPWI 630
             PD   ADP+     W+PW+
Sbjct: 492 NPPDDPMADPQLHGGVWTPWL 512



 Score = 47.4 bits (111), Expect = 0.026,   Method: Compositional matrix adjust.
 Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 5/76 (6%)

Query: 8   AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 67
           AW+  G Y+T L+T+    ++      KPLFLY+A  A H       L+AP   + ++Q 
Sbjct: 183 AWELRGNYSTRLYTERVKDILRRHDHRKPLFLYVALQAVHT-----PLQAPGHLLRRYQA 237

Query: 68  ITDPNRRTYAALTKST 83
           + +  RR YAA+    
Sbjct: 238 LGNRPRRHYAAMVSGV 253


>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score =  270 bits (691), Expect = 1e-69,   Method: Compositional matrix adjust.
 Identities = 180/564 (31%), Positives = 274/564 (48%), Gaps = 88/564 (15%)

Query: 75  TYAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPS 131
           ++  ++ S T  + I+   YG++D+ +H S  I TPN+D LA  G+ L N Y QP CTP+
Sbjct: 5   SWRYISSSPTHIVFILADDYGFHDIGYHDS-IIKTPNLDRLASEGVKLENYYVQPKCTPT 63

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           R+ LM+G+Y IHTG+Q   IW  +P  +P+ E  +P+ L+E GY+T  +GKWHLG +++E
Sbjct: 64  RSQLMSGRYQIHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKE 123

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDL 250
             P  RGF++ FGYL G   YY H  S  Y +    +G D R N+        G+Y+T +
Sbjct: 124 CLPTERGFDTFFGYLTGSEDYYTHNRS--YDK---FHGMDFRENMQIVQPQYNGQYSTHV 178

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
           F ++A  +I+      PLFLYL   A H       L+ P +    +  IT+  RRTYA M
Sbjct: 179 FAEKAKNIIKSHDPQIPLFLYLPLHAVHGP-----LQVPDQYEKPYTNITNKQRRTYAGM 233

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V  +D++VG +   L+  G+ +NS+IIF +DNG               G+N+P RG K +
Sbjct: 234 VSCMDEAVGNITQTLEDAGLYDNSVIIFSTDNGGQVTA---------GGNNWPLRGWKGS 284

Query: 371 LWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
           LWEGG+     + SP I    R   S +++H++DW PT+   AGG  S            
Sbjct: 285 LWEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAGGSVSG----------- 333

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
                         +DG +QW ++     S R  +L NID K+ T               
Sbjct: 334 -----------TQPLDGYNQWETISNGKKSPRKELLHNIDPKRTTL-------------- 368

Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
                     QT +N       N I     + +  Q +  +        K+ +     + 
Sbjct: 369 ----------QTSNN-------NMIYRKDMFDASIQAAMRV-----GDWKILTGNPGNSS 406

Query: 549 HCGANPAPMTPSPCTNGP---CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLV 605
                 + +TP   T+ P    +LFN+  DP E N+++ SRPD  ++L E+L+ + +T V
Sbjct: 407 WIPPPESSITPPHPTDIPGKNIWLFNIAEDPNEYNDLSLSRPDKVNELLEILQQYYKTSV 466

Query: 606 PQSHEQPDLVQADPKRFNDTWSPW 629
           P  H   D    DP + +  W PW
Sbjct: 467 P-VHYPDDEPACDPDKNDGAWGPW 489



 Score = 42.4 bits (98), Expect = 0.85,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 5/68 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G+Y+T +F ++A  +I+      PLFLYL   A H       L+ P +    +  IT+  
Sbjct: 172 GQYSTHVFAEKAKNIIKSHDPQIPLFLYLPLHAVHGP-----LQVPDQYEKPYTNITNKQ 226

Query: 73  RRTYAALT 80
           RRTYA + 
Sbjct: 227 RRTYAGMV 234


>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
          Length = 573

 Score =  270 bits (690), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 175/542 (32%), Positives = 279/542 (51%), Gaps = 79/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 61  GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 119

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+++E  P  RGF++  G L G + 
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 179

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G+D+      AWD  G+Y+T L+ +   +++      +P+F+Y
Sbjct: 180 YYTYDNCDGPG----VCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E I +++ + +  RR YAAMV  +D++V  +  AL++ G  
Sbjct: 236 VAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 290

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NS+I+F +DNG  T          + GSN+P RG K T WEGGV+    + SP I++  
Sbjct: 291 DNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKRKR 341

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL + A G+ S +P  +DG + W ++     S R   +  +D    
Sbjct: 342 RTSWALVHITDWYPTLVSLARGNLSNVP-GLDGYNVWPAISEGKESPRTEILHNIDP--- 397

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      A++R+  WKL+ G  + G  D    QT      L N
Sbjct: 398 --LYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LTN 447

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F       ++ +L++L+          D +R +                         +L
Sbjct: 448 FPG-----SWWNLERLT----------DGLRKS------------------------VWL 468

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ +++  RPD+   L   L ++ +T +P  +  E P   +A P      W 
Sbjct: 469 FNITADPYERYDLSDQRPDVVRTLLMRLVHYNQTAIPVRYPAENP---RAHPDFNGGAWG 525

Query: 628 PW 629
           PW
Sbjct: 526 PW 527



 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 42/74 (56%), Gaps = 5/74 (6%)

Query: 7   TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
            AWD  G+Y+T L+ +   +++      +P+F+Y+A  A H       L++P+E I +++
Sbjct: 202 VAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYR 256

Query: 67  YITDPNRRTYAALT 80
            + +  RR YAA+ 
Sbjct: 257 SMGNVARRKYAAMV 270


>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
          Length = 545

 Score =  270 bits (689), Expect = 2e-69,   Method: Compositional matrix adjust.
 Identities = 168/551 (30%), Positives = 268/551 (48%), Gaps = 62/551 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H + +I TPN+D +A  G+ILN+ Y  P+CTPSR S +TG YP   G+ G  I
Sbjct: 39  GWNDVGWH-NPDIKTPNLDRMAGGGVILNSSYVHPICTPSRNSFLTGVYPFRVGLSGTAI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              + R + L    LPE L++LGYST  IGKWHLGF    YTP  RGF+S  G+  G   
Sbjct: 98  TPHQARFMSLKTPTLPEKLKKLGYSTHMIGKWHLGFCNERYTPTRRGFDSFLGFYTGTQD 157

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
           YY H  +          G+D R N +  +    +Y+T  + K AV +I +    K PLFL
Sbjct: 158 YYKHTTA---------KGYDFRFNQTVFYPPKKQYSTKTYAKRAVDIITEHKRKKNPLFL 208

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLA  + H       L+ P++   Q+  I + +RR Y+ MV  +DD++G V++A ++   
Sbjct: 209 YLAFQSVHTP-----LQVPKKYEKQYNNIKNKDRRVYSGMVTAMDDAIGAVMAAFRKHKF 263

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ-IQQ 389
           + N I++F +DNG               G+N+P RG K T+WEGG +  + ++S   +++
Sbjct: 264 MGNLIVVFTTDNGGAA---------HIVGNNFPLRGSKTTVWEGGTRAVSFVYSKNLLKK 314

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
                  + H +DW PT+   AGG                        ++   +DG++QW
Sbjct: 315 TGYTHDGLFHATDWFPTILAVAGGK----------------------PKQEKQLDGVNQW 352

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNK-VPLL 508
             +    PSRR   +  ID+ +  +A+R+  +KL++G    G  + +Y   +SN   P+ 
Sbjct: 353 PMIRSGKPSRRTEFVYEIDKSRPVSAIRMGKYKLIVGNP--GDYNNWYRIAKSNSFCPIY 410

Query: 509 NFNAIVESK---TYQSLQQLSQN----IFLPISNIDKMRSTRQQATIHCGANPAPMTPSP 561
           + N     +    Y+SL  + +N    +    S +   R  R+            +    
Sbjct: 411 DENETDSDEDIFIYESLNNIVRNAKGYVKWLSSKLPGYRMVRKWHRYRQLERCRKLKKRD 470

Query: 562 CTN--GPCY-LFNLGNDPCEQNNIASSRPDISSQLYELLK-YHRRTLVPQSHEQPDLVQA 617
                 P Y L++L  DP E+ NIA  +P +  ++   L+ Y ++T       +  +  A
Sbjct: 471 KAKWRFPRYLLYDLEADPSERKNIAGYKPVLVRKMKARLEHYQKKTKHVAGQSKRKVAAA 530

Query: 618 DPKRFNDTWSP 628
           DPK  +  W+P
Sbjct: 531 DPKHHHGVWTP 541



 Score = 41.2 bits (95), Expect = 1.7,   Method: Compositional matrix adjust.
 Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 6/81 (7%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
            R N +  +    +Y+T  + K AV +I +    K PLFLYLA  + H       L+ P+
Sbjct: 169 FRFNQTVFYPPKKQYSTKTYAKRAVDIITEHKRKKNPLFLYLAFQSVHTP-----LQVPK 223

Query: 60  ETINQFQYITDPNRRTYAALT 80
           +   Q+  I + +RR Y+ + 
Sbjct: 224 KYEKQYNNIKNKDRRVYSGMV 244


>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
          Length = 476

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 153/405 (37%), Positives = 225/405 (55%), Gaps = 33/405 (8%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YGW+D+ +HGS +I TP +D LAYNGI L N Y QP+C+P+R+  M+G Y IHTG+Q   
Sbjct: 18  YGWHDIGYHGS-KIRTPVLDDLAYNGIRLENYYVQPICSPTRSQFMSGVYQIHTGLQHNV 76

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           IW A+  G+PL    + + +RE GY+T   GKWHLG+++ EY P  RGF++++GYLNG  
Sbjct: 77  IWPAQANGLPLEFPTIADKMREAGYATHMAGKWHLGYYKEEYLPHNRGFDTYYGYLNGCE 136

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKP 267
            YYD      Y       G+D R N    W+   +Y+T L+     +++ +  +   DKP
Sbjct: 137 DYYDKSYCHPYC------GYDFRLNDDIQWNLT-DYSTYLYVSRVNEILLNHKIYSPDKP 189

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFLYL   + H     + LE P+E  +++ +I D NRRTYA MV  +D++VG +    ++
Sbjct: 190 LFLYLPLQSVH-----EPLEVPKEYSDKYSHIKDNNRRTYAGMVAAMDEAVGKIRDLFKK 244

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
             M + +++ F +DNG               G+N+P RG K +LWEGG++    + SP +
Sbjct: 245 YEMWDETVLAFSTDNGG---------QIHRGGNNWPLRGWKISLWEGGMRGVGFVHSPLL 295

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
             +   S  +MH+SDWLPTL   AGG+T+ + L  DG + W  L   T S R   +  +D
Sbjct: 296 PNSGGKSKGLMHVSDWLPTLVHVAGGNTTGMKL--DGYNVWDMLTKGTESPRKEILHNID 353

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
                 L  P          D + R AA+R   WKL+ G   NG+
Sbjct: 354 P-----LMEPKGHARPNSTFDTRVR-AAIRYGDWKLITGDPGNGS 392



 Score = 46.2 bits (108), Expect = 0.054,   Method: Compositional matrix adjust.
 Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 9/85 (10%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
            R N    W+   +Y+T L+     +++ +  +   DKPLFLYL   + H     + LE 
Sbjct: 152 FRLNDDIQWNLT-DYSTYLYVSRVNEILLNHKIYSPDKPLFLYLPLQSVH-----EPLEV 205

Query: 58  PQETINQFQYITDPNRRTYAALTKS 82
           P+E  +++ +I D NRRTYA +  +
Sbjct: 206 PKEYSDKYSHIKDNNRRTYAGMVAA 230


>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
          Length = 575

 Score =  269 bits (688), Expect = 3e-69,   Method: Compositional matrix adjust.
 Identities = 174/542 (32%), Positives = 279/542 (51%), Gaps = 79/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 61  GYHDIGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 119

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+++E  P  RGF++  G L G + 
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 179

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D      ++ G+D+      AWD  G+Y+T L+ +   +++      +P+F+Y
Sbjct: 180 YYTYDNCD----GPDVCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E I +++ + +  RR YAAMV  +D++V  +  AL++ G  
Sbjct: 236 VAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 290

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NS+++F +DNG  T          + GSN+P RG K T WEGGV+    + SP I++  
Sbjct: 291 DNSVMVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKRKR 341

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL + A G+ S +   +DG D W ++     S R   +  +D    
Sbjct: 342 RTSWALVHITDWYPTLVSLARGNLSNVQ-GLDGYDVWPAISEGKDSPRTEILHNIDP--- 397

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      A++R+  WKL+ G  + G  D    QT      L N
Sbjct: 398 --LYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LTN 447

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F       ++ +L++L+          D ++ +                         +L
Sbjct: 448 FPG-----SWWNLERLT----------DGLKKS------------------------MWL 468

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ +++  RPD+   L   L ++ RT +P  +  E P   +A P      W 
Sbjct: 469 FNITADPYERYDLSEQRPDVVRTLLTRLVHYNRTAIPVRYPAENP---RAHPDFNGGAWG 525

Query: 628 PW 629
           PW
Sbjct: 526 PW 527



 Score = 47.8 bits (112), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 42/74 (56%), Gaps = 5/74 (6%)

Query: 7   TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
            AWD  G+Y+T L+ +   +++      +P+F+Y+A  A H       L++P+E I +++
Sbjct: 202 VAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYR 256

Query: 67  YITDPNRRTYAALT 80
            + +  RR YAA+ 
Sbjct: 257 SMGNVARRKYAAMV 270


>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
 gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
          Length = 449

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 178/523 (34%), Positives = 259/523 (49%), Gaps = 79/523 (15%)

Query: 112 LAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR 171
           LA  G+ L N Y QP+C+PSR  LMTG+Y I  G+Q   I    P G+PL E  LP+ LR
Sbjct: 2   LASEGVKLENYYIQPICSPSRCQLMTGRYQIRYGLQHSVITSDRPHGLPLDEVTLPQKLR 61

Query: 172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRT-VELNGH 230
           E GY +  +GKWHLGFFR+EY PL RGF+  +GYL G   Y+ H   + Y+R     +G 
Sbjct: 62  ENGYRSYIVGKWHLGFFRKEYMPLQRGFDRFYGYLTGGEDYWTHRRPNGYARDPSAFHGL 121

Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
           D+R       D  G Y+T LF ++A++ I      KP+FLYL   A H+      LEAP 
Sbjct: 122 DLRDQDKPVLDQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHSP-----LEAPT 176

Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
           + + +++ +T P  + YAAMV  +D++VG V  AL+R G+ +N+++IF +DNGA     R
Sbjct: 177 KYVEKYKDVTHPLIKIYAAMVTAMDEAVGNVTDALKRTGLWDNTVLIFSTDNGA-----R 231

Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
           E     N GSN+P RG KNTLWEGGV+    + S  +++  R S  ++HISDW PTL   
Sbjct: 232 E-----NAGSNWPLRGWKNTLWEGGVRGVGFVNSKLLRKGNRKSDALIHISDWFPTLLRI 286

Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNSVLINI 467
           +    S     +DG D W ++   TPS R     NID L + ++      S  N +  +I
Sbjct: 287 SRS-VSNGTKPLDGFDIWDTISTGTPSPRKEILHNIDPLHRKTA-----SSSFNLLYDDI 340

Query: 468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQ 527
                 AA+R   WKL+       T D  +G  + +      F   V+  T         
Sbjct: 341 FNISVYAAIRSGDWKLL-------TGDQGHGGWKDSDTSGSKFEYAVDPPTKH------- 386

Query: 528 NIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRP 587
                                                   +LFN+ NDP E+++++   P
Sbjct: 387 ---------------------------------------LWLFNIRNDPQERSDLSDRYP 407

Query: 588 DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           D+  +L + L  + +T +P      D  +A+P    D + PW+
Sbjct: 408 DVVQELLQKLAAYNKTALPPFWPSRD-PRANPALHGDLFGPWL 449



 Score = 49.3 bits (116), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 25/73 (34%), Positives = 41/73 (56%), Gaps = 5/73 (6%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           D  G Y+T LF ++A++ I      KP+FLYL   A H+      LEAP + + +++ +T
Sbjct: 132 DQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHSP-----LEAPTKYVEKYKDVT 186

Query: 70  DPNRRTYAALTKS 82
            P  + YAA+  +
Sbjct: 187 HPLIKIYAAMVTA 199


>gi|241601772|ref|XP_002405049.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215500554|gb|EEC10048.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 451

 Score =  269 bits (687), Expect = 4e-69,   Method: Compositional matrix adjust.
 Identities = 179/510 (35%), Positives = 249/510 (48%), Gaps = 81/510 (15%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           I +GMQ   I  A P G+PL  + +P++ ++LGY    IGKWHLGFF+  YTP  RGF+S
Sbjct: 1   ILSGMQAEVIRDAAPWGLPLELKIMPQHFKDLGYQVNMIGKWHLGFFQTPYTPPKRGFDS 60

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD--TVGEYATDLFTKEAVQLI 259
            FGY  G   YY+H           LN   +     T      + +Y+T  +      L+
Sbjct: 61  FFGYYTGENDYYNHTAG-------TLNRPPLYGTFKTKQTRCVLAKYSTHTWG-----LL 108

Query: 260 EDQPV-----DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           E  P+      +P F Y ++ A HA       +AP+  + +F YI + NR  YA M+  L
Sbjct: 109 EAFPLLLNLHSQPFFCYFSYQAVHAALMNVPFQAPERNVLKFPYIGETNRTLYAGMLDSL 168

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D+S+G V+  L   GMLE++II F SDNG   + Y   SN    G N+P RG K TLWEG
Sbjct: 169 DESIGRVVEILGNAGMLEDTIIAFSSDNGG--LPYNAFSNR---GFNWPLRGAKGTLWEG 223

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
           GV+VPA +WS +  + PRVS Q+MHISDWLPTLY+AAGGD S L                
Sbjct: 224 GVRVPAFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAGGDVSNL---------------- 267

Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
                  ++DG++ W  L L   S R  VL+ ID     A +R  ++KLV+G   +  +D
Sbjct: 268 ------GSLDGVNMWRYLTLGLGSARKEVLLTIDPMYNLAGLRYKNYKLVVGEGFDEELD 321

Query: 495 GYY----GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC 550
             Y    G   +N VP L      +SK  + L+                +  R  + +  
Sbjct: 322 DRYAFPGGLRPTNDVPQLR----KDSKVARVLKTFYG------------KQNRNWSPLDW 365

Query: 551 GANPAP------MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL 604
             NP+       +T +     P YLFNL  DPCE NN+A S+  I   +   L  +  T 
Sbjct: 366 SKNPSVDCRERWLTNNFVARRPPYLFNLARDPCELNNLAGSKRIILHFMMHKLAAYNATA 425

Query: 605 VPQSHEQPDLVQADPKRFND----TWSPWI 630
           VP     P  +  DP+ + +     WSPW+
Sbjct: 426 VP-----PRFLPEDPRGYPEYHGGVWSPWL 450


>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
          Length = 551

 Score =  268 bits (686), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 179/540 (33%), Positives = 273/540 (50%), Gaps = 72/540 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS +I TP +D LA  G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q   I
Sbjct: 35  GFRDVGYHGS-DIKTPTLDRLAAEGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSVI 93

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A+P  +PL    LP+ L+  GYST  +GKWHLGF++R   P  RGF++ FG L G   
Sbjct: 94  RAAQPNCLPLENVTLPQKLKNAGYSTHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSGD 153

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFTKEAVQLIEDQ-PVDKPLF 269
           YY H      S    + G+D+      AW+   G Y+T++FT++AV ++ +  P  +PLF
Sbjct: 154 YYSHYKCQGPS----MCGYDLYEGEEAAWEQDRGLYSTEMFTQKAVSILANHNPRKQPLF 209

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYLA+ A H+      L+ P   + +++ I +P RR YAAMV  LD+++  +  AL+  G
Sbjct: 210 LYLAYQAVHSP-----LQVPARYLERYKGIPNPYRRKYAAMVSCLDEAIHNLTLALKHYG 264

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             +N++I++ SDNG   +           GSN+P RG K T WEGG++    + SP +  
Sbjct: 265 YYDNTVIVYSSDNGGQPLAG---------GSNWPLRGSKATYWEGGIRTVGFVHSPLLMN 315

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
                  ++HI+DW PTL T   G T    LN+DG D W ++    PS R+  +  +D  
Sbjct: 316 KGTKCRSLIHITDWFPTLVTLGEG-TLDEDLNLDGYDVWETISEGRPSPRHDILHNIDPI 374

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
                N   +    L N   +   AA+R+  WKL+ G      + GY     S+ VP   
Sbjct: 375 YIKAKNGSWKAGYGLWNTAIQ---AALRVGHWKLLTG------VPGY-----SDWVP--- 417

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                                 P + +++  +TR    +H    P     S       +L
Sbjct: 418 ----------------------PQTFLNQRLTTR----LHNERVPWDRGKS------IWL 445

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           FN+  DP E+ +++   P I  ++   L  + +T VP  +   DL +++P+     W PW
Sbjct: 446 FNITADPYERVDLSQRYPHIVKKMLMRLAQYNKTAVPVCYPTKDL-RSNPQYNGGVWGPW 504



 Score = 51.6 bits (122), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 28/76 (36%), Positives = 47/76 (61%), Gaps = 7/76 (9%)

Query: 7   TAWDT-VGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
            AW+   G Y+T++FT++AV ++ +  P  +PLFLYLA+ A H+      L+ P   + +
Sbjct: 176 AAWEQDRGLYSTEMFTQKAVSILANHNPRKQPLFLYLAYQAVHSP-----LQVPARYLER 230

Query: 65  FQYITDPNRRTYAALT 80
           ++ I +P RR YAA+ 
Sbjct: 231 YKGIPNPYRRKYAAMV 246


>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 517

 Score =  268 bits (686), Expect = 5e-69,   Method: Compositional matrix adjust.
 Identities = 179/561 (31%), Positives = 269/561 (47%), Gaps = 71/561 (12%)

Query: 77  AALTKSTTLTLLIV-YGWNDLSFHG---SNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
           A LTK   + +L   YG+ND+ +H     +++ TP +D+LA  G+ L N Y QP+C+PSR
Sbjct: 19  AELTKPNIVFILADDYGFNDIGYHAVEHHSDMKTPFLDSLAMAGVRLENYYIQPICSPSR 78

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
           + LM+G+Y IHTG+Q   I   +  G+PL    LPE L + GY T  +GKWHLGF++ EY
Sbjct: 79  SVLMSGRYQIHTGLQHYVISPQQRNGLPLDNIILPEQLHKCGYDTHMVGKWHLGFYKDEY 138

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
            P  RGF S+FGYL G   YY     D      +L G+DM         T G+Y+ +LF 
Sbjct: 139 LPWKRGFNSYFGYLTGGEDYYTKWRCDG-----KLCGYDMTSEKGPTNATYGQYSANLFA 193

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
            +A + I+     KPLFLY+A  + H+      +E P+     F YI + NR+ Y  MV 
Sbjct: 194 NKANEAIDKHDKTKPLFLYVAFQSVHSP-----MEVPESYAKPFDYIKNHNRKMYGGMVA 248

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
            +D++V  +   LQ  G+ +N+I++F +DNG  T+         + G+N+P RG K TLW
Sbjct: 249 AMDEAVKNITEHLQAAGLWDNTILVFSADNGGQTL---------SGGNNWPLRGRKLTLW 299

Query: 373 EGGVKVPAILWSPQIQ-QNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
           EGG+K    +    +   NP   V+ +M+HISDW PT+  A           +DG++QW 
Sbjct: 300 EGGIKGVGFVHGKILNVPNPNYIVNNEMIHISDWFPTIMEATQCPYVEGTQKLDGVNQWD 359

Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
           ++   T S R   +  +D   + L    + + ++ +  D     A +R   WKL+ G Q 
Sbjct: 360 TIFGKTVSARKEILLNIDPLFTPLFYKETPKQNIQMGFDTSVH-AGIRYGEWKLLTGLQ- 417

Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
                   G  R  + P        E K                   +K    +Q+ T H
Sbjct: 418 --------GDDRWIEPP--------EWK-------------------NKAEEEQQEDTEH 442

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
            G                 L+N+ NDP EQ  ++   P + + L   L  +  T VP ++
Sbjct: 443 PGF-------QKFATKSVQLYNIVNDPYEQVEVSDLYPSVVNDLLSKLAAYNATAVPVNY 495

Query: 610 EQPDLVQADPKRFNDTWSPWI 630
              D   +DP      W PW+
Sbjct: 496 PPSDPT-SDPSLHGGFWGPWV 515



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 5/72 (6%)

Query: 11  TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 70
           T G+Y+ +LF  +A + I+     KPLFLY+A  + H+      +E P+     F YI +
Sbjct: 183 TYGQYSANLFANKANEAIDKHDKTKPLFLYVAFQSVHSP-----MEVPESYAKPFDYIKN 237

Query: 71  PNRRTYAALTKS 82
            NR+ Y  +  +
Sbjct: 238 HNRKMYGGMVAA 249


>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
          Length = 481

 Score =  268 bits (684), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 172/502 (34%), Positives = 258/502 (51%), Gaps = 75/502 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 46  GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 104

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 105 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 164

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++      +PLFLY
Sbjct: 165 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 220

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 221 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 275

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 276 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 326

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  +MHI+DW PTL   AGG TS     +DG + W ++     S R   +  +D    
Sbjct: 327 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYNVWPAISEGRASPRTEILHNIDP--- 382

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 383 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 427

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 428 ----------QTLATFPGSWW----NLERMASVRQAV---------------------WL 452

Query: 570 FNLGNDPCEQNNIASSRPDISS 591
           FN+   P E+ ++A  RPD+S+
Sbjct: 453 FNISAXPYEREDLAGQRPDVSA 474



 Score = 47.8 bits (112), Expect = 0.019,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 186 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 240

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 241 RTMGNVARRKYAAMV 255


>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 514

 Score =  268 bits (684), Expect = 8e-69,   Method: Compositional matrix adjust.
 Identities = 181/546 (33%), Positives = 277/546 (50%), Gaps = 93/546 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H ++ +PTPN++ LA  G+IL+NMY+QP+CTPSR +LMTGKYP   GMQ   +
Sbjct: 50  GWNDVGWH-NDFMPTPNLNTLAREGVILDNMYSQPICTPSRVALMTGKYPAKVGMQHFVV 108

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               P  +P     L E L+E GY+   +GKWHLG    +YTP++RGF+SH+G   GV S
Sbjct: 109 LPMRPYYLPGNYATLAEKLKEQGYTNHIVGKWHLGSCDWKYTPMWRGFDSHYGCHEGVTS 168

Query: 212 YYD-HILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            ++ H+L+  +   V ++G D+R N        G + T LF++ A +++++   + PLFL
Sbjct: 169 NFETHMLT--WPPVVGVSGRDLRDNTGLVTHENGTHNTMLFSERAERIVKNHNPESPLFL 226

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+ ++A H       L+APQ      Q + D +RRT+A M+  +D  VG +  AL+ KGM
Sbjct: 227 YVPYMAPHF-----PLQAPQGFEEAVQ-LDDTDRRTFAGMMAAMDQGVGNITQALKDKGM 280

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            ++++ +F+SDNG   +           GSNYPYRG K T+WEGGVKVPA +    ++ +
Sbjct: 281 WDDTLFVFLSDNGGDKMFA---------GSNYPYRGNKATMWEGGVKVPAFIRGSMLKTS 331

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
             V+ ++ H +D   TL   AGG       +IDG +QW                      
Sbjct: 332 GTVNNELYHFTDIFATLLKVAGGTPDE---DIDGKNQW---------------------D 367

Query: 451 SLLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
           S     PS+R+ +L+++DE   T  AA+R+  +K + G   N              VP+L
Sbjct: 368 SFCKGKPSKRSEILLHLDEHPVTNGAALRMGDYKYMEGVTIN--------------VPML 413

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS--PCTNGP 566
                                 LP ++I+        A      NP   TP   P T   
Sbjct: 414 ----------------------LPRNDIEHFDQWFTPA-----ENPDASTPDVPPITENV 446

Query: 567 C--YLFNLGNDPCEQNNIAS--SRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
              YLFN+ +DP E NN+ +  ++ +I +++ + L   R ++ P      D    +P  F
Sbjct: 447 NNRYLFNIKDDPLETNNLYNDPAKQEIINEMKQRLDEFRTSVPPFWFPGED-ESGNPDNF 505

Query: 623 NDTWSP 628
              WSP
Sbjct: 506 GGAWSP 511



 Score = 39.7 bits (91), Expect = 4.9,   Method: Compositional matrix adjust.
 Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 6/82 (7%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R N        G + T LF++ A +++++   + PLFLY+ ++A H       L+APQ 
Sbjct: 188 LRDNTGLVTHENGTHNTMLFSERAERIVKNHNPESPLFLYVPYMAPHF-----PLQAPQG 242

Query: 61  TINQFQYITDPNRRTYAALTKS 82
                Q + D +RRT+A +  +
Sbjct: 243 FEEAVQ-LDDTDRRTFAGMMAA 263


>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
          Length = 631

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 178/539 (33%), Positives = 271/539 (50%), Gaps = 70/539 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM-QGPP 150
           G++D+SFHG+ +IPTPN+DA+A +G+ILN  YA    TPSR +  TGK P+  G+ +GP 
Sbjct: 53  GYDDVSFHGNEQIPTPNLDAMAADGVILNRHYAAMSGTPSRGAFFTGKLPLRIGLNEGPI 112

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           + G    G+ L    LP Y+R+LGY T  IG+W LGF++    P  RGF++H+G  +   
Sbjct: 113 LKGVYGTGLSLEHEVLPFYMRDLGYETHLIGRWGLGFYKESLLPTNRGFDTHYGPYSDSA 172

Query: 211 SYYDHILSDQYSRTVEL-NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           SY  H+  +   +++ +   +D+ R+    +   G Y TDL+     ++IE +   KPLF
Sbjct: 173 SYSSHLSREDAWKSLSVPPAYDLHRDGKPDFSGFGSYVTDLYKGRFERIIEQR--RKPLF 230

Query: 270 LYLAHLAAHAGNAGKHLEAPQET--INQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           + L+H   H  + G   + P  T   +QF +I D +RR+YA  V+ LDDS   ++  LQR
Sbjct: 231 IVLSHQTPHGASFGPLHQPPPRTNRASQFLHIKDRSRRSYARTVESLDDSFAHMVDVLQR 290

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
             +L  S+I+F SDNG  T    +   ++   SN+P RG KNT WEGGV+V + +WS QI
Sbjct: 291 NNILNESVILFSSDNGGTTF---KDPYFKTGASNWPLRGQKNTQWEGGVRVSSFVWSSQI 347

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
            + P VS ++ H  DWLPT     GGD   L  +IDG D W S++  T            
Sbjct: 348 TE-PYVSEELYHFVDWLPTFRRLGGGDIGDLK-DIDGFDIWDSIVNRT------------ 393

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
                   T S R  +L+  D   R   V    +  ++G  +      + G+ ++N    
Sbjct: 394 --------TASPRTKILLAYD---RDFTVDSPGFAAIIGKFKY-VSGKWSGRDQTNHYVP 441

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDK---MRSTRQQATIHC--GAN--------- 553
            +F    +S+  + L+  S+ ++  + +I +       R+QA I C  GA+         
Sbjct: 442 ASFKYSEDSQVLERLRNDSK-VWKNLRDIGRPGPAADWREQARIKCSWGAHLTCNGVSCD 500

Query: 554 ----------------PAPMTPSPCTN---GPCYLFNLGNDPCEQNNIASSRPDISSQL 593
                             P+  S   +    PC LF+L NDPCE  NIA+  P I   +
Sbjct: 501 KIMETFRGSRGNVEIRVEPVNVSHACDFAHEPC-LFDLENDPCETTNIANDMPQIRDDI 558



 Score = 45.4 bits (106), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 4/84 (4%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           + R+    +   G Y TDL+     ++IE +   KPLF+ L+H   H  + G   + P  
Sbjct: 195 LHRDGKPDFSGFGSYVTDLYKGRFERIIEQR--RKPLFIVLSHQTPHGASFGPLHQPPPR 252

Query: 61  T--INQFQYITDPNRRTYAALTKS 82
           T   +QF +I D +RR+YA   +S
Sbjct: 253 TNRASQFLHIKDRSRRSYARTVES 276


>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 437

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 142/400 (35%), Positives = 223/400 (55%), Gaps = 34/400 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS +IP PNIDALA +G+ILN  YAQP    SR  L+TG YP  TG+ G  +
Sbjct: 33  GWADVGFHGSRQIPVPNIDALAADGVILNKYYAQPWPLSSRIGLLTGIYPYRTGV-GRVM 91

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P   R LP + + LGY T  +G W+LGF+++ +TP+ RGF++ +    G   
Sbjct: 92  LPCQPVALPSVFRILPTFFKSLGYRTHFVGVWNLGFYKKRFTPVNRGFDTAYAKWTGPGD 151

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA--VQLIEDQPVDKPLF 269
           Y+ H +        ++ G D+R N    W+  G Y+T LFT+ A   + +    + +PL 
Sbjct: 152 YWTHDMQ------TKMQGFDLRLNDDLMWNQTGVYSTRLFTERADPTRKLCFFVLHQPLL 205

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           L L+H A H  N    L+ P E ++ F +I D  RR +A  + ++D S+G +  AL  + 
Sbjct: 206 LILSHQALHTANYHGSLQCPLEHLDSFGFIRDRKRRIFAGALTEVDQSLGQIFEALHNRR 265

Query: 330 MLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           +L N+I++F+S +G  P  ++    ++     N+P RG K T +EGGV+VPA +WSP + 
Sbjct: 266 LLNNTILVFVSASGGMPVGDFTNNLSF-----NFPLRGTKGTCFEGGVRVPAFVWSPLLN 320

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +  R S Q+MH++DWLPTL++AAGGD  +L    D                   +D ++ 
Sbjct: 321 KTRRTSSQLMHVTDWLPTLFSAAGGDIKKLGAVFD-------------------LDSVNM 361

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
           W +L  +  S R  ++ NID      A++  ++K++  T+
Sbjct: 362 WEALSGDLQSPRTEIIHNIDLAGHNFALQEGNYKVIAVTK 401



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 3/84 (3%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEA--VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 58
           +R N    W+  G Y+T LFT+ A   + +    + +PL L L+H A H  N    L+ P
Sbjct: 166 LRLNDDLMWNQTGVYSTRLFTERADPTRKLCFFVLHQPLLLILSHQALHTANYHGSLQCP 225

Query: 59  QETINQFQYITDPNRRTYA-ALTK 81
            E ++ F +I D  RR +A ALT+
Sbjct: 226 LEHLDSFGFIRDRKRRIFAGALTE 249


>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 673

 Score =  266 bits (681), Expect = 2e-68,   Method: Compositional matrix adjust.
 Identities = 149/403 (36%), Positives = 232/403 (57%), Gaps = 28/403 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +H S++I TP +D LA  G+ L N Y QP+CTPSR+  +TG+Y IHTG+Q   I
Sbjct: 59  GFNDIGYH-SSDIKTPTLDKLAAKGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P  +  LP+ L++LGYST  +GKWHLGF+++E  P  RGF+++FG L G ++
Sbjct: 118 RPRQPNCLPFDQVTLPQRLQQLGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSVN 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFL 270
           YY +   D      EL G D+    S AWD  G+Y+T LFT+   +++    P  +PLF+
Sbjct: 178 YYTYSSCD----GPELCGFDLHEGESVAWDQGGKYSTHLFTQRVRKILARHDPQSQPLFI 233

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +L+  A H+      L++P+E I  +  + +  RR YAAMV  +D++V  +  AL++ G 
Sbjct: 234 FLSFQAVHS-----PLQSPKEYICPYSGLENVARRKYAAMVSTVDEAVRNITYALRKYGY 288

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +NS++IF +DNG   +         + GSN+P RG K T WEGG++    + SP +++ 
Sbjct: 289 YQNSVLIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGIRGLGFVHSPLLKKR 339

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS  ++HI+DW PTL   AGG+ S +   +DG + W ++     S R   +  +D   
Sbjct: 340 KRVSKALVHITDWYPTLVGLAGGNES-MTDGLDGYNVWEAISEGKESPRLEILHNIDP-- 396

Query: 451 SLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENG 491
              L   +R  S+     I      A++R+  WKL+ G   +G
Sbjct: 397 ---LFNHARSGSLQQGFGIWNTAVQASIRMGDWKLLTGNPGHG 436



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 26/79 (32%), Positives = 45/79 (56%), Gaps = 6/79 (7%)

Query: 6   STAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
           S AWD  G+Y+T LFT+   +++    P  +PLF++L+  A H+      L++P+E I  
Sbjct: 199 SVAWDQGGKYSTHLFTQRVRKILARHDPQSQPLFIFLSFQAVHS-----PLQSPKEYICP 253

Query: 65  FQYITDPNRRTYAALTKST 83
           +  + +  RR YAA+  + 
Sbjct: 254 YSGLENVARRKYAAMVSTV 272



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 1/64 (1%)

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
            +LFN+  DP E+ +++  RPD+  QL   L Y+ RT VP      D  +ADP      W
Sbjct: 464 VWLFNISEDPYERFDLSEQRPDVVKQLLARLAYYNRTAVPVRFPSED-PRADPHLNGGAW 522

Query: 627 SPWI 630
            PW+
Sbjct: 523 VPWV 526


>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
 gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
          Length = 494

 Score =  265 bits (678), Expect = 4e-68,   Method: Compositional matrix adjust.
 Identities = 172/542 (31%), Positives = 253/542 (46%), Gaps = 89/542 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H + ++ TP +D LA+ G+ILN  Y   VCTPSR + MTG +P H G Q    
Sbjct: 34  GWNDVGWH-NPDVRTPVLDQLAHEGVILNQSYVNYVCTPSRTAFMTGYFPYHVGSQHLVF 92

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +     FLPE L+ LGY+T  +GKWHLGF   ++TP +RGF+S++GY +G   
Sbjct: 93  RPDQPSAILSNFTFLPEKLKSLGYATHMVGKWHLGFCNWKFTPTFRGFDSYYGYYSGAED 152

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H  S +        G D   N     D  G Y+  LF++ A  ++     + PLFLY
Sbjct: 153 YFTHFRSIRNGT----GGIDFHDNKDVVTDQNGTYSAYLFSQRAADIVNKHDPNTPLFLY 208

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L     HA      LE P+   + +  + D NRR    MV  LD++VG V  A+++ G+ 
Sbjct: 209 LPFQNVHA-----PLEVPKRFEDMYANVQDENRRKLLGMVSALDEAVGNVTMAMKKTGLW 263

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N+++IF +DNG   ++          G+NYP RG K TLWEGG +  A      +Q+  
Sbjct: 264 DNTLLIFSTDNGGMVIQS---------GNNYPLRGGKTTLWEGGTRGVAFAHGKMLQKTG 314

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             + +M+H  DW PTL  AAGG                          +  +DG+ Q  +
Sbjct: 315 YTNNEMIHAVDWFPTLLAAAGGTA------------------------DPGMDGVSQLDT 350

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           +L   PS R   + NID+   T                                      
Sbjct: 351 ILSGKPSPRTEFVYNIDDIFPT-------------------------------------- 372

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP-MTPS-PCTNGPC-- 567
             V+  T+       Q    P  +++  +     A I  G  P P MT S P +  P   
Sbjct: 373 --VQDTTFAGGGHKYQKSMSPPVSVEYYKLIEGFAGIFNGWYPPPNMTASHPNSGDPING 430

Query: 568 -YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
            +LFNL +DP E +N+A S PD   ++   L  +R+TLVP  + + D  ++DPK +   W
Sbjct: 431 TWLFNLKDDPTEHHNLADSMPDKLKEMQAKLDGYRKTLVPAINPKSD-PKSDPKNWGGAW 489

Query: 627 SP 628
           SP
Sbjct: 490 SP 491


>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  265 bits (676), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 173/543 (31%), Positives = 267/543 (49%), Gaps = 82/543 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ +HGS  I TP+IDALA  G+ L+N Y   +CTPSR+ LMTG+Y IHTG+Q   I
Sbjct: 33  GWSDVGYHGS-VIKTPHIDALASEGVKLDNYYTSLLCTPSRSQLMTGRYEIHTGLQHRTI 91

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P+ E  LP+ L++ GY+T  +GKWHLGF+R+E  P  RGF++  G+   +  
Sbjct: 92  DMMQPLCLPIDETILPQKLKDRGYATHMVGKWHLGFYRQECLPNNRGFDTFMGFYQAMGD 151

Query: 212 YYDHILSDQYSRTVELNGHDMRR-NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H +S     T + NG D RR N   A    G+Y+T +F  EA  +I     D PLFL
Sbjct: 152 YYYHNVS-----TGKFNGWDFRRDNDVIAERYAGQYSTHVFADEARDIISKHNPDVPLFL 206

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYIT--DPNRRTYAAMVKKLDDSVGTVISALQRK 328
           +L+  A H       L+ P    + +  +     +RRTYA MV  +D++VG V++ L+  
Sbjct: 207 FLSFQAIHFP-----LQVPSRYADIYNTLIPNSADRRTYAGMVTCMDEAVGGVVATLKHS 261

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+ ENS+++F +DNG             + G N+P RG K +L+EGGV+  A + SP + 
Sbjct: 262 GLWENSVLVFSTDNGG----------IHSLGGNWPLRGGKASLYEGGVRGVAFVTSPLLP 311

Query: 389 QNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
              R  ++ +++H++DW PTL   AGG                          N  +DG 
Sbjct: 312 SRVRGTINKELLHMTDWFPTLVRLAGGHL----------------------MGNKRLDGY 349

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
           +QW ++   TPS RN +L+ I+  KR               + N  +D  Y  T S  + 
Sbjct: 350 NQWDTISQETPSLRNEILLTINPLKR---------------KPNHFVDDPYADTDSFDIL 394

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
           +    A +    ++ +   + N            S   +   H                 
Sbjct: 395 M---RAAIRVGDWKLITGPTGNGSWVTPAEFGFNSLHPEERTHS---------------- 435

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
             L+N+  DP E+++ +  RPDI   L + + ++ +T VP  +   ++ +ADP      W
Sbjct: 436 VELYNVIADPNERHDYSRDRPDIVGALLDRIGHYYKTSVPVLNPPQEVTRADPALHGGAW 495

Query: 627 SPW 629
           SPW
Sbjct: 496 SPW 498


>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 588

 Score =  264 bits (675), Expect = 8e-68,   Method: Compositional matrix adjust.
 Identities = 186/547 (34%), Positives = 269/547 (49%), Gaps = 69/547 (12%)

Query: 92  GWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
           G+ND+ +H   G + I TPNID +AY+G+ L N Y QPVCTP+R+ L+TG+Y IHTGMQ 
Sbjct: 105 GYNDVGYHAKYGRSMIRTPNIDEMAYSGVRLENYYVQPVCTPTRSQLITGRYQIHTGMQH 164

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
             ++   P  +PL E  L + L++ GYST A+GKWHLG+  ++  P  RGFES FG + G
Sbjct: 165 LNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSRRGFESFFGNIMG 224

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
              ++ H  +  +   + + G  M  N    W   G ++T L+T  A QLI  QP +KPL
Sbjct: 225 SADHWSHNKTALFGDKLVM-GKSMYYNERIYWKHEGTFSTTLYTNRARQLIRKQPRNKPL 283

Query: 269 FLYLAHLAAHAG-NAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           FLYL++ A H   N  +    P E I     I +  RR YA +V  LD++V  V  AL+ 
Sbjct: 284 FLYLSYEAVHTPLNVPEQYAKPYEGI-----IHNSKRRRYAGLVNILDEAVRNVTEALKY 338

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
            G+ +NS+IIF +DNG            R+ G+N+P RG K+TLWEGG++    + SP I
Sbjct: 339 NGLYDNSVIIFTTDNGG-------RPKPRSVGNNWPLRGGKSTLWEGGIRGVGFVHSPLI 391

Query: 388 QQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
               R  V+ Q++H+SDW PT+     G                 L+ N P      +DG
Sbjct: 392 PWELRGTVNRQLIHVSDWFPTIVXGIAG---------------GKLVTNKP------LDG 430

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
             QW ++   T S R+ +L NID     A      W     T+EN    G          
Sbjct: 431 XHQWKTISKGTESNRHEILHNIDPIYPAA-----HW-----TRENERDFGALSN------ 474

Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
             L FNA + +       +LS    LP  +  +     +         P  M     +  
Sbjct: 475 --LPFNATMRASIRVGNWKLSTG--LPHEDFWEPPKESEM--------PPEMNDIRWST- 521

Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
           P  L+N+  DP E+ N+A  +  I  +L + L+ ++ T V   H  P   + +PK     
Sbjct: 522 PVRLYNIKKDPNERQNMAPYQKKIVYRLLKRLQDYQNTAVTPIHLGPKDERGNPKYHGGA 581

Query: 626 WSPWIYR 632
           W PW+ R
Sbjct: 582 WVPWMDR 588



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 32/81 (39%), Positives = 40/81 (49%), Gaps = 6/81 (7%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQ 59
           M  N    W   G ++T L+T  A QLI  QP +KPLFLYL++ A H   N  +    P 
Sbjct: 247 MYYNERIYWKHEGTFSTTLYTNRARQLIRKQPRNKPLFLYLSYEAVHTPLNVPEQYAKPY 306

Query: 60  ETINQFQYITDPNRRTYAALT 80
           E I     I +  RR YA L 
Sbjct: 307 EGI-----IHNSKRRRYAGLV 322


>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 527

 Score =  263 bits (672), Expect = 2e-67,   Method: Compositional matrix adjust.
 Identities = 151/398 (37%), Positives = 227/398 (57%), Gaps = 28/398 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +H S++I TP +D LA +G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 59  GFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P  +  LP+ L+ELGYST  +GKWHLGF+++E  P  RGF+++FG L G ++
Sbjct: 118 RPRQPNCLPFDQITLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSVN 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
           YY +   D       + G D+    S AW   G+Y+T L+T+   +++    P  +PLF+
Sbjct: 178 YYTYDSCD----GPGVCGFDLHEGESVAWSQRGKYSTHLYTQRVRKILATHDPQSQPLFI 233

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +L+  A H       L+ PQE I  ++ + +  RR YAAMV  +D++V  +   L++ G 
Sbjct: 234 FLSFQAVHT-----PLQCPQEYIYPYRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGY 288

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +NSIIIF +DNG   +         + GSN+P RG K T WEGGV+    + SP +++ 
Sbjct: 289 YQNSIIIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLRKK 339

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS  ++HI+DW PTL   AGGD S +   +DG D W ++     S R   +  +D   
Sbjct: 340 RRVSKALVHITDWYPTLVGLAGGDESHIE-GVDGYDVWEAISDGKESPRLEILHNIDP-- 396

Query: 451 SLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLG 486
              L   +R  S+     I      A++R   WKL+ G
Sbjct: 397 ---LYNHARSGSLQKGYGIWNTAVQASIRAGDWKLLTG 431



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
            +LFN+  DP E+ +++  RPD+  QL   L Y+ RT VP  +   DL +ADP      W
Sbjct: 464 VWLFNISGDPYERFDLSEQRPDVVKQLLARLVYYNRTAVPVRYPTEDL-RADPHLNGGAW 522

Query: 627 SPWI 630
            PW+
Sbjct: 523 VPWV 526



 Score = 47.4 bits (111), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 29/101 (28%), Positives = 52/101 (51%), Gaps = 7/101 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
           S AW   G+Y+T L+T+   +++    P  +PLF++L+  A H       L+ PQE I  
Sbjct: 199 SVAWSQRGKYSTHLYTQRVRKILATHDPQSQPLFIFLSFQAVHT-----PLQCPQEYIYP 253

Query: 65  FQYITDPNRRTYAALTKSTTLTLL-IVYGWNDLSFHGSNEI 104
           ++ + +  RR YAA+  +    +  I YG     ++ ++ I
Sbjct: 254 YRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGYYQNSII 294


>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score =  263 bits (671), Expect = 3e-67,   Method: Compositional matrix adjust.
 Identities = 178/560 (31%), Positives = 277/560 (49%), Gaps = 104/560 (18%)

Query: 82  STTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGK 139
           S  + L ++   GWND S+ GS +I TPNID LA  GI L   Y QP+C+PSR++L+ GK
Sbjct: 23  SPNIILFVIDDLGWNDTSYQGS-DIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSALLAGK 81

Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           YP H G+    I    P G+ L E  + ++L++ GYST A+GKW LG  + E+TP YRGF
Sbjct: 82  YPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTPTYRGF 141

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGH-DMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           ++ +GY +    YY H          ++ G+ D R N     D  G Y+T LFTK A++ 
Sbjct: 142 DTFYGYYDADEDYYTH----------KVGGYLDFRNNTDPVKDEDGTYSTFLFTK-AIED 190

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
             +   D P F+Y A+ + H+      LEAP   + +  +   PNR+ +  M+  LD+ +
Sbjct: 191 AINAKSDSPFFIYGAYQSVHS-----PLEAPDTYLEKC-HSPYPNRKIFCGMMLALDEGI 244

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
             + S LQ KG+L+++III  +DNG  T             SN+P RG K T++EGGV+ 
Sbjct: 245 SNITSLLQTKGLLDDTIIILTTDNGGQTALGS---------SNWPLRGNKATVFEGGVRG 295

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
            + +WS +++++   +  MMHI+DW PT+     G T    L+  GL             
Sbjct: 296 ISFVWSTKLRKSNYDNNAMMHITDWYPTIIEGIAGMT----LDTKGL------------- 338

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK---RT-----AAVRLDSWKLVLGTQEN 490
                DG + W ++  +TPS R  +L  +D  K   RT     AA+R   WKL++G    
Sbjct: 339 -----DGFNMWDTINDDTPSPRTEILHQLDPPKYDERTQFIGQAALRQGDWKLIIGQPNC 393

Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC 550
               G +       +  ++ N  +E                                   
Sbjct: 394 SEKLGPHTVYDLCPIGWIHLNGTIEEPE-------------------------------- 421

Query: 551 GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
             NP+            +LFN+ +DP E+N +++S P+I S+L EL++ +  T + Q  +
Sbjct: 422 -VNPS----------LTWLFNVKDDPNERNELSASHPEIVSKLKELIEAYNATHIVQM-D 469

Query: 611 QPDLVQADPKRFNDTWSPWI 630
            P   ++DPK+F + W+PW+
Sbjct: 470 PPIDPRSDPKKFGNIWTPWL 489


>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 1410

 Score =  262 bits (670), Expect = 4e-67,   Method: Compositional matrix adjust.
 Identities = 145/419 (34%), Positives = 232/419 (55%), Gaps = 66/419 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +HGS+ I TP++D LA  G  L N Y   +C+PSR   +TGK+ IH GM+G  I
Sbjct: 38  GWNDVGYHGSS-ISTPHMDTLAKEGTKLENYYVAHLCSPSRGMFLTGKHMIHLGMEGGII 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E + +P+ E  + + L+   YST AIGKWH+G++++   P  RGF++ FG + G   
Sbjct: 97  MPFERKCLPVNEATIAQELKLKNYSTHAIGKWHVGYYKKACLPNNRGFDTFFGIIGGCAD 156

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           +Y H    + +   EL     R N+S A +  G Y+TDL+ +EA  +I +    KP+F+Y
Sbjct: 157 HYTH----KNTHWWEL----YRNNISIAQEYQGHYSTDLYAREATNVIRNHDASKPMFMY 208

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LA  AAH       L+AP++ I+ +  I DP+RR YAAMV  +DD++G V+  L+  G+ 
Sbjct: 209 LAFQAAHLP-----LQAPRKYIDMYSNIEDPDRRVYAAMVTCMDDAIGNVVDQLREAGLW 263

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            N+++I  +DNG   +           G+N+P +G K +LWEGGV+  A + SP +  + 
Sbjct: 264 NNTVLIVSTDNGGAKIS----------GNNWPLKGSKASLWEGGVRGVAFVTSPFLADHV 313

Query: 392 R--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
           R   + Q+MH++DW PT+   AG               +S ++      ++  +DG++QW
Sbjct: 314 RGTSNHQLMHVTDWFPTMLHVAG--------------VYSDVI------KSMTLDGVNQW 353

Query: 450 SSLLLNTPSRRNSVLINIDEKKRT--------------------AAVRLDSWKLVLGTQ 488
           +++  N  + RN VLIN++  ++                     +A+RL +WKL+ G+Q
Sbjct: 354 NTISENRQTERNDVLINLNVNRKVKHVHYEEPWLDNEHFDVRIKSALRLGNWKLLTGSQ 412



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 33/79 (41%), Positives = 49/79 (62%), Gaps = 5/79 (6%)

Query: 2   RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
           R N+S A +  G Y+TDL+ +EA  +I +    KP+F+YLA  AAH       L+AP++ 
Sbjct: 170 RNNISIAQEYQGHYSTDLYAREATNVIRNHDASKPMFMYLAFQAAHLP-----LQAPRKY 224

Query: 62  INQFQYITDPNRRTYAALT 80
           I+ +  I DP+RR YAA+ 
Sbjct: 225 IDMYSNIEDPDRRVYAAMV 243


>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
          Length = 572

 Score =  262 bits (669), Expect = 5e-67,   Method: Compositional matrix adjust.
 Identities = 173/542 (31%), Positives = 268/542 (49%), Gaps = 74/542 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS ++ TP +D LA  G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q   I
Sbjct: 55  GYADIGYHGS-DVHTPVLDQLAAEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSII 113

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P     LPE L + GY T  +GKWHLGF R    P  RGF+S FG L G   
Sbjct: 114 RPRQPLCLPPDIPTLPECLLKAGYHTHMVGKWHLGFCRPSCLPTRRGFQSFFGTLTGSGD 173

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           ++ +    Q     E  G D+      AW+  G Y+T L+ +   Q++ +   + PLFLY
Sbjct: 174 HFSY----QSCDGAEACGFDLHDGSRPAWEMRGNYSTLLYIERVKQILRNHDPNTPLFLY 229

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L+  AAH       L+ P   +N ++   + +RR YAAM+  LDD V  V+  L+  G+ 
Sbjct: 230 LSLQAAHT-----PLQVPAHFLNPYKSQNNRHRRHYAAMLSCLDDGVAQVVKELKTTGIY 284

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NSI+I+ SDNG   +         + GSN+P RG K + WEGG++    + SP +++  
Sbjct: 285 DNSILIYSSDNGGQPL---------SGGSNWPLRGGKGSYWEGGIRAVGFVHSPLLKKKG 335

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            +S +M+H+SDW PTL   AG   S     +DG D W S+   +PS R+  +  +D   S
Sbjct: 336 VISKEMIHVSDWYPTLLGLAGALQSH--NGLDGHDVWESISEGSPSPRSEILFNIDP-VS 392

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
                P  +  VL    I +    AA+R+  W L+ G   +G           + +PL +
Sbjct: 393 RRPGEPYEKALVLNGFGIWDTAVRAALRVGDWNLLTGNVGDG-----------DWIPLQS 441

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                E   +Q L++               R  +   ++                   +L
Sbjct: 442 LTGGPER--WQKLEK---------------RRNKMGKSV-------------------WL 465

Query: 570 FNLGNDPCEQNNIASSRPDISSQLY-ELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           FN+  DP E++++A   P++   L   L +Y++  ++  ++  PD   ADP+     WSP
Sbjct: 466 FNVSGDPYERSDLAELHPEVVKHLLTRLAEYNQTAMI--ANNPPDDPMADPELHGGVWSP 523

Query: 629 WI 630
           W+
Sbjct: 524 WL 525



 Score = 45.8 bits (107), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 40/72 (55%), Gaps = 5/72 (6%)

Query: 8   AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 67
           AW+  G Y+T L+ +   Q++ +   + PLFLYL+  AAH       L+ P   +N ++ 
Sbjct: 197 AWEMRGNYSTLLYIERVKQILRNHDPNTPLFLYLSLQAAHT-----PLQVPAHFLNPYKS 251

Query: 68  ITDPNRRTYAAL 79
             + +RR YAA+
Sbjct: 252 QNNRHRRHYAAM 263


>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 507

 Score =  261 bits (668), Expect = 6e-67,   Method: Compositional matrix adjust.
 Identities = 176/550 (32%), Positives = 262/550 (47%), Gaps = 95/550 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ +H S  I TPNID LA  G+ L N Y  P+CTP+RA LMTG+Y IHTGMQ   +
Sbjct: 40  GWHDVGYHDSI-IRTPNIDKLAAEGVKLENYYVTPLCTPTRAVLMTGRYQIHTGMQHGVL 98

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EPR +P  E  +P+ L+E GY+T  +GKWHLGF++   TP +RGF++ FG       
Sbjct: 99  MAQEPRCLPTDEVLMPQKLKESGYTTHMVGKWHLGFYKWACTPNHRGFDTFFGMYLAGGD 158

Query: 212 YYDHILSDQYSRTVELNGHDMRR-NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           Y++H    +      L   D+R  +   A + VG+Y+T +F ++A ++I+      PLFL
Sbjct: 159 YFNHT---RLCHGRRLAAWDLRDGDQVVAPEYVGKYSTIVFAEKAQEIIKKHDPTNPLFL 215

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           YL+  A HA      L+ P+  IN ++  I D +RR YA M   +D+++G V   L R+G
Sbjct: 216 YLSFQAVHAP-----LQVPERYINMYKDDIRDESRRIYAGMTTCMDEAIGNVTRTLARRG 270

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           +L+N+II+F SDNG   +            SNYP RG K TL+EGGV+ PA ++SP +  
Sbjct: 271 LLDNTIIVFSSDNGGAKL---------YGASNYPLRGQKGTLYEGGVRAPAFVYSPLLSN 321

Query: 390 NPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
             R   + ++M++ DW PT    AGG       N+ G+                 +DG++
Sbjct: 322 RVRGTATHELMYVGDWFPTFLHLAGG-------NLHGVKP---------------LDGVN 359

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
           QW ++    PS            KRT  V                          N  PL
Sbjct: 360 QWHTVSAAMPS------------KRTEIVH-------------------------NIEPL 382

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
                 V+ + + +      N +  I  +  +R    +       +P    P     G  
Sbjct: 383 RGKPPRVKDEPWLT------NPYFDIRVMASIRVGEWKLITGRKGDPTWTIPPELGGGSV 436

Query: 568 --------YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
                    L+++  DP E+ +++  RPDI ++L E L+YH  T        P    +DP
Sbjct: 437 DPGRPTIVELYHIKEDPLERTDLSRDRPDIVNKLLERLRYHSDTSRLPETPTPVFDTSDP 496

Query: 620 KRFNDTWSPW 629
              N  W PW
Sbjct: 497 LHHNMAWGPW 506



 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 27/70 (38%), Positives = 42/70 (60%), Gaps = 6/70 (8%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
           VG+Y+T +F ++A ++I+      PLFLYL+  A HA      L+ P+  IN ++  I D
Sbjct: 188 VGKYSTIVFAEKAQEIIKKHDPTNPLFLYLSFQAVHAP-----LQVPERYINMYKDDIRD 242

Query: 71  PNRRTYAALT 80
            +RR YA +T
Sbjct: 243 ESRRIYAGMT 252


>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
          Length = 560

 Score =  261 bits (666), Expect = 9e-67,   Method: Compositional matrix adjust.
 Identities = 172/541 (31%), Positives = 271/541 (50%), Gaps = 74/541 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q   I
Sbjct: 49  GFRDVGYHGS-EIKTPTLDRLAAQGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSII 107

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP  L++ GY+T  +GKWHLGF++R   P  RGF++ FG L G   
Sbjct: 108 RATQPNCLPLENVTLPLKLKQAGYATHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSGD 167

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFTKEAVQLI-EDQPVDKPLF 269
           +Y H   +       + G+D+      AW+   G Y+T +FT++A+ ++ +  P  KPLF
Sbjct: 168 HYSHYKCE----APGMCGYDLYEGEEAAWEQDRGLYSTVMFTQKAISILAKHDPHRKPLF 223

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYLA+ A H+      L+ P   + +++ I++ +RR YAAMV  LD+++  +  AL+R G
Sbjct: 224 LYLAYQAVHSP-----LQVPSRYLERYKGISNVHRRKYAAMVSCLDEAIRNLTLALKRYG 278

Query: 330 MLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
             +N+++++ SDNG  P +           GSN+P RG K + WEGG++    + SP ++
Sbjct: 279 YYDNTVLVYSSDNGGQPLLG----------GSNWPLRGSKASYWEGGIRAVGFVHSPLLR 328

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
                   ++HI+DW PTL  + G  T    LN+DG D W ++    PS R+  +  +D 
Sbjct: 329 NKGTKCRSLIHITDWFPTL-VSLGEGTLEEHLNLDGYDVWEAISEGRPSPRHDILHNID- 386

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
              + +   +        I      AA+R+  WKL+ G      + GY     S+ VP  
Sbjct: 387 --PIYIKAKNGSWKAGYGIWNTAIQAALRVGHWKLLTG------IPGY-----SDWVPPQ 433

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
            F          S QQ+           D+ +S                          +
Sbjct: 434 TF----------SNQQMGSRWHNEHIRFDRGKS-------------------------LW 458

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LFN+  DP E+ +++   P I  ++   L  + +T VP  +   DL +++P+     W P
Sbjct: 459 LFNITADPYERVDLSQRYPQIVKKMLIRLAQYNKTAVPVRYPAKDL-RSNPQYNGGVWGP 517

Query: 629 W 629
           W
Sbjct: 518 W 518



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 27/76 (35%), Positives = 47/76 (61%), Gaps = 7/76 (9%)

Query: 7   TAWDT-VGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
            AW+   G Y+T +FT++A+ ++ +  P  KPLFLYLA+ A H+      L+ P   + +
Sbjct: 190 AAWEQDRGLYSTVMFTQKAISILAKHDPHRKPLFLYLAYQAVHSP-----LQVPSRYLER 244

Query: 65  FQYITDPNRRTYAALT 80
           ++ I++ +RR YAA+ 
Sbjct: 245 YKGISNVHRRKYAAMV 260


>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 491

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 178/551 (32%), Positives = 272/551 (49%), Gaps = 107/551 (19%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND S+ GS +I TPNID LA  GI L   Y QP+C+PSR++L+ GKYP H G+    I
Sbjct: 35  GWNDTSYQGS-DIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSALLAGKYPYHLGLAHGVI 93

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               P G+ L E  + ++L++ GYST A+GKW LG  + E+TP YRGF++ +GY +    
Sbjct: 94  TNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTPTYRGFDTFYGYYDADED 153

Query: 212 YYDHILSDQYSRTVELNGH-DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H          ++ G+ D R N     D  G Y+T LFTK A++   +   D P F+
Sbjct: 154 YYTH----------KVGGYLDFRNNTDPVKDEDGTYSTFLFTK-AIEDAINAKSDSPFFI 202

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y A+ + H       LEAP   +N+   I  PNR+ +  M+  LD+ +  + S LQ KG+
Sbjct: 203 YGAYQSVHG-----PLEAPDIYLNKCN-IPYPNRKIFCGMMLALDEGISNITSLLQTKGL 256

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L+++III  +DNG  T             SN+P RG K T++EGGV+  + +WS +++++
Sbjct: 257 LDDTIIILTTDNGGQTALGS---------SNWPLRGNKATVFEGGVRGISFVWSTKLRKS 307

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
              +  MMHI+DW PT+     G T    L+  GL                  DG + W 
Sbjct: 308 NYDNNAMMHITDWYPTIIEGIAGMT----LDTKGL------------------DGFNMWD 345

Query: 451 SLLLNTPSRRNSVLINID----EKKRT-----AAVRLDSWKLVLGTQENGTMDGYYGQTR 501
           ++  +TPS R  +L  +D    + +R      AA+R   WKL++G        G +    
Sbjct: 346 TINDDTPSPRTEILHQLDPPRYDNERNPFIGQAALRQGDWKLIIGQPNCSEKIGPH--LV 403

Query: 502 SNKVP--LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP 559
            NK P   ++ N  +E                                     NP+    
Sbjct: 404 GNKCPNGWVHLNGTIEEPE---------------------------------VNPS---- 426

Query: 560 SPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
                   +LFN+ +DP E+N +++S P+I S+L E ++ +  T + Q     D  ++DP
Sbjct: 427 ------LTWLFNVKDDPNERNELSASHPEIVSKLKERIEAYNATHIVQMSPPID-PRSDP 479

Query: 620 KRFNDTWSPWI 630
           K+F + W+PW+
Sbjct: 480 KKFGNVWTPWL 490


>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
          Length = 477

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 170/538 (31%), Positives = 259/538 (48%), Gaps = 97/538 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +    +I TP +D LA +G+ILN+ Y QP+C+PSR   M+GK+P HTG+Q   +
Sbjct: 35  GWNDVGWINP-DIKTPTLDRLARSGVILNSSYVQPLCSPSRNCFMSGKFPYHTGLQDKVV 93

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +  +P+ +P     +P+ L+ LGY T  +GKWHLGF   +YTP YRGF++  GY  G+  
Sbjct: 94  FIEQPKYMPANLTTIPQRLKTLGYDTHMVGKWHLGFCNWKYTPTYRGFDTFMGYYAGMED 153

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H+  +      + NG+D R          GEY+T +F   A+ +I +    KP++LY
Sbjct: 154 YFTHVRDE----VADYNGYDFRNMTDVYKGAQGEYSTYVFANRAIDIIMNHDKSKPMYLY 209

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L   A H       L+ P +  +++ +I D NR+ Y+ MV  LDD+V  + + L+  G +
Sbjct: 210 LPFQAVHV-----PLQVPTKYSDRYSHIHDLNRKVYSGMVSALDDAVYNITTVLEDLGFM 264

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ-IQQN 390
           +N +++F +DNG P      T    NW    P RG KNTLWEGG +  A ++S   +++ 
Sbjct: 265 DNLLLVFTTDNGGP-----PTRGANNW----PLRGGKNTLWEGGTRGAAFVYSKTLLKKK 315

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
                 + H  DW PTL   AGGD+S+L                       +IDGL Q  
Sbjct: 316 GYTHPGLFHAVDWYPTLLDIAGGDSSQL-----------------------DIDGLSQVD 352

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            +L    S RN  + NI +    AA+R    KL++G   N    G+Y        P   F
Sbjct: 353 MILNGDSSVRNQFVYNIYDDHYAAALRWGDLKLIVG---NPGYSGWY--------PPPGF 401

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
           +  V                 PI +                      TP P       L+
Sbjct: 402 DLDV-----------------PIMDD---------------------TPLPS----VMLY 419

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           N+  DP E  +I+   PD+ + L ++L     T VP + + P   +++P  F+ +WSP
Sbjct: 420 NISTDPTEHEDISKDYPDLVNFLLQMLARKNGTRVP-AMDPPSDPRSNPVNFDGSWSP 476



 Score = 45.8 bits (107), Expect = 0.075,   Method: Compositional matrix adjust.
 Identities = 23/81 (28%), Positives = 43/81 (53%), Gaps = 6/81 (7%)

Query: 3   RNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
           RN++  +    GEY+T +F   A+ +I +    KP++LYL   A H       L+ P + 
Sbjct: 171 RNMTDVYKGAQGEYSTYVFANRAIDIIMNHDKSKPMYLYLPFQAVHV-----PLQVPTKY 225

Query: 62  INQFQYITDPNRRTYAALTKS 82
            +++ +I D NR+ Y+ +  +
Sbjct: 226 SDRYSHIHDLNRKVYSGMVSA 246


>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 648

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 169/544 (31%), Positives = 275/544 (50%), Gaps = 83/544 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ +H S  I TPNID LA  G+ L N Y  P+CTP+RA LMTG+Y IHT MQ   +
Sbjct: 39  GWHDVGYHDSI-IRTPNIDKLAAEGVKLENYYVTPICTPTRAVLMTGRYQIHTTMQHGVL 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E R +P  E  +P+ L+E GYST  +GKWHLGF++ + TP +RGF++ FG+      
Sbjct: 98  MAQEQRCLPTDEVLMPQKLKESGYSTHMVGKWHLGFYKWDCTPNHRGFDTFFGFYLAGGE 157

Query: 212 YYDHILSDQYSRTVELNGHDMRR-NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           Y+ H       R   L+  D+R  +     +  GEY+T L+ ++A ++I  Q  + P+FL
Sbjct: 158 YFTHTRKCHGHR---LDAWDLRDGDKMVGPEYTGEYSTMLYARKAQEVIRKQDPNVPMFL 214

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           Y++  A HA      LE P    + + + I D +R+ YA M   +D+++G V   L+ +G
Sbjct: 215 YVSFQAVHAP-----LEVPDSYADAYGKDIYDQSRKLYAGMTSCMDEAIGNVTQTLKDRG 269

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWG-SNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           M +N++++F +DNG  T           +G SNYP RG K   +EGGV+ PA + SP ++
Sbjct: 270 MWDNTVVVFSTDNGGAT----------GFGASNYPLRGQKCGHYEGGVRGPAFVSSPLLK 319

Query: 389 QNPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
            + R +   ++M++ DWLPT    AGG T                           +DG+
Sbjct: 320 PHVRGTKNTELMYVGDWLPTFVHLAGGTT----------------------HGTKPLDGV 357

Query: 447 DQWSSLLLNTPSRRNSVLINIDE-KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
           +QW ++    PSRR  +L NID  + ++  ++ + W           ++ Y+       +
Sbjct: 358 NQWEAISEGLPSRRKEILHNIDPIRAKSPTLKTEPW----------MLNPYFDIRVMAAI 407

Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
            + ++  +   K   + Q+  +     + N    R T                 +P TN 
Sbjct: 408 RVGHWKLLTGQKGNANWQEPPE-----LGN----RLT-----------------NPYTNN 441

Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
              L+++  DP E+N+I+++RPD+  ++   L+    T V  +H  P+   +DP + +  
Sbjct: 442 ILQLYHIKEDPEERNDISATRPDVVYRILLRLQALNSTAVLPTHVDPEFSLSDPVKHDMA 501

Query: 626 WSPW 629
           WSPW
Sbjct: 502 WSPW 505



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 45/81 (55%)

Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
           H  A    ++ S  +N    L+++G DP E+  I+S RPD+  +L   L+YH  T V   
Sbjct: 567 HLPAEIGGISRSSYSNEVLQLYHIGEDPEERYEISSLRPDVVIRLLRRLQYHDSTAVLPV 626

Query: 609 HEQPDLVQADPKRFNDTWSPW 629
             +P+ +++DP + +  WSPW
Sbjct: 627 ETEPEFIESDPVKHDMAWSPW 647



 Score = 42.4 bits (98), Expect = 0.87,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 6/69 (8%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDP 71
           GEY+T L+ ++A ++I  Q  + P+FLY++  A HA      LE P    + + + I D 
Sbjct: 188 GEYSTMLYARKAQEVIRKQDPNVPMFLYVSFQAVHAP-----LEVPDSYADAYGKDIYDQ 242

Query: 72  NRRTYAALT 80
           +R+ YA +T
Sbjct: 243 SRKLYAGMT 251


>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
          Length = 684

 Score =  261 bits (666), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 175/557 (31%), Positives = 266/557 (47%), Gaps = 94/557 (16%)

Query: 73  RRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
           R+   A +      +    GWND+ FH    + TPNID LA  G+ILN  Y QP+C+PSR
Sbjct: 218 RKQVEAKSPHILFIVADDLGWNDVGFHNP-AMKTPNIDKLAREGLILNQTYLQPLCSPSR 276

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
            +LMTG YP H G+Q   I   +P   PL  +FLP+ L+++GY+T  +GKWHLGF     
Sbjct: 277 HALMTGYYPYHAGLQHLVILPWQPVCSPLKMKFLPQRLKDIGYATHMVGKWHLGFCSWNC 336

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
           TP YRGF+S FGY N    +Y H            N  D R N     +  G Y+T  F 
Sbjct: 337 TPTYRGFDSFFGYYNAQGDHYSHTW---------YNYLDYRDNEKPVKNLNGTYSTFTFV 387

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
             A  +I+     +PLFLY+A    H       ++ P++  + +  I    RR ++ MV 
Sbjct: 388 SRAQDIIKKHNSSQPLFLYMAFQNVH-----DPIQVPKQYEDMYPNIKTRGRRQFSGMVS 442

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTL 371
            LD+++G +   L+  GM ++++IIF SDNG  P          + +G+NYP RG K T+
Sbjct: 443 ALDEAIGNITDQLRESGMYDDTLIIFTSDNGGWP----------KYYGNNYPLRGGKFTV 492

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           +EGG +V + +    +Q+       +MH  DW  TL  AAGG   +  L  DG+ QWSS+
Sbjct: 493 YEGGTRVVSFVHGAGLQKTGEAYESIMHAVDWSDTLVEAAGG---KPDLMGDGVSQWSSI 549

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
             N P  R   +   D                  ++  ++  AA+R+  +KL+       
Sbjct: 550 ANNFPGPRTELVYNFDS-----------------DLAPEEGHAAIRVGDYKLI------- 585

Query: 492 TMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCG 551
             DGY G                          L  + + P    D++   + + +I   
Sbjct: 586 --DGYPG--------------------------LYPDWYKP----DQLYKFKPEPSI--- 610

Query: 552 ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQ 611
               P T +    G   LFNL +DP E N+++ SRPDI +++ + L+   +  VP ++  
Sbjct: 611 ----PETQAHA--GRFQLFNLKDDPTEHNDLSKSRPDIVTEMSQRLRLLTQNAVPPNYPL 664

Query: 612 PDLVQADPKRFNDTWSP 628
               +++P +F+D WSP
Sbjct: 665 VPDPKSNPSKFDDVWSP 681


>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 524

 Score =  260 bits (665), Expect = 1e-66,   Method: Compositional matrix adjust.
 Identities = 176/577 (30%), Positives = 267/577 (46%), Gaps = 98/577 (16%)

Query: 75  TYAALTKSTTLTLLIVYGWNDLSFHG---SNEIPTPNIDALAYNGIILNNMYAQPVCTPS 131
           +YA    +    L   YG+ND+ +HG    + I TPN+D LA  G+ L N Y QP+C+P+
Sbjct: 23  SYAEQLPNVVFILADDYGFNDVGYHGRSHGSAILTPNLDMLAGEGVKLENYYVQPICSPT 82

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           R+ LM+G+Y IHTG+Q   I   +P  +PL E  LP+ L+E GY+T  +GKWH+GF+   
Sbjct: 83  RSQLMSGRYQIHTGLQHGVIRPPQPNCLPLDEVTLPQKLKENGYATNMVGKWHIGFYLDA 142

Query: 192 YTPLYRGFESHFGYLNGVISYYDHI----LSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
             P  RGF+S+F       ++ DH                       N +  +   G+Y+
Sbjct: 143 CLPTERGFDSYF-------AWEDHFSCLPXXXXXXXXXXXXXXXXXANKTPVFQYEGQYS 195

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
           T LFT + + +IE     KPLF+YLA+ A H       LE P   ++ +  ITD NRRTY
Sbjct: 196 THLFTNKTIDVIERHDKTKPLFIYLAYQAVHF-----PLEVPDSYMDPYMNITDKNRRTY 250

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
           A MV  +D+ VG V  AL+  G+ +N+IIIF +DNG P          ++  +N+P RG 
Sbjct: 251 AGMVSCMDEGVGNVTRALKEAGLYDNTIIIFSTDNGGPI---------QSGANNWPLRGS 301

Query: 368 KNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           K +LWEGG+     + SP +  + +  V+ +++H+SDWLPT+     G        +DG 
Sbjct: 302 KGSLWEGGIHGVGFVHSPLLPTSVKGTVNHELIHVSDWLPTIVAGIAGGALNGTKPLDGY 361

Query: 426 DQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNS----------VLINIDEKKR 472
           + WSS+     S R     NID L     +L   PS  +           VL +  +   
Sbjct: 362 NVWSSISGTASSPRKEILHNIDSLYIAPEIL---PSYFDGEDFLWRFLARVLEDKFDTSI 418

Query: 473 TAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLP 532
            AA+R+  WK+  G   +                         S  +Q   +   +   P
Sbjct: 419 RAALRVGDWKIKTGPDASA------------------------SDVWQPPAESGLHTIYP 454

Query: 533 ISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQ 592
           I                              N   +LFN+  DP E+ +++   PD+  +
Sbjct: 455 IEK---------------------------PNQKVWLFNITADPLERTDLSGKYPDVVDE 487

Query: 593 LYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
             + L+ + +  VP  +  PDL +ADP     +W PW
Sbjct: 488 FLQRLEEYYKGSVPVRYPDPDL-EADPALHGGSWGPW 523



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 5/70 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G+Y+T LFT + + +IE     KPLF+YLA+ A H       LE P   ++ +  ITD N
Sbjct: 192 GQYSTHLFTNKTIDVIERHDKTKPLFIYLAYQAVHF-----PLEVPDSYMDPYMNITDKN 246

Query: 73  RRTYAALTKS 82
           RRTYA +   
Sbjct: 247 RRTYAGMVSC 256


>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
          Length = 570

 Score =  260 bits (665), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 147/396 (37%), Positives = 225/396 (56%), Gaps = 24/396 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +H S++I TP +D LA +G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 59  GFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P  +  LP+ L+ELGYST  +GKWHLGF+++E  P  RGF+++FG L G ++
Sbjct: 118 RPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSVN 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
           YY +   D       + G D+    S AW   G+Y+T L+T+   +++    P  +PLF+
Sbjct: 178 YYTYDSCDGPG----MCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILATHDPRSQPLFI 233

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +L+  A H       L+ P+E I  ++ + +  RR YAAMV  +D++V  +   L++ G 
Sbjct: 234 FLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGY 288

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            ENSI+IF +DNG   +         + GSN+P RG K T WEGGV+    + SP +++ 
Sbjct: 289 YENSIMIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLRKK 339

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS  ++HI+DW PTL   AGG  S +   +DG D W ++   +  R +  ++ L    
Sbjct: 340 KRVSKALVHITDWYPTLVGLAGGKESHIE-GVDGYDVWEAI---SDGRESPRLEILHNID 395

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
            L  +  S        I      A++R   WKL+ G
Sbjct: 396 PLYNHARSGSLQKGYGIWNTAVQASIRAGDWKLLTG 431



 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
            +LFN+  DP E+ +++  RPD+  QL   L Y+ RT VP  +   DL +ADP      W
Sbjct: 464 VWLFNISGDPYERLDLSEQRPDVVKQLLARLVYYNRTAVPVRYPTEDL-RADPHLNGGAW 522

Query: 627 SPWI 630
            PW+
Sbjct: 523 VPWV 526



 Score = 45.8 bits (107), Expect = 0.068,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 49/96 (51%), Gaps = 7/96 (7%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
           S AW   G+Y+T L+T+   +++    P  +PLF++L+  A H       L+ P+E I  
Sbjct: 199 SVAWSQKGKYSTHLYTQRVRKILATHDPRSQPLFIFLSFQAVHT-----PLQCPREYIYP 253

Query: 65  FQYITDPNRRTYAALTKSTTLTLL-IVYGWNDLSFH 99
           ++ + +  RR YAA+  +    +  I YG     ++
Sbjct: 254 YRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGYY 289


>gi|156364432|ref|XP_001626352.1| predicted protein [Nematostella vectensis]
 gi|156213225|gb|EDO34252.1| predicted protein [Nematostella vectensis]
          Length = 270

 Score =  260 bits (664), Expect = 2e-66,   Method: Compositional matrix adjust.
 Identities = 124/256 (48%), Positives = 168/256 (65%), Gaps = 9/256 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM---QG 148
           GW+D+SFHGS++IPTP ID LA  G+ILN+ Y  P+CTP+RASLMTGK+P++ GM     
Sbjct: 14  GWDDVSFHGSSQIPTPTIDKLASEGVILNSYYVSPICTPTRASLMTGKHPMNLGMLIHTH 73

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
             ++G +P G+PL E   P+Y++ LGY T  IGKWHLGFF +EYTP YRGF+S +G+ NG
Sbjct: 74  ATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPTYRGFDSFYGFWNG 133

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
              Y+DH      S   ++ G D+R N     +  G Y T+LF + A Q+I      KPL
Sbjct: 134 KEDYWDH------SSQEDVWGTDLRDNEKPVRNESGHYGTELFAERAAQIIHLHNQTKPL 187

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           +LYLA    H+ N  + L+AP+  I +F +I+ P RR YAAMV  LD+SV TV  AL   
Sbjct: 188 YLYLAQQGVHSANGNEPLQAPKRLIKKFSHISSPKRRIYAAMVSSLDESVETVHKALSET 247

Query: 329 GMLENSIIIFMSDNGA 344
           GML N++++F +DNG 
Sbjct: 248 GMLNNTVLVFTTDNGG 263



 Score = 68.2 bits (165), Expect = 1e-08,   Method: Compositional matrix adjust.
 Identities = 32/82 (39%), Positives = 46/82 (56%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +R N     +  G Y T+LF + A Q+I      KPL+LYLA    H+ N  + L+AP+ 
Sbjct: 151 LRDNEKPVRNESGHYGTELFAERAAQIIHLHNQTKPLYLYLAQQGVHSANGNEPLQAPKR 210

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            I +F +I+ P RR YAA+  S
Sbjct: 211 LIKKFSHISSPKRRIYAAMVSS 232


>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 784

 Score =  259 bits (661), Expect = 4e-66,   Method: Compositional matrix adjust.
 Identities = 158/413 (38%), Positives = 232/413 (56%), Gaps = 44/413 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ +HGS  I TPNIDALA  G+ L+N Y   +CTPSR  L++G+Y IHTG+Q   I
Sbjct: 53  GWSDVGYHGS-VIKTPNIDALASEGVKLDNYYMSLLCTPSRGQLLSGRYEIHTGLQHRTI 111

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P+ E  LP+ L+E GY+T  +GKWHLGF+R+E  P YRGF++  G+  G+  
Sbjct: 112 DMMQPLCLPIDETILPQKLKERGYATHMVGKWHLGFYRKECLPNYRGFDTFMGFYQGMAD 171

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H +S     T   +G D RRN    A    G+Y+T +F  EA  +I     + PLFL
Sbjct: 172 YYYHNIS-----TGIYHGWDFRRNNDVIAQKYAGQYSTHVFADEAQIIIMKHNPEVPLFL 226

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYIT--DPNRRTYAAMVKKLDDSVGTVISALQRK 328
           +L+  A H       L+ P    + ++ +   + +R+ YA MV  +D++VG V++AL+  
Sbjct: 227 FLSFQAIHLP-----LQVPSRYADMYKTLIPNNADRQKYAGMVTCMDEAVGRVVTALKHT 281

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+ ENS+++F +DNG               G N+P RG K +L+EGGV+    + SP + 
Sbjct: 282 GLWENSVLVFSTDNGGIL------------GGNWPLRGSKASLYEGGVRGVGFVTSPLLP 329

Query: 389 QNPR--VSLQMMHISDWLPTLYTAAGGD-TSRLPLNIDGLDQWSSLLLNTPSRRNS---N 442
              R  V+ +++H++DWLPTL   AGGD     PL  DG +QW ++   TPS R+     
Sbjct: 330 SRVRGTVNRELLHMTDWLPTLVRLAGGDLMGTKPL--DGYNQWETISQGTPSVRDEVLLT 387

Query: 443 IDGLDQWSSLLLNTPSRRN---SVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
           ID L   +    + P   N   +VLI        AA+R+  WKL+ G   NG+
Sbjct: 388 IDPLIAKAHHFKDDPYADNDSFNVLIR-------AAIRVGDWKLITGPPGNGS 433



 Score = 47.4 bits (111), Expect = 0.022,   Method: Compositional matrix adjust.
 Identities = 23/63 (36%), Positives = 34/63 (53%)

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           L+N+  DP E+N+ +  RPDI   L   + ++  T VP      +++ ADP R    WSP
Sbjct: 456 LYNVVADPNERNDYSHDRPDIVRFLLARIGHYYNTSVPVLDPSQEILLADPARHGGVWSP 515

Query: 629 WIY 631
           W Y
Sbjct: 516 WRY 518


>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score =  258 bits (660), Expect = 6e-66,   Method: Compositional matrix adjust.
 Identities = 177/569 (31%), Positives = 272/569 (47%), Gaps = 90/569 (15%)

Query: 77  AALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
            A+ +S+   ++ V     GW+D+ +HGS  + TP++D LA  GI L N Y QP+CTP+R
Sbjct: 15  CAVARSSQPNIVFVLADELGWHDIGYHGS-RVQTPHLDKLASEGIKLENYYVQPMCTPTR 73

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
           + LM+G+Y IHTG+Q   I   +   +PL E  + + L+E GYST  +GKWHLG + +  
Sbjct: 74  SQLMSGRYQIHTGLQHTVINPDQRSCLPLDEVTIAQKLKEAGYSTHMVGKWHLGHYTKGC 133

Query: 193 TPLYRGFESHFGYLNGVISYYDH----ILSDQYSRTVELNGHDMRRN--LSTAWDTVGEY 246
            P  RGF+S FG+ N  + YY +        +    + + G D+ RN     A    G Y
Sbjct: 134 LPTKRGFDSFFGFYNCAVDYYTYEKGKFCKFENETVLRMRGTDLWRNDEEHVAPYYQGHY 193

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
            T +  KEA  +I      KPLFLYLA  A H       LE P+   + +  + D +RR 
Sbjct: 194 QTHVLAKEAEDVIRKHNPSKPLFLYLAFGAVHVP-----LEVPKVYEDMYADVKDNSRRI 248

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             AM   +DDS+  V  AL+  G+ +N+I IF +DNGA T            G ++P +G
Sbjct: 249 LLAMATCMDDSIADVTGALKDTGLWDNTIFIFSTDNGAATA---------YGGCSWPLKG 299

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
            K +L+EGG++    + SP +  + + +  +++MH+SDW PTL   AGG+ S     +DG
Sbjct: 300 GKTSLFEGGIRAVGFVTSPLLPPDMQGTENMELMHVSDWFPTLVHVAGGELSGTK-PLDG 358

Query: 425 LDQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
            +QW ++ L  PS+R     NID L    +   +T        +++      AA+R+  W
Sbjct: 359 YNQWETISLGRPSQRTEILINIDPLIPCEACANHTKWDHKKFDVHVH-----AAIRMGEW 413

Query: 482 KLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
           KL+ G   +                    N  V    Y ++    +NI+L          
Sbjct: 414 KLITGNPHH--------------------NYWVTPAEYPNM----KNIYL---------- 439

Query: 542 TRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
                       P  M          +L+N+  DP EQ++++ + P I + L + L  + 
Sbjct: 440 ----------NEPGKMV---------WLYNIAADPNEQDDLSDTFPGIVNILLDRLAEYN 480

Query: 602 RTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
            T VP      D   ADP R   +W PW+
Sbjct: 481 ETAVPVQFPDCD-TGADPARRGGSWGPWV 508



 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 35/112 (31%), Positives = 47/112 (41%), Gaps = 16/112 (14%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G Y T +  KEA  +I      KPLFLYLA  A H       LE P+   + +  + D +
Sbjct: 191 GHYQTHVLAKEAEDVIRKHNPSKPLFLYLAFGAVHVP-----LEVPKVYEDMYADVKDNS 245

Query: 73  RRTYAALTKSTTLTLLIVYG-------WNDLSFHGSNEIPTPNIDALAYNGI 117
           RR   A+      ++  V G       W++  F  S    T N  A AY G 
Sbjct: 246 RRILLAMATCMDDSIADVTGALKDTGLWDNTIFIFS----TDNGAATAYGGC 293


>gi|291225027|ref|XP_002732497.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 461

 Score =  257 bits (656), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 177/539 (32%), Positives = 258/539 (47%), Gaps = 84/539 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H S +I  PN+  LA +G+I N  Y  P CTPSRA++MTG YP  TG Q   +
Sbjct: 2   GWNDVHWHNS-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMV 60

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +   P GVPL  + LPE L+E+GYST  +GKWHLGF + EY P  RGF+SH+G     +S
Sbjct: 61  FNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVS 120

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
            YD  ++   S +    G+D R N+     +  +Y   +    A  ++       PLF  
Sbjct: 121 DYDK-MNGVLSPSA---GYDFRDNMGVVPKS-DDYLALMLGDRAEHIVNTHYPGTPLF-- 173

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
              LA       KHLE P+E   ++  I D   R +   +  +DD +G V+ AL+ +GM 
Sbjct: 174 ---LAFTLDIPAKHLEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMY 230

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ++S+IIF+ DNGA + +          GSNYP+RG+  TL+EG  +VP+I+    I++  
Sbjct: 231 DDSLIIFIGDNGALSSQS---------GSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTG 281

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             S ++  I D   T+   AG +                            +DGL  W +
Sbjct: 282 YESNELYSIVDIHRTILDVAGAEAE------------------------PELDGLSMWPT 317

Query: 452 LLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
                PS RN +L NID+   +  AA+R+  +KL+ G  +                 L  
Sbjct: 318 WSKGKPSPRNEILYNIDDDPVSPGAAIRIGDYKLITGHPD----------------LLYP 361

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           +  I  +  Y +          P S    + ST QQ+       PAP           YL
Sbjct: 362 YRDISLTDNYYNYGD------APASG---LPSTDQQSHA-----PAPQNVE-------YL 400

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           FN+  DP E+N+I++  PDI   L   L  HR+ L+P  ++  +L  A    FND WSP
Sbjct: 401 FNIVVDPEERNDISADHPDIVQDLRARLDEHRKLLIPPVNQTTELAGA-AGNFNDVWSP 458


>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 516

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 172/545 (31%), Positives = 268/545 (49%), Gaps = 85/545 (15%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YG++D+ +H S  I TPN+D LA  G+ L N Y QP+CTP+R+ LM+G+Y IHTG+Q   
Sbjct: 50  YGFHDIGYHDS-IIKTPNLDRLASEGVKLENYYVQPICTPTRSQLMSGRYQIHTGLQHGI 108

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           IW  +P  +P+ E  +P+ L+E GY+T  +GKWHLG +++E  P  RGF++ FGYL G  
Sbjct: 109 IWPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTERGFDTFFGYLTGSE 168

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLF 269
            YY H  S  Y +    +G D R N+        G+Y+T +F ++A  +I+      PLF
Sbjct: 169 DYYTHNRS--YDK---FHGMDFRENMQIVQPQYNGQYSTHVFAEKAKNIIKSHDPQIPLF 223

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL   A H       L+ P +    +  IT+  RRTYA MV  +D++VG +   L+  G
Sbjct: 224 LYLPFQAVHGP-----LQVPDQYEKPYANITNKQRRTYAGMVSCMDEAVGNITQTLKDTG 278

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           + +NS+IIF +DNG   +           G+N+P RG K +LWEGG+     + SP I  
Sbjct: 279 LYDNSVIIFSTDNGGQVIA---------GGNNWPLRGWKGSLWEGGMHGVGFVNSPLINH 329

Query: 390 NPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
             R   S +++H++DW PT+   AGG       +++G                  +DG +
Sbjct: 330 QARGTTSKELIHVTDWFPTIVHLAGG-------SVNG---------------TQPLDGYN 367

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
           QW ++     S R  +L N           +D   + L T  N  +       R++    
Sbjct: 368 QWETISNGKKSPRKELLHN-----------IDPNYIALQTSNNNMI------YRTDM--- 407

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP- 566
             FNA + +              L + +   +      ++       +  +P P T+ P 
Sbjct: 408 --FNASIRAA-------------LRVGDWKILTGDPGNSSWIPPPESSITSPHP-TDIPG 451

Query: 567 --CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
              +LFN+  DP E  +++  RPD  ++L E L+ + +T VP  +   D    DP + + 
Sbjct: 452 KIIWLFNIAEDPNEYKDLSLFRPDKVNELLEKLQQYYKTSVPVRYPDNDPA-CDPGKHSG 510

Query: 625 TWSPW 629
            W PW
Sbjct: 511 AWGPW 515



 Score = 43.1 bits (100), Expect = 0.46,   Method: Compositional matrix adjust.
 Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 5/68 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G+Y+T +F ++A  +I+      PLFLYL   A H       L+ P +    +  IT+  
Sbjct: 198 GQYSTHVFAEKAKNIIKSHDPQIPLFLYLPFQAVHGP-----LQVPDQYEKPYANITNKQ 252

Query: 73  RRTYAALT 80
           RRTYA + 
Sbjct: 253 RRTYAGMV 260


>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
           carolinensis]
          Length = 580

 Score =  256 bits (655), Expect = 2e-65,   Method: Compositional matrix adjust.
 Identities = 170/542 (31%), Positives = 274/542 (50%), Gaps = 79/542 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D  +HGS +I TP +D LA  G+ L N Y +P+CT SR+ L+TG+Y IHTG+Q   I
Sbjct: 69  GFHDXGYHGS-DIXTPTLDRLAAEGVKLENYYIRPICTLSRSQLITGRYQIHTGLQHSII 127

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P  +  LP+ L+E GYST  +GKWHLGF+R+E  P  RGF++  G L G + 
Sbjct: 128 RPQQPNCLPFNQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 187

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G+D+    + AW+  G+Y+T L+ +   +++      +P+F+Y
Sbjct: 188 YYTYDNCDGPG----VCGYDLHDGENVAWEQSGKYSTFLYAQRVNKILAAHNPKEPIFIY 243

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E I +++ + +  RR YAAMV  +D++V  +  AL++ G  
Sbjct: 244 IAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 298

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +NS+I+F +DNG  T          + GSN+P RG K T WEGGV+    + SP I+   
Sbjct: 299 DNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKHKR 349

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R+S  ++HI+DW PTL T A G+ S    ++DG + W ++     S R   +  +D    
Sbjct: 350 RISRALIHITDWYPTLVTLAQGNVSEAD-DLDGYNVWPAISEGKESPRTEILHNIDP--- 405

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   ++  S+     I      A+VR+  WKL+ G        GY     S+ +P   
Sbjct: 406 --LYNHAKYGSLESGFGIWNTAVQASVRVGDWKLLTGDP------GY-----SDWIPP-- 450

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M    +++                     +L
Sbjct: 451 ----------QTLATFPGSWW----NLERMTGGMRKSV--------------------WL 476

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ +++  RPD+   L   L  + +T +P  +  E P   +A P      W 
Sbjct: 477 FNITADPYERYDLSEQRPDVVRALLLRLVRYNQTAIPVRYPAENP---RAHPDFNGGAWG 533

Query: 628 PW 629
           PW
Sbjct: 534 PW 535



 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 21/75 (28%), Positives = 43/75 (57%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW+  G+Y+T L+ +   +++      +P+F+Y+A  A H       L++P+E I ++
Sbjct: 209 NVAWEQSGKYSTFLYAQRVNKILAAHNPKEPIFIYIAFQAVHT-----PLQSPKEYIYRY 263

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 264 RSMGNVARRKYAAMV 278


>gi|291225029|ref|XP_002732496.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  256 bits (653), Expect = 3e-65,   Method: Compositional matrix adjust.
 Identities = 175/539 (32%), Positives = 260/539 (48%), Gaps = 84/539 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H + +I  PN+  LA +G+I N  Y  P CTPSRA++MTG YP  TG Q   +
Sbjct: 38  GWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMV 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +   P GVPL  + LPE L+E+GYST  +GKWHLGF + EY P  RGF+SH+G     + 
Sbjct: 97  FNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVG 156

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
            YD  ++   S +    G+D R N+     + G Y   +    A  ++       PLF  
Sbjct: 157 DYDK-MNGVLSPSA---GYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNTHYPGTPLF-- 209

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
              LA       KHLE P+E   ++  I D   R +   +  +DD +G V+ AL+ +GM 
Sbjct: 210 ---LAFTLDIPAKHLEIPEEYEEKYSDIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMY 266

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ++S+IIF+ DNGA + +          GSNYP+RG+  T++EG  +VP+I+    I++  
Sbjct: 267 DDSLIIFIGDNGALSSQS---------GSNYPFRGIAGTMFEGATRVPSIVSGKGIKKTG 317

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             S ++  I D   T+   AG                        ++    +DGL  W +
Sbjct: 318 YESNELYSIVDIHRTILDVAG------------------------AKAEPELDGLSMWPT 353

Query: 452 LLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
              + PS RN +L NID+   +  AA+R+  +KL+ G  +                 L  
Sbjct: 354 WSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLITGHPDL----------------LYP 397

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           +  I  +  Y +          P S    + ST QQ+       PAP           YL
Sbjct: 398 YRDISLTDNYYNYGD------APASG---LPSTDQQSHA-----PAPQNVE-------YL 436

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           FN+  DP E+N+I++  PDI   L + L  HR+ L+P  ++  +L  A    FND WSP
Sbjct: 437 FNVVVDPEERNDISADHPDIVQDLRDRLDEHRKLLIPPVNQTTELAGA-AGNFNDVWSP 494


>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 571

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 174/540 (32%), Positives = 252/540 (46%), Gaps = 70/540 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS+ I TP +D LA  G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q   I
Sbjct: 57  GYGDIGYHGSD-IHTPVLDRLAAEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSII 115

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P     LPE L E GY+T  +GKWHLGF R    P  RGF+S  G L G   
Sbjct: 116 RPRQPLCLPPDSPTLPERLAEAGYATHMVGKWHLGFCRPSCLPTGRGFQSFLGTLTGSGD 175

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           ++ +    Q     E  G D+      AW+  G Y+T L+     Q+++      PLFLY
Sbjct: 176 HFSY----QSCDGAEACGFDLHDGDRPAWEMAGNYSTLLYIDRVKQILKRHDPHTPLFLY 231

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L+  AAH       L+ P   ++ +    +  RR YAAM+  LD  V  V+  L+ +G+ 
Sbjct: 232 LSLQAAHTP-----LQVPDHFLHLYDSHRNRLRRHYAAMLSCLDSGVAQVVQELKTQGLY 286

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ENS++I+ SDNG   +         + GSN+P RG K T WEGG++    + SP +++  
Sbjct: 287 ENSVLIYSSDNGGQPL---------SGGSNWPLRGGKGTYWEGGIRAVGFVHSPLLKKKG 337

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            VS  ++H+SDW PTL   AG   S                       +  +DG D W S
Sbjct: 338 IVSRALIHVSDWYPTLLGLAGAQQS-----------------------HHGLDGHDVWGS 374

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD-GYYGQTRSNKVPLLNF 510
           +    P  R  +L NID   R A    D   ++ G    G  D       R+    LL  
Sbjct: 375 ISEGLPCPRTEILFNIDPVSRKAGEPYDKALILSGF---GIWDTAVRAALRAGDWKLLTG 431

Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
           N  V    +   Q L          ++K R+   ++                     +LF
Sbjct: 432 N--VGDGDWIPPQALPGGPER-WKKLEKRRNKLGKSV--------------------WLF 468

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           N+  DP E+ ++A +RP++   L   L  + +T V  +   PD   ADP+     WSPW+
Sbjct: 469 NVSADPYERYDLAETRPEVVKHLMTRLAEYNQTAV-MARNPPDDPMADPELHGGAWSPWL 527



 Score = 39.7 bits (91), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 5/72 (6%)

Query: 8   AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 67
           AW+  G Y+T L+     Q+++      PLFLYL+  AAH       L+ P   ++ +  
Sbjct: 199 AWEMAGNYSTLLYIDRVKQILKRHDPHTPLFLYLSLQAAHT-----PLQVPDHFLHLYDS 253

Query: 68  ITDPNRRTYAAL 79
             +  RR YAA+
Sbjct: 254 HRNRLRRHYAAM 265


>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
          Length = 515

 Score =  256 bits (653), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 143/361 (39%), Positives = 204/361 (56%), Gaps = 23/361 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS EI TP +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I
Sbjct: 87  GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL    LP+ L+E+GYST  +GKWHLGF R+E  P  RGF++ FG L G   
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKECMPTRRGFDTFFGSLLGSGD 205

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q++      KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G 
Sbjct: 262 YTAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316

Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             NSIII+ SDNG  PT            GSN+P RG K T WEGG++    + SP ++ 
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V  +++HI+DW PTL + A G      + +DG D W ++     S R   +  +D +
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETISEGLRSPRVDILHNIDPY 425

Query: 450 S 450
           +
Sbjct: 426 T 426



 Score = 47.8 bits (112), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY A+ A H+      L+AP 
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYTAYQAVHS-----PLQAPG 276

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296


>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
          Length = 1941

 Score =  255 bits (652), Expect = 4e-65,   Method: Compositional matrix adjust.
 Identities = 164/517 (31%), Positives = 258/517 (49%), Gaps = 84/517 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ ++   +I TP +D LAY G+I N  Y QP+CTP+RA+LMTG YP   GMQ   +
Sbjct: 54  GWNDIGWNNL-QIKTPVLDKLAYEGVIFNQTYVQPLCTPTRAALMTGYYPFRIGMQHQMV 112

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P G+PL  + LP+ L++ GY    +GKWHLG+   EYTPL RGF+S +G  +  ++
Sbjct: 113 LPFQPSGLPLHLKILPQKLKQAGYINHIVGKWHLGYCNWEYTPLNRGFDSFYGSFSNSVN 172

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           + + I     S   +  G+D R N     +  G+  T LFT+  V +I +   D P+F+Y
Sbjct: 173 HNNKISQLPISDHSKYKGYDFRDNTGVVQND-GQPLTKLFTQRVVDIISNHHKDYPMFMY 231

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                       K  E  ++    +  +TD   R + A +  +DD+VG VI AL+ +GM 
Sbjct: 232 FP-----INQPSKGSEPDEKYTALYPNVTDDQVRQWYATLSLMDDAVGHVIDALKSRGMW 286

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           E +++IF+SDNGA   ++ E       G+N P RG   TL+EGG++VPAI     I++  
Sbjct: 287 EETLLIFISDNGA--FDFME-------GTNLPLRGAGTTLFEGGMRVPAIAHGKMIKKTG 337

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            ++ ++ HI+DW  T+ + AG                         +   ++DG++ W +
Sbjct: 338 YINNELNHITDWHSTILSLAG------------------------IQPEPDLDGINIWGT 373

Query: 452 LLLNTPSRRNSVLINIDE--KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
           +     S RN ++ NIDE  + R AA+RL  WK++ G    G M  +        + L  
Sbjct: 374 VSEGKASARNEIVYNIDEIDECRGAALRLGDWKVITGNP--GLMCRF--------LDLNK 423

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
            +  ++S  +   Q ++   F  ISN+                               YL
Sbjct: 424 QDGWLQSGRHSDNQAITD--FQSISNVS------------------------------YL 451

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
           FNL  DP E++N+A   P++ S L + L  +R+ +VP
Sbjct: 452 FNLKEDPEERHNVAEKYPEVFSALLDRLNEYRKNVVP 488


>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
          Length = 1542

 Score =  254 bits (649), Expect = 9e-65,   Method: Compositional matrix adjust.
 Identities = 172/558 (30%), Positives = 271/558 (48%), Gaps = 109/558 (19%)

Query: 92   GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
            G+ D+ +HGS EI TP +D LA  G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q   I
Sbjct: 1038 GFRDVGYHGS-EIKTPTLDRLAAAGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSII 1096

Query: 152  WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               +P  +PL    LP+ L+  GYST  +GKWHLGF++R   P  RGF++ FG L G   
Sbjct: 1097 RPTQPNCLPLENITLPQKLKNAGYSTHMVGKWHLGFYKRACMPTQRGFDTFFGSLLGSGD 1156

Query: 212  YYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            YY H   D    +  L G+D+      AW+   G Y+T ++T++AV ++      +P+FL
Sbjct: 1157 YYSHYKCD----SPGLCGYDLHEGEEAAWEQDRGVYSTIMYTQKAVNILASHNPKRPIFL 1212

Query: 271  YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            YLA  A H+      L+ P   + +++ I + +RR YAAMV  LD++V  +  AL++ G 
Sbjct: 1213 YLAFQAVHSP-----LQVPARYLERYKSIPNLHRRKYAAMVSCLDEAVRNLTLALKQYGY 1267

Query: 331  LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             +N ++++ SDNG   +           GSN+P RG K + WEGG++    + SP + + 
Sbjct: 1268 YDNMVMVYSSDNGGQPMA---------GGSNWPLRGSKGSYWEGGIRAVGFVHSPLLVKK 1318

Query: 391  PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
               S  ++HI+DW PTL     G           LD+            N N+DG D W 
Sbjct: 1319 GTKSKALIHITDWYPTLVMLGEGT----------LDE------------NQNLDGYDVWE 1356

Query: 451  SLLLNTPSRRNSVLINID------------------EKKRTAAVRLDSWKLVLGTQENGT 492
            ++    PS R  +L NID                       AA+R+  WKL+ G      
Sbjct: 1357 AISEGRPSPRLDILHNIDPIYTKAKNGSWKAGYGIWNTAVQAALRVGDWKLLTG------ 1410

Query: 493  MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
            + GY     S+ +P   F+ ++ +  + + +           + D+ +S           
Sbjct: 1411 VPGY-----SDWIPPQTFSTLLLANRWHNER----------VSWDRGKSV---------- 1445

Query: 553  NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQ-LYELLKYHRRTLVPQSHEQ 611
                           +LFN+  DP E+ +++   P +  + L++LL+Y +   VP  +  
Sbjct: 1446 ---------------WLFNITADPYERVDLSRQYPHVVKKMLFKLLRYSKNA-VPCRYPP 1489

Query: 612  PDLVQADPKRFNDTWSPW 629
             D  +++P+     W PW
Sbjct: 1490 KDY-RSNPQYNGGVWGPW 1506



 Score = 45.4 bits (106), Expect = 0.095,   Method: Compositional matrix adjust.
 Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 12/95 (12%)

Query: 8    AWDT-VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
            AW+   G Y+T ++T++AV ++      +P+FLYLA  A H+      L+ P   + +++
Sbjct: 1180 AWEQDRGVYSTIMYTQKAVNILASHNPKRPIFLYLAFQAVHSP-----LQVPARYLERYK 1234

Query: 67   YITDPNRRTYAALTKS-----TTLTL-LIVYGWND 95
             I + +RR YAA+          LTL L  YG+ D
Sbjct: 1235 SIPNLHRRKYAAMVSCLDEAVRNLTLALKQYGYYD 1269


>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
          Length = 732

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 144/383 (37%), Positives = 210/383 (54%), Gaps = 21/383 (5%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ +H    I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 60  GFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLITGRYQIHTGLQHSII 118

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +P     LPE LRE GY+T  +GKWHLGF+R+   P  +GF++ FG L G + 
Sbjct: 119 RPRQPSCLPSHMDTLPERLREAGYTTHMVGKWHLGFYRKACLPTRKGFDTFFGSLTGSVD 178

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
           YY +   D      +L G+D+  +   AW   G+Y+T LFT+ A +++E   P ++PLFL
Sbjct: 179 YYSYESCD----GKDLCGYDLHDDEGVAWGQEGKYSTTLFTQRARKILESHDPAERPLFL 234

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            L+  A H       L  P+  I  F+ + +  RR +A MV  +D++V  V  AL++ G 
Sbjct: 235 LLSFQAVHTP-----LNPPESYIYPFRDMVNVARRNFAGMVSTVDEAVRNVTYALRKYGY 289

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             NS+II+ +DNGA              G+N+P RG K T WEGGV+    + SP +++ 
Sbjct: 290 YRNSVIIYSTDNGAQPF---------TGGNNWPLRGRKGTYWEGGVRGVGFVHSPLLRRR 340

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            RVS  +MHI+DW PTL   A G+ S+    +DG D W ++     S R   +  +D   
Sbjct: 341 RRVSKALMHITDWFPTLVGLARGNISQ-SQGLDGFDVWPAISEGRESPRKEILHNIDPLH 399

Query: 451 SLLLNTPSRRNSVLINIDEKKRT 473
              +   ++      N   KKRT
Sbjct: 400 KSPIEKKNQGEENSGNYRGKKRT 422



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 25/63 (39%), Positives = 38/63 (60%), Gaps = 1/63 (1%)

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           +LFN+  DP E++++A  RPDI  QL   L Y+ +T VP  +  PD  +A+P +    W 
Sbjct: 621 WLFNITGDPYERHDLADQRPDIVQQLLARLAYYNQTAVP-VYFPPDDPRANPSQHRGAWV 679

Query: 628 PWI 630
           PW+
Sbjct: 680 PWV 682



 Score = 46.6 bits (109), Expect = 0.044,   Method: Compositional matrix adjust.
 Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 6/78 (7%)

Query: 7   TAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
            AW   G+Y+T LFT+ A +++E   P ++PLFL L+  A H       L  P+  I  F
Sbjct: 201 VAWGQEGKYSTTLFTQRARKILESHDPAERPLFLLLSFQAVHTP-----LNPPESYIYPF 255

Query: 66  QYITDPNRRTYAALTKST 83
           + + +  RR +A +  + 
Sbjct: 256 RDMVNVARRNFAGMVSTV 273


>gi|241176208|ref|XP_002399501.1| arylsulfatase B precursor, putative [Ixodes scapularis]
 gi|215495150|gb|EEC04791.1| arylsulfatase B precursor, putative [Ixodes scapularis]
          Length = 418

 Score =  253 bits (646), Expect = 2e-64,   Method: Compositional matrix adjust.
 Identities = 166/496 (33%), Positives = 238/496 (47%), Gaps = 88/496 (17%)

Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
           G+Q   I   E  G+PL    +P++ ++LGY+T  +GK  +G F    T   R  + H  
Sbjct: 1   GLQHLTIEEGEAWGLPLRFTLMPQHFKKLGYATHMVGK--VGQFIISDTSCRRYVKRH-- 56

Query: 205 YLNGVISYYDHILSDQYSRTVELN--------GHDMRRNLSTAWDTVGEYATDLFTKEAV 256
                        S +  R  ++         G D+  N        G Y TD+ T +AV
Sbjct: 57  --------TQKSCSQRCKRERKVKEPCGGDHIGLDLWDNEVAVRTECGNYDTDMLTDKAV 108

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
            +I      KPLFLYL+H +AH+      L+AP++ I +F YI + NR  YA MV  +DD
Sbjct: 109 SIIASHDTTKPLFLYLSHRSAHSAFTSAPLQAPEKNIAKFGYIGERNRTIYAGMVDSMDD 168

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           + G ++ AL   G+L N+I++F SDNGA P +E      + N G N+P RG K TLWEG 
Sbjct: 169 AFGRLVEALSGAGVLNNTIVVFSSDNGAVPIIE------FPNRGFNWPLRGAKKTLWEGA 222

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           V+VPA +WSP ++ + RVS QMMHI DWLPT Y+ AGG+ S L                 
Sbjct: 223 VRVPAFIWSPLLESSGRVSDQMMHIVDWLPTFYSVAGGNLSHL----------------- 265

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
                   DG D W +L   T S R  +L+NID    T+++R  + K+VLGT  NG+ D 
Sbjct: 266 -----GEQDGFDMWKALSEGTKSPREEMLLNIDPVLNTSSLRYKNHKVVLGTYHNGSFDH 320

Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCG-ANP 554
            +  T                                     +    R++AT+ CG  N 
Sbjct: 321 RFKTT------------------------------------GRRPGWRRRATVRCGECNL 344

Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
           A     P +  P YLF++  DPCE NN+A   P+I  QL + L+ +  T V   +     
Sbjct: 345 AKDNVKPSS--PPYLFDVVADPCELNNLAPEHPEILRQLLKRLEDYATTAVDPENVYETD 402

Query: 615 VQADPKRFNDTWSPWI 630
            ++ P+R N TW+PW+
Sbjct: 403 ARSFPERHNGTWAPWL 418



 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 30/71 (42%), Positives = 42/71 (59%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
            G Y TD+ T +AV +I      KPLFLYL+H +AH+      L+AP++ I +F YI + 
Sbjct: 95  CGNYDTDMLTDKAVSIIASHDTTKPLFLYLSHRSAHSAFTSAPLQAPEKNIAKFGYIGER 154

Query: 72  NRRTYAALTKS 82
           NR  YA +  S
Sbjct: 155 NRTIYAGMVDS 165


>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
           purpuratus]
          Length = 784

 Score =  252 bits (644), Expect = 4e-64,   Method: Compositional matrix adjust.
 Identities = 134/367 (36%), Positives = 215/367 (58%), Gaps = 27/367 (7%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP 126
           I++   + ++ LTK   + +LI   G+ND+ +HGS EI TPNID LA  G+ L N Y QP
Sbjct: 13  ISNVCSKKFSQLTKPNIVFILIDDLGYNDVGYHGS-EIYTPNIDKLAREGVRLENYYVQP 71

Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           +CTP+R+ L++G+Y IHTG+Q   I  A+P  +P       + LRE GY+T A+GKWHLG
Sbjct: 72  ICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATHAVGKWHLG 131

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
           F+++E  P  RGF+S+FGYL G   Y+ H     + +   L    +R     AW+  G Y
Sbjct: 132 FYKKECLPTQRGFDSYFGYLTGGEDYWTH-----HRKRPXLA---LRHVDKVAWEYGGYY 183

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +  +FT++  Q++   PV++P  LYL   + H+      L+ P     +++ I + NRR 
Sbjct: 184 SAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHSP-----LQVPSSYEERYKNIKNTNRRI 238

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA M+  LD++VG ++ +LQ+ G+ +N+++IF +DNG               G+N+P RG
Sbjct: 239 YAGMMTCLDEAVGKIVHSLQQAGLWDNTVLIFSTDNGGEVA---------AGGNNWPLRG 289

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
            K ++WEGG++    + SP +  + +  V+ Q++H+SDW PTL     G +S   + +DG
Sbjct: 290 WKRSIWEGGMRGVGFVNSPLLPASVQGTVNKQLIHVSDWFPTLVQGVAG-SSLDNITLDG 348

Query: 425 LDQWSSL 431
            + W ++
Sbjct: 349 FNMWKTI 355



 Score = 44.7 bits (104), Expect = 0.15,   Method: Compositional matrix adjust.
 Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 5/74 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
             AW+  G Y+  +FT++  Q++   PV++P  LYL   + H+      L+ P     ++
Sbjct: 174 KVAWEYGGYYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHSP-----LQVPSSYEERY 228

Query: 66  QYITDPNRRTYAAL 79
           + I + NRR YA +
Sbjct: 229 KNIKNTNRRIYAGM 242


>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
 gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
          Length = 540

 Score =  251 bits (640), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 139/364 (38%), Positives = 207/364 (56%), Gaps = 26/364 (7%)

Query: 85  LTLLIVYGWNDLSFHG-SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
             L+   GW+D+ +H  S+ + TPNID LA  G+ L + Y+QP+CTPSR +LMTGKYPIH
Sbjct: 39  FILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSRGALMTGKYPIH 98

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
            GMQ   I    P G+P     +P+ LR LGY T  IGKWHLGFF  +YTPL RGF+S  
Sbjct: 99  LGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDYTPLRRGFDSFL 158

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
           G+  G   ++ H          ++   D RR+   A +  G+++TD+FT+EA+ +     
Sbjct: 159 GFFAGEQDHWRH---------SKMGFLDFRRDEEPANEYGGQHSTDVFTQEAINIAMRHN 209

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
             +PLFL L++ A H       L+A    +N+   ++D +R+ Y  M+   D S+G +I 
Sbjct: 210 ASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGAADWSIGRLID 264

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
             +R G+  N+++I+ SDNGA           +  G N+P RG K++L+EGGV+VPA + 
Sbjct: 265 VYKRNGLWNNTLMIWASDNGA--------QPGKGGGYNWPLRGYKSSLFEGGVRVPAFVH 316

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
              +Q+       + H++DW PTL   AGG+      +IDG+DQW +L    PS+R   +
Sbjct: 317 GEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEVEP---DIDGVDQWPTLSEGKPSKREEIL 373

Query: 444 DGLD 447
             +D
Sbjct: 374 HNID 377



 Score = 45.4 bits (106), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 28/99 (28%), Positives = 51/99 (51%), Gaps = 8/99 (8%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+   A +  G+++TD+FT+EA+ +       +PLFL L++ A H       L+A   
Sbjct: 178 FRRDEEPANEYGGQHSTDVFTQEAINIAMRHNASQPLFLLLSYAAVHT-----PLQAHPN 232

Query: 61  TINQFQYITDPNRRTYAALTKSTTLT---LLIVYGWNDL 96
            +N+   ++D +R+ Y  +  +   +   L+ VY  N L
Sbjct: 233 DVNKIGGVSDKDRQNYLGMMGAADWSIGRLIDVYKRNGL 271


>gi|291225021|ref|XP_002732506.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  250 bits (639), Expect = 1e-63,   Method: Compositional matrix adjust.
 Identities = 172/551 (31%), Positives = 256/551 (46%), Gaps = 108/551 (19%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H + +I  PN+  LA +G+I N  Y  P CTPSRA++MTG YP  TG Q   +
Sbjct: 38  GWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQML 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +   P G+PL  + LPE L+E+GYST  +GKWHLGF + EY P  RGF+SH+G     + 
Sbjct: 97  FNLHPSGLPLEFKLLPEKLKEIGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGIWTLGVG 156

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
            YD    +LS          G+D R N      + G Y   +    A  ++       PL
Sbjct: 157 DYDKMNGVLSPS-------KGYDFRDNTGVVQKSNG-YLALMLGDRAEHIVNTHYPGTPL 208

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           F     LA       KHL  P+E  N++  I D   R +   +  +DD +G V+ AL+ +
Sbjct: 209 F-----LAFTLDIPAKHLAIPEEYENKYADIEDSRTRHFYGKLTMMDDIIGGVVDALKNR 263

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+ ++S+IIF+ DNGA + +          GSNYP+RG+  TL+EG  +VP+I+    I+
Sbjct: 264 GIYDDSLIIFIGDNGALSSQA---------GSNYPFRGIAGTLFEGATRVPSIVSGKGIK 314

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           +    S ++  I D   T+   AG                        ++    +DGL  
Sbjct: 315 KTGYESNELYSIVDIHRTILDVAG------------------------AKAEPELDGLSM 350

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGT-------QENGTMDGY--Y 497
           W +     PS RN ++ NID+   +  AA+R+  +KL+ G        ++ G  D Y  Y
Sbjct: 351 WPTWSKGKPSPRNEIIYNIDDDPVSPGAAIRIGDYKLITGHPDLLYPYRDIGITDNYYNY 410

Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
           G   ++ +P                                             A+ A M
Sbjct: 411 GDPLASGLPT--------------------------------------------ADQAAM 426

Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
            P P      YLFN+  DP E+N+I++  PDI   L + L  HR+ L+P  ++  +L  +
Sbjct: 427 APPPLDVE--YLFNVIVDPEERNDISADHPDIVQDLRDRLDEHRKLLIPPVNQTTELAGS 484

Query: 618 DPKRFNDTWSP 628
               FND WSP
Sbjct: 485 -AANFNDVWSP 494


>gi|313219945|emb|CBY43646.1| unnamed protein product [Oikopleura dioica]
          Length = 1683

 Score =  250 bits (639), Expect = 2e-63,   Method: Compositional matrix adjust.
 Identities = 162/512 (31%), Positives = 249/512 (48%), Gaps = 72/512 (14%)

Query: 119  LNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTK 178
            L + Y  P+CTP+R+ LM+G+Y IHTG+Q   I GA+P G+PLT+   PE  ++ GY T 
Sbjct: 1233 LQSYYVNPICTPTRSVLMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTH 1292

Query: 179  AIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST 238
             +GKWHLGF+  +  P  RGFESH+GYL G   +Y+H    Q    V+    D   + ++
Sbjct: 1293 MVGKWHLGFYDEKMVPENRGFESHYGYLIGAEGHYNHSQFMQGQHGVDF--RDGGASTNS 1350

Query: 239  AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
            +W   G+Y+ DLF K    L+E   V++ L++Y+     H       LEAPQ  ++QF +
Sbjct: 1351 SW---GQYSADLFVKRVEDLVEAHDVEESLYMYVGLQNVHYP-----LEAPQHYVDQFSW 1402

Query: 299  ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
            I D +RR YAAM K +DD+VG +I++ + KG+ E++I+ F +DNG  T+           
Sbjct: 1403 IKDRDRRVYAAMTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTLYG--------- 1453

Query: 359  GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
            G+N+P RG+KNTLWEGG++    +  P +  + R   Q+MH+ D  PTL           
Sbjct: 1454 GNNWPLRGLKNTLWEGGIRGIGSIKIPGVSADKRD--QLMHVVDMFPTL----------- 1500

Query: 419  PLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRL 478
                  LD     L + P+ R    DG  Q   L+ N  S+ +S LIN+D  ++      
Sbjct: 1501 ------LD-----LTSCPNTRKVKFDGKSQAKMLVDNDESQSDSFLINVDPLRKNKGADN 1549

Query: 479  DSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDK 538
             +W      +    +       R  +  LL       S  Y      + +I+ P    D 
Sbjct: 1550 RTWHSEFDVRVQAGL-------RWQQWKLL--TGAPSSDNYP-----TGHIYPPEWPQDG 1595

Query: 539  MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
             ++++                   +N    LF++ NDP E   I++   +I  ++  +L 
Sbjct: 1596 RKNSQSNFE--------------SSNLNVRLFDIDNDPLEDFEISNQHTEIVEKMLNMLS 1641

Query: 599  YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
             +  T VP     PD     P +    W PWI
Sbjct: 1642 SYNSTAVPCRWPNPDPAGL-PDQNGGFWKPWI 1672



 Score = 60.1 bits (144), Expect = 4e-06,   Method: Compositional matrix adjust.
 Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 5/75 (6%)

Query: 13   GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
            G+Y+ DLF K    L+E   V++ L++Y+     H       LEAPQ  ++QF +I D +
Sbjct: 1353 GQYSADLFVKRVEDLVEAHDVEESLYMYVGLQNVHYP-----LEAPQHYVDQFSWIKDRD 1407

Query: 73   RRTYAALTKSTTLTL 87
            RR YAA+TKS   T+
Sbjct: 1408 RRVYAAMTKSMDDTV 1422


>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 533

 Score =  249 bits (637), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 174/523 (33%), Positives = 243/523 (46%), Gaps = 87/523 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ FHGS +IPTP +DALA +G+ L N +  PVC+PSRAS+++G++ IH G+  P  
Sbjct: 67  GFNDVGFHGSKQIPTPRLDALAADGVDLLNYHTHPVCSPSRASMLSGRHAIHHGIYMPFA 126

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G     + L    LPE LR LGY T A+GKWHLG   R   P  RGF+S  GY +G   
Sbjct: 127 QGTAYH-LSLEYELLPEALRRLGYETHAVGKWHLGQNTRAALPTGRGFDSFLGYWSGAED 185

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H  +  Y         D   N +TAW   G Y+   FT  AV ++       P FLY
Sbjct: 186 YFAHDCAGAY---------DFANNETTAWAYDGVYSAYSFTDRAVDVVAS--ASTPYFLY 234

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A    H       LE P   +  F ++    RR  + MV  LD++VG V  AL+ +   
Sbjct: 235 VAWQNVH-----WPLEVPARYLEPFGHVPAGPRRNVSGMVSALDEAVGNVTGALKARRAY 289

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ++++++F+SDNG PT     T++     +N+P RG KNTL+EGG +V  +L +P + ++ 
Sbjct: 290 DDAVVVFVSDNGGPTNGDEGTAS-----NNWPLRGGKNTLYEGGTRVVGLLKAPGVAKS- 343

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             S   +H  DW PTLY  AGG                 L+LN P       DG+D +  
Sbjct: 344 SASRAKLHAVDWFPTLYRFAGG---------------GPLVLNPPLAPG---DGVDSYDV 385

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           L    P R        DE             L+L     G+ D  +G             
Sbjct: 386 LARGAPGR--------DE-------------LLLEAHAPGSNDTVHG------------- 411

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQA----TIHCGANPAP----MTPSPCT 563
              E+      + L      P S     R   Q+A     + CGA P P     T   C 
Sbjct: 412 ---EALIVGDWKFLRIGTVRPDSEAGWTRPPGQEAHPRSVVDCGAAPEPADANATRRQCV 468

Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
             PC LF++  DPCE+ + A   PD+ + L   L  +R T VP
Sbjct: 469 AAPC-LFDVAADPCERVDRAVEFPDVLAALDARLDAYRATAVP 510


>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
 gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
          Length = 480

 Score =  249 bits (636), Expect = 3e-63,   Method: Compositional matrix adjust.
 Identities = 178/551 (32%), Positives = 251/551 (45%), Gaps = 88/551 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+SFHGS +IPTPNID LA +G+IL+N YA P+CTPSRA+LMTG YPIHTGM    I
Sbjct: 5   GWGDVSFHGSTQIPTPNIDVLAGDGVILDNYYALPLCTPSRAALMTGLYPIHTGMHAGVI 64

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A P G+ L  + +P++  +LGY    IGK H           + G  +     +G+  
Sbjct: 65  QDAAPWGLTLETKIMPQHFEDLGYEVNMIGKSH-----------HDGCHNFDSTKSGIDL 113

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV------- 264
            +  ++S    +      HD        ++     A     K  +Q +    +       
Sbjct: 114 LHTPLISSIPGQ------HDNTWMYGKTYNFYRTRAICRLQKRVIQGVVVYLIRFLLDLH 167

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD-DSVGTVIS 323
            +P F YL+H A H+       +AP   + +F YI + NR  YA   KK     +G  + 
Sbjct: 168 SQPFFCYLSHQAVHSALMKDPFQAPARNLLKFPYIGETNRTVYAGETKKTHLPWLGLALV 227

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
           +L +     + +++  SDNG           + N G N+P RG K  LWEGGV+VPA +W
Sbjct: 228 SLVKTKNYVDILLMGFSDNGGSPFGL-----FSNRGFNWPLRGAKANLWEGGVRVPAFVW 282

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
           S +  + PRVS Q+MHISDWLPTLY+AAGGD S L                       ++
Sbjct: 283 SSKFLKKPRVSDQLMHISDWLPTLYSAAGGDVSNL----------------------GSM 320

Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY----GQ 499
           DG+D W  L L   S R  VL+ ID K   AA+R  ++KLV+G   N  +DG Y    G 
Sbjct: 321 DGIDMWRYLTLGFGSPRKEVLLTIDPKNNLAALRYKNYKLVVGEGFNEELDGRYSFPGGL 380

Query: 500 TRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP 559
             +N VP L      +SK  + L+                +  R  + +    NP     
Sbjct: 381 RPTNDVPQLR----KDSKVARVLKSFYG------------KQNRNWSPLDWSKNPV---- 420

Query: 560 SPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
             C            +    NN  + +P     +   L  +  T VP    QP   +A P
Sbjct: 421 VDC-----------RERWLTNNFVARQPPYFRFMMSKLAAYNATAVPPLF-QPQDPRAYP 468

Query: 620 KRFNDTWSPWI 630
           +     WSPW+
Sbjct: 469 EYHGGVWSPWL 479



 Score = 43.1 bits (100), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 28/54 (51%)

Query: 35  KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLL 88
           +P F YL+H A H+       +AP   + +F YI + NR  YA  TK T L  L
Sbjct: 169 QPFFCYLSHQAVHSALMKDPFQAPARNLLKFPYIGETNRTVYAGETKKTHLPWL 222


>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
          Length = 514

 Score =  248 bits (634), Expect = 5e-63,   Method: Compositional matrix adjust.
 Identities = 166/525 (31%), Positives = 255/525 (48%), Gaps = 94/525 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+S+H +  + +P ++ LA  G+IL+  YA P CTP+RA+LMTG+YP   G+Q   I
Sbjct: 36  GWNDVSWH-NERMESPILEQLAKEGVILDQHYALPTCTPTRAALMTGRYPYKLGIQSHGI 94

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+PL    L E L+  GY+T A GKWHLG+  +  TP  RGF++  G+  G   
Sbjct: 95  RTLEPNGLPLGVTTLAEELKRTGYTTHAFGKWHLGYCNQSLTPTRRGFDTFRGFYVGGQD 154

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H LS   + +    G+D R      +   G Y T L     +  IE+   DKP+FLY
Sbjct: 155 YFSHTLSGGKT-SATAKGYDYRNGDEVDYSAKGVYTTTLIANHVLSAIEESQPDKPMFLY 213

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A HA      L+ P +         +P R+    M+  +D +VG ++  L+ K M 
Sbjct: 214 VAFQAVHA-----PLQVPTQYRKMCSIYRNPKRKLLCEMLAVMDHAVGLIVKKLKMKDMW 268

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            N++++F+SDNG   +           G+N+P RG KNTL+EGG +VPA +  P I+   
Sbjct: 269 RNTLLVFISDNGGQIL---------YGGNNWPLRGNKNTLFEGGTRVPAFVAGPLIRNGG 319

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPL-NIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
           R S  ++H+ DW PTL +AAGG+++   L ++DG+DQW+++       R+  +DG     
Sbjct: 320 RNSSSIIHVVDWFPTLLSAAGGESAESQLKDMDGVDQWAAI-------RDGTLDGT---- 368

Query: 451 SLLLNTPSRRNSVLINIDEKKR--TAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
                    R   + N+D KK     A+R    K +L   +  T  G+Y +         
Sbjct: 369 ---------RTEFVYNLDCKKGKLVGAIRDGDLKFILKPSQVRT--GWYEE--------- 408

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
                 E K  ++ ++L                         G NPA M           
Sbjct: 409 ------ELKGSENPEKL-------------------------GRNPARM----------- 426

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
           LFN+  DP E++++ +S    + +L   L+  +R L+  S   PD
Sbjct: 427 LFNISADPLEKDDLFASLKTEAQRLEAKLR--KRALLLVSSRNPD 469


>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 519

 Score =  248 bits (633), Expect = 6e-63,   Method: Compositional matrix adjust.
 Identities = 151/426 (35%), Positives = 218/426 (51%), Gaps = 69/426 (16%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +GWND+ +H S+ I +PNI+AL  +G+ L N Y QP CTPSR+ LM+G+Y IHTG+Q   
Sbjct: 32  HGWNDIGYH-SHIIRSPNINALCNDGVRLENYYIQPGCTPSRSQLMSGRYQIHTGLQHSV 90

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I   +P  +PL E  L E L+E+GY+T  +GKWHLGF+     P  RGF+S FGYL G  
Sbjct: 91  IRNDQPNCLPLDEVTLAEKLKEVGYATHLVGKWHLGFYTPSCLPTRRGFDSFFGYLIGQE 150

Query: 211 SYYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            YY HI            G+D++R+    +    G+Y T +FT EA  +I+      PLF
Sbjct: 151 DYYKHIHD---------GGYDLKRHETDVSKQYQGDYTTHVFTSEAQNIIKSHDPSTPLF 201

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           LY+++ + HA     +L+ P+   N +   I D +RR  A MV  +D+ +G +   L+  
Sbjct: 202 LYMSYQSVHA----NYLQVPEHYSNMYNGVIDDEDRRIVAGMVTCMDEGIGNITQTLKET 257

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            +  N+IIIF SDNG          +  + G+N+P RG K T WEG +     ++S  I 
Sbjct: 258 ELWNNTIIIFSSDNGG---------DPNDGGNNWPLRGEKGTHWEGAIHGLGFVYSSDIA 308

Query: 389 QNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           ++ R  V+ +M+H+SDW PT+   AGG                SL    P      +DG 
Sbjct: 309 KDVRGTVNTEMIHVSDWFPTIVNLAGG----------------SLNGTKP------LDGF 346

Query: 447 DQWSSLLLNTPSRRNSVLINID--------------------EKKRTAAVRLDSWKLVLG 486
           DQW ++     S R  +LINID                          A+R+  WKL+ G
Sbjct: 347 DQWQTISQGAESPRTEILINIDPLVPPPSDMEHFFHGWNGTYNTSMCGAIRVGDWKLITG 406

Query: 487 TQENGT 492
               G+
Sbjct: 407 YPGEGS 412



 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 5/69 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDP 71
           G+Y T +FT EA  +I+      PLFLY+++ + HA     +L+ P+   N +   I D 
Sbjct: 176 GDYTTHVFTSEAQNIIKSHDPSTPLFLYMSYQSVHA----NYLQVPEHYSNMYNGVIDDE 231

Query: 72  NRRTYAALT 80
           +RR  A + 
Sbjct: 232 DRRIVAGMV 240


>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 545

 Score =  248 bits (633), Expect = 7e-63,   Method: Compositional matrix adjust.
 Identities = 172/546 (31%), Positives = 260/546 (47%), Gaps = 77/546 (14%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YG+ND+ +  +  + TPN+D LA  GI L+N Y QP+CTPSRA LM+GKY IHTG+Q   
Sbjct: 69  YGFNDIGYR-NPAMRTPNLDYLAAEGIKLDNYYVQPICTPSRAQLMSGKYQIHTGLQHSI 127

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           IW  +P  +PL    LP+ L+E GY+T   GKWHLGF+++E  P  RGF+S  G L G  
Sbjct: 128 IWPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFYKKECWPTNRGFDSFLGILLGKG 187

Query: 211 SYYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
            ++ H        Y  T    G D R  L +     G Y+T +  +    +IE    DKP
Sbjct: 188 DHFLHTEEGGGGPYPSTWPWEGLDFRDGLQSTNAYSGIYSTHVIAERVENIIEKHDKDKP 247

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           LFLY++  A H       L+ P+  +  F+  I D  RR YA M   +D++VG +   L+
Sbjct: 248 LFLYVSFQAVHT-----PLQVPESYLQPFESSIQDEKRRIYAGMTYCMDEAVGNITKKLK 302

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
           ++G+ ++++++F SDNG          N     SN+P RG K TLWEGGV+    + SP 
Sbjct: 303 KQGLWDDTVLVFSSDNGG---------NIDQGASNWPLRGSKTTLWEGGVRAVGFVTSPL 353

Query: 387 IQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
           + +  +  VS +++ ISDW PTL     G T      +DG + W +L    PS R   + 
Sbjct: 354 LSERMKGTVSRELIDISDWYPTLIEGVAGWTLS-GTKLDGYNIWETLRSGKPSARVELLH 412

Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNK 504
            +D   +     P+   +   N    +  AA+R   WK+V G       +G+Y    S+K
Sbjct: 413 NIDPLITPPSTWPNESIAAAHNSFSTRTYAALRYKDWKIVTGYXS--INNGWYSPAESSK 470

Query: 505 VPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN 564
                             Q ++  I LP  ++                            
Sbjct: 471 ------------------QSVASEI-LPGKSV---------------------------- 483

Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL-VQADPKRFN 623
              +LFN+  DP E +++++  P I + L E L+ ++    P  +  PD+  +A+P    
Sbjct: 484 ---WLFNITRDPREFHDLSNQEPAIVNFLLERLESYQSGASPVLY--PDIDTKANPAGHG 538

Query: 624 DTWSPW 629
             W  W
Sbjct: 539 GIWDSW 544



 Score = 42.7 bits (99), Expect = 0.62,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 6/69 (8%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDP 71
           G Y+T +  +    +IE    DKPLFLY++  A H       L+ P+  +  F+  I D 
Sbjct: 224 GIYSTHVIAERVENIIEKHDKDKPLFLYVSFQAVHT-----PLQVPESYLQPFESSIQDE 278

Query: 72  NRRTYAALT 80
            RR YA +T
Sbjct: 279 KRRIYAGMT 287


>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
 gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
          Length = 502

 Score =  247 bits (630), Expect = 1e-62,   Method: Compositional matrix adjust.
 Identities = 154/402 (38%), Positives = 209/402 (51%), Gaps = 50/402 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS +I TPNID LA  G  L   Y QP+CTP+RA+ MTG+YP   GMQ   I
Sbjct: 71  GWKDVGFHGS-DIKTPNIDELAEKGARLEQYYVQPMCTPTRAAFMTGRYPFRYGMQTAVI 129

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+ L +  LPE L+E GY+T A GKWHLG  +  + P  RGF+S +G L G I 
Sbjct: 130 PQGGTYGLALDDHLLPELLKEAGYATAASGKWHLGHAKTAFWPRQRGFDSFYGALLGEID 189

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD--LFTKEAVQLIEDQPVDKPLF 269
           ++ H       ++   N    R N     D + E   D  LF  EAV++I +    KPLF
Sbjct: 190 HFTH-------KSANGNADWYRNN-----DALEEEGFDNVLFADEAVRVINEHDQAKPLF 237

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYLA  + H        +AP+E + +  +I D +RR YAAM+  +DD +G V+ AL+ +G
Sbjct: 238 LYLAFTSPHT-----PFQAPKEFLERNSHIADESRRNYAAMISVMDDGIGRVVEALEMRG 292

Query: 330 MLENSIIIFMSDNGAPT-VEYRETSNYRNW--GSNYPYRGVKNTLWEGGVKVPAIL-WSP 385
           M EN++I+F SDNG      +   SN +      N PYR  K T +EGG +V  ++ W  
Sbjct: 293 MRENTLIVFHSDNGGVVDAFFAGESNVKGKLPADNGPYREGKGTTYEGGTRVVGLVNWPG 352

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
           +I+  P  +  MMH+ D  PTL   AG +                        +N  +DG
Sbjct: 353 KIE--PGTANGMMHVVDMYPTLAALAGAELG----------------------KNKPLDG 388

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGT 487
           +D W  L    PS R  ++ NI+    TAAVR   WKLV  T
Sbjct: 389 MDMWPVLGEGKPSPRTEIVYNIE--PMTAAVRQGDWKLVWKT 428



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 36/61 (59%), Gaps = 5/61 (8%)

Query: 19  LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 78
           LF  EAV++I +    KPLFLYLA  + H        +AP+E + +  +I D +RR YAA
Sbjct: 218 LFADEAVRVINEHDQAKPLFLYLAFTSPHT-----PFQAPKEFLERNSHIADESRRNYAA 272

Query: 79  L 79
           +
Sbjct: 273 M 273


>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
 gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
          Length = 495

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 163/447 (36%), Positives = 236/447 (52%), Gaps = 48/447 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS+ I TPN+D LA  G  L   Y+QP+CTPSRA+L+TG+YP   G+Q   I
Sbjct: 59  GWKDVGFHGSD-IRTPNLDQLAKTGARLEQYYSQPMCTPSRAALLTGRYPHRYGLQTLVI 117

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A   G+P  E  LP+ L+E GY T  +GKWHLG    +Y P  RGF+  +G L G I 
Sbjct: 118 PSAGKYGLPTDEYLLPQALKEAGYETAIVGKWHLGHADPKYWPRQRGFDYQYGPLLGEID 177

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H    +          D  RN     +    Y T L  ++AV+LIE      PLFLY
Sbjct: 178 YFTHSAHGKV---------DWYRNNQLIKEE--GYVTTLLGQDAVKLIEKHNPKTPLFLY 226

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LA  A HA       +APQ+ ++Q++ I DPNRR YAAM+  +DD +G V++AL+++GM 
Sbjct: 227 LAFTAPHA-----PYQAPQKYLDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMR 281

Query: 332 ENSIIIFMSDNGAP-TVEYR---ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
            N++I+F SDNG P + ++    +TS       N PYR  K +L+EGG +V A+  W  +
Sbjct: 282 NNTLIVFQSDNGGPRSAQFTGEVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGK 341

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           IQ    V+   +HI D  PTL                      + L +    +N  +DGL
Sbjct: 342 IQPGTVVN-HPIHIVDMYPTL----------------------TGLASVSVGKNKPLDGL 378

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY-YGQTRSNKV 505
           + W +L    PS R+ V+ +I+  +  AA+  + WKLV        ++ +   Q  S + 
Sbjct: 379 NIWPALSEAKPSPRSQVVYDIEPFR--AALSQEDWKLVWKATLPSRLELFNLSQDVSEQT 436

Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLP 532
            L   N  + S+  Q ++ LS++  LP
Sbjct: 437 NLAEQNPEIVSRLKQQIEVLSRDAVLP 463



 Score = 58.9 bits (141), Expect = 7e-06,   Method: Compositional matrix adjust.
 Identities = 31/65 (47%), Positives = 41/65 (63%), Gaps = 5/65 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y T L  ++AV+LIE      PLFLYLA  A HA       +APQ+ ++Q++ I DPNRR
Sbjct: 201 YVTTLLGQDAVKLIEKHNPKTPLFLYLAFTAPHA-----PYQAPQKYLDQYKTIADPNRR 255

Query: 75  TYAAL 79
            YAA+
Sbjct: 256 AYAAM 260


>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
 gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
          Length = 498

 Score =  247 bits (630), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 158/417 (37%), Positives = 219/417 (52%), Gaps = 54/417 (12%)

Query: 78  ALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
           AL +ST   ++ +     GW D+ FHGS +I TPNID LA  G  L   YAQP+CTP+RA
Sbjct: 49  ALAQSTKPNIVFIVADDLGWKDVGFHGS-DIKTPNIDQLAEKGGRLEQFYAQPMCTPTRA 107

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           +LMTG+YP   GMQ   I      G+ L +  LPE L+E GY+T A GKWHLG     + 
Sbjct: 108 ALMTGRYPFRYGMQTAVIPQGGTYGLALDDYLLPEMLKEAGYATAASGKWHLGHADTAFW 167

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD--LF 251
           P  RGF+S +G L G I ++ H       ++   N    R N     + + E   D  LF
Sbjct: 168 PRQRGFDSFYGALLGEIDHFTH-------KSANGNADWYRNN-----EAIEEAGFDNILF 215

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
             EAV++I +    KPLFLYLA  + H        +AP+E +++  +I D +RR YAAM+
Sbjct: 216 ATEAVRVINEHDQSKPLFLYLAFTSPHT-----PFQAPKEYLDRNSHIADESRRAYAAMI 270

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT-VEYRETSNYRNW--GSNYPYRGVK 368
             +DD +G V+ AL+ +GM EN++I+F SDNG      +   S+ +      N PYR  K
Sbjct: 271 SVMDDGIGRVVEALETRGMRENTLILFHSDNGGVVDAFFAGESDVKGKLPADNGPYRQGK 330

Query: 369 NTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
            T +EGG +V  ++ W  +I+  P  +  MMH+ D  PTL   AG               
Sbjct: 331 GTTYEGGTRVVGLINWPGKIE--PGSADGMMHVVDMYPTLAALAGA-------------- 374

Query: 428 WSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
                  TP  +N  +DG+D W  L    PS R  ++ N++    TAAVR   WKLV
Sbjct: 375 -------TPG-KNKPLDGMDMWPMLGEGKPSPRTEIVYNVE--PMTAAVRQGDWKLV 421



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 5/61 (8%)

Query: 19  LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 78
           LF  EAV++I +    KPLFLYLA  + H        +AP+E +++  +I D +RR YAA
Sbjct: 214 LFATEAVRVINEHDQSKPLFLYLAFTSPHT-----PFQAPKEYLDRNSHIADESRRAYAA 268

Query: 79  L 79
           +
Sbjct: 269 M 269


>gi|291230930|ref|XP_002735418.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 480

 Score =  246 bits (629), Expect = 2e-62,   Method: Compositional matrix adjust.
 Identities = 178/554 (32%), Positives = 267/554 (48%), Gaps = 88/554 (15%)

Query: 81  KSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           KS    +++++    GWND+ +H + +I  PN+  LA +G+I N  Y  P CTPSRA++M
Sbjct: 6   KSDQPHIILIFVDDMGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMM 64

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TG YP  TG Q   ++   P GVPL  + LPE  +E+GYST  +GKWHLGF + EY P  
Sbjct: 65  TGLYPFKTGNQHQMVFNLHPSGVPLEFKLLPEKFKEVGYSTHMVGKWHLGFCKDEYLPTN 124

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           RGF+SH+G     +  YD  ++   S +    G+D R N+     + G Y   +    A 
Sbjct: 125 RGFDSHYGIWTLGVGDYDK-MNGVLSPSA---GYDFRDNMGVVPKSNG-YLALMLGDRAE 179

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
            ++ +     PLF     LA       KHLE P+E    +  I D   R +   +  +DD
Sbjct: 180 HIVNNHYPGTPLF-----LAFTLDIPAKHLEIPEEYEEPYADIEDDRTRQFYGKLTMMDD 234

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
            +G V+ AL+ +GM ++S+IIF+ DNGA + +          GSNYP+RG+  TL+EG  
Sbjct: 235 IIGGVVDALKNRGMYDDSLIIFIGDNGALSSQA---------GSNYPFRGIAGTLFEGAT 285

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
           +VP+I+    I++    S ++  I D   T+   AG                        
Sbjct: 286 RVPSIVSGKGIKKTGYESNELYSIVDIHRTILDVAG------------------------ 321

Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMD 494
           ++    +DGL  W +   + PS RN +L NID+   +  AA+R+  +KL+ G  +     
Sbjct: 322 AKAEPELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLITGHPD----- 376

Query: 495 GYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP 554
                       L  +  I  +  Y +          P S    + ST Q+A       P
Sbjct: 377 -----------LLYPYRDISLTDNYYNYGD------PPASG---LPSTDQEANA-----P 411

Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
           AP           YLFN+  DP E+N+I++ RPDI  +L + L  HR  L+P  ++  +L
Sbjct: 412 APQNVE-------YLFNVVVDPEERNDISADRPDIVQELRDRLDEHRNLLIPPVNQTTEL 464

Query: 615 VQADPKRFNDTWSP 628
             A    FND WSP
Sbjct: 465 AGA-AGNFNDVWSP 477


>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
 gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
          Length = 501

 Score =  245 bits (626), Expect = 5e-62,   Method: Compositional matrix adjust.
 Identities = 153/402 (38%), Positives = 214/402 (53%), Gaps = 57/402 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D  F+GS +IPTPN+D LA  G  L   YA P+CTP+RA+LMTG+YP+  G+Q   I
Sbjct: 71  GFGDAGFNGS-DIPTPNLDKLAAEGARLEQFYALPMCTPTRAALMTGRYPLRYGLQVGVI 129

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A    +P+ E  LP+ L++ GY+T  +GKWHLG  + E+ P  RGF+  +G L G I 
Sbjct: 130 PAAGTYSLPVDEYLLPQALKDTGYTTAMVGKWHLGHAKPEFWPRQRGFDYFYGALVGEID 189

Query: 212 YYDHI---LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           ++ H    + D Y     LN         T +D        LF  EAV+++E      PL
Sbjct: 190 HFKHSSHGVKDWYRNNKPLN--------ETGFDNT------LFGNEAVRVVERHEGKSPL 235

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  A H        +APQ+ +++F+ I D NRR YAAM+  +DD VG V++AL++K
Sbjct: 236 FLYLAFTAPHT-----PFQAPQDYLDRFKDIADDNRRKYAAMISVMDDGVGKVVAALEKK 290

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWG-----SNYPYRGVKNTLWEGGVKVPAIL- 382
           GM +N+II+F SDNG   V+    +     G     SN PYR  K TL+EGG +V A+  
Sbjct: 291 GMRDNTIIVFHSDNGG--VKNSLFAGDSKVGGELPASNGPYRDGKGTLYEGGTRVAALAN 348

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
           W  +I+  P V   M+H+ D  PTL   A                      +    +N  
Sbjct: 349 WPGKIK--PGVVDSMVHVVDMYPTLAAVA----------------------DAKLEKNKP 384

Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
           +DG++ W +L    PS R+ V+ N+D      AVR   WKLV
Sbjct: 385 LDGMNVWDALSEGKPSPRSEVVYNVD--PLAGAVRKGDWKLV 424



 Score = 52.8 bits (125), Expect = 6e-04,   Method: Compositional matrix adjust.
 Identities = 27/61 (44%), Positives = 37/61 (60%), Gaps = 5/61 (8%)

Query: 19  LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 78
           LF  EAV+++E      PLFLYLA  A H        +APQ+ +++F+ I D NRR YAA
Sbjct: 217 LFGNEAVRVVERHEGKSPLFLYLAFTAPHT-----PFQAPQDYLDRFKDIADDNRRKYAA 271

Query: 79  L 79
           +
Sbjct: 272 M 272


>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
          Length = 484

 Score =  244 bits (624), Expect = 7e-62,   Method: Compositional matrix adjust.
 Identities = 142/399 (35%), Positives = 209/399 (52%), Gaps = 52/399 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H    + TPNID LA  G+ILN  Y QP+C+PSR + MTG YP   G+Q   I
Sbjct: 37  GWNDVGYHNP-AMKTPNIDKLAREGLILNQTYFQPLCSPSRHAFMTGYYPYRAGLQHLVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P   PL  +FLP+ L++LGY+T  +GKWHLG    + TP YRGF+S FG+ +    
Sbjct: 96  MPWQPVCSPLNMKFLPQRLKDLGYATHMVGKWHLGMCNWDCTPTYRGFDSFFGFYHAKAD 155

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY HI S +Y         D R N     +  G Y+T  FT  A  +I+     +PLFLY
Sbjct: 156 YYSHI-SYKYL--------DYRDNEKPVKNLNGTYSTFTFTSRAQDIIKKHNSSQPLFLY 206

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A    H     + L+ PQ+  + +       RR Y+ MV  LD+++G +   L+  GM 
Sbjct: 207 MAFPIPH-----EPLQVPQQYEDMYPNTMSKGRRQYSGMVSALDEAIGNITDQLRESGMY 261

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ++++I+F SDNG   ++Y         G+NYP RG K T++EGG +V + +    +Q+  
Sbjct: 262 DDTLIVFTSDNGGSPMDY---------GNNYPLRGAKFTVYEGGTRVVSFVHGAGLQKTG 312

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
                MMH  DW  TL  AAGG+                     P  R+   DG+ QW++
Sbjct: 313 EAYESMMHAVDWSDTLVAAAGGE---------------------PDMRD---DGVSQWTA 348

Query: 452 LLLNTPSRRNSVLINID----EKKRTAAVRLDSWKLVLG 486
           +  N+P  R  ++ N D     ++  AA+R+  +KL+ G
Sbjct: 349 IAQNSPGPRTELVYNFDPDSAPEQGHAAIRVGDYKLIHG 387



 Score = 45.8 bits (107), Expect = 0.065,   Method: Compositional matrix adjust.
 Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 2/61 (3%)

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYE-LLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           LF + +DP EQN++A S+PDI  +L + LL   +  L P     PD  ++DP  F++ WS
Sbjct: 422 LFYIKDDPTEQNDLAQSKPDIVKKLSQRLLMLTKNALPPNYPLVPD-PKSDPTHFDNVWS 480

Query: 628 P 628
           P
Sbjct: 481 P 481


>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
          Length = 621

 Score =  244 bits (623), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 140/377 (37%), Positives = 204/377 (54%), Gaps = 25/377 (6%)

Query: 109 IDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPE 168
           +D LA  G+ L N Y QP+CTPSR+  +TGKY IHTG+Q   I   +P  +PL    LP+
Sbjct: 80  LDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 139

Query: 169 YLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELN 228
            L+E+GYST  +GKWHLGF+R+E  P  RGF++ FG L G   YY H   D    +  + 
Sbjct: 140 KLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMC 195

Query: 229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLE 287
           G+D+  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+
Sbjct: 196 GYDLYENDNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQ 250

Query: 288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PT 346
           AP      ++ I + NRR YAAM+  LD+++  V  AL+  G   NSIII+ SDNG  PT
Sbjct: 251 APGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPT 310

Query: 347 VEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPT 406
                       GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PT
Sbjct: 311 AG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPT 360

Query: 407 LYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN 466
           L + A G        +DG D W ++   +   R+  +D L     +     +   +    
Sbjct: 361 LISLAEGQIDE-DTQLDGYDVWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYG 416

Query: 467 IDEKKRTAAVRLDSWKL 483
           I      +A+R+  WKL
Sbjct: 417 IWNTAIQSAIRVQHWKL 433



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 199 LYENDNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 253

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 254 RYFEHYRSIININRRRYAAM 273


>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
 gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
          Length = 487

 Score =  244 bits (623), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 143/405 (35%), Positives = 211/405 (52%), Gaps = 40/405 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H + ++ TP +D LA+ G+ILN  Y   VCTPSR + MTG YP H G Q    
Sbjct: 40  GWNDVGWH-NPDVKTPVLDKLAHEGVILNQSYVNYVCTPSRTAFMTGYYPYHAGSQHLVF 98

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              + +G+P    FLPE L++LGY+T  +GKWHLGF   +YTP YRGF+S +GY N    
Sbjct: 99  LPQQAQGIPYNFTFLPEKLKDLGYATHMVGKWHLGFCNWKYTPTYRGFDSFYGYYNADED 158

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY H+++          G D+R +        G+Y T  FT   V +IE  P D PLFLY
Sbjct: 159 YYTHVVA---------GGLDLRDDKEVVNTKNGQYGTYFFTDRMVDIIEKHPADTPLFLY 209

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L     H     + LE P+   N +  + + NRR    MV  LD++VG V  A++  G+ 
Sbjct: 210 LPFQNVH-----EPLEVPERFENIYMNVQNENRRKLLGMVSALDEAVGNVTMAMKNAGLW 264

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N+++IF +DNG   +           G+NYP RG K TLWEGG +  A +    +Q+  
Sbjct: 265 DNTLVIFTTDNGGWIIAS---------GNNYPLRGGKVTLWEGGTRGVAFVHGKMLQKTG 315

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             + +M+H  DW PT+  AA          +DG+ Q  +++   PS R   +  +D    
Sbjct: 316 YTNNEMIHAVDWFPTILAAA---GGTADDAMDGVSQLDTIMSGQPSPRTEFVYNIDD--- 369

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
                P+++ ++ I       T  +R+  +KL+ G    G  DG+
Sbjct: 370 ---VFPTKQGAIRIIC-----TLCLRVGDYKLIEGVA--GKPDGW 404



 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 24/61 (39%), Positives = 37/61 (60%), Gaps = 1/61 (1%)

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           +LFNL +DP E +N+A S PD   ++   L  +R++LVP  +  PD  ++ PK +   WS
Sbjct: 425 WLFNLKDDPTEHHNLADSMPDKLKEMQAKLAEYRKSLVPAINPPPD-PKSFPKNWGGAWS 483

Query: 628 P 628
           P
Sbjct: 484 P 484



 Score = 45.8 bits (107), Expect = 0.061,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 5/70 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G+Y T  FT   V +IE  P D PLFLYL     H     + LE P+   N +  + + N
Sbjct: 182 GQYGTYFFTDRMVDIIEKHPADTPLFLYLPFQNVH-----EPLEVPERFENIYMNVQNEN 236

Query: 73  RRTYAALTKS 82
           RR    +  +
Sbjct: 237 RRKLLGMVSA 246


>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
          Length = 518

 Score =  244 bits (623), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 139/360 (38%), Positives = 202/360 (56%), Gaps = 33/360 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+S+H S  +  PN+  LA  G+IL + YAQ  CTPSRA+ +TG+YPI+TGMQ   +
Sbjct: 41  GWNDVSWHNS-IVQMPNLQDLAERGVILEHAYAQEKCTPSRAAFLTGRYPINTGMQEEVV 99

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +  G+P+  + LP YL++ GY+T  IGKWH+G+    YTP  RGF+SH+G+ N  IS
Sbjct: 100 VATQMSGLPIEFKLLPSYLKDQGYATHMIGKWHVGYCDEAYTPTRRGFDSHYGFYNSGIS 159

Query: 212 YYDHILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           Y        YS T   + G+D R +L+      G+Y T  FT +A  LI++     P+FL
Sbjct: 160 Y------SNYSSTEGTDVGYDYRDDLALNLAAEGKYTTTDFTDQAKTLIDNHDQTNPMFL 213

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYI-----TDPNRRTYAAMVKKLDDSVGTVISAL 325
           Y+A+ A H          P E    ++ I      D NR+TY  M+  LD+ VG ++  L
Sbjct: 214 YMAYNAPH---------TPFEVEESYRDIYDGNLRDGNRKTYLGMISALDEQVGQLVDKL 264

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
           +  GM  N++ +F SDNG    +  ++      G+N+P RG K +L+EGG +VPA L + 
Sbjct: 265 KEVGMWSNTVFVFYSDNGGTQPQSGQS------GNNFPLRGKKGSLFEGGYRVPAFLIAR 318

Query: 386 QIQQN----PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
               N       S  + HISD   T    AGGD + +P +IDG++ W ++  NTPS R +
Sbjct: 319 TRAGNLELIASTSSTLFHISDMFATFIALAGGDAA-VPADIDGINAWPAISGNTPSPRTT 377


>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
          Length = 782

 Score =  244 bits (622), Expect = 1e-61,   Method: Compositional matrix adjust.
 Identities = 165/499 (33%), Positives = 241/499 (48%), Gaps = 82/499 (16%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           IHTG+Q   IW  +P  VPL E+ LPE L+E GY+T  +GKWHLG +R+E  P  RGF+S
Sbjct: 354 IHTGLQHQIIWACQPNCVPLDEKLLPELLKEAGYATHMVGKWHLGMYRKECLPTRRGFDS 413

Query: 202 HFGYLNGVISYYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
           +FGYL G   YY H       S  V     D R     A      Y+T++F K AV LI 
Sbjct: 414 YFGYLLGSEDYYSHERCVLIRSLNVTRCALDFRDGEEVAVGYKNMYSTNVFAKRAVDLIA 473

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
           + P DKPLFLYLA  + H     + L+ P+E +  + +I +  RR YA MV  +D++VG 
Sbjct: 474 NHPPDKPLFLYLAFQSVH-----EPLQVPEEYVKPYSFIQNKKRRNYAGMVSIMDEAVGN 528

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           V +AL+R G+ +N+++IF +DNG  T+           G+N+P RG K TLWEGGV+   
Sbjct: 529 VTAALKRHGLWDNTVLIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGIG 579

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
            + SP ++Q    + +++HISDWLPTL   AGG  +     +DG D W ++   +PS R 
Sbjct: 580 FVASPLLKQKGVKNRELIHISDWLPTLVNLAGGHVNGT-KPLDGFDVWKTISEGSPSPRV 638

Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEK----------KRTAAVRLDSWKLVLGTQEN 490
             +  +D     +  +P +R+ V  + D               AA+R  +WKL+ G    
Sbjct: 639 ELLHNID--PMFVDPSPCQRSKVTPSKDTSFPQDYSTFNTSIHAAIRHGNWKLLTGYPGC 696

Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC 550
           G                                    N F P S +          ++  
Sbjct: 697 G------------------------------------NWFPPPSLL----------SVSS 710

Query: 551 GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
             +P P T S       +LF++  DP E+++++   P I  +L   L+Y+ +  VP  + 
Sbjct: 711 IESPDPPTKS------LWLFDIDRDPEERHDLSREHPSIVKKLLARLRYYSQDSVPIRYP 764

Query: 611 QPDLVQADPKRFNDTWSPW 629
             D    DPK     W PW
Sbjct: 765 DED-PYCDPKS-TGVWGPW 781



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++F K AV LI + P DKPLFLYLA  + H     + L+ P+E +  + +I +  RR
Sbjct: 459 YSTNVFAKRAVDLIANHPPDKPLFLYLAFQSVH-----EPLQVPEEYVKPYSFIQNKKRR 513

Query: 75  TYAALT 80
            YA + 
Sbjct: 514 NYAGMV 519


>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 486

 Score =  243 bits (619), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 148/411 (36%), Positives = 214/411 (52%), Gaps = 48/411 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ F+G  +I TP++DALA  G      Y QP+CTP+RA+LMTG+YP   G+Q   I
Sbjct: 41  GWQDVGFNGCKDIQTPHLDALAKGGARFTQFYVQPMCTPTRAALMTGRYPFRYGLQTAVI 100

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+   E  LP+ L++ GY+T  IGKWHLG   +++ P  RGFE  +G + G + 
Sbjct: 101 PSVSTYGLDTGEYLLPQCLQDAGYTTAIIGKWHLGHADKKFWPKQRGFEYQYGAMIGELD 160

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY H          E    D  R+     +    Y T+L   +AV+ +E Q  D+P +LY
Sbjct: 161 YYTH---------SEHGVLDWFRDNEPVHEE--GYTTNLLGADAVKYLEKQKADRPFYLY 209

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           LA  A H        +APQE I+++ +I DP RRTYA MV  LDD++G V++AL +KG+ 
Sbjct: 210 LAFNAPHT-----PYQAPQEYIDRYTHIADPTRRTYAGMVACLDDNIGRVVAALDKKGLR 264

Query: 332 ENSIIIFMSDNGA---PTVE--YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSP 385
           EN++I+F SDNG    P       + S  +    N PYR  K +L+EGGV+V A+  W  
Sbjct: 265 ENTLILFHSDNGGTHNPMFAGVMADVSKIKIPCDNSPYRDGKGSLFEGGVRVCALANWPG 324

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
           +I+  P     ++H +D   T    AG  TSR                         +DG
Sbjct: 325 KIK--PATVDGLIHATDIFTTFTKLAGASTSRC----------------------KPLDG 360

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
           +D W ++    PS R  V+ N++  +  AA+R   WKL+  T    ++D Y
Sbjct: 361 VDAWPTMAEGRPSLREEVVYNVEPYR--AALRQGDWKLIWRTTLPSSVDLY 409



 Score = 57.4 bits (137), Expect = 2e-05,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y T+L   +AV+ +E Q  D+P +LYLA  A H        +APQE I+++ +I DP RR
Sbjct: 184 YTTNLLGADAVKYLEKQKADRPFYLYLAFNAPHT-----PYQAPQEYIDRYTHIADPTRR 238

Query: 75  TYAALT 80
           TYA + 
Sbjct: 239 TYAGMV 244


>gi|403049780|ref|ZP_10904264.1| sulfatase [SAR86 cluster bacterium SAR86D]
          Length = 515

 Score =  243 bits (619), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 143/349 (40%), Positives = 204/349 (58%), Gaps = 25/349 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+S+H +  IPTPNID+   NGI LN  YA P C+P+RASL+TG +  + G+  P +
Sbjct: 29  GWGDVSYH-NGFIPTPNIDSFVSNGIELNRFYANPTCSPTRASLLTGLHIFNHGVIRPFM 87

Query: 152 W-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              AE  G+P   + +PEY +E GY T   GKWHLG  + EY P  RGF+S +G++ G I
Sbjct: 88  NPSAEQTGLPEHLKIMPEYFKEAGYQTALSGKWHLGMHKEEYLPTNRGFDSSYGHMLGGI 147

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            YYDH+       T  ++ H  R  +S   D    Y+T+L   EA+ +I+++  D+PLFL
Sbjct: 148 GYYDHV------HTNRMDWH--RDGVSLNEDG---YSTELIADEAINIIKNKDDDRPLFL 196

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A  A H       +EAP+E +N F YI D   R YAA + KLD  +G +I++++  G+
Sbjct: 197 YVAFNAPHT-----PIEAPEEDVNNFLYIEDELDRNYAANISKLDIEIGRIINSIKDLGI 251

Query: 331 LENSIIIFMSDNGA-----PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
           LE +IIIF+SDNG      P V+       +  GS    RG K +  +GG++VPA +W  
Sbjct: 252 LEETIIIFLSDNGPVFDINPIVKTIAPGLTKAKGSTAGLRGSKTSALDGGIRVPAAIWWK 311

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
            + +  + S Q + I D LPTL TA G       L IDG+D+WS+L+ N
Sbjct: 312 GVLEKSK-SDQFIFIQDLLPTLLTATGIGYDE-NLLIDGVDRWSNLITN 358



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 40/64 (62%), Gaps = 5/64 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T+L   EA+ +I+++  D+PLFLY+A  A H       +EAP+E +N F YI D   R
Sbjct: 172 YSTELIADEAINIIKNKDDDRPLFLYVAFNAPHT-----PIEAPEEDVNNFLYIEDELDR 226

Query: 75  TYAA 78
            YAA
Sbjct: 227 NYAA 230


>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score =  243 bits (619), Expect = 3e-61,   Method: Compositional matrix adjust.
 Identities = 147/405 (36%), Positives = 213/405 (52%), Gaps = 52/405 (12%)

Query: 82  STTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYP 141
           +  L L    GW D+ +H S EI TP++D LA +G  L   Y QPVC+P+RA+LMTG+YP
Sbjct: 38  NVVLILADDMGWGDVGWHDS-EIKTPHLDKLAASGTRLEQFYVQPVCSPTRAALMTGRYP 96

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           +  G+Q   +      G+PL ER LP+ L+E+GY T   GKWHLG F+ EY P +RGF+ 
Sbjct: 97  MRHGLQVGVVRPWAQYGLPLNERTLPQALKEVGYETAICGKWHLGHFQPEYLPTHRGFDH 156

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
            +G+ NG + Y+ HI            G D  R+     D    Y+T L  +EA ++I  
Sbjct: 157 QYGHYNGALDYFTHIRD---------GGFDWHRDDRVNRDE--GYSTHLIGREATRIIGH 205

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
               KPLFLY+   A HA +     + P++    +  + +P RRTYA M+  +D++VG +
Sbjct: 206 HDTSKPLFLYVPFNAVHAPH-----QVPEKYKEPYSKLKEP-RRTYAGMLAAMDEAVGQI 259

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
            SA+  KGM  N+I +F SDNG P              SN P RG K TL+EGGV+VPA 
Sbjct: 260 ASAIDGKGMRSNTIFLFSSDNGGPAPG--------QVTSNGPLRGQKGTLYEGGVRVPAF 311

Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
           + W   ++    V+   +HI D  PTL T AG                       P  ++
Sbjct: 312 ISWEGHLRPGTVVNAP-LHIVDLFPTLLTLAG----------------------APLEQS 348

Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
              DG D W+++  + PS  +++L+N      + A+R+  WKL+L
Sbjct: 349 LPTDGRDAWTAIAHHGPSPHDAILLNTTPD--SGAIRVGDWKLIL 391


>gi|291225017|ref|XP_002732505.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 497

 Score =  242 bits (617), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 168/539 (31%), Positives = 256/539 (47%), Gaps = 84/539 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H   +I  PN+  LA +G+I    Y  P CTPSRA++MTG YP  TG Q   +
Sbjct: 38  GWNDVHWHNP-DIAMPNLMDLADDGVIFEQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMV 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +   P GVPL  + LPE L+E+GY+T  +GKWHLGF + EY P  RGF+SH+G     + 
Sbjct: 97  FNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVG 156

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
            YD  L+   S +    G+D R NL     + G Y   +    A +++ +   + PLFL 
Sbjct: 157 DYDK-LNGVLSPSA---GYDFRDNLEVVPKSDG-YLALMLGDRAEEIVNNHSPETPLFLV 211

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
                       KHLE P+E    +  I D   R +   +  +DD +G V+ AL+ + M 
Sbjct: 212 FT-----LDIPAKHLEIPEEYEELYADIEDDRTRQFYGKLTMMDDIIGGVVDALKDRDMY 266

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           ++S+ IF+ DNGA + +          GSNYP+RG+  TL+EG  +VP+I+    I++  
Sbjct: 267 DDSLFIFIGDNGALSSQA---------GSNYPFRGIAGTLFEGATRVPSIVSGKGIEKTG 317

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             S ++  I D   T+   AG                        ++    +DGL  W +
Sbjct: 318 YKSNELYSIVDIHRTILDVAG------------------------AKAEPELDGLSMWPT 353

Query: 452 LLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
              + PS RN ++ NID+   +  AA+R+  +KL+ G  +                 L  
Sbjct: 354 WSEDEPSPRNEIIYNIDDDPVSPGAAIRIGDYKLITGHPDL----------------LYP 397

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           +  I  +  Y +          P S    + +T Q+A       PAP           YL
Sbjct: 398 YRDIRLTDNYYNYGD------PPASG---LPTTDQEANA-----PAPQNVE-------YL 436

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           FN+ +DP E+++I++   DI   L + L  HR+ L P  ++  +L  +    FND WSP
Sbjct: 437 FNIVDDPEERHDISADNQDIVQDLRDRLDEHRKLLHPPVNQTTELAGS-AANFNDVWSP 494


>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 505

 Score =  242 bits (617), Expect = 5e-61,   Method: Compositional matrix adjust.
 Identities = 176/557 (31%), Positives = 259/557 (46%), Gaps = 109/557 (19%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +   + + TPNID LA  G+ L   YAQ  C PSR+ LM G+Y IHTG     +
Sbjct: 37  GYFDVGYREGSIVKTPNIDKLAAEGVKLERYYAQSSCMPSRSCLMMGRYQIHTGFDYRCL 96

Query: 152 WGAEPRGVPLTERF-LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
            G   R     +   LP  LR+ GY+T  IGKWHLG  R+E  P ++GF++ FGY     
Sbjct: 97  DGQLTRLCMAPDTVTLPMKLRQYGYATHMIGKWHLGHERKECVPTHKGFDTFFGYHGAAE 156

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNL-STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           +YY H      +       HD+ RN+ + A D  G+Y+T  +TKEA  +I++    KP F
Sbjct: 157 NYYTH------TALGRPRCHDLWRNMENVAEDYDGQYSTLFYTKEAQDIIKNHDKKKPFF 210

Query: 270 LYLAHLAAHAGNAGKHLEAPQE-TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           +YL++ A H       LE P    +  +    D +R+ YAAM   +D+ +G +   L+  
Sbjct: 211 MYLSYQAVHG-----PLEVPSNYRMPYYTSGVDESRQIYAAMATCMDEGIGNITETLKAS 265

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           GML N+I++F SDNGA          +   G N+P RG K T+WEGGV+V A + SP ++
Sbjct: 266 GMLNNTILVFSSDNGA---------QFHTGGRNWPLRGGKGTMWEGGVRVLAFVNSPLLK 316

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           ++  V+ +M+H SDW PT    AGG    + L  DG+DQW ++    P++R         
Sbjct: 317 KSRTVNHEMIHTSDWFPTFVRLAGGTVDDVEL--DGVDQWDTISTGAPTKRQ-------- 366

Query: 449 WSSLLLNTPSRRNSVL-INID--------EKKRTAAVRLDSWKLVLGTQENGTMDGYYGQ 499
              LL+    RR ++  I  D        +  R A++ +  WKLV             GQ
Sbjct: 367 --ELLIQISGRRGALEDIGKDPWITNDYFDITRAASIIVGRWKLVT------------GQ 412

Query: 500 TRSNKV----PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA 555
            RS K     P LN+                     P  +I                   
Sbjct: 413 YRSPKTWVPPPELNY---------------------PEDHI------------------- 432

Query: 556 PMTPSPCTNGP-CYLFNLGNDPCEQNNIASSRPDISSQLYELLK--YHRRTLVPQSHEQP 612
            M PS    G   +LF+L  D  E+N+++S RP+I  +L + L+  Y        S   P
Sbjct: 433 -MPPSTEILGKMVWLFDLEEDQNEKNDLSSCRPEIVIKLLKRLRDYYSHSVFCGSSSYSP 491

Query: 613 DLVQADPKRFNDTWSPW 629
           +  Q       + W P+
Sbjct: 492 NCAQR-----GNQWGPY 503



 Score = 42.0 bits (97), Expect = 0.91,   Method: Compositional matrix adjust.
 Identities = 27/79 (34%), Positives = 43/79 (54%), Gaps = 7/79 (8%)

Query: 3   RNL-STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE- 60
           RN+ + A D  G+Y+T  +TKEA  +I++    KP F+YL++ A H       LE P   
Sbjct: 174 RNMENVAEDYDGQYSTLFYTKEAQDIIKNHDKKKPFFMYLSYQAVHG-----PLEVPSNY 228

Query: 61  TINQFQYITDPNRRTYAAL 79
            +  +    D +R+ YAA+
Sbjct: 229 RMPYYTSGVDESRQIYAAM 247


>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
           caulinodans ORS 571]
 gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
           caulinodans ORS 571]
          Length = 490

 Score =  240 bits (613), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 149/401 (37%), Positives = 215/401 (53%), Gaps = 57/401 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ FHGS+ I TPN+D LA  G  L   Y QP+CTP+RA+LMTG+YP+  G+Q   I
Sbjct: 58  GFADVGFHGSD-IKTPNLDKLAATGATLGQFYTQPMCTPTRAALMTGRYPLRYGLQTGVI 116

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+   E  LP+ L+ +GYST  IGKWHLG  ++++ P  RGF+  +G L G I 
Sbjct: 117 PSGASYGLATDEFLLPQALKSVGYSTALIGKWHLGHAKQDFWPRQRGFDYFYGPLVGEID 176

Query: 212 YYDHILSDQYSRTVELNG-----HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
           +Y H          E +G      D ++ +   +D      T+LF  +A +LI       
Sbjct: 177 HYKH----------EAHGVVDWYRDNKQVVEEGYD------TELFGTDAARLIGAHDPKT 220

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           PLFLYLA  A H        +APQ  ++++  ITDP RR YAAM+  +DD +G V++AL+
Sbjct: 221 PLFLYLAFTAPH-----TPFQAPQAYVDRYPNITDPARRLYAAMISAMDDQIGKVVAALE 275

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNW---GSNYPYRGVKNTLWEGGVKVPAIL- 382
           ++GM EN++I+F SDNG    +        +     SN PYR  K TL+EGG +V A+  
Sbjct: 276 QRGMRENTLIVFHSDNGGTRSKMFVGEGAFHGELPASNAPYRDGKGTLYEGGTRVAALAN 335

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
           W  +I   P  +  +MH+ D +PTL   AG        N+D               R+  
Sbjct: 336 WPGRI--TPGAADGVMHVVDMMPTLARLAGA-------NLD---------------RSKP 371

Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
           +DG+D WS+L     S R+ V+ N++  +   AVR  +WKL
Sbjct: 372 LDGVDVWSALSSARTSPRDGVVYNVEPTQ--GAVREGNWKL 410



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 5/68 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y T+LF  +A +LI       PLFLYLA  A H        +APQ  ++++  ITDP RR
Sbjct: 200 YDTELFGTDAARLIGAHDPKTPLFLYLAFTAPH-----TPFQAPQAYVDRYPNITDPARR 254

Query: 75  TYAALTKS 82
            YAA+  +
Sbjct: 255 LYAAMISA 262


>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
 gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
          Length = 564

 Score =  240 bits (612), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 179/566 (31%), Positives = 257/566 (45%), Gaps = 113/566 (19%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G +DL  H ++ I TP+ D LA +G+ L+  Y  P C+P+RASL++G+YP+HTG     +
Sbjct: 75  GSHDLGIHENSGIQTPHADQLARDGLYLDQYYVLPYCSPTRASLLSGRYPLHTGCH-TIV 133

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E +G+PL E  LP+ LR  GY   A+GKWH+G  R   TP +RGF+S FG+  G   
Sbjct: 134 NDWETQGLPLDEETLPQVLRRAGYQAHAVGKWHVGHSRWTQTPTFRGFQSFFGFYLGAQD 193

Query: 212 YYDHILSDQYSRTVELNGHDMR----RNLSTAWDTVGEYATDLFTKEAVQLIEDQPV--D 265
           Y  HI   +     E++  D R    R+ S   D  G Y+T +FT+EA+++IE+ P    
Sbjct: 194 YNTHIKQGERGNAYEMH-WDARGKCGRDCSRLVDERGNYSTHVFTREAIRVIENHPQRPH 252

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAAMVKKLDDSVGTV 321
           +PLFLYLAH A H  +     + P+     ++  T  N    R+TYA M+   D+S+G V
Sbjct: 253 EPLFLYLAHQAVHWPD-----QVPETYRKFYEGATYSNWTDQRKTYAGMLSAADESIGNV 307

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
             ALQ  GM EN++++F +DNG PT       +     SNYP RG K T++EGG      
Sbjct: 308 TKALQDAGMWENTLVVFTTDNGGPTAVCAAQGS-----SNYPKRGGKCTVYEGGTTGDGF 362

Query: 382 LWSPQIQQNPR-----------VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
           +  P   +  R           +  ++ H+ DWLPTL    G   +  PL  DG++QW S
Sbjct: 363 VSGPAWNKVARSRKKEYSETLELYSKVFHVVDWLPTLARMTGATPNGKPL--DGVNQWDS 420

Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN---IDEKKRTAAVRLDSWKLVLGT 487
           +L   PS                   P  R  V +       +    A+R   WKL+ G 
Sbjct: 421 MLQREPS------------------APPPREEVFVGYAYFGNQWYGPAIRYKHWKLIQG- 461

Query: 488 QENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQAT 547
                                      +S   ++   L    FLP               
Sbjct: 462 ---------------------------QSGGPETSHDLPPGSFLPAP------------- 481

Query: 548 IHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLY-ELLKYHRRTLVP 606
              G  P          G   L++L +DP E  NIASS P I   L  +L++YH   + P
Sbjct: 482 ---GGAP----------GEYQLYDLQSDPSETQNIASSYPLIVQILQGKLIEYHASFVPP 528

Query: 607 QSHEQ--PDLVQADPKRFNDTWSPWI 630
            S++   P     +   F  TW PW 
Sbjct: 529 ISNDPTCPFTGTTNTSTFGPTWLPWC 554



 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 32/47 (68%), Gaps = 2/47 (4%)

Query: 3   RNLSTAWDTVGEYATDLFTKEAVQLIEDQPV--DKPLFLYLAHLAAH 47
           R+ S   D  G Y+T +FT+EA+++IE+ P    +PLFLYLAH A H
Sbjct: 219 RDCSRLVDERGNYSTHVFTREAIRVIENHPQRPHEPLFLYLAHQAVH 265


>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 473

 Score =  239 bits (611), Expect = 2e-60,   Method: Compositional matrix adjust.
 Identities = 172/546 (31%), Positives = 255/546 (46%), Gaps = 114/546 (20%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ +H S EI TPNID LA  G+ L N Y  P+CTPSRA LMTG+Y IH+GMQ   +
Sbjct: 35  GWHDVGYHDS-EIQTPNIDMLAAEGVKLENYYVTPLCTPSRAVLMTGRYLIHSGMQHGVL 93

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               PR +P  E  LP+ L++ GYST  +GKWHLGF + + TP +RGF++ FG+ N    
Sbjct: 94  VAQNPRCLPTDEILLPQMLKDSGYSTHMVGKWHLGFCKFQCTPNHRGFDTFFGWYNAA-- 151

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
               ++S   S      G + R + +T                            P+F+Y
Sbjct: 152 ----VISPFISCRRRSAGCEFRVSHTTV---------------------------PMFIY 180

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           L   A HA      L+ P +  + ++  I D +RRTYA M   +D+++G +   L+ KG+
Sbjct: 181 LPFQAVHAP-----LQVPDKYSDMYKDTIHDESRRTYAGMTSCMDEAIGNITRTLKDKGI 235

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWG-SNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
             N++I+F +DNG            R +G SNYP RG K + +EGG++ PA + SP +  
Sbjct: 236 WNNTVIVFSTDNGGA----------RTFGASNYPLRGQKASNFEGGIRGPAFVSSPLLDP 285

Query: 390 NPRVSL--QMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
             R ++  ++M+I DW PT    AGG      PL  DG++QW ++    PS+R   +  +
Sbjct: 286 LVRGTINNELMYIGDWFPTFMNLAGGHVIGSKPL--DGINQWETISRGVPSKRVEIVYNI 343

Query: 447 DQ-WSSLLLNTPSRRNSVLIN-IDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNK 504
           D  W        S R++ + N   +   +AA+R+  WKL+ G            +TR   
Sbjct: 344 DPLWPGC---KSSTRDTWIGNPYFDICVSAAIRVGHWKLLTGHTGCSNW-----ETRPEI 395

Query: 505 VPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN 564
            P        +S+T +S                                          +
Sbjct: 396 AP--------KSRTVESSD----------------------------------------D 407

Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
           G   L+++  D  E+ N+   RPDI   L + L YH  T VP        V+ADP R   
Sbjct: 408 GMVRLYHIKEDAEEKVNLVHYRPDIVFMLLQRLHYHNTTAVPPGEVDKFSVRADPARHGM 467

Query: 625 TWSPWI 630
            W+PWI
Sbjct: 468 AWNPWI 473


>gi|328726458|ref|XP_001949756.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
          Length = 402

 Score =  239 bits (609), Expect = 4e-60,   Method: Compositional matrix adjust.
 Identities = 155/424 (36%), Positives = 223/424 (52%), Gaps = 48/424 (11%)

Query: 227 LNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAG 283
           ++G D RR    A + +    YATD+FT+EA ++IE    +   +FL ++HLA H+GN+G
Sbjct: 1   MSGIDARRGFERAGNEMDRDRYATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSG 60

Query: 284 -KHLEAPQETINQ--FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
             HLE   +T N   F YI + NRR YA MV  LDDSVG +I +L    MLENSI++FMS
Sbjct: 61  PNHLEVLNKTYNDEAFGYIENENRRLYAGMVTSLDDSVGKIIESLHEHEMLENSIVVFMS 120

Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHI 400
           DNGAPT +      + N+GSN+P RG K ++ EGGV+  A +WSP +++   VS  + HI
Sbjct: 121 DNGAPTDD--PLWGHDNFGSNWPLRGKKASVLEGGVRGVAAIWSPWLKEKNTVSNNLFHI 178

Query: 401 SDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRR 460
           +DWLPTLYTAAGG+   L                        IDG++QW SL   +   R
Sbjct: 179 TDWLPTLYTAAGGNFEDL----------------------GKIDGVNQWKSLTDTSKQVR 216

Query: 461 NSVLINIDEKKRTAAVRLDSWKLVLGTQ-ENGTMDGYYGQTRSNKVPLLNFNAIVESKTY 519
           + +LINIDE +   A+  + WK+V   +  N   +  Y     N  P  N + +V S   
Sbjct: 217 SKILINIDETRGEEALIFNQWKVVKSNRTSNMVFNLKYAGDPGNAGPKYNMSGVVASTVG 276

Query: 520 QSLQQ---LSQNIFLPISNI-DKMRSTRQQATIH---CG---ANPAPMTPSPCTNGPCYL 569
           + L +   L+Q+  +  S   D    +R  A +    C    +NP P     C +G C L
Sbjct: 277 KRLSKINCLTQDSCMNASTARDVFYVSRSLAKVDGQTCPERVSNPNPDQQYECFDGYC-L 335

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLV---QADPKRFNDTW 626
           F++  DPCE  N+A   P        +L+  ++ +  Q+   PD +    ADP+RF   W
Sbjct: 336 FDILQDPCEYRNVAKQNPQTLDATIHMLEQFKKEMTMQA---PDPIIDPDADPRRFAGYW 392

Query: 627 SPWI 630
            PW+
Sbjct: 393 EPWL 396



 Score = 60.5 bits (145), Expect = 2e-06,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 46/72 (63%), Gaps = 4/72 (5%)

Query: 15 YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAG-KHLEAPQETINQ--FQYITD 70
          YATD+FT+EA ++IE    +   +FL ++HLA H+GN+G  HLE   +T N   F YI +
Sbjct: 22 YATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSGPNHLEVLNKTYNDEAFGYIEN 81

Query: 71 PNRRTYAALTKS 82
           NRR YA +  S
Sbjct: 82 ENRRLYAGMVTS 93


>gi|390362801|ref|XP_793927.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 739

 Score =  239 bits (609), Expect = 5e-60,   Method: Compositional matrix adjust.
 Identities = 156/508 (30%), Positives = 239/508 (47%), Gaps = 107/508 (21%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           IHTG+Q   I  A+P  +PL E  LP+ L+E GY+T  +GKWH+GF+    TP  RGF+S
Sbjct: 318 IHTGLQHGVIRHAQPNCLPLDEVTLPQKLKESGYATNMVGKWHIGFYLDACTPTERGFDS 377

Query: 202 HFGYLNGVISYYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
            FG+L G   +Y H     + +     T  L+GHD+  N +  +D  G+Y+T LFT + +
Sbjct: 378 FFGFLVGGEDHYTHSGHCEVYNADDGSTKTLSGHDLHANKTPVFDYEGQYSTHLFTNKTI 437

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
             IE     KPLF+YLA+ A H       L+ P   ++ ++ I D NRRTYA MV  +D+
Sbjct: 438 DAIERHDKTKPLFMYLAYQAVHLP-----LQVPDRYMDPYKNIADENRRTYAGMVSCMDE 492

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
            +G V  AL+  G+ +N+IIIF +DNG P          ++  +N+P RG K  +WEGG+
Sbjct: 493 GIGNVTRALKDAGLYDNTIIIFSTDNGGPI---------QSGANNWPLRGSKGNMWEGGI 543

Query: 377 KVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
                + SP +  + +  V+  +MH+SDWLPT+     G        ++G          
Sbjct: 544 HGVGFVHSPLLPTSVKGTVNHDLMHVSDWLPTIVAGIAGGA------LNG---------- 587

Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT-----------AAVRLDSWKL 483
                   +DG D W+S+     S R  +L NID KK+            AA+R+  WK+
Sbjct: 588 -----TKPLDGFDVWNSISGTASSPRTEILHNIDLKKKQPANEKFDTSIRAALRVGDWKI 642

Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFL--PISNIDKMRS 541
             G   + T                                 + N+++  P S +D +  
Sbjct: 643 KTGADYDVT---------------------------------THNVWMPPPESGLDTIYP 669

Query: 542 TRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
           T +                   N   +LFN+  DP EQ +++   PD+ ++L + L+ + 
Sbjct: 670 TEK------------------PNQKVWLFNITADPLEQTDLSEKYPDVVNELVQKLEEYY 711

Query: 602 RTLVPQSHEQPDLVQADPKRFNDTWSPW 629
              VP  +   DL +ADP     +W PW
Sbjct: 712 EGSVPMRYPDEDL-EADPALRGGSWGPW 738



 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 44/80 (55%), Gaps = 5/80 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +  N +  +D  G+Y+T LFT + +  IE     KPLF+YLA+ A H       L+ P  
Sbjct: 413 LHANKTPVFDYEGQYSTHLFTNKTIDAIERHDKTKPLFMYLAYQAVHLP-----LQVPDR 467

Query: 61  TINQFQYITDPNRRTYAALT 80
            ++ ++ I D NRRTYA + 
Sbjct: 468 YMDPYKNIADENRRTYAGMV 487


>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 496

 Score =  238 bits (607), Expect = 8e-60,   Method: Compositional matrix adjust.
 Identities = 168/551 (30%), Positives = 256/551 (46%), Gaps = 103/551 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H S  I TP ID LA +G+ LNN Y    C PSR  L++G++ I  G+Q   I
Sbjct: 36  GWNDVGYHNS-YIKTPTIDMLAKSGVRLNNYYVASHCVPSRNMLISGRHVIDIGLQHGEI 94

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G  PRG+PL E  + + L+E+GY+T  IGKWH G +     P  RGF++ FGYL     
Sbjct: 95  -GYYPRGLPLDEFTIADKLKEIGYATHLIGKWHCGCYSNHSLPHNRGFDTFFGYLGSSDD 153

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           +Y HI+       + LN      +    +   G+Y+T ++  EA  +I     +KPLFL 
Sbjct: 154 HYTHIIMSNGLADLRLN------DECVGYKYFGDYSTIMYANEAKNIIAQHDENKPLFLM 207

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           LA  A H     K ++ P   +  ++  I D  RRTYA M   +D+++  +  AL+ KGM
Sbjct: 208 LAFSAVH-----KPIQVPDVYMQYYKSTIHDNERRTYAGMASCVDEAIANITEALENKGM 262

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L+NS+I+  +DNG  +V           G+N+P RG KN+ WEGGV+  A ++S  +  +
Sbjct: 263 LKNSVIVLTTDNGGGSV-----------GNNWPLRGRKNSHWEGGVRGVAFVYSDLLPMD 311

Query: 391 PRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
            R +   ++MHI+DW PTL    GG+                     P  R  N+ G+DQ
Sbjct: 312 VRGTENSELMHITDWFPTLVKLGGGN---------------------PCER-KNLYGVDQ 349

Query: 449 WSSLLLNTPSRRNSVLINID------EKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
           W  +    PS R  VLI +D      E+ RT   + D + +             +   + 
Sbjct: 350 WEMIRGMEPSARGEVLIALDTTRKASEEYRTGPFKYDKFDITT-----------FAAIKM 398

Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPI--SNIDKMRSTRQQATIHCGANPAPMTPS 560
            K  LL  ++     TY++   +    +  I   + D  R  R                 
Sbjct: 399 GKWKLLTGDS-----TYRNFWDMPDPEYGVIDDEDFDDTRMVR----------------- 436

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQ-LYELLKYHRRTLVPQSHEQPDLVQADP 619
                   L++L +DP E+ +++  R DI  + L +L  Y +  + PQ   +P L     
Sbjct: 437 --------LYDLVDDPSERTDVSEERQDIVHEMLAKLDGYEKAAVPPQMQTKPRL----D 484

Query: 620 KRFNDTWSPWI 630
               +   PWI
Sbjct: 485 NHVLECMGPWI 495



 Score = 40.8 bits (94), Expect = 2.4,   Method: Compositional matrix adjust.
 Identities = 24/79 (30%), Positives = 37/79 (46%), Gaps = 6/79 (7%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
              +   G+Y+T ++  EA  +I     +KPLFL LA  A H     K ++ P   +  +
Sbjct: 173 CVGYKYFGDYSTIMYANEAKNIIAQHDENKPLFLMLAFSAVH-----KPIQVPDVYMQYY 227

Query: 66  QY-ITDPNRRTYAALTKST 83
           +  I D  RRTYA +    
Sbjct: 228 KSTIHDNERRTYAGMASCV 246


>gi|313235834|emb|CBY19819.1| unnamed protein product [Oikopleura dioica]
          Length = 518

 Score =  238 bits (606), Expect = 9e-60,   Method: Compositional matrix adjust.
 Identities = 166/577 (28%), Positives = 268/577 (46%), Gaps = 98/577 (16%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
            L +S  +  L++  +GW D   HGS ++ TPN+DA+A +GI+L   Y Q VC+P+R+SL
Sbjct: 13  CLVQSKNVVYLMIDDFGWGDFGVHGS-KLETPNLDAIARDGILLEKYYTQQVCSPTRSSL 71

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           +TG+YPI  GMQ   I   +  G+P     LP+ L+  G++T  +GKWH G       P 
Sbjct: 72  LTGRYPIRYGMQHNVILAGQTTGIPKEYALLPQDLKSCGFATHMVGKWHCGHSHEYMLPF 131

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
            RGF+S++GYL G   +Y  I      +  E  G D          T G Y T++++ + 
Sbjct: 132 NRGFDSYYGYLQGAEDHYSRI----QCQAKEWCGVDFCTENGPTNSTWGTYGTEIYSAQV 187

Query: 256 VQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
            Q+++     +KP FLY A    H       L+AP+    +F +I DP+RRTYAAMV  +
Sbjct: 188 AQVLDKVSKEEKPFFLYYAMQNVH-----DPLQAPEHYKIKFDWIEDPDRRTYAAMVSIM 242

Query: 315 D-DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           D ++VG  ++ L  + + ++++II  +DNG  T         R+ G+N+P R  K +L+E
Sbjct: 243 DEETVGDFMTMLHERNLYDDTLIIITADNGGET---------RSGGNNFPLRSQKWSLYE 293

Query: 374 GGVKVPAILWSPQIQQNPRVSLQ-MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           GGV V A+L    I+   + +   ++H+SDW  T+  A          ++DG+ QW ++L
Sbjct: 294 GGVHVNAMLSGGLIKDQSKSTFDGLVHVSDWRETILEAMECKPPADKSDLDGISQWQAIL 353

Query: 433 LNTPSRRNSNIDGLD-----------QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
            N    R S +  +D           QW   L    S       NI+ +   +A+R   W
Sbjct: 354 QNASPPRQSIVHNIDPIKTVKGTDERQWVQKLAADAS------FNINSQ---SAIRYGRW 404

Query: 482 KLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
           KL+ G    G  DG        + P+  +  +++   ++S + L++ +            
Sbjct: 405 KLLTGDPAFGVPDGNIPPPEDAR-PV--WQKLLDDSMFESSETLTRLV------------ 449

Query: 542 TRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
                                      L+++  DP E+++I+ + PDI     +LL  H 
Sbjct: 450 --------------------------QLYDIEEDPFEKHDISDNHPDIVKIGLDLLSRHN 483

Query: 602 RTLVPQSHEQPDLVQADPKR--------FNDTWSPWI 630
            T V      P+    DP          F   W PW+
Sbjct: 484 ETAV-----WPNFPHTDPNSNPETRSDGFAGFWWPWV 515



 Score = 47.8 bits (112), Expect = 0.020,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 40/71 (56%), Gaps = 6/71 (8%)

Query: 11  TVGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           T G Y T++++ +  Q+++     +KP FLY A    H       L+AP+    +F +I 
Sbjct: 174 TWGTYGTEIYSAQVAQVLDKVSKEEKPFFLYYAMQNVH-----DPLQAPEHYKIKFDWIE 228

Query: 70  DPNRRTYAALT 80
           DP+RRTYAA+ 
Sbjct: 229 DPDRRTYAAMV 239


>gi|242002216|ref|XP_002435751.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215499087|gb|EEC08581.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 433

 Score =  236 bits (601), Expect = 4e-59,   Method: Compositional matrix adjust.
 Identities = 169/493 (34%), Positives = 233/493 (47%), Gaps = 73/493 (14%)

Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
           +TGMQ   +   EPRG+PL  + LPE+L +LGY      K  L    +E   +Y      
Sbjct: 1   YTGMQHFVLMEMEPRGLPLNFKLLPEWLGDLGYVHNQTTKQTL--LHKELRFVYA----- 53

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           F YL                      G D RR L    +  G Y T+L T+EA+++IE  
Sbjct: 54  FPYLR--------------------RGFDFRRGLLGGREFDGRYYTELMTEEAIRIIETH 93

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
           P++KPLFLYL+HLA H+GN    L+ P++  + +  I   NR  YA MV  LD+SVG V+
Sbjct: 94  PLEKPLFLYLSHLAPHSGNLRDPLQVPKKFSDMYHDIGSRNRTIYAGMVSALDESVGAVV 153

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL R+GML +++++F SDNGA            N+ S +P++G K T WEGGV+ PAI+
Sbjct: 154 EALGRRGMLGDTVLVFSSDNGA-----NSRGENPNYASAWPFKGQKITPWEGGVRAPAII 208

Query: 383 WSPQIQ-QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
           WSP +     R    + HISDWLPTLY  AGGD S L                       
Sbjct: 209 WSPSLSGTQGRDYSNIFHISDWLPTLYQLAGGDPSDL----------------------G 246

Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
           +IDG+ Q  +L   T + R  VLINID      A+    +KLV GT + G +D ++ Q  
Sbjct: 247 DIDGISQLEALRHGTEAPRKEVLINIDPIDNVTAIIEGHFKLVSGTVKGGALDDWF-QIP 305

Query: 502 SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQA---TIHCGANPAPMT 558
            N        A    +T    +       L  S  D    + Q +    + CG      T
Sbjct: 306 GNV-------AWDTDRTRHECETSVAAQVLRNSGHDVTCGSGQGSYAIPVKCGRRD---T 355

Query: 559 PSPCTN--GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
             PCT    PC LF+L  DPCE NN+A    ++  +L   L  ++ T     +  P+  +
Sbjct: 356 SKPCTPTLAPC-LFDLSEDPCEYNNVAEQHSEVVRRLLRKLAVYKATSWSPGN-VPEDPR 413

Query: 617 ADPKRFNDTWSPW 629
           A P   N  W  W
Sbjct: 414 AHPALHNGAWVSW 426



 Score = 70.5 bits (171), Expect = 3e-09,   Method: Compositional matrix adjust.
 Identities = 33/82 (40%), Positives = 50/82 (60%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR L    +  G Y T+L T+EA+++IE  P++KPLFLYL+HLA H+GN    L+ P++
Sbjct: 63  FRRGLLGGREFDGRYYTELMTEEAIRIIETHPLEKPLFLYLSHLAPHSGNLRDPLQVPKK 122

Query: 61  TINQFQYITDPNRRTYAALTKS 82
             + +  I   NR  YA +  +
Sbjct: 123 FSDMYHDIGSRNRTIYAGMVSA 144


>gi|390336076|ref|XP_784530.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
          Length = 545

 Score =  235 bits (600), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 162/516 (31%), Positives = 243/516 (47%), Gaps = 106/516 (20%)

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           RASL    Y IHTG+Q   I  A+P  +P+ E  LP+ L+E GY+T  +GKWH+GF+   
Sbjct: 117 RASLC---YWIHTGLQHGVIRHAQPNCLPVDEVTLPQKLKENGYTTNMVGKWHIGFYLDA 173

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQY-----SRTVELNGHDMRRNLSTAWDTVGEY 246
            TP  RGF+S FG+L G   +Y  +LS +        T  L+GHD+  N +  +D  G+Y
Sbjct: 174 CTPTERGFDSFFGFLIGAEDHYTRMLSCEAYNANDGSTKTLSGHDLHANKTPVFDYEGQY 233

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           +T LFT + V +IE     KPLF+YLA+ A HA      L+ P   ++ ++ I D  RRT
Sbjct: 234 STHLFTNKTVDVIERHDKTKPLFIYLAYQAVHAP-----LQVPDSYMDPYKNIADKTRRT 288

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YA MV  +D+ +G V  AL+  G+ +N+IIIF +DNG          +    G+N+P RG
Sbjct: 289 YAGMVSCMDEGIGNVTQALKDAGLYDNTIIIFSTDNGG---------HIDRGGNNWPLRG 339

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
            K ++WEGG+     + SP +    +  V+ ++MH+SDWLPT+     G           
Sbjct: 340 SKGSMWEGGIHGVGFVHSPLLPDAVKGTVNHELMHVSDWLPTIVAGIAG----------- 388

Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKR-----------T 473
                +L    P      +DG + WSS+     S +  +L NI+ KK+            
Sbjct: 389 ----GALNGTKP------LDGYNVWSSIRGTASSPQTELLHNINPKKKRPENEKFDTSMK 438

Query: 474 AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
           AA+R+  WK+  G   + T+                FN  +                 P 
Sbjct: 439 AALRVGDWKIKTGANYDVTV----------------FNEWMPP---------------PE 467

Query: 534 SNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
           S +D +  T +                   N   +LFN+  DP EQ ++A   P +  +L
Sbjct: 468 SGLDTIYPTEKP------------------NQKVWLFNITADPLEQTDLAEKYPGVVDEL 509

Query: 594 YELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
              L+ + +  VP  +   DL +ADP     +W PW
Sbjct: 510 VHKLEEYYKGSVPMRYPDEDL-EADPALHGGSWGPW 544



 Score = 66.6 bits (161), Expect = 4e-08,   Method: Compositional matrix adjust.
 Identities = 40/99 (40%), Positives = 56/99 (56%), Gaps = 4/99 (4%)

Query: 108 NIDALAYNGIILNNMYAQPVCTPSRASLMT---GKYPIHTGMQGPPIWGAEPRGVPLTER 164
           NI A A   +  + + AQ  C  + A + T   G   IHTG+Q   I  A+P  +P+ E 
Sbjct: 2   NIPAKAVLHLYADKIKAQ-RCILAAAIVGTPDFGHNMIHTGLQHGVIRHAQPNCLPMDEV 60

Query: 165 FLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
            LP+ L+E GY+T  +GKWH+GF+    TP  RGF+S F
Sbjct: 61  TLPQKLKENGYTTNMVGKWHIGFYLDACTPTERGFDSFF 99



 Score = 58.2 bits (139), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/82 (36%), Positives = 45/82 (54%), Gaps = 5/82 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +  N +  +D  G+Y+T LFT + V +IE     KPLF+YLA+ A HA      L+ P  
Sbjct: 219 LHANKTPVFDYEGQYSTHLFTNKTVDVIERHDKTKPLFIYLAYQAVHAP-----LQVPDS 273

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            ++ ++ I D  RRTYA +   
Sbjct: 274 YMDPYKNIADKTRRTYAGMVSC 295


>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 508

 Score =  235 bits (600), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 162/526 (30%), Positives = 257/526 (48%), Gaps = 77/526 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +   + + TPNID LA  G+ L   YAQP C PSR+ LM G+Y IHTG      
Sbjct: 37  GYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYAQPSCMPSRSCLMMGRYQIHTGFDYRCK 96

Query: 152 WGAEPRGVPLTERF-LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
            G   +     +   +P  L+E GY+T  IGKWHLG  R E  P  +GF++ FGYL+ + 
Sbjct: 97  DGKRSQLCMHPDTITMPMKLKENGYATHMIGKWHLGNIRWECLPNAKGFDTFFGYLSAIE 156

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNL-STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            Y+ H            N HD  RN    A D  G+Y+T LFTKEA  +I++  +++P+F
Sbjct: 157 DYFTHYTP------AGANCHDFWRNHDEVADDYKGQYSTHLFTKEAQDIIKNHDINQPMF 210

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQY--ITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           +YL++ A H       L+ P    + +         R+TYAAM   +D+ +G V S L++
Sbjct: 211 MYLSYQAVHGP-----LQVPPHYYDMYNDGPRVSTTRQTYAAMATCMDEGIGNVTSTLKQ 265

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           +GM +N+++IF+SDNGA          +   G N+P RG K +++EGG++V + + SP I
Sbjct: 266 RGMWDNTVLIFVSDNGA---------QFHTGGRNWPLRGGKGSVFEGGIRVVSFVTSPLI 316

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           ++  R+S  ++H+ DW PTL   AGG    + L                       DG++
Sbjct: 317 ERPRRISEDIVHLMDWFPTLTHLAGGTVDDVML-----------------------DGVN 353

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVR-LDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
            W ++   T S R  +L++I         R LD W        NG  D     TR++ + 
Sbjct: 354 VWDTISKGTLSPRRELLVHISATDGVTNYRGLDPW------YNNGMFD----ITRTSAII 403

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
             ++  +   K Y S   +   +    ++I KM                   P   T+  
Sbjct: 404 AGDWKLVTSQKRYDSSWAVPPEL---ANSIGKMSINE---------------PKSDTDKL 445

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYE-LLKYHRRTLVPQSHEQ 611
             L++L  D  EQ+++++ RPDI  +L E L KY + ++  ++ ++
Sbjct: 446 VLLYHLTEDKREQHDLSNLRPDIVFKLLEKLRKYQKHSVYCETSKR 491



 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 7/80 (8%)

Query: 2   RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
           R +   A D  G+Y+T LFTKEA  +I++  +++P+F+YL++ A H       L+ P   
Sbjct: 174 RNHDEVADDYKGQYSTHLFTKEAQDIIKNHDINQPMFMYLSYQAVHGP-----LQVPPHY 228

Query: 62  INQFQY--ITDPNRRTYAAL 79
            + +         R+TYAA+
Sbjct: 229 YDMYNDGPRVSTTRQTYAAM 248


>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 500

 Score =  235 bits (600), Expect = 5e-59,   Method: Compositional matrix adjust.
 Identities = 144/401 (35%), Positives = 217/401 (54%), Gaps = 33/401 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ ++      TP ID LA +G+ L N Y    C PSR  LMTG++ I  G+Q    
Sbjct: 34  GWNDVGYNNP-VFKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIQHDD- 91

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +G  PR +PL E  + E L+ +GYST  +GKWH GF+     P  RGF++ FG++   I 
Sbjct: 92  YGFHPRSLPLNETTIAEPLKHVGYSTHIVGKWHCGFYSDNCLPHNRGFDTFFGFVGAGID 151

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           +Y H  SD ++       H++R+N    A   +G+Y+T +F  E   +I     +KPLFL
Sbjct: 152 HYTH--SDHFNHM-----HNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFL 204

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           YL+  A HA      LE P   + Q++  I D +RRTYAAM   +D+++  +  AL+ K 
Sbjct: 205 YLSFSAVHAP-----LEVPSSYLKQYESTIHDEDRRTYAAMTSCVDEAIANITEALEEKN 259

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML+NSII+F SDNG   V         + G+N+P RG K+T WEGGV+    ++   + +
Sbjct: 260 MLKNSIIVFTSDNGGALVN--------SAGNNWPLRGGKHTSWEGGVRAIGFVYGDLLPR 311

Query: 390 NPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL- 446
           + R +    +MHI+DW PTL T AGG+   +   + G+DQW+ +     S R+  +  L 
Sbjct: 312 HSRGTENTGLMHITDWFPTLITLAGGN-PHVGKKLYGVDQWNMITGKDKSGRDDVLITLQ 370

Query: 447 -DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
            D  +     T   +N+      +    AA+++  WKL+ G
Sbjct: 371 EDYKAGAHYRTGIYKNNAF----DITTHAAIKMGKWKLLTG 407



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 6/73 (8%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
           +G+Y+T +F  E   +I     +KPLFLYL+  A HA      LE P   + Q++  I D
Sbjct: 177 IGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSAVHAP-----LEVPSSYLKQYESTIHD 231

Query: 71  PNRRTYAALTKST 83
            +RRTYAA+T   
Sbjct: 232 EDRRTYAAMTSCV 244


>gi|291221683|ref|XP_002730830.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  235 bits (599), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 166/580 (28%), Positives = 262/580 (45%), Gaps = 116/580 (20%)

Query: 67  YITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
           YI+ P       + K+    +++++    GWND+ +H   +I  P +  LA +G+I N  
Sbjct: 15  YISKP------CIAKAEQPHIVLIFVDDLGWNDVEWHNP-DIKMPVLSKLAADGVIFNQS 67

Query: 123 YAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGK 182
           Y  P CTPSRA++MTG YP  TG Q   I+   P GVPL  + LPE L+E+GY T  +GK
Sbjct: 68  YTHPTCTPSRAAMMTGMYPFRTGNQHQMIFNLHPSGVPLEFKLLPEKLKEVGYFTHMVGK 127

Query: 183 WHLGFFRREYTPLYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTA 239
           WHLGF + EY P  RGF+SH+G     + +YD    +LS          G+D R N+   
Sbjct: 128 WHLGFCKEEYLPTSRGFDSHYGLWTLGVGHYDKMNGVLSPS-------EGYDFRDNIGVV 180

Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
             +  EY T +  + A  +I       PLFL             KHLE P      +  I
Sbjct: 181 PKS-DEYLTLMLAERAEHIINGHYNKHPLFLQFT-----MDIPAKHLEIPDTFEALYCDI 234

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
            D   R +   +  +D  +G V+ AL+ + M ++++++F+ DNGA   +          G
Sbjct: 235 EDERTRKFYGKLSLMDHVIGRVVDALKAREMWDDTVLMFIGDNGALASQA---------G 285

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           SNYP+RG+  TL+EG  +VP+++    +++   V+ ++ ++ D   T+   AG ++  + 
Sbjct: 286 SNYPFRGIAGTLFEGATRVPSLIHGSMLRETGYVNNELWNLVDLHRTILALAGAESGSM- 344

Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT--AAVR 477
                                  +DG++ W +      S R   + NID+   +  AA+R
Sbjct: 345 ---------------------CELDGMNMWDTFSKGKKSPREEHVYNIDDDPISPGAAIR 383

Query: 478 LDSWKLVLGT-------QENGTMDGY--YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQN 528
           +  +KL+ G        ++    DG+  YG   ++ +P                      
Sbjct: 384 IGDYKLITGHPDLLYPYRDISLTDGWYNYGDPPASGLP---------------------- 421

Query: 529 IFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPD 588
                       ST Q+A       P P+          YLFN+  DP E+NNIA   PD
Sbjct: 422 ------------STDQEANA-----PPPLNVE-------YLFNVVVDPSEKNNIADQHPD 457

Query: 589 ISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           I   L + L  HR+ L+P  ++  +L   D   +   WSP
Sbjct: 458 IVQNLRDRLDEHRKLLIPPVNQTTELA-GDAANYCGVWSP 496


>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
          Length = 431

 Score =  235 bits (599), Expect = 6e-59,   Method: Compositional matrix adjust.
 Identities = 164/504 (32%), Positives = 241/504 (47%), Gaps = 90/504 (17%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           IHTG+Q   IW  +P  +PL E+ LPE L+E GY T  +GKWHLG +++E  P +RGF++
Sbjct: 1   IHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYKKECLPTHRGFDT 60

Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           +FGYL G   YY H   +L    ++ +     D R     A      Y+T+LFT+ A+ L
Sbjct: 61  YFGYLLGSEDYYSHDRCVLIK--AKNITRCALDFRDGEEVATGFKNMYSTNLFTERAIDL 118

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I     +KPLFLYLA  + H     + LE P+E +  +  I D  RR YA MV  +D++V
Sbjct: 119 IAHHKTEKPLFLYLAFQSVH-----EPLEVPEEYMKPYSSIKDAKRRHYAGMVTLMDEAV 173

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G +  AL+R G+  N++++F +DNG  T+           G+N+P RG K TLWEGGV+ 
Sbjct: 174 GNLTDALKRYGLWNNTVLVFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRG 224

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              + SP ++Q    S +++HISDWLPTL   AGG T+     +DG D W ++    PS 
Sbjct: 225 VGFVASPLLKQKGVESHELIHISDWLPTLVHLAGGHTNGTK-PLDGFDVWKAISEGRPSP 283

Query: 439 RNS---NID----------GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
           R     NID          G+D  +SL  +     +  L NI      AA+R   WKL  
Sbjct: 284 RVELLHNIDPMFVDDSPCLGIDNRTSLPQSNSFPWDDTLFNI---SMHAAIRHGKWKL-- 338

Query: 486 GTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQ 545
                  + GY G +     P L   + ++S                             
Sbjct: 339 -------LTGYPGCSHWFPPPSLFSESKIQS----------------------------- 362

Query: 546 ATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLV 605
                        P P T    +LF++ +DP E+N+++   P +  +L   L+Y+ +  V
Sbjct: 363 -------------PDPPTKR-LWLFDIVHDPEEKNDLSEKYPHVVKKLLSRLQYYYKRSV 408

Query: 606 PQSHEQPDLVQADPKRFNDTWSPW 629
           P  +   D    DP+     W PW
Sbjct: 409 PVFYPDED-PSCDPEA-TGAWGPW 430



 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T+LFT+ A+ LI     +KPLFLYLA  + H     + LE P+E +  +  I D  RR
Sbjct: 106 YSTNLFTERAIDLIAHHKTEKPLFLYLAFQSVH-----EPLEVPEEYMKPYSSIKDAKRR 160

Query: 75  TYAALT 80
            YA + 
Sbjct: 161 HYAGMV 166


>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 843

 Score =  235 bits (599), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 148/401 (36%), Positives = 218/401 (54%), Gaps = 33/401 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ ++      TP ID LA +G+ L N Y    C PSR  LMTG++ I  G+Q    
Sbjct: 377 GWNDVGYNNP-VFKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIQRHG- 434

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +G  PR +PL E  + + L+++GYST  IGKWH GF+     P  RGF++ FG++   I 
Sbjct: 435 FGYHPRSLPLDETTIAQPLKQVGYSTHIIGKWHCGFYSDNCLPHNRGFDTFFGFVGAGIE 494

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           +Y H  SD ++       H++R+N    A   +G+Y+T ++  E   +I     +KP FL
Sbjct: 495 HYTH--SDHFNHM-----HNLRKNDDCIAKQYIGKYSTTIYANEGKNIINAHDQNKPFFL 547

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           YL+  A H       LE P   + Q++  I D +RRTYAAM   +D+++  +  AL+ K 
Sbjct: 548 YLSFSAVHTP-----LEVPSSYLKQYESTIYDEDRRTYAAMTSCVDEAIANITEALEEKD 602

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML+NSII+F SDNG   VE       R+ GSN+P RG K+T+WEGGVK    ++   +  
Sbjct: 603 MLKNSIIVFTSDNGG-AVE-------RSAGSNWPLRGGKHTIWEGGVKAVGFVYGELLPH 654

Query: 390 NPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL- 446
           + R +    +MHI+DW PTL T AG +   +   + G+DQW  +     SRR+  +  L 
Sbjct: 655 HSRGTENTGLMHITDWFPTLVTLAGAN-PHVGKKLYGVDQWKMVTGEGKSRRDDVLIELV 713

Query: 447 -DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
            D  +S    T   +N      D   R AA+++  WKL+ G
Sbjct: 714 KDNKASTRYRTGIFKNHPF---DITTR-AAMKMGKWKLLTG 750



 Score = 48.9 bits (115), Expect = 0.008,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 39/73 (53%), Gaps = 6/73 (8%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
           +G+Y+T ++  E   +I     +KP FLYL+  A H       LE P   + Q++  I D
Sbjct: 520 IGKYSTTIYANEGKNIINAHDQNKPFFLYLSFSAVHTP-----LEVPSSYLKQYESTIYD 574

Query: 71  PNRRTYAALTKST 83
            +RRTYAA+T   
Sbjct: 575 EDRRTYAAMTSCV 587


>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
          Length = 339

 Score =  235 bits (599), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 130/316 (41%), Positives = 182/316 (57%), Gaps = 21/316 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+S HGS +IPTP+IDA+A++G+ L N + QPVCTP+R++ ++G++ IHTG+  P  
Sbjct: 44  GWNDVSLHGSPQIPTPHIDAIAHSGVHLTNYHVQPVCTPTRSTFLSGRHVIHTGIYMPFA 103

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G   R + L+   LP YL++LGY T A+GKWHLG    +  P  RGF+ + GY +G   
Sbjct: 104 QGTALR-LNLSYTLLPAYLKKLGYRTAAVGKWHLGQNVEKALPTGRGFDEYLGYWSGAED 162

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY H   D +       G+D +     A      Y+T +F + AV  I +   ++PLFLY
Sbjct: 163 YYTH---DTHG------GYDFQDGTECAIKYNNTYSTYIFAERAVNTILEADPEQPLFLY 213

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
            A    H       LEAP E + +F +I +  R+  AAM   LDD+VG +  AL+R  + 
Sbjct: 214 TAFQNVH-----WPLEAPAEYVARFSHIPNSERQYVAAMTSILDDAVGNITDALKRSRIA 268

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N+I+IF SDNG P  +    SN      NYP RG KNTLW GG +V  ++    I+   
Sbjct: 269 DNTILIFTSDNGGPVHDENTESN------NYPLRGGKNTLWNGGTQVVGMIAGKGIENPG 322

Query: 392 RVSLQMMHISDWLPTL 407
                MMH SDWLP+L
Sbjct: 323 TDCHGMMHASDWLPSL 338



 Score = 40.8 bits (94), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 36/67 (53%), Gaps = 5/67 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T +F + AV  I +   ++PLFLY A    H       LEAP E + +F +I +  R+
Sbjct: 188 YSTYIFAERAVNTILEADPEQPLFLYTAFQNVH-----WPLEAPAEYVARFSHIPNSERQ 242

Query: 75  TYAALTK 81
             AA+T 
Sbjct: 243 YVAAMTS 249


>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 539

 Score =  234 bits (598), Expect = 7e-59,   Method: Compositional matrix adjust.
 Identities = 164/527 (31%), Positives = 260/527 (49%), Gaps = 78/527 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +   + + TPNID LA  G+ L   YAQP C PSR+ LM G+Y IHTG      
Sbjct: 72  GYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYAQPSCMPSRSCLMMGRYQIHTGFNYKCT 131

Query: 152 WGAEPRGVPLTERF-LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
            G+  +     +   +P  L+E GY+T  +GKWHLG  R E  P  +GF++ FGY     
Sbjct: 132 DGSGSQLCMHPDTITIPMKLKENGYATHMVGKWHLGNIRWECLPNAKGFDTFFGYHGASE 191

Query: 211 SYYDHIL-SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            YY H   + +  R +  N  D+      A +  G+Y+T +FT EA+ +IE+  V KP+F
Sbjct: 192 DYYTHFSPAGRECRDLWRNRDDV------AQEYYGQYSTHIFTNEALNIIENHDVSKPMF 245

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL + A H       L+ P++  N ++   D  RR YAAM   +D+ +G V +AL+++G
Sbjct: 246 LYLPYQAVHGP-----LQVPEKYYNMYERPLDDIRRKYAAMATCMDEGIGNVTNALKQRG 300

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           M +N++ IF+SDNGA          +   G N+P RG K +++EGG++V + + SP I++
Sbjct: 301 MWDNTVFIFVSDNGA---------QFPTGGRNWPLRGGKGSVFEGGIRVVSFVTSPLIER 351

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             R S +M+H++DWLPTL   AGG    + L  DG++ W ++   + + R   +  +   
Sbjct: 352 PQRSSNEMIHLTDWLPTLTHLAGGTVDDVEL--DGVNMWDTISKGSATPRKELLVQISPP 409

Query: 450 SSL--LLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
             +  + + P  +N +     +    +A+ + +WKLV       T +GYYG    + VP 
Sbjct: 410 HGVRDIGHEPWYKNGMF----DVTTNSAIIVGNWKLV-------TNEGYYG---GHVVP- 454

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
                           +L   I                     GA   P  P        
Sbjct: 455 ---------------PELESQI---------------------GAERIP-PPESKRGRLV 477

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
           +LF++G D  E++ +++ RPDI  +L E L+ ++   V   H    L
Sbjct: 478 WLFHIGEDTKEKHEMSNERPDIVFRLLERLRGYQSHAVFCRHSYASL 524



 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 5/79 (6%)

Query: 2   RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
           R     A +  G+Y+T +FT EA+ +IE+  V KP+FLYL + A H       L+ P++ 
Sbjct: 209 RNRDDVAQEYYGQYSTHIFTNEALNIIENHDVSKPMFLYLPYQAVHGP-----LQVPEKY 263

Query: 62  INQFQYITDPNRRTYAALT 80
            N ++   D  RR YAA+ 
Sbjct: 264 YNMYERPLDDIRRKYAAMA 282


>gi|327291280|ref|XP_003230349.1| PREDICTED: arylsulfatase J-like, partial [Anolis carolinensis]
          Length = 471

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 158/498 (31%), Positives = 242/498 (48%), Gaps = 73/498 (14%)

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L++ +Y IHTG+Q   I   +P  +PL    LP+ L+E GYST  +GKWHLGF+R+E  P
Sbjct: 3   LVSSRYQIHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMP 62

Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTK 253
             RGF++ FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T ++T+
Sbjct: 63  TQRGFDTFFGSLLGSGDYYTHYKCD----SPRMCGYDLYENDNAAWDHDNGIYSTQMYTQ 118

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
           +  Q++      KP+FLY+A+ A H+      L+AP     +++ I + NRR YAAM+  
Sbjct: 119 KVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPGTYYERYRSINNINRRRYAAMLAC 173

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLW 372
           LD+++  V  AL+R G   NS+II+ SDNG  PT            G+N+P RG K T W
Sbjct: 174 LDEAINNVTLALKRYGYYNNSVIIYSSDNGGQPTAG----------GNNWPLRGSKGTYW 223

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           EGG++    + SP ++    V  +++HI+DW PTL T A G      L +DG D W ++ 
Sbjct: 224 EGGIRAVGFVHSPLLKNKGCVCKELIHITDWFPTLITLAEGQIDEDTL-LDGYDVWETI- 281

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
             + SRR+  +D L     +     +   +    I      +A+R++ WKL+ G      
Sbjct: 282 --SESRRSPRMDILHNIDPIYTKAKNGSWAAGFGIWNTAIQSAIRVNHWKLLTGNP---- 335

Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
             GY     S+ VP  +F                       SN+   R   ++ +   G 
Sbjct: 336 --GY-----SDWVPPQSF-----------------------SNVGPSRWHNERVSWTAGK 365

Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
                          +LFN+  DP E+ +++S  PD+  QL   L    +T VP  +   
Sbjct: 366 T-------------VWLFNITADPYERVDLSSKYPDVVKQLLRRLSQFNKTAVPVRYPPK 412

Query: 613 DLVQADPKRFNDTWSPWI 630
           D  +++P      W PW 
Sbjct: 413 D-PRSNPNFNGGVWGPWF 429



 Score = 48.9 bits (115), Expect = 0.007,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T++  Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 96  LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 150

Query: 60  ETINQFQYITDPNRRTYAAL 79
               +++ I + NRR YAA+
Sbjct: 151 TYYERYRSINNINRRRYAAM 170


>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
          Length = 551

 Score =  234 bits (597), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 135/358 (37%), Positives = 200/358 (55%), Gaps = 32/358 (8%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           I TG+Q   IW  +P  VPL E+ LP++L+E GY+T  +GKWHLG +R+E  P  RGF++
Sbjct: 123 IRTGLQHQIIWPCQPSCVPLDEKLLPQFLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 182

Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           +FGYL G   YY H    L D  +  V     D R     A      Y+T++FTK A  L
Sbjct: 183 YFGYLLGSEDYYSHERCTLIDALN--VTRCALDFRDGEEVATGYKNMYSTNIFTKRAATL 240

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++V
Sbjct: 241 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAV 295

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V +AL+ +G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+ 
Sbjct: 296 GNVTAALKSRGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRG 346

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              + SP ++Q    + +++HISDWLPTL   AGG T+     +DG D W ++   +PS 
Sbjct: 347 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWKTISEGSPSP 405

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
           R   +  +D   + + ++P  RNS+ +  D+              AA+R  +WKL+ G
Sbjct: 406 RVELLHNID--PNFVDSSPCPRNSMALAKDDSSLPEHSAFNTSVHAAIRHGNWKLLTG 461



 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 228 YSTNIFTKRAATLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 282

Query: 75  TYAALT 80
            YA + 
Sbjct: 283 HYAGMV 288


>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
 gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
          Length = 357

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 141/405 (34%), Positives = 204/405 (50%), Gaps = 52/405 (12%)

Query: 77  AALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
           AA T +    LLIV    GW+D+S++  N +  PN+  LA  G+I N  Y+Q +CTPSR 
Sbjct: 1   AAQTTTKPHILLIVADDLGWSDVSWNNPN-VVMPNLHTLATTGVIFNQTYSQRLCTPSRT 59

Query: 134 SLMTGKYPIHTGMQ-GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
           +L+TGK+P   GMQ    ++     G+PL E  LP+ L++LGY+T  +GKWHLG  + EY
Sbjct: 60  ALLTGKFPYRLGMQVQKSMFEKNSHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEY 119

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
           TP  RGF+S +G+ +G   YY H+            G D     ++  D  G Y+T+ F 
Sbjct: 120 TPTERGFDSFYGFHHGGEDYYTHMSE---------RGLDFWDGRTSVSDRNGVYSTESFA 170

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
           + A  +I     + PLFLYL   + H  +     + P   +  F  I D NR++   M  
Sbjct: 171 RRAENIISQHDPNTPLFLYLPFQSVHTPH-----QVPSSYLQTFSTIQDDNRKSILGMAT 225

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
            LDD++  V  AL+ KG+ +N++ IFMSDNG          NY    SN+P RG K TLW
Sbjct: 226 ALDDAIKRVTDALKGKGLWDNTLTIFMSDNGG---------NYVQGQSNWPLRGAKGTLW 276

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           EGG +VPA +    +++       MMH  D LPTL + AGG                   
Sbjct: 277 EGGTRVPAFVHGNMLERTGYTYHGMMHGVDILPTLVSVAGG------------------- 317

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
                  ++ +DG + W S+     S R   + +ID +    A+R
Sbjct: 318 -----TEDAGLDGKNMWQSISTGADSPRTEFVYDIDSRGERYAIR 357


>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
           obscuriglobus UQM 2246]
          Length = 459

 Score =  234 bits (596), Expect = 1e-58,   Method: Compositional matrix adjust.
 Identities = 147/422 (34%), Positives = 212/422 (50%), Gaps = 54/422 (12%)

Query: 74  RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
           R  A    S    L    G  D  F G  EI TPNID +A  G  L+  YAQPVC+P+RA
Sbjct: 18  RAEAPAQPSVVFLLADDLGREDCGFMGGKEIKTPNIDKIAAAGATLDAFYAQPVCSPTRA 77

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           + MTG+YP+  G+Q   +      G+PL ER + + L++ GY+T  IGKWHLG F  EY 
Sbjct: 78  AFMTGRYPMRHGLQVGVVRPWAQYGLPLDERTVAQGLKDAGYTTAVIGKWHLGHFAPEYL 137

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
           P  RGF+  +G+ NG + Y+ HI            G D  R+     D    Y+T L  K
Sbjct: 138 PTKRGFDHQYGHYNGALDYFTHIRD---------GGFDWHRDDKVNSDE--GYSTHLVAK 186

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
           +AVQ ++     KP FLY+   A HA +     + P+E +  + ++    R+ YA M+  
Sbjct: 187 DAVQFVQTHAGKKPFFLYVPFNAVHAPH-----QVPEEYLKPYGHLKG-ERQKYAGMLAA 240

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           +D++VG +++A+++ G+ +N++ +F+SDNG P              SN  +R  K TL+E
Sbjct: 241 MDEAVGQIVAAVEKAGVRKNTLFVFLSDNGGPQPGVVT--------SNGKFRAGKGTLYE 292

Query: 374 GGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           GGV+V A   +   I+    ++ + +H+ DW PTL    GG T    L +DGLD W  L 
Sbjct: 293 GGVRVAACAAFDGHIRPGTTIT-EPLHVVDWYPTLLK-LGGPTGEQQLPVDGLDIWPVLT 350

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
              PS             ++LLNT                T AVR   WKLV+   +NGT
Sbjct: 351 GGKPSPHG----------AILLNT-------------TPNTGAVRAGDWKLVV---KNGT 384

Query: 493 MD 494
            D
Sbjct: 385 DD 386


>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
 gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
          Length = 491

 Score =  234 bits (596), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 147/400 (36%), Positives = 210/400 (52%), Gaps = 54/400 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ FHGS+ I TPN+D LA  G  L   Y QP CTP+RA+ +TG+YP+H G+Q   I
Sbjct: 60  GFADVGFHGSD-IKTPNLDHLAAQGARLGQFYTQPFCTPTRAAFLTGRYPLHYGLQVGAI 118

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+   E  LP+ L+++GY T  +GKWHLG   +++ P  RGF+S +G L G I 
Sbjct: 119 PSGAKYGLATDEFLLPQALKDVGYRTALVGKWHLGHADQKFWPRQRGFDSFYGPLVGEID 178

Query: 212 YYDHI---LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           ++ H    ++D Y        HD  +     +D      T+LF KEAV+LI       PL
Sbjct: 179 HFKHEAHGVTDWY--------HDNTQVKEEGYD------TELFGKEAVRLIAAHDPKTPL 224

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  A H        +APQ  ++Q+ +I  P RR YAAM+  +DD +G V++AL  +
Sbjct: 225 FLYLAFTAPH-----TPFQAPQSYLDQYAHIAAPQRRAYAAMITAMDDQIGHVVAALTSR 279

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNW---GSNYPYRGVKNTLWEGGVKVPAIL-WS 384
           GM EN++I+F SDNG    +              SN PYR  K +L+EGG +V A+  W 
Sbjct: 280 GMRENTLIVFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRDGKGSLYEGGTRVVALANWP 339

Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
            +I   P  +  +MH+ D LPTL   AG                +SL  + P      +D
Sbjct: 340 GRIA--PGAAEGVMHVVDMLPTLAKLAG----------------ASLAKSKP------LD 375

Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
           G+D W +L     + R  ++ N++  +   AVR   WKLV
Sbjct: 376 GVDVWPALAAGQ-AGRAGIVYNVEPTQ--GAVRDGRWKLV 412



 Score = 56.2 bits (134), Expect = 5e-05,   Method: Compositional matrix adjust.
 Identities = 30/68 (44%), Positives = 39/68 (57%), Gaps = 5/68 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y T+LF KEAV+LI       PLFLYLA  A H        +APQ  ++Q+ +I  P RR
Sbjct: 202 YDTELFGKEAVRLIAAHDPKTPLFLYLAFTAPH-----TPFQAPQSYLDQYAHIAAPQRR 256

Query: 75  TYAALTKS 82
            YAA+  +
Sbjct: 257 AYAAMITA 264


>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
          Length = 571

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 165/540 (30%), Positives = 256/540 (47%), Gaps = 79/540 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+S+H   ++ TPN+  +A NG+IL   Y+Q  CTPSRAS MTG YP   G+Q   +
Sbjct: 43  GYNDVSWHNP-QVLTPNLGKMAKNGVILTESYSQAACTPSRASYMTGYYPFRIGVQNSVV 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                  VPL   FLP+ L+E GY +  +GKWHLG  RR+ TP  RGF++  G LNG   
Sbjct: 102 REGMEDYVPLDVDFLPKRLKEAGYVSHLVGKWHLGHCRRDVTPPGRGFDTFLGLLNGYND 161

Query: 212 YYDHILS-----DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
           YY   +      + +        +D   N +       +Y TD+FT  A++LI+ Q  D 
Sbjct: 162 YYTKKIRAIASHEDFDPNAPGTIYDFFSNYTLQPSPETDYTTDIFTNRAIELIQ-QSKDT 220

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           P FL L + A H     KH   P  +   +  + D NRR Y + +  +D  +G V+ AL+
Sbjct: 221 PFFLALHYTAPH-WPLQKH---PGFSDKDYPDVKDLNRRIYLSSITAMDTGIGKVVDALK 276

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
           +  +L+N++I+F SDNG  T             +N P R  K TL+EGGVKVP++ + P 
Sbjct: 277 QHNLLDNTLIVFQSDNGGDT---------HFSANNGPLRDRKTTLYEGGVKVPSMAYGPG 327

Query: 387 IQQN-PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
           +  N PR S  + HISDW  T+ +AAG +      N+DG+DQW+  LL   S+       
Sbjct: 328 LLTNTPRTSDDLFHISDWFTTILSAAGLEAG----NVDGIDQWN--LLRYGSK------- 374

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
                       S R + + NI      AA+R   +KL+ G  +      +  + R   V
Sbjct: 375 ------------SPRQTFIYNI--HGDNAAIRAGRYKLIEGNPK-----AFQARRRRRSV 415

Query: 506 PLLNFN---------------AIVESKTYQSLQQLSQNIFLPISNIDKMRSTR------Q 544
            +L+ +                 V+ + Y+S +      F  ++ + K+   R       
Sbjct: 416 DVLDQDLAVHGSALTRHKRDEEFVDRRPYESDEDKVMKAFSSLTGLKKLEQIRPTGDWTM 475

Query: 545 QATIHCGANPAP---MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
            + +       P      + C    C L+NL  DP E+ ++   +P +   ++ LL+ ++
Sbjct: 476 DSILSVLEKIRPDQVKVQADCHK--CQLYNLTADPSEKKDLVKEKPAVVLFMHTLLELYK 533


>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
          Length = 486

 Score =  233 bits (595), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 172/545 (31%), Positives = 252/545 (46%), Gaps = 76/545 (13%)

Query: 98  FHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPR 157
           +  S+ I TPNIDA++  G+ L N Y QP+CTPSR+ L++G+Y IHTG+Q   IW   P 
Sbjct: 5   YKSSDIIKTPNIDAISAAGVRLENYYVQPICTPSRSQLLSGRYQIHTGLQHQLIWMGMPS 64

Query: 158 GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHIL 217
            +PL    LPE +R  GY T A GKWHLG+ +   TP  RGF +  GYL G   YY    
Sbjct: 65  ALPLDTELLPETMRNCGYHTMAAGKWHLGYAKTANTPWGRGFHNFTGYLGGSEDYYKKTR 124

Query: 218 S-DQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAH 274
             D +   ++ N  G      +  A     EY+   + ++A   I+ +  DKP FLYL  
Sbjct: 125 CIDHHKCGIDQNTDGEIFGERVYNA--DASEYSAFKYIRQAKNYIDGRDKDKPFFLYLPM 182

Query: 275 LAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLEN 333
            + HA      LEAP E I ++   I D  R+ YA MV  +D+ +G + S L+  G+  N
Sbjct: 183 QSVHA-----PLEAPAEYIAKYDGIIEDKPRKIYAGMVSVMDEGIGNITSHLKESGLWNN 237

Query: 334 SIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK----VPAILWSPQIQQ 389
           +I IF +DNG               G+N P RG K   WEGG+     V    +  + + 
Sbjct: 238 TIFIFSTDNGGQAYV---------GGNNLPLRGNKAGYWEGGIHGNGFVAGGYFEDRRKG 288

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
           +  V+ +++HISDW PT+  A   + ++   ++DG+ QW  LL    S R+  +  +D+ 
Sbjct: 289 SELVNNELIHISDWYPTIMEATSCEKAQNAPDLDGMSQWKMLLDLEKSPRDEILHNIDE- 347

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
            ++   + +R      NI  +   AA+R  +WKL+ G        GY            +
Sbjct: 348 MTVAKGSDNRTFKSDFNISVQ---AAIRWKNWKLLTGDP------GY-----------PD 387

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
           F   +  +  + L      IF P + +                 P P  P P T     L
Sbjct: 388 FPITIPPQGNKKL------IFTPENGLQY---------------PIPPRPEPLTR-LVRL 425

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND----T 625
           F++  DP E N I+    DI   L + L Y   T VP      +    DPK F +     
Sbjct: 426 FDVIKDPTESNEISDYHEDIVEFLLDRLSYWNSTQVPI-----NFPAVDPKSFPEYHQGF 480

Query: 626 WSPWI 630
           W PW+
Sbjct: 481 WKPWL 485


>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
 gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
          Length = 353

 Score =  233 bits (594), Expect = 2e-58,   Method: Compositional matrix adjust.
 Identities = 143/394 (36%), Positives = 198/394 (50%), Gaps = 52/394 (13%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
           LLIV    GW+D+S++  N +  PN+  LA  G+I N  Y Q +CTPSR +L+TGK+P  
Sbjct: 4   LLIVADDLGWSDVSWNNPNVV-MPNLHTLATTGVIFNQTYCQRLCTPSRTALLTGKFPYR 62

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
            GMQ  PI   +  G+PL E  LP+ L++LGY+T  IGKWHLG  + EYTP  RGF+S +
Sbjct: 63  LGMQ-RPIRHKKAHGLPLDEELLPQKLKKLGYATHMIGKWHLGCCKWEYTPTERGFDSFY 121

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
           GY  G   YY H +SD         G D     +   D  G Y+T+ F   A  +I    
Sbjct: 122 GYHRGSQDYYTH-MSD--------GGLDFWEGKTAISDQNGVYSTESFATRAENIISQHD 172

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
            + PLFLYL     H  +     + P   +  F  I D NR++   MV  LDD++  V  
Sbjct: 173 PNTPLFLYLPLQPVHTPH-----QVPSSYLQTFSTIQDHNRKSILGMVAALDDAIKRVTD 227

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
           ALQ KG+  N++ IFMSDNG          +Y +  SN+P RG K T+WEGG +VPA   
Sbjct: 228 ALQGKGLWNNTLTIFMSDNGG---------DYLDGQSNWPLRGAKGTVWEGGTRVPAFAH 278

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
              +++       MMH  D LPTL + AGG                          ++ +
Sbjct: 279 GNMLERTGYTYHGMMHGVDILPTLVSVAGG------------------------TEDAGL 314

Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
           DG + W S+     S R   + +ID ++    +R
Sbjct: 315 DGKNMWQSISTGAESPRTEFVYDIDSRQDRYGIR 348


>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 495

 Score =  233 bits (594), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 142/399 (35%), Positives = 214/399 (53%), Gaps = 29/399 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ ++      TP ID LA NG+ L N Y    C PSR  LMTG++ I  G+     
Sbjct: 34  GWNDVGYNNP-VFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIPNDG- 91

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +G  PR +PL E  + E L+  GYS   +GKWH G++     P  RGF++ FG++   I 
Sbjct: 92  FGYHPRSLPLDETTIAEPLKHAGYSNHIVGKWHCGYYADNCLPHNRGFDTFFGFVGAGID 151

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           +Y H  SD ++       H++R+N    A   +G+Y+T +F  E   +I     +KPLFL
Sbjct: 152 HYTH--SDHFNHM-----HNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFL 204

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           YL+  A HA      LE P   + Q++  I D +RRTYAAM   +D+++  +  AL+ K 
Sbjct: 205 YLSFSAVHAP-----LEVPSSYLKQYESTIHDEDRRTYAAMTSCVDEAIANITEALEEKN 259

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML+NSII+F SDNG   V         + G+N+P RG K+T WEGGV+    ++   + +
Sbjct: 260 MLKNSIIVFTSDNGGALV--------NSAGNNWPLRGGKHTSWEGGVRAVGFVYGDLLPR 311

Query: 390 NPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           + R +    +MHI+DW PTL T AGG+   +   + G+DQW+ +     S R+  +  L+
Sbjct: 312 HSRGTENTGLMHITDWFPTLITLAGGN-PHVGKKLYGVDQWNMITGEDKSGRDDVLITLE 370

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
           + +      P R      N  +    AA+++  WKL+ G
Sbjct: 371 EDNK--AGAPYRTGIYKNNAFDITTHAAIKMGKWKLLTG 407



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 6/73 (8%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
           +G+Y+T +F  E   +I     +KPLFLYL+  A HA      LE P   + Q++  I D
Sbjct: 177 IGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSAVHAP-----LEVPSSYLKQYESTIHD 231

Query: 71  PNRRTYAALTKST 83
            +RRTYAA+T   
Sbjct: 232 EDRRTYAAMTSCV 244


>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 490

 Score =  233 bits (593), Expect = 3e-58,   Method: Compositional matrix adjust.
 Identities = 144/421 (34%), Positives = 218/421 (51%), Gaps = 57/421 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +   + I TPNID LA  G+ L   YAQP C PSRA LM G+Y IHTG +   +
Sbjct: 23  GYFDVGYRNRSVIKTPNIDKLAAEGVKLERHYAQPSCLPSRACLMMGRYQIHTGYRDECM 82

Query: 152 WGAE-PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 R +      LP  +++ GY T  IGKWHLG    +  P  +GF+++FGY     
Sbjct: 83  NDTRYQRCMNHDIVTLPMKMKQNGYVTHMIGKWHLGNNNWDCLPNAKGFDTYFGYNAAAE 142

Query: 211 SYYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            YY H+LS +       N  D+ R+ +  A   +G+Y+T +FT+EAV +IE+  + +P+F
Sbjct: 143 DYYTHMLSGRQ------NCSDLWRDRMDVADKYIGQYSTRIFTEEAVNIIENHDISQPMF 196

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           +YLAH A HA      L+ PQE  + ++   +  RR YAAMV  +D+ +G V  AL+  G
Sbjct: 197 MYLAHQAVHAP-----LQVPQEYYSLYETTLNDVRRKYAAMVTCMDEGIGNVTEALKESG 251

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           M +N++++F+SDNG        ++  RNW    P RG K +++EGG++  + + S  I++
Sbjct: 252 MWDNTVLVFVSDNGGEL-----SAGGRNW----PLRGGKGSVFEGGIRTVSFVTSSLIER 302

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
             R+S  M+HI+DW PTL   AGG        ID                +  +DG++ W
Sbjct: 303 PQRISNAMIHITDWFPTLTHLAGG-------TID----------------DEMVDGINMW 339

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWK------------LVLGTQENGTMDGYY 497
            ++   +   R  +LI I    R   V +D W             L +G  +  T + YY
Sbjct: 340 DTISTGSVEPRTELLIKISPPARITEVGMDPWHKYDVFDAWLNSALFVGNWKLVTNEAYY 399

Query: 498 G 498
           G
Sbjct: 400 G 400



 Score = 62.0 bits (149), Expect = 1e-06,   Method: Compositional matrix adjust.
 Identities = 30/79 (37%), Positives = 48/79 (60%), Gaps = 5/79 (6%)

Query: 2   RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
           R  +  A   +G+Y+T +FT+EAV +IE+  + +P+F+YLAH A HA      L+ PQE 
Sbjct: 160 RDRMDVADKYIGQYSTRIFTEEAVNIIENHDISQPMFMYLAHQAVHAP-----LQVPQEY 214

Query: 62  INQFQYITDPNRRTYAALT 80
            + ++   +  RR YAA+ 
Sbjct: 215 YSLYETTLNDVRRKYAAMV 233


>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 469

 Score =  232 bits (592), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 148/411 (36%), Positives = 220/411 (53%), Gaps = 61/411 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS EI TPN+D LA +G+ L   Y + VCTP+RA+ +TG++P   GM    I
Sbjct: 46  GWNDVGFHGS-EIKTPNLDKLAASGVRLERFYVKSVCTPTRAAFLTGRHPFRYGMSTGVI 104

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +  G+PL E+ + E L+E GY T  +GKWHLG ++  + P  RGF+ H+G+  G I 
Sbjct: 105 KPWDKVGLPLEEKTIAETLKEAGYYTAILGKWHLGHYQESFLPTSRGFDYHYGHYLGGID 164

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD-KPLFL 270
           Y+ H  +D +     L+ H  R +L         YATDL  +EAV+LI +   + +PLFL
Sbjct: 165 YFTH--NDDFLGA--LDWHRNRIHLKEEG-----YATDLIGQEAVKLINNHNYEQQPLFL 215

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A  A H       L A  E I  +  I D  RR +AA V+ +D+++G +I +L+ + +
Sbjct: 216 YIAFNAPHTP-----LHAKTEDIEDYLTIDDEKRRVFAAQVQSMDEAIGKIIQSLENQQV 270

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQ 389
            +N+ + F+SDNG   +         N G N P RG KN+L+EGGV+VPAI+ + P++  
Sbjct: 271 CDNTFVFFVSDNGGSVMR-------ANRGDNRPLRGGKNSLYEGGVRVPAIVSYPPKLSA 323

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
           N  ++ Q+  I D  PTL   AG                        S     +DGLD +
Sbjct: 324 NQEIN-QIFSIVDLYPTLAKLAG----------------------VESVEGKQLDGLDIF 360

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
           +       SRR+ +LI       +AA+  D++KLVL        +G YG++
Sbjct: 361 AK-----HSRRDELLIQY-RSSTSAAIIKDNYKLVL--------NGGYGES 397



 Score = 45.1 bits (105), Expect = 0.10,   Method: Compositional matrix adjust.
 Identities = 29/69 (42%), Positives = 38/69 (55%), Gaps = 6/69 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           YATDL  +EAV+LI +   + +PLFLY+A  A H       L A  E I  +  I D  R
Sbjct: 190 YATDLIGQEAVKLINNHNYEQQPLFLYIAFNAPHTP-----LHAKTEDIEDYLTIDDEKR 244

Query: 74  RTYAALTKS 82
           R +AA  +S
Sbjct: 245 RVFAAQVQS 253


>gi|410446533|ref|ZP_11300636.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409980205|gb|EKO36956.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 517

 Score =  232 bits (592), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 141/363 (38%), Positives = 203/363 (55%), Gaps = 31/363 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP-- 149
           GW D+S+HG + IPTPNIDALA NG+ LN  YA PVC+P+RASL+TG +  + G+  P  
Sbjct: 30  GWGDVSYHGGH-IPTPNIDALAKNGVELNRFYASPVCSPTRASLLTGLHIFNHGIIRPLA 88

Query: 150 -PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            P   AE  G+P+  + +P++ +E GY T   GKWHLG    EY P  RGF+  +G++ G
Sbjct: 89  NPT--AEQYGLPVDLKIMPQFFKEAGYQTALSGKWHLGMHLEEYWPTNRGFDQSYGHMLG 146

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
            I Y+DH+ S +          D  RN    ++    Y+T+L   EAV++IE +  ++PL
Sbjct: 147 GIGYFDHVHSSRL---------DWHRNEEPLFED--GYSTELIANEAVRIIETKDPNRPL 195

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLY+A  A H       ++AP + I  F YI DP  R YAA V  LD  +G +I A++ +
Sbjct: 196 FLYVAFNAPHT-----PIQAPDKNIELFSYIEDPLDRAYAANVNALDAEIGKIIYAIEDQ 250

Query: 329 GMLENSIIIFMSDNGA-----PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA-IL 382
           G+LE +II+F SDNG      P +           GS     G K +  EGG++VPA I+
Sbjct: 251 GILEETIIVFFSDNGPVFDINPIIAVLAPGLVEARGSTAGLPGSKTSAREGGIRVPASIM 310

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
           W   +  +   S Q + + D LPTL +AA  + S L +  DG D+W +L+       N++
Sbjct: 311 WKGVLDNSK--SDQYIFVQDLLPTLLSAAEIEVSDLKV-FDGTDKWQNLVTGALQAPNND 367

Query: 443 IDG 445
             G
Sbjct: 368 FVG 370



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/64 (43%), Positives = 39/64 (60%), Gaps = 5/64 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T+L   EAV++IE +  ++PLFLY+A  A H       ++AP + I  F YI DP  R
Sbjct: 173 YSTELIANEAVRIIETKDPNRPLFLYVAFNAPHT-----PIQAPDKNIELFSYIEDPLDR 227

Query: 75  TYAA 78
            YAA
Sbjct: 228 AYAA 231


>gi|449265842|gb|EMC76972.1| Arylsulfatase J, partial [Columba livia]
          Length = 467

 Score =  232 bits (592), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 155/493 (31%), Positives = 238/493 (48%), Gaps = 71/493 (14%)

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           +Y IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+RRE  P  RG
Sbjct: 1   RYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRG 60

Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQ 257
           F++ FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T ++T++  Q
Sbjct: 61  FDTFFGSLLGSGDYYTHFKCD----SPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQ 116

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
           ++      KP+FLY+A+ A H+      L+AP +    ++ I + NRR YAAM+  LD++
Sbjct: 117 ILASHNPSKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEA 171

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           +  V  AL++ G  +NSIII+ SDNG   +           GSN+P RG K T WEGG++
Sbjct: 172 INNVTLALKKYGYYDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIR 222

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
               + SP ++    V  +++HI+DW PTL T A G      + +DG D W ++   +  
Sbjct: 223 AVGFVHSPLLKNKGSVCKELVHITDWFPTLITLAEGQIDE-DIQLDGYDIWETI---SEG 278

Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
           RR+  +D L     +     +   +    I      +A+R++ WKL+ G        GY 
Sbjct: 279 RRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVNHWKLLTGNP------GY- 331

Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
               S+ VP   F                       SN    R   ++ +   G      
Sbjct: 332 ----SDWVPPQAF-----------------------SNAGPNRWHNERVSWTAGKT---- 360

Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
                     +LFN+  DP E+ ++++  PD+  QL   L    +T VP  +   D  ++
Sbjct: 361 ---------VWLFNITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKD-PRS 410

Query: 618 DPKRFNDTWSPWI 630
           +PK     W PW 
Sbjct: 411 NPKLNGGVWGPWF 423



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T++  Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 90  LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPSKPIFLYIAYQAVHS-----PLQAPG 144

Query: 60  ETINQFQYITDPNRRTYAAL 79
           +    ++ I + NRR YAA+
Sbjct: 145 KYFEHYRSINNINRRRYAAM 164


>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
          Length = 431

 Score =  232 bits (592), Expect = 4e-58,   Method: Compositional matrix adjust.
 Identities = 134/360 (37%), Positives = 200/360 (55%), Gaps = 32/360 (8%)

Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           + IHTG+Q   IW  +P  +PL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF
Sbjct: 1   FKIHTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGF 60

Query: 200 ESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           +++FGYL G   YY H    L D  +  V     D R     A      Y+T++FT+ A 
Sbjct: 61  DTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNVFTERAT 118

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
            LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NRR YA M   +D+
Sbjct: 119 TLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDE 173

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           +VG V +AL+R+G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV
Sbjct: 174 AVGNVTAALERRGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGV 224

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
           +    + SP +++    + +++HISDWLPTL   AGG T+     +DG D W+++   +P
Sbjct: 225 RGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTNGTK-PLDGFDVWNTISEGSP 283

Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
           S R   +  +D   + +   P   NS+ +  DE              AA+R  +WKL+ G
Sbjct: 284 SPRMELLHNID--PNFVDTAPCPGNSMSLAKDESSLLEYSAFNISIHAAIRHQNWKLLTG 341



 Score = 55.5 bits (132), Expect = 8e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NRR
Sbjct: 108 YSTNVFTERATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRR 162

Query: 75  TYAALT 80
            YA + 
Sbjct: 163 YYAGMA 168


>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
          Length = 609

 Score =  232 bits (591), Expect = 6e-58,   Method: Compositional matrix adjust.
 Identities = 163/551 (29%), Positives = 256/551 (46%), Gaps = 94/551 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +H + EI TP +D LA  G+ L   Y QPVCTP+R  LMTG+Y I  GMQ   +
Sbjct: 109 GYHDIGYHQA-EILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRYQIRYGMQHGVV 167

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P GVPL E+ LPE LR+ GY+T+ IGKWHLG F  +Y P  RGF+   G+  G   
Sbjct: 168 RPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFDHFMGFYTGSQD 227

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLS-----TAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
           +Y H  +  +S    + G+D R   +       WD    Y+T +F  E  + +      +
Sbjct: 228 FYSH--NKCFS---GMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELEKRLSKMNPSE 282

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           P F YL+  A H+      L+AP++ I+Q++ + +  RR Y AMV  +D ++  V+ A +
Sbjct: 283 PSFTYLSFQAVHS-----PLQAPRKFIDQYRNLKNRKRRNYNAMVTAMDTAIYKVVKAYK 337

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
           +    +++I+IF SDNG          N +   SN+P RG K +L+EGGV+    + SP 
Sbjct: 338 KFKFWDDTILIFSSDNGG---------NLKGGASNWPLRGAKGSLFEGGVRTIGFVHSP- 387

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLLNTPSR---RNSN 442
                     ++ ++DW PTL   AG  TS      +DG+ Q   L + +      R+  
Sbjct: 388 ----------LLPVTDWFPTLLQLAGCKTSDYGGKPLDGVSQVRQLWMGSNGNDHARDEI 437

Query: 443 IDGLDQWSSLLLNTPSRRNSVLIN----IDEKKRTAAVRLDSWKLVLGTQENGTMDGYYG 498
           + GLD  + +      R+  ++ N    +D +    A+R   WKL+ G Q          
Sbjct: 438 LHGLDPLNKVKNTVDPRQFPIIKNRTFSVDVE---GALRWKQWKLITGKQ---------- 484

Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
                                               + D+ R +    T++ GA      
Sbjct: 485 ------------------------------------SPDQNRWSTPPTTMYGGAKTIFDK 508

Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
             P  +    L+N+ +DP E+N+I+   P + + +   L  +    VP  +   DL +AD
Sbjct: 509 VEPKKSKLVRLYNVVDDPFERNDISDEHPQVVNVMLTKLADYYDASVPPRYPAYDL-RAD 567

Query: 619 PKRFNDTWSPW 629
           P   +  W PW
Sbjct: 568 PALGDGVWRPW 578



 Score = 40.8 bits (94), Expect = 2.0,   Method: Compositional matrix adjust.
 Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 6/91 (6%)

Query: 9   WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
           WD    Y+T +F  E  + +      +P F YL+  A H+      L+AP++ I+Q++ +
Sbjct: 256 WDLNNTYSTGVFADELEKRLSKMNPSEPSFTYLSFQAVHS-----PLQAPRKFIDQYRNL 310

Query: 69  TDPNRRTY-AALTKSTTLTLLIVYGWNDLSF 98
            +  RR Y A +T   T    +V  +    F
Sbjct: 311 KNRKRRNYNAMVTAMDTAIYKVVKAYKKFKF 341


>gi|449499600|ref|XP_002193905.2| PREDICTED: arylsulfatase J [Taeniopygia guttata]
          Length = 488

 Score =  231 bits (590), Expect = 7e-58,   Method: Compositional matrix adjust.
 Identities = 156/492 (31%), Positives = 236/492 (47%), Gaps = 71/492 (14%)

Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           Y IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+RRE  P  RGF
Sbjct: 23  YQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGF 82

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQL 258
           +S FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T ++T++  Q+
Sbjct: 83  DSFFGSLLGSGDYYTHFKCD----SPGICGYDLYENNNAAWDHDNGIYSTQMYTQKVQQI 138

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           +      KP+FLY+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++
Sbjct: 139 LASHNPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSINNINRRRYAAMLACLDEAI 193

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
             V  AL++ G  ENSIII+ SDNG   +           GSN+P RG K T WEGG++ 
Sbjct: 194 NNVTLALRKYGYYENSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRA 244

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              + SP ++    V  +++HI+DW PTL T A G      + +DG D W ++   +  R
Sbjct: 245 VGFVHSPLLKNKGSVCKELVHITDWFPTLITLAEGQIDE-DIQLDGYDVWETI---SEGR 300

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYG 498
           R+  +D L     +     +   +    I      +A+R++ WKL+ G        GY  
Sbjct: 301 RSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVNHWKLLTGN------PGY-- 352

Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
              S+ VP   F                       SN    R   ++ +   G       
Sbjct: 353 ---SDWVPPQAF-----------------------SNAGPNRWHNERVSWSAGKT----- 381

Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
                    +LFN+  DP E+ ++++  P++  QL   L    +T VP  +   D  +++
Sbjct: 382 --------VWLFNITADPYERVDLSARHPEVVKQLLRRLSQFNKTAVPVRYPPKD-PRSN 432

Query: 619 PKRFNDTWSPWI 630
           PK     W PW 
Sbjct: 433 PKLNGGVWGPWF 444



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T++  Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 111 LYENNNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 165

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 166 RYFEHYRSINNINRRRYAAM 185


>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 504

 Score =  231 bits (589), Expect = 9e-58,   Method: Compositional matrix adjust.
 Identities = 168/548 (30%), Positives = 260/548 (47%), Gaps = 99/548 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H    I TP ID LA NG+ LNN YA  +C PSR  LM+G++ +H  + G   
Sbjct: 39  GWNDVGYHNL-YIKTPTIDRLANNGVKLNNYYAANLCVPSRNMLMSGRH-VHGVIMG--- 93

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               PRG+PL E  +   L+E GYST  +GKW+ GF+ +E+ P  RGF++ FG+++    
Sbjct: 94  ---YPRGLPLNETTIANKLKEAGYSTHLVGKWNCGFYSKEFLPHNRGFDTFFGFVDSKED 150

Query: 212 YYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           +Y H++ D           D+RRN L  A    G Y+T ++  E   +I++   +KPLFL
Sbjct: 151 HYTHMVHDI---------SDLRRNDLCVADKYYGNYSTIMYGNEGTTIIDNHDTNKPLFL 201

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQY---ITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           +++  A H     + L+ P  ++ + +Y   I D +RR YA MV  +D+++  +I  L+ 
Sbjct: 202 FMSFSAVH-----EPLQVP--SVYEKEYIPTIDDTDRRIYAGMVSCVDEAIANIIETLEN 254

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           K ML+NSII+F SDNG   +           G+N+P RG K + WEGGV+    ++S  +
Sbjct: 255 KDMLKNSIIVFTSDNGGDPLYS---------GNNWPLRGWKASNWEGGVRALGFVYSEIL 305

Query: 388 QQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
             N R +   ++MHI+DW PTL   +GG+     L + G+ QW  +     S R   +  
Sbjct: 306 PTNARGTDNNELMHITDWFPTLVDISGGNIHTEEL-LYGVSQWGVIRGLAKSPREDVLIA 364

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
           LD     +  +  R      N  +    AA+++  WKL+ G  +        G  R  + 
Sbjct: 365 LDTTRKAV--SKYRTGIYGTNDFDITTLAALKMGKWKLLTGQND--------GDNRWFQP 414

Query: 506 PLLN--FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT 563
           P +N    +I++                  ++ D+ R  R                    
Sbjct: 415 PEINPPVKSIID------------------TDFDRKRMVR-------------------- 436

Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDIS-SQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
                L+NL  DP E  +++  R DI    L  L +Y   T+  Q    P   + D K  
Sbjct: 437 -----LYNLDEDPTETTDLSLEREDIVLEMLARLTRYASSTVPAQLQTVP---RVDRKTM 488

Query: 623 NDTWSPWI 630
            D   PWI
Sbjct: 489 TDCLGPWI 496


>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 530

 Score =  231 bits (588), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 147/435 (33%), Positives = 219/435 (50%), Gaps = 43/435 (9%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIP-TPNIDALAYNGIILNNMYAQP 126
           + +P     A  T      L    GWND+ +  ++    TP +  LA NG+ L   Y Q 
Sbjct: 14  LPEPEEVASAPETPHIVYVLFDDMGWNDIGYQSTDMAALTPVLSDLAENGVKLTQYYTQS 73

Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
            CT SRA+L++G  P+H G+    I    P G+PL  + LP+YL+E GY T  +GKW +G
Sbjct: 74  TCTVSRAALLSGVLPMHNGISHGTIVMDSPIGLPLKYKLLPQYLQESGYRTYMVGKWDIG 133

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
            F  EY P  RGF+  FG+    I+Y+ HI S  Y         D+R    T  +    Y
Sbjct: 134 HFNEEYLPHNRGFDHFFGFYGADITYFSHISSRGYCANPNCFP-DLRNEDETMANASMRY 192

Query: 247 ATDLFTKEAVQLIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQETI-NQFQYI--- 299
            TDLF + AV  +E    +    PLFLYL+  A H   +     APQE + N+ + +   
Sbjct: 193 TTDLFRERAVGFVEGHAANHATDPLFLYLSFNAPHYPTS-----APQEFMRNEAELLAPF 247

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRN 357
           T+  RR +AA+V   D SVG V+ AL+  G   +S+I+F SDNGA   T      ++  N
Sbjct: 248 TNRERRVFAAVVNYADRSVGRVVEALKATGAYNDSVILFASDNGAVPTTCNQSHCTSESN 307

Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQI---QQNPRVSLQMMHISDWLPTLYTAAGGD 414
            GSN+P RG+K T WEGG +VPA + +P+    + +  +   ++HI+DW+PT+       
Sbjct: 308 TGSNWPLRGMKATYWEGGCRVPAFVHAPKYLGDRASGSLYQGIVHITDWIPTI------- 360

Query: 415 TSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE----K 470
                  + G+ + SS LL       + + G D W S+       R  +L  +D      
Sbjct: 361 -------VRGVLRSSSTLL------GAEVSGADHWDSIRKLATPPRTELLYGVDHCVAGG 407

Query: 471 KRTAAVRLDSWKLVL 485
           + T A+R+D+WK ++
Sbjct: 408 QLTGAIRVDNWKFII 422


>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
          Length = 659

 Score =  230 bits (587), Expect = 1e-57,   Method: Compositional matrix adjust.
 Identities = 183/550 (33%), Positives = 259/550 (47%), Gaps = 86/550 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H SN I TP++DALA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 185 GWNDVGYHDSN-IFTPHLDALAAGGVRLENYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 243

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  +PL E+ LPE L+E GY T  +GKWHLG FR+E  P  RGF++ FGYL G   
Sbjct: 244 WPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMFRKECLPTRRGFDTFFGYLLGSED 303

Query: 212 YYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           YY H       +  V     D R     A      Y+T++FT++A+ LI   P  K    
Sbjct: 304 YYSHKHCVHIDALNVTRCALDFRDGEDVAEGYNNTYSTNIFTEKAIDLIAKHPAQKGTED 363

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            +  L +        L + +E ++ F      +  T   ++       G++ +    +  
Sbjct: 364 LIVILDSRGVPT---LVSSEEQVS-FLCKDQSSDHTSLQIIH-----FGSIKNVQTPRAS 414

Query: 331 LENSIIIF-MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
            ENS   F   DNG  T+           G+N+P RG K TLWEGG++    + SP ++Q
Sbjct: 415 SENSSSAFPFPDNGGQTLAG---------GNNWPLRGRKWTLWEGGIRGVGFVASPLLKQ 465

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRRNS---NID- 444
               S +++HISDWLPTL   AGG T S  PL  DG D W ++    PS R     NID 
Sbjct: 466 TGVQSRELIHISDWLPTLVNLAGGHTNSTKPL--DGFDVWKTISEGNPSPRMELLHNIDP 523

Query: 445 GLDQWSSLLLNT--PSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQT 500
           G    S    +T  PS+ +S          +  AA+R  +WKL         + GY G  
Sbjct: 524 GFVDISPCPESTRPPSKSDSFSTTHSTFNVSIHAAIRYGNWKL---------LTGYPGCD 574

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
                P L                         SNI K               P+P +P 
Sbjct: 575 YWFPPPSL-------------------------SNITK--------------QPSPDSP- 594

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
              N   +LF++  DP E+ +++S  P I  +L + L+Y++++ VP  +   D    DPK
Sbjct: 595 ---NKNLWLFDIQQDPEERYDLSSKHPSIVKKLLDRLQYYQKSSVPVFYPDED-PHCDPK 650

Query: 621 RFNDTWSPWI 630
                W PW+
Sbjct: 651 A-TGVWGPWM 659


>gi|345326826|ref|XP_003431088.1| PREDICTED: arylsulfatase J [Ornithorhynchus anatinus]
          Length = 504

 Score =  229 bits (584), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 154/508 (30%), Positives = 239/508 (47%), Gaps = 73/508 (14%)

Query: 124 AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
           A P C  +  S    KY IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKW
Sbjct: 20  ALPGCPQNEGSCFQQKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKW 79

Query: 184 HLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-T 242
           HLGF+R+E  P  RGF++ FG L G   YY H   D    +  + G+D+  N   AW+  
Sbjct: 80  HLGFYRKECMPTQRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDDAAWEYD 135

Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
            G Y+T ++T+   Q++      +PLFLY+A+ A H+      L+AP      ++ I + 
Sbjct: 136 NGIYSTQMYTQRVQQILASHDPRRPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIINI 190

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSN 361
           NRR YAAM+  LD+++  V  AL+R G  +NSI+I+ SDNG  PT            GSN
Sbjct: 191 NRRRYAAMLSCLDEAIHNVTLALKRYGFYDNSILIYSSDNGGQPTAG----------GSN 240

Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
           +P RG K T WEGG++    + SP ++    V  +++HI+DW PTL T A G      + 
Sbjct: 241 WPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCRELVHITDWYPTLITLAEGQLDE-GVR 299

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
           +DG D W ++   +  RR+  +D L     +     +   +    I      +A+R+  W
Sbjct: 300 LDGYDVWETI---SEGRRSPRVDILHNIDPIYTKAKNGSWAAGFGIWNTAIQSAIRVRHW 356

Query: 482 KLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
           KL+ G                       F+  V  + +              SN+   R 
Sbjct: 357 KLLTGNP--------------------GFSEWVPPQAF--------------SNLGPNRW 382

Query: 542 TRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
             ++ +   G +              +LFN+  DP E+ ++++  P +  QL   L    
Sbjct: 383 HNERISWSAGKS-------------VWLFNITADPYERVDLSARYPGVVKQLLRRLSQFN 429

Query: 602 RTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           +T VP  +  P   +++P+     W PW
Sbjct: 430 KTAVPVRYP-PRDPRSNPRLNGGVWGPW 456



 Score = 47.4 bits (111), Expect = 0.021,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 6/77 (7%)

Query: 4   NLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 62
           N   AW+   G Y+T ++T+   Q++      +PLFLY+A+ A H+      L+AP    
Sbjct: 127 NDDAAWEYDNGIYSTQMYTQRVQQILASHDPRRPLFLYIAYQAVHS-----PLQAPGRYF 181

Query: 63  NQFQYITDPNRRTYAAL 79
             ++ I + NRR YAA+
Sbjct: 182 EHYRSIININRRRYAAM 198


>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
 gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
          Length = 497

 Score =  229 bits (583), Expect = 4e-57,   Method: Compositional matrix adjust.
 Identities = 150/398 (37%), Positives = 208/398 (52%), Gaps = 48/398 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ +HGS +I TPNID LA  G  L   Y QP+CTPSRA+ MTG+YP   G+Q   I
Sbjct: 66  GWKDVGYHGS-DIRTPNIDRLAAEGARLEQFYVQPMCTPSRAAFMTGRYPFRYGLQTAVI 124

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             +   G+ L E  LP+ L++ GY T   GKWHLG  +  Y P  RGF+S +G L G I 
Sbjct: 125 PQSGTYGLALDEYPLPQVLKDAGYYTAMSGKWHLGHSKTAYWPRQRGFDSFYGALLGEID 184

Query: 212 YYDHILSDQYSRTVELNGH-DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           ++ H  +         NG+ D  RN + A    G Y   L   EAV++I      KPLFL
Sbjct: 185 HFTHKAA---------NGNPDWYRN-NKALKEEG-YDNILIGAEAVRVINKHDQQKPLFL 233

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLA  A H        +AP+E +++  +I D +RR YAAM+  +DD +G V++AL+++GM
Sbjct: 234 YLAFTAPHT-----PYQAPKEYLDRNSHIADESRRKYAAMISVMDDEIGKVVAALEKRGM 288

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNW---GSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
            +N++I+F SDNG     +    +         N PYR  K T +EGG +V  ++ W  +
Sbjct: 289 RDNTLIVFHSDNGGVVNAFFAGESEVKGKLPADNGPYREGKGTTYEGGTRVAGLVNWPGK 348

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           I+  P      MH+ D  PTL   AG                   L+     +N  +DG+
Sbjct: 349 IK--PGTIDGPMHVVDMYPTL---AG-------------------LVGATLGKNKPLDGM 384

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
           D W +L    PS R  ++ NI+    TAAVR   WKLV
Sbjct: 385 DMWPTLAEGKPSPRTEIVYNIE--PMTAAVRQGDWKLV 420



 Score = 45.4 bits (106), Expect = 0.081,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 5/65 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y   L   EAV++I      KPLFLYLA  A H        +AP+E +++  +I D +RR
Sbjct: 209 YDNILIGAEAVRVINKHDQQKPLFLYLAFTAPHT-----PYQAPKEYLDRNSHIADESRR 263

Query: 75  TYAAL 79
            YAA+
Sbjct: 264 KYAAM 268


>gi|260788444|ref|XP_002589260.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
 gi|229274435|gb|EEN45271.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
          Length = 455

 Score =  228 bits (581), Expect = 8e-57,   Method: Compositional matrix adjust.
 Identities = 123/269 (45%), Positives = 165/269 (61%), Gaps = 10/269 (3%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YGWND+ +H S  I TPN+D LA  G+ L N Y QP+C+PSR  LMTG+Y IH G+Q   
Sbjct: 11  YGWNDIGYHNSF-IRTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRYQIHYGLQHSV 69

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I    P G+PL E  LP+ L+E GYST  +GKWHLGFFR+EY PL RGFE  FGYL G  
Sbjct: 70  IMCDRPHGLPLDEVTLPQRLKENGYSTYMVGKWHLGFFRKEYMPLQRGFERFFGYLTGGE 129

Query: 211 SYYDHILSDQYSRT-VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            Y+ H    Q+S+   E +G D+R       D  G Y+T LF ++A+++I +    KP+F
Sbjct: 130 DYWTHRKPSQFSKDPSEFHGLDLRDQDKPVLDQNGTYSTHLFARKAIEMILNHDQSKPMF 189

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LYL   A H       LEAP+E    ++ I +   RTYAAMV  +D++VG V  AL+ +G
Sbjct: 190 LYLPFQAVHGP-----LEAPEEYKRIYEDIDNSLVRTYAAMVTVMDEAVGNVTGALKHRG 244

Query: 330 MLENSIIIFMSDNGAPT---VEYRETSNY 355
           M  N+I+IF + + A T   ++Y  T +Y
Sbjct: 245 MWRNTILIFSTASSAFTQSQIKYDGTISY 273



 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/71 (38%), Positives = 40/71 (56%), Gaps = 5/71 (7%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           D  G Y+T LF ++A+++I +    KP+FLYL   A H       LEAP+E    ++ I 
Sbjct: 161 DQNGTYSTHLFARKAIEMILNHDQSKPMFLYLPFQAVHGP-----LEAPEEYKRIYEDID 215

Query: 70  DPNRRTYAALT 80
           +   RTYAA+ 
Sbjct: 216 NSLVRTYAAMV 226


>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
 gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
          Length = 538

 Score =  228 bits (580), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 134/350 (38%), Positives = 194/350 (55%), Gaps = 25/350 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H S EI TPN+D LA +   L+  Y    CTP+RA+LMTG++P   GM    I
Sbjct: 54  GWNDVGYHNS-EIKTPNLDKLAESSTRLDRFYVTSSCTPTRAALMTGRHPSRYGMSSGVI 112

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +  G+PL E+ + + L+E GY T  +GKWHLG ++ EY P  RGF+ H+G+  G I 
Sbjct: 113 WPWDKVGLPLEEKTIAQTLKEAGYYTAIVGKWHLGHYKEEYLPTRRGFDYHYGHYCGSID 172

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
           Y+ H L           G D  RN     +    YATDL  +EAV+LI D   +K PLFL
Sbjct: 173 YFTHQLDAGIQ-----GGLDWHRNEQPVEEE--GYATDLLAQEAVKLIRDCDYNKSPLFL 225

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y++  A HA      L+A ++ I  +  I D  RR +AA V+ +D++VG +++AL+ K +
Sbjct: 226 YVSFNAPHA-----PLQAKEKDIKNYANIQDEGRRIFAAQVQSMDEAVGRIVAALKEKQV 280

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNW--GSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
            +N+++ F SDNG          + + W  G N P RG K TL++GGV+VP I+  P   
Sbjct: 281 WDNTLLFFTSDNGG--------GSDQPWTRGDNRPLRGQKGTLYDGGVRVPTIISFPAQL 332

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           +  +V  Q+  + D  PTL   AG   S     +DG+D   S+   +  R
Sbjct: 333 KGGQVIEQVFSVVDLYPTLAKLAGVKESSQN-QLDGVDILESIATGSTRR 381



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 41/69 (59%), Gaps = 6/69 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           YATDL  +EAV+LI D   +K PLFLY++  A HA      L+A ++ I  +  I D  R
Sbjct: 200 YATDLLAQEAVKLIRDCDYNKSPLFLYVSFNAPHA-----PLQAKEKDIKNYANIQDEGR 254

Query: 74  RTYAALTKS 82
           R +AA  +S
Sbjct: 255 RIFAAQVQS 263


>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
 gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
           IMCC14465]
          Length = 526

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 138/388 (35%), Positives = 215/388 (55%), Gaps = 37/388 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ +HGS +I TP+ID LA  G  LN  YA P C+P+RA+LMTG+ P+  G+    +
Sbjct: 62  GWGDVGYHGS-DIQTPHIDRLAKEGAKLNRFYATPFCSPTRAALMTGRDPLKLGVAYSVL 120

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E  GV L E FLP+  +  GY+T  +GKWHLG    ++TP  RGF+  +G+++  +S
Sbjct: 121 MPWENGGVSLDEHFLPQSFQAAGYNTAMVGKWHLGHTIEQHTPNARGFDLFYGHMHTQVS 180

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFL 270
           Y+DH ++         NGHD + N     D  GEYATD+   +A + I D +   KP  L
Sbjct: 181 YFDHQIA---------NGHDFQEN-GKPVDHNGEYATDVHGAQAARFITDLRDKTKPFLL 230

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITD---PNRRTYAAMVKKLDDSVGTVISALQR 327
           Y+  LA H+      +EAP+   +++    D     R+TYAAMV  +D ++G V++AL  
Sbjct: 231 YVPFLAPHS-----PMEAPEALKDKYSSRLDLPGSTRKTYAAMVDSMDQAIGKVLTALDE 285

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAIL-WSP 385
           +G+ +N+I+ F SDNG           + N+GS N PYRG K  ++EGG++V A++ W  
Sbjct: 286 EGVADNTIVFFFSDNGG----------FENYGSDNGPYRGGKLEVYEGGIRVTAVMRWPE 335

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI-- 443
           ++     V  +++ + D LPTL  AAG +       IDG+++WS++     S+R   +  
Sbjct: 336 KLAAGSEVD-EIVSVMDLLPTLTHAAGVENGT-EKKIDGVNRWSTITGEKVSKRKGALYF 393

Query: 444 -DGLDQWSSLLLNTPSRRNSVLINIDEK 470
              +  ++   L     +  ++ NID K
Sbjct: 394 ASNIPVYNKFQLGVIDGKWKLVQNIDHK 421



 Score = 40.0 bits (92), Expect = 3.6,   Method: Compositional matrix adjust.
 Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 9/77 (11%)

Query: 10  DTVGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
           D  GEYATD+   +A + I D +   KP  LY+  LA H+      +EAP+   +++   
Sbjct: 200 DHNGEYATDVHGAQAARFITDLRDKTKPFLLYVPFLAPHS-----PMEAPEALKDKYSSR 254

Query: 69  TD---PNRRTYAALTKS 82
            D     R+TYAA+  S
Sbjct: 255 LDLPGSTRKTYAAMVDS 271


>gi|291233691|ref|XP_002736785.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 499

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 158/540 (29%), Positives = 251/540 (46%), Gaps = 88/540 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H   E+  P ++ LA +G+I N  Y QP CTP+RA+LM+G YP  TG Q   +
Sbjct: 42  GWNDVGWHNP-EVKMPVLNQLAADGVIFNQAYVQPTCTPTRAALMSGYYPFKTGNQHQLL 100

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               P G+PL  + LP+ L+++GY T  +GKWHLGF +  + P  RGF+S +G L    S
Sbjct: 101 LNLHPGGLPLRFKTLPQRLKDVGYLTHIVGKWHLGFCKEAFLPTNRGFDSFYGGLTLGTS 160

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           ++  +     +  +   G+D   N      T  +Y   +    AV++I     + PLF+Y
Sbjct: 161 HFSKM-----NGILSTPGYDFYDNSGVVPQT-NDYLAFMLADRAVKIINGHYQEYPLFMY 214

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
            +          K  E P E    +  ITD   R +   +  +DD+VG+V+ AL+++ M 
Sbjct: 215 FS-----MDVPAKSPEVPPEYEALYANITDDRTRRFYGKLSVMDDAVGSVVDALKKRDMW 269

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +++++IF+ DNGA            + GSN+P RG+  TL+EG  +VPA +    ++Q  
Sbjct: 270 DDTLLIFIGDNGALA---------SSSGSNWPLRGIAATLFEGATRVPAFIHGNMLKQTG 320

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             + ++ HI D   T+ + AG ++S     IDG+D W                  D +S 
Sbjct: 321 YENNELWHIVDLHKTILSLAGAESS---CEIDGMDMW------------------DTFSK 359

Query: 452 LLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
              N  S RN ++ NID+   +  AA+R+  +KL+ G                       
Sbjct: 360 ---NESSPRNEIVYNIDDDPISPGAAIRVGDYKLIAG----------------------- 393

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA-PMTPSPCTNGPCY 568
                           + ++F P+  I++         +     P  P    P      Y
Sbjct: 394 ----------------NPDLFYPLRLINQSDGWFNYGDVPTSGLPVLPTGDEPPPPNVTY 437

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           LFN+ +DP E+NNIA   P+I   L + L  HR   +P  ++  D    DP  F+  +SP
Sbjct: 438 LFNVIDDPEERNNIADDHPEIVQALRQRLDEHRIYYMPAVNQVFD-EAGDPSNFDGVFSP 496


>gi|432104048|gb|ELK30879.1| Arylsulfatase J [Myotis davidii]
          Length = 496

 Score =  227 bits (579), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 155/492 (31%), Positives = 230/492 (46%), Gaps = 73/492 (14%)

Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           Y IHTG+Q   I  A+P  +PL    LP+ LRE GY+T  +GKWHLGF+RRE  P +RGF
Sbjct: 31  YQIHTGLQHSVIRPAQPNCLPLGHATLPQKLREAGYATHMVGKWHLGFYRRECMPTHRGF 90

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQL 258
           ++ FG L G   YY H   D         G+D+  N   AWD   G Y+T ++T+   Q+
Sbjct: 91  DTFFGSLLGSGDYYTHYKCDGPGAC----GYDLYENDRAAWDHDDGVYSTHMYTRRVQQI 146

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           +      KPLFLY+A+ A H+      L+AP      ++ I + +RR YAAM+  LD++V
Sbjct: 147 LAAHDPGKPLFLYIAYQAVHSP-----LQAPSRYFRHYRSIANTHRRRYAAMLSCLDEAV 201

Query: 319 GTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           G V  AL+  G+  NSI+I+ SDNG  P+            GSN+P RG K T WEGG++
Sbjct: 202 GNVTLALRAHGLYNNSILIYSSDNGGQPSAG----------GSNWPLRGSKGTYWEGGIR 251

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
               + SP ++Q   V  +++HI+DW PTL   A G       ++DG D W ++   +  
Sbjct: 252 AVGFVHSPLLRQRGTVCRELVHITDWYPTLVALAEGHIEE-DAHLDGYDVWETI---SEG 307

Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
            R+  +D L     +         +    I      +A+R+  WKL+ G    G      
Sbjct: 308 LRSPRVDILHNIDPIYTKAKHGSWAAGYGIWNTAVQSAIRVQHWKLLTGNPGYG------ 361

Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
                + VP   F                       SN+   R   ++ T+  G +    
Sbjct: 362 -----DWVPPQAF-----------------------SNLGPSRWHSERVTLATGQS---- 389

Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
                     +LFN+  DP E+ +++   P +  QL   L    RT VP  +  P   ++
Sbjct: 390 ---------VWLFNITADPYERVDLSGRYPGVVRQLLRRLSQFNRTAVPVRYP-PGDPRS 439

Query: 618 DPKRFNDTWSPW 629
           +P+     W PW
Sbjct: 440 NPRLNGGVWGPW 451



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N   AWD   G Y+T ++T+   Q++      KPLFLY+A+ A H+      L+AP 
Sbjct: 119 LYENDRAAWDHDDGVYSTHMYTRRVQQILAAHDPGKPLFLYIAYQAVHSP-----LQAPS 173

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + +RR YAA+
Sbjct: 174 RYFRHYRSIANTHRRRYAAM 193


>gi|431918040|gb|ELK17268.1| Arylsulfatase I [Pteropus alecto]
          Length = 547

 Score =  227 bits (578), Expect = 1e-56,   Method: Compositional matrix adjust.
 Identities = 167/542 (30%), Positives = 250/542 (46%), Gaps = 105/542 (19%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 60  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 118

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +       R                         RGF++  G L G + 
Sbjct: 119 RPRQPNCLPLDQSXXXXXXR-------------------------RGFDTFLGSLTGNVD 153

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    S AW   G+Y+T L+ +    ++      +PLFLY
Sbjct: 154 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLY 209

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 210 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 264

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
            NS+IIF SDNG  T          + GSN+P RG K T WEGGV+    + SP +++  
Sbjct: 265 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 315

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
           R S  ++HI+DW PTL   AGG T+     +DG D W ++     S R   +  +D    
Sbjct: 316 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 371

Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L   +R  S+     I      AA+R+  WKL+ G       D  YG    + +P   
Sbjct: 372 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 416

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
                     Q+L     + +    N+++M S RQ                       +L
Sbjct: 417 ----------QTLAAFPGSWW----NLERMASARQA---------------------VWL 441

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
           FN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P   +A P      W 
Sbjct: 442 FNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 498

Query: 628 PW 629
           PW
Sbjct: 499 PW 500



 Score = 46.6 bits (109), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 175 SVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 229

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 230 RTMGNVARRKYAAMV 244


>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
           [Verrucomicrobium spinosum DSM 4136]
          Length = 496

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 152/439 (34%), Positives = 220/439 (50%), Gaps = 66/439 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G +D+ + GS EI TP++D LA  G  L+  Y QPVC+P+RA+L+TG+YP   G Q   +
Sbjct: 64  GSHDVGWRGS-EIKTPHLDELARAGATLDQFYVQPVCSPTRAALLTGRYPFRYGFQTGVV 122

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+PL ER LP+ L+E GY T   GKWHLG F+  Y P  RGF+  +G+ NG++ 
Sbjct: 123 RPWAEYGLPLEERTLPQALKEAGYETAITGKWHLGHFQPAYLPTKRGFDHQYGHYNGMLD 182

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY HI            G D  RN     D    Y+T+L  KEA + + ++   +PLFLY
Sbjct: 183 YYTHIRH---------GGFDWHRNDQENHDE--GYSTELVGKEAARRVRERDKSRPLFLY 231

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +     H+ +     + P E    +    +  R+TYA MV  LD +VG ++ A++ + ++
Sbjct: 232 VPFNGVHSPH-----QVPVEYETPYTQF-EGKRKTYAGMVTALDAAVGRIVQAVRDEKLV 285

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQN 390
           EN++ IF SDNG P        N +   SN P R  K T++EGGV+V A   W  +I+  
Sbjct: 286 ENTLFIFSSDNGGP--------NPKQLTSNGPLRAGKGTVYEGGVRVCAFATWPGKIK-- 335

Query: 391 PRVSLQM-MHISDWLPTLYTAAGGDTSR-LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
           P  S+Q  +HI+DW PTL   AG    + LPL                       DG D 
Sbjct: 336 PGTSVQAPIHIADWFPTLLGLAGASLKQPLPL-----------------------DGRDV 372

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG----TQENGTMDGYYGQTRSNK 504
           W+++     S  +++LIN        A+R   WKLV+G      E+G  D   G+ +  +
Sbjct: 373 WATVAEGAASPHDTILINTTPS--GGAIRSGDWKLVIGGANRDDEDGPGDAAKGKGKKGR 430

Query: 505 ------VPLLNFNAIVESK 517
                 V L N  + V  K
Sbjct: 431 EQPAQGVELFNLASDVGEK 449


>gi|149726732|ref|XP_001501521.1| PREDICTED: arylsulfatase I-like [Equus caballus]
          Length = 476

 Score =  226 bits (577), Expect = 2e-56,   Method: Compositional matrix adjust.
 Identities = 159/495 (32%), Positives = 234/495 (47%), Gaps = 79/495 (15%)

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           +Y IHTG+Q   I   +P  +PL +  LP+ L+E GYST  +GKWHLGF+R+E  P  RG
Sbjct: 8   RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 67

Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           F++  G L G + YY +   D       + G D+    S AW   G+Y+T L+ +    +
Sbjct: 68  FDTFLGSLTGNVDYYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHI 123

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           +      +PLFLY+A  A H       L++P+E + +++ + +  RR YAAMV  +D++V
Sbjct: 124 LASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 178

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
             +  AL+R G   NS+IIF SDNG  T          + GSN+P RG K T WEGGV+ 
Sbjct: 179 RNITWALKRYGFYNNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRG 229

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              + SP +++  R S  ++HI+DW PTL   AGG T+     +DG D W ++     S 
Sbjct: 230 LGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAGG-TASATDGLDGYDVWPAISEGRASP 288

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
           R   +  +D      L   +R  S+     I      AA+R+  WKL+ G       D  
Sbjct: 289 RTEILHNIDP-----LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPG 336

Query: 497 YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
           YG    + +P     A   S                  N+++M S RQ            
Sbjct: 337 YG----DWIPPQTLAAFPGS----------------WWNLERMASARQA----------- 365

Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDL 614
                      +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +  E P  
Sbjct: 366 ----------VWLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP-- 413

Query: 615 VQADPKRFNDTWSPW 629
            +A P      W PW
Sbjct: 414 -RAHPDFNGGAWGPW 427



 Score = 47.0 bits (110), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW   G+Y+T L+ +    ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 102 SVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 156

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 157 RTMGNVARRKYAAMV 171


>gi|355750020|gb|EHH54358.1| Arylsulfatase B, partial [Macaca fascicularis]
          Length = 430

 Score =  226 bits (576), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 134/358 (37%), Positives = 196/358 (54%), Gaps = 32/358 (8%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF++
Sbjct: 2   IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61

Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           +FGYL G   YY H    L D  +  V     D R     A      Y+T++FTK A  L
Sbjct: 62  YFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATAL 119

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++V
Sbjct: 120 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAV 174

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+ 
Sbjct: 175 GNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRG 225

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              + SP ++Q    + +++HISDWLPTL   AGG T+     +DG D W ++   +PS 
Sbjct: 226 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWKTISEGSPSP 284

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
           R   +  +D   + + ++P   NS+    D+              AA+R  +WKL+ G
Sbjct: 285 RMELLHNID--PNFVDSSPCPGNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 340



 Score = 56.2 bits (134), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 107 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 161

Query: 75  TYAALT 80
            YA + 
Sbjct: 162 HYAGMV 167


>gi|354473078|ref|XP_003498763.1| PREDICTED: arylsulfatase B [Cricetulus griseus]
          Length = 460

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 159/501 (31%), Positives = 239/501 (47%), Gaps = 84/501 (16%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           IHTG+Q   I   +P  VPL E+ LP+ L+E GY+T  +GKWHLG  ++E  P  RGF++
Sbjct: 32  IHTGLQHLLIRVCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMCQKECLPTRRGFDT 91

Query: 202 HFGYLNGVISYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
           +FGYL G  +YY H + +      V     D R     A      Y+T +F++ A  LI 
Sbjct: 92  YFGYLLGSENYYTHEVCALIEPLNVTRCALDFRDGEEPATKYNNTYSTHIFSQRATTLIA 151

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
           + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR YA MV  +D++VG 
Sbjct: 152 NHPPEKPLFLYLALQSVH-----DPLQVPEEYMKPYDFIQDKHRRIYAGMVSLMDEAVGN 206

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           V +AL+  G+  N++ +F +DNG  T+         + G+N+P RG K TLWEGG++   
Sbjct: 207 VTAALKSHGLWNNTVFVFSTDNGGQTL---------SGGNNWPLRGRKWTLWEGGIRGVG 257

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRR 439
            + SP ++Q    S +++HISDWLPTL   AGG T    PL  DG D W ++   TPS R
Sbjct: 258 FVASPLLKQKGVKSRELIHISDWLPTLVNLAGGSTYGTKPL--DGFDVWRTISEGTPSPR 315

Query: 440 NSNIDGLDQWSSLLLNTPSRRNSVLINIDEK----------KRTAAVRLDSWKLVLGTQE 489
              +  +DQ      ++P    S+    ++              A +R  +WKL+ G   
Sbjct: 316 VELLHNIDQ--DFFDSSPCPGKSMAEAKNDSFPLKYSAFNTSIHAGIRYKNWKLLTGH-- 371

Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
                           P   F     S+                SNI ++RS    +   
Sbjct: 372 ----------------PGCGFWFPPPSQ----------------SNISEIRSLDSSSKT- 398

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
                             +LF++  DP E+++++   P I  +L   L+Y+    VP S+
Sbjct: 399 -----------------LWLFDINQDPEERHDLSEKHPHIVLKLLSRLQYYHEHSVP-SY 440

Query: 610 EQPDLVQADPKRFNDTWSPWI 630
             P   + DPK     WSPW+
Sbjct: 441 FPPMDPRCDPKD-TGVWSPWM 460



 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T +F++ A  LI + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 137 YSTHIFSQRATTLIANHPPEKPLFLYLALQSVH-----DPLQVPEEYMKPYDFIQDKHRR 191

Query: 75  TYAALT 80
            YA + 
Sbjct: 192 IYAGMV 197


>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
           partial [Saccoglossus kowalevskii]
          Length = 355

 Score =  226 bits (575), Expect = 3e-56,   Method: Compositional matrix adjust.
 Identities = 131/344 (38%), Positives = 193/344 (56%), Gaps = 27/344 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ ++      TP ID LA NG+ L N Y    C PSR  LMTG++ I  G+     
Sbjct: 34  GWNDVGYNNP-VFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIPQDG- 91

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +G  PR +PL E  + E L+  GYST  +GKWH G++     P  RGF++ FG++   I 
Sbjct: 92  FGYHPRSLPLDETTIAEPLKHAGYSTHIVGKWHCGYYADNCLPHNRGFDTFFGFVGAGID 151

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           +Y H  SD ++       H++R+N    A   +G+Y+T +F  E   +I     +KPLFL
Sbjct: 152 HYTH--SDHFNHM-----HNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFL 204

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           YL+  A HA      LE P   + Q++  I D +RRTYAAM   +D+++  +  AL+ K 
Sbjct: 205 YLSFSAVHAP-----LEVPSSYLKQYESTIHDEDRRTYAAMTSCVDEAIANITEALEEKN 259

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML+NSII+F SDNG   V         + G+N+P RG K+T WEGGV+    ++   + +
Sbjct: 260 MLKNSIIVFTSDNGGALVN--------SAGNNWPLRGGKHTSWEGGVRAVGFVYGDLLPR 311

Query: 390 NPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           + R +    +MHI+DW PTL T AGG+   +   + G+DQW+ +
Sbjct: 312 HSRGTENTGLMHITDWFPTLITLAGGN-PHVGKKLYGVDQWNMI 354



 Score = 53.5 bits (127), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 6/73 (8%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
           +G+Y+T +F  E   +I     +KPLFLYL+  A HA      LE P   + Q++  I D
Sbjct: 177 IGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSAVHAP-----LEVPSSYLKQYESTIHD 231

Query: 71  PNRRTYAALTKST 83
            +RRTYAA+T   
Sbjct: 232 EDRRTYAAMTSCV 244


>gi|355691424|gb|EHH26609.1| Arylsulfatase B, partial [Macaca mulatta]
          Length = 430

 Score =  226 bits (575), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 134/358 (37%), Positives = 196/358 (54%), Gaps = 32/358 (8%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           I TG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF++
Sbjct: 2   IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61

Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           +FGYL G   YY H    L D  +  V     D R     A      Y+T++FTK A  L
Sbjct: 62  YFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATAL 119

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++V
Sbjct: 120 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAV 174

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +LWEGGV+ 
Sbjct: 175 GNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRG 225

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              + SP ++Q    + +++HISDWLPTL   AGG T+     +DG D W ++   +PS 
Sbjct: 226 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWKTISEGSPSP 284

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
           R   +  +D   + + ++P   NS+    D+              AA+R  +WKL+ G
Sbjct: 285 RVELLHNID--PNFVDSSPCPGNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 340



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 107 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 161

Query: 75  TYAALT 80
            YA + 
Sbjct: 162 HYAGMV 167


>gi|254515652|ref|ZP_05127712.1| arylsulfatase B [gamma proteobacterium NOR5-3]
 gi|219675374|gb|EED31740.1| arylsulfatase B [gamma proteobacterium NOR5-3]
          Length = 507

 Score =  226 bits (575), Expect = 4e-56,   Method: Compositional matrix adjust.
 Identities = 139/390 (35%), Positives = 208/390 (53%), Gaps = 56/390 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +HGS +I TP+ID LA  G+ L+  YAQ  C+P+RA+L++G+     G+  P +
Sbjct: 52  GWNDVGYHGS-DIHTPHIDQLAAEGLELDRFYAQTACSPTRAALLSGQSSQSLGIYSP-L 109

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               P G+ L ++ +P Y R+ GY T  +GKWHLGF+  EY PL RGF+  +G L G + 
Sbjct: 110 SKLNPTGLALDQKIMPAYFRDAGYQTFMVGKWHLGFYEPEYRPLARGFDHFYGNLTGGVG 169

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H         V   G D +RN  T       Y+T L + E  +LI+ +  +KPLFLY
Sbjct: 170 YWNH---------VHGGGLDWQRNGKTLRQE--GYSTHLQSAEITRLIQQRDPEKPLFLY 218

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
            A  A H  N     EAP +T+ ++ +I +PNRR +AAMV +LD ++G ++  L  +GML
Sbjct: 219 AAFNAPHLPN-----EAPADTLARYAHIENPNRRIHAAMVTELDSAIGQLMETLSTEGML 273

Query: 332 ENSIIIFMSDNGA---------------------------PTVEYRETSNYRNWGSNYPY 364
           EN++I FMSDNG                             T+E+  T+       N P+
Sbjct: 274 ENTLIWFMSDNGGLNRTAMPSGLVSMSQRLEDWFGKPLFPKTLEFIRTNALDGGSDNSPH 333

Query: 365 RGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
           R  K +++EGG +VP+ + W  ++  +P    QM+ + D LPTL +A   D + LP    
Sbjct: 334 RKGKQSIYEGGARVPSFVYWKGRL--SPERITQMVTVKDVLPTLLSATDIDANGLPTATT 391

Query: 424 ----GLDQWSSL----LLNTPSRRNSNIDG 445
               G+DQW  L     +  P    + +DG
Sbjct: 392 TESAGVDQWPGLTRGEFIQAPDYLINGMDG 421



 Score = 50.8 bits (120), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T L + E  +LI+ +  +KPLFLY A  A H  N     EAP +T+ ++ +I +PNRR
Sbjct: 193 YSTHLQSAEITRLIQQRDPEKPLFLYAAFNAPHLPN-----EAPADTLARYAHIENPNRR 247

Query: 75  TYAALT 80
            +AA+ 
Sbjct: 248 IHAAMV 253


>gi|291239589|ref|XP_002739705.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 489

 Score =  225 bits (574), Expect = 5e-56,   Method: Compositional matrix adjust.
 Identities = 163/538 (30%), Positives = 245/538 (45%), Gaps = 86/538 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H + +I  P ++ LA +G+I N  Y QP CTPSRA+L+TG YP     Q   +
Sbjct: 34  GWNDVGWHNA-DIKMPILNQLAADGVIFNQSYVQPACTPSRAALLTGYYPFKIQRQHQML 92

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E  G+ L  + LPE L+++GY T  +GKWHLGF + EY P  RGF+S +G+L    +
Sbjct: 93  LNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLGFCKEEYLPNKRGFDSFYGWLTLGTT 152

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
            Y           +   G+D R N     ++   Y   +  + AV ++     + PLFL 
Sbjct: 153 LYS-------KENIIAPGYDFRDNTGVVQES-DTYLPFMLAERAVDIVMGHYKEYPLFLE 204

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
            +        +GK LE PQE  + +  I D  +R +   +  +D+S+G ++ AL+ +GM 
Sbjct: 205 FS-----MALSGKFLEVPQEYEDLYSDIEDDRQRKFYGKLSVMDNSIGLLVDALKTRGMW 259

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           E+++ IF SDNGA   E          GSN+P RG   TL+EG  +VP       ++   
Sbjct: 260 EDTLFIFTSDNGALASES---------GSNWPLRGNIATLFEGATRVPTFFHGSMLKNTG 310

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            V+ ++MHI D   T+   AG  T     +IDG+D W ++    PS R   +  +D    
Sbjct: 311 YVNNELMHIVDLHKTIIELAGAKTE---CDIDGMDLWKTISKGKPSPRTEFVYNIDD--- 364

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
                           +E    AA+R+  +KL+ G  +                 L  F 
Sbjct: 365 ----------------NELSPGAAIRVGDYKLITGHPD----------------LLYPFR 392

Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS-PCTNGPCYLF 570
            +  S  + +          PIS I                 P P T + P  +   YLF
Sbjct: 393 MLNRSDDWYNYGD------EPISGI-----------------PIPPTGNEPPPSNVTYLF 429

Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           N+ +DP E+NN+A   P+   +L   L  +R  LVP  +   D    DP  F   WSP
Sbjct: 430 NIIDDPEERNNLAEEYPNKVEELRHRLDEYREHLVPPVNRVFD-PAGDPVNFGGVWSP 486


>gi|440894324|gb|ELR46807.1| Arylsulfatase J, partial [Bos grunniens mutus]
          Length = 466

 Score =  225 bits (573), Expect = 7e-56,   Method: Compositional matrix adjust.
 Identities = 154/493 (31%), Positives = 236/493 (47%), Gaps = 73/493 (14%)

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           +Y IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RG
Sbjct: 1   RYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRG 60

Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQ 257
           F++ FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q
Sbjct: 61  FDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQ 116

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
           ++      KP+FLY+A+ A H+      L+AP      ++ I + NRR YAAM+  LD++
Sbjct: 117 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 171

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           +  V  AL+  G   NSIII+ SDNG  PT            GSN+P RG K T WEGG+
Sbjct: 172 INNVTLALKMYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGI 221

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
           +    + SP ++    V  +++HI+DW PTL + A G      + +DG D W ++   + 
Sbjct: 222 RAIGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-NIQLDGYDVWETI---SE 277

Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
             R+  +D L     +     +   +    I      +A+R+  WKL+ G        GY
Sbjct: 278 GLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVKHWKLLTGNP------GY 331

Query: 497 YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
                S+ VP  +F                       SN+   R   ++ T+  G +   
Sbjct: 332 -----SDWVPPQSF-----------------------SNLGPNRWHNERITLSTGKS--- 360

Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
                      +LFN+  DP E+ ++++  P I  QL   L    +T VP  +   D  +
Sbjct: 361 ----------VWLFNITADPYERVDLSNKYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PR 409

Query: 617 ADPKRFNDTWSPW 629
           ++P+     W PW
Sbjct: 410 SNPRLNGGVWGPW 422



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 90  LYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 144

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 145 RYFEHYRSIVNINRRRYAAM 164


>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 513

 Score =  224 bits (571), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 137/408 (33%), Positives = 221/408 (54%), Gaps = 31/408 (7%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +GW+D+ +HGS  + +P +D LA  G+ L N Y QP+C+P+RA LM+G+Y I  G+Q   
Sbjct: 36  FGWHDIGYHGSI-VRSPYMDFLASEGVKLENYYVQPMCSPTRAQLMSGRYQIRYGLQHLV 94

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I   +   +P  E  + + ++E GY+T  +GKWHLGF+R+E  P+ RGF++ FG+LN +I
Sbjct: 95  IQPDQRACLPPDEVTIAQKMKEAGYATHMVGKWHLGFYRKECLPINRGFDTFFGFLNCLI 154

Query: 211 SYYDHILS-----DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
            +Y +        +  ++T+       R +   A +  GEY+T LF +EA ++I +   +
Sbjct: 155 YHYTYDFGYWHTPESGNKTIMFGWDLFRNHDCVAKEHKGEYSTILFAEEAQRVIWNHDQE 214

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISA 324
            P+FLYL   A H       L+ P+E    ++  I D  RR  AAMV  +D+++  +  A
Sbjct: 215 TPMFLYLPFAAVH-----NPLDVPREYEAMYEGIIEDEQRRKKAAMVTCMDEAIWNITKA 269

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
           L   GM +N+++IF +DNG  T            G+N+P RG K +++EGG++    + S
Sbjct: 270 LMDTGMWDNTVLIFSTDNGGAT---------SVGGNNWPLRGGKRSMFEGGIRGVGFVTS 320

Query: 385 PQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS- 441
           P + +  R +   Q++H++DW PT    A G+ S     +DG +QW ++     S RN  
Sbjct: 321 PLLDEAVRGTENNQLIHVTDWFPTFVHLAEGNLSGTK-PLDGFNQWDTISQGMFSPRNEF 379

Query: 442 --NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGT 487
             NID ++  ++ L     R N      D   R   +R+  WKLV G 
Sbjct: 380 LINIDPMEICANPLATNQFRPNPYF---DVCVR-GGIRVGKWKLVTGV 423



 Score = 41.2 bits (95), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 41/81 (50%), Gaps = 6/81 (7%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            R +   A +  GEY+T LF +EA ++I +   + P+FLYL   A H       L+ P+E
Sbjct: 181 FRNHDCVAKEHKGEYSTILFAEEAQRVIWNHDQETPMFLYLPFAAVH-----NPLDVPRE 235

Query: 61  TINQFQ-YITDPNRRTYAALT 80
               ++  I D  RR  AA+ 
Sbjct: 236 YEAMYEGIIEDEQRRKKAAMV 256


>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
 gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
          Length = 476

 Score =  224 bits (570), Expect = 1e-55,   Method: Compositional matrix adjust.
 Identities = 132/356 (37%), Positives = 194/356 (54%), Gaps = 26/356 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS EI T N+D LA +G+ L   Y + +CTP+RA+ +TG++P   GM    +
Sbjct: 46  GWNDVGFHGS-EIKTTNLDKLAVSGVRLERFYVKAMCTPTRAAFLTGRHPFRYGMSAINV 104

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+PL E+ + E L+E GY T  +GKWHLG ++  Y P  RGF+ H+G+    I 
Sbjct: 105 TPWSETGLPLEEKTIAETLKEAGYYTAILGKWHLGHYQESYLPTSRGFDYHYGHYLAGID 164

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
           Y+ H   D         G D  RN +  +  +  Y+TDL  ++AVQLI +    K PLFL
Sbjct: 165 YFTHKSGD---------GLDWHRNNNPVY--IEGYSTDLIAQDAVQLINNHDYHKNPLFL 213

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+A  A H       L+A  E +  +  I D  RR +AA V+ +D+++G++I +LQ K +
Sbjct: 214 YIAFNAPHIP-----LQAKAEDLEDYLTIEDEQRRLFAAQVQSMDEAIGSIIQSLQAKQV 268

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
             N++++F SDNG   V       +   G N P RG K  L+EGGV+VP I+  P    +
Sbjct: 269 WNNTLLVFTSDNGGEIVA--NDIRFTGRGDNRPLRGGKRNLYEGGVRVPTIISYPGHLSS 326

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
            +   QM  I D  PT    AG   ++    IDG+D     +L + + +N+N D L
Sbjct: 327 GKTVEQMFSIVDLYPTFAKLAGLKINQ-EQQIDGVD-----ILESIAEKNTNRDEL 376



 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 6/69 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           Y+TDL  ++AVQLI +    K PLFLY+A  A H       L+A  E +  +  I D  R
Sbjct: 188 YSTDLIAQDAVQLINNHDYHKNPLFLYIAFNAPHIP-----LQAKAEDLEDYLTIEDEQR 242

Query: 74  RTYAALTKS 82
           R +AA  +S
Sbjct: 243 RLFAAQVQS 251


>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 726

 Score =  223 bits (569), Expect = 2e-55,   Method: Compositional matrix adjust.
 Identities = 182/597 (30%), Positives = 271/597 (45%), Gaps = 143/597 (23%)

Query: 92  GWNDLSFHGSNEI-PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GWND+ +H ++    TPN+D L+ +G+ ++  Y+  +CTP+RA+LMTG+YP+  G+Q   
Sbjct: 123 GWNDIGYHSTDLANVTPNLDRLSASGVKVSQYYSMSICTPARAALMTGRYPVRYGLQYNV 182

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I    P G+PLTE+ LPEY+ E GY +  +GKWHLG +   +TP  RGFE++FGYLN   
Sbjct: 183 IQPGAPWGLPLTEKLLPEYMNEAGYESHMVGKWHLGSYTHAHTPHRRGFETYFGYLNDEE 242

Query: 211 SYYDHILSDQYSRTVELNG--------------HDMRRNL-------------------S 237
            Y+ H      + T  +NG              +D+                       S
Sbjct: 243 MYWTH-----QTWTATINGRKFFDFGFGNATGFYDVIERFDPPPGDDDLVSTGPTSSVYS 297

Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQ-PVDK-PLFLYLAHLAAH--AGNAGKHLEAPQE-- 291
           ++ +  G+Y+T++FT  A++++  + P D+ PLFLYL+H A H   G       +P+E  
Sbjct: 298 SSLEIKGDYSTEIFTDRALEILSQKTPHDENPLFLYLSHQAVHDPLGVPPADAFSPEEWA 357

Query: 292 --TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN-GAPTVE 348
             T  + +      R  +A ++  LD S+G ++  L+ +G LENSII+  SDN G P+  
Sbjct: 358 VLTALEDESSDASLRVRFAKVLMYLDKSIGRLVDYLETEGWLENSIIVVASDNGGCPSTG 417

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ---MMHISDWLP 405
                     GSNYP RG+K++ WEGGVKVPA ++SP          +   +MH++DWLP
Sbjct: 418 ----------GSNYPLRGLKHSNWEGGVKVPAFVYSPSHIPEEHWGTEYRGLMHVTDWLP 467

Query: 406 TLYTAAGGDTSRLPLNIDGLDQWSSLL---------------------LNTPSRRNSNID 444
           TL  AA  +      ++DG+  W  ++                      N   R  +   
Sbjct: 468 TLADAANFELLGSHGDLDGVSHWKHIVPAPGDDTATVSESPEAAAAVEDNRLKRARTATA 527

Query: 445 GLD----------QWSSLLLNTPS--RRNSVLINID-------------EKKRT---AAV 476
            +D            SS  L   S   R  VL N D             E   T    A 
Sbjct: 528 AVDGGGGGGGGGGGGSSYGLGEVSYGARREVLYNFDPYVLGANITDDIGEADFTEVQGAF 587

Query: 477 RLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNI 536
           R   WK +     N    GYY     NK  LL  +   E  T Q L + SQ         
Sbjct: 588 RQGKWKFMF----NSWCSGYYA--FDNKT-LLADDLTDEDNTCQELGECSQ--------- 631

Query: 537 DKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
                        CGAN   ++     +   +LF+L +DP E NN+A+  P++   L
Sbjct: 632 -------------CGANCGSLS----IDYTDWLFDLEDDPRELNNLANEHPEVVESL 671


>gi|350587859|ref|XP_003129285.2| PREDICTED: arylsulfatase J [Sus scrofa]
          Length = 539

 Score =  223 bits (568), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 152/492 (30%), Positives = 235/492 (47%), Gaps = 73/492 (14%)

Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           Y IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF
Sbjct: 75  YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 134

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQL 258
           ++ FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T ++T+   Q+
Sbjct: 135 DTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENENAAWDYDNGIYSTQMYTQRVQQI 190

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           +      +P+FLY+A+ A H+      L+AP      ++ I + NRR YAAM+  LD+++
Sbjct: 191 LASHDPKRPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 245

Query: 319 GTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
             V  AL+  G   NSIII+ SDNG  PT            GSN+P RG K T WEGG++
Sbjct: 246 NNVTLALKMYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIR 295

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
               + SP ++    V  +++HI+DW PTL + A G      + +DG D W ++   +  
Sbjct: 296 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDVWETI---SEG 351

Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
            R+  +D L     +     +   +    I      +A+R+  WKL+ G        GY 
Sbjct: 352 LRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY- 404

Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
               S+ VP  +F                       SN+   R   ++ T+  G +    
Sbjct: 405 ----SDWVPPQSF-----------------------SNLGPNRWHNERITLSTGKS---- 433

Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
                     +LFN+  DP E+ ++++  P +  QL   L    +T VP  +  P   ++
Sbjct: 434 ---------VWLFNITADPYERVDLSNRYPGVVKQLLRRLSQFNKTAVPVRYP-PKDPRS 483

Query: 618 DPKRFNDTWSPW 629
           +P+     W PW
Sbjct: 484 NPRLNGGVWGPW 495



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      +P+FLY+A+ A H+      L+AP 
Sbjct: 163 LYENENAAWDYDNGIYSTQMYTQRVQQILASHDPKRPIFLYIAYQAVHS-----PLQAPG 217

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 218 RYFEHYRSIININRRRYAAM 237


>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
 gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
          Length = 455

 Score =  223 bits (567), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 146/414 (35%), Positives = 212/414 (51%), Gaps = 54/414 (13%)

Query: 75  TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
           T+A       +  L+    G  D+S+ GS  I TP +DALA +G  L   Y QPVC+P+R
Sbjct: 21  TFATDAPRPNIVFLLADDLGGADVSWRGS-PIKTPQLDALANSGAKLEQFYVQPVCSPTR 79

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
           ++L+TG+YP+  G+Q   +      G+PL ER L E L++ GY T  +GKWHLG     Y
Sbjct: 80  SALLTGRYPMRYGLQVGVVRPWADYGLPLDERTLAEALQDAGYETAIVGKWHLGHVSPAY 139

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
            P+ RGF+  +G+ NG + Y+ H             GHD  ++     D    YAT L  
Sbjct: 140 LPMARGFDHQYGHYNGALDYFTHDRD---------GGHDWHKDDHVNRDE--GYATHLIA 188

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
           +EAV++I+D+   KPLFLY+   A H+      L+ P+     +  +    R+ YA MV 
Sbjct: 189 QEAVRVIQDRDKKKPLFLYVPFNAVHS-----PLQVPESYAAPYGDMKK-RRQAYAGMVA 242

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
            LD++VG ++  +QR+ ML+N++ IF SDNG P     E     +   N P RG K+TL+
Sbjct: 243 ALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGP-----EPGKLTD---NGPLRGGKHTLY 294

Query: 373 EGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           EGGV+V A   W  +I    +V    +HI DW PTL   AGG                SL
Sbjct: 295 EGGVRVCAFASWKGRIAPGSKVEAP-LHIVDWYPTLIELAGG----------------SL 337

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
               P      +DG + W S+    PS  + ++ NI   +   A+R+  WKLV+
Sbjct: 338 QQAKP------LDGRNIWPSITTGEPSPHDVIVCNITPTE--GAIRVGDWKLVV 383


>gi|395545568|ref|XP_003774672.1| PREDICTED: arylsulfatase J-like, partial [Sarcophilus harrisii]
          Length = 466

 Score =  223 bits (567), Expect = 3e-55,   Method: Compositional matrix adjust.
 Identities = 152/491 (30%), Positives = 236/491 (48%), Gaps = 71/491 (14%)

Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           Y IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+R+E  P  RGF
Sbjct: 1   YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 60

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQL 258
           ++ FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T ++T++  Q+
Sbjct: 61  DTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQI 116

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           +      KP+FLY+A+ A H+      L+AP +    ++ I + NRR YAAM+  LD+++
Sbjct: 117 LASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAI 171

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
             V  AL+R G  +NSIII+ SDNG   +           GSN+P RG K T WEGG++ 
Sbjct: 172 NNVTLALKRYGFYDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRA 222

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              + SP ++    +  +++HI+DW PTL T A G      + +DG D W ++   +   
Sbjct: 223 VGFVHSPLLKNKGTICKELVHITDWYPTLITLAEGWIDE-DIQLDGYDIWETI---SEGL 278

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYG 498
           R+  +D L     +     +   +    I      +A+R+  WKL+ G        GY  
Sbjct: 279 RSPRVDILHNIDPIYTKAKNGSWAAGFGIWNTAIQSAIRVKHWKLLTGNP------GY-- 330

Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
              S+ VP  +F                       SN    R   ++ +   G +     
Sbjct: 331 ---SDWVPPQSF-----------------------SNAGPNRWHNERISWSAGKS----- 359

Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
                    +LFN+  DP E+ +++S  P I  +L   L    +T VP  +   D  +++
Sbjct: 360 --------VWLFNITADPYERVDLSSKYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSN 410

Query: 619 PKRFNDTWSPW 629
           P+     W PW
Sbjct: 411 PRLNGGVWGPW 421



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T++  Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 89  LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 143

Query: 60  ETINQFQYITDPNRRTYAAL 79
           +    ++ I + NRR YAA+
Sbjct: 144 KYFEHYRSIININRRRYAAM 163


>gi|402871949|ref|XP_003899908.1| PREDICTED: arylsulfatase B-like, partial [Papio anubis]
          Length = 316

 Score =  222 bits (566), Expect = 4e-55,   Method: Compositional matrix adjust.
 Identities = 113/250 (45%), Positives = 156/250 (62%), Gaps = 11/250 (4%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS  I TP++DALA  G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q   I
Sbjct: 57  GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           W  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           YY H    L D  +  V     D R     A      Y+T++FTK A  LI + P +KPL
Sbjct: 176 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALITNHPPEKPL 233

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++VG V +AL+  
Sbjct: 234 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 288

Query: 329 GMLENSIIIF 338
           G+  N++ IF
Sbjct: 289 GLWNNTVFIF 298



 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 211 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265

Query: 75  TYAALT 80
            YA + 
Sbjct: 266 HYAGMV 271


>gi|406832341|ref|ZP_11091935.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 490

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 149/412 (36%), Positives = 212/412 (51%), Gaps = 64/412 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G  DL + GS +I TP+IDALA  G+ L + Y  P+CTP+RA+LMTG+YP+  G+Q   I
Sbjct: 37  GNADLGYRGS-KIKTPHIDALAKGGVRLESYYGLPLCTPARAALMTGRYPMRQGLQTLVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           + +   G+P  E+ LP+ L+E+GY T  +GKWHLG   +++ P  RGF+  +G + G + 
Sbjct: 96  FPSHRYGLPTDEKTLPQALKEVGYHTAMVGKWHLGHADKKFWPQNRGFDHFYGNVVGEVD 155

Query: 212 YYDHILSDQYSRTV--ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           Y+    + +    V  + NG  +R +          Y  DL   EAV+LI      KPLF
Sbjct: 156 YF----TRERGGVVDWQRNGEFLRED---------GYYVDLIGTEAVKLIAGHDKAKPLF 202

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN-RRTYAAMVKKLDDSVGTVISALQRK 328
           LY A LA HA       +AP+  I+ +  I D    RTYA M+  LD  VG V++AL+  
Sbjct: 203 LYFASLAPHA-----PYQAPKADIDAYNDIFDNEMHRTYAGMISNLDRQVGRVVAALETN 257

Query: 329 GMLENSIIIFMSDNGAPTV----------EYRETSNYRNWG-----SNYPYRGVKNTLWE 373
           GM EN+II F +DNG  T           E RE S     G     SN P+ G K +L +
Sbjct: 258 GMRENTIIFFTTDNGGATSALFATGARAPEEREASGGVALGSKPPCSNLPFSGGKGSLKD 317

Query: 374 GGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           GGV +PAI+ W  +++  P V  + +H  D +PTL   AGG                   
Sbjct: 318 GGVHLPAIVNWPGRLK--PSVVNEPLHHVDLMPTLLALAGGK------------------ 357

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
              PS+     DG D W +L    PS    +LIN++  +   ++R   WKL+
Sbjct: 358 -GDPSK---PFDGKDAWGTLAEGKPSPHEEILINVEAFR--GSIRKGRWKLI 403



 Score = 43.5 bits (101), Expect = 0.31,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 34/66 (51%), Gaps = 6/66 (9%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN-R 73
           Y  DL   EAV+LI      KPLFLY A LA HA       +AP+  I+ +  I D    
Sbjct: 179 YYVDLIGTEAVKLIAGHDKAKPLFLYFASLAPHA-----PYQAPKADIDAYNDIFDNEMH 233

Query: 74  RTYAAL 79
           RTYA +
Sbjct: 234 RTYAGM 239


>gi|424863174|ref|ZP_18287087.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
 gi|400757795|gb|EJP72006.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
          Length = 519

 Score =  222 bits (565), Expect = 5e-55,   Method: Compositional matrix adjust.
 Identities = 133/351 (37%), Positives = 197/351 (56%), Gaps = 30/351 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP-- 149
           GW D+S++G   I TPNID LA +G+ +N  Y+ P C+P+RA+L TG   +  G+  P  
Sbjct: 31  GWGDVSYNGG-PINTPNIDKLADDGLQMNRFYSAPTCSPTRAALFTGINSLKNGIIRPLN 89

Query: 150 -PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            P   AE  G+PL  + LPEYL+E+GY T   GKWHLG F  EY P  RGFES +G+L G
Sbjct: 90  NPT--AERYGLPLKHKILPEYLKEIGYQTALSGKWHLGMFSDEYLPRNRGFESTYGHLGG 147

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
            I Y+DH LS +          D  RN    ++    Y+T L   EA+++IE++  + PL
Sbjct: 148 GIGYFDHALSGRL---------DWHRNGEILYED--GYSTTLIADEAIRIIENKNNETPL 196

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLY+A  A H       ++A ++ IN    I+D   R YAA +  LD  +G +I A++ K
Sbjct: 197 FLYVAFNAPHT-----PIQAEEKIINNLSDISDKKERVYAANIITLDREIGKIIDAIESK 251

Query: 329 GMLENSIIIFMSDNGA-----PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
           G+L  +IIIF SDNG      P           + G+    +G K + +EGG++VPA+++
Sbjct: 252 GILNETIIIFFSDNGPVFDIDPIAATIAPDILDSKGNTLGLKGSKGSAYEGGIRVPAVIY 311

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
              I +   +S Q   + D LPTL+ A   + ++   NI G+++W+ L+ N
Sbjct: 312 YKGILEK-SLSNQFFFVDDILPTLFGALNINPNQ--NNITGVNRWNDLIKN 359



 Score = 45.8 bits (107), Expect = 0.072,   Method: Compositional matrix adjust.
 Identities = 24/64 (37%), Positives = 37/64 (57%), Gaps = 5/64 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T L   EA+++IE++  + PLFLY+A  A H       ++A ++ IN    I+D   R
Sbjct: 174 YSTTLIADEAIRIIENKNNETPLFLYVAFNAPHT-----PIQAEEKIINNLSDISDKKER 228

Query: 75  TYAA 78
            YAA
Sbjct: 229 VYAA 232


>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
          Length = 431

 Score =  222 bits (565), Expect = 6e-55,   Method: Compositional matrix adjust.
 Identities = 134/367 (36%), Positives = 197/367 (53%), Gaps = 34/367 (9%)

Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           + IHTG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF
Sbjct: 1   FKIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 60

Query: 200 ESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           +++FGYL G   YY H    L D  +  V     D R     A      Y+T++FT+ A 
Sbjct: 61  DTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTERAT 118

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
            LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D+
Sbjct: 119 ALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDE 173

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           +VG V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +LWEGG+
Sbjct: 174 AVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGI 224

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS-RLPLNIDGLDQWSSLLLNT 435
           +    + SP ++Q    + +++HISDWLPTL   A G T+   PL  DG D W ++   +
Sbjct: 225 RGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTNGSKPL--DGFDVWETISEGS 282

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVL 485
           PS R   +  +D   + +  +P    S+    D+              AA+R  +WKL+ 
Sbjct: 283 PSPRMELLHNID--PNFVDISPCPGKSLAPAKDDSSHPEYFSFNTSLHAAIRHRNWKLLT 340

Query: 486 GTQENGT 492
           G    G 
Sbjct: 341 GYPGCGC 347



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 108 YSTNIFTERATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRH 162

Query: 75  TYAALT 80
            YA + 
Sbjct: 163 YYAGMV 168


>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
          Length = 519

 Score =  221 bits (564), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 134/365 (36%), Positives = 196/365 (53%), Gaps = 34/365 (9%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           IHTG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF++
Sbjct: 91  IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 150

Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           +FGYL G   YY H    L D  +  V     D R     A      Y+T++FT+ A  L
Sbjct: 151 YFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATAL 208

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV  +D++V
Sbjct: 209 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAV 263

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V +AL+  G+  N++ IF +DNG  T+           G+N+P RG K +LWEGG++ 
Sbjct: 264 GNVTAALKSHGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGIRG 314

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS-RLPLNIDGLDQWSSLLLNTPS 437
              + SP ++Q    + +++HISDWLPTL   A G T+   PL  DG D W ++   +PS
Sbjct: 315 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTNGSKPL--DGFDVWETISEGSPS 372

Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGT 487
            R   +  +D   + +  +P    S+    D+              AA+R  +WKL+ G 
Sbjct: 373 PRMELLHNID--PNFVDISPCPGKSLAPAKDDSSHPEYFSFNTSLHAAIRHRNWKLLTGY 430

Query: 488 QENGT 492
              G 
Sbjct: 431 PGCGC 435



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 196 YSTNIFTERATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRH 250

Query: 75  TYAALT 80
            YA + 
Sbjct: 251 YYAGMV 256


>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
          Length = 501

 Score =  221 bits (564), Expect = 7e-55,   Method: Compositional matrix adjust.
 Identities = 160/513 (31%), Positives = 251/513 (48%), Gaps = 59/513 (11%)

Query: 94  NDLSFHGSN----EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP 149
           +DL +H       ++ TP +D LA  G+   N Y    C+PSR S M+G+YP  + MQ  
Sbjct: 30  DDLGYHDIGLRNPDLHTPTLDKLATKGVQFKNNYVMHACSPSRHSFMSGRYPFTSQMQKD 89

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
            I+   P   PL  +FLPEYL+ELGY T A+GKWHLG+ R +  P  RGF+S +G L+G 
Sbjct: 90  VIFPVSPDCSPLKLKFLPEYLKELGYGTHAVGKWHLGYCREDCMPTSRGFDSFYGTLDGE 149

Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
             Y  H+ +  Y         D   N +      G+++ DL T     +IE Q  +KP F
Sbjct: 150 GDYLTHMSAGFY---------DWHTNGTVDRSKSGQHSQDLHTAALADIIERQTEEKPFF 200

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LY A     A N    L+A ++ + ++  Q ++D  ++ Y  ++  LD  V T+   L++
Sbjct: 201 LYFA-----AQNPHTPLQAKKKDMKKYKNQNLSDTLKK-YYGLITGLDGMVKTIKDTLEK 254

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           K +L N+ IIF++DNGA              GSN P+RG K+TLWEGGVK  + ++SP +
Sbjct: 255 KNLLNNTYIIFLADNGADV----------GIGSNEPFRGTKSTLWEGGVKSLSFIYSPLL 304

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           ++    +  ++HI+DW+PTL + AGG        IDG++Q + +     S R   +  LD
Sbjct: 305 KKTGYENDGLIHITDWMPTLVSLAGGSVPESE-GIDGINQVAMVTDARKSERKDMVYNLD 363

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
             S   L +  +               AVR + +KL+ G    G  DG YG T +    +
Sbjct: 364 MESQFGLPSSGQ--------------IAVRDERYKLLWGFP--GQNDG-YGLTTTYLAKV 406

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
             F  I +S T  + ++ +     P  +   M     + T+        +       G  
Sbjct: 407 EEFKVIYKSLT--TREKRAGRFTYPEDDQKVMNRLLAELTV-------SVDDIERGTGFM 457

Query: 568 YLFNLGNDPCEQNNIASS-RPDISSQLYELLKY 599
            LF+L  DP E  ++++S  P+    L++L+ +
Sbjct: 458 RLFDLIKDPYETYDLSTSDEPEHQEALHKLITF 490


>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
          Length = 418

 Score =  221 bits (564), Expect = 8e-55,   Method: Compositional matrix adjust.
 Identities = 154/491 (31%), Positives = 234/491 (47%), Gaps = 86/491 (17%)

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG FR+E  P  RGF+++FGYL G  
Sbjct: 1   IWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMFRKECLPTRRGFDTYFGYLLGSE 60

Query: 211 SYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
            YY H    L D  +  V     D R     A      Y+T++FT+ A  LI + P +KP
Sbjct: 61  DYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIANHPPEKP 118

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFLYLA  + H     + L+ P+E +  +++I D NR  YA MV  +D++VG V +AL+ 
Sbjct: 119 LFLYLALQSVH-----EPLQVPEEYLKPYKFIQDKNRHHYAGMVSLMDEAVGNVTAALKS 173

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
            G+  N++ IF +DNG  T+           G+N+P RG K +LWEGG++    + SP +
Sbjct: 174 HGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGIRGVGFVASPLL 224

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           +Q    + +++HISDWLPTL T AGG T+     +DG D W ++   +PS R   +  +D
Sbjct: 225 KQKGIKNRELIHISDWLPTLVTLAGGSTNGTK-PLDGFDVWKTISEGSPSPRMELLHNID 283

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGTMDGYY 497
              + +  +P    S+    D+              AA+R  +WKL+ G    G      
Sbjct: 284 --PNFVDISPCPGMSLAPPKDDSSGPEYFAFNTSLHAAIRHRNWKLLTGYPGCGCW---- 337

Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
                                           F P S        +  A++   ++P   
Sbjct: 338 --------------------------------FPPPS--------QNNASVIPSSDPPTK 357

Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
           T         +LF++  DP E+++++   P I  QL   L+++ +  VP      D  + 
Sbjct: 358 T--------VWLFDIDQDPEERHDLSRDHPHIVKQLLSRLQFYHKHSVPVHFPAQD-PRC 408

Query: 618 DPKRFNDTWSP 628
           DPK     W P
Sbjct: 409 DPKD-TGAWGP 418



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  +++I D NR 
Sbjct: 97  YSTNIFTERATALIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYKFIQDKNRH 151

Query: 75  TYAALT 80
            YA + 
Sbjct: 152 HYAGMV 157


>gi|449680632|ref|XP_004209636.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 336

 Score =  221 bits (562), Expect = 1e-54,   Method: Compositional matrix adjust.
 Identities = 136/403 (33%), Positives = 207/403 (51%), Gaps = 73/403 (18%)

Query: 232 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQ 290
           M +N    ++  G Y+T +++ EA+  I +    +P+FLYLA+ A H+ N  ++ L+APQ
Sbjct: 1   MAQNQENIFNQWGNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQ 60

Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
           E +++F +I    R+ YAAM+  +D  +G + S L ++ ML++SIIIF SDNG P     
Sbjct: 61  EWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITSELLKQNMLDDSIIIFTSDNGGPANGLN 120

Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
                 NW +NYP RG K T++EGGV+  A +WS ++ +NPRVS  +MHI+DWLPTL +A
Sbjct: 121 N-----NWATNYPLRGTKTTVYEGGVRGAACIWSKELSKNPRVSNDLMHITDWLPTLLSA 175

Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE- 469
           AG DT+ LP N+DG                     L+ W +L+   PS R  VLINID  
Sbjct: 176 AGVDTTLLP-NLDG---------------------LNLWDTLINQVPSPRTEVLINIDPL 213

Query: 470 KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI 529
               AA+R+  WK+V   ++N   DG+Y                          ++S  I
Sbjct: 214 LYMNAALRVGDWKIV---KQNNFYDGWYPPP-----------------------EISNEI 247

Query: 530 FLPISNIDKMRSTRQQATIHCGANPAPMTPSPC--TNGPCYLFNLGNDPCEQNNIASSRP 587
                         ++  +HC   P P+ P  C   +G   LFN+  DPCE  ++++S P
Sbjct: 248 --------------EEYDVHCS--PPPLLPRSCDKKDGKLCLFNIKYDPCEYIDLSASNP 291

Query: 588 DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
            I   + E L ++ +++V           ADPK +N  W  W+
Sbjct: 292 HIYQFMLERLNFYNKSMVESRRNTFRDPLADPKLYNGVWKSWV 334



 Score = 59.7 bits (143), Expect = 5e-06,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 50/80 (62%), Gaps = 1/80 (1%)

Query: 1  MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQ 59
          M +N    ++  G Y+T +++ EA+  I +    +P+FLYLA+ A H+ N  ++ L+APQ
Sbjct: 1  MAQNQENIFNQWGNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQ 60

Query: 60 ETINQFQYITDPNRRTYAAL 79
          E +++F +I    R+ YAA+
Sbjct: 61 EWVDKFSHIKHKGRQNYAAM 80


>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score =  220 bits (561), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 173/559 (30%), Positives = 253/559 (45%), Gaps = 98/559 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG--P 149
           G++D+ F  S++I TPNID                     +AS+++G+Y +H G+    P
Sbjct: 46  GFDDVGFK-SHQIKTPNID---------------------QASILSGRYAMHHGIVNWIP 83

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
           P    +  G+PL    LP+ L+  GY T AIGKWHLGF++ +YTP +RGF S  GY +G 
Sbjct: 84  P---KDSYGLPLNHTTLPQLLKNGGYDTHAIGKWHLGFYKWDYTPTFRGFNSFLGYYSGG 140

Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            +Y+ H     Y    +      +     A+D  G+Y+T +F+ EAV++I+D    KPLF
Sbjct: 141 ENYFTHKNGPAYDMHRDPLPSCGQNCSQIAFDLQGQYSTTIFSDEAVRVIDDHIGPKPLF 200

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           LYLA+ A H     +  +APQ  I+ +   I D  RRT+A M+  LD+ +G V  AL+R 
Sbjct: 201 LYLAYQAVH-----EPAQAPQSYIDPYTDLIPDAQRRTFAGMLSCLDEGIGNVTRALKRN 255

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ-- 386
           G   N++I+F +DNG P +             NYP RG K+ ++EGGV+  A +  PQ  
Sbjct: 256 GFYNNTLIVFTTDNGGPILGGDSVG-----ARNYPMRGGKHAVYEGGVRGIAFVHPPQNT 310

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
             +       +MH +DWLPTL  AAG +      ++DG+  W ++  NT           
Sbjct: 311 FVETGAGYHNLMHAADWLPTLCDAAGVE-CHTQQSLDGVSHWDAMTTNT----------- 358

Query: 447 DQWSSLLLNTPSRRNSVLIN----IDEKKRTAAVRLDSWKLVLGT------------QEN 490
                   N+  R + VL N     D      A R   WK + GT            Q  
Sbjct: 359 --------NSTLRGDVVLGNATDLCDATNCGFAYRKGDWKFIRGTGGMPFSWSTPMNQSG 410

Query: 491 GTMDGYYGQ----------TRSNKVPLLNFNAIVESKTYQSL--QQLSQNIFLPISNIDK 538
           G  +  YGQ          T +               TY +    Q S N  L  S    
Sbjct: 411 GNCNLAYGQCVDGASLGRVTAATAADCCYTCTNTTKCTYSTFLANQTSNNCLLFAS---- 466

Query: 539 MRSTRQQATIHC--GANPAPMTPSPCTNGPCY-LFNLGNDPCEQNNIASSRPDISSQLYE 595
             ST      +C  GA+     P    N   Y LFN+  DP E  ++ASS  D++ Q+  
Sbjct: 467 --STASTLDTNCISGAHDPKQVPYTTLNPLGYQLFNIAQDPNEHTDVASSHADVAKQMNS 524

Query: 596 LLKYHRRTLVPQSHEQPDL 614
            L    ++ V Q+   P+ 
Sbjct: 525 ELDSILQSYV-QAQNDPNC 542



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 31/73 (42%), Positives = 46/73 (63%), Gaps = 6/73 (8%)

Query: 8   AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-Q 66
           A+D  G+Y+T +F+ EAV++I+D    KPLFLYLA+ A H     +  +APQ  I+ +  
Sbjct: 170 AFDLQGQYSTTIFSDEAVRVIDDHIGPKPLFLYLAYQAVH-----EPAQAPQSYIDPYTD 224

Query: 67  YITDPNRRTYAAL 79
            I D  RRT+A +
Sbjct: 225 LIPDAQRRTFAGM 237


>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
          Length = 370

 Score =  220 bits (561), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 132/366 (36%), Positives = 197/366 (53%), Gaps = 26/366 (7%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
           L  + + G++DL F   + I TPNIDALA  G+I  N Y Q VCTPSR +L+TG+YP  +
Sbjct: 3   LKPVFIPGYHDLGFRNPDVI-TPNIDALATEGVIFTNNYVQSVCTPSRHALLTGRYPHRS 61

Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
            MQ   I   + R   L  +FLPEYL++LGYST A+GKWH+G+ R E  P +RGF+S FG
Sbjct: 62  AMQNLVIMSNQARCTGLGYKFLPEYLKDLGYSTHAVGKWHVGYCREECLPTHRGFDSFFG 121

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
             +G   Y++H      + TV     D   +     +  G ++ DL  +    +++ Q  
Sbjct: 122 LYDGDGYYWNH------TSTVIPGAFDWNNSTGVYLEARGIHSEDLGAERLTAILDGQNA 175

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQ-ETINQFQYITDPN-RRTYAAMVKKLDDSVGTVI 322
            +P FLY +    H  +       PQ E +N +     P+ RR Y  +V  LD  VG ++
Sbjct: 176 KEPFFLYFSPQNPHTPS------QPQAEFLNLYPENRYPDIRRKYLGLVSGLDRMVGGIV 229

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
             L+R G++ N+ IIF+SDNGA   E          G N P+RG K +L+EGG K  + +
Sbjct: 230 DGLKRNGLMNNTYIIFVSDNGADPEE----------GLNDPFRGGKGSLFEGGTKSASFI 279

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
           +SP + +    +  +MHI+DW+PT+   AGG        +DG+DQ   +     S+R   
Sbjct: 280 YSPLLNKAGYENNGLMHITDWMPTILKLAGGSIPEGE-QLDGIDQSDMVKNGKASQRKKM 338

Query: 443 IDGLDQ 448
           +  +D+
Sbjct: 339 VYSIDR 344


>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 522

 Score =  220 bits (560), Expect = 2e-54,   Method: Compositional matrix adjust.
 Identities = 149/435 (34%), Positives = 215/435 (49%), Gaps = 70/435 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ +HGS  + TP IDALA  G+ L N Y   +CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 39  GWDDVGYHGS-VMKTPYIDALAAEGVTLENYYMPSLCTPSRSVLLTGRYEIHTGLQHGTI 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL E  LP+ L+E GY T  +GKWHLGF+R+E  P  RGF+S  G+   +  
Sbjct: 98  LMMQPLCLPLDEITLPQKLKEEGYDTHMVGKWHLGFYRKECLPNNRGFDSFLGFYQAMGD 157

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFTKEAVQ------------- 257
           ++ H +S   +     NG D RRN     D   G+Y+T +F    +              
Sbjct: 158 HFYHNIS---ASPGHFNGFDFRRNNDVVADQYAGKYSTHIFXXXFINTQTLSFVCVNNVK 214

Query: 258 -------LIEDQPV------DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT--DP 302
                  L    P+       +PLFLYL+  A H       L+ P      +  +   D 
Sbjct: 215 GVYFRGYLSSFTPITSSYNPQQPLFLYLSFQAVHT-----PLQVPSRYAELYNDLIPNDE 269

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
           +RR YA MV  +D++VG V + L++  + ENS++IF +                      
Sbjct: 270 DRRIYAGMVTCMDEAVGYVTTVLKQSALWENSVVIFSTX--------------------- 308

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPL 420
             RG K TL+EGGV+    + SP + +  R  ++ +++HISDW PTL   AGGD      
Sbjct: 309 -LRGGKGTLYEGGVRGVGFVTSPLLHEQVRGTINHELIHISDWFPTLTRLAGGDLMG-GG 366

Query: 421 NIDGLDQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
            +DG +QW ++   TPS R     NID L      L + P    S   N+  +   AA+R
Sbjct: 367 RLDGYNQWETISQGTPSLRTEILLNIDPLYTAVRNLRDDP-WAESTCFNVSMR---AAIR 422

Query: 478 LDSWKLVLGTQENGT 492
           +  WKL+ G   NG+
Sbjct: 423 VGDWKLITGGAGNGS 437



 Score = 52.8 bits (125), Expect = 5e-04,   Method: Compositional matrix adjust.
 Identities = 23/61 (37%), Positives = 37/61 (60%)

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           L+N+  DP E+ + +  RPDI+ +L + L ++ RT VP +  Q ++  ADP  +   WSP
Sbjct: 461 LYNVHVDPYERMDYSHERPDIAEELLQRLGHYYRTSVPPNFPQQEVRSADPANYGGVWSP 520

Query: 629 W 629
           W
Sbjct: 521 W 521


>gi|348564635|ref|XP_003468110.1| PREDICTED: arylsulfatase J-like [Cavia porcellus]
          Length = 543

 Score =  219 bits (559), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 155/500 (31%), Positives = 238/500 (47%), Gaps = 74/500 (14%)

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           R  L++  Y IHTG+Q   I   +P  +PL    LP+ L+E+GYST  +GKWHLGF+++E
Sbjct: 71  RCGLLS-LYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYKKE 129

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDL 250
             P  RGF++ FG L G   YY H   D    +  + G+D+  N + AWD   G Y+T +
Sbjct: 130 CMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDHDNGIYSTQM 185

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
           +T+   Q++      KP+FLY+A+ A H+      L+AP      ++ I + NRR YAAM
Sbjct: 186 YTQRVQQILASHNPTKPIFLYIAYQAVHSP-----LQAPGRYFEHYRSIININRRRYAAM 240

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKN 369
           +  LD+++  V  AL+  G   NSIII+ SDNG  PT            GSN+P RG K 
Sbjct: 241 LSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKG 290

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
           T WEGG++    + SP ++    V  +++HI+DW PTL + A G        +DG D W 
Sbjct: 291 TYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEH-TQLDGYDVWE 349

Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
           ++   +   R+  +D L     +     +   +    I      +A+R+  WKL+ G   
Sbjct: 350 TI---SEGFRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN-- 404

Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
                GY     S+ VP   F                       SN+   R   ++ T+ 
Sbjct: 405 ----PGY-----SDWVPPQAF-----------------------SNLGPNRWHNERITLS 432

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
            G +              +LFN+  DP E+ +++S  P I  +L + L    +T VP  +
Sbjct: 433 TGKS-------------IWLFNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRY 479

Query: 610 EQPDLVQADPKRFNDTWSPW 629
              D  +++P+     W PW
Sbjct: 480 PPKD-PRSNPRLNGGVWGPW 498



 Score = 49.3 bits (116), Expect = 0.006,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 166 LYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHSP-----LQAPG 220

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 221 RYFEHYRSIININRRRYAAM 240


>gi|260819479|ref|XP_002605064.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
 gi|229290394|gb|EEN61074.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
          Length = 485

 Score =  219 bits (558), Expect = 3e-54,   Method: Compositional matrix adjust.
 Identities = 152/539 (28%), Positives = 241/539 (44%), Gaps = 97/539 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H + ++ TP +D LA  G+ILN  Y   VCTPSR + MTG +P H G Q    
Sbjct: 39  GWNDVGWH-NPDVKTPVLDQLANEGVILNQCYVNYVCTPSRTAFMTGYFPYHVGSQHTVF 97

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +  + +G+P    FLPE L++LGY+T  +GK                             
Sbjct: 98  FPQQAQGIPSKFPFLPEKLKQLGYATHMVGK----------------------------- 128

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
                           +  D+ +  S      G+Y T  FT   V ++E  P D PLF+Y
Sbjct: 129 ----------------DDKDVVKTRS------GQYGTYFFTDRVVDIVEKHPADTPLFMY 166

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L +   H     + L+ PQ   + +  + D NRR    MV  LD++VG V  A+++ G+ 
Sbjct: 167 LPYQNVH-----QPLQVPQRFEDMYASVQDENRRKLLGMVSALDEAVGNVTMAMKKAGLW 221

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N+++IF +DNG     +   S     G+NYP RG K TLWEGG +  A      +Q+  
Sbjct: 222 DNTLVIFTTDNGG----WISAS-----GNNYPLRGGKVTLWEGGTRGVAFAHGKMLQKTG 272

Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
             + +M+H  DW PT+  AAGG                          +  IDG+ Q  +
Sbjct: 273 YTNNEMIHAVDWFPTILAAAGGTA------------------------DEAIDGVSQLDT 308

Query: 452 LLLNTPSRRNSVLINIDEK--KRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
           ++   PS R   + NID++   +  A+R+  +KL+ G    G  DG+          L +
Sbjct: 309 IMSGQPSPRTEFVYNIDDEFPTKQGAIRVGDYKLIEGYA--GKPDGWILPPNLTGSQLDS 366

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
            + ++ +  +         I   + N+   +   +   +       P +  P      +L
Sbjct: 367 GDPVIGTWLFNLKVSGDPVIGTWLFNLKAGQRRPRHRDLALQPESQPDSGDPVIG--TWL 424

Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           FNL +DP E +N+A S PD   ++   L  +R++LVP     PD  +++P  +   WSP
Sbjct: 425 FNLKDDPTEHHNLADSMPDKLREMQAKLAEYRKSLVPAIFPAPD-PKSNPDNWGGAWSP 482



 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 22/70 (31%), Positives = 34/70 (48%), Gaps = 5/70 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G+Y T  FT   V ++E  P D PLF+YL +   H     + L+ PQ   + +  + D N
Sbjct: 139 GQYGTYFFTDRVVDIVEKHPADTPLFMYLPYQNVH-----QPLQVPQRFEDMYASVQDEN 193

Query: 73  RRTYAALTKS 82
           RR    +  +
Sbjct: 194 RRKLLGMVSA 203


>gi|241680395|ref|XP_002412684.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215506486|gb|EEC15980.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 461

 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 141/417 (33%), Positives = 203/417 (48%), Gaps = 50/417 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+SFHGS +IPTPNID LA +G+ L++ Y Q     +R++LMTG YPIHTG++    
Sbjct: 20  GWGDVSFHGSRQIPTPNIDTLAGDGLALHHYYVQSSGAATRSALMTGLYPIHTGIKHASH 79

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                R +PL+   +  ++         +   H+GF     +       + + + NG  S
Sbjct: 80  ICISARLIPLSGGIVRCHIVLTIKYAFMVNIPHIGFLFEHISKGAHAVNASW-WTNGYWS 138

Query: 212 Y-YDHILSDQYSRTVEL----NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
             + H    Q     E+    N    R   S  W     +   +                
Sbjct: 139 KRFIHAFFLQLFMNAEIFFFFNQWPCREMRSPKWKIAPLHNCSV---------------- 182

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           PL LYLAH A H GNA   LEAP   + +F+YI D  RR +A MV  LD SVG V  ALQ
Sbjct: 183 PLLLYLAHQAVHVGNAYHPLEAPSSYVKKFRYIKDLRRRIFAGMVAALDQSVGDVFRALQ 242

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
           RKG+LE++I++  SD+G       +       GSN+P RG K TLWEGGV+   +LWS +
Sbjct: 243 RKGILEDTIVLLTSDSGGAAGGVEQGV-----GSNWPLRGTKKTLWEGGVRAVGLLWSAK 297

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           + +  RV+ Q++H S + P     + GD + L                       +IDG 
Sbjct: 298 LARG-RVASQLIHRSAYFPLTNADSCGDITDL----------------------GSIDGK 334

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
           D W+ L+ + PS R  +L NID + + AA+R+  +KLVLG   +   D +Y    S+
Sbjct: 335 DLWTVLVDDRPSPRTELLYNIDYRWKLAAIRVGPYKLVLGASSSDQFDDWYEPIESD 391



 Score = 52.0 bits (123), Expect = 0.001,   Method: Compositional matrix adjust.
 Identities = 23/47 (48%), Positives = 29/47 (61%)

Query: 36  PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKS 82
           PL LYLAH A H GNA   LEAP   + +F+YI D  RR +A +  +
Sbjct: 183 PLLLYLAHQAVHVGNAYHPLEAPSSYVKKFRYIKDLRRRIFAGMVAA 229


>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
          Length = 512

 Score =  217 bits (552), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 174/571 (30%), Positives = 262/571 (45%), Gaps = 98/571 (17%)

Query: 59  QETINQFQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNE---IPTPNIDALA 113
           + TI+ F  I      T A  TK   + L++    G+ND+ + G N      TP +D+LA
Sbjct: 7   KSTISSFLLICVVG--TVAEATKKPHVVLILADDLGYNDVGYWGQNHGSAAKTPFLDSLA 64

Query: 114 YNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLREL 173
            NG+ L N Y   VC+P+R +LMTG+  I  G+    I   +  G+PL    LPE L   
Sbjct: 65  ENGVRLENYYTHSVCSPTRGALMTGRNRIDIGLAHGIIHTTQIEGLPLDNVLLPEQLSNC 124

Query: 174 GYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMR 233
           GY+T+ IGKWHLGF   +Y P  RGF   +G+L G  +Y+   L    +R   + G D  
Sbjct: 125 GYNTQMIGKWHLGFSSSKYAPWNRGFHGFYGFLAGSENYWSKWLP--MARHSNIGGVDFT 182

Query: 234 RNLSTAW-DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 292
            + +    +T G+Y+  ++   A  +I+     KPLFLYL     H       L AP   
Sbjct: 183 DSTTGPTNETWGQYSAHVYASRARYVIQHHDQSKPLFLYLPLQTPHT-----PLGAPSHY 237

Query: 293 INQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRET 352
              F+ I D +R  Y +MV  LD++V  V + L+  GM E++++IF +DNG         
Sbjct: 238 YEPFKDIEDDDRMKYLSMVSVLDETVRNVTNYLKDAGMWEDTLLIFSTDNGGEV------ 291

Query: 353 SNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP---QIQQNPRVSLQMMHISDWLPTLYT 409
              ++ G+N+PYRG KNTL+EGGVK    +       +    RV+ +++H++DW PT+  
Sbjct: 292 ---KHGGNNWPYRGTKNTLYEGGVKAVGFVHGKILGDLSPAQRVNRELIHVTDWYPTIMA 348

Query: 410 AAGGDTSRLPLN----IDGLDQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNS 462
           AA  D   +  N    + GL+QW +L     S+R      ID +++ S +    P  R  
Sbjct: 349 AA--DCPYVSGNSRNKLYGLNQWGALRGERGSQRTEIVHTIDPMNRTSEI----PDMRFG 402

Query: 463 VLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSL 522
             IN+      AA+ +  WKL+ G   +G   G+Y              A  ESK     
Sbjct: 403 FNINVK-----AAICVRKWKLITGNPGHG---GWY--------------APPESK----- 435

Query: 523 QQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNI 582
                       +I     TR    I                    LF+L  DP E+N I
Sbjct: 436 ------------DIFTKSGTRGSKNIQ-------------------LFDLSVDPLEENEI 464

Query: 583 ASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
           ++  P + S+L E L  +  +  P  ++  D
Sbjct: 465 SNEHPTVVSELLERLAKYSLSAPPAINQNTD 495


>gi|313236221|emb|CBY11544.1| unnamed protein product [Oikopleura dioica]
          Length = 511

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 160/573 (27%), Positives = 261/573 (45%), Gaps = 93/573 (16%)

Query: 62  INQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN 121
           + +  YI   +   +A    +  + +    G +D+S HG+ +I TPN+D LA +G++LNN
Sbjct: 1   MKKLLYIFGLSANIFAEQQPNIVVIVADDLGKHDVSMHGA-DIYTPNLDMLARDGVLLNN 59

Query: 122 MYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIG 181
            Y QPVC+P+R SLMTG+YP   G+Q   + G  P G+PL E  +P+Y++E GY T  +G
Sbjct: 60  YYVQPVCSPTRGSLMTGRYPYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVG 119

Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD 241
           KW LGFF+  Y P  RGF+  FG L G   YY      +      L G+D+R       D
Sbjct: 120 KWQLGFFKDNYLPWKRGFDEFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRD 179

Query: 242 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ--YI 299
           T  +Y   L+  +A +         PL++Y+A  + H     + L+APQ+ ++ +     
Sbjct: 180 TSMKYQPFLYADKAREKFFAHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQ 234

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               +R +A MV  LD +VG ++  L+  G+  N++I F +DNG            R   
Sbjct: 235 MSNGQRVFAGMVTALDAAVGEIVDGLKDSGLWNNTLIYFSNDNG------------REDD 282

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSR 417
           S++  +G KN  W+GG+        P I +  R      + H+SD LPT+ +AA   T  
Sbjct: 283 SDFVLKGGKNDFWQGGITGYGFFSGPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHT-- 340

Query: 418 LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
           LP  +                    IDG D WS L  ++        +  DEK++   ++
Sbjct: 341 LPPKV--------------------IDGHDLWSRLQGDS--------LGQDEKRKEIIMQ 372

Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPL-LNFNAIVESKTYQSLQQLSQNIFLPISNI 536
           ++   L+   +E+         TR+  V +  ++  IV S +                 +
Sbjct: 373 MNP--LLRLWEEDTRFQNSTWDTRTQGVIIDGDYKLIVGSHS-----------------V 413

Query: 537 DKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYEL 596
            + +++  Q                      +LFN+ NDP E N+++ S+P+++ +L   
Sbjct: 414 LRRKASDYQTV--------------------WLFNIKNDPQEANDLSGSKPNLTHKLLHR 453

Query: 597 LKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
           L      +    H  PD   +DP    D W P+
Sbjct: 454 LYKEMDNVRDPWHPLPDPA-SDPALHGDFWGPY 485


>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
          Length = 551

 Score =  216 bits (551), Expect = 2e-53,   Method: Compositional matrix adjust.
 Identities = 164/544 (30%), Positives = 254/544 (46%), Gaps = 90/544 (16%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YG+ND+ F   N I +PN+DALA +GIIL N Y  P C+PSR S M+G+Y   + MQ   
Sbjct: 36  YGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFMSGRYSYKSAMQHGV 94

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I   +P+ + L    LP YL+ELGY T A GKWHLG+ R E TP +RGF+S  G  +G  
Sbjct: 95  ILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEG 154

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            +Y+H        T     +D        +D +G+++ DL      + +++     PLF+
Sbjct: 155 EFYEH-------TTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQSSPLFM 207

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRRTYAAMVKKLDDSVGTVISAL 325
           Y+A    H         +P +   +F  + D      +R+ Y  +V  +D  +G ++  +
Sbjct: 208 YVAFHNVH---------SPLDPKPEFLALYDDLDVTDDRKKYLGLVSGMDYIIGGIVDKI 258

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
           +  G+ EN+ I+F SDNG    E          G N P+RG K+TLWEGG K  +   SP
Sbjct: 259 KEIGIYENTYILFSSDNGGDVGE----------GDNSPHRGGKSTLWEGGCKANSWTHSP 308

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
            +      +  +MH++DWLPT+   AGG    +P + DGL                  DG
Sbjct: 309 LLGATGVENNGLMHVTDWLPTIVDLAGG---SVPAS-DGL------------------DG 346

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTA-------AVRLDSWKLVLGTQENGTMDGYYG 498
           + Q   ++ N  S R +++ NID +           A R   +KL+ G    G  DGY  
Sbjct: 347 VVQTDMVINNGDSARTTMIYNIDRESDIGQPKFGEMACRNLQYKLIWGFP--GQQDGYGA 404

Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR------STRQQATIHCGA 552
             +           I +   Y   Q+ +Q       + DK R      +  Q + ++   
Sbjct: 405 DAK----------YIYDIPFY---QEAAQAYIDAEGSTDKKRGGPYVLTADQTSVLNNYM 451

Query: 553 NPAPMTPSPCTN--GPCYLFNLGNDPCEQNNIA-SSRPDISSQLYELLKYHRRTL----- 604
           +   ++     +  G   LF+L NDP E  +++ S+ PD+ +   EL+K  +  +     
Sbjct: 452 SSVHVSDQELADGTGSMLLFDLINDPNETTDLSESTDPDVVAAKAELIKMIQDAVNDGGY 511

Query: 605 VPQS 608
           VPQS
Sbjct: 512 VPQS 515


>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
          Length = 475

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 132/391 (33%), Positives = 202/391 (51%), Gaps = 46/391 (11%)

Query: 107 PNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFL 166
           PN+ +L YN                ++      + IHTG+Q   IW  +P  +PL E+ L
Sbjct: 30  PNLGSLVYN-------------VEEQSCPFFRNFQIHTGLQHQIIWPCQPSCIPLDEKLL 76

Query: 167 PEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY-SRTV 225
           P+ L+E GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   YY H       +  V
Sbjct: 77  PQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTVIDALNV 136

Query: 226 ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
                D R     A      Y+T++FT+ A  LI   P +KPLFLYLA  + H     + 
Sbjct: 137 TRCALDFRDGEEVATGYKNMYSTNVFTERATTLITTHPPEKPLFLYLALQSVH-----EP 191

Query: 286 LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
           L+ P+E +  + +I D NRR YA M   +D++VG V +AL+ +G+  N++ IF +DNG  
Sbjct: 192 LQVPEEYLKPYDFIQDKNRRHYAGMASLMDEAVGNVTAALESRGLWNNTVFIFSTDNGGQ 251

Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLP 405
           T+           G+N+P RG K +LWEGGV+    + SP +++    + +++HISDWLP
Sbjct: 252 TLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLKRKGVKTRELIHISDWLP 302

Query: 406 TLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI 465
            L    G D ++    +DG D W+++   +PS R   +  +D     +   P   NS+ +
Sbjct: 303 PL---XGTDGTK---PLDGFDVWNTISEGSPSPRMELLHNIDPDFVDISLCPG--NSMAL 354

Query: 466 NIDEKKRT----------AAVRLDSWKLVLG 486
             D+              AA+R  +WKL+ G
Sbjct: 355 AKDDSSLLEYSAFNTSIHAAIRHQNWKLLTG 385



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI   P +KPLFLYLA  + H     + L+ P+E +  + +I D NRR
Sbjct: 157 YSTNVFTERATTLITTHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRR 211

Query: 75  TYAALT 80
            YA + 
Sbjct: 212 HYAGMA 217


>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
          Length = 529

 Score =  215 bits (548), Expect = 5e-53,   Method: Compositional matrix adjust.
 Identities = 164/544 (30%), Positives = 254/544 (46%), Gaps = 90/544 (16%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YG+ND+ F   N I +PN+DALA +GIIL N Y  P C+PSR S ++G+Y   + MQ   
Sbjct: 14  YGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGV 72

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I   +P+ + L    LP YL+ELGY T A GKWHLG+ R E TP +RGF+S  G  +G  
Sbjct: 73  ILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEG 132

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            +Y+H        T     +D        +D +G+++ DL      + +++     PLF+
Sbjct: 133 EFYEH-------TTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQSSPLFM 185

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRRTYAAMVKKLDDSVGTVISAL 325
           Y+A    H         +P +   +F  + D      +R+ Y  +V  +D  +G ++  +
Sbjct: 186 YVAFHNVH---------SPLDPKPEFLALYDDLDVTDDRKKYLGLVSGMDYIIGGIVDKI 236

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
           +  G+ EN+ I+F SDNG    E          G N P+RG K+TLWEGG K  +   SP
Sbjct: 237 KEIGIYENTYILFSSDNGGDVGE----------GDNSPHRGGKSTLWEGGCKANSWTHSP 286

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
            +      +  +MH++DWLPT+   AGG    +P + DGL                  DG
Sbjct: 287 LLGATGVENNGLMHVTDWLPTIVDLAGG---SVPAS-DGL------------------DG 324

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTA-------AVRLDSWKLVLGTQENGTMDGYYG 498
           + Q   ++ N  S R +++ NID +           A R   +KL+ G    G  DGY  
Sbjct: 325 VVQTDMVINNGDSARTTMIYNIDRESDIGQPKFGEMACRNLQYKLIWGFP--GQQDGYGA 382

Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR------STRQQATIHCGA 552
             +           I +   Y   Q+ +Q       + DK R      +  Q + ++   
Sbjct: 383 DAK----------YIYDIPFY---QEAAQAYIDAEGSTDKKRGGPYVLTADQTSVLNNYM 429

Query: 553 NPAPMTPSPCTN--GPCYLFNLGNDPCEQNNIA-SSRPDISSQLYELLKYHRRTL----- 604
           +   ++     +  G   LF+L NDP E  +++ S+ PDI +   EL+K  +  +     
Sbjct: 430 SSVHVSDQELADGTGSMLLFDLINDPNETTDLSESTDPDIVAAKAELIKMIQDAVNDGGY 489

Query: 605 VPQS 608
           VPQS
Sbjct: 490 VPQS 493


>gi|432963754|ref|XP_004086820.1| PREDICTED: uncharacterized protein LOC101172478 [Oryzias latipes]
          Length = 873

 Score =  215 bits (548), Expect = 6e-53,   Method: Compositional matrix adjust.
 Identities = 154/498 (30%), Positives = 234/498 (46%), Gaps = 71/498 (14%)

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           S+    Y IHTG+Q   I   +P  +PL    LP+ LR+ GYST  +GKWHLGF++R   
Sbjct: 415 SVFHSGYQIHTGLQHSVIRATQPNCLPLENVTLPQKLRDAGYSTHMVGKWHLGFYKRGCL 474

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFT 252
           P  RGF+S FG L G   YY H   +       + G+D+      AW+   G Y+T +FT
Sbjct: 475 PTQRGFDSFFGSLLGSGDYYSHYKCE----VPGMCGYDLYEGEEAAWEQDRGLYSTVMFT 530

Query: 253 KEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           ++AV ++    P   PLFLYLA+ A H+      L+ P   + +++ I++ +RR YAAMV
Sbjct: 531 RKAVSILAHHNPRAHPLFLYLAYQAVHS-----PLQVPARYLERYKGISNLSRRKYAAMV 585

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             LD+++  +  AL+R G  +N++I++ SDNG   +           GSN+P RG K T 
Sbjct: 586 SCLDEAIHNLTLALKRYGYYDNTVIVYSSDNGGQPLAG---------GSNWPLRGSKATY 636

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGG++    + SP + +       ++HI+DW PTL T   G T    LN+DG D W ++
Sbjct: 637 WEGGIRAVGFVHSPLLMKKGTKCRSLIHITDWFPTLVTLGEG-TLDEDLNLDGYDVWEAI 695

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
               PS R   +D L     + +   +        +      AA+R+  WKL+ G     
Sbjct: 696 SEGRPSPR---LDILHNIDPIYIKAKNGSWKAGFGLWNTAVQAAIRVGHWKLLTG----- 747

Query: 492 TMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCG 551
            + GY     S+ VP   F          S Q+L+          D  +S          
Sbjct: 748 -VPGY-----SDWVPPQTF----------SKQRLTNRWHNERVRWDGGKS---------- 781

Query: 552 ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQ 611
                           +LFN+  DP E+ +++   P I   +   L  + RT VP  +  
Sbjct: 782 ---------------LWLFNITADPYERVDLSQHYPHIVKTMLLRLAQYNRTAVPVRYPA 826

Query: 612 PDLVQADPKRFNDTWSPW 629
            DL +++P+     W PW
Sbjct: 827 KDL-RSNPQYNGGVWGPW 843



 Score = 47.0 bits (110), Expect = 0.033,   Method: Compositional matrix adjust.
 Identities = 34/96 (35%), Positives = 53/96 (55%), Gaps = 13/96 (13%)

Query: 8   AWDT-VGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           AW+   G Y+T +FT++AV ++    P   PLFLYLA+ A H+      L+ P   + ++
Sbjct: 516 AWEQDRGLYSTVMFTRKAVSILAHHNPRAHPLFLYLAYQAVHS-----PLQVPARYLERY 570

Query: 66  QYITDPNRRTYAALTKST-----TLTL-LIVYGWND 95
           + I++ +RR YAA+          LTL L  YG+ D
Sbjct: 571 KGISNLSRRKYAAMVSCLDEAIHNLTLALKRYGYYD 606


>gi|251798133|ref|YP_003012864.1| sulfatase [Paenibacillus sp. JDR-2]
 gi|247545759|gb|ACT02778.1| sulfatase [Paenibacillus sp. JDR-2]
          Length = 434

 Score =  214 bits (546), Expect = 9e-53,   Method: Compositional matrix adjust.
 Identities = 151/426 (35%), Positives = 222/426 (52%), Gaps = 55/426 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +GS+ + TP++D LA  GI   N Y+  PVC+PSRASL+TGKYP   G+    
Sbjct: 15  GYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLLTGKYPAKAGVT--S 72

Query: 151 IWGAE--PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           I G +   +G+ L +  L   L+E GY T   GKWHLG    EY P   GF+  +G+  G
Sbjct: 73  ILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGA-SAEYGPNAHGFDQFYGFRAG 131

Query: 209 VISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
            I YY HI    + +   +N  HD+ RN +  W+  GEY T+  T+EA   I+  P D+P
Sbjct: 132 CIDYYSHIF--YWGQGGGVNPVHDLWRNETEVWEN-GEYMTEAITREATSYIDAAPDDEP 188

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
            F+Y+A+ A H       + AP+  +++F  +  P+RR  AAM+  +DD VG ++ AL++
Sbjct: 189 YFMYVAYNAPH-----YPMHAPKAYLDRFPDLP-PDRRIMAAMIAAVDDGVGEIVKALKQ 242

Query: 328 KGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
           KG  E++II F SDNG P+ E R     T +    GS   +RG K +L+EGG++ PAIL 
Sbjct: 243 KGAYEDTIIFFSSDNG-PSTESRNWLDGTEDLYYGGSAGRFRGHKASLFEGGIREPAILS 301

Query: 384 SPQ--IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
            P    +Q  ++S +M  + D  PT+                       L L+       
Sbjct: 302 YPAGLAEQQGQISDEMFAMMDIFPTM-----------------------LELSGIGTEGY 338

Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
           ++DG   + +L  N  S R  +     E +   AVR   WKLVL    NG +D  + +T 
Sbjct: 339 SLDGHSVFDALSGNALSPRKQLFW---EYEGQLAVREGKWKLVL----NGKLD--FSRTE 389

Query: 502 SNKVPL 507
           ++ V L
Sbjct: 390 ADAVHL 395



 Score = 46.6 bits (109), Expect = 0.046,   Method: Compositional matrix adjust.
 Identities = 27/81 (33%), Positives = 45/81 (55%), Gaps = 7/81 (8%)

Query: 3   RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 62
           RN +  W+  GEY T+  T+EA   I+  P D+P F+Y+A+ A H       + AP+  +
Sbjct: 156 RNETEVWEN-GEYMTEAITREATSYIDAAPDDEPYFMYVAYNAPH-----YPMHAPKAYL 209

Query: 63  NQFQYITDPNRRTYAALTKST 83
           ++F  +  P+RR  AA+  + 
Sbjct: 210 DRFPDLP-PDRRIMAAMIAAV 229


>gi|313215020|emb|CBY41206.1| unnamed protein product [Oikopleura dioica]
          Length = 427

 Score =  213 bits (541), Expect = 3e-52,   Method: Compositional matrix adjust.
 Identities = 127/374 (33%), Positives = 192/374 (51%), Gaps = 23/374 (6%)

Query: 62  INQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN 121
           + +  YI   +   +A    +  + +    G +D+S HG+ +I TPN+D LA +G++LNN
Sbjct: 1   MKKLLYIFGLSANIFAEQQPNIVVIVADDLGKHDVSMHGA-DIYTPNLDMLARDGVLLNN 59

Query: 122 MYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIG 181
            Y QPVC+P+R SLMTG+YP   G+Q   + G  P G+PL E  +P+Y++E GY T  +G
Sbjct: 60  YYVQPVCSPTRGSLMTGRYPYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVG 119

Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD 241
           KW LGFF+  Y P  RGF+  FG L G   YY      +      L G+D+R       D
Sbjct: 120 KWQLGFFKDNYLPWKRGFDEFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRD 179

Query: 242 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYI 299
           T  +Y   L+  +A +         PL++Y+A  + H     + L+APQ+ ++ +     
Sbjct: 180 TSMKYQPFLYADKAREKFFAHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQ 234

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               +R +A MV  LD +VG ++  L+  G+  N++I F +DNG            R   
Sbjct: 235 MSNGQRVFAGMVTALDAAVGEIVDGLKDSGLWNNTLIYFSNDNG------------REDD 282

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSR 417
           S++  +G KN  W+GG+        P I +  R      + H+SD LPT+ +AA   T  
Sbjct: 283 SDFVLKGGKNDFWQGGITGYGFFSGPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLP 342

Query: 418 LPLNIDGLDQWSSL 431
            P  IDG D WS L
Sbjct: 343 -PKVIDGHDLWSRL 355


>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
 gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
          Length = 500

 Score =  212 bits (540), Expect = 4e-52,   Method: Compositional matrix adjust.
 Identities = 133/416 (31%), Positives = 202/416 (48%), Gaps = 58/416 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAY--NGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP 149
           G+NDL +  S  + TPN+  LA+  N  +L N Y   +CTP+R++ MTG YP   G Q  
Sbjct: 42  GFNDLDWKDST-LHTPNLRNLAFHKNTALLTNSYVNQLCTPTRSAFMTGYYPFRVGTQNG 100

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
                EP GVP    FL E +R+L YST  +GKWHLG+ ++E+ P  RGF+  +G+    
Sbjct: 101 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 160

Query: 210 ISYYDHILSDQYSRTVE--LNGHDMRRNLSTA-----WDTVGEYATDLFTKEAVQLIEDQ 262
             Y++H  +DQY R ++  + G D+   +        +   G Y+TDLFT  A+ ++++ 
Sbjct: 161 TGYFNHS-ADQYHRELKRVVKGLDLFEEVGNGKSVPDFSQNGVYSTDLFTDVAMSVLDNH 219

Query: 263 PVDKPLFLYLAHLAAH-----------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
              KP F++L++ A H                 H E P+        + D  RR Y  M+
Sbjct: 220 NTTKPFFMFLSYQAVHPPLQIKSEKKLTPKIATHFEVPRSFDKYCSRVKDRYRRIYCGML 279

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             +D ++G ++  L+   + EN++I+F SDNG        TSN+    SN P RG K+T+
Sbjct: 280 TAMDFAIGRLVEYLKASNLYENTVIVFTSDNGG-------TSNF--GASNAPLRGEKDTI 330

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           WEGG K    + SP   +       M H+ DW  T+ +  G       L ID        
Sbjct: 331 WEGGTKTTTFVHSPMYIEEGGTREMMFHVVDWHATILSITG-------LEID-------- 375

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPS-RRNSVLINIDEKKRTAAVRLDSWKLVLG 486
                    S  DG++QW  +    P  RR   + NID     +A+R   +KL++G
Sbjct: 376 ---------SYGDGINQWEYIKTGRPKFRRFQFVYNIDN--HGSAIRDGDYKLIVG 420



 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 21/78 (26%), Positives = 34/78 (43%), Gaps = 11/78 (14%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-----------AGNAGKHLEAPQET 61
           G Y+TDLFT  A+ ++++    KP F++L++ A H                 H E P+  
Sbjct: 201 GVYSTDLFTDVAMSVLDNHNTTKPFFMFLSYQAVHPPLQIKSEKKLTPKIATHFEVPRSF 260

Query: 62  INQFQYITDPNRRTYAAL 79
                 + D  RR Y  +
Sbjct: 261 DKYCSRVKDRYRRIYCGM 278


>gi|291244839|ref|XP_002742301.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 263

 Score =  212 bits (540), Expect = 5e-52,   Method: Compositional matrix adjust.
 Identities = 110/269 (40%), Positives = 159/269 (59%), Gaps = 26/269 (9%)

Query: 229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 288
           G+D+R N++  +   G Y T++FT +A Q+I+      PLFLYLAHLA H+GN    LEA
Sbjct: 2   GYDLRDNVNVTYYMNGSYTTEVFTTKAEQIIKKHNKQNPLFLYLAHLAVHSGNPSNPLEA 61

Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
           P++ +++F +I D NR+T+A MV  LD+SVG +   L+  GM  N+IIIF +DNG P   
Sbjct: 62  PEKYMSRFPHIQDKNRKTFAGMVSALDESVGNLTKVLKDSGMYNNTIIIFTTDNGGPAAG 121

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
           +       N  SN+P RG+K TLWEGGV+    + SP +++  RVS  M+H+ DWLPT+Y
Sbjct: 122 FNN-----NHASNWPLRGIKATLWEGGVRGDGFVHSPLLEKPGRVSTDMIHVCDWLPTIY 176

Query: 409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINID 468
             AGG+ S LP N+DG + W +L     SR                 + S R  +L NID
Sbjct: 177 HIAGGNVSSLPTNLDGFNVWDTL-----SR----------------GSVSPRTEILHNID 215

Query: 469 EKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
              + AA+R+  +K+++G    G  DG+Y
Sbjct: 216 PILQIAALRVGKYKIIMGECYRGNWDGWY 244



 Score = 77.0 bits (188), Expect = 3e-11,   Method: Compositional matrix adjust.
 Identities = 34/82 (41%), Positives = 54/82 (65%)

Query: 1  MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
          +R N++  +   G Y T++FT +A Q+I+      PLFLYLAHLA H+GN    LEAP++
Sbjct: 5  LRDNVNVTYYMNGSYTTEVFTTKAEQIIKKHNKQNPLFLYLAHLAVHSGNPSNPLEAPEK 64

Query: 61 TINQFQYITDPNRRTYAALTKS 82
           +++F +I D NR+T+A +  +
Sbjct: 65 YMSRFPHIQDKNRKTFAGMVSA 86


>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score =  212 bits (539), Expect = 6e-52,   Method: Compositional matrix adjust.
 Identities = 169/580 (29%), Positives = 263/580 (45%), Gaps = 99/580 (17%)

Query: 75  TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
            YA   K   + +   +GW D+SFH +  I TPNID L   G+ L N Y Q +CTP+R++
Sbjct: 17  VYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQHICTPTRSA 76

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTK-AIGKWHLGFFRREYT 193
           L+TG+YPIHTG+Q   I  ++  G+   E  LPEYL       +  +GKWH+G      T
Sbjct: 77  LLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHVGHGHSWMT 136

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS----TAWDTVGEYATD 249
           P  RGFE+  GYL G   +Y      + +    ++  +   +LS    ++W   G+Y+ D
Sbjct: 137 PWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGPTNSSW---GKYSGD 193

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           L+ ++  ++++     K  F+Y A    H       L+A +  + ++ +I D NRR YAA
Sbjct: 194 LYLQKMSEIVKSIDPTKDSFIYFAPQHVHYP-----LQALERHLVKYSWIKDTNRRNYAA 248

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           MV  LD+ VG VI    +  +L+N+I  F++DNG  T         R+ G+N+P  G K 
Sbjct: 249 MVASLDEIVGDVIDLYHKNRLLKNTIGFFVADNGGET---------RDGGNNWPLSGQKW 299

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
           TL EGGVK    ++   +       L   H+SD LPT+  A                   
Sbjct: 300 TLLEGGVKATGFMFGGLVPLGKYNGL--FHVSDILPTVMEA------------------- 338

Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK--------RT-------- 473
               N P ++   IDG  QW +++ +    R+ +L NID  K        RT        
Sbjct: 339 ---FNCPIKK--KIDGYAQWEAIIYSNDFPRSQILHNIDPTKINKVANDSRTWNSDWDVR 393

Query: 474 --AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFL 531
             AA+R   WKL+ G        G+     S++VP      +   + +Q     ++ +  
Sbjct: 394 VQAALRKGPWKLITGDP------GF----TSDEVPEF---PVSPPEWHQQGPYKTRGLIS 440

Query: 532 PISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISS 591
            I+     +S+      H              N    L+ +  DP E  +I+   P I  
Sbjct: 441 QITG----KSSDYYFDNH-------------RNKLVRLYKIDEDPTESRDISDQNPSIVE 483

Query: 592 QLYELLKYHRRTLVPQSHEQPDLVQ-ADPKRFNDTWSPWI 630
           ++  +L  +  T+VP   + PDL   ++PK     W PWI
Sbjct: 484 EMLNILIKYNSTMVPP--QLPDLDPFSNPKLHEGFWFPWI 521


>gi|405964717|gb|EKC30169.1| Arylsulfatase B [Crassostrea gigas]
          Length = 898

 Score =  211 bits (538), Expect = 8e-52,   Method: Compositional matrix adjust.
 Identities = 138/467 (29%), Positives = 221/467 (47%), Gaps = 110/467 (23%)

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
           +GKWH+GF+++EY P  RGF+S+FGYL G   YY H     +    ++ G D+R N   A
Sbjct: 526 VGKWHVGFYKKEYLPTSRGFDSYFGYLTGSEDYYTH-----HRCYGKMCGMDLRENAVPA 580

Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
            +  G Y+T LF ++A  +++    D+PLFLYL   + H+      L+ P++ I  + +I
Sbjct: 581 -NYSGIYSTYLFAQKAADIVQKHDSDQPLFLYLPFQSVHSP-----LQVPEKYIQPYLHI 634

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
            D  RR YA MV  +D+++G +    ++KG+  N++++F +DNG   +E          G
Sbjct: 635 KDKKRRKYAGMVAAMDEAIGNLTEVFKQKGIWNNTLMVFSTDNGGQILE---------GG 685

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           +NYP RG K +LWEGG+     +    +++   VS  +MH++DW PTL + AGG      
Sbjct: 686 NNYPLRGWKASLWEGGMHGVGFVHGQMLKRKGMVSKALMHVTDWFPTLVSLAGG------ 739

Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLL-NTPSRRNSVLINID---------- 468
            N++G                  +DG+DQW ++   +T ++R+S+L NID          
Sbjct: 740 -NLNG---------------TKPLDGVDQWKTISEGSTVNQRSSILHNIDPLFQPVGHRL 783

Query: 469 -----EKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQ 523
                + +  AAVR   WKL+ G+  NG+                     V     +SL 
Sbjct: 784 PKSPFDNRIRAAVRFGDWKLITGSPGNGSW--------------------VPPPHMKSLH 823

Query: 524 QLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIA 583
           Q+  N   P  NI                               +LFN+  DP E  ++A
Sbjct: 824 QMEDNSEPPEKNI-------------------------------WLFNITADPYEHKDVA 852

Query: 584 SSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
              P +  Q+ +LL  + +T VP  + + D  +A+PK     W PW+
Sbjct: 853 HINPKVVMQMLDLLAQYNQTAVPCRYPKMD-EKANPKYLGGYWGPWL 898



 Score = 68.6 bits (166), Expect = 9e-09,   Method: Compositional matrix adjust.
 Identities = 33/72 (45%), Positives = 49/72 (68%), Gaps = 5/72 (6%)

Query: 75  TYAALTKSTTLTLLIV----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           T+    KS+   +L V    +G+ND+ +HGS EI TPN+D LA +GI L N Y QP+CTP
Sbjct: 17  TFPVQGKSSQPHILFVLADDFGYNDIGYHGS-EIKTPNLDKLAGDGIRLENYYVQPICTP 75

Query: 131 SRASLMTGKYPI 142
           +R+ L++G+Y +
Sbjct: 76  TRSQLLSGRYQV 87



 Score = 48.1 bits (113), Expect = 0.013,   Method: Compositional matrix adjust.
 Identities = 23/70 (32%), Positives = 39/70 (55%), Gaps = 5/70 (7%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           G Y+T LF ++A  +++    D+PLFLYL   + H+      L+ P++ I  + +I D  
Sbjct: 584 GIYSTYLFAQKAADIVQKHDSDQPLFLYLPFQSVHSP-----LQVPEKYIQPYLHIKDKK 638

Query: 73  RRTYAALTKS 82
           RR YA +  +
Sbjct: 639 RRKYAGMVAA 648


>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 486

 Score =  211 bits (537), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 138/411 (33%), Positives = 202/411 (49%), Gaps = 48/411 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ F+G  EI TPNIDALA  G   +  Y Q +CTP+RA LMTG++P   G+Q   I
Sbjct: 40  GWKDVGFNGCTEIKTPNIDALAKGGAKFSQFYVQNMCTPTRACLMTGRFPYRYGLQTIVI 99

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A   G+  +E  +P+ L + GY T  IGKWHLG   ++Y P  RGF+  +G + G + 
Sbjct: 100 PTAAGYGLDTSEYLMPQCLGDAGYKTAIIGKWHLGHADQKYWPKQRGFDYQYGAMIGELD 159

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H   D++         D  R+     +    Y T L   +AV+ I  Q   KP +LY
Sbjct: 160 YFTH---DEHGVL------DWFRDNKPVHEQ--GYTTTLIGDDAVKYIHGQDGKKPFYLY 208

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L   A H        +AP+E I ++  I +P RRTYAAMV  LD+++G V++AL +KG+ 
Sbjct: 209 LTFNAPHT-----PYQAPKEYITKYLNIAEPTRRTYAAMVDCLDENIGKVVAALDQKGLR 263

Query: 332 ENSIIIFMSDNGAP-----TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSP 385
           EN++I F SDNG         +  + S       N PYR  K +L+EGG +V A+  W  
Sbjct: 264 ENTLIFFHSDNGGTKDKMFAGQMADMSKVVLPCDNGPYRNGKGSLFEGGSRVCALANWPG 323

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
           +I+   +    M+H  D  PT    AG   ++                         +DG
Sbjct: 324 KIKA--QTVDGMIHAVDLYPTFAALAGASIAKC----------------------KPLDG 359

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
            + W ++    PS R     +I+  +  A +R   WKL+  T    ++D Y
Sbjct: 360 TNVWDTIAEGKPSPRTEFFYSIEPFR--AGLRQGDWKLIWRTMLPSSVDLY 408



 Score = 46.6 bits (109), Expect = 0.039,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 35/66 (53%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y T L   +AV+ I  Q   KP +LYL   A H        +AP+E I ++  I +P RR
Sbjct: 183 YTTTLIGDDAVKYIHGQDGKKPFYLYLTFNAPHT-----PYQAPKEYITKYLNIAEPTRR 237

Query: 75  TYAALT 80
           TYAA+ 
Sbjct: 238 TYAAMV 243


>gi|298706368|emb|CBJ29377.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 653

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 164/573 (28%), Positives = 263/573 (45%), Gaps = 83/573 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FH +    TPN+DA+   G+ L+  Y  P CTPSRA LMTG+Y    GMQ   +
Sbjct: 48  GWKDVGFHDTT-FSTPNLDAMVAEGVELSTFYTAPTCTPSRAQLMTGRYSYRIGMQDSVL 106

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EPRGVPLTE F+ E L+  GYST A+GKWHLG    ++ P+ RGF+  +G L G   
Sbjct: 107 HTTEPRGVPLTETFVGEKLQAAGYSTAAVGKWHLGMHMPQFLPVERGFDDFYGILTGGGG 166

Query: 212 YYDHI-LSDQYS------RTVELNGHDMRRNLSTAWDT--VGEYATDLFTKEAVQLIEDQ 262
           +Y H+ +S +++      +T   +G ++      + D      ++T+L+TK+A + +E  
Sbjct: 167 HYTHMSVSPEFTPRGGSGKTRTFSGPNIVEGSEVSDDNQDTRTHSTELYTKKAAEYVEAM 226

Query: 263 PV--DKPLFLYLAHLAAHA----------GNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
               D+P FLYL++ A H           G + + + A   +I +       NRR    M
Sbjct: 227 SAKEDQPWFLYLSYQAIHDPIETDDSWINGRSCESISAENSSIGETDETDYDNRRIACGM 286

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKN 369
           V ++D+ +G +   L+  G  +N+++IF SDNG             + GS N P+RG K 
Sbjct: 287 VAQMDNGLGALRGLLENLGEWDNTVVIFFSDNGG----------LPSHGSVNRPFRGGKG 336

Query: 370 TLWEGGVKVPAILWSPQIQQ-------NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
             WEGGV VPA +    +          P     + H++D   T+   AG     +P   
Sbjct: 337 DYWEGGVHVPAFVSGGFVSAALRRTGVEPYRYGHLTHVTDVHATILGLAG-----VPGAD 391

Query: 423 DGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEK--KRTAAVRLDS 480
           DG                + +DG+D W SL+      R +V+IN++      + AVR  +
Sbjct: 392 DG------------GNGEAELDGIDLWDSLVETKAPARQAVVINLNSPNFASSGAVRWSN 439

Query: 481 WKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFL--PISNIDK 538
           +KL+   +   T+   Y + +S  V       + ES    ++  +   I L  P   +  
Sbjct: 440 YKLIRNPEPKETV--IYSRVQSKLVE--EEGVVSESTLASAISTVQAQIHLSEPKMYMFD 495

Query: 539 MRSTRQQATIH-CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
           + +   ++ +  CG + A      C N    L+++   P  +        D   +L  ++
Sbjct: 496 LEANPSESVVEGCGTDRALGPVESCAN----LYSI---PAFR--------DARKKLEGII 540

Query: 598 -KYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
            K  R ++ P    + D   ADP  +   W PW
Sbjct: 541 RKADRESVAPTLRWEDDGPLADPSNWGG-WVPW 572


>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
          Length = 532

 Score =  211 bits (536), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 168/580 (28%), Positives = 263/580 (45%), Gaps = 99/580 (17%)

Query: 75  TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
            YA   K   + +   +GW D+SFH +  I TPNID L   G+ L N Y Q +CTP+R++
Sbjct: 17  VYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQHICTPTRSA 76

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTK-AIGKWHLGFFRREYT 193
           L+TG+YPIHTG+Q   I  ++  G+   E  LPEYL       +  +GKWH+G      T
Sbjct: 77  LLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHVGHGHSWMT 136

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS----TAWDTVGEYATD 249
           P  RGFE+  GYL G   +Y      + +    ++  +   +LS    ++W   G+Y+ D
Sbjct: 137 PWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGPTNSSW---GKYSGD 193

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           L+ ++  ++++     K  F+Y A    H       L+A +  + ++ +I D NRR YAA
Sbjct: 194 LYLQKMSEIVKSIDPTKDSFIYFAPQHVHYP-----LQALERHLVKYSWIKDTNRRNYAA 248

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           MV  LD+ VG V+    +  +L+N+I  F++DNG  T         R+ G+N+P  G K 
Sbjct: 249 MVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNGGET---------RDGGNNWPLSGQKW 299

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
           TL EGGVK    ++   +       L   H+SD LPT+  A                   
Sbjct: 300 TLLEGGVKATGFMFGGLVPLGKYNGL--FHVSDILPTVMEA------------------- 338

Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK--------RT-------- 473
               N P ++   IDG  QW +++ +    R+ +L NID  K        RT        
Sbjct: 339 ---FNCPIKK--KIDGYAQWEAIIYSNDFPRSQILHNIDPTKINKVANDSRTWNSDWDVR 393

Query: 474 --AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFL 531
             AA+R   WKL+ G        G+     S++VP      +   + +Q     ++ +  
Sbjct: 394 VQAALRKGPWKLITGDP------GF----TSDEVPEF---PVSPPEWHQQGPYKTRGLIS 440

Query: 532 PISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISS 591
            I+     +S+      H              N    L+ +  DP E  +I+   P I  
Sbjct: 441 QITG----KSSDYYFDNH-------------RNKLVRLYKIDEDPTESRDISDQNPSIVE 483

Query: 592 QLYELLKYHRRTLVPQSHEQPDLVQ-ADPKRFNDTWSPWI 630
           ++  +L  +  T+VP   + PDL   ++PK     W PWI
Sbjct: 484 EMLNILIKYNSTMVPP--QLPDLDPFSNPKLHEGFWFPWI 521


>gi|117558328|gb|AAI27451.1| Arsb protein [Rattus norvegicus]
          Length = 523

 Score =  210 bits (535), Expect = 1e-51,   Method: Compositional matrix adjust.
 Identities = 148/468 (31%), Positives = 220/468 (47%), Gaps = 84/468 (17%)

Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH---- 230
           Y+T  +GKWHLG +R+E  P  RGF+++FGYL G   YY H   +  +    LNG     
Sbjct: 128 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIECLNGTRCAL 184

Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
           D+R     A +    Y+T++FTK A  LI + P +KPLFLYLA  + H       L+ P+
Sbjct: 185 DLRDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPE 239

Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
           E +  + +I D +RR YA MV  LD++VG V  AL+ +G+  N+++IF +DNG  T    
Sbjct: 240 EYMEPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQT---- 295

Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
                R+ G+N+P RG K TLWEGG++    + SP ++Q    S ++MHI+DWLPTL   
Sbjct: 296 -----RSGGNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNL 350

Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRR-----NSNIDGLDQWSSLLLNTPSRRNSVLI 465
           AGG T      +DG D W ++   +PS R     N + D  D       NT   +N    
Sbjct: 351 AGGSTHGT-KPLDGFDVWETISEGSPSPRVELLLNIDPDFFDGLPCPGKNTTPEKNDSFP 409

Query: 466 NIDEKKRT---AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSL 522
                  T   A +R  +WKL+ G    G       Q+  ++VP              S+
Sbjct: 410 LEHSAFNTSIHAGIRYKNWKLLTGYPGCGYWFPPPSQSNISEVP--------------SV 455

Query: 523 QQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNI 582
              ++ ++L                                      F++  DP E++++
Sbjct: 456 DSPTKTLWL--------------------------------------FDINRDPEERHDV 477

Query: 583 ASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           +   P I   L   L+Y+    VP S+  P   + DPK     WSPW+
Sbjct: 478 SREHPHIVQNLLSRLQYYHEHSVP-SYFPPLDPRCDPKG-TGVWSPWM 523



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  LI + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 200 YSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRR 254

Query: 75  TYAALT 80
            YA + 
Sbjct: 255 IYAGMV 260


>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 506

 Score =  209 bits (532), Expect = 4e-51,   Method: Compositional matrix adjust.
 Identities = 159/555 (28%), Positives = 246/555 (44%), Gaps = 103/555 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H   ++  P ++ LA +G+I N  Y QP CTPSR++L TG YP     Q   +
Sbjct: 34  GWNDVGWHNP-DLKMPILNQLAADGVIFNQSYVQPACTPSRSALFTGYYPFKIKRQHQML 92

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E  G+ L  + LPE L+++GY T  +GKWHLGF + EY P  RGF+S +G+L     
Sbjct: 93  LNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLGFCKEEYLPNKRGFDSFYGWLT---- 148

Query: 212 YYDHILSDQYSR-TVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
               + +D Y++  V   G+D R N     ++   Y   +  + AV ++     + PLFL
Sbjct: 149 ----LGTDLYTKENVLAPGYDFRDNTGVVQES-DTYLPFMLAERAVDIVMGHYKEYPLFL 203

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM-----------------VKK 313
             +         GK LE PQ+  + +  I D   R +  +                 +  
Sbjct: 204 EFS-----MALPGKFLEVPQDYEDLYSDIDDDRTRKFYGICNLNIVVSKNTKLPNRKLSA 258

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           +D+S+G ++ AL+ +GM ++++ IF SDNGA   E          GSN+P RG   TL+E
Sbjct: 259 MDNSIGLLVEALKTRGMWDDTLFIFTSDNGAFATES---------GSNWPLRGNIATLFE 309

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
           GG +VP   +   +++   V+ +++HI D   T+   AG  T     +IDG+D W ++  
Sbjct: 310 GGTRVPTFFYGSMLKKAGYVNNELIHIVDLHKTIIELAGATTES---DIDGMDMWKTISK 366

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
             PS R   +  +D                    +E    AA+R+  +KL+ G  +   M
Sbjct: 367 GEPSPRTEFVYNIDD-------------------NELSPGAAIRVGDYKLITGHPD--LM 405

Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
             +    RS+               Y    +    I LP +                G  
Sbjct: 406 YPFRLLNRSD-------------DWYNYGDEPISGILLPPT----------------GNE 436

Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
           P P   +       YLFN+ +DP E+NN+A   P+   +L   L  +R  LVP      D
Sbjct: 437 PPPANVT-------YLFNIIDDPEERNNLADVYPNKVEELRHRLDEYREHLVPPESRIFD 489

Query: 614 LVQADPKRFNDTWSP 628
               D   F   WSP
Sbjct: 490 -PSGDAVNFGGVWSP 503


>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
           sphaeroides KD131]
 gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
           KD131]
          Length = 509

 Score =  207 bits (528), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 145/459 (31%), Positives = 218/459 (47%), Gaps = 69/459 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS+ + TPN+D LA  G  L   Y QP+CTP+RA+LMTG+YP+  G+Q   I
Sbjct: 75  GYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGRYPMRYGLQTGVI 133

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+   E  LP+ L+E GY T  +GKWHLG   ++Y P  RG +  +G L G I 
Sbjct: 134 PSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGVDYFYGPLVGEID 193

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           ++ H   + +  T     ++M +           Y T+LF  +A++LIE+     PL++Y
Sbjct: 194 HFKH---EAHGITDWYRDNEMVKEPG--------YDTELFGADAIRLIEEHDSATPLYMY 242

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L+  A H        +AP +  + +  I D  R+ YAAM+  +DD VG V+ AL+R+GM 
Sbjct: 243 LSFTAPHT-----PYQAPDKYKDLYPDIADEGRKAYAAMISCMDDQVGLVLQALERRGMR 297

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGS--------NYPYRGVKNTLWEGGVKVPAIL- 382
           E++++IF SDNG        +  +   G+        N P R  K TL+EGG +V A+  
Sbjct: 298 EDTLVIFHSDNGG-----TRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALAN 352

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
           W  +I      +  MMH+ D LPTL   A  + +                          
Sbjct: 353 WPGRIPAGE--THGMMHVVDMLPTLAGLAQAEIAH----------------------AGQ 388

Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
           +DG+D W ++     S R  V+ NI+  +   A+R   WKL            Y+     
Sbjct: 389 LDGMDVWQAISAGKASPREEVVYNIEPTQ--GALRDGKWKL------------YWQPILP 434

Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
            KV L +  A     T  S ++  Q   +    ID  RS
Sbjct: 435 PKVELFDLEADPSETTDLSAKEPEQLARMQARVIDLARS 473



 Score = 41.2 bits (95), Expect = 1.8,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 36/65 (55%), Gaps = 5/65 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y T+LF  +A++LIE+     PL++YL+  A H        +AP +  + +  I D  R+
Sbjct: 217 YDTELFGADAIRLIEEHDSATPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRK 271

Query: 75  TYAAL 79
            YAA+
Sbjct: 272 AYAAM 276


>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 706

 Score =  207 bits (527), Expect = 1e-50,   Method: Compositional matrix adjust.
 Identities = 167/559 (29%), Positives = 253/559 (45%), Gaps = 105/559 (18%)

Query: 92  GWNDLSFHGSN-EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GWND+ +   + +  TP++D LA  G+ + N Y   +CTP+RASLMTG+Y +  G+Q   
Sbjct: 142 GWNDIGYQSVDLQGVTPHLDRLAAGGVKMTNYYTMSICTPARASLMTGRYVMRYGLQYSV 201

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN--- 207
           I    P G+PLTE+  PEY+++ GY T  IGKWH+G +   + P  RGF+++ GYLN   
Sbjct: 202 IQPGAPWGLPLTEKIFPEYMKDAGYETHMIGKWHIGSYTSRHIPSQRGFDTYLGYLNDEE 261

Query: 208 -----------------------GVISYYDHI---LSDQYSRTVELNGHDMRRNLSTAWD 241
                                      YYD I   + DQ       +G D    LS    
Sbjct: 262 MYWTHQSWTAKYNGRKFFDFGFGNATGYYDVIDREVPDQLRDGTISDGDDDETMLSATSS 321

Query: 242 TV-------GEYATDLFTKEAVQLIEDQPV--DKPLFLYLAHLAAH---AGNAGKHLEAP 289
                    G Y+T +F   A+++I ++    D+PL++YLAH A H        +H    
Sbjct: 322 YSSSGGFFKGRYSTQVFHNRALEIIHEKTPYDDEPLYMYLAHQAVHDPLGLPPAEHFTPD 381

Query: 290 QETINQFQYI-------TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
           Q  ++  + I       T+  R  +A ++  LD ++G ++  L+  G +ENSIII  SDN
Sbjct: 382 Q--LDLLERIETGSEEGTEALRARFAKVLMYLDYTIGELVEYLETNGWMENSIIIVASDN 439

Query: 343 GA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ---MM 398
           G  P+            GSNYP RG+K++ WEGG KVP+ ++S       R   +   +M
Sbjct: 440 GGCPSCG----------GSNYPLRGIKHSYWEGGTKVPSFVYSKSHIPEERWGTEYDGLM 489

Query: 399 HISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPS 458
           H++DWLPT+  AA  D        DG+DQW  L+ +  +                    S
Sbjct: 490 HVTDWLPTIAGAAEIDLDGGAGPFDGVDQW-PLITSVAAEAEFG---------------S 533

Query: 459 RRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKT 518
            R+ ++ N D       + +DS    +G  E     G +   RS K   L         T
Sbjct: 534 ARDEMMYNFDPY----ILWVDS-SDEIGDPEFNLAQGAF---RSGKYKYLANEWCTGWYT 585

Query: 519 YQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGPCYLFNLGNDP 576
           +     L++    P++N+          TI C  +P       CT  N   YLF+L  DP
Sbjct: 586 FDK-HILAEE---PLTNL----------TIACDGSPCSDCGQGCTSYNYTDYLFDLEADP 631

Query: 577 CEQNNIASSRPDISSQLYE 595
            E+ N+    PD++  + E
Sbjct: 632 REETNLIYDLPDVAHNMRE 650


>gi|442749327|gb|JAA66823.1| Putative arylsulfatase b precursor [Ixodes ricinus]
          Length = 274

 Score =  207 bits (526), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 103/218 (47%), Positives = 139/218 (63%), Gaps = 5/218 (2%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+SFHGS +IPTPN+DALA +GIILN  YAQ +CTPSRA+LMTG+YPI+TGMQ   I
Sbjct: 59  GWDDVSFHGSPQIPTPNMDALAADGIILNQHYAQALCTPSRAALMTGRYPIYTGMQHFVI 118

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+PL  R +PE+  +LGY T  +GKWHLG F++++ P+ RGF+S +G+ N    
Sbjct: 119 QPGEPWGLPLEYRLMPEFFSDLGYKTHMVGKWHLGSFKKDFIPVRRGFDSFYGFYNADQD 178

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY+  L++      E  G+D   N          Y+T  +T+ AV LI      +PLFLY
Sbjct: 179 YYNKTLTEG-----EHTGYDFWLNEDIHIYPNNRYSTHHYTERAVSLIRSHNPSQPLFLY 233

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           L++     G     LEAP+E +N+F YI + NR  YA 
Sbjct: 234 LSYXXXXVGTGPSLLEAPEENVNKFLYIPEKNRTIYAG 271



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/64 (42%), Positives = 37/64 (57%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T  +T+ AV LI      +PLFLYL++     G     LEAP+E +N+F YI + NR 
Sbjct: 208 YSTHHYTERAVSLIRSHNPSQPLFLYLSYXXXXVGTGPSLLEAPEENVNKFLYIPEKNRT 267

Query: 75  TYAA 78
            YA 
Sbjct: 268 IYAG 271


>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
 gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
           AKP1]
          Length = 498

 Score =  207 bits (526), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 146/459 (31%), Positives = 217/459 (47%), Gaps = 69/459 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+ +HGS+ + TPN+D LA  G  L   Y QP+CTP+RA+LMTG YP+  G+Q   I
Sbjct: 64  GYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGCYPMRYGLQTGVI 122

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+   E  LP+ L+E GY T  +GKWHLG   ++Y P  RGF+  +G L G I 
Sbjct: 123 PSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGFDYFYGPLVGEID 182

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           ++ H   + +  T     ++M +           Y T+LF  +A++LIE+     PL++Y
Sbjct: 183 HFKH---EAHGITDWYRDNEMVKEPG--------YDTELFGADAIRLIEEHDSATPLYMY 231

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L+  A H        +AP +  + +  I D  R  YAAM+  +DD VG V+ AL+R+GM 
Sbjct: 232 LSFTAPHT-----PYQAPDKYKDLYPDIADEGRNAYAAMISCMDDQVGPVLQALERRGMR 286

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGS--------NYPYRGVKNTLWEGGVKVPAIL- 382
           E++++IF SDNG        +  +   G+        N P R  K TL+EGG +V A+  
Sbjct: 287 EDTLVIFHSDNGG-----TRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALAN 341

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
           W  +I      +  MMH+ D LPTL   A  + +                          
Sbjct: 342 WPGRIPAGE--THGMMHVVDMLPTLAGLAQAEIAH----------------------AGQ 377

Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
           +DG+D W ++     S R  V+ NI+  +   A+R   WKL            Y+     
Sbjct: 378 LDGMDVWQAISAGKASPREEVVYNIEPTQ--GALRDGKWKL------------YWQPILP 423

Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
            KV L +  A     T  S ++  Q   +    ID  RS
Sbjct: 424 PKVELFDLEADPSETTDLSAKEPEQLARMQARVIDLARS 462



 Score = 39.7 bits (91), Expect = 4.5,   Method: Compositional matrix adjust.
 Identities = 22/65 (33%), Positives = 35/65 (53%), Gaps = 5/65 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y T+LF  +A++LIE+     PL++YL+  A H        +AP +  + +  I D  R 
Sbjct: 206 YDTELFGADAIRLIEEHDSATPLYMYLSFTAPH-----TPYQAPDKYKDLYPDIADEGRN 260

Query: 75  TYAAL 79
            YAA+
Sbjct: 261 AYAAM 265


>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
          Length = 318

 Score =  207 bits (526), Expect = 2e-50,   Method: Compositional matrix adjust.
 Identities = 118/323 (36%), Positives = 169/323 (52%), Gaps = 24/323 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+     + I TPN+DALA  G+IL N Y Q +C+PSR +LMTG+YP  + MQ   +
Sbjct: 14  GYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCSPSRHALMTGRYPYKSAMQSFVV 72

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E +   L  + LP+YL+ELGY    IGKWHLG+ R E  P  RGF+S +G L+G   
Sbjct: 73  LPFEAKCTGLEYKLLPQYLKELGYENHLIGKWHLGYCREECLPTSRGFDSFYGLLDGAGD 132

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H  S  Y         D   N     +  G ++ DL      +L  +     PLFLY
Sbjct: 133 YWEHTTSGVY---------DWHLNDEVFHEAYGNHSQDLELDRLDKLFAEHDNKDPLFLY 183

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
            A    H  +          + + F  +    RR Y A+   LD  VG ++  L+   M+
Sbjct: 184 FAPQNPHVPSQPTEEYLSMYSADDFSDV----RRRYLALTSALDSMVGKIVQHLKDNCMM 239

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +N+ +IFMSDNGA   E          G N P+RG K +L+EGG K  + ++S  +++  
Sbjct: 240 DNTYLIFMSDNGADPGE----------GQNTPFRGGKTSLFEGGTKSNSFIYSQLLKKTE 289

Query: 392 RVSLQMMHISDWLPTLYTAAGGD 414
             +  +MHI+DWLPTL   AGGD
Sbjct: 290 YENDGLMHITDWLPTLVKLAGGD 312


>gi|346464573|gb|AEO32131.1| hypothetical protein [Amblyomma maculatum]
          Length = 418

 Score =  206 bits (523), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 147/438 (33%), Positives = 207/438 (47%), Gaps = 54/438 (12%)

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
           Y P  RGF++ +GY N    Y+   L+        + G D   N +   D  G Y T L 
Sbjct: 1   YLPTRRGFDTFYGYYNTAKYYFTQKLN-----FYNICGRDYWDNETLVTDKDGTYDTHLI 55

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           T  AV +I     DKPLF+Y+A LAAH       ++APQ  + +F YI D NR   A  V
Sbjct: 56  TDRAVDIISSHDPDKPLFIYMAPLAAHGQTTNLTVDAPQRNLAKFPYIGDHNRTLLAGAV 115

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             LD+SVG +++AL ++ ML NS+++F SDNG          ++ N GSN+P RG K T+
Sbjct: 116 DALDESVGRIVAALHQRRMLANSVLVFTSDNGG-----IPWGDFSNTGSNWPLRGTKGTM 170

Query: 372 WEGGVKVPAILWSPQIQQNPRVSL-QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
           WEGGV++PA++WS  +    RV +   MH+ DWLPTLY+AAGG    L  NIDG+D W  
Sbjct: 171 WEGGVRLPAVVWSSMLAPGSRVVVPHPMHLVDWLPTLYSAAGGHIGDLG-NIDGIDLWPV 229

Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL----- 485
           L     S +++       W          R   LIN D     +A R   +KLV      
Sbjct: 230 LSSAKTSSKST------SWP---------RTEFLINYDTDTGLSAYRDGDYKLVAVTTPK 274

Query: 486 -----------GTQENGTMDGYYGQTRSNKVPLLNFNAIVESK-TYQSLQQLSQNIFLPI 533
                      G Q N    G     R  K  +   N ++ S   +++LQ      F   
Sbjct: 275 SKDARYPFGDPGLQRNVPTPGGIPPDRCRKGAIKRLNTLMASSLAWRTLQG-----FYGD 329

Query: 534 SNIDKMRST----RQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDI 589
             I +  S     R QA++ C      +          YLF++  DPCE NN+A S P +
Sbjct: 330 GEIGRAESLAPEWRVQASVRCNPRKETVPGLEQLREGNYLFDIKRDPCELNNLAHSLPSV 389

Query: 590 SSQLYELLK-YHRRTLVP 606
            ++L + L+ Y   T+ P
Sbjct: 390 VAKLSKKLQSYVAITMAP 407



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 37/69 (53%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           D  G Y T L T  AV +I     DKPLF+Y+A LAAH       ++APQ  + +F YI 
Sbjct: 45  DKDGTYDTHLITDRAVDIISSHDPDKPLFIYMAPLAAHGQTTNLTVDAPQRNLAKFPYIG 104

Query: 70  DPNRRTYAA 78
           D NR   A 
Sbjct: 105 DHNRTLLAG 113


>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
 gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
          Length = 468

 Score =  206 bits (523), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 121/320 (37%), Positives = 180/320 (56%), Gaps = 32/320 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G++DL  HG  +IPTP++DALA +G+   + Y   P C+P+RA L+TG+Y    G +  P
Sbjct: 41  GYHDLGVHGCKDIPTPHLDALATSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 100

Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
             G  P G   +PL+E  L + L+++GY T  +GKWHLG   + + PL RGF+  FG+L 
Sbjct: 101 --GPTPTGEIGLPLSETTLADRLKKVGYKTGMVGKWHLGNDEKRH-PLSRGFDEFFGFLG 157

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           G  +Y+     +  + T  L G ++            EY TD F +EAV  I D+    P
Sbjct: 158 GARTYF-ATPGNASAGTKLLRGREVVDE--------KEYLTDAFAREAVAYI-DRSKASP 207

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
            FLYL   A H       +EA Q+ +++F  ++DP R+ Y AM+  +DD+VG V++ L+R
Sbjct: 208 FFLYLTFNAVHT-----PMEASQKYLDRFTAVSDPKRQKYCAMMSAMDDAVGQVVAKLER 262

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
           + +LEN++I F+SDNG PT          N G N P RG K T WEGG++VP  + W  +
Sbjct: 263 EKLLENTLIFFVSDNGGPTAA--------NTGDNTPLRGFKATTWEGGIRVPYFVSWKGK 314

Query: 387 IQQNPRVSLQMMHISDWLPT 406
           I         ++ I D++PT
Sbjct: 315 IPAGKTYDQPVIQI-DFVPT 333



 Score = 48.1 bits (113), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 38/69 (55%), Gaps = 6/69 (8%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           EY TD F +EAV  I D+    P FLYL   A H       +EA Q+ +++F  ++DP R
Sbjct: 186 EYLTDAFAREAVAYI-DRSKASPFFLYLTFNAVHT-----PMEASQKYLDRFTAVSDPKR 239

Query: 74  RTYAALTKS 82
           + Y A+  +
Sbjct: 240 QKYCAMMSA 248


>gi|390341601|ref|XP_796347.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 497

 Score =  206 bits (523), Expect = 4e-50,   Method: Compositional matrix adjust.
 Identities = 105/251 (41%), Positives = 151/251 (60%), Gaps = 9/251 (3%)

Query: 91  YGWNDLSFHGSNE---IPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
           +G+ND+ +HG      + TPN+D LA  G+ L   Y QP+C+P+R+ L++G+Y IHTG+Q
Sbjct: 45  FGYNDVGYHGREHGSMVLTPNLDGLAGEGVKLEKYYVQPICSPTRSQLLSGRYQIHTGLQ 104

Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
              I  A+P  +PL E  LP+ L+E  Y+T  +GKWH+GF++   TP  RGF+S+FGYL+
Sbjct: 105 YGVIRPAQPHCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTPTERGFDSYFGYLS 164

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           G   YY H  S Q      L G D+  N + A+   G+Y+T LFT +A+ +I +    KP
Sbjct: 165 GAEDYYSHSRSFQIGSKT-LKGLDLMANKTPAFQYKGQYSTHLFTSKAIDVINNHERSKP 223

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFLYLA+ A H+      L+ P +    +  IT   RR YA MV  +D+ +G V  AL  
Sbjct: 224 LFLYLAYQAVHS-----PLQVPSKYEEPYANITSSARRAYAGMVSCMDEGIGNVTRALVD 278

Query: 328 KGMLENSIIIF 338
            G+  N+IIIF
Sbjct: 279 AGLYNNTIIIF 289



 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 24/62 (38%), Positives = 37/62 (59%), Gaps = 1/62 (1%)

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           +LFN+  DP E+N+++ + PD+ +QL   L+Y+ +  VP  +  PD V ADP      W 
Sbjct: 436 WLFNITADPLEENDLSDTYPDVVAQLLHRLQYYYKDSVPPRYPPPD-VNADPALHGGAWG 494

Query: 628 PW 629
           PW
Sbjct: 495 PW 496



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           +  N + A+   G+Y+T LFT +A+ +I +    KPLFLYLA+ A H+      L+ P +
Sbjct: 188 LMANKTPAFQYKGQYSTHLFTSKAIDVINNHERSKPLFLYLAYQAVHS-----PLQVPSK 242

Query: 61  TINQFQYITDPNRRTYAALT 80
               +  IT   RR YA + 
Sbjct: 243 YEEPYANITSSARRAYAGMV 262


>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
          Length = 370

 Score =  205 bits (522), Expect = 5e-50,   Method: Compositional matrix adjust.
 Identities = 131/358 (36%), Positives = 188/358 (52%), Gaps = 25/358 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ F    +I TPNID LA  G+++ N Y+  VCTPSR +LMTG+YP  TGMQ   I
Sbjct: 22  GYNDVGFRNP-DIITPNIDKLARKGVVMTNSYSTHVCTPSRHALMTGRYPYKTGMQNFVI 80

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G  P    L  +FLP+YL+ LGY+T A+GKWHLG  R E  P  RGF+S +G L G   
Sbjct: 81  PGDAPVCSGLEYKFLPQYLKSLGYNTHAVGKWHLGDCRDECLPTERGFDSFYGLLLGGGG 140

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H  +           +D   N        G ++ DL       +      ++P+FLY
Sbjct: 141 YWNHTYT-------LFGAYDWFNNKDLDLSANGTHSQDLMVDRLSAVFASHNREEPMFLY 193

Query: 272 LAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
            A    H  +     EA +  +  +   +    RR Y  +V  LD  VG ++  +    M
Sbjct: 194 FAPQNPHTPS-----EATENFLRHYPSSMYTEIRRQYLGLVSGLDWMVGELLELVIANNM 248

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
           L+N+ IIF SDNG   VE            N PYRG K +L+EGG KV   ++SP +++ 
Sbjct: 249 LDNTYIIFQSDNGGDAVE----------ALNGPYRGGKGSLFEGGSKVVNFIYSPLLKKV 298

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
              +   MHI+DW+PT+ + AGG+   L   IDG++Q S +L    S+R S +  +D+
Sbjct: 299 GYENNGWMHITDWMPTIISLAGGEVP-LVEGIDGINQMSMVLKGAESKRTSMVYNIDR 355


>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1517

 Score =  205 bits (521), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 128/361 (35%), Positives = 188/361 (52%), Gaps = 44/361 (12%)

Query: 77  AALTKSTTLTLLIVYGWNDLSF-HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
           AA        L+   G++D+ + +    + TP+IDALA  G+ L+  Y+   CTP+R +L
Sbjct: 67  AASYPHVVFALMDDLGFDDVGYGNADGAVATPHIDALAKEGVTLSRYYSAFSCTPARGAL 126

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           +TG  P   G+Q   ++  +P G+P     LP++L +LGY +  +GKWHLG F  E  P 
Sbjct: 127 LTGLSPHRLGLQHGQVFPEQPWGLPSKFSILPQHLAKLGYRSHLVGKWHLGHFSAERLPT 186

Query: 196 YRGFESHFGYLNGVISYYDHILS-------DQYSRTVELNGH--------------DMRR 234
            RGF+S FG L+G   Y  HI +       D   R  E+  +              D+R 
Sbjct: 187 ARGFDSFFGGLDGAQYYATHIDAMDCKLPGDVLYRGFEVGDYDSLKAVTAEHGCYFDLRE 246

Query: 235 NLSTAWDTVGEYATDLFTKEAVQLIEDQ-----PVDKPLFLYLAHLAAHAGNAGKHLEAP 289
           N     D  G Y+T LF ++A +LI+         +KPLFL L+  A HA      + AP
Sbjct: 247 NNDRVEDLFGSYSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVHA-----PVWAP 301

Query: 290 QETIN---QFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT 346
           ++T         +T+ NRR +AA ++ +DD+VG V++AL+   M  NSI+IF SDNGA  
Sbjct: 302 EDTYETHPDLLNVTNGNRRKFAAALRLVDDAVGNVVAALEEARMDANSILIFASDNGA-- 359

Query: 347 VEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPT 406
                  N  + GSN P RG K  L+EGGV+VPA + +P+            H++DW+PT
Sbjct: 360 -------NPEHGGSNAPLRGSKGYLFEGGVRVPAFVRAPKYLPRGATYDHPFHVTDWVPT 412

Query: 407 L 407
           L
Sbjct: 413 L 413



 Score = 39.7 bits (91), Expect = 5.1,   Method: Compositional matrix adjust.
 Identities = 29/89 (32%), Positives = 44/89 (49%), Gaps = 13/89 (14%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-----PVDKPLFLYLAHLAAHAGNAGKHL 55
           +R N     D  G Y+T LF ++A +LI+         +KPLFL L+  A HA      +
Sbjct: 244 LRENNDRVEDLFGSYSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVHA-----PV 298

Query: 56  EAPQETIN---QFQYITDPNRRTYAALTK 81
            AP++T         +T+ NRR +AA  +
Sbjct: 299 WAPEDTYETHPDLLNVTNGNRRKFAAALR 327


>gi|241789348|ref|XP_002400616.1| sulfatase, putative [Ixodes scapularis]
 gi|215510801|gb|EEC20254.1| sulfatase, putative [Ixodes scapularis]
          Length = 224

 Score =  205 bits (521), Expect = 7e-50,   Method: Compositional matrix adjust.
 Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 6/218 (2%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+SF G  +IPTPN+D LA  GIILNN Y   +CTPSR +LM+G YPIHTG+Q    
Sbjct: 13  GWADVSFRGDPQIPTPNLDVLASQGIILNNYYVLHLCTPSRGALMSGLYPIHTGLQHYVQ 72

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             AEP G+PL    +PE+L+ LGY+T  IGKW+LG+++  YTP  RGF+S +G+LNG   
Sbjct: 73  LPAEPHGLPLNVTIMPEHLKNLGYTTHMIGKWNLGYYKESYTPTRRGFDSFYGFLNGGED 132

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YYDH +    +     +G    RN S        Y+TDLFTK+A+ LI+D    KP+FLY
Sbjct: 133 YYDHTILFVSTGLDFWDGTTPVRNQSH------HYSTDLFTKKALALIKDHDQAKPMFLY 186

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
            +H A H+G+    LEAP   I  F YI + NR   A 
Sbjct: 187 FSHQAVHSGDFKVELEAPASAIAHFPYIKEQNRSIIAG 224



 Score = 65.9 bits (159), Expect = 7e-08,   Method: Compositional matrix adjust.
 Identities = 31/65 (47%), Positives = 40/65 (61%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
            Y+TDLFTK+A+ LI+D    KP+FLY +H A H+G+    LEAP   I  F YI + NR
Sbjct: 160 HYSTDLFTKKALALIKDHDQAKPMFLYFSHQAVHSGDFKVELEAPASAIAHFPYIKEQNR 219

Query: 74  RTYAA 78
              A 
Sbjct: 220 SIIAG 224


>gi|410932865|ref|XP_003979813.1| PREDICTED: arylsulfatase I-like, partial [Takifugu rubripes]
          Length = 316

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 110/291 (37%), Positives = 168/291 (57%), Gaps = 20/291 (6%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           IHTG+Q   I   +P  +P  +  LP+ L+ELGYST  +GKWHLGF+++E  P  RGF++
Sbjct: 2   IHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDT 61

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
           +FG L G ++YY +   D       + G D+    S AW   G+Y+T L+T+   +++  
Sbjct: 62  YFGSLTGSVNYYTYDSCD----GPGMCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILAT 117

Query: 262 Q-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
             P  +PLF++L+  A H       L+ P+E I  ++ + +  RR YAAMV  +D++V  
Sbjct: 118 HDPRSQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRN 172

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           +   L++ G  ENSI+IF +DNG   +         + GSN+P RG K T WEGGV+   
Sbjct: 173 ITYGLRKYGYYENSIMIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGVRGLG 223

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
            + SP +++  RVS  ++HI+DW PTL   AGG  S +   +DG D W ++
Sbjct: 224 FIHSPLLRKKKRVSKALVHITDWYPTLVGLAGGKESHIE-GVDGYDVWEAI 273



 Score = 46.6 bits (109), Expect = 0.036,   Method: Compositional matrix adjust.
 Identities = 27/96 (28%), Positives = 49/96 (51%), Gaps = 7/96 (7%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
           S AW   G+Y+T L+T+   +++    P  +PLF++L+  A H       L+ P+E I  
Sbjct: 93  SVAWSQKGKYSTHLYTQRVRKILATHDPRSQPLFIFLSFQAVHT-----PLQCPREYIYP 147

Query: 65  FQYITDPNRRTYAALTKSTTLTLL-IVYGWNDLSFH 99
           ++ + +  RR YAA+  +    +  I YG     ++
Sbjct: 148 YRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGYY 183


>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
          Length = 382

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 123/365 (33%), Positives = 190/365 (52%), Gaps = 27/365 (7%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YG+ND+ F   N I +PN+DALA +GIIL N Y  P C+PSR S ++G+Y   + MQ   
Sbjct: 31  YGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGV 89

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I   +P+ + L    LP YL+ELGY T A GKWHLG+ R E TP +RGF+S  G  +G  
Sbjct: 90  ILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEG 149

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            +Y+H        T     +D        +D +G+++ DL      + +++     PLF+
Sbjct: 150 EFYEHT-------TATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQSSPLFM 202

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYI---TDPNRRTYAAMVKKLDDSVGTVISALQR 327
           Y+A    H+      L+   E +  +  +   TD +   ++ +V  +D  +G ++  ++ 
Sbjct: 203 YVAFHNVHS-----PLDPKPEFLALYDDLDASTDYSGFFFSGLVSGMDYIIGGIVDKIKE 257

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
            G+ EN+ I+F SDNG    E          G N P+RG K+TLWEGG K  +   SP +
Sbjct: 258 IGIYENTYILFSSDNGGDVGE----------GDNSPHRGGKSTLWEGGCKANSWTHSPLL 307

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
             +   +  +MH++DWLPT+   AGG        +DG  Q   ++ N  S R + I  +D
Sbjct: 308 GASGVENNGLMHVTDWLPTIVDLAGGSVPASD-GLDGFVQTDMVINNGDSARTTMIYNID 366

Query: 448 QWSSL 452
           + S +
Sbjct: 367 RESDI 371


>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
 gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 463

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 126/351 (35%), Positives = 190/351 (54%), Gaps = 34/351 (9%)

Query: 75  TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCT 129
            +++L +     +++V     GW D+ F+G+ +IPTPN+D LA  GI+ +N Y + P C+
Sbjct: 16  AFSSLAQEKKPNIIVVLTDDQGWADVGFNGATDIPTPNLDRLASEGIVFDNAYVSHPYCS 75

Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAE---PRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           PSRA L+TG+Y    G      + +E     G PL+E+ +PE L+E GY T AIGKWHLG
Sbjct: 76  PSRAGLLTGRYQARFGHDCNMPYDSENDDTVGTPLSEKMIPEALKEHGYRTSAIGKWHLG 135

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
                  P+++GF+  FG+  G ++Y+   + D   +T+  NG  + +N          Y
Sbjct: 136 D-HPSLHPIHQGFDHWFGFAGGGMNYWG--IPDGPIKTIVRNGEPVPQNEL-------RY 185

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
            TD FT EA+  I  +  DKP F+YLA+ A HA +     +A ++ + Q ++I    R  
Sbjct: 186 LTDDFTDEAIDFITKKD-DKPFFMYLAYNAPHAPD-----QATKQYLEQTKHIEYGGRSI 239

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           YAAMV  +D +VG + S L   GM EN+I++F+SDNG                 N PYRG
Sbjct: 240 YAAMVNAVDANVGRIDSTLVANGMKENTILVFLSDNGGRGEH----------ADNRPYRG 289

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
            K  L+EGG+KVP  +  P+  +  +   +M+   D  PT   AAGG   +
Sbjct: 290 HKGMLFEGGIKVPFFITWPKKLKGGQHYEKMVSSLDLFPTFLEAAGGKVKK 340



 Score = 42.4 bits (98), Expect = 0.75,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 38/69 (55%), Gaps = 6/69 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD FT EA+  I  +  DKP F+YLA+ A HA +     +A ++ + Q ++I    R 
Sbjct: 185 YLTDDFTDEAIDFITKKD-DKPFFMYLAYNAPHAPD-----QATKQYLEQTKHIEYGGRS 238

Query: 75  TYAALTKST 83
            YAA+  + 
Sbjct: 239 IYAAMVNAV 247


>gi|354488427|ref|XP_003506371.1| PREDICTED: arylsulfatase I [Cricetulus griseus]
          Length = 560

 Score =  204 bits (519), Expect = 1e-49,   Method: Compositional matrix adjust.
 Identities = 110/272 (40%), Positives = 160/272 (58%), Gaps = 19/272 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 113 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 171

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGFFR+E  P  RGF++  G L G + 
Sbjct: 172 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLPSCRGFDTFLGSLTGNVD 231

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++       PLFLY
Sbjct: 232 YYTYDNCD----GPGVCGFDLHEGETVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLY 287

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 288 VAFQAVH-----TPLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 342

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
            NS+IIF SDNG  T          + GSN+P
Sbjct: 343 NNSVIIFSSDNGGQTF---------SGGSNWP 365



 Score = 46.6 bits (109), Expect = 0.040,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++       PLFLY+A  A H       L++P+E + ++
Sbjct: 253 TVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLYVAFQAVH-----TPLQSPREYLYRY 307

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 308 RTMGNVARRKYAAMV 322



 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +   +  +A P      W 
Sbjct: 453 WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWG 511

Query: 628 PW 629
           PW
Sbjct: 512 PW 513


>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 635

 Score =  203 bits (517), Expect = 2e-49,   Method: Compositional matrix adjust.
 Identities = 160/531 (30%), Positives = 242/531 (45%), Gaps = 86/531 (16%)

Query: 92  GWNDLSF----HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
           G+ND+ +    H +N++ TP +D LA  G+ L   Y Q  C+PSR +L+TG YP  TG+ 
Sbjct: 129 GYNDIGYNRAPHQTNQVSTPFLDELASEGVTLTRYYTQCDCSPSRGALLTGLYPASTGLY 188

Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
              I      G+PL    +P++L    Y + AIGKW +G +   + P  RGF S+ G+  
Sbjct: 189 HGVIVTQSHWGLPLEYHLIPQFLPSR-YRSHAIGKWDVGHYTWNHVPTGRGFHSYVGFYG 247

Query: 208 GVISYYDHIL-----SDQYSRTVELNGHDMRRNLSTAWDTVGE-----YATDLFTKEAVQ 257
             I YY H +     S   S  ++   +D   +L+      G+     Y+TD+FT  AV+
Sbjct: 248 TDIDYYTHEIGAGCNSYNCSSAIKRCMNDSITDLNYDGAATGDEYYNRYSTDIFTDRAVE 307

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ----YITDPNRRTYAAMVKK 313
           L+  +    PLFLY+A  A H       LEA   TI++F     Y     R T+AA    
Sbjct: 308 LLRTESARNPLFLYVAFNAVHGP-----LEADDATIDRFNASFLYEARTERATFAAAALL 362

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           +D +V +++  +   G  +NSI++F SDNGA   +          GSN+P RG K T +E
Sbjct: 363 MDRAVESIVKTMHAVGAYDNSIVLFASDNGATLAQ-------TGGGSNWPLRGSKFTPYE 415

Query: 374 GGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           GGV+VPA L SP +    R      + H++D LPTL  AAGG     P +IDG++Q+ +L
Sbjct: 416 GGVRVPAFLHSPLLGSGRRGITHAGLFHVTDILPTLVHAAGGSP---PSDIDGVNQYHAL 472

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
                        G D  +S      S R  VL+++D          ++  L  G    G
Sbjct: 473 F------------GSDATAS------SPRADVLVHVD----VYGYETEAMGLATGAYIEG 510

Query: 492 TMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQ--QATIH 549
            +      T +            +S+TY+   +      LP +  DK+   +    A  H
Sbjct: 511 DLKVVVNATEATWCA-------PDSETYRYADECWD---LPATYCDKVAENKSGYAAYFH 560

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYH 600
                             YLF+L  DP EQ+++  + P    +LY   + H
Sbjct: 561 ----------------KTYLFDLSVDPEEQHDLKDAMPAKFRKLYAAFQAH 595



 Score = 43.9 bits (102), Expect = 0.30,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 9/68 (13%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ----YITD 70
           Y+TD+FT  AV+L+  +    PLFLY+A  A H       LEA   TI++F     Y   
Sbjct: 296 YSTDIFTDRAVELLRTESARNPLFLYVAFNAVHGP-----LEADDATIDRFNASFLYEAR 350

Query: 71  PNRRTYAA 78
             R T+AA
Sbjct: 351 TERATFAA 358


>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 588

 Score =  203 bits (516), Expect = 3e-49,   Method: Compositional matrix adjust.
 Identities = 177/619 (28%), Positives = 272/619 (43%), Gaps = 126/619 (20%)

Query: 92  GWNDLSFHGSN-EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ-GP 149
           GW+D+ +   + +  TP +D LA  G+ + N Y    CTP+RASLMTG+Y +  GMQ   
Sbjct: 6   GWDDIGYQSVDLKGVTPVLDKLAAGGVKITNYYTMNTCTPARASLMTGRYTVRYGMQYNV 65

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
            I   EP GVPL+E+ LPEY +E GY T  +GKWHLG    E+ P  RGF+++ GY+ G 
Sbjct: 66  AINPGEPWGVPLSEKMLPEYFKEAGYGTHLVGKWHLGSHSPEHIPSQRGFDTYMGYVGGF 125

Query: 210 ISYYDH----ILSD-------------------QYSRTVELNGHDMRRNLSTAW----DT 242
            +Y+ H    ++SD                       + ++ G D R +   A     D 
Sbjct: 126 EAYWTHETVGVISDGRHVCDFGFGNASGYYDIIDRPSSQDIYGSDHREDQKEAEENRDDG 185

Query: 243 VGE---------------------------YATDLFTKEAVQLIEDQ-PVD-KPLFLYLA 273
            GE                           Y+T +F + A++++E + P D +P++++LA
Sbjct: 186 DGEGVHSTTPTKTPWASSFRSNSGVMKGQQYSTQMFQERAIEILETKSPFDEEPIYMHLA 245

Query: 274 HLAAH-----------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
           H A H           + +    L+  +  I   Q      RR    ++  LD SVG +I
Sbjct: 246 HQAVHSPLGLPPADSFSEDETLLLDEIEGKIGDVQGAQ--TRRHLTEVLMFLDKSVGHLI 303

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
             L+ +G +ENSI++  SDNG  + E          GSNYP RG K + WEGG KVPA +
Sbjct: 304 HYLEDEGWMENSIVVVASDNGGDSNE---------GGSNYPLRGKKASYWEGGSKVPAFV 354

Query: 383 WS----PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           +S    P+ ++       +MH++DWLPTL   AG + S     +DG++QW  ++ +  + 
Sbjct: 355 YSTSHIPEARRGSEYD-GLMHVTDWLPTLAAGAGLEMSGSAGPLDGVNQWEHIVGSAAA- 412

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYG 498
               +DG    + LL N    R   L+            L  +++     + G   G + 
Sbjct: 413 ----VDGGRPRTELLYNYDPYR---LLPNSNHCYCCWCSLCHYRVY----DTGYAQGAF- 460

Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCG--ANPAP 556
             RS K  ++ FN             L ++    +S +             CG   N   
Sbjct: 461 --RSGKYKMM-FNVSCHGWYTFDGDMLEEDHLTDLSKV-------------CGGTCNDGG 504

Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL----YELLKYHRRTLVPQSHEQP 612
                 T+   YLF+L  DP E NN+    PD++ ++    YE+     R       E  
Sbjct: 505 TCSDEATSD--YLFDLDQDPRESNNLIDELPDVAQEMRDRWYEVT---FREYTDTVFEPL 559

Query: 613 DLVQADPKRFNDTWS-PWI 630
           DL   D    N+ W  PW+
Sbjct: 560 DLKAYDVWAENNWWMIPWL 578


>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
          Length = 345

 Score =  202 bits (515), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 116/327 (35%), Positives = 172/327 (52%), Gaps = 34/327 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+     + I TPN+DALA  G+IL N Y Q +CTPSR +LM+G+YP  + MQ   I
Sbjct: 37  GYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMSGRYPSASAMQSMVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E R   L  +FLP+YL+ELGY    +GKWHLG+ R E  P  RGF++ +G   G   
Sbjct: 96  QPMEARCAGLEYKFLPQYLKELGYKNHMVGKWHLGYCRDECLPTSRGFDTFYGLYAGAGD 155

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H +  +Y         D   N    ++  G ++ DL  +   ++ ++     PLFLY
Sbjct: 156 YWSHEIFGKY---------DWHINGDIHFEANGTHSQDLEMEGLDKVFDEHDSKDPLFLY 206

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYI-----TDPNRRTYAAMVKKLDDSVGTVISALQ 326
            A    H          P +  ++F  +      D  RR Y  +   LD  VG ++  L+
Sbjct: 207 FAPQNPH---------TPSQPTDEFLNLYPEDKFDDIRRKYLGLTSGLDAMVGKIVEKLE 257

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
             G+++N+ +IF+SDNGA   E          G N P+RG K +L+EGG K  + ++SP 
Sbjct: 258 NNGLMDNTYVIFVSDNGADPPE----------GLNTPFRGGKGSLFEGGTKSNSFVYSPL 307

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGG 413
           + +    +  +MHI+DWLPTL   AGG
Sbjct: 308 LNKTGYENDGLMHITDWLPTLVKLAGG 334


>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
 gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
 gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
          Length = 473

 Score =  202 bits (515), Expect = 4e-49,   Method: Compositional matrix adjust.
 Identities = 125/348 (35%), Positives = 186/348 (53%), Gaps = 34/348 (9%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
             LL   G+ D+ + GS +I TPNID LA  G+ L + YA P+C+P+RA+LMTGK P++ 
Sbjct: 47  FILLDDLGYGDVGYLGS-QIQTPNIDNLASQGVTLKHGYAYPICSPTRAALMTGKNPLNF 105

Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
           G+ GP    A    +P     +PE  +E GY T  +GKWHLG  +R   P  RGF+  +G
Sbjct: 106 GIDGPMENDAM---LPEDLTTMPERFQEAGYQTWMVGKWHLGMAKRSAMPHNRGFDDFYG 162

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           +L G + YY H         V   G D + N  T+    G + T+L T +A+  I     
Sbjct: 163 FLGGFVDYYTH---------VYFGGLDWQNN-DTSLREEG-FVTELLTAKAIDKITHFKG 211

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
           DKP F+YL++ A H       L+     IN +      +R+ +A M   +D  +G V++A
Sbjct: 212 DKPFFMYLSYSAPHT-----PLQYVPNAINDYSGFESKDRQVFADMTSDVDQDIGEVLAA 266

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-W 383
           L ++G+ EN+I++FMSDNG          N     +N   +G K++++EGGV+VP ++ W
Sbjct: 267 LDKRGIAENTIVVFMSDNGG---------NLEAGANNGSLQGGKSSVYEGGVRVPVLIRW 317

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
             Q+Q   +   Q M   DW PTL  AA  D    P + DG+ Q+ +L
Sbjct: 318 PAQLQS--KTMTQPMFTQDWAPTLLEAANIDYQ--PNDFDGVSQFVNL 361


>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
          Length = 328

 Score =  202 bits (514), Expect = 5e-49,   Method: Compositional matrix adjust.
 Identities = 120/327 (36%), Positives = 174/327 (53%), Gaps = 22/327 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ F    ++ TPN+D LA  G+IL N Y Q VCTPSR +LMTG+YP  + MQ   I
Sbjct: 12  GYNDIGFRNP-DVQTPNLDYLANKGVILTNNYVQAVCTPSRHALMTGRYPYRSAMQNFVI 70

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              + +   L  +FLP+YL+ELGY    IGKW+LG+ R E  P  RGF+S FG L+G   
Sbjct: 71  NPDQAKCTALEYKFLPQYLKELGYQNHLIGKWNLGYCREECLPTSRGFDSFFGLLDGAGD 130

Query: 212 YYDHILSDQYSRTVE-LNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
           Y++H     Y  T + L G     +  + +S     +     DL      ++  +    +
Sbjct: 131 YWEHTTYGLYDCTGQSLAGMACLCEFTQKISILMPVIC--FQDLELDRLDKIFTEHDNKQ 188

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           PLFLY A    H  +        + + + F+ +    RR Y A+   LD  VG V+  L 
Sbjct: 189 PLFLYFAPQNPHTPSQPTDEFLNKYSADDFEDV----RRRYLALTSGLDSMVGKVVELLN 244

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
              M++N+ +IF+SDNGA   E          G N P+RG K +L+EGG K  + ++SP 
Sbjct: 245 DNHMMDNTYLIFLSDNGADPAE----------GLNAPFRGGKGSLFEGGTKSSSFIYSPL 294

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGG 413
           +++    +  +MHI+DWLPTL   AGG
Sbjct: 295 LKKTEYENDGLMHITDWLPTLVKLAGG 321


>gi|395504882|ref|XP_003756775.1| PREDICTED: arylsulfatase I [Sarcophilus harrisii]
          Length = 598

 Score =  201 bits (512), Expect = 7e-49,   Method: Compositional matrix adjust.
 Identities = 102/251 (40%), Positives = 157/251 (62%), Gaps = 10/251 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS+ I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 225 GYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 283

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E+GYST  +GKWHLGF+++   P  RGF++  G L G + 
Sbjct: 284 RPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDTFLGSLTGNVD 343

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G+D+    S AW+  G+Y+T L+ + A Q++      +PLFLY
Sbjct: 344 YYTYDNCDGPG----VCGYDLHEGESVAWEQSGQYSTLLYAQRASQILASHNPRQPLFLY 399

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 400 VAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAVRNITWALKRYGYY 454

Query: 332 ENSIIIFMSDN 342
           +NS+IIF +DN
Sbjct: 455 DNSVIIFSTDN 465



 Score = 50.1 bits (118), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 44/75 (58%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           S AW+  G+Y+T L+ + A Q++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 365 SVAWEQSGQYSTLLYAQRASQILASHNPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 419

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 420 RSMGNVARRKYAAMV 434



 Score = 47.0 bits (110), Expect = 0.029,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 36/70 (51%), Gaps = 5/70 (7%)

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPK 620
           T    +LFN+  DP E+ ++A+ RPDI   L   L Y+ RT +P  +  E P   +A P 
Sbjct: 484 TRQSVWLFNISADPYEREDLAAQRPDIVRSLLARLAYYNRTAIPVRYPAENP---RAHPD 540

Query: 621 RFNDTWSPWI 630
                W PW+
Sbjct: 541 FNGGAWGPWV 550


>gi|291225025|ref|XP_002732499.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 307

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 119/324 (36%), Positives = 173/324 (53%), Gaps = 26/324 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H + +I  PN+  LA +G+I N  Y  P CTPSRA++MTG YP  TG Q    
Sbjct: 2   GWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMA 60

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG-YLNGVI 210
           +   P GVPL  + LPE L+E+GYST  +GKWHLGF + EY P  RGF+SH+G +  GV 
Sbjct: 61  FNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVG 120

Query: 211 SY--YDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
            Y   D +LS          G+D R N+     +  +Y   +    A  ++       PL
Sbjct: 121 DYDKMDGVLSPS-------AGYDFRDNMGVVPKS-DDYLALMLGDRAEHIVNTHYPGTPL 172

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           F     LA       KHLE P+E   ++  I D   R +   +  +DD +G V+ A++ +
Sbjct: 173 F-----LAFTLDIPAKHLEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDAMKNR 227

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           GM ++S+IIF+ DN   + +          GSNYP+RG+  TL+EG  +VP+I+    I+
Sbjct: 228 GMYDDSLIIFIGDNEGLSSQS---------GSNYPFRGIAGTLFEGATRVPSIVSGKGIK 278

Query: 389 QNPRVSLQMMHISDWLPTLYTAAG 412
           +    S ++  I D   T+   AG
Sbjct: 279 KTGYESNELYSIVDIHRTILDVAG 302


>gi|354581367|ref|ZP_09000271.1| sulfatase [Paenibacillus lactis 154]
 gi|353201695|gb|EHB67148.1| sulfatase [Paenibacillus lactis 154]
          Length = 446

 Score =  201 bits (511), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 143/412 (34%), Positives = 214/412 (51%), Gaps = 47/412 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +GS+ + TPN+D+LA  GI   N Y+  PVC+PSRASL+TGKYP   G+    
Sbjct: 22  GYGDLGCYGSDTVTTPNLDSLAGEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVG--E 79

Query: 151 IWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           I GA+    G+P TE  L + L+  GY T   GKWHLG    E +P   GF+  FG+  G
Sbjct: 80  ILGAKRGLDGLPSTEVTLAKALKPAGYRTALYGKWHLGV-SEETSPNAHGFDEFFGFKAG 138

Query: 209 VISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQLIE-DQPVDK 266
            I +Y HI    + +   +N  HD+  N +  W+  G Y T+L T+ +V  I+  +  + 
Sbjct: 139 CIDFYSHIF--YWGQGHGVNPLHDLWENETEVWEN-GRYMTELITERSVDFIKRSREQED 195

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           P FL++++ A H       + AP+E +++F ++   +R+  AAM+  +DD VG ++ AL+
Sbjct: 196 PFFLFVSYNAPH-----YPMHAPKEYMDRFAHLP-WDRQVMAAMIAAVDDGVGEIVKALK 249

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
             G  EN++I F SDNG P+ E R     T +    G+   +RG K +L+EGG++ PAIL
Sbjct: 250 AAGCYENTVIFFSSDNG-PSSESRNWLDGTEDVYYGGTAGIFRGHKASLFEGGIREPAIL 308

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
             P   +  +V  ++  + D +PT    AG D +  PL                    + 
Sbjct: 309 SWPNGLEGGQVRDEVGVMMDIVPTFLDLAGVDPATRPL------------------EGAE 350

Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
           +DG      LL+  PS    +     E +   AVR   WKLVL    NG +D
Sbjct: 351 LDGSSLKEMLLMGEPSPHKQLFW---EYQGQLAVREGDWKLVL----NGKLD 395


>gi|291225031|ref|XP_002732500.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 286

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 114/299 (38%), Positives = 166/299 (55%), Gaps = 22/299 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +H + +I  PN+  LA +G+I N  Y  P CTPSRA++MTG YP  TG Q   +
Sbjct: 2   GWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMV 60

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +   P GVPL  + LPE L+E+GY+T  +GKWHLGF + EY P  RGF+SH+G     + 
Sbjct: 61  FNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVG 120

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL- 270
            YD  L+   S +    G+D R N+     + G Y   +    A  ++       PLFL 
Sbjct: 121 DYDK-LNGVLSPSA---GYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNTHYPGTPLFLT 175

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +   + A      KHLE P+E    +  I D   R +   +  +DD +G V+ AL+ +GM
Sbjct: 176 FTLDIPA------KHLEIPEEYEEAYADIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGM 229

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
            ++S+IIF+ DNGA + +          GSNYP+RG+  TL+EG   VP+I+    I++
Sbjct: 230 YDDSLIIFIGDNGALSSQS---------GSNYPFRGIAGTLFEGATCVPSIVSGKGIKK 279


>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
          Length = 558

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 153/513 (29%), Positives = 236/513 (46%), Gaps = 77/513 (15%)

Query: 88  LIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
           L + G+ D  +  S  I TPNID L  +GI   N Y+   C+PSR+S ++G+YP  +GMQ
Sbjct: 78  LALPGFQDAGYRNS-AIHTPNIDKLVGDGISFTNAYSSQQCSPSRSSFLSGRYPYKSGMQ 136

Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
              I    P  + L  +FL +YL++L Y+T A+GKWHLG+  ++ TP YRGF++  G  +
Sbjct: 137 HGVISDEGPNCMDLKFKFLSDYLKDLNYNTHAVGKWHLGYCNKKCTPTYRGFDTFSGGYS 196

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           G   YY+H        T     +D      T     G ++ DL   + ++ + D+    P
Sbjct: 197 GEGDYYEH--------TTHGGWYDWHNGTETDLTAAGVHSQDLVEADMMKNL-DENDGTP 247

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
            F Y A    H+    K    P+           P+R+ Y  +V  +D  +G V++ L+ 
Sbjct: 248 FFYYTAFHNTHSPLQPK----PEHKALYNDMDATPSRKKYLGLVSGMDAVIGNVVAKLKA 303

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           KGM +N+ I F SDNG             N G N P+RG K++L+EGG    +   SP I
Sbjct: 304 KGMFDNTYIFFSSDNGGDV----------NEGDNSPWRGAKSSLFEGGCHAHSWASSPLI 353

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
           ++    +  ++H++DWLPT+ T AGG    +P  IDG++Q   L+ N          GL 
Sbjct: 354 RKKGVEADGLLHMTDWLPTILTLAGG---TVPDGIDGIEQ-VDLITNA---------GL- 399

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTA-------AVRLDSWKLVLGTQENGTMDGYYGQT 500
                     S+R +++ NID +  T        A R   +KL+ G +  G  DGY    
Sbjct: 400 ----------SKRTNMIYNIDMELETGPPNFGCMAARDLRYKLIWGFE--GRSDGY---- 443

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR-----STRQQATI--HCGAN 553
             +   + N + +V+         L +     + N+DK       S RQ      +    
Sbjct: 444 GVDADFIYNHDRVVD---------LVEEEEFIVENVDKRGGPYPLSHRQHRIFSEYVSLT 494

Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSR 586
              M       GP  LF+L  DP E  ++  S+
Sbjct: 495 TVTMADVQAGTGPMMLFDLQEDPNETTDLIESQ 527


>gi|119582144|gb|EAW61740.1| arylsulfatase family, member I [Homo sapiens]
          Length = 426

 Score =  201 bits (510), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 142/454 (31%), Positives = 212/454 (46%), Gaps = 79/454 (17%)

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
           +GKWHLGF+R+E  P  RGF++  G L G + YY +   D       + G D+    + A
Sbjct: 2   VGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDGPG----VCGFDLHEGENVA 57

Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
           W   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + +++ +
Sbjct: 58  WGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTM 112

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
            +  RR YAAMV  +D++V  +  AL+R G   NS+IIF SDNG  T            G
Sbjct: 113 GNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTFSG---------G 163

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           SN+P RG K T WEGGV+    + SP +++  R S  +MHI+DW PTL   AGG TS   
Sbjct: 164 SNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAAD 223

Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--INIDEKKRTAAVR 477
             +DG D W ++     S R   +  +D      L   ++  S+     I      AA+R
Sbjct: 224 -GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHAQHGSLEGGFGIWNTAVQAAIR 277

Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
           +  WKL+ G       D  YG    + +P             Q+L     + +    N++
Sbjct: 278 VGEWKLLTG-------DPGYG----DWIPP------------QTLATFPGSWW----NLE 310

Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
           +M S RQ                       +LFN+  DP E+ ++A  RPD+   L   L
Sbjct: 311 RMASVRQA---------------------VWLFNISADPYEREDLAGQRPDVVRTLLARL 349

Query: 598 KYHRRTLVPQSH--EQPDLVQADPKRFNDTWSPW 629
             + RT +P  +  E P   +A P      W PW
Sbjct: 350 AEYNRTAIPVRYPAENP---RAHPDFNGGAWGPW 380



 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 55  NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 109

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 110 RTMGNVARRKYAAMV 124


>gi|410039868|ref|XP_003950702.1| PREDICTED: arylsulfatase I isoform 1 [Pan troglodytes]
          Length = 426

 Score =  200 bits (509), Expect = 1e-48,   Method: Compositional matrix adjust.
 Identities = 142/454 (31%), Positives = 212/454 (46%), Gaps = 79/454 (17%)

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
           +GKWHLGF+R+E  P  RGF++  G L G + YY +   D       + G D+    + A
Sbjct: 2   VGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCD----GPGVCGFDLHEGENVA 57

Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
           W   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + +++ +
Sbjct: 58  WGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTM 112

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
            +  RR YAAMV  +D++V  +  AL+R G   NS+IIF SDNG  T            G
Sbjct: 113 GNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTFSG---------G 163

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           SN+P RG K T WEGGV+    + SP +++  R S  +MHI+DW PTL   AGG TS   
Sbjct: 164 SNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAAD 223

Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--INIDEKKRTAAVR 477
             +DG D W ++     S R   +  +D      L   ++  S+     I      AA+R
Sbjct: 224 -GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHAQHGSLEGGFGIWNTAVQAAIR 277

Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
           +  WKL+ G       D  YG    + +P             Q+L     + +    N++
Sbjct: 278 VGEWKLLTG-------DPGYG----DWIPP------------QTLATFPGSWW----NLE 310

Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
           +M S RQ                       +LFN+  DP E+ ++A  RPD+   L   L
Sbjct: 311 RMASVRQAV---------------------WLFNISADPYEREDLAGQRPDVVRTLLARL 349

Query: 598 KYHRRTLVPQSH--EQPDLVQADPKRFNDTWSPW 629
             + RT +P  +  E P   +A P      W PW
Sbjct: 350 AEYNRTAIPVRYPAENP---RAHPDFNGGAWGPW 380



 Score = 48.1 bits (113), Expect = 0.014,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 55  NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 109

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 110 RTMGNVARRKYAAMV 124


>gi|344250866|gb|EGW06970.1| Arylsulfatase I [Cricetulus griseus]
          Length = 484

 Score =  199 bits (507), Expect = 3e-48,   Method: Compositional matrix adjust.
 Identities = 104/251 (41%), Positives = 152/251 (60%), Gaps = 10/251 (3%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+ +HGS +I TP +D LA  G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q   I
Sbjct: 60  GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 118

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL +  LP+ L+E GYST  +GKWHLGFFR+E  P  RGF++  G L G + 
Sbjct: 119 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLPSCRGFDTFLGSLTGNVD 178

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           YY +   D       + G D+    + AW   G+Y+T L+ + A  ++       PLFLY
Sbjct: 179 YYTYDNCD----GPGVCGFDLHEGETVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLY 234

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           +A  A H       L++P+E + +++ + +  RR YAAMV  +D++V  +  AL+R G  
Sbjct: 235 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 289

Query: 332 ENSIIIFMSDN 342
            NS+IIF SDN
Sbjct: 290 NNSVIIFSSDN 300



 Score = 46.6 bits (109), Expect = 0.043,   Method: Compositional matrix adjust.
 Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 15/116 (12%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T L+ + A  ++       PLFLY+A  A H       L++P+E + ++
Sbjct: 200 TVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRY 254

Query: 66  QYITDPNRRTYAALTKSTTLTL------LIVYGWNDLS---FHGSNEIPTPNIDAL 112
           + + +  RR YAA+       +      L  YG+ + S   F   NEI   NID L
Sbjct: 255 RTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNEI-LHNIDPL 309



 Score = 40.4 bits (93), Expect = 3.0,   Method: Compositional matrix adjust.
 Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 1/62 (1%)

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
           +LFN+  DP E+ ++A  RPD+   L   L  + RT +P  +   +  +A P      W 
Sbjct: 377 WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWG 435

Query: 628 PW 629
           PW
Sbjct: 436 PW 437


>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
          Length = 330

 Score =  199 bits (506), Expect = 4e-48,   Method: Compositional matrix adjust.
 Identities = 115/327 (35%), Positives = 168/327 (51%), Gaps = 25/327 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+NDL     + I TPN+DALA  G+IL N Y Q VC+PSR +LMTG+YP  + MQ   I
Sbjct: 14  GYNDLGLRDPDVI-TPNMDALASKGVILTNNYVQAVCSPSRHALMTGRYPSASAMQSIVI 72

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E +   L  +FLP+YL++LGY    IGKWHLG+ R E  P  RGF++ +G       
Sbjct: 73  QPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRGFDTFYGLYASSGD 132

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y++H +   Y    E       R       +   +  DL  +   ++ ++     PLFLY
Sbjct: 133 YWEHGIMGMYDWHTEAGVDFEARGTHAQVGSRYWHIYDLEMERLDKVFDEHDNKDPLFLY 192

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPN-----RRTYAAMVKKLDDSVGTVISALQ 326
            A   +H          P +  ++F  +   +     RR Y  +   LD  VG ++  L 
Sbjct: 193 FAPQNSH---------TPHQPTDEFINLYSEDKFSDIRRKYLGLTSGLDAMVGKIVDKLT 243

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
              M+ N+ +IF+SDNGA  +E          G N P+RG K +L+EGG K  + ++SP 
Sbjct: 244 ENDMMNNTYVIFVSDNGADPIE----------GRNAPFRGGKGSLFEGGTKSSSFIYSPL 293

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGG 413
           +++    +  +MHI+DWLPTL   AGG
Sbjct: 294 LKKTEYENDGLMHITDWLPTLVKLAGG 320


>gi|298706912|emb|CBJ29739.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 781

 Score =  199 bits (505), Expect = 5e-48,   Method: Compositional matrix adjust.
 Identities = 165/565 (29%), Positives = 258/565 (45%), Gaps = 118/565 (20%)

Query: 87  LLIVYGWNDLSFHGSN--EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
           L+   G+ D+ +  ++  EI TPN+DALA  G+ L+N Y   +CTP+RAS+MTG+YP+  
Sbjct: 223 LIDDMGFGDMGYQSTDLSEI-TPNLDALAAGGVKLSNYYTMTLCTPARASIMTGRYPVRY 281

Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
           GMQ   I    P G+P +E+ LPEY+ E GY +  +GKWHLG +R E  P  RGF++  G
Sbjct: 282 GMQYSVIMPGSPWGLPTSEKILPEYMNEAGYESHMVGKWHLGSYRDESLPSQRGFKTFLG 341

Query: 205 YLNGVISYYDH-----ILSDQY-----------SRTVELNGHDMR-----RNLSTAWDTV 243
           YLNG+ +YY H      +  QY              V L  HD        +    W  V
Sbjct: 342 YLNGIETYYSHKNPEASVDGQYFFDFGYGNATGYHDVTLQNHDENVGGPCTDGGPRWGDV 401

Query: 244 ------------GEYATDLFTKEAVQLIEDQ-PVDK-PLFLYLAHLAAHA--GNAGKHLE 287
                       G Y+TD F   A Q+++ + P D+ PLF+Y+AH + H+  G A     
Sbjct: 402 MENEDPADVCFTGTYSTDAFVGRAKQIVKSKAPFDEDPLFMYIAHQSVHSPTGPAPYEEF 461

Query: 288 APQE--TINQFQYITDPNRRT-YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
           +P+E   +   +  T+  +RT +A ++  LD  +G  +  L+++G LENS+I+  SDNGA
Sbjct: 462 SPEELALLEGVKDATESEKRTAFAGVLLYLDKRIGEFVDMLEQEGWLENSVIVVSSDNGA 521

Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV-----PAILWSPQIQ----QNPRVSL 395
                       + GSNYP RG K +++EGG +      PA+L   Q       +PR   
Sbjct: 522 CP---------DDGGSNYPLRGSKQSVFEGGSRTGYQPSPALLGKRQPAGLNTSSPRYLC 572

Query: 396 QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN-------IDGLDQ 448
              ++ D       +AG         +DG+DQW  L  N      S         + L  
Sbjct: 573 NPCNLPD-------SAGA--------LDGVDQWECLKSNGVRGAGSGNTSKPPRTELLYN 617

Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
           W S LL++        +  D +    A R   WKL++    N    GY+           
Sbjct: 618 WDSYLLSSADG-----LKEDLEFSQGAFRSGDWKLLV----NVFCSGYFSHDL------- 661

Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
              A++ES        L  ++   + N++  +    +  ++ G  P              
Sbjct: 662 ---AVIESDDL-----LDSDVACGVGNME-CQDCCDRCQLYSGDYPVSDR---------- 702

Query: 569 LFNLGNDPCEQNNIASSRPDISSQL 593
           L+N+  DP E++++    P+ + +L
Sbjct: 703 LYNVREDPREEHDLYDQYPEFAKEL 727


>gi|315644664|ref|ZP_07897795.1| sulfatase [Paenibacillus vortex V453]
 gi|315279923|gb|EFU43222.1| sulfatase [Paenibacillus vortex V453]
          Length = 439

 Score =  199 bits (505), Expect = 6e-48,   Method: Compositional matrix adjust.
 Identities = 142/420 (33%), Positives = 213/420 (50%), Gaps = 49/420 (11%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
            +IVY    G+ DL  +GS+ + TP++D LA  G+   N Y+  PVC+PSRASL+TGKYP
Sbjct: 6   FVIVYCDDLGYGDLGCYGSDSVRTPHLDGLADEGVRFTNWYSNSPVCSPSRASLLTGKYP 65

Query: 142 IHTGMQGPPIWGAE--PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           +  G+    I GA+    G+P  E  L + L+  GY T   GKWHLG   +E +P   GF
Sbjct: 66  VRAGVG--EILGAKRGSHGLPAAEVTLAKALKPAGYRTALYGKWHLGL-SKETSPNAHGF 122

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI 259
           +  FG+  G + +Y HI        V    HD+  N +  W+  G Y T+L T+ +V  I
Sbjct: 123 DEFFGFKAGCVDFYSHIFYWGQGHGVNPL-HDLWENETEVWEN-GRYMTELITERSVDFI 180

Query: 260 E-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           +  +  + P FL+ ++ A H       + APQE +++F ++   +R+  AAM+  +DD V
Sbjct: 181 KRSREQEAPFFLFASYNAPH-----YPMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDDGV 234

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWEG 374
           G ++ AL+  G  E+++I F SDNG P+ E R     T +    GS   +RG K +L+EG
Sbjct: 235 GEIVEALKEAGCYEDTVIFFSSDNG-PSSESRNWLDGTEDIYYGGSAGIFRGHKASLFEG 293

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
           G++ PAIL  P   +  +V  ++  + D +PT    AG D +  PL              
Sbjct: 294 GIREPAILNWPNGWEGGQVRDEVAAMMDIVPTFLDLAGVDLAAGPL-------------- 339

Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
               +   +DG      L    PS    +     E +   AVR  +WKLVL    NG +D
Sbjct: 340 ----QGITLDGSSLKEMLFRGKPSPHKQLFW---EYQGQLAVREGNWKLVL----NGKLD 388


>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 614

 Score =  198 bits (503), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 136/427 (31%), Positives = 224/427 (52%), Gaps = 50/427 (11%)

Query: 83  TTLTLLIVYGWNDLSFHGSN--EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
               +L   G+ND+ +  S+  E+ TP +D L  +G+ ++ +Y Q VCTPSRA+++TGK 
Sbjct: 65  VVFVMLDDVGFNDVGYASSDLGEM-TPFLDGLMADGVRVDRLYGQQVCTPSRAAMLTGKL 123

Query: 141 PIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
           PIH  +Q   +  +EP G+P  E  L +YL+ LGYST  +GKWHLG +    TPL RGF+
Sbjct: 124 PIHLELQHWQVAPSEPWGLPTREATLAQYLKALGYSTHMVGKWHLGHYNNASTPLNRGFD 183

Query: 201 SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
           S +G+ +G + Y  H  S  Y      +  D  R ++   D  G++ T L  + A+ ++E
Sbjct: 184 SFYGFYSGGVDYLTHDPSTGYVWRCYRDLWDDERPVT---DAHGQHQTSLMNERAIAVLE 240

Query: 261 DQPVDK---PLFLYLAHLAAHAGNAGKHLEAPQETINQ----FQYITDPNRRTYAAMVKK 313
              V+K   P+F Y+++  AH       L+ P E + +       I + +R+ +AA++  
Sbjct: 241 RHAVEKKSEPVFAYVSYPNAHL-----PLQPPTELLERRNATLLDIPNHDRKNFAALMMN 295

Query: 314 LDDSVGTVIS-ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
            D+S+  + +  L+  G+ + ++I+  SDNG          +    GSNYP RG K  L+
Sbjct: 296 ADESLANLTNRGLKATGLYDTAVIVIASDNGGQV-------SAMGGGSNYPLRGEKKYLF 348

Query: 373 EGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
           EGGV+  A++ SP + +  R S   ++MH+SDWLPTL   A G ++         D+ + 
Sbjct: 349 EGGVRAHAVIHSPLLPRAARGSSYGKLMHMSDWLPTLLQGALGASA---------DELAG 399

Query: 431 LLLNTPSRRN-SNIDGLDQWSSLLLNTPSR----RNSVLINIDEKKRT--------AAVR 477
           L L      +   IDG+D W+ ++    +     R+ +L+NID    +        AA+ 
Sbjct: 400 LKLPRGVVNDYGAIDGVDHWAHVVGAVDAAAAAPRDELLLNIDYLDGSGDYLGYFRAALI 459

Query: 478 LDSWKLV 484
           +  WK +
Sbjct: 460 IGDWKFI 466


>gi|325286699|ref|YP_004262489.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324322153|gb|ADY29618.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 494

 Score =  198 bits (503), Expect = 8e-48,   Method: Compositional matrix adjust.
 Identities = 143/440 (32%), Positives = 215/440 (48%), Gaps = 63/440 (14%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L    G++D+ F+GS +I TP +D LA  G I  + Y A P C PSRA+L+TGKYP  
Sbjct: 46  LVLCDDLGYSDVGFNGSTDIKTPELDKLANAGTIFTSAYVAHPFCGPSRAALLTGKYPHT 105

Query: 144 TGMQG--PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
            G Q   P    +  +G+   E+F+ + L+E GY T AIGKWHLG    E+ P  RGF  
Sbjct: 106 IGSQFNLPANGESLGKGIDTNEQFIAKTLQESGYYTGAIGKWHLGA-TEEFHPNQRGFTD 164

Query: 202 HFGYLNGVISYY---------------DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
            +G+L G  +Y+                 I+ D Y   +E NG +++           EY
Sbjct: 165 FYGFLGGGHNYFPEQYQAQYQKQKKAKKKIIRD-YILPLEHNGKEVKET---------EY 214

Query: 247 ATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
            TD F++EA + +++     KP FLYLA+ A H       LEA +E + +F  I D +RR
Sbjct: 215 LTDAFSREASRFVKEASNKKKPFFLYLAYNAPHVP-----LEAKEEDLEKFSVIKDKDRR 269

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
           TYAAMV  +D  +  +++ L+    LEN++IIF+SDNG          N  +  +N P  
Sbjct: 270 TYAAMVYAVDRGIKQIVNTLKETNQLENTLIIFLSDNGG---------NTDHGATNNPLH 320

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
           G K   WEGG +VP  ++ P+       +   +   D  PTL   A    ++LP N  + 
Sbjct: 321 GRKGDTWEGGYRVPMFMYCPKNIAKGHTNNFPITSLDLYPTL---AHIGKAKLPDNKVLH 377

Query: 424 GLDQWSSLLLNTPSRRNSNIDGL-------------DQWSSLLLNT-PSRRNSVLINIDE 469
           G + W S+L N    +   +  +             D+W +L ++  P +   +  +I E
Sbjct: 378 GKNVWESILNNKNIHKKETLFAMRHREGYTDVAARKDEWKALKVSKQPWKLFKITEDIGE 437

Query: 470 KKRTAAVRLDSWKLVLGTQE 489
           KK  +A   D  K ++   E
Sbjct: 438 KKDLSATHPDILKGLVNDAE 457



 Score = 53.9 bits (128), Expect = 3e-04,   Method: Compositional matrix adjust.
 Identities = 29/67 (43%), Positives = 41/67 (61%), Gaps = 6/67 (8%)

Query: 14  EYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           EY TD F++EA + +++     KP FLYLA+ A H       LEA +E + +F  I D +
Sbjct: 213 EYLTDAFSREASRFVKEASNKKKPFFLYLAYNAPHVP-----LEAKEEDLEKFSVIKDKD 267

Query: 73  RRTYAAL 79
           RRTYAA+
Sbjct: 268 RRTYAAM 274


>gi|323451705|gb|EGB07581.1| hypothetical protein AURANDRAFT_27261 [Aureococcus anophagefferens]
          Length = 614

 Score =  198 bits (503), Expect = 9e-48,   Method: Compositional matrix adjust.
 Identities = 166/557 (29%), Positives = 249/557 (44%), Gaps = 94/557 (16%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
           LT++  +G  D+S++ +  + TP +  ++ +G +L+N YA P CTPSRA LMTG+Y I  
Sbjct: 48  LTVVDDWGSGDVSYNDA-ALHTPELQRMSEHGFVLDNFYAAPTCTPSRAMLMTGRYNIRN 106

Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
           GMQ   I   EPRGVPL ERFL + L + GY T AIGKWHLG  R  YTPL RGF+  +G
Sbjct: 107 GMQDSVIHSTEPRGVPLDERFLSQKLSDAGYRTAAIGKWHLGMHRDAYTPLKRGFDLFYG 166

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
            L G  S+  H     +S+  +  G D +          G +     + +A         
Sbjct: 167 ILTGGGSHTGH-----FSKGDDKKGDDKKDAKKDDAKKGGAHRRLAASGKA--------- 212

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
             P F+YL++ A H              +++   I+  NR T   M+ ++DD + T+  +
Sbjct: 213 --PWFVYLSYQAVH----DPVTVGDDRYVSETAAISTTNRPTLCGMIAEIDDGLKTLRLS 266

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAILW 383
           L+       +I + +SDNG            R  GS NYP RG K   +EGGVKVPA+L 
Sbjct: 267 LENAQAWAQTIFVVLSDNG----------GVRAHGSANYPLRGEKAQYYEGGVKVPAVLA 316

Query: 384 SPQIQQNPR--------VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
              +++  R         S  + HI+D   T  + AG +       +DG+  +S L+   
Sbjct: 317 GGYVEEALRRAGGAERLRSASLAHITDVHATFLSLAGYEGGDDDKPLDGVSLFSHLV--- 373

Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEK--KRTAAVRLDSWKLVLGTQENGTM 493
                       Q +S +      R+ VLINI+      + A+R+   KL++  + N + 
Sbjct: 374 ------------QSASGVAGAAPPRSDVLININSALFAGSGAIRVGDHKLMVNPEPNES- 420

Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
              Y + R     L      V + T+QS+                       A  H  A 
Sbjct: 421 -SIYSKVRK---ALAAQKGAVPTDTFQSI----------------------VAQAHKDAL 454

Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE-QP 612
            AP            LF+L  +P E+ + A+S P  S  LY++  Y  +   P +   + 
Sbjct: 455 GAP---------SISLFDLKKNPYERTDCATSDPAESCNLYDVAAYADQEAQPSTFAWED 505

Query: 613 DLVQADPKRFNDTWSPW 629
           D   ADP  F   W+PW
Sbjct: 506 DGPLADPALFGSVWAPW 522


>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 536

 Score =  197 bits (502), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 127/364 (34%), Positives = 186/364 (51%), Gaps = 31/364 (8%)

Query: 92  GWNDLSFHG----SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
           G+ D+S++G    +N + TP +D LA +GI L   Y+Q  CTP+RA+L+TG+YP +TGMQ
Sbjct: 44  GYGDVSYNGDGSLTNAVATPYLDRLAADGITLTRFYSQCDCTPARAALLTGRYPSNTGMQ 103

Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
              +       +P     LP  L E GY   AIGKW +G  R   TP  RGF+SH GY  
Sbjct: 104 HEVVTAQSQWSLPHEFALLPSALPE-GYRKHAIGKWDVGHARAADTPTARGFDSHLGYYG 162

Query: 208 GVISYYDHILSDQYSRTVELNG--HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
             I+Y +H      S     NG   DM  + +T   T   Y+T LF   A+ L++ +  +
Sbjct: 163 AEITYDEHAALRSCS-----NGTIRDMNHDGATLAATEDRYSTHLFADHAMALVDREADE 217

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI----TDPNRRTYAAMVKKLDDSVGTV 321
             LFLYL   A H     + L A    + +F       +D  R T+AA+  +LD +V   
Sbjct: 218 YKLFLYLCFQAVH-----QPLAADAALVKRFADAFLDESDAARSTFAAVALELDRAVERF 272

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           +   + +G  ++++  FMSDNGA   +          GSN+P RG K T +EGGV+VPA 
Sbjct: 273 VEFTKARGAYDDAVFFFMSDNGATLAQ-------SGGGSNWPLRGSKFTAYEGGVRVPAF 325

Query: 382 LWSPQIQQNPRVSLQ--MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
           L S ++    R +    + H+ D LPTL  A      R+P  +DG+DQ  ++L    + R
Sbjct: 326 LHSARVPPARRGAAHAGLFHVVDVLPTL-AAFAESADRVPAGVDGVDQRDAVLHGAAAPR 384

Query: 440 NSNI 443
              +
Sbjct: 385 TEAL 388


>gi|329928435|ref|ZP_08282305.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
 gi|328937871|gb|EGG34277.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
          Length = 443

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 141/421 (33%), Positives = 214/421 (50%), Gaps = 51/421 (12%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
            +++Y    G+ DL  +GS+ + TP++D LA  GI   N Y+  PVC+PSRASL+TGKYP
Sbjct: 10  FIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGKYP 69

Query: 142 IHTGMQGPPIWGAE--PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
              G+    I GA+    G+P  E  L + L+  GY T   GKWHLG    E +P   GF
Sbjct: 70  ARAGVG--EILGAKRGSHGLPADEVTLAKALKPAGYRTALFGKWHLGL-SEETSPNAHGF 126

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           +  FG+  G + +Y HI    + +   +N  HD+  N +  W+  G Y T+L T+ +V  
Sbjct: 127 DEFFGFKAGCVDFYSHIF--YWGQAHGVNPLHDLWENETEVWEN-GRYMTELITERSVDF 183

Query: 259 IE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
           I+  +  + P FL+ ++ A H       + APQE +++F ++   +R+  AAM+  +DD 
Sbjct: 184 IQRSREQEAPFFLFASYNAPH-----YPMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDDG 237

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWE 373
           VG ++ AL+  G  E+++I F SDNG P+ E R     T +    GS   +RG K +L+E
Sbjct: 238 VGEIVKALKEAGCYEDTVIFFSSDNG-PSSESRNWLDGTEDVYYGGSAGIFRGHKASLFE 296

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
           GG++ PAIL  P   +  +V  +++ + D  PT    AG D +  PL             
Sbjct: 297 GGIREPAILSWPNGWEGGQVRDEVVAMMDIAPTFLDLAGVDPAAGPL------------- 343

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
                +   +DG      L +  PS    +     E +   AVR   WKLVL    NG +
Sbjct: 344 -----QGVALDGSSLKEMLQMRMPSPHKQLFW---EYQGQLAVREGDWKLVL----NGKL 391

Query: 494 D 494
           D
Sbjct: 392 D 392


>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 453

 Score =  197 bits (501), Expect = 1e-47,   Method: Compositional matrix adjust.
 Identities = 125/343 (36%), Positives = 184/343 (53%), Gaps = 39/343 (11%)

Query: 76  YAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPS 131
           +AA   S    +LIV    G+ DL F G  +IPTP ID LA +G+  ++ Y   P C+P+
Sbjct: 25  WAADASSKPNIVLIVADDLGYGDLGFQGGRDIPTPRIDGLARSGVTCSSGYVSGPYCSPT 84

Query: 132 RASLMTGKYPIHTGMQGPPIWGA---EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
           RA L+TG+Y    G +  P   A   E  G+ L ER LP+ L++ GY+T  +GKWHLGF 
Sbjct: 85  RAGLLTGRYQQRFGHEFNPGNAARVTETFGLSLEERTLPQRLKQAGYATGIVGKWHLGF- 143

Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD-MRRNLSTAWDTVGEYA 247
             ++ PL RGF+  FG+L G   Y+            + N +D +RR      ++  EY 
Sbjct: 144 APQFQPLERGFDEFFGFLGGAHPYFP-----------DANSNDPIRRGREAVVES--EYL 190

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
           TD F +E+V  I D+  ++P FLYLA  A H         A  E + +F+ I D  RRTY
Sbjct: 191 TDAFARESVAYI-DRNKNRPFFLYLAFNAVH-----NPQHAKPEHLERFKSIADERRRTY 244

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
           A M+  +DD+VG V+  L  + + + ++++F+SDNG P           N  +N P  G 
Sbjct: 245 AGMLTAMDDAVGQVLDKLAAEKLEQKTLVVFISDNGGPPA---------NGSTNTPLNGQ 295

Query: 368 KNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYT 409
           K T WEGGV+VP ++ W  Q+    +    ++ + D LPT+ T
Sbjct: 296 KATTWEGGVRVPFVVSWPGQLAAGKKYEEPVIQL-DLLPTILT 337



 Score = 45.8 bits (107), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 27/66 (40%), Positives = 36/66 (54%), Gaps = 6/66 (9%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           EY TD F +E+V  I D+  ++P FLYLA  A H         A  E + +F+ I D  R
Sbjct: 188 EYLTDAFARESVAYI-DRNKNRPFFLYLAFNAVH-----NPQHAKPEHLERFKSIADERR 241

Query: 74  RTYAAL 79
           RTYA +
Sbjct: 242 RTYAGM 247


>gi|29348898|ref|NP_812401.1| arylsulfatase [Bacteroides thetaiotaomicron VPI-5482]
 gi|383125065|ref|ZP_09945724.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
 gi|29340804|gb|AAO78595.1| arylsulfatase B precursor [Bacteroides thetaiotaomicron VPI-5482]
 gi|251837419|gb|EES65514.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
          Length = 458

 Score =  197 bits (501), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 127/342 (37%), Positives = 189/342 (55%), Gaps = 35/342 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP ++ALA  G++L+  Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPCLNALAAEGVVLDRFYTAPISTPTRAGLMTGRYPNRFGIRTTVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  L + L   GYS +A IGKWHLG  R+ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETLADMLARNGYSNRAIIGKWHLGHTRKVHYPINRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            Y+DH+      R  EL+ H+   +  T +D    Y+T+L T+EAV+ I     + P  L
Sbjct: 156 DYFDHM------REGELDWHN---DWETCYDK--GYSTELITQEAVRCINTYEKEGPFLL 204

Query: 271 YLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVISA 324
           Y+A+ A H       L+A ++ I    + F  +T  +  R TY AMV  +D  +GT++ A
Sbjct: 205 YVAYNAPHT-----PLQAQEKDIELYCDDFGSLTPKEQKRVTYQAMVSCMDRGIGTIVDA 259

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
           L++KG+++N+ +IF SDNG   V            S+   RG K   W+GGV+VPA+ + 
Sbjct: 260 LKKKGIMDNTFLIFFSDNGPAGVP---------GSSSGKLRGRKFDEWDGGVRVPAVFYW 310

Query: 385 PQIQQNPR-VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            + + N + +S Q+    D +PTL     GD +R     DG+
Sbjct: 311 KRAESNYKNLSSQVTGFVDIVPTLKELV-GDKNRPERAYDGI 351


>gi|109897214|ref|YP_660469.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699495|gb|ABG39415.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 500

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 123/369 (33%), Positives = 183/369 (49%), Gaps = 43/369 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ND+ F+GS +I TPN+D LA NG+  +  Y A P C PSRA++MTG+YP   G Q   
Sbjct: 51  GYNDVGFNGSTDIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMTGRYPHKIGAQFNL 110

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  GV   E F+ + ++  GY T A+GKWHLG    EY P   GF+  +G+L G  
Sbjct: 111 PEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLGE-ASEYHPNKHGFDEFYGFLGGGH 169

Query: 211 SYYDHILSDQYSRTV--------------ELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           +Y+       Y++ V              E NG ++R           EY TD  ++EAV
Sbjct: 170 NYFPEQFEAAYNKRVAQGMTNINMYLTPLEHNGKEVRET---------EYITDGLSREAV 220

Query: 257 QLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
             ++     K P FLYLA+ A H       L+A +E +  F  I D  RRTYA MV  +D
Sbjct: 221 NFVDKAAAKKKPFFLYLAYNAPHV-----PLQAKEEDMAMFSQIKDKKRRTYAGMVYAVD 275

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
             VG ++  L++ G  +N++I+F SDNG    +           +NYP +  K ++ EGG
Sbjct: 276 RGVGRIVEQLKKNGQFDNTVIVFTSDNGGKLGQ---------GANNYPLKEGKGSVQEGG 326

Query: 376 VKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
            + P ++ W   ++   R S  ++ + D  PT +   GG        +DG D W+ +  N
Sbjct: 327 FRTPMLVHWPKHMKAGSRFSHPVLAL-DLYPT-FAGLGGAVLPEDKKLDGKDIWADIQAN 384

Query: 435 TPSRRNSNI 443
           T   ++  I
Sbjct: 385 TAPHKDEFI 393



 Score = 48.5 bits (114), Expect = 0.011,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 6/68 (8%)

Query: 14  EYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           EY TD  ++EAV  ++     K P FLYLA+ A H       L+A +E +  F  I D  
Sbjct: 209 EYITDGLSREAVNFVDKAAAKKKPFFLYLAYNAPHV-----PLQAKEEDMAMFSQIKDKK 263

Query: 73  RRTYAALT 80
           RRTYA + 
Sbjct: 264 RRTYAGMV 271


>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 484

 Score =  197 bits (500), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 125/368 (33%), Positives = 193/368 (52%), Gaps = 28/368 (7%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L    G+ND+ F+GS +I TPN+D LA +G I  + Y A P C PSRA+L+TG+YP  
Sbjct: 35  LILCDDLGYNDVGFNGSTDITTPNLDQLAQDGTIFTSAYVAHPFCGPSRAALLTGRYPHT 94

Query: 144 TGMQG--PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
            G Q   P    +  +G+ + E+F+   +++ GY T AIGKWHLG    EY P  RGF  
Sbjct: 95  LGSQFNLPANGASTGKGISVEEKFMGVPMQKAGYYTGAIGKWHLGE-TAEYHPNKRGFND 153

Query: 202 HFGYLNGVISYYDHILSDQYSRTVEL-----NGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
            +G+L G   Y+      QY    E+     N + +    + A     +Y TD+ ++E +
Sbjct: 154 FYGFLGGGHKYFPEEYKLQYKHQKEMGTKNINDYVLPLEHNGAIVEENDYLTDVLSREGI 213

Query: 257 QLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           +   E     KP FLYLA+ A H       LEA ++ + +F+ I D +RRTYAAMV  +D
Sbjct: 214 RFTKEAHDKKKPFFLYLAYNAPHVP-----LEAKEKDLEKFKDIEDIDRRTYAAMVYAVD 268

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
             VG ++++L++ G  +N++IIF+SDNG  T          +  +NYP  G K   WEGG
Sbjct: 269 RGVGEIVASLKKTGQFDNTLIIFLSDNGGHT---------GHGANNYPLTGRKGDTWEGG 319

Query: 376 VKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-L 433
            +VP    W  +I++  +    +  + D  PT+            + +DG + W +++ L
Sbjct: 320 FRVPMFFHWPKKIKKGQKFDYPVSAL-DLYPTIAHIGQAKIGSDKI-LDGKNIWEAIISL 377

Query: 434 NTPSRRNS 441
             P  + +
Sbjct: 378 TNPHDKET 385



 Score = 47.8 bits (112), Expect = 0.018,   Method: Compositional matrix adjust.
 Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 6/68 (8%)

Query: 14  EYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           +Y TD+ ++E ++   E     KP FLYLA+ A H       LEA ++ + +F+ I D +
Sbjct: 202 DYLTDVLSREGIRFTKEAHDKKKPFFLYLAYNAPHVP-----LEAKEKDLEKFKDIEDID 256

Query: 73  RRTYAALT 80
           RRTYAA+ 
Sbjct: 257 RRTYAAMV 264


>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
 gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 511

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 143/448 (31%), Positives = 220/448 (49%), Gaps = 57/448 (12%)

Query: 78  ALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSR 132
            L + T   +L+V     G+ D+ F+GS +I TP +D LA NG I  + Y A P C PSR
Sbjct: 41  GLAQETRPNILVVLCDDLGYADVGFNGSTDILTPELDNLAQNGSIFTSAYVAHPFCGPSR 100

Query: 133 ASLMTGKYPIHTG-----MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           ++++TG+YP  TG              +  GVP+ E ++ + L+  GY T AIGKWHLG 
Sbjct: 101 SAILTGRYPHLTGTAYNLFHNSSEDDKDNMGVPVEETYMSKVLQNAGYYTSAIGKWHLGA 160

Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMR-------RNLSTAW 240
             + + P  RGF+  +G+L G   Y+       Y    +    ++R        N   A 
Sbjct: 161 APK-FHPNKRGFDDFYGFLGGGHDYFPSEYQKTYKAQKKAGNPNIRDYVFPMEHNGKPAN 219

Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
           +T  EY TD F++EA++ I+     K P F+YLA+ A H       L+A  E + +F +I
Sbjct: 220 ET--EYITDGFSREAIKNIKIAAAKKQPFFIYLAYNAPHVP-----LQAKAEDVAKFAHI 272

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
            D +RRTYAAMV  +D  VG ++  L+     +N++I+F+SDNG          N+ +  
Sbjct: 273 KDKDRRTYAAMVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNGG---------NFNHGA 323

Query: 360 SNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
           +NYP +G K   WEGG +VP    W  +I+++ R    +  + D  PT     G   ++L
Sbjct: 324 NNYPLKGTKGDTWEGGYRVPMFFHWPKKIKKDQRFDFPVSSL-DLYPTF---TGLAEAKL 379

Query: 419 PLN--IDGLDQWSSLLLNTPSRRNSNIDGL--------------DQWSSLLLNTPSRRNS 462
           P    +DG +    +L NT   ++  I  L              D   + + N P R ++
Sbjct: 380 PKGKQLDGKNIMDDVLKNTEPYKDEMIYSLRYREGYNDVGARMGDWKITRMGNEPWRLHN 439

Query: 463 VLINIDEKKRTAAVRLDSWK-LVLGTQE 489
           +  +I EKK  A    D  K ++  TQE
Sbjct: 440 ITQDIGEKKNLAGRYPDRLKEMIAKTQE 467



 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 24/111 (21%)

Query: 14  EYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           EY TD F++EA++ I+     K P F+YLA+ A H       L+A  E + +F +I D +
Sbjct: 222 EYITDGFSREAIKNIKIAAAKKQPFFIYLAYNAPHVP-----LQAKAEDVAKFAHIKDKD 276

Query: 73  RRTYAAL-----------------TKSTTLTLLIVYGWNDLSF-HGSNEIP 105
           RRTYAA+                 TK    TL++    N  +F HG+N  P
Sbjct: 277 RRTYAAMVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNGGNFNHGANNYP 327


>gi|390361685|ref|XP_789654.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like
           [Strongylocentrotus purpuratus]
          Length = 401

 Score =  196 bits (499), Expect = 2e-47,   Method: Compositional matrix adjust.
 Identities = 149/484 (30%), Positives = 220/484 (45%), Gaps = 103/484 (21%)

Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV 225
           + + LR+ GY+T A+GKWH+G ++    P  RGF+S FG L G   +Y H     +   V
Sbjct: 1   MADKLRKAGYATHAVGKWHMGLYKTCCWPTNRGFDSFFGLLTGRGGFYTHKHYGGHPGLV 60

Query: 226 E---LNGHDMRRNLS-TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN 281
           +    +G+D+R NL   A D  G Y+T LFT+++  +I      KPLFLY +  A H   
Sbjct: 61  DSKNWSGYDLRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSKPLFLYHSFQAVHYP- 119

Query: 282 AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSD 341
               LE P   +  F YI D  RRTYA MVK +D++VG +   L++ G+  N+IIIF SD
Sbjct: 120 ----LEVPPRYMEDFNYIADERRRTYAGMVKCMDEAVGNLTRTLKKTGLWNNTIIIFSSD 175

Query: 342 NGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMH 399
           NGA         N+   GSN+P RG+K +LWEGG+K    + SP + +  R  V+  + H
Sbjct: 176 NGA---------NFNYGGSNWPLRGMKRSLWEGGIKSVGFIASPLLPKLVRGTVNNNLFH 226

Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSR 459
           ++DW PTL       + +   ++DG + W  LL    +R      G D W          
Sbjct: 227 VTDWFPTLVRGVARGSLK-GTHLDGHNLWKHLLNKKKTR------GKDSWP--------- 270

Query: 460 RNSVLINID-------EKKR------TAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
           R  +L NID       E  R       AA+R ++WKL+                      
Sbjct: 271 RKEILHNIDPYSPIFSELPRWFSINFPAAIRYENWKLI---------------------- 308

Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
                            Q+    ++P            + TIH    P            
Sbjct: 309 ---------------TSQIGNRSWVPPPE-------SGENTIHSTDTPEKR--------- 337

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
            +LF++  DP E N+++  R DI+  L   L  + R  VP  +  PD ++A+P   N ++
Sbjct: 338 VWLFDISEDPKEMNDLSLERTDITYFLLRRLGSYLRGSVPIRYPGPD-IRANPSLHNGSF 396

Query: 627 SPWI 630
           S WI
Sbjct: 397 SDWI 400



 Score = 54.7 bits (130), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 33/82 (40%), Positives = 42/82 (51%), Gaps = 6/82 (7%)

Query: 1   MRRNLS-TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +R NL   A D  G Y+T LFT+++  +I      KPLFLY +  A H       LE P 
Sbjct: 70  LRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSKPLFLYHSFQAVHYP-----LEVPP 124

Query: 60  ETINQFQYITDPNRRTYAALTK 81
             +  F YI D  RRTYA + K
Sbjct: 125 RYMEDFNYIADERRRTYAGMVK 146


>gi|193784090|dbj|BAG53634.1| unnamed protein product [Homo sapiens]
          Length = 426

 Score =  196 bits (499), Expect = 3e-47,   Method: Compositional matrix adjust.
 Identities = 140/454 (30%), Positives = 210/454 (46%), Gaps = 79/454 (17%)

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
           +GKWHLGF+R+E  P  RGF++  G   G + YY +   D       + G D+    + A
Sbjct: 2   VGKWHLGFYRKECLPTRRGFDTFLGSFTGNVDYYTYDNCD----GPGVCGFDLHEGENVA 57

Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
           W   G+Y+T  + + A  ++      +PLFLY+A  A H       L++P+E + +++ +
Sbjct: 58  WGLSGQYSTMPYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTM 112

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
            +  RR YAAMV  +D++V  +  AL+R G   NS+IIF SDNG  T            G
Sbjct: 113 GNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTFSG---------G 163

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           SN+P RG K T WEGGV+    + SP +++  R S  +MHI+DW PTL   AGG TS   
Sbjct: 164 SNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAAD 223

Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--INIDEKKRTAAVR 477
             +DG D W ++     S R   +  +D      L   ++  S+     I      AA+R
Sbjct: 224 -GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHAQHGSLEGGFGIWNTAVQAAIR 277

Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
           +  WKL+ G       D  YG    + +P             Q+L     + +    N++
Sbjct: 278 VGEWKLLTG-------DPGYG----DWIPP------------QTLATFPGSWW----NLE 310

Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
           +M S RQ                       +LFN+  DP E+ ++A  RPD+   L   L
Sbjct: 311 RMASVRQA---------------------VWLFNISADPYEREDLAGQRPDVVRTLLARL 349

Query: 598 KYHRRTLVPQSH--EQPDLVQADPKRFNDTWSPW 629
             + RT +P  +  E P   +A P      W PW
Sbjct: 350 AEYNRTAIPVRYPAENP---RAHPDFNGGAWGPW 380



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW   G+Y+T  + + A  ++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 55  NVAWGLSGQYSTMPYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 109

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 110 RTMGNVARRKYAAMV 124


>gi|291513548|emb|CBK62758.1| Arylsulfatase A and related enzymes [Alistipes shahii WAL 8301]
          Length = 467

 Score =  196 bits (497), Expect = 4e-47,   Method: Compositional matrix adjust.
 Identities = 130/357 (36%), Positives = 182/357 (50%), Gaps = 39/357 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ +HGS  IPTPNIDALA  GI +N  Y  PV +P+RA LMTG+YP   G++   I
Sbjct: 43  GWGDVGYHGSV-IPTPNIDALAARGIEMNRFYTAPVSSPTRAGLMTGRYPSRFGIRKTVI 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E+ L + L   GY+ +AI GKWHLG  RR Y PL RGF   +G LNG +
Sbjct: 102 PPWRDYGLDPEEQTLADMLAANGYAHRAIVGKWHLGHGRRAYYPLNRGFTHFYGCLNGAL 161

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+ H       R  EL+ H+        W++  +  Y+TDL   EAV+ I     + P 
Sbjct: 162 DYFTH------EREGELDWHN-------DWESCRDEGYSTDLIADEAVRCIGGYASEGPF 208

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI------NQFQYIT--DPNRRTYAAMVKKLDDSVGT 320
           FLY+A  A H        +AP++ I       +F  +T  + +  TY AMV ++D  VG 
Sbjct: 209 FLYVAFNAPHT-----PFQAPEDEIAEHISPEKFAALTPREKDAYTYRAMVTRMDKGVGR 263

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           ++  L+R G++EN++++FMSDNG          N     S  P RG K   W+GG++VPA
Sbjct: 264 ILEKLERSGLVENTLVLFMSDNGG-------IPNLPGGSSCAPLRGHKGREWDGGLRVPA 316

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
           +L           S Q+    D  PT+    G      P   DG++   +L   T S
Sbjct: 317 VLCRKGRFAEGVRSEQLAAFVDMAPTIAALVG--AGEPPRPYDGVNLLPALTGETAS 371


>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
 gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
          Length = 452

 Score =  196 bits (497), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 131/408 (32%), Positives = 202/408 (49%), Gaps = 61/408 (14%)

Query: 93  WNDLSFHGSNEIPTPNIDALAY--NGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           W D + H      TPN+  LA+  N  +L+N Y   +CTP+R++ MTG YP   G Q   
Sbjct: 12  WKDSTLH------TPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNGV 65

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
               EP GVP    FL E +R+L YST  +GKWHLG+ ++E+ P  RGF+  +G+     
Sbjct: 66  FLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQT 125

Query: 211 SYYDHILSDQYSRTVE--LNGHDMRRNLSTA-----WDTVGEYATDLFTKEAVQLIEDQP 263
            Y++H  +DQY R ++  + G D+   + +      +   G Y+TDLFT  A+ ++++  
Sbjct: 126 GYFNHS-ADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 184

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQE--TINQFQYITDPNRRTYAA--MVKKLDDSVG 319
             KP F++L++ A H       L+  Q+  TI Q +  T   R    +  M+  +D ++G
Sbjct: 185 NSKPFFMFLSYQAVHPP-----LQVSQQSKTIGQGKEATFILRSHAHSTRMLTAMDFAIG 239

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            ++  L+   + EN++I+F SDNG        T+N+    SN P RG K+T+WEGG K  
Sbjct: 240 RLVEYLKASNLYENTVIVFTSDNGG-------TANF--GASNAPLRGEKDTIWEGGTKTT 290

Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
             + SP   +       M H+ DW  T+ +  G       L ID                
Sbjct: 291 TFVHSPMYIEEGGTRDMMFHVVDWHATILSITG-------LEID---------------- 327

Query: 440 NSNIDGLDQWSSLLLNTPS-RRNSVLINIDEKKRTAAVRLDSWKLVLG 486
            S  DG++QW  L    P  RR   + NID     +A+R   +KL++G
Sbjct: 328 -SYGDGINQWEYLKTGRPKFRRFQFVYNIDN--HGSAIRDGDYKLIVG 372


>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
 gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
          Length = 488

 Score =  195 bits (496), Expect = 5e-47,   Method: Compositional matrix adjust.
 Identities = 131/408 (32%), Positives = 202/408 (49%), Gaps = 61/408 (14%)

Query: 93  WNDLSFHGSNEIPTPNIDALAY--NGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           W D + H      TPN+  LA+  N  +L+N Y   +CTP+R++ MTG YP   G Q   
Sbjct: 48  WKDSTLH------TPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNGV 101

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
               EP GVP    FL E +R+L YST  +GKWHLG+ ++E+ P  RGF+  +G+     
Sbjct: 102 FLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQT 161

Query: 211 SYYDHILSDQYSRTVE--LNGHDMRRNLSTA-----WDTVGEYATDLFTKEAVQLIEDQP 263
            Y++H  +DQY R ++  + G D+   + +      +   G Y+TDLFT  A+ ++++  
Sbjct: 162 GYFNHS-ADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 220

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQE--TINQFQYITDPNRRTYAA--MVKKLDDSVG 319
             KP F++L++ A H       L+  Q+  TI Q +  T   R    +  M+  +D ++G
Sbjct: 221 NSKPFFMFLSYQAVHPP-----LQVSQQSKTIGQGKEATFILRSHAHSTRMLTAMDFAIG 275

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            ++  L+   + EN++I+F SDNG        T+N+    SN P RG K+T+WEGG K  
Sbjct: 276 RLVEYLKASNLYENTVIVFTSDNGG-------TANF--GASNAPLRGEKDTIWEGGTKTT 326

Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
             + SP   +       M H+ DW  T+ +  G       L ID                
Sbjct: 327 TFVHSPMYIEEGGTRDMMFHVVDWHATILSITG-------LEID---------------- 363

Query: 440 NSNIDGLDQWSSLLLNTPS-RRNSVLINIDEKKRTAAVRLDSWKLVLG 486
            S  DG++QW  L    P  RR   + NID     +A+R   +KL++G
Sbjct: 364 -SYGDGINQWEYLKTGRPKFRRFQFVYNIDN--HGSAIRDGDYKLIVG 408


>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 658

 Score =  195 bits (495), Expect = 6e-47,   Method: Compositional matrix adjust.
 Identities = 137/416 (32%), Positives = 205/416 (49%), Gaps = 56/416 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ +HGS  I TPNID LA  G+ L N Y    C PSR +L+TG+Y I TG+     
Sbjct: 215 GWHDVGYHGSI-IDTPNIDHLAAEGVKLENYYVSSWCAPSRVNLLTGRYRIRTGL----- 268

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL-GFFRREYTPLYRGFESHFGYLNGVI 210
           +G     + + E  L + L E GY T  +GKWHL GF  RE  P +RGF++  G+  G  
Sbjct: 269 YGDVCDFMGIHEITLADKLYEAGYYTAMVGKWHLSGFQHRECYPAHRGFQTFLGFHGGSQ 328

Query: 211 SYYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           +Y+ H      S       +D   N  S   +  G Y+T ++ +EA ++I     ++PLF
Sbjct: 329 NYFTHRRGGSNSE------YDFWANDTSIGREYDGRYSTMVYAEEAQRIIRHHRTEQPLF 382

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           LYL+  A H+      L  P    ++++  I D  RR YAAM   +D+++G V   L+  
Sbjct: 383 LYLSFQAVHSP-----LLVPSAYEDKYRTGIEDDKRRVYAAMTSCMDEAIGNVTDTLKET 437

Query: 329 GMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           G+   ++ IF SDNG  PT            GSN+P RG K   +EGG++    + S  +
Sbjct: 438 GLWNKTVFIFSSDNGGTPTFG----------GSNWPLRGRKGLWYEGGIRAVGFVNSYLL 487

Query: 388 QQNPRVSL--QMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNSNID 444
             +   S   +++HI+DW PTL   A  D  RL PL  DG+DQW ++     + R     
Sbjct: 488 SNHVTGSTNHELIHITDWFPTLLRLAKTDLHRLKPL--DGIDQWETISQGKKTNR----- 540

Query: 445 GLDQWSSLLLN----------TPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
                  +LLN            +R      +  +    AA+R+  WKL+ G +++
Sbjct: 541 -----KEVLLNFLPGNISGNINENREGEATADYFDTMPFAAIRVGDWKLLTGNKDS 591



 Score = 42.0 bits (97), Expect = 1.1,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 39/69 (56%), Gaps = 6/69 (8%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-ITDP 71
           G Y+T ++ +EA ++I     ++PLFLYL+  A H+      L  P    ++++  I D 
Sbjct: 357 GRYSTMVYAEEAQRIIRHHRTEQPLFLYLSFQAVHSP-----LLVPSAYEDKYRTGIEDD 411

Query: 72  NRRTYAALT 80
            RR YAA+T
Sbjct: 412 KRRVYAAMT 420


>gi|1030057|emb|CAA63067.1| murine arylsulfatase B [Mus musculus]
          Length = 252

 Score =  194 bits (494), Expect = 8e-47,   Method: Compositional matrix adjust.
 Identities = 107/268 (39%), Positives = 154/268 (57%), Gaps = 22/268 (8%)

Query: 174 GYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH--- 230
           GY+T  +GKWHLG +R+E  P  RGF+++FGYL G   YY H   +  +    LNG    
Sbjct: 2   GYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCA 58

Query: 231 -DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
            D+R     A +    Y+T++FTK A  +I + P +KPLFLYLA  + H       L+ P
Sbjct: 59  LDLRDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVP 113

Query: 290 QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEY 349
           +E +  + +I D +RR YA MV  +D++VG V  AL+  G+  N++ IF +DNG  T   
Sbjct: 114 EEYMEPYGFIQDKHRRIYAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT--- 170

Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
                 R+ G+N+P RG K TLWEGG++    + SP ++Q    S ++MHISDWLPTL  
Sbjct: 171 ------RSGGNNWPLRGRKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVD 224

Query: 410 AAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
            AGG T+     +DG + W ++    PS
Sbjct: 225 LAGGSTNGTK-PLDGFNMWKTISEGHPS 251



 Score = 55.8 bits (133), Expect = 7e-05,   Method: Compositional matrix adjust.
 Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FTK A  +I + P +KPLFLYLA  + H       L+ P+E +  + +I D +RR
Sbjct: 75  YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 129

Query: 75  TYAALT 80
            YA + 
Sbjct: 130 IYAGMV 135


>gi|261404208|ref|YP_003240449.1| sulfatase [Paenibacillus sp. Y412MC10]
 gi|261280671|gb|ACX62642.1| sulfatase [Paenibacillus sp. Y412MC10]
          Length = 452

 Score =  194 bits (494), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 140/421 (33%), Positives = 213/421 (50%), Gaps = 51/421 (12%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
            +++Y    G+ DL  +GS+ + TP++D LA  GI   N Y+  PVC+PSRASL+TGKYP
Sbjct: 19  FIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGKYP 78

Query: 142 IHTGMQGPPIWGAE--PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
              G+    I GA+    G+P  E  L + L+  GY T   GKWHLG    E +P   GF
Sbjct: 79  ARAGVG--EILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGL-SEETSPNAHGF 135

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           +  FG+  G + +Y HI    + +   +N  HD+  N +  W+  G Y T+L T+ +V  
Sbjct: 136 DEFFGFKAGCVDFYSHIF--YWGQAHGVNPLHDLWENETEVWEN-GRYMTELITERSVDF 192

Query: 259 IE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
           I+  +  + P FL+ ++ A H       + APQ+ +++F ++   +R+  AAM+  +DD 
Sbjct: 193 IQRSREQEAPFFLFASYNAPH-----YPMHAPQKYMDRFAHLP-WDRQVMAAMIAAVDDG 246

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWE 373
           VG ++ AL+  G  E+++I F SDNG P+ E R     T +    GS   +RG K +L+E
Sbjct: 247 VGKIVKALKEAGCYEDTVIFFSSDNG-PSSESRNWLDGTEDVYYGGSAGIFRGHKASLFE 305

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
           GG++ PAIL  P   +  +V  ++  + D  PT    AG D +  PL             
Sbjct: 306 GGIREPAILSWPNGWEGGQVRDEVAAMMDLAPTFLDLAGVDPAAGPL------------- 352

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
                +   +DG      L +  PS    +     E +   AVR   WKLVL    NG +
Sbjct: 353 -----QGVALDGSSLKEMLQMREPSPHQQLFW---EYQGQLAVREGDWKLVL----NGKL 400

Query: 494 D 494
           D
Sbjct: 401 D 401


>gi|160885330|ref|ZP_02066333.1| hypothetical protein BACOVA_03329 [Bacteroides ovatus ATCC 8483]
 gi|156108952|gb|EDO10697.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
          Length = 458

 Score =  194 bits (493), Expect = 1e-46,   Method: Compositional matrix adjust.
 Identities = 125/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP +DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L TKEA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    N F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTNNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GGV  PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351


>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 454

 Score =  194 bits (492), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 122/331 (36%), Positives = 178/331 (53%), Gaps = 38/331 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D+ +HG  EIPTPNID +A  G+  +  Y+   +C P+RA+LM+G Y    G +G  
Sbjct: 31  GYADVGYHGLEEIPTPNIDRIANEGVQFSAGYSNGSICGPTRAALMSGVYQQRIGCEG-- 88

Query: 151 IWGAEPR------GVPLTERFLPEYLRELGYSTKAIGKWHLG---FFRREYTPLYRGFES 201
           I G          G+P   + L +Y +E GY+T   GKWHLG    F +   P  RGF+ 
Sbjct: 89  ICGGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLGGERLFDKTLMPTSRGFDE 148

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
            FG L G   Y D         TV      +R++  T  D  GEY TD   +EAV  I  
Sbjct: 149 FFGILEGASLYDD---------TVNRERKYIRQD--TVIDYEGEYFTDAIGREAVSFITR 197

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
           +  DKP FLYL   A HA      ++A ++ + +F +I DPNRR +AAM+  +DD++G V
Sbjct: 198 KG-DKPFFLYLPFTAVHA-----PMQASEKYMQRFAHIADPNRRVFAAMLSAMDDNIGRV 251

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
             AL+ +G+L+N++I+F SDNG       +  N  N+  N+P +G K   +EGG++VPA 
Sbjct: 252 FDALEHQGILDNTLIVFWSDNGG------KPDN--NYSLNHPLKGQKTQFYEGGIRVPAC 303

Query: 382 LWSPQIQQNPRVSL-QMMHISDWLPTLYTAA 411
           +  P+ Q     +L Q + + D  P+   AA
Sbjct: 304 VRWPKGQIPAGKTLDQPVFLMDIFPSALEAA 334



 Score = 57.0 bits (136), Expect = 3e-05,   Method: Compositional matrix adjust.
 Identities = 38/116 (32%), Positives = 55/116 (47%), Gaps = 24/116 (20%)

Query: 7   TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
           T  D  GEY TD   +EAV  I  +  DKP FLYL   A HA      ++A ++ + +F 
Sbjct: 174 TVIDYEGEYFTDAIGREAVSFITRKG-DKPFFLYLPFTAVHA-----PMQASEKYMQRFA 227

Query: 67  YITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
           +I DPNRR +AA+  +    +  V+                  DAL + GI+ N +
Sbjct: 228 HIADPNRRVFAAMLSAMDDNIGRVF------------------DALEHQGILDNTL 265


>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 472

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 123/332 (37%), Positives = 177/332 (53%), Gaps = 18/332 (5%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F G+ +I TPN+D LA NG      YA    C PSRA+L++G+Y    G +  P
Sbjct: 37  GYADTGFTGATDIQTPNLDNLAKNGAFFKQGYANHAYCGPSRAALLSGRYQHRFGFETNP 96

Query: 151 IWG-AEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            +  A P  G+ + E+  P+ L+E GY T AIGKWHLG    E+ PL RGF+  +G+L G
Sbjct: 97  AYDPANPHMGIDVGEKLFPKRLQEAGYKTGAIGKWHLGA-AAEFHPLNRGFDYFYGFLGG 155

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
               +D+   D   +  E     + RN     D    Y T   + +A Q ++D   + P 
Sbjct: 156 G---HDYFRIDGTKKVWEAYLQPLVRNKRA--DNFEGYLTTALSNDAAQFVKDNK-ENPF 209

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FLY+A+ A H       L+AP+E I ++ +I D  RR YAAMV  +D  +GTVI +L+  
Sbjct: 210 FLYVAYNAPHM-----PLQAPKEDIARYAHIKDNKRRVYAAMVDVMDRGIGTVIQSLKDS 264

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
           G+ EN++I F+SDNG P  +   T  + N   N P+RG K  L++GGV VP    W  +I
Sbjct: 265 GIYENTLIFFLSDNGGPQSKGGSTKGW-NGSDNQPFRGGKGNLYDGGVHVPFFASWPNKI 323

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           +    V+  +  + D   T    AGGD +  P
Sbjct: 324 KPGTVVNYPVNSL-DIGRTAVALAGGDATAAP 354



 Score = 45.8 bits (107), Expect = 0.070,   Method: Compositional matrix adjust.
 Identities = 25/71 (35%), Positives = 38/71 (53%), Gaps = 6/71 (8%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           D    Y T   + +A Q ++D   + P FLY+A+ A H       L+AP+E I ++ +I 
Sbjct: 183 DNFEGYLTTALSNDAAQFVKDNK-ENPFFLYVAYNAPHM-----PLQAPKEDIARYAHIK 236

Query: 70  DPNRRTYAALT 80
           D  RR YAA+ 
Sbjct: 237 DNKRRVYAAMV 247


>gi|241844556|ref|XP_002415496.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509708|gb|EEC19161.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 328

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 184/365 (50%), Gaps = 41/365 (11%)

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           +FLY +H A H G+    LEAP   +  F YI   NR  +A  V +LD SVG V+ AL +
Sbjct: 1   MFLYFSHQAVHCGDYKVKLEAPASAVAHFPYIKSQNRSIHAGAVYELDKSVGLVVEALNK 60

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           +GML NSI++F +DNG         S Y     N+P RG K TLWEGGV+  A +WSP +
Sbjct: 61  RGMLSNSIVVFSTDNGGLPWGIDPNSGY-----NWPLRGAKATLWEGGVRGAAFVWSPLL 115

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
            ++ R+S Q+MHI+DWLPTLY+AAGG+ S L                       + DG D
Sbjct: 116 FKSGRLSNQLMHITDWLPTLYSAAGGNLSTL----------------------GDTDGKD 153

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
            W +L  +  S R  +LINID    T+A+ +   K+VLGT + G  D      +  K P 
Sbjct: 154 MWEALSEDLESPRQEILINIDPTGNTSALIVGRRKVVLGTFQGGQHD------QRMKAP- 206

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR---STRQQATIHCGANPAPMTPSPCTN 564
              +  V+      L   +  +     N+ ++R   + R +  + C    AP   +    
Sbjct: 207 -GGSRTVDGLDQMMLSSRTGKVLKDFYNVRQLRVRPNWRNETVVRCDQY-APRD-NFVAA 263

Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
            P Y F+L +DPCE NN+A+S      +L + ++ + + +VP  ++  D  +  P+  + 
Sbjct: 264 SPPYYFDLEHDPCELNNLAASNVTDLKELIKKIEEYGKGMVPPGNKPLD-PRGLPQNNHG 322

Query: 625 TWSPW 629
            W PW
Sbjct: 323 LWGPW 327


>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 474

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 127/340 (37%), Positives = 179/340 (52%), Gaps = 35/340 (10%)

Query: 85  LTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP 141
           + LL+    G+ DL F G+ +IPTP++DALA  G+   + Y   P C+P+RA L+TG+Y 
Sbjct: 39  VVLLVADDLGYGDLGFQGARDIPTPHLDALAQGGVRCTSGYVSGPYCSPTRAGLLTGRYQ 98

Query: 142 IHTGMQGPPIWGAEPR-----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
              G +  P  G         G+P+TE  L + L+  GY+T  +GKWHLG    ++ P  
Sbjct: 99  QRFGHEFNPGGGGGAAAAKNVGLPVTETTLADRLKAAGYATGLVGKWHLGS-EAKFHPQK 157

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           RGF+  FG+L G  +Y+     D Y R  E+   +              Y TD F++EA+
Sbjct: 158 RGFDEFFGFLGGQHTYFASKSGDVY-RGTEVVKEEA-------------YLTDAFSREAL 203

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
             I D+  D P FL L+  A H       ++A ++ + +F  I DP RRTYAAM+  LD+
Sbjct: 204 SFI-DRHKDHPFFLQLSFNAVHT-----PMDATEDRVARFSSIEDPKRRTYAAMLSALDE 257

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           + G VI  L+  G+ E ++IIF SDNG PT+    T    N   N P RG K T  EGGV
Sbjct: 258 AAGNVIERLRAAGLEEKTLIIFFSDNGGPTMAGTTT----NGSKNDPLRGSKRTTLEGGV 313

Query: 377 KVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
           +VP +L W   +    +V  Q +   D LPT   AAG D 
Sbjct: 314 RVPFVLSWKGTLPAG-KVYDQPIIQLDILPTALAAAGVDA 352



 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 6/65 (9%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD F++EA+  I D+  D P FL L+  A H       ++A ++ + +F  I DP RR
Sbjct: 193 YLTDAFSREALSFI-DRHKDHPFFLQLSFNAVHT-----PMDATEDRVARFSSIEDPKRR 246

Query: 75  TYAAL 79
           TYAA+
Sbjct: 247 TYAAM 251


>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
          Length = 581

 Score =  193 bits (491), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 155/573 (27%), Positives = 247/573 (43%), Gaps = 98/573 (17%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
            L K   +  ++V   G++DL +  + ++ +PNIDALA + + L   Y QP CTPSRA+ 
Sbjct: 19  VLEKKPHVIFVLVDDLGFDDLGY-VNRDVISPNIDALAKDALHLKKHYVQPSCTPSRAAF 77

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           +TG+Y I  GMQ   I   EP G+PL E  L E  ++ GY T   GKWHLGF+  ++ P 
Sbjct: 78  LTGRYNIRMGMQSGVIRATEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQ 137

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
            RGF+  +G+  G   +Y H       +    NG      +     T G Y+T LF  + 
Sbjct: 138 IRGFDRFYGFYLGSQDFYFH--DSGRLKAYPGNGDVENDTILDDLHTNGTYSTKLFVDDF 195

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY-------A 308
           +  +       PLF Y++    H          P +T+N+F+   +    T+       +
Sbjct: 196 INDLAKHDPAVPLFNYVSFQDVH---------GPLQTVNKFKKHYENKTDTWTHERILIS 246

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             +  LD  +G +++AL+ K    N++++F SDNG    E           SN+P RG K
Sbjct: 247 TKITTLDHHIGKMVTALKEKNYWNNTVLVFTSDNGGQPKE---------GASNWPLRGSK 297

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQ----MMHISDWLPTLYTAAGGDTSRLPLN-ID 423
            T+++GG+K  A + SP +Q   R+  Q    + H+SDW PTL T +G D  R     +D
Sbjct: 298 GTIYDGGLKSRAFIASPLLQN--RMKGQNYNYLFHVSDWFPTLLTLSGCDVPRKEGRPLD 355

Query: 424 GLDQWSSLLLNTPSR-RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWK 482
           G  +   L +  P + RN  +  LD  S      P     +          A+ R   WK
Sbjct: 356 G--KIHELFVYKPKKARNKMLHYLDPLSQFDYGDPRHFEVLDDRAFNTTIKASYRQGRWK 413

Query: 483 LVLGTQENGTMDGYYGQTR-----SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
           ++ G +  G   GY    R      NK+ L++                     +P     
Sbjct: 414 IITG-KPCGNGCGYIKNERPWNYAKNKMKLID---------------------MPKEK-- 449

Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLY-EL 596
                                     N   +L+++  D  E+ + ++ +P + ++L  +L
Sbjct: 450 --------------------------NKNVWLYDMHKDKSEKFDRSAQKPWVVNRLLTKL 483

Query: 597 LKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
              +   ++PQ  E  D   A+PK     W PW
Sbjct: 484 ADLYDEQVMPQRREASD--AANPKHHGGVWMPW 514


>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1818

 Score =  193 bits (490), Expect = 2e-46,   Method: Compositional matrix adjust.
 Identities = 133/395 (33%), Positives = 197/395 (49%), Gaps = 55/395 (13%)

Query: 92  GWNDLSFHG----SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
           G+ D+ ++G    +N + TP IDALA  G+ L+  Y QP CTPSRA+L++GKYP  TG  
Sbjct: 55  GFGDVGYNGDPTLTNRVSTPVIDALADAGVKLSRYYTQPDCTPSRAALLSGKYPATTGTY 114

Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--- 204
              +      G+PL    LPE L    Y + A+GKW +G    +  P  RGF+S  G   
Sbjct: 115 HGVLNPQSTWGLPLEHALLPEALPG-AYRSHAVGKWDVGHSSAKRLPEARGFDSFLGFYL 173

Query: 205 -YLNGVISYYDHILSD-------------------QYSRTVELNGHDMRRNLSTAWDTVG 244
            +   +I Y  H + D                   + S   + +G   +R      D  G
Sbjct: 174 CFYGPMIDYSTHEIHDHDLACAGDACAAALAKCQVRGSTVADFSGLGGQRR-----DYDG 228

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI----T 300
            Y TD+F   AV LIE +  D PLFLY+A  A H       LEA +  +  F+      +
Sbjct: 229 MYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVHG-----PLEADEADVAAFEATFLPES 283

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
           D  R T+AA+   +D ++G V+ ALQ  G  EN++++F SDNGA              GS
Sbjct: 284 DSKRATFAAVALAMDRAIGRVVDALQATGAYENALVVFASDNGA-------IPGQMGGGS 336

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRL 418
           N+P RG K + WEGGV+VPA + SP +  + R ++   + H++DWLPT+  AAG      
Sbjct: 337 NWPLRGGKFSAWEGGVRVPAFVHSPLLPAHMRGAVYDGLFHVADWLPTIAAAAGAAVD-- 394

Query: 419 PLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLL 453
             + DG+  + +L  + P+ R+  +  +D +   L
Sbjct: 395 --SGDGVSHYRALFADGPAPRSELLVHVDVYGVEL 427



 Score =  185 bits (469), Expect = 7e-44,   Method: Compositional matrix adjust.
 Identities = 121/373 (32%), Positives = 187/373 (50%), Gaps = 42/373 (11%)

Query: 92  GWNDLSFHG----SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
           G++D+ ++     +N++ TP +D+LA  G+ L   Y QP CTPSRA+L++G YP  +GM 
Sbjct: 630 GFDDVGYNSDPSKTNQVQTPFLDSLAAGGVKLARYYTQPDCTPSRAALLSGMYPASSGMY 689

Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
              I      G+ L    +P+ L    Y + A+GKW +G +   + P +RGF S  G+ +
Sbjct: 690 HKMITAQSNWGLDLDLELIPQRL-PAAYRSHAVGKWDVGHYTWSHVPQFRGFRSFLGFYS 748

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV---------------GEYATDLFT 252
            +I YY H   D      E+   +    L++   +                G Y+TD+F 
Sbjct: 749 PIIDYYTHETFDTLQCLEEMELTECEARLASECSSSIKDFNFDGDPLPLADGTYSTDVFA 808

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPN--RRTYA 308
             A  LI  +    PLFLY+A  A HA      LEA ++ I  F   ++ +    R ++A
Sbjct: 809 ARARDLIRKEAPKHPLFLYVAFNAVHA-----PLEADEDVIESFASSFLEEARDARASFA 863

Query: 309 AMVKKLDDSVGTVISALQR-KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
           A+   +D ++  V+  L+   G  ENS+I+F SDNG               GSN+P RG 
Sbjct: 864 AVALGMDRAMEAVVDELKAVDGAYENSVIMFASDNGG-------LPGQIAGGSNWPLRGQ 916

Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           K + +EGGV+VPA + SP +    R      + H++DWLPTL   AGG     P N+DG+
Sbjct: 917 KFSPFEGGVRVPAFVHSPLLPAARRGGAYGGLFHVTDWLPTLVRLAGGAP---PRNVDGV 973

Query: 426 DQWSSLLLNTPSR 438
           DQ+ ++    P+ 
Sbjct: 974 DQYEAIFGEHPAE 986



 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 29/75 (38%), Positives = 38/75 (50%), Gaps = 9/75 (12%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI- 68
           D  G Y TD+F   AV LIE +  D PLFLY+A  A H       LEA +  +  F+   
Sbjct: 225 DYDGMYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVHG-----PLEADEADVAAFEATF 279

Query: 69  ---TDPNRRTYAALT 80
              +D  R T+AA+ 
Sbjct: 280 LPESDSKRATFAAVA 294


>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
          Length = 534

 Score =  193 bits (490), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 143/423 (33%), Positives = 206/423 (48%), Gaps = 67/423 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ +H S  I TPNID LA  G+ L N Y    C PSR +LMTG+Y I TG+     
Sbjct: 80  GWHDVGYHDS-VIKTPNIDQLAAEGVKLENYYVSSWCAPSRVNLMTGRYRIRTGL----- 133

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL-GFFRREYTPLYRGFESHFGYLNGVI 210
           +G     + + E  L + L E GY T  +GKWHL GF   E  P +RGF++  GY  G  
Sbjct: 134 YGDVCDFMGIHETTLADKLYEAGYYTAMVGKWHLSGFEHAECYPTHRGFQTFLGYHGGSQ 193

Query: 211 SYYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           +Y+ H     ++       +D   N  S A    G+Y+T +F  EA ++I      +PLF
Sbjct: 194 NYFTHRRGGPHAP------YDFWANDTSIAVKYEGQYSTMIFADEAQRIIRQHNTKQPLF 247

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           LYL+  A H       L  P    +Q++  I D  RR YAAM   +D+++G V + L+ K
Sbjct: 248 LYLSFQAVHVP-----LLVPPSYEDQYRSLIEDDKRRVYAAMASCMDEAIGNVTATLKEK 302

Query: 329 GMLENSIIIFMS-----------DNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           G+  N++ IF S           DNGA PT          + GSN+P RG K   +EGG+
Sbjct: 303 GLWNNTVFIFSSARLNFKHDFCLDNGATPT----------DGGSNWPLRGRKGLWYEGGI 352

Query: 377 KVPAILWSPQIQQN--PRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLL 433
           +    + S  +  +     + +++H++DW PTL   A  D   L PL  DG+DQW ++  
Sbjct: 353 RAVGFVNSHLLSDHVTGTTNHELIHVTDWFPTLLRLAKADLHGLKPL--DGIDQWETISR 410

Query: 434 NTPSRRNSNIDGLDQWSSLLLN-TPSRRNSVLINIDEKKRTA---------AVRLDSWKL 483
              + R            +LLN  P   N V+    E + TA         A+R+  WKL
Sbjct: 411 GNVTNR----------KEVLLNFIPGDTNGVVNENHEGEATADYFDTMPFSAIRVGDWKL 460

Query: 484 VLG 486
           + G
Sbjct: 461 LTG 463



 Score = 42.0 bits (97), Expect = 0.88,   Method: Compositional matrix adjust.
 Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 13/94 (13%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDP 71
           G+Y+T +F  EA ++I      +PLFLYL+  A H       L  P    +Q++  I D 
Sbjct: 222 GQYSTMIFADEAQRIIRQHNTKQPLFLYLSFQAVHVP-----LLVPPSYEDQYRSLIEDD 276

Query: 72  NRRTYAALTK-------STTLTLLIVYGWNDLSF 98
            RR YAA+         + T TL     WN+  F
Sbjct: 277 KRRVYAAMASCMDEAIGNVTATLKEKGLWNNTVF 310


>gi|383110963|ref|ZP_09931781.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
 gi|313694533|gb|EFS31368.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
          Length = 458

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 183/344 (53%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP +DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L TKEA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYCDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG+  V            S+ P RG K   W+GGV  PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGSAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 CWKKAEKQYKNLSSQVTGFVDLVPTLKELV-GDHSRPKREYDGI 351


>gi|299147176|ref|ZP_07040243.1| arylsulfatase B [Bacteroides sp. 3_1_23]
 gi|298515061|gb|EFI38943.1| arylsulfatase B [Bacteroides sp. 3_1_23]
          Length = 458

 Score =  192 bits (489), Expect = 3e-46,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP +DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L TKEA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GGV  PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351


>gi|423214938|ref|ZP_17201466.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692201|gb|EIY85439.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 458

 Score =  192 bits (489), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP +DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L TKEA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GGV  PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351


>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
 gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
           IMCC14465]
          Length = 496

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 121/323 (37%), Positives = 172/323 (53%), Gaps = 29/323 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ DL F GS +I TPN+D LA  GI+LN  Y+ P+CTP+R++LMT + PI  G     +
Sbjct: 34  GYADLGFRGS-DIQTPNLDRLAAEGIVLNRFYSLPICTPTRSALMTARDPIKLGTAYAGL 92

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              E  GV   E F+PE  ++ GY T  IGKWH+G       P +RGF+  FG+LN  + 
Sbjct: 93  QPWENGGVSPDEHFMPESFKKAGYQTAMIGKWHIGRQYESLVPHHRGFDHFFGHLNTQVD 152

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ-LIEDQPVDKPLFL 270
           YY H             GHD++ N  +       YATD+   E+V+ L E +   KP  L
Sbjct: 153 YYTH---------ASAGGHDLQENGKSLKRDA--YATDIHGDESVRYLKEIRDPSKPFLL 201

Query: 271 YLAHLAAHA-GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           Y+  LA H+   A + LEA  ++   F       +RTYAAMV  +D ++G ++  L R+G
Sbjct: 202 YVPFLAPHSPMQAPEALEAKYKSRLNFPV----PKRTYAAMVDSMDQAIGRIMDELDRQG 257

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           + +N+II+F SDNG           +  +G  N PYRG K   +EGG++V AI+  P   
Sbjct: 258 VSDNTIILFFSDNGG----------FAGFGGVNKPYRGGKLETFEGGIRVNAIMRWPGAI 307

Query: 389 QNPRVSLQMMHISDWLPTLYTAA 411
                + Q + + D  PTL  AA
Sbjct: 308 PAGSKTAQAISVMDVFPTLAAAA 330


>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
          Length = 545

 Score =  192 bits (488), Expect = 4e-46,   Method: Compositional matrix adjust.
 Identities = 161/571 (28%), Positives = 248/571 (43%), Gaps = 109/571 (19%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT------ 144
           +G+++       E  TPN+D LA +G++L+  Y+   CTPSR+S ++G+ P+H       
Sbjct: 51  WGFHNFGIRNQTEAKTPNMDKLARDGLLLDQAYSYFWCTPSRSSFLSGRLPLHVFHSNRV 110

Query: 145 -----GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
                  Q P   G    G+P     +P ++R+ GY T  +GKW  G    +++PL RGF
Sbjct: 111 SSASWDSQHPDTAGV---GIPRNMTTIPAFMRKAGYKTHMVGKWDAGIATPQHSPLGRGF 167

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGH---------DMRRNLSTAWDTVGEYATDL 250
           +S   Y N   +YY +  +D  S    +            D+  +   A   +G Y   +
Sbjct: 168 DSSLHYFNHDNNYYAYNYTDTVSVQFPVKCQLLKYVTGFVDLWNSTEPADAPIGTYEEHV 227

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
           F   A+ +I       PLFLY A   AHA      L+ PQ  +++FQ+I DP RRTY AM
Sbjct: 228 FRDHALDVISKHDASTPLFLYYASHIAHAP-----LQVPQAYLDRFQHIPDPIRRTYQAM 282

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG--SNYPYRGVK 368
           V   DD VG + +AL+ KGM EN++++  SDNG    +  +  +    G  SN+P RG K
Sbjct: 283 VLAADDVVGNITAALKAKGMWENTLLVVHSDNGGAIYQCLDGMDASLCGGASNWPLRGGK 342

Query: 369 NTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            + ++GG++V A +    +  + R  V   ++ ++DW+ T    AG              
Sbjct: 343 LSPFQGGIRVNAFVNGGFLPSHRRGQVETGLVVLADWMATFCELAG-------------- 388

Query: 427 QWSSLLLNTPSRRNSNIDGLDQWS--SLL--LNTPSRRNSVLINIDEKKRTAAVRLDSWK 482
               +    P  + +N+  +D  S  SLL   N+ S R    I  D+   +  +    WK
Sbjct: 389 ----VACEDPIAKKANLPAIDSQSLWSLLSGANSTSPRVEYPITFDQ---SPLIVRPPWK 441

Query: 483 LVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRST 542
           L++G          YGQ    K P                                    
Sbjct: 442 LIVGDNLQ------YGQWTGPKYP------------------------------------ 459

Query: 543 RQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY--- 599
              AT   G  PA      CT G C LFN+  DP E +++A+  PD+ + L + L Y   
Sbjct: 460 --NATTTGGGVPAGAV--NCTAG-C-LFNIEQDPTEHHDVAAQHPDVVADLKQRLAYWQS 513

Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
           H  T  P     P   +A   R+N  + P+I
Sbjct: 514 HTLTWKP-GWPIPQACEAYIHRYNGYYGPFI 543



 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 37/69 (53%), Gaps = 5/69 (7%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
           +G Y   +F   A+ +I       PLFLY A   AHA      L+ PQ  +++FQ+I DP
Sbjct: 220 IGTYEEHVFRDHALDVISKHDASTPLFLYYASHIAHAP-----LQVPQAYLDRFQHIPDP 274

Query: 72  NRRTYAALT 80
            RRTY A+ 
Sbjct: 275 IRRTYQAMV 283


>gi|336415441|ref|ZP_08595781.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
           3_8_47FAA]
 gi|335941037|gb|EGN02899.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
           3_8_47FAA]
          Length = 458

 Score =  192 bits (488), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP +DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L TKEA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GGV  PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351


>gi|149197521|ref|ZP_01874572.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139539|gb|EDM27941.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 465

 Score =  192 bits (488), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 193/348 (55%), Gaps = 32/348 (9%)

Query: 77  AALTKSTTLTLLIV-----YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMY-AQPVCT 129
           A+L+ S     +IV      G+ D+S+HG+  E  TP+ID++A +G    N Y A PVC 
Sbjct: 14  ASLSASAAKPNIIVILADDLGYGDVSYHGTLKETTTPHIDSIAQSGAWFQNGYSAAPVCG 73

Query: 130 PSRASLMTGKYPIHTGMQ---GPPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHL 185
           PSRA L++G+Y    G     GP     +   G+PL+++ +PE L + GY+T  +GKWH 
Sbjct: 74  PSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLPLSQKLIPEILVKEGYATGMVGKWHD 133

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
           G  + ++ P  RGF+  +G+ NG I+ +  +L  + + TV+  G   R N     +  GE
Sbjct: 134 GD-QHKFWPYNRGFQEFYGFNNGAINNW--VLKGE-NHTVDEWGAVHRENKRV--ENSGE 187

Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
           Y T+ F +EAV+ I D+   +P FLYL+  A H       L+AP+   NQF++I   NR 
Sbjct: 188 YMTEAFGREAVEFI-DRHKTEPFFLYLSFNAVHG-----PLQAPKSYTNQFKHIKPENRA 241

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
              AM+K +DD++G V+  L+++G+ EN+II F SDNG             N+  N  YR
Sbjct: 242 LCLAMLKSMDDNIGLVLEKLRKEGLEENTIIFFTSDNGGKLK--------GNYSFNGKYR 293

Query: 366 GVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           G KNT+++GG+ VP A+ W  QI    +     +H  D   T++ AAG
Sbjct: 294 GEKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSIDLAHTIFAAAG 341



 Score = 50.4 bits (119), Expect = 0.003,   Method: Compositional matrix adjust.
 Identities = 30/81 (37%), Positives = 44/81 (54%), Gaps = 6/81 (7%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           +  GEY T+ F +EAV+ I D+   +P FLYL+  A H       L+AP+   NQF++I 
Sbjct: 183 ENSGEYMTEAFGREAVEFI-DRHKTEPFFLYLSFNAVHG-----PLQAPKSYTNQFKHIK 236

Query: 70  DPNRRTYAALTKSTTLTLLIV 90
             NR    A+ KS    + +V
Sbjct: 237 PENRALCLAMLKSMDDNIGLV 257


>gi|293372058|ref|ZP_06618453.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
 gi|292632962|gb|EFF51547.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
          Length = 458

 Score =  192 bits (488), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP +DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L TKEA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GGV  PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351


>gi|423290535|ref|ZP_17269384.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
           CL02T12C04]
 gi|392665922|gb|EIY59445.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
           CL02T12C04]
          Length = 458

 Score =  192 bits (488), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP +DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L TKEA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GGV  PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351


>gi|221119831|ref|XP_002168522.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 223

 Score =  192 bits (488), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 97/229 (42%), Positives = 143/229 (62%), Gaps = 7/229 (3%)

Query: 146 MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
           MQ   I+ A   GV L E+FLP+YL+ +GY T AIGKWHLGFF +EYTP YRGF+S +GY
Sbjct: 1   MQSDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGY 60

Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
             G   Y+DH L+      ++L+ +D   +    ++  G Y+T +++ EA+  I +    
Sbjct: 61  YGGQADYWDHSLASNGWWGLDLH-YDTPSSSKNIFNQWGNYSTAMYSMEAIDRIRNHNST 119

Query: 266 KPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
           +P+FLYLA+ A H+ N  ++ L+APQE +++F +I    R+ YAAM+  +D  +G + S 
Sbjct: 120 QPMFLYLAYQAVHSANLREYPLQAPQEWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITSE 179

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           L ++ ML++SIIIF SDNG P           NW +NYP RG K T++E
Sbjct: 180 LLKQNMLDDSIIIFTSDNGGPANGLN-----NNWATNYPLRGTKTTVYE 223



 Score = 58.5 bits (140), Expect = 1e-05,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 49/78 (62%), Gaps = 4/78 (5%)

Query: 3   RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQET 61
           +N+   W   G Y+T +++ EA+  I +    +P+FLYLA+ A H+ N  ++ L+APQE 
Sbjct: 91  KNIFNQW---GNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQEW 147

Query: 62  INQFQYITDPNRRTYAAL 79
           +++F +I    R+ YAA+
Sbjct: 148 VDKFSHIKHKGRQNYAAM 165


>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
          Length = 602

 Score =  192 bits (488), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 161/595 (27%), Positives = 255/595 (42%), Gaps = 92/595 (15%)

Query: 54  HLEAPQETINQFQYITDPNRRTYA--ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNI 109
           HLE  Q  ++ F +       T++   L K   +  ++V   G++DL +  + ++ +PNI
Sbjct: 14  HLEKEQGAMSSFFFYLFLFCSTFSQKVLEKKPHVIFVLVDDLGFDDLGY-VNRDVISPNI 72

Query: 110 DALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEY 169
           DALA + + L   Y QP CTPSRA+ +TG+Y I  GMQ   I   EP G+PL E  L E 
Sbjct: 73  DALAKDALHLKKHYVQPSCTPSRAAFLTGRYNIRMGMQSGVIRAPEPEGIPLRETLLSEA 132

Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
            ++ GY T   GKWHLGF+  ++ P  RGF+  +G+  G   +Y H            NG
Sbjct: 133 FKQCGYRTSLQGKWHLGFYTYKHCPQNRGFDRFYGFYLGSQDFYFH--DSGRLEAYPGNG 190

Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
                 +   + T G Y+T LF  + +  +       PLF Y++    H          P
Sbjct: 191 DVENDTILDDFHTNGTYSTKLFVDDFINDLAKHDPAVPLFNYVSFQDVH---------GP 241

Query: 290 QETINQFQYITDPNRRTY-------AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
            +T+N+F+   +    T+       +  +  LD  +G +++AL+ K    N++++F SDN
Sbjct: 242 LQTVNKFKKHYENKTDTWTHERILISTKITTLDHHIGRMVTALKEKNYWNNTVLVFTSDN 301

Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ----MM 398
           G    E           SN+P RG K T+++GG+K  A + SP +Q   R+  Q    + 
Sbjct: 302 GGQPNE---------GASNWPLRGSKGTIYDGGLKSRAFVASPLLQN--RMKGQNYNYLF 350

Query: 399 HISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLLNTPSR-RNSNIDGLDQWSSLLLNT 456
           H+SDW PTL T +G D  R     +DG  +   L +  P + RN  +  LD  S    + 
Sbjct: 351 HVSDWFPTLLTLSGCDVPRKEGRPLDG--KIHELFVYKPKKARNEMLHYLDPLSQFDYDD 408

Query: 457 PSRRNSVLINIDEKKRTAAVRLDSWKLVLGT-QENGTMDGYYGQTRSNKVPLLNFNAIVE 515
           P     +          A+ R   WK++ G    NG   GY  + R       N   I  
Sbjct: 409 PRHFEVLDDRAFNTTIKASYRQGRWKIITGKPCANGC--GYIKKERPWNYAKNNMKLIDM 466

Query: 516 SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGND 575
            K                                              N   +L+++  D
Sbjct: 467 PKE--------------------------------------------QNKNVWLYDMHKD 482

Query: 576 PCEQNNIASSRPDISSQLY-ELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
             E+ + ++ +P + ++L  +L   +   ++PQ  E  D   A+PK     W PW
Sbjct: 483 KSEKFDRSAQKPWVVNRLLTKLADLYDEQVMPQRREASD--AANPKHHGGVWMPW 535


>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 478

 Score =  192 bits (488), Expect = 5e-46,   Method: Compositional matrix adjust.
 Identities = 125/332 (37%), Positives = 177/332 (53%), Gaps = 28/332 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ-GP 149
           G+ D  F G+ +I TPN+D LA +G+I N  Y     C PSRA  + G+Y    G +   
Sbjct: 33  GYADAGFTGATDILTPNLDKLAESGVIFNQGYVTHAFCGPSRAGFLAGRYQHRFGFEHNT 92

Query: 150 PIWGAEP-RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P   A P  G+ + E   P  L+++GY+T  IGKWHLG     + PL RGF+  +G+L G
Sbjct: 93  PYDPANPLAGIDVRETLFPARLQDVGYTTGIIGKWHLGA-SSPFYPLNRGFDYFYGFLTG 151

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
              Y++ I   Q  ++  L G  + RN   A      Y T   +++AVQ + D   + P 
Sbjct: 152 GHDYFE-IDVTQPVKSAYLQG--LFRNKRVA--NFEGYLTTALSRDAVQFVNDNK-ENPF 205

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FL+L++ A H     + L+APQE I ++ +I D  RR YAAMV  +D  +G V++AL R 
Sbjct: 206 FLFLSYNAPH-----QPLQAPQEDIARYAHIKDKKRRVYAAMVDVMDRGIGEVVTALDRN 260

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
           G+ EN++I F+SDNG P V  ++     N  SN P+RG K   +EGGV VP I  W  +I
Sbjct: 261 GLRENTLIFFLSDNGGP-VSSKKHPGKGNGSSNGPFRGGKTDFYEGGVHVPFIASWPAKI 319

Query: 388 Q-----QNPRVSLQMMHISDWLPTLYTAAGGD 414
           Q       P +SL +        T    AGGD
Sbjct: 320 QPGQVYDKPVISLDIGR------TAVELAGGD 345



 Score = 48.5 bits (114), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 6/66 (9%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y T   +++AVQ + D   + P FL+L++ A H     + L+APQE I ++ +I D  RR
Sbjct: 184 YLTTALSRDAVQFVNDNK-ENPFFLFLSYNAPH-----QPLQAPQEDIARYAHIKDKKRR 237

Query: 75  TYAALT 80
            YAA+ 
Sbjct: 238 VYAAMV 243


>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
 gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
          Length = 333

 Score =  192 bits (487), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 111/330 (33%), Positives = 172/330 (52%), Gaps = 29/330 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++D+     + I TPN+DALA  G+IL N Y Q +CTPSR +LMTG+YP  + MQ   I
Sbjct: 14  GYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMTGRYPSASAMQTSVI 72

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                +   L  + LP+YL++LGY    +GKWHLG+ R E  P  RGF++ +G   G   
Sbjct: 73  LPMRAKCTGLEYKLLPQYLKDLGYKNHMVGKWHLGYCRDECLPTSRGFDTFYGLYAGTGD 132

Query: 212 YYDHILSDQY----SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           Y+ H    +Y    +  ++   +     + +++  +     DL  +   ++ ++     P
Sbjct: 133 YWSHTFFGKYDWHTNADIDFEANSTHSQVRSSY--MNFVLQDLEMERLDKVFDEHDSKDP 190

Query: 268 LFLYLA----HLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
           LFLY A    H  +   +   +L  P++  N         RR Y  +   LD  VG ++ 
Sbjct: 191 LFLYFAPQNPHTPSQPTDEFLNL-YPEDKFNDI-------RRKYLGLTSGLDAMVGKIVE 242

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
            L   GM+ N+ +IF+SDNGA   E          G N P+RG K++L+EGG K  + ++
Sbjct: 243 KLIENGMMNNTYVIFVSDNGADPPE----------GLNTPFRGGKSSLFEGGTKSNSFIY 292

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
           SP +++    +  +MHI+DWLPTL   AGG
Sbjct: 293 SPLLKKTEYENDGLMHITDWLPTLVRLAGG 322


>gi|319954018|ref|YP_004165285.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422678|gb|ADV49787.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 434

 Score =  192 bits (487), Expect = 6e-46,   Method: Compositional matrix adjust.
 Identities = 138/407 (33%), Positives = 204/407 (50%), Gaps = 54/407 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ-GP 149
           GW D+ F+G+ +IPTPN+D LA  G+I +N Y + P C+PSRA L+TG+Y    G     
Sbjct: 8   GWADVGFNGATDIPTPNLDRLASEGVIFSNGYVSHPYCSPSRAGLLTGRYQARFGHDCNM 67

Query: 150 PIWGAEPR--GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
           P  G      G PL+E+ + E L+E GY T AIGKWHLG     Y P  +GF+  FG+  
Sbjct: 68  PYDGKNDASVGTPLSEKMISEALKEQGYRTSAIGKWHLGDHPDLYPPA-QGFDHWFGFPG 126

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           G ++Y+    S    +T+        RN     +    Y TD FT EA++ I  Q  +KP
Sbjct: 127 GGMNYWGE--SKNEIQTI-------YRNRKVVPEEELTYLTDDFTTEAIRFIT-QKDEKP 176

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
            F+YLA+ A HA +     +A +E + + ++I    R  YAAMV  +D +VG + S L  
Sbjct: 177 FFMYLAYNAPHAPD-----QATKEYLEKTKHIEYAGRSVYAAMVNAVDANVGKIDSTLIA 231

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA-ILWSPQ 386
            G+ EN+I++F+SDNG   +E+ +         N PYRG K  L+EGG+KVP  I W  +
Sbjct: 232 NGLKENTILVFLSDNGG-RIEHAD---------NRPYRGHKGMLFEGGIKVPFFITWPTK 281

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           I++    +  +  + D  PT   AAGG                       +++   +DG+
Sbjct: 282 IKEKYAYNEPISSL-DLFPTFLNAAGGK----------------------AKKERQLDGV 318

Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
           D    +L  T    +  L          AVR  ++KL     +N T+
Sbjct: 319 DLLPFILQKTQETPHETLFWRSSGNFEYAVRKGNYKLYKSAYKNKTL 365



 Score = 42.4 bits (98), Expect = 0.72,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 39/69 (56%), Gaps = 6/69 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD FT EA++ I  Q  +KP F+YLA+ A HA +     +A +E + + ++I    R 
Sbjct: 156 YLTDDFTTEAIRFIT-QKDEKPFFMYLAYNAPHAPD-----QATKEYLEKTKHIEYAGRS 209

Query: 75  TYAALTKST 83
            YAA+  + 
Sbjct: 210 VYAAMVNAV 218


>gi|423294191|ref|ZP_17272318.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
           CL03T12C18]
 gi|392676448|gb|EIY69884.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
           CL03T12C18]
          Length = 458

 Score =  191 bits (486), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 181/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP +DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETVADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L TKEA+  I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIHCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GGV  PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351


>gi|86141258|ref|ZP_01059804.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
 gi|85831817|gb|EAQ50272.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
          Length = 461

 Score =  191 bits (486), Expect = 8e-46,   Method: Compositional matrix adjust.
 Identities = 125/359 (34%), Positives = 193/359 (53%), Gaps = 43/359 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND SFHGS EI TPN+D LA  G+ L+  Y  P C+P+RASL+TG+     G+  P I
Sbjct: 50  GWNDFSFHGS-EIQTPNLDQLAGKGLTLDRFYTYPTCSPARASLLTGRPASRMGIVAP-I 107

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G     +P +   LP+ L +L Y T  +GKWHLG  + E  P   GF+  +G+L+G + 
Sbjct: 108 SGRSELNLPDSITTLPQALSKLNYKTALMGKWHLGL-KPESGPEVYGFDFSYGFLHGQL- 165

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY------ATDLFTKEAVQLIEDQPVD 265
                  DQY+ T         +N  + W   G++       TDL T+ AV  I+    D
Sbjct: 166 -------DQYAHTY--------KNGDSTWYRNGKFISEKGHVTDLLTQSAVHYIDTLQTD 210

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           +  +L +A+ A H       L+ PQE + ++  I D +RR YAA +  +D  +G ++  L
Sbjct: 211 QNFYLQVAYSAPHIP-----LQEPQEWLEKYTGIKDSSRRAYAAAMTHMDAGIGEILQKL 265

Query: 326 QRKGMLENSIIIFMSDNGA-----PTVEY--RETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           + K + +N++++F+SDNGA     P  +Y  +   NY + GSN P R  K + +EG ++V
Sbjct: 266 KDKDLEKNTVVLFVSDNGAQEKWVPNTQYDGKYGPNY-SLGSNLPLRDFKTSNYEGALRV 324

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
           PAI+  P+   N   S   ++++DW+PT    A  +   LP  ++G++  +  LL T S
Sbjct: 325 PAIISWPE-NLNSGTSTNYINVTDWMPTFLNWANAE--ELPSTVEGVN--AQHLLKTAS 378



 Score = 40.4 bits (93), Expect = 2.6,   Method: Compositional matrix adjust.
 Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 5/64 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           + TDL T+ AV  I+    D+  +L +A+ A H       L+ PQE + ++  I D +RR
Sbjct: 191 HVTDLLTQSAVHYIDTLQTDQNFYLQVAYSAPHIP-----LQEPQEWLEKYTGIKDSSRR 245

Query: 75  TYAA 78
            YAA
Sbjct: 246 AYAA 249


>gi|325285341|ref|YP_004261131.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324320795|gb|ADY28260.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 460

 Score =  191 bits (485), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 120/346 (34%), Positives = 185/346 (53%), Gaps = 33/346 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW D+ F+G+ +IPTPN+D +A  G+I +N Y + P C+PSRA L+TG+Y    G     
Sbjct: 37  GWADVGFNGATDIPTPNLDRIASEGVIFSNGYVSHPYCSPSRAGLLTGRYQARFGHDCNM 96

Query: 151 IWGAE---PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
            +  E     G PL+E+ + E L+E GY T AIGKWH+G     + P  +GF+  FG+  
Sbjct: 97  PYEGENDATVGTPLSEKLISEALKEQGYRTSAIGKWHIGDHPNLHPPA-QGFDHWFGFPG 155

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           G ++Y+    S   +         + RN     +    Y TD FT EA+  I  +  + P
Sbjct: 156 GSMNYWGKATSKIQT---------IYRNTKPVAEEELTYLTDDFTNEAINFINKKDKN-P 205

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
            F+YLA+ A HA +      A ++ + Q ++I    R  YAAMV  +D ++G + S L  
Sbjct: 206 FFIYLAYNAPHAPD-----HATKKYLEQTKHIEYAGRSVYAAMVNAVDANIGKIDSTLVA 260

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
            G+ +N+I++F+SDNG                 N PYRG K  L+EGG+KVP  + W  +
Sbjct: 261 NGIKDNTILVFLSDNGGRIQH----------ADNRPYRGHKGMLFEGGIKVPFFISWPSK 310

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           I+++ + +  +  + D  PTL  AAGG+ + +   +DG+D    LL
Sbjct: 311 IKEDSKYTKPISSL-DLFPTLLNAAGGNAA-IEKQLDGVDLMPYLL 354


>gi|119626708|gb|EAX06303.1| hCG20717, isoform CRA_a [Homo sapiens]
          Length = 427

 Score =  191 bits (485), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 135/452 (29%), Positives = 211/452 (46%), Gaps = 73/452 (16%)

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
           +GKWHLGF+R+E  P  RGF++ FG L G   YY H   D    +  + G+D+  N + A
Sbjct: 2   VGKWHLGFYRKECMPTRRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAA 57

Query: 240 WD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
           WD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP      ++ 
Sbjct: 58  WDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRS 112

Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRN 357
           I + NRR YAAM+  LD+++  V  AL+  G   NSIII+ SDNG  PT           
Sbjct: 113 IININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------- 163

Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
            GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW PTL + A G    
Sbjct: 164 -GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE 222

Query: 418 LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
             + +DG D W ++   +   R+  +D L     +     +   +    I      +A+R
Sbjct: 223 -DIQLDGYDIWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIR 278

Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
           +  WKL+ G        GY     S+ VP  +F                       SN+ 
Sbjct: 279 VQHWKLLTGNP------GY-----SDWVPPQSF-----------------------SNLG 304

Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
             R   ++ T+  G +              +LFN+  DP E+ ++++  P I  +L   L
Sbjct: 305 PNRWHNERITLSTGKS-------------VWLFNITADPYERVDLSNRYPGIVKKLLRRL 351

Query: 598 KYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
               +T VP  +   D  +++P+     W PW
Sbjct: 352 SQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 382



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)

Query: 1   MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
           +  N + AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 50  LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 104

Query: 60  ETINQFQYITDPNRRTYAAL 79
                ++ I + NRR YAA+
Sbjct: 105 RYFEHYRSIININRRRYAAM 124


>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 551

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 130/426 (30%), Positives = 199/426 (46%), Gaps = 72/426 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ +HG N I TP++D LA  G+ LN  Y  P+C+P+RA+LMTG+ P+  G+    I
Sbjct: 47  GWNDVGYHGGN-IDTPSLDKLAEQGVQLNRFYTTPICSPTRAALMTGRDPMRLGIAYGVI 105

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +  GV   E F+P+  +  GY T  +GKWHLG  +  Y P  RGFE  +G+L+  + 
Sbjct: 106 LPWDNIGVNPAEHFMPQSFQAAGYQTAMVGKWHLGHAQMTYHPNQRGFEHFYGHLHTEVG 165

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           +Y             + G D + N   + D  G Y T L   E  + I D+  +KP F+Y
Sbjct: 166 FYPPF--------ANVGGKDFQEN-GVSIDDEG-YETYLLADEVSRYIRDRDEEKPFFIY 215

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYI-----------TDPNRR------------TYA 308
           +  +A H       L+APQE   +++ I           TD  RR             YA
Sbjct: 216 MPFIAPHT-----PLDAPQELQEKYKDIETDLAPARSNQTDSTRRMAKLMMQPSARPMYA 270

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
           A+V  +D ++G V+  L  +G+ +N+I++F SDNG     Y           N P RG K
Sbjct: 271 AVVDAMDQAIGQVLDTLDEEGLADNTIVLFFSDNGGAAYSY-------GGADNAPLRGGK 323

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
              +EGG++V +++  P + +  +   Q+M + D  PTL  A G      PLN       
Sbjct: 324 GETFEGGIRVVSLMRWPGVLEPAQSFDQIMTVMDVFPTLAEATG----VTPLN------- 372

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRL----DSWKLV 484
                      N   DG   WSS+       R+ +++   E     + +L    + WKLV
Sbjct: 373 -----------NFPFDGSSLWSSIKEGEIHERDELVMFASEIPIYGSFKLTAFDEEWKLV 421

Query: 485 LGTQEN 490
              +++
Sbjct: 422 QEMEQD 427


>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 487

 Score =  191 bits (484), Expect = 1e-45,   Method: Compositional matrix adjust.
 Identities = 118/320 (36%), Positives = 181/320 (56%), Gaps = 38/320 (11%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPI 142
           LLIV    G+ D+ FHG  +IPTPN+DALA +G+   + Y   P C+P+RA L+TG+Y  
Sbjct: 40  LLIVGDDMGYADVGFHGCKDIPTPNLDALAKSGVQFTSGYVTGPYCSPTRAGLLTGRYQQ 99

Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
             G +  P  GA   G+PLTE  + + L+++GY+T  +GKWHLG  +    P  RGFE  
Sbjct: 100 RFGHEFNPS-GAN-TGLPLTEVTIADRLKQVGYTTGLVGKWHLGS-QPAMHPQERGFEEF 156

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
            G+L G  S++D       ++ + L GH+  + +        +Y TDLF +EAV  IE  
Sbjct: 157 IGFLGGAHSFFD-------AQGI-LRGHEPVKTI--------DYTTDLFGREAVSFIEKH 200

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
             DKP FLYL+  A H       + A ++ + +   I+D  RRTYAAM+  +D+++G V+
Sbjct: 201 R-DKPWFLYLSFNAVHT-----PMHATEDRMAKLASISDQERRTYAAMMLAMDEAIGKVL 254

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
           + L+  G  + ++++F+SDNG PT+         N   N P RG K T  EGG++VP ++
Sbjct: 255 TQLETTGQKQKTLVMFISDNGGPTMPGVTI----NGSINTPLRGSKRTTLEGGIRVPFVV 310

Query: 383 -W----SPQIQQNPRVSLQM 397
            W    +P +  +P + L +
Sbjct: 311 SWPGKIAPAVFDSPVIQLDL 330



 Score = 49.7 bits (117), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 6/66 (9%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y TDLF +EAV  IE    DKP FLYL+  A H       + A ++ + +   I+D  R
Sbjct: 183 DYTTDLFGREAVSFIEKH-RDKPWFLYLSFNAVHT-----PMHATEDRMAKLASISDQER 236

Query: 74  RTYAAL 79
           RTYAA+
Sbjct: 237 RTYAAM 242


>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
          Length = 441

 Score =  190 bits (483), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 120/323 (37%), Positives = 173/323 (53%), Gaps = 30/323 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D+ FHG  +IPTP+ID+LA +G   ++ Y   P C+P+RA L+TG+Y    G +  P
Sbjct: 40  GYADVGFHGGKDIPTPHIDSLAASGTRFSSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 99

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
             G   +G+PLTE  + + L+  GY+T  +GKWHLG    ++ PL RGF   FG+L G  
Sbjct: 100 --GGANKGLPLTETTIADRLQAAGYATGLVGKWHLGT-DPKFHPLKRGFGEFFGFLAGHH 156

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           +Y+D   +D    T ++              T   Y TD F +EAV  IE    + P FL
Sbjct: 157 TYFDKQEADIQRGTTKV--------------TEPGYLTDAFGREAVSFIERHQ-NHPFFL 201

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLA  A H       LEA  + +N F  I    RRTYAAM+  +DD++G V+++L+    
Sbjct: 202 YLAFNAVHT-----PLEADDDRLNAFSSIDATPRRTYAAMLSAMDDAIGKVLASLRETAQ 256

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQ 389
            EN++I F+SDNG PT+         N   N P RG K T  EGG++VP ++ W   +  
Sbjct: 257 DENTLIFFISDNGGPTMP----GTTINSSINAPLRGSKRTTLEGGIRVPFVVRWPGHVPA 312

Query: 390 NPRVSLQMMHISDWLPTLYTAAG 412
               +  ++ + D  PT   AAG
Sbjct: 313 GAVYAHPVIQL-DIQPTALAAAG 334



 Score = 45.4 bits (106), Expect = 0.091,   Method: Compositional matrix adjust.
 Identities = 29/65 (44%), Positives = 34/65 (52%), Gaps = 6/65 (9%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD F +EAV  IE    + P FLYLA  A H       LEA  + +N F  I    RR
Sbjct: 178 YLTDAFGREAVSFIERHQ-NHPFFLYLAFNAVHT-----PLEADDDRLNAFSSIDATPRR 231

Query: 75  TYAAL 79
           TYAA+
Sbjct: 232 TYAAM 236


>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
          Length = 511

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 115/333 (34%), Positives = 173/333 (51%), Gaps = 28/333 (8%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           +GWND  F G+  + TP ID L   GI LN  Y Q VC+P+RA+LMTG+YP   G+Q P 
Sbjct: 24  HGWNDCGFAGT-RVKTPTIDTLRSEGIALNQHYVQKVCSPTRAALMTGRYPHRYGLQFP- 81

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
             G     +   E  LP+Y++  GY+T+A+GKWHLGF   ++TP +RGF+S +G+ +   
Sbjct: 82  FCGGAAMALNSNETLLPQYMKSAGYTTRAVGKWHLGFTEWQFTPTFRGFDSFYGFYSCAE 141

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNL---------STAWDTVG----EYATDLFTKEAVQ 257
            Y+ H L  + S    +   D   +            A++ VG     Y+T LF    V 
Sbjct: 142 DYFFHGLGFKNSSGAHVKSLDFHDDARPSCGADCSKAAFEAVGTDWQHYSTTLFAGRIVD 201

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDD 316
           +++     +PLFLY A    H  N G   E PQ   +++   + D  RR  AA +  +D 
Sbjct: 202 IVDGHDPSQPLFLYFASQDTH--NPG---EVPQCYKDRYNTTVPDQVRRKLAAKLTTVDS 256

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAP-TVEYRETSNYRN---WGSNYPYRGVKNTLW 372
           ++  +  A +R GM +N +I++ +DNG P TV+       +      SNYP RG K+  +
Sbjct: 257 ALANITEAYKRNGMYDNLLIVYTADNGGPITVQSGPNKGKQEDSIGSSNYPLRGGKHNAY 316

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQM---MHISD 402
           EGGV+  A L  P ++   R+  +    +HI D
Sbjct: 317 EGGVRSTAFLSGPALKTLARLVRKYQYKLHIGD 349


>gi|414070343|ref|ZP_11406329.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410807260|gb|EKS13240.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 469

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 121/331 (36%), Positives = 177/331 (53%), Gaps = 20/331 (6%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL F GS EI TPNIDALA NG    N Y   P C PSR  L+TG+Y    GM+   
Sbjct: 28  GYGDLGFTGSKEIKTPNIDALASNGTRFKNAYVTHPYCGPSRVGLLTGRYQARLGMENNV 87

Query: 151 IWGAEPR--GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            +  + +  G+PL+E      L+++GY T   GKWHLG     + P  RGF+  +G+L+G
Sbjct: 88  SYMPQDKYMGLPLSENTFANRLQDVGYHTSVFGKWHLGG-APHFQPNKRGFDYFYGFLDG 146

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
             +Y    + DQ   TV  +G+ +    +T      EY T   +++AV+ I  Q  + P 
Sbjct: 147 GHNY----MPDQV--TVGGDGYSLPLMRNTQVTEFDEYLTTALSRDAVKYIHRQQ-ESPF 199

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           F+YL++ A H       L+AP E I ++++I D +RR YAAM+  +D+ +G V+ AL+  
Sbjct: 200 FMYLSYNAPHTP-----LQAPAEYIEKYKHIEDEDRRVYAAMIDVMDEGIGMVVDALKDI 254

Query: 329 GMLENSIIIFMSDNGAPTVE-YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
             L+N++IIF+SDNG    E ++ TS   +W  N P+R  K  L EGG+ VP I   P +
Sbjct: 255 NQLDNTLIIFLSDNGGVFPESWQPTS---DWADNSPFRRGKAALLEGGIHVPFIAHWPGV 311

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
                    ++   D   T    AG D  +L
Sbjct: 312 IPKGEEFTGLVSSLDIAATSVALAGADNKKL 342



 Score = 48.1 bits (113), Expect = 0.012,   Method: Compositional matrix adjust.
 Identities = 29/79 (36%), Positives = 45/79 (56%), Gaps = 9/79 (11%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MR    T +D   EY T   +++AV+ I  Q  + P F+YL++ A H       L+AP E
Sbjct: 167 MRNTQVTEFD---EYLTTALSRDAVKYIHRQQ-ESPFFMYLSYNAPHTP-----LQAPAE 217

Query: 61  TINQFQYITDPNRRTYAAL 79
            I ++++I D +RR YAA+
Sbjct: 218 YIEKYKHIEDEDRRVYAAM 236


>gi|345510992|ref|ZP_08790548.1| arylsulfatase B [Bacteroides sp. D1]
 gi|229442597|gb|EEO48388.1| arylsulfatase B [Bacteroides sp. D1]
          Length = 498

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 182/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP++DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 77  GWGDVGFHGS-EIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 135

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 136 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 195

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L T+EA++ I+    + P 
Sbjct: 196 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITQEAIRCIDAYEKEGPF 242

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 243 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 297

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GG   PA+L
Sbjct: 298 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGGHAPAVL 348

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 349 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 391


>gi|295086308|emb|CBK67831.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
           XB1A]
          Length = 458

 Score =  190 bits (482), Expect = 2e-45,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 182/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP++DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L T+EA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITQEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GGV  PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +P L     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPMLKDLV-GDHSRPKREYDGI 351


>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 469

 Score =  189 bits (481), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 117/345 (33%), Positives = 170/345 (49%), Gaps = 66/345 (19%)

Query: 92  GWNDLSFHG-SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ND+ +H   + I T NIDALA  G+ L + Y  P+CTPSR+  ++GKY IH GMQ   
Sbjct: 40  GYNDVGYHSDGSAIETDNIDALAAGGLKLESYYVAPLCTPSRSQFLSGKYLIHNGMQHLV 99

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I    PR +PL +  +   L + GY+T  +GKWHLGF+++E  PL RGF+S FG L G  
Sbjct: 100 IDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWPLNRGFQSFFGMLLG-- 157

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
                                                         Q   DQ   KP+F+
Sbjct: 158 ----------------------------------------------QSTVDQGGTKPMFM 171

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           YL++ A H        E P E    ++  I + NRRTYA MV  LD+S+G +   L+ +G
Sbjct: 172 YLSYQAPHL-----PFEVPDEYFVSYRGKINNRNRRTYAGMVTMLDESIGKLTDTLKEEG 226

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           +  +++ IF +DNG            +N G+N+P RGVK   +EGG++    +  P +  
Sbjct: 227 LWNDTVFIFSTDNGG--------VGKKNAGNNWPLRGVKGNYFEGGIRGVGFVAGPLLST 278

Query: 390 NPR--VSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSL 431
           N +  +S  +MHISDW PTL       T +   L +DG++ W  +
Sbjct: 279 NVQGTISTDLMHISDWYPTLVEGVAKVTLNHTELGLDGVNMWDVI 323


>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
           variabilis ATCC 29413]
 gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
           ATCC 29413]
          Length = 457

 Score =  189 bits (481), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 122/348 (35%), Positives = 186/348 (53%), Gaps = 30/348 (8%)

Query: 72  NRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTP 130
           +R T  +   +    L+   GW DLS +G  +  TPN+D LA  G+   N YA Q VCTP
Sbjct: 33  SRATAQSSRPNVVFILVDDMGWGDLSIYGRTDYETPNLDRLARQGVRFTNAYANQTVCTP 92

Query: 131 SRASLMTGKYP--IHTGMQGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWH 184
           +R + +TG+Y   +  G++ P    ++P     G+P  +  +   L+  GY T  +GKWH
Sbjct: 93  TRIAFLTGRYQARLPVGLREPLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWH 152

Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
            G+    + PL +GF+ +FG+L+G I Y+ H  +D   R ++L  +D+    S       
Sbjct: 153 AGY-PPNFGPLQKGFDEYFGHLSGGIEYFTHTGTD---RILDLYENDVPVQRS------- 201

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
            Y TDLFT  AV+ I+ +P  +P +L L + A H    G + +A         Y    ++
Sbjct: 202 GYVTDLFTDRAVEFIQ-RPHSRPFYLSLHYNAPHWPWQGPNDQASTAFYLTNGYTVGGSQ 260

Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
            TYAAMVK LDD VG V+ AL+  G  +N+++IF SDNG           + N+G   P+
Sbjct: 261 ATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGG--------ERFSNFG---PF 309

Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           RG K +L+EGG++VPAI+  P + Q  +VS Q++   D   T+  A G
Sbjct: 310 RGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTATILAATG 357


>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score =  189 bits (481), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 126/399 (31%), Positives = 200/399 (50%), Gaps = 28/399 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++DL +  ++++ +PNID LA N + + N Y QP CTPSRA+ MTG+Y I  GMQ   I
Sbjct: 35  GFDDLGY-VNDDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVI 93

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP  +PL+E  LP+  ++ GY+T   GKWHLGF+  ++ P  RGF+  FG+  G   
Sbjct: 94  KPDEPEAIPLSETLLPQAFKKCGYNTSMHGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQD 153

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H  S       E NG +  R      D  G Y+T    ++ +  +++   + PLF +
Sbjct: 154 YFYHD-SGNNCYLYEPNGTEKVR-----LDLNGTYSTKAIAEDFIAKLDEYDPETPLFEF 207

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
           L+    H    G     P E +          R+  +  +  +D  +G V+  L+ KG  
Sbjct: 208 LSFQEVH----GPLQWLPAEVLKMAGGWEGWERKMLSRKIVIVDHFIGEVVKKLKEKGFW 263

Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
           +++I++  SDNG  T         R   SN+P RG K  ++EGG++  A + SP++  + 
Sbjct: 264 KDTILVITSDNGGQT---------REGASNWPLRGRKGDVFEGGIRSRAFIHSPKLPDSL 314

Query: 392 RVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS-RRNSNIDGLDQ 448
           + S    + H++DW PTL   +G    +   N+DG  Q   +  N P  +R   ++ LD 
Sbjct: 315 KGSSFQHVFHVTDWFPTLLRFSG--CQQPDSNLDGKAQ--DIFHNKPEVKRTHLLNFLDP 370

Query: 449 WSSLLLNTPSRRNSVLIN-IDEKKRTAAVRLDSWKLVLG 486
           +         R  SVL N   E    + +R ++WKL+ G
Sbjct: 371 FKVAKRKLDDRMFSVLQNRTFEVTVKSVIRTENWKLITG 409


>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 465

 Score =  189 bits (480), Expect = 3e-45,   Method: Compositional matrix adjust.
 Identities = 116/319 (36%), Positives = 174/319 (54%), Gaps = 28/319 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ND+ F+G  EIPTP ID++A NG+   N Y    VC PSRA  +TG+Y    G +  P
Sbjct: 32  GYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFITGRYQQRFGFERNP 91

Query: 151 IWG-AEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            W   +P   +P +E  + E L ++GY    IGKWHLG       P  RGF+  FG+L G
Sbjct: 92  QWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLGA-EPSLRPNKRGFDEFFGHLGG 150

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT---VGEYATDLFTKEAVQLIEDQPVD 265
              +    L  Q++  V+ N  D  R+  T  DT     +Y T+ F+ EAV  I+     
Sbjct: 151 GHRFMPEDLVIQHTEEVK-NELDSYRSWITRNDTPVKTTKYLTEEFSDEAVSFIKRNH-Q 208

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           KP FL+L++ A H       L+A ++ + +F +I DP R+TYAAMV  +DD V  V+ +L
Sbjct: 209 KPFFLFLSYNAPHLP-----LQATEKYLARFPHIKDPKRKTYAAMVSAVDDGVSQVMQSL 263

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV------P 379
           +   + +N+I+ F+SDNG P+        ++N   N+P +G K+ +WEGG +V      P
Sbjct: 264 KETNIADNTIVFFLSDNGGPS--------HKNKSDNFPLKGQKSDVWEGGFRVPFAMQYP 315

Query: 380 AILWSPQIQQNPRVSLQMM 398
           A + + Q+  +P  SL + 
Sbjct: 316 AAIQAKQVYDHPVSSLDIF 334



 Score = 45.1 bits (105), Expect = 0.11,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 41/70 (58%), Gaps = 6/70 (8%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y T+ F+ EAV  I+     KP FL+L++ A H       L+A ++ + +F +I DP R
Sbjct: 189 KYLTEEFSDEAVSFIKRNH-QKPFFLFLSYNAPHLP-----LQATEKYLARFPHIKDPKR 242

Query: 74  RTYAALTKST 83
           +TYAA+  + 
Sbjct: 243 KTYAAMVSAV 252


>gi|262405390|ref|ZP_06081940.1| arylsulfatase B [Bacteroides sp. 2_1_22]
 gi|262356265|gb|EEZ05355.1| arylsulfatase B [Bacteroides sp. 2_1_22]
          Length = 458

 Score =  189 bits (480), Expect = 4e-45,   Method: Compositional matrix adjust.
 Identities = 122/344 (35%), Positives = 182/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP++DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RGF   +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L T+EA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITQEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GG   PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGGHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351


>gi|350414673|ref|XP_003490384.1| PREDICTED: hypothetical protein LOC100748821, partial [Bombus
           impatiens]
          Length = 413

 Score =  189 bits (479), Expect = 6e-45,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 175/346 (50%), Gaps = 55/346 (15%)

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           +V KLD SVG V+ AL+ +GMLENSI++FMSDNGAPT          N GSNYP RG+KN
Sbjct: 1   VVSKLDQSVGDVMDALRSRGMLENSIVVFMSDNGAPT-----NGILSNQGSNYPLRGIKN 55

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
           + WEGG +  A +WSP I+++ RVS QMM +SDWLPTL +AAG D  +L           
Sbjct: 56  SPWEGGTRGVAAIWSPLIRKSKRVSNQMMFMSDWLPTLLSAAGVDRIQL----------- 104

Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
                       NIDG D W SL+    S R+ ++INID+    AA+R   +K ++G  E
Sbjct: 105 -----------GNIDGFDLWPSLVSGKISPRSEIVINIDDLADYAAIRRGDFKYIIGRTE 153

Query: 490 NGTM-DGYYGQTRSNKVPLLNFNAIVESKTYQSL-------------QQLSQNIFL---- 531
            G    G  G       P  + + ++ SKT  ++             ++  +NI +    
Sbjct: 154 TGIAWLGASGDPSEGVSPRYDPHKVLYSKTGVAISGIITAKQAMELNKKKKENIRIVYDT 213

Query: 532 --------PISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIA 583
                    I   + +   R++A I C        P      PC LFN+  DPCE  N+A
Sbjct: 214 SSKTNFQEKILTSEDIIEMRKKAQIKCNVTEKDKIPCEPMIAPC-LFNIEQDPCEMVNLA 272

Query: 584 SSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
              P I +    +L  HR T++P S+   D  +++P  +N+TW+ W
Sbjct: 273 ERNPVILTIFETILIRHRLTVIPPSNLDGD-PRSNPSLWNNTWTSW 317


>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
 gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
          Length = 488

 Score =  188 bits (478), Expect = 7e-45,   Method: Compositional matrix adjust.
 Identities = 117/318 (36%), Positives = 173/318 (54%), Gaps = 30/318 (9%)

Query: 75  TYAALTKSTTLTLLIV-----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVC 128
            YA  TK+ +   +IV      G+ DL F GS EI TPNIDALA+ G++ +N Y   P C
Sbjct: 24  AYAGQTKADSRPNIIVILADDLGYGDLGFTGSREIKTPNIDALAHKGVVFSNAYVTHPYC 83

Query: 129 TPSRASLMTGKYPIHTGMQGPPIWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
            PSRA L+TG+Y    GM+       +    G+P+ E    + +++ GY T  IGKWH+G
Sbjct: 84  GPSRAGLLTGRYQARFGMEINAAHSPDDPFMGLPVDEPTFAKRMQKAGYKTAVIGKWHMG 143

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTV 243
                + P  RGF+  +G+L G   Y+     + +++YS  +  NG   + N        
Sbjct: 144 S-HPNFHPNNRGFDYFYGFLGGGHDYFPESVKVSNEEYSIPLSRNGKPAQLN-------- 194

Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
            EY T   +KEA +        +P  +Y+A+ A H     + LEA Q+ + ++Q+I D N
Sbjct: 195 -EYLTTAISKEAAEFA--MTTSQPFMMYVAYNAPH-----QPLEATQKDLAKYQHIEDIN 246

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
           RRTYAAMV  +D  +G +++ L++ G  EN++I F+SDNG    E    ++  NW SN P
Sbjct: 247 RRTYAAMVDSMDQGIGRLVNGLKQSGKFENTLIFFLSDNGGVYPESWMPNS--NWASNKP 304

Query: 364 YRGVKNTLWEGGVKVPAI 381
           +R  K +L EGGV VP I
Sbjct: 305 FRRGKVSLTEGGVHVPFI 322



 Score = 45.4 bits (106), Expect = 0.085,   Method: Compositional matrix adjust.
 Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 7/71 (9%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
           + EY T   +KEA +        +P  +Y+A+ A H     + LEA Q+ + ++Q+I D 
Sbjct: 193 LNEYLTTAISKEAAEFA--MTTSQPFMMYVAYNAPH-----QPLEATQKDLAKYQHIEDI 245

Query: 72  NRRTYAALTKS 82
           NRRTYAA+  S
Sbjct: 246 NRRTYAAMVDS 256


>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
 gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
          Length = 474

 Score =  188 bits (477), Expect = 8e-45,   Method: Compositional matrix adjust.
 Identities = 122/347 (35%), Positives = 179/347 (51%), Gaps = 37/347 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS EIPTPN+D LA  G++    Y +  VC PSRA L+TGKYP   G +   
Sbjct: 47  GYADFGFQGSTEIPTPNLDQLAQEGVVFKQAYVSASVCGPSRAGLLTGKYPQRFGFEENN 106

Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
           + G         +  G+ L +  +  YL E GY T  IGKWH G   R + PL RGF+  
Sbjct: 107 VPGYMSSSGATGDDMGMRLDQLTMANYLAERGYRTSLIGKWHQGNEDR-FHPLKRGFDEF 165

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE---YATDLFTKEAVQLI 259
           FG+  G  SY+    +   SR  +     + RN    ++  GE   Y TD    E ++ I
Sbjct: 166 FGFRGGARSYFPFTQAHPSSRREDF----LERN----FNNYGESPLYLTDALANETIEFI 217

Query: 260 EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
           +     +P F +L+  A H       +EA +  ++QF ++T   R+ YAAM+  +D ++G
Sbjct: 218 KRNK-HQPFFTFLSLSAPH-----YPMEAEKSDLDQFPHLTG-ERKIYAAMMLNMDRAIG 270

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            V+  LQ++G+ EN+++IF +DNG P+          N   N P  G K  L EGG++VP
Sbjct: 271 RVLQTLQQEGLSENTLVIFTNDNGGPSDH--------NGSINLPLSGTKANLLEGGIRVP 322

Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            I+  P + +       M+   D LPT Y AAGGD  ++  + DG+D
Sbjct: 323 MIMRWPGVTKPGSSYDPMVSTLDLLPTFYAAAGGDVRQV-TDWDGVD 368


>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 1605

 Score =  188 bits (477), Expect = 9e-45,   Method: Compositional matrix adjust.
 Identities = 153/547 (27%), Positives = 240/547 (43%), Gaps = 69/547 (12%)

Query: 85  LTLLIVYGWNDLSFHGSNEIP-TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
             L+   G ND+ +   + +  TP ID LA  G+ L   Y+  +CTP+RA+LM+G YP+ 
Sbjct: 96  FVLVDDMGSNDVGYQSHDMVGVTPFIDGLAEQGVRLKEYYSMHMCTPARAALMSGHYPMR 155

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
            GMQ   I    P G+P     + E L+ LGY+T  +GKW LG  +  + P+ RGF++ +
Sbjct: 156 IGMQLENIKPDSPWGMPRELTTMAETLKNLGYNTHGVGKWGLGHSQHGFLPVNRGFDTWY 215

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRR-------------NLSTAWDTVGEYATDL 250
           GYL+  I YY H     +  TVE     M               +L    D  G ++++L
Sbjct: 216 GYLSDEIDYYSHEYPAPFE-TVEDGATVMASFTDYVFMERSRPYDLQYMPDLNGTHSSEL 274

Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQY------ITDPN 303
           +T+   Q+++     + PLF+Y A    H    G     P+   +  ++      I    
Sbjct: 275 YTQRVQQIVKSANASREPLFVYYASQMTH----GPLDVPPRSAFSDAEWARIGADIKGTV 330

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
           RR +A+++  LD SVG V+  ++R  +  +++ +F SDNG           +   G+NYP
Sbjct: 331 RRKFASLLLALDQSVGRVVEEMKRSDVWRDTLFVFASDNGG---------CWAQGGTNYP 381

Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQN--PRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
            RG K+ LWEGGVKVPA ++S  I  +   R    + H++DW PTL        S L   
Sbjct: 382 LRGGKHFLWEGGVKVPAFVYSELIPSSYRNRAHRGLFHVTDWYPTLVKLVASSNSVLAST 441

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
            D  D+ ++ L      +  ++DG  QW +L+      R  V IN+     T        
Sbjct: 442 NDDDDETTTAL-----EKVDDLDGFFQWEALMGLADPPRVEVPINVQSYAFTC------- 489

Query: 482 KLVLGTQENGTM---DGYYGQTRSNKVP----LLNFNAIVESKTYQSLQQLSQNIF---- 530
                T  NGT+       G       P    +   NA   +  Y   + L         
Sbjct: 490 -----TGPNGTLLQDADSLGAVSDCADPWHILVRQGNATRAALRYGDWKLLVNEFAEPWY 544

Query: 531 -LPISNIDKMRSTRQQATIHCGANPAPMT---PSPCTNGPCYLFNLGNDPCEQNNIASSR 586
            +P    D +       +   G +  PM     +  T    YLF+L NDP E  +++  +
Sbjct: 545 GVPAEGADALGLDLADGSADLGDDEGPMDNCGTAVGTEAHTYLFDLKNDPSETMDLSGEK 604

Query: 587 PDISSQL 593
           P++   L
Sbjct: 605 PEVRDAL 611


>gi|21483362|gb|AAM52656.1| HL07902p [Drosophila melanogaster]
          Length = 365

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 127/366 (34%), Positives = 182/366 (49%), Gaps = 39/366 (10%)

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           + L+HLA H GN    ++AP+E + +F +I DP RRTYA M+  LD SV   I AL+  G
Sbjct: 1   MVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQTIGALKDNG 60

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           ML NSII+  SDNGAPT+       + N GSNYPYRG K + WEGG++    LWSP +++
Sbjct: 61  MLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGGIRSAGALWSPLLKE 115

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              VS Q +H  DWLPTL  AAG                 SL  + P      +DG++ W
Sbjct: 116 RGYVSNQAIHAVDWLPTLAGAAG----------------VSLPQDLP------LDGINLW 153

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L  N   +  +++  +DE    ++   D+ K V G+   G  D + G+  +N+   L 
Sbjct: 154 PMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQWLGELETNEDDPLG 213

Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----GANP--APMTPSPCT 563
             +  +      +Q L  N  L     D++R  R +AT  C    G NP  +     P  
Sbjct: 214 -ESYEQHVLASDVQSLLGNRGL---TKDRIRQMRSEATETCPPIEGQNPLESHFKCEPL- 268

Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
             PC+ F+L  DPCE+ N+A   P    QL + L+  R+T +P +       +A+P   N
Sbjct: 269 KAPCF-FDLAKDPCERYNLAQMYPLQLQQLADELEQIRKTAIPSARVPHSDSRANPTFHN 327

Query: 624 DTWSPW 629
             W  W
Sbjct: 328 GNWEWW 333



 Score = 49.7 bits (117), Expect = 0.005,   Method: Compositional matrix adjust.
 Identities = 20/44 (45%), Positives = 29/44 (65%)

Query: 39 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKS 82
          + L+HLA H GN    ++AP+E + +F +I DP RRTYA +  S
Sbjct: 1  MVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISS 44


>gi|256421633|ref|YP_003122286.1| sulfatase [Chitinophaga pinensis DSM 2588]
 gi|256036541|gb|ACU60085.1| sulfatase [Chitinophaga pinensis DSM 2588]
          Length = 452

 Score =  187 bits (476), Expect = 1e-44,   Method: Compositional matrix adjust.
 Identities = 127/348 (36%), Positives = 183/348 (52%), Gaps = 29/348 (8%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP 141
           +LI+Y    G  D++ +G+ ++ TPNID LA  G++ +  YA  PVC+PSRASL+TG+YP
Sbjct: 30  VLIIYTDDQGTLDVNCYGAKDLHTPNIDRLAKEGVLFSQFYAAAPVCSPSRASLLTGRYP 89

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
               +           G+P ++  + E  ++ GY+T  IGKWH+G+   E  P  +GF+ 
Sbjct: 90  QRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIGY-SPETMPNQQGFDY 148

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE- 260
            FG++ G I  Y H     Y      N HD+ RN    W+  G++  DL  +E    +E 
Sbjct: 149 SFGFMGGCIDNYSH-----YFYWAGPNRHDLWRNGQEIWED-GKFFADLTVQEVNGFLEK 202

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
           ++  DKP FLY A    H    G      QE   Q+       RR YAA V  +D+ +G 
Sbjct: 203 NKRADKPFFLYWAINMPHYPLQG------QEKWRQYYKDLPAPRRMYAAAVSTMDEKIGQ 256

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           V+  L R G+ EN+I++F SD G  T    E  ++   G   PYRG K +L+EGG++VPA
Sbjct: 257 VLQQLDRLGLAENTIVVFQSDQGHST----EDRSFGGGGFTGPYRGAKFSLFEGGIRVPA 312

Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLD 426
           I+ W+  + +N  V  Q+    DW PTL   AG     LP   IDG D
Sbjct: 313 IIRWTGHLPKN-EVRDQLCVNIDWYPTL---AGLCKVALPQRKIDGKD 356


>gi|388257121|ref|ZP_10134301.1| sulfatase [Cellvibrio sp. BR]
 gi|387939325|gb|EIK45876.1| sulfatase [Cellvibrio sp. BR]
          Length = 484

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 125/344 (36%), Positives = 185/344 (53%), Gaps = 29/344 (8%)

Query: 92  GWNDLSF-HGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGP 149
           G+ND+ F +G  EI TP +DALA  G++  N Y   P C PSRA L+TG+Y    GM+  
Sbjct: 39  GYNDVGFTNGQTEIKTPRLDALANEGVVFENGYVTHPYCGPSRAGLITGRYQARFGMENN 98

Query: 150 PIWGAEPR--GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
             +  + +  G+PLTE+  P  L+E+GY T   GKWHLG     + P  RGF+  +G+L+
Sbjct: 99  VTYSPDDKYMGLPLTEKTFPARLQEVGYKTAIFGKWHLGG-APHFQPNERGFDYFYGFLD 157

Query: 208 GVISYYD---HILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           G  +Y     H+ +  Y   +      MR      +D   EY T   +++A + IE    
Sbjct: 158 GGHNYMPGEVHLGAGGYLLPI------MRNKGVAEFD---EYLTTALSRDAARYIERTSK 208

Query: 265 DK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
           ++ P F+Y+++ A HA      L+APQ  + ++ +I D  RRTYAAMV  +D+ +G +I 
Sbjct: 209 EQAPFFIYMSYNAPHA-----PLQAPQNYLEKYAHIKDEKRRTYAAMVDAMDEGIGMLID 263

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL- 382
           AL + G L+N++I F+SDNG     Y    +Y  W  N P+R  K  L EGGV VP I  
Sbjct: 264 ALDKSGELDNTLIFFLSDNGG-LYPYSWWPDY-TWADNSPFRFGKVALTEGGVHVPFIAH 321

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           W  +I +  R++  ++   D   T    AG +T    L  DG+D
Sbjct: 322 WPKKITKGKRLN-GLVSALDIAATSLALAGANTKDAKL--DGVD 362



 Score = 45.1 bits (105), Expect = 0.12,   Method: Compositional matrix adjust.
 Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 6/70 (8%)

Query: 14  EYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           EY T   +++A + IE    ++ P F+Y+++ A HA      L+APQ  + ++ +I D  
Sbjct: 189 EYLTTALSRDAARYIERTSKEQAPFFIYMSYNAPHA-----PLQAPQNYLEKYAHIKDEK 243

Query: 73  RRTYAALTKS 82
           RRTYAA+  +
Sbjct: 244 RRTYAAMVDA 253


>gi|325109241|ref|YP_004270309.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969509|gb|ADY60287.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 485

 Score =  187 bits (475), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 139/387 (35%), Positives = 197/387 (50%), Gaps = 65/387 (16%)

Query: 80  TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV-CTPSRASLM 136
           T    + LL+    G+ +L   G+ +IPTP+ID+LA NG+     Y     C+PSRA L+
Sbjct: 39  TDRPNIVLLLADDLGYGELGCQGNPQIPTPHIDSLAANGVRFRCGYVTAAYCSPSRAGLL 98

Query: 137 TGKYPIHTGMQGPPIWG--AEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           TG+Y    G +  P      +P  G+PL E+ L   L + GY+T  +GKWHLG   R + 
Sbjct: 99  TGRYQSRFGYEQNPTGARNEDPELGLPLEEKTLARRLHDAGYATGLVGKWHLGGTAR-FH 157

Query: 194 PLYRGFESHFGYLNG----VISYYDHILSDQYSRTVELNGH-----DMRRNLST------ 238
           PL RGF+  FG+L+     V   Y+ + S    R    NG      D R  LST      
Sbjct: 158 PLRRGFDEFFGFLHEGHFFVPPPYEGV-STFLRRRALPNGKTGRWGDERLMLSTHMGHDE 216

Query: 239 -AWDT------------VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
            A+D                Y TD FT+EA   I  +  D+P FLYLA+ A H+      
Sbjct: 217 PAYDANNPILRGGQPVEEAAYLTDAFTREACDFIA-RNQDRPFFLYLAYNAVHS-----P 270

Query: 286 LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
           L+     + QF +I D  RR +AAM+  LDDSVG V+  L++  +  N+I+IF+SDNG P
Sbjct: 271 LQGADAYMQQFAHIADQQRRIFAAMLANLDDSVGEVLQTLEQHQLTSNTIVIFLSDNGGP 330

Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI-----QQNPRVSLQMMH 399
           T   RE ++     SN P RG K  ++EGG++VP ++ W  QI     ++ P +SL    
Sbjct: 331 T---RELTS-----SNAPLRGEKGDMYEGGIRVPVLMQWPGQIPAGRVEERPIISL---- 378

Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLD 426
             D  PT   AA    + LP  +DG++
Sbjct: 379 --DLFPTCLAAA---EAELPTELDGVN 400



 Score = 46.6 bits (109), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 6/65 (9%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD FT+EA   I  +  D+P FLYLA+ A H+      L+     + QF +I D  RR
Sbjct: 237 YLTDAFTREACDFIA-RNQDRPFFLYLAYNAVHS-----PLQGADAYMQQFAHIADQQRR 290

Query: 75  TYAAL 79
            +AA+
Sbjct: 291 IFAAM 295


>gi|294647729|ref|ZP_06725288.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294809280|ref|ZP_06767994.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
 gi|292636934|gb|EFF55393.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
 gi|294443524|gb|EFG12277.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
          Length = 458

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 121/344 (35%), Positives = 181/344 (52%), Gaps = 39/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHGS EI TP++DAL   G+ L   Y  P+ TP+RA LMTG+YP   G++   I
Sbjct: 37  GWGDVGFHGS-EIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                 G+   E  + + L   GY  +A IGKWHLG  ++ + P+ RG    +G+LNG I
Sbjct: 96  PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGLSHFYGHLNGAI 155

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+D       +R  EL+ H+        W+T  +  Y+T+L T+EA++ I+    + P 
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITQEAIRCIDAYEKEGPF 202

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
            LY+A+ A H       L+A ++ I    + F  +T  +  + TY+AMV  +D  +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++KG+++N+  IF SDNG   V            S+ P RG K   W+GG   PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGGHAPAVL 308

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W    +Q   +S Q+    D +PTL     GD SR     DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351


>gi|241244123|ref|XP_002402253.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215496292|gb|EEC05932.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 196

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 100/233 (42%), Positives = 132/233 (56%), Gaps = 52/233 (22%)

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
           +G WHLG+++  YTP  RGFES +GY N    YY+H L                      
Sbjct: 11  LGAWHLGYYKEAYTPTERGFESFYGYYNFGEDYYNHTL---------------------- 48

Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
                                    D PLFLYL+HLA HAG     ++AP+E + +F YI
Sbjct: 49  -------------------------DLPLFLYLSHLAVHAGTQYGPIQAPEENVQKFDYI 83

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
              NR  YA  V  LD SVG V  AL++K ML+N++++F+SDNGA  + + E SN    G
Sbjct: 84  GVKNRSLYAGAVDALDQSVGIVFQALEKKDMLQNTVLVFLSDNGA--IPWGEWSNA---G 138

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           SN+P RG K TLWEGGV+VPA +WSP +++  RVS Q+MH+SDW+PTLY+ AG
Sbjct: 139 SNWPLRGAKFTLWEGGVRVPAFVWSPLLRKPQRVSNQLMHVSDWVPTLYSLAG 191



 Score = 54.3 bits (129), Expect = 2e-04,   Method: Compositional matrix adjust.
 Identities = 25/62 (40%), Positives = 37/62 (59%)

Query: 30  DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLI 89
           +  +D PLFLYL+HLA HAG     ++AP+E + +F YI   NR  YA    +   ++ I
Sbjct: 45  NHTLDLPLFLYLSHLAVHAGTQYGPIQAPEENVQKFDYIGVKNRSLYAGAVDALDQSVGI 104

Query: 90  VY 91
           V+
Sbjct: 105 VF 106


>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 441

 Score =  187 bits (474), Expect = 2e-44,   Method: Compositional matrix adjust.
 Identities = 131/361 (36%), Positives = 193/361 (53%), Gaps = 42/361 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTG-MQGP 149
           G +D S +GS ++ TP+ID++A+NGI     Y A  VC+PSRA L+TG+Y    G +   
Sbjct: 32  GSSDFSCYGSKQLLTPHIDSIAHNGIKFTQAYTASSVCSPSRAGLLTGRYQQTFGHLANI 91

Query: 150 P-----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
           P         E  G+P+TE  L + L+ELGYST  IGKWHLG     + P  RGF++ +G
Sbjct: 92  PHSKHSANDPELLGLPVTEITLADSLKELGYSTHCIGKWHLGE-ADHFHPNARGFDNFYG 150

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMR--RNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           +L+G  +Y+            EL G   R  RN   A  + G Y T++FT+EA+++I+++
Sbjct: 151 FLSGARTYF---------LGGELRGDMDRIMRNKEFAEPSSG-YTTEVFTQEAIRIIQEE 200

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
             DKP F+YL+H A H       ++A  E I  + +  +P R+ Y+ ++K LDD  G ++
Sbjct: 201 Q-DKPFFIYLSHNAVHGP-----MDAKDEDIMSYDF-KNPLRKKYSGLMKNLDDQTGLLL 253

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL+     EN++I FMSDNG PT          N  SN+P RG K + +EGG + P +L
Sbjct: 254 QALKDSKQYENTLIFFMSDNGGPTT--------HNGSSNWPLRGFKGSEFEGGNRTPFLL 305

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDGLDQWSSLLLNTPSRRN 440
             P+       S + +   D   T   AAGG+  T R    ID L      ++N P   N
Sbjct: 306 QWPEKISAGLSSDKPIIAYDVFATCIQAAGGELVTDRTYHGIDLLP-----VINKPQETN 360

Query: 441 S 441
           +
Sbjct: 361 A 361



 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 28/82 (34%), Positives = 49/82 (59%), Gaps = 8/82 (9%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           + RN   A  + G Y T++FT+EA+++I+++  DKP F+YL+H A H       ++A  E
Sbjct: 171 IMRNKEFAEPSSG-YTTEVFTQEAIRIIQEEQ-DKPFFIYLSHNAVHGP-----MDAKDE 223

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            I  + +  +P R+ Y+ L K+
Sbjct: 224 DIMSYDF-KNPLRKKYSGLMKN 244


>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
 gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
           spiritivorum ATCC 33300]
          Length = 466

 Score =  186 bits (472), Expect = 3e-44,   Method: Compositional matrix adjust.
 Identities = 118/377 (31%), Positives = 198/377 (52%), Gaps = 35/377 (9%)

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNG 116
           T+NQ  ++      T  +++      ++++     G+ D   +GS +IPTP+IDALA  G
Sbjct: 3   TLNQIVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGG 62

Query: 117 IILNNMYAQP-VCTPSRASLMTGKYPIHTGMQ----GPPIWGAEPRGVPLTE--RFLPEY 169
           I   N Y    VC PSRA L+ G+Y   +G +      P  G + + + L++  R + + 
Sbjct: 63  IRFTNSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQIQDIGLSDTVRTIADQ 122

Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
           ++  GY T AIGKWH G    ++ PL++GF   FG++ G  S++         RT     
Sbjct: 123 MQSNGYETMAIGKWHQGN-ETKHHPLHKGFNHFFGFIGGHRSFFP-------IRTAIKQE 174

Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
             +  + +   +    Y TD+FT +A+  +  Q  DKP F+YL++ A H       +EA 
Sbjct: 175 EKILNDYTEVDEKDVYYLTDMFTDKAISYMR-QKRDKPYFIYLSYNAVHT-----PVEAT 228

Query: 290 QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEY 349
            + + QF  + + +RR+YA ++  +DD +G ++  L++ G LEN+++IF++DNGA T   
Sbjct: 229 PQKLAQFARLKNAHRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT--- 285

Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
                  N   N P RG+K + WEGG++VP I+  P+  Q  +    ++   D +PT   
Sbjct: 286 ------NNGADNGPLRGLKGSKWEGGIRVPMIMHWPKQLQAGKDYKGLVSSLDLVPTCVA 339

Query: 410 AAGGDTSRLPLNIDGLD 426
           AA G T R  + +DG++
Sbjct: 340 AAQG-TVRSDMQLDGVN 355



 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 6/68 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD+FT +A+  +  Q  DKP F+YL++ A H       +EA  + + QF  + + +RR
Sbjct: 191 YLTDMFTDKAISYMR-QKRDKPYFIYLSYNAVHT-----PVEATPQKLAQFARLKNAHRR 244

Query: 75  TYAALTKS 82
           +YA L  +
Sbjct: 245 SYAGLMSA 252


>gi|239626021|ref|ZP_04669052.1| arylsulfatase E [Clostridiales bacterium 1_7_47_FAA]
 gi|239520251|gb|EEQ60117.1| arylsulfatase E [Clostridiales bacterium 1_7_47FAA]
          Length = 483

 Score =  186 bits (472), Expect = 4e-44,   Method: Compositional matrix adjust.
 Identities = 123/347 (35%), Positives = 187/347 (53%), Gaps = 27/347 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ DLS  GS ++ TPN+D LA  G    + YA   VC+PSRA L+TG+YP  TG++   
Sbjct: 28  GYGDLSCMGSTDVCTPNLDILAAGGARFTDFYAGSAVCSPSRACLLTGRYPYMTGVRSIL 87

Query: 149 ---PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
                  G  P G+P         L++LGY+T  +GKWHLG    E  P + GF+   G+
Sbjct: 88  GGIKTTTGLNP-GIPT----FASALKDLGYTTGMVGKWHLGAVP-ECRPTHMGFDYFCGF 141

Query: 206 LNGVISYYDHILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-P 263
           L+GV  Y+ HI   + +    +N  HD+  N        GEY+T+LF ++ ++ I +Q  
Sbjct: 142 LSGVNDYFSHIHYTEANSHPGINPNHDLWENDERCLKYTGEYSTELFARKGLEFIREQVE 201

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
            D P  LY A  A H       + AP + + +F+++ + +R+  AAM+  +DD VG +++
Sbjct: 202 KDMPFALYCAFNAPH-----YPMHAPYKYLERFKHLPE-DRQIMAAMLSAVDDGVGEIMN 255

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYR----ETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            L+R+G+  ++II F SDNG P+ E R    E  +Y   GS    +G K +L++GG++VP
Sbjct: 256 YLKRRGIFNDTIIYFQSDNG-PSKESRNWLDERKDYYYGGSTGGLKGHKFSLFDGGIRVP 314

Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           AI   P +    +V  +    +D  PT   AAGG+ S     I G D
Sbjct: 315 AIFSWPAMVPAGQVISEPCMGTDIFPTFINAAGGNASD--YEISGCD 359


>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
 gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
           ATCC 33861]
          Length = 466

 Score =  186 bits (471), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 117/377 (31%), Positives = 197/377 (52%), Gaps = 35/377 (9%)

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNG 116
           T+NQ  ++      T  +++      ++++     G+ D   +GS +IPTP+IDALA  G
Sbjct: 3   TLNQIVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGG 62

Query: 117 IILNNMYAQP-VCTPSRASLMTGKYPIHTGMQ----GPPIWGAEPRGVPLTE--RFLPEY 169
           +   N Y    VC PSRA L+ G+Y   +G +      P  G + + + L++  R + + 
Sbjct: 63  VRFTNSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQMQDIGLSDTVRTIADQ 122

Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
           ++  GY T AIGKWH G    ++ PL++GF   FG++ G  S++         RT     
Sbjct: 123 MQSNGYETMAIGKWHQGN-ETKHHPLHKGFNHFFGFIGGHRSFFP-------IRTAIKQE 174

Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
             +  + +   +    Y TD+FT +A+  +  Q  DKP F+YL++ A H       +EA 
Sbjct: 175 EKILNDYTEVDEKDVYYLTDMFTDKAISYMR-QKRDKPYFIYLSYNAVHT-----PVEAT 228

Query: 290 QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEY 349
            + + QF ++ D  RR+YA ++  +DD +G ++  L++ G LEN+++IF++DNGA T   
Sbjct: 229 PQKLAQFAHLKDAQRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT--- 285

Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
                  N   N P RG+K + WEGG++VP I+  P+     +    ++   D +PT   
Sbjct: 286 ------NNGADNGPLRGLKGSKWEGGIRVPMIMHWPKQLPAGKDYKGLVSSLDLVPTCVA 339

Query: 410 AAGGDTSRLPLNIDGLD 426
           AA G T R  + +DG++
Sbjct: 340 AAQG-TVRSDMQLDGVN 355



 Score = 47.4 bits (111), Expect = 0.024,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 6/68 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD+FT +A+  +  Q  DKP F+YL++ A H       +EA  + + QF ++ D  RR
Sbjct: 191 YLTDMFTDKAISYMR-QKRDKPYFIYLSYNAVHT-----PVEATPQKLAQFAHLKDAQRR 244

Query: 75  TYAALTKS 82
           +YA L  +
Sbjct: 245 SYAGLMSA 252


>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
          Length = 548

 Score =  185 bits (470), Expect = 5e-44,   Method: Compositional matrix adjust.
 Identities = 149/535 (27%), Positives = 239/535 (44%), Gaps = 88/535 (16%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YG++D  +  S +I TPNID L  +GI   N Y+   C+PSR+S ++G+Y   +GMQ   
Sbjct: 39  YGYHDAGYRNS-DIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRYAYTSGMQHGV 97

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I       + L   FL +YL+EL Y+T A GKWHLG+  +E TP YRGF++  G  +G  
Sbjct: 98  ISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEG 157

Query: 211 SYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            YY+H      Y      NG  + R+ S      G ++ DL   + +  + D+    PLF
Sbjct: 158 KYYEHTTFQGMYDWH---NGTKLDRSAS------GIHSQDLIEADLLNQL-DENDGTPLF 207

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRRTYAAMVKKLDDSVGTVISA 324
            Y A    H         +P +   + + + D       R+ Y  +V  +D  +G V++ 
Sbjct: 208 YYAAFHNTH---------SPMQPKPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGKVVAK 258

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
           L+ K M +N+ I F SDNG    E          G N P RG K++L++GG    +   S
Sbjct: 259 LKEKDMFDNTYIFFSSDNGGDVHE----------GDNSPKRGAKSSLFDGGCHAHSWASS 308

Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
           P +++    +  ++HI+DWLPT+   AGG+                         +  ID
Sbjct: 309 PLLKKKGVEADGLLHITDWLPTIVKLAGGEVP----------------------ESDGID 346

Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTA-------AVRLDSWKLVLGTQENGTMDGYY 497
           G++Q   +    PS+R  ++ NID +           A R   +KL+ G +  G  DGY 
Sbjct: 347 GIEQVDLITNAGPSKRTDMIYNIDMELEIGTPDFGAMAARDLRYKLIWGFE--GRSDGYG 404

Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKM-----RSTRQQATIHCGA 552
                     +    I E      + + +  +   + N+DK       +T+Q   +    
Sbjct: 405 ----------MGAQFIYEVDRVLDVVKEADAV---VENVDKRGGPYPMNTQQHHILRTYM 451

Query: 553 NPAPMTPS--PCTNGPCYLFNLGNDPCEQNNI-ASSRPDISSQLYELLKYHRRTL 604
               +T        GP  LF+L  DP E  ++ AS+ P+  +   +L++  R  +
Sbjct: 452 QTKLVTLDDIEAGTGPMMLFDLQEDPNETTDLSASTDPEHQAAKLKLIEMLRSAI 506


>gi|449680623|ref|XP_004209633.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
          Length = 327

 Score =  185 bits (470), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 124/382 (32%), Positives = 196/382 (51%), Gaps = 68/382 (17%)

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITDPNRRTYA 308
           ++T +A+  I +    +PLFLYLA  A H+ N+    L+APQE I++F YI    RR YA
Sbjct: 1   MYTSKAISYITNHNCSEPLFLYLAFQAVHSSNSEVEPLQAPQEWIDKFSYIKHEQRRKYA 60

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
           AM+  +D  +G V  AL  K ML+NS+IIF +DNG P+  +          +N+P RGVK
Sbjct: 61  AMLAYMDYGIGRVHEALIEKKMLDNSVIIFTTDNGGPSNGFNNNWA-----NNFPLRGVK 115

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            TL+EGGV+    ++S  I+++ RVS  ++H++DWLPTL + AGG+ S +   +DG DQW
Sbjct: 116 ATLFEGGVRGVGFVYSKLIEKS-RVSHDLIHVTDWLPTLVSLAGGNVSDISF-LDGFDQW 173

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK-RTAAVRLDSWKLVLGT 487
           ++L       + S                S+RN +L+NIDE   +  A+R+ SWKL+   
Sbjct: 174 ATL-----QNKQS----------------SQRNEILLNIDEHVWKNEALRVGSWKLIKEG 212

Query: 488 QENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQAT 547
            +N   DG+Y                                  P S  + M  +   + 
Sbjct: 213 LKN--WDGWYP---------------------------------PPSYNESMDESLNSSI 237

Query: 548 IHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQ 607
           + CG +  P+  + C +  C LF++  DPCE  +++   PD+ + +   L  +++++VP 
Sbjct: 238 LKCGHD-FPVVINHCDDY-C-LFHVVEDPCEIVDLSKKFPDVLATMMYRLNIYKQSMVPP 294

Query: 608 SHEQPDLVQADPKRFNDTWSPW 629
            +      ++DPK  N  W PW
Sbjct: 295 RNNMTIDSRSDPKLHNGVWQPW 316



 Score = 55.8 bits (133), Expect = 6e-05,   Method: Compositional matrix adjust.
 Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 1/62 (1%)

Query: 19 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITDPNRRTYA 77
          ++T +A+  I +    +PLFLYLA  A H+ N+    L+APQE I++F YI    RR YA
Sbjct: 1  MYTSKAISYITNHNCSEPLFLYLAFQAVHSSNSEVEPLQAPQEWIDKFSYIKHEQRRKYA 60

Query: 78 AL 79
          A+
Sbjct: 61 AM 62


>gi|449137658|ref|ZP_21772978.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448883711|gb|EMB14224.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 810

 Score =  185 bits (470), Expect = 6e-44,   Method: Compositional matrix adjust.
 Identities = 122/338 (36%), Positives = 177/338 (52%), Gaps = 42/338 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           G++D+ F+G  EIPTP +D LA  G++  N YA  P C+PSRA L+TG+Y    G +G  
Sbjct: 40  GYSDVGFNGCKEIPTPRLDELAGEGVVFTNGYASHPYCSPSRAGLLTGRYQQRFGHEGNP 99

Query: 149 --PPIW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
              P W G +  G+PL+E  L + L+E GY T AIGKWHLG   + + P  RGF+  FG+
Sbjct: 100 EPDPQWHGDDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGF 158

Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
             G +SY+  +        V      + R   T       Y TD F+ EAV+ I+    D
Sbjct: 159 SGGGLSYWGDLGRMDPLLGVHRGDEPVDRKSLT-------YLTDDFSTEAVKFIQRHETD 211

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
            P FLYLA+ A HA +     +A +  + +  +I    R  Y AMV  +D+ +G V+  +
Sbjct: 212 -PFFLYLAYNAPHAPD-----QATRAHLEKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQI 265

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
           +  G+ +N++I+F SDNG            R    N PYRG K  L+EGG++VP ++  P
Sbjct: 266 RESGLDDNTLIVFYSDNGG----------RREHAMNLPYRGHKGMLFEGGIRVPFLVSWP 315

Query: 386 QI------QQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
                   +Q+P  +L      D  PT+  AAG D S+
Sbjct: 316 GTLPGGVREQSPVTAL------DVFPTVLAAAGLDPSK 347


>gi|422293430|gb|EKU20730.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
 gi|422295486|gb|EKU22785.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
          Length = 703

 Score =  185 bits (469), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 141/424 (33%), Positives = 213/424 (50%), Gaps = 62/424 (14%)

Query: 86  TLLIVY---GWNDLSFHGSNEIP----TPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
            L+IV    G  D+ ++ S E P    TP +D+LA   + L   Y  PVCTP+RA+L+TG
Sbjct: 64  VLMIVVDDAGVQDVGYNASPESPLRGKTPVLDSLAAESVRLKEYYVHPVCTPTRAALLTG 123

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR-ELGYSTKAIGKWHLGFFRREYTPLYR 197
           +Y ++ GM  P I G    G+  +   LPE L+ E  YST  +GKWHLG  + +  PL R
Sbjct: 124 RYAVNVGMPFPLI-GDAISGLDGSIPTLPEMLKSEANYSTHLVGKWHLGAAKAKNRPLAR 182

Query: 198 GFESHFGYLNGVISYYDHILS---DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
           GF+S +G L     +Y   +    D +    E+   ++            E+AT LF++E
Sbjct: 183 GFDSFYGLLGASFDHYTKKMGEVRDLWKNEAEVPAKEVDEK---------EHATTLFSRE 233

Query: 255 AVQLIEDQPVD----------KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
           AV++IE+               PLFLYLA+ A HA      L+A ++ +     + + +R
Sbjct: 234 AVKVIEEHSARGHAGAKDGDMDPLFLYLAYSAPHA-----PLQADEKFMKLCSDVPNRHR 288

Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
           RT+ AM+ +LD+ +G V ++L+  GM E+++++F SDNG           +   G NY Y
Sbjct: 289 RTFCAMMAQLDEGIGNVTASLKAHGMWEDTVVVFTSDNGG---------YFLVGGFNYNY 339

Query: 365 RGVKNTLWEGGVKVPAILWSPQ-IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
           RG K+  WEGGV+VPA + +P+     PR    M H++DW+PTL   A G         D
Sbjct: 340 RGGKSGGWEGGVRVPAFIRAPKRYNFAPRDFRGMAHVADWMPTLMGMATGREEGRERMGD 399

Query: 424 GLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
           G D   +        R     G ++         S RN  L+ +D  + T+A R   +KL
Sbjct: 400 GYDLRKAF-------RGDEEGGKEK---------SPRNEALLQLDIFQNTSAYRSGCFKL 443

Query: 484 VLGT 487
           +LG 
Sbjct: 444 LLGC 447



 Score = 41.6 bits (96), Expect = 1.2,   Method: Compositional matrix adjust.
 Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 15/76 (19%)

Query: 14  EYATDLFTKEAVQLIEDQPVD----------KPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
           E+AT LF++EAV++IE+               PLFLYLA+ A HA      L+A ++ + 
Sbjct: 224 EHATTLFSREAVKVIEEHSARGHAGAKDGDMDPLFLYLAYSAPHA-----PLQADEKFMK 278

Query: 64  QFQYITDPNRRTYAAL 79
               + + +RRT+ A+
Sbjct: 279 LCSDVPNRHRRTFCAM 294


>gi|241708194|ref|XP_002413315.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215507129|gb|EEC16623.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 294

 Score =  185 bits (469), Expect = 8e-44,   Method: Compositional matrix adjust.
 Identities = 123/328 (37%), Positives = 163/328 (49%), Gaps = 42/328 (12%)

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           PLFLYLAH A H+GN G+ LE P+    Q++ I   NR  YA MV  LD+SVG V  AL 
Sbjct: 1   PLFLYLAHQAPHSGNFGR-LEVPETYTEQYRDIGSKNRTLYAGMVSALDESVGAVFEALD 59

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
           ++GML NSI++F SDNG    E  ET N    GS +P RG K +L EGGV+ PA+LWSP 
Sbjct: 60  KRGMLSNSIVVFSSDNGGAPDE--ETYNS---GSPWPLRGQKMSLSEGGVRAPALLWSPL 114

Query: 387 IQQNP-RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
           +      +   + HI+DWLPT Y  AGG  S L                        IDG
Sbjct: 115 LSATEGSIFDGLFHITDWLPTFYELAGGSPSSL----------------------GAIDG 152

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
           + Q  +L   T + R  +LINID ++  +A+   ++KLVLG  E G  + +Y        
Sbjct: 153 VSQLGALTRQTTAPRKEILINIDPEENCSAIIEGNFKLVLGVAEQGLYENWY-------- 204

Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQA---TIHCGANPAPMTPSPC 562
           PL         K  Q  ++      L    +  +  +R  A    + CGA        P 
Sbjct: 205 PLSGHVDWESDKPRQECEKSVVARVLEHHGVRPVCGSRTGAYATPVQCGARDPSRECVP- 263

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDIS 590
           T   C LFNL  DPCE NN+A   P+++
Sbjct: 264 TKSLC-LFNLSKDPCEYNNVADQHPEVT 290



 Score = 46.2 bits (108), Expect = 0.059,   Method: Compositional matrix adjust.
 Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)

Query: 36 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
          PLFLYLAH A H+GN G+ LE P+    Q++ I   NR  YA +  +   ++  V+
Sbjct: 1  PLFLYLAHQAPHSGNFGR-LEVPETYTEQYRDIGSKNRTLYAGMVSALDESVGAVF 55


>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
          Length = 617

 Score =  184 bits (468), Expect = 9e-44,   Method: Compositional matrix adjust.
 Identities = 137/421 (32%), Positives = 194/421 (46%), Gaps = 57/421 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ ++ S E  TPNI+ LA NGIIL++ Y+QPVC+PSR+  +TG+Y    GMQ   I
Sbjct: 169 GYNDIGYN-SIEAFTPNINYLAKNGIILDSHYSQPVCSPSRSQFLTGRYSFRYGMQHRNI 227

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P GVPLTE+ LPE  +E GYST   GKWH G F   Y P  RGF+   G  +G   
Sbjct: 228 LPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRGFDKFVGSYSGSSQ 287

Query: 212 YYDH---ILSDQYSRTVELNGHDMRRN--LSTAWDTVGEYATDLFTKEAVQLI------E 260
           +  H     S Q        G+DMR    +    +  G   +D   +E    I       
Sbjct: 288 HSTHEKCFNSPQRDSPFSFCGYDMREAEVVEARQNLTG---SDFLLREICIRITVHGPLT 344

Query: 261 DQPVDKPLF--LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
             P D+ ++  LY  H                ETI +        R    AMV  LD ++
Sbjct: 345 TDPRDEKIYEELYERHR--------------NETIYK--------RTKMLAMVTGLDRAI 382

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G +I + ++ G  +N+I  F SDNG  ++ YR         +N P+RG KN L EGGV+ 
Sbjct: 383 GRIIESFKKYGKYKNTIFAFASDNGG-SLTYR--------ANNMPFRGGKNDLLEGGVRT 433

Query: 379 PAILWS---PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LN 434
            + L S   P  +   R S  +  I+DW PTL  A       LP  +DG   + +     
Sbjct: 434 LSFLHSSLIPNDKIGSRYS-NLFDITDWFPTLLDAGKCSQKNLPNQLDGKSHFENFFSAG 492

Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI---NIDEKKRTAAVRLDSWKLVLGTQENG 491
             S R+  +  LD            R  V +     D + +T A+R+  +KL+ G +   
Sbjct: 493 NASVRDEIVHHLDPLKKADAEVEDPRPFVKVYGFKYDIRMKT-AIRVGKYKLITGAEATK 551

Query: 492 T 492
           T
Sbjct: 552 T 552


>gi|323452297|gb|EGB08171.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 889

 Score =  184 bits (468), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 159/587 (27%), Positives = 251/587 (42%), Gaps = 118/587 (20%)

Query: 109 IDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPE 168
           ++ LA  G+ L + YA P CT SRA+L TGKYP+H G+Q   I  +EPRGVPL E FL +
Sbjct: 1   MNKLASGGVTLTSFYASPTCTASRAALFTGKYPMHLGLQDSVIHPSEPRGVPLKETFLSQ 60

Query: 169 YLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELN 228
            L++ GY T  +GKWHLGF +  YTP  RGF+  FG L G   +  H  ++ ++    + 
Sbjct: 61  KLKDAGYGTVFVGKWHLGFHQAAYTPSQRGFDEFFGILTGGGDHVAHASTESFA----MR 116

Query: 229 GHDMRRNLSTAWDT--------VGE--------YATDLFTKEAVQLIED--QPVDKPLFL 270
           G D +  ++T   +        VGE        ++T L+T ++   +    +  D PLFL
Sbjct: 117 GPDYKSKITTMTGSNLWHNGRPVGEDEEGVRDTHSTHLYTAKSRAYVAKYAEMADVPLFL 176

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------RRTYAAMVKKLDDSV 318
            +++ A HA      ++ P E ++   Y                 RR    MV  +D+S+
Sbjct: 177 TVSYQAVHA-----PIQVPDEYVDGSAYANGCESLRAGSASATRLRRQLCGMVSMVDESL 231

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
            ++   L+ + M EN+++ F+SDNG            R+  SN   RG K + +EGG++V
Sbjct: 232 ASIHDDLKAEAMWENTLLFFLSDNGGI---------LRHGSSNLDLRGEKGSYYEGGIRV 282

Query: 379 PAIL---WSPQIQQNPRVSLQMMHISDWLPTLYTAAG------------------GDTSR 417
           PA +   +  ++    RV   ++HI+D   T    AG                  G   R
Sbjct: 283 PAFVSGGYLAKLLSAGRVFEGLVHITDLHATALRVAGVSSKEDKRAAPAVAAAKVGHLRR 342

Query: 418 LPLNIDGLDQWSSLLLNTPSRRN--SNIDGLDQWSSLLLN---TPSRRNSVLINIDEKKR 472
           L           + L +  S  +   ++DG+DQW + +       S+R  V  NI     
Sbjct: 343 LEPGQQPQSYAETALDDVLSHEHLTDSLDGVDQWDAFVSGASAAASQRTHVAHNI----- 397

Query: 473 TAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLP 532
                            N  + G  G  R  +  L+    + ES+ Y+  Q + Q+    
Sbjct: 398 -----------------NSELFGAAGALRVGEYKLIVEARVTESEIYEYGQHMLQDDNWD 440

Query: 533 ISNIDK------MRSTRQQATIHCGANPAPMTPSPCTNG-PCY-LFNLGNDPCEQNNIAS 584
            S + +      +R+    +     ANP  +    C +   C  L++L       +  A 
Sbjct: 441 QSELSQVIHQKLLRTPGTTSLFDVVANPNELDEEDCDDARSCRNLYDL-------DEFAD 493

Query: 585 SRPDISSQLYELLKYHRRTLVPQSHE--QPDLVQADPKRFNDTWSPW 629
            +  I  +  EL     R   P+S E    D   ADP  F   W+PW
Sbjct: 494 VKATILEKWAEL-----RDSTPESTEIWLDDGPLADPALFGGVWTPW 535


>gi|241999506|ref|XP_002434396.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215497726|gb|EEC07220.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 458

 Score =  184 bits (468), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 152/500 (30%), Positives = 234/500 (46%), Gaps = 67/500 (13%)

Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES-HF 203
           GMQ   I+  EPRG+ L  + LP++L + GY+T  IGK  +    + Y+  Y   +  H 
Sbjct: 1   GMQHSIIFNEEPRGLSLRFKLLPQWLADRGYTTHMIGKVAMHLSLQHYS--YEQIKHLHC 58

Query: 204 GYLNGVISYYDHILSDQYSRTVEL---------NGHDMRRNLSTAWDTVGEYATDLFTKE 254
            +LN         L++ + R   L          G D RR+LS + +  G Y T L T+E
Sbjct: 59  RHLNSK-------LAEVFFRPQTLVLLLQPGGDTGLDFRRDLSISPNESGTYITQLLTRE 111

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN-QFQYITDPNRRTYAAMVKK 313
           A  +I + P+DK +  YLA     +  A   +    ++I   F     P       MV  
Sbjct: 112 AEAVIANHPIDK-VCHYLASDRETSVFALDTVVICTKSIAISFHKCLLP------GMVSA 164

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           LD+SVG+V  +L  +GML+N++ +F SDNGA T      S   N  S++P++G K T WE
Sbjct: 165 LDESVGSVFESLNHRGMLDNTVFVFSSDNGADT-----DSANANTASSWPFKGQKYTPWE 219

Query: 374 GGVKVPAILWSPQIQQN-PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           GGV+ PA++WSP        V   M HISDWLPTLY  AGG    +              
Sbjct: 220 GGVRAPAVIWSPLFSGLLGSVYNNMFHISDWLPTLYQLAGGLAEGMA------------- 266

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
                    +IDGL    SL   T   R+ +LINID  +  +A+    +KLV G      
Sbjct: 267 --------DDIDGLSHLHSLTGGTEPPRHELLINIDPVENYSAIIEGHFKLVKGRASGSA 318

Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQS--LQQLSQNIFL-PISNIDKMRSTRQQATIH 549
            D +Y    +     + +++ +  K  +S  + ++ +NI L P+    +   TR    + 
Sbjct: 319 FDKWYSIPGN-----VTWDSDMPRKQCESSVVVRVLRNIGLSPVCGAGR---TRYAVPVD 370

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
           CG    P   S        LF+L  DPCE N+++   P++  +L + L  ++ T +  ++
Sbjct: 371 CGQR-DPREKSCAPERSACLFDLSKDPCEYNDVSGQHPEVVERLLKKLAGYKATALKPAN 429

Query: 610 EQPDLVQADPKRFNDTWSPW 629
             PD  +++P   N+ W  W
Sbjct: 430 -LPDDPRSNPGLHNNVWVAW 448


>gi|325109725|ref|YP_004270793.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969993|gb|ADY60771.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 471

 Score =  184 bits (468), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 115/310 (37%), Positives = 165/310 (53%), Gaps = 35/310 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW+D+ F+G  EIPTP++DALA +G+  +  YA  P C+PSRA L+TG+Y    G +  P
Sbjct: 39  GWSDVGFNGCKEIPTPHLDALAKSGVAFDCGYASHPYCSPSRAGLLTGRYQQRFGHECNP 98

Query: 151 IWGA-------EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
             GA       E  G+PL+E  L    R  GY T AIGKWHLG    ++ P  RGFE  F
Sbjct: 99  --GAHGNDDAIEMEGLPLSETLLSTVFRNAGYRTGAIGKWHLG-DEPQFWPTERGFEEWF 155

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
           G+  G +SY+  +        V  NG  + ++  T       Y TD F+ EAV+ +E+  
Sbjct: 156 GFSGGGLSYWGDLGKKPPLHGVLRNGDVVPKDELT-------YLTDDFSTEAVKFVEENR 208

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
             +P FLYLA+ A HA +     +  +E + +  +I    R  Y AMV  +D+ +G V+ 
Sbjct: 209 A-RPFFLYLAYNAPHAPD-----QVTREHLKKTSHIEYGGRAVYGAMVAAMDEGIGRVVQ 262

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AIL 382
           AL  + + + ++I F SDNG      R          N+PYRG K  L+EGG++VP  + 
Sbjct: 263 ALDDQNLRDETVIFFYSDNGGRATLAR----------NFPYRGHKGMLFEGGIRVPFCVS 312

Query: 383 WSPQIQQNPR 392
           W  QI+   R
Sbjct: 313 WPEQIEGGRR 322


>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
          Length = 481

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 120/404 (29%), Positives = 202/404 (50%), Gaps = 38/404 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G++DL +  ++++ +PNID LA N + + N Y QP CTPSRA+ MTG+Y I  GMQ   I
Sbjct: 35  GFDDLGY-VNDDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVI 93

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP  +PL+E  LP+ L++ GY+T   GKWHLGF+  ++ P  RGF+  FG+  G   
Sbjct: 94  KPDEPEAIPLSETLLPQALKKCGYNTSMHGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQD 153

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H   +            +R +L+      G Y+T    ++ +  +++   + PLF +
Sbjct: 154 YFYHDSGNNCYLYEPCYREKVRLDLN------GTYSTKAIAEDFIAKLDEYDPETPLFEF 207

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPN-----RRTYAAMVKKLDDSVGTVISALQ 326
           L+    H          P + ++++  +         R+  +  +  +D  +G V+  L+
Sbjct: 208 LSFQEVH---------GPLQWLSEYSTLFKEKDWTHYRKMLSRKIVIVDHFIGEVVKKLK 258

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
            KG  +++I++  SDNG  T         R   SN+P RG K  ++EGG++  A + SP+
Sbjct: 259 EKGFWKDTILVITSDNGGQT---------REGASNWPLRGRKGDVFEGGIRSRAFIHSPK 309

Query: 387 IQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS-RRNSNI 443
           +  + + S    + H++DW PTL   +G    +   N+DG  Q   +    P  +R   +
Sbjct: 310 LPNSLKGSSFPHVFHVTDWFPTLLRFSG--CQQPDSNLDGEAQ--DIFHQKPEVKRTHLL 365

Query: 444 DGLDQWSSLLLNTPSRRNSVLIN-IDEKKRTAAVRLDSWKLVLG 486
           + LD +         R  SVL N   E    + +R ++WKL+ G
Sbjct: 366 NFLDPFKVAKRKLDDRMFSVLQNRTFEVTVKSVIRTENWKLITG 409


>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
 gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
          Length = 472

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 125/367 (34%), Positives = 187/367 (50%), Gaps = 38/367 (10%)

Query: 81  KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMT 137
           K   L L++    G+ DL F GS +I TP++D LA NG+     Y +  VC+PSRA  +T
Sbjct: 33  KHYNLVLIVADDLGYGDLGFTGSTQIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAGFIT 92

Query: 138 GKYPIHTGMQGPPIWGAEP------RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           G   +  G     + G EP       G+PL+++ + ++L +LGY    IGKWHLG    +
Sbjct: 93  GINQVEFGHDNN-LAGVEPGFDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLGK-EPQ 150

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG--EYATD 249
           + PL RGF+  +GY  G   Y++ +           NG   +  L + + T     Y TD
Sbjct: 151 FHPLKRGFDEFWGYTGGGHDYFESLP----------NGKGYKEPLESNFKTPDPITYITD 200

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
               E+V  IE    D+P FL+ A  A H       ++A +E +  +Q+I D  RRTYAA
Sbjct: 201 DVGNESVDFIERHK-DEPFFLFAAFNAPHT-----PMQALEEDLALYQHIEDKKRRTYAA 254

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           MV +LD +VG ++++L+ +G+ EN++++F SDNG PT          N   N PYRG K 
Sbjct: 255 MVHRLDLNVGKIMTSLEEQGLSENTLVVFFSDNGGPTDS--------NASLNAPYRGQKG 306

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
            L EGG+ VP ++  P +     +  + +   D +PT + A  GDT        G+D   
Sbjct: 307 ILLEGGIHVPFVMNLPGLLPEGLIYQEQVTSLDVVPT-FLALAGDTETSMDMFSGVDLIP 365

Query: 430 SLLLNTP 436
            L   TP
Sbjct: 366 HLTGKTP 372



 Score = 42.0 bits (97), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 6/73 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD    E+V  IE    D+P FL+ A  A H       ++A +E +  +Q+I D  RR
Sbjct: 197 YITDDVGNESVDFIERHK-DEPFFLFAAFNAPHT-----PMQALEEDLALYQHIEDKKRR 250

Query: 75  TYAALTKSTTLTL 87
           TYAA+     L +
Sbjct: 251 TYAAMVHRLDLNV 263


>gi|417303628|ref|ZP_12090677.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
 gi|327540049|gb|EGF26644.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           WH47]
          Length = 826

 Score =  184 bits (467), Expect = 1e-43,   Method: Compositional matrix adjust.
 Identities = 124/341 (36%), Positives = 181/341 (53%), Gaps = 35/341 (10%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI 142
           +LIV    G++D+ F+G  EIPTP++D LA +G++  N YA  P C+PSRA L+TG++  
Sbjct: 48  VLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQ 107

Query: 143 HTGMQGPP----IW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
             G +  P     W G +  G+PLTE  L + L+E GY T AIGKWHLG   + + P  R
Sbjct: 108 RFGHESNPEPDTQWHGEDTPGMPLTETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRR 166

Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
           GF+  FG+  G  SY+     D   +   L  H     +     T   + TD F+ EAV+
Sbjct: 167 GFDEWFGFSGGGFSYW----GDLGKKDPLLGVHRGDEPVDPKTLT---HLTDDFSTEAVK 219

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
            I+    + P FLYLA+ A HA +      A +  + +  +I    R  Y AMV  +D+ 
Sbjct: 220 FIQRHETE-PFFLYLAYNAPHAPD-----HATRAHLQKTAHIEYGGRAVYGAMVAGMDEG 273

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           +G V+  ++  G+ EN++IIF SDNG            R    N+PYRG K  L+EGG++
Sbjct: 274 IGRVVDQIRESGLGENTMIIFYSDNGG----------RREHAVNFPYRGHKGMLFEGGIR 323

Query: 378 VPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
           VP ++ W   ++   +  L +  + D  PT   AAG D SR
Sbjct: 324 VPFLVSWPGTVRSGMKEELPITAL-DLFPTALAAAGMDPSR 363


>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 471

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 121/347 (34%), Positives = 178/347 (51%), Gaps = 37/347 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS E+ TPN+D LA +G+     Y     C PSRA L+TGKY    G +   
Sbjct: 33  GFADFGFQGSTEMKTPNLDKLANSGVKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEIN 92

Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           + G         A+  G+PL +  + +YL++LGY+T   GKWHLG   R + P+ RGF+ 
Sbjct: 93  VPGYMSENSKFLADDMGLPLDQLTIGDYLKKLGYNTAMYGKWHLGNADR-FHPMNRGFDE 151

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
            +G+  G  SY+ + ++       +     M R     ++   EY TD    EA+  IE 
Sbjct: 152 FYGFRGGARSYFGYDVASSAHHDTK-----MERGFGN-FEEPQEYVTDALADEAISFIEK 205

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
              + P F+YLA  A H       +EA +E +NQF  +T   R+  AAM   LD + G V
Sbjct: 206 NKKN-PFFIYLAFNAVHT-----PMEATKEDLNQFPNLTG-KRKELAAMTLALDRACGRV 258

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           +  L++ G+ +N+II+F +DNG PT         +N   N P  G K+   EGG++VP +
Sbjct: 259 LDKLKQLGLDKNTIIVFSNDNGGPTD--------KNASLNLPLSGTKSNHLEGGIRVPFL 310

Query: 382 L-WSPQIQQNPRVSLQM-MHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           + W   IQ  P+ +    +   D LPT Y A GG+T  L   IDG++
Sbjct: 311 MSWPKHIQ--PKTTYTFPISTMDLLPTFYAAGGGNTKDLQ-EIDGIN 354



 Score = 41.2 bits (95), Expect = 1.9,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 35/67 (52%), Gaps = 7/67 (10%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           EY TD    EA+  IE    + P F+YLA  A H       +EA +E +NQF  +T   R
Sbjct: 189 EYVTDALADEAISFIEKNKKN-PFFIYLAFNAVHT-----PMEATKEDLNQFPNLTG-KR 241

Query: 74  RTYAALT 80
           +  AA+T
Sbjct: 242 KELAAMT 248


>gi|241601300|ref|XP_002405277.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215502504|gb|EEC11998.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 234

 Score =  183 bits (465), Expect = 2e-43,   Method: Compositional matrix adjust.
 Identities = 95/234 (40%), Positives = 139/234 (59%), Gaps = 14/234 (5%)

Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILS-DQYSRTVELNGHDMRRNLSTAW 240
           +WHLG++  +Y P+ RGF++  G+ +  + YY    + D ++      GHD        W
Sbjct: 12  QWHLGYYDSKYVPVNRGFDTFLGFYSPALDYYTQNFTFDNHT------GHDFWNGDKIYW 65

Query: 241 --DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
                 +Y+T  +T++ VQLI+      PLFL L+H A H       L+ P++ I  F Y
Sbjct: 66  VDKEKKQYSTHYYTRKTVQLIQVHNKSTPLFLLLSHQAPHTSGGPTLLQVPKKGIRDFSY 125

Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
           I + NR  +A MV  LD SVG+V+ ALQ+  ML ++I++F +DNGA    ++      N 
Sbjct: 126 IQEENRTLFAGMVDALDQSVGSVVDALQKADMLTDTILVFCTDNGALPWGFKS-----NR 180

Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           GSN+P RG K TL+EGGV+  A +WSP +  + RVS Q+MHI+DWLPTLY+AAG
Sbjct: 181 GSNWPLRGTKFTLYEGGVRSTAFIWSPLVNNSKRVSSQLMHITDWLPTLYSAAG 234



 Score = 52.4 bits (124), Expect = 7e-04,   Method: Compositional matrix adjust.
 Identities = 24/69 (34%), Positives = 38/69 (55%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y+T  +T++ VQLI+      PLFL L+H A H       L+ P++ I  F YI + NR
Sbjct: 72  QYSTHYYTRKTVQLIQVHNKSTPLFLLLSHQAPHTSGGPTLLQVPKKGIRDFSYIQEENR 131

Query: 74  RTYAALTKS 82
             +A +  +
Sbjct: 132 TLFAGMVDA 140


>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
          Length = 330

 Score =  183 bits (464), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 114/342 (33%), Positives = 167/342 (48%), Gaps = 28/342 (8%)

Query: 50  NAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNI 109
           NA +  E  +E    F +I   +     A             G+NDL     + I TPN+
Sbjct: 8   NAHRSHEGRKEKKPNFVFILADDLGGMLASKLKNFQACSFSVGYNDLGLRDPDVI-TPNM 66

Query: 110 DALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEY 169
           DALA  G+IL N Y Q VC+PSR +LMTG+YP  + MQ   I   E +   L  +FLP+Y
Sbjct: 67  DALASKGVILTNNYVQAVCSPSRHALMTGRYPSASAMQSIVIQPMEAKCSGLKYKFLPQY 126

Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
           L++LGY    IGKWHLG+ R E  P  RGF++ +G       Y++H +   Y        
Sbjct: 127 LKDLGYKNHMIGKWHLGYCREECLPTSRGFDTFYGLYASSGDYWEHGIMGMY-------- 178

Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
            D        ++  G +A DL  +   ++ ++     PLFLY A   +H       L+  
Sbjct: 179 -DWHTEAGVDFEARGTHAQDLEIERLDKVFDEHDNKDPLFLYFAPQNSHT-----PLQPT 232

Query: 290 QETINQF--QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
            E IN +     +D  RR Y  +   LD  VG ++  L    M+ N+ +IF+SDNGA  +
Sbjct: 233 DEFINLYPEDKFSDI-RRKYLGLTSGLDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPI 291

Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           E          G N P+RG K +L+EGG K  + ++SP +++
Sbjct: 292 E----------GRNAPFRGGKGSLFEGGTKSSSFIYSPLLKK 323


>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 474

 Score =  182 bits (463), Expect = 3e-43,   Method: Compositional matrix adjust.
 Identities = 122/359 (33%), Positives = 188/359 (52%), Gaps = 40/359 (11%)

Query: 81  KSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
           KS    +LI+     GW D+ F+G+ +IPTPN++ALA +G+I +  Y+  P C+PSRA L
Sbjct: 23  KSKQPNILIILSDDQGWGDVGFNGATDIPTPNLNALAKDGVIFSQGYSSHPYCSPSRAGL 82

Query: 136 MTGKYPIHTGMQGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           +TG+Y    G +  P    +      G+PL E  + E L++  Y T AIGKWHLG   + 
Sbjct: 83  LTGRYQQKFGHENNPENEKQNEDTVIGLPLNELMISEVLQQNNYHTCAIGKWHLGNAHK- 141

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
           + P  RGF+  FG+  G  +Y+           V  NG  +  N  T       Y TD F
Sbjct: 142 FLPNQRGFKDWFGFSGGGFNYWGKTTPKNKELGVMKNGKPVPENTLT-------YLTDDF 194

Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
           + +A+  I+     ++P F+YLA+ A HA      ++A +E  N   +I +  R  YAAM
Sbjct: 195 SNQAINYIDQYSKTEQPFFMYLAYNAPHA-----PIQATKEYTNLVTHIENGERAAYAAM 249

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V  +D  +G VI  L+  G+  N++I+F SDNG           + +  S+ PYRG K  
Sbjct: 250 VAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGG----------HSHGASSAPYRGHKGM 299

Query: 371 LWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL--NIDGLD 426
           L+EGG++ P ++ W  +I+ N R   + +   D  PT+  A     +++P   NIDG++
Sbjct: 300 LFEGGIREPFVISWPNKIKGN-RTYKEPIIALDIFPTILAAT---NTQVPASKNIDGVN 354



 Score = 41.2 bits (95), Expect = 1.6,   Method: Compositional matrix adjust.
 Identities = 33/122 (27%), Positives = 54/122 (44%), Gaps = 23/122 (18%)

Query: 15  YATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           Y TD F+ +A+  I+     ++P F+YLA+ A HA      ++A +E  N   +I +  R
Sbjct: 189 YLTDDFSNQAINYIDQYSKTEQPFFMYLAYNAPHA-----PIQATKEYTNLVTHIENGER 243

Query: 74  RTYAALT--------------KSTTL---TLLIVYGWNDLSFHGSNEIPTPNIDALAYNG 116
             YAA+               K T +   TL++ Y  N    HG++  P      + + G
Sbjct: 244 AAYAAMVAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGGHSHGASSAPYRGHKGMLFEG 303

Query: 117 II 118
            I
Sbjct: 304 GI 305


>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 465

 Score =  182 bits (463), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 117/346 (33%), Positives = 181/346 (52%), Gaps = 38/346 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS  + TPN+D LA +G+     Y + P C PSRA +MTGKY    G +   
Sbjct: 34  GYMDFGFQGSKVMKTPNLDKLAKSGVTFTQGYVSDPTCGPSRAGMMTGKYQARFGYEEIN 93

Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           + G          +  G+PL ++ +  YL++LGY T   GKWHLG   R + PL RGF+ 
Sbjct: 94  VPGYMSSHSALKGDEMGLPLDQKLMSNYLKDLGYKTAVYGKWHLGNADR-FHPLNRGFDE 152

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
            +G+  G  SY+ +  +    R +E N           ++    YATD+F ++AV  IE 
Sbjct: 153 FYGFRGGARSYFAYK-NPTGDRKMETN--------FGQYEEPNHYATDVFAEKAVHFIER 203

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
              + P F+YL+  A H       +EA +  + QF  +T   R+  AAM   LD + G V
Sbjct: 204 NK-EHPFFIYLSFNAVHT-----PMEATEADLAQFPNLTG-KRQQLAAMTLALDRACGKV 256

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           ++ L+  G+  N+I++F +DNG P+         +N  +N P  G K+   EGG++VP +
Sbjct: 257 LNKLKELGIDRNTIVVFSNDNGGPSD--------KNASNNKPLSGTKSNHLEGGIRVPFL 308

Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           + W  +I++  R +  ++   D LPT Y AAGGD   L  +I+G++
Sbjct: 309 MSWPKEIKKKSRFNYPVVTF-DLLPTFYAAAGGDLKDLK-DINGVN 352


>gi|410628682|ref|ZP_11339400.1| arylsulfatase B [Glaciecola mesophila KMM 241]
 gi|410151686|dbj|GAC26169.1| arylsulfatase B [Glaciecola mesophila KMM 241]
          Length = 510

 Score =  182 bits (463), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 112/318 (35%), Positives = 173/318 (54%), Gaps = 28/318 (8%)

Query: 73  RRTYAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVC 128
           ++T+  + K     ++I+    G+ DL F GS EI TPNIDALA NG++  N Y   P C
Sbjct: 46  KKTHKLVAKERPNIVVILADDLGYADLGFTGSKEIFTPNIDALANNGVVFKNGYVTHPYC 105

Query: 129 TPSRASLMTGKYPIHTGMQGPPIWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
            PSRA L+TG+Y    GM+       +    G+P+ E    + +++ GY T  +GKWH+G
Sbjct: 106 GPSRAGLLTGRYQARFGMEVNAAHSPDDPYMGLPVEELTFAKRMQQAGYKTAVMGKWHMG 165

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTV 243
                + P  RGF+  FG+L G   Y+     + S +YS  +  NG   + N        
Sbjct: 166 S-HPNFHPNNRGFDEFFGFLGGGHDYFPESVKVSSAEYSIALSRNGKPAQLN-------- 216

Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
            EY T   +KEA + +     ++P  +Y+A+ A H+      L+A ++ + ++Q+I D +
Sbjct: 217 -EYLTTAISKEAARFVS--ATEQPFMMYVAYNAPHSP-----LQATEQDLAKYQHIADLD 268

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
           RRTYAAM+  +D  VG +++AL++   L N++I F+SDNG   V   E     +W +N P
Sbjct: 269 RRTYAAMIDSMDQGVGRIVAALKQSEKLHNTLIFFLSDNGG--VYPEEWMPDSDWANNAP 326

Query: 364 YRGVKNTLWEGGVKVPAI 381
           +R  K +L EGGV VP I
Sbjct: 327 FRRGKVSLTEGGVHVPFI 344



 Score = 43.5 bits (101), Expect = 0.38,   Method: Compositional matrix adjust.
 Identities = 23/71 (32%), Positives = 42/71 (59%), Gaps = 7/71 (9%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
           + EY T   +KEA + +     ++P  +Y+A+ A H+      L+A ++ + ++Q+I D 
Sbjct: 215 LNEYLTTAISKEAARFVS--ATEQPFMMYVAYNAPHSP-----LQATEQDLAKYQHIADL 267

Query: 72  NRRTYAALTKS 82
           +RRTYAA+  S
Sbjct: 268 DRRTYAAMIDS 278


>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 826

 Score =  182 bits (462), Expect = 4e-43,   Method: Compositional matrix adjust.
 Identities = 124/352 (35%), Positives = 181/352 (51%), Gaps = 33/352 (9%)

Query: 75  TYAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTP 130
           T  A  K     +LIV    G++D+ F+G  EIPTP++D LA +G++  N YA  P C+P
Sbjct: 36  TSVAGAKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSP 95

Query: 131 SRASLMTGKYPIHTGMQGPP----IW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
           SRA L+TG++    G +  P     W G +  G+PL+E  L + L+E GY T AIGKWHL
Sbjct: 96  SRAGLLTGRHQQRFGHESNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHL 155

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
           G   + + P +RGF+  FG+  G  SY+     D   +   L  H   R           
Sbjct: 156 GD-AKPFWPNHRGFDEWFGFSGGGFSYW----GDLGKKDPLLGVH---RGDEPVEPKTLT 207

Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
           + TD F+ EAV+ I+    + P FLYLA+ A HA +      A +  + +  +I    R 
Sbjct: 208 HLTDDFSTEAVKFIQRNETE-PFFLYLAYNAPHAPD-----HATRAHLQKTTHIEYGGRA 261

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
            Y AMV  +D+ +G V+  ++  G+ EN++IIF SDNG            R    N+PYR
Sbjct: 262 VYGAMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG----------RREHAVNFPYR 311

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
           G K  L+EGG++VP ++  P   ++       +   D  PT   AAG D S+
Sbjct: 312 GHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPSQ 363


>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 470

 Score =  182 bits (462), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 117/341 (34%), Positives = 181/341 (53%), Gaps = 30/341 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTG----- 145
           G+ DLSF GS ++ TP+ID LA +G+     Y +  VC+PSRA L+TG+  +  G     
Sbjct: 46  GYGDLSFTGSTQVKTPHIDELAASGVFFPEGYVSSAVCSPSRAGLLTGRNQVSFGYDNNL 105

Query: 146 MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
               P +     G+P+  + + ++L++LGY T  +GKWHLG+   +++PL RGF+  +GY
Sbjct: 106 ANSQPGFDPAFLGLPVNVKTVGDHLKKLGYVTGLVGKWHLGY-EDQFSPLNRGFDEFWGY 164

Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
           L G   Y++   + +  R  +     ++ N  T  +    Y TD    E +  I+    D
Sbjct: 165 LGGGHDYFE---ASEAKRGYKAK---IKCNYKTPQEIT--YITDDKGDECINFIQRHK-D 215

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           +P FLY +  A H       ++A  E +  +Q+I D  RRTYAAMV +LD +VG ++ A+
Sbjct: 216 EPFFLYASFNAPHT-----PMQATAEDLAIYQHIEDRKRRTYAAMVHRLDINVGRIVEAV 270

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
           +   +LEN+I +F+SDNG P +         N   N PY G K TL EGG++VP  +  P
Sbjct: 271 EEADLLENTIFVFISDNGGPVM--------TNGSINAPYNGKKGTLLEGGIRVPFFISWP 322

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
              +  +V    +   D  PT    AGG+ S+     DGL+
Sbjct: 323 GHLEPGKVFEHPVSSLDLTPTFVALAGGE-SKEEDQFDGLN 362


>gi|414068777|ref|ZP_11404774.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410808616|gb|EKS14585.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 480

 Score =  182 bits (462), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 130/427 (30%), Positives = 204/427 (47%), Gaps = 73/427 (17%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
           +LIV    G+ D+ F+GS +I TPNID LA +G   ++ Y A P C PSRA+LMTG+YP 
Sbjct: 33  ILIVADDLGYADVGFNGSKDIITPNIDDLAKSGTSFSDAYVAHPFCGPSRAALMTGRYPH 92

Query: 143 HTGMQ-GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
             G Q   P  G+   GVP   +F+ + L E  Y T A+GKWH+G    +Y P  RGF+ 
Sbjct: 93  KIGSQFNLPTRGSN-VGVPTDAKFISKLLNENNYFTGALGKWHMGD-APQYHPNKRGFDE 150

Query: 202 HFGYLNGVISYYDHILSD--------------QYSRTVELNGHDMRRNLSTAWDTVGEYA 247
           ++G+L G  +Y+                    +Y   +E NG +++           +Y 
Sbjct: 151 YYGFLGGGHNYFPDQYQPQYKKQQAQGLKNIFEYITPLEHNGKEVKET---------QYI 201

Query: 248 TDLFTKEAVQLIEDQPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
           TD  ++EAV  + D+ V+K  P FLYLA+ A H       L+A  E +  F  I + +R+
Sbjct: 202 TDALSREAVNFV-DKAVNKKNPFFLYLAYNAPHVP-----LQAKDEDMAMFPNIKNKDRK 255

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
           TYA MV  +D  VG ++  L++    +N++I+FMSDNG    +           +N+P +
Sbjct: 256 TYAGMVYAVDRGVGKLVETLKKNNQYDNTLIVFMSDNGGKLSK---------GANNFPLK 306

Query: 366 GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
             K +  EGG +VP +  W   +    R S  +  + D  PT    AG            
Sbjct: 307 AGKGSTQEGGFRVPMLFHWPKHVPAGKRFSHPVSAL-DLYPTFAALAGAKV--------- 356

Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK--RTAAVRLDSWK 482
                          N ++DG + W + + N    ++  +  +  +K    AA+R++ WK
Sbjct: 357 -------------EENQHLDGTNMWPAFIKNENPHKDEPIYALRHRKGYSDAAIRMNQWK 403

Query: 483 LVLGTQE 489
            +   Q+
Sbjct: 404 ALKVNQQ 410



 Score = 43.1 bits (100), Expect = 0.41,   Method: Compositional matrix adjust.
 Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 8/69 (11%)

Query: 14  EYATDLFTKEAVQLIEDQPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
           +Y TD  ++EAV  + D+ V+K  P FLYLA+ A H       L+A  E +  F  I + 
Sbjct: 199 QYITDALSREAVNFV-DKAVNKKNPFFLYLAYNAPHVP-----LQAKDEDMAMFPNIKNK 252

Query: 72  NRRTYAALT 80
           +R+TYA + 
Sbjct: 253 DRKTYAGMV 261


>gi|296121201|ref|YP_003628979.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296013541|gb|ADG66780.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 479

 Score =  182 bits (462), Expect = 5e-43,   Method: Compositional matrix adjust.
 Identities = 135/367 (36%), Positives = 191/367 (52%), Gaps = 43/367 (11%)

Query: 80  TKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
           +KS    +L++     G+ DL   G  EIPTP++D LA +GI   N Y + P C+PSRA 
Sbjct: 33  SKSGRPNILVIMADDLGYADLGVQGGCEIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAG 92

Query: 135 LMTGKYPIHTGMQGPPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
            +TGKY    G +  P  G E + G+PL E  +   L+  GY T  IGKWH GF  +++ 
Sbjct: 93  FLTGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQGF-SKDHH 151

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
           P  RGF+  FG+L G  +Y  H   +  +R    + HDM           G YATDLFT 
Sbjct: 152 PQSRGFDEFFGFLVGGHNYLLH--KEVKARFGTAHSHDMIYRGREVEPQEG-YATDLFTN 208

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLE-APQETINQFQYITDPNRRTYAAMVK 312
           EA++ +   P +KP FLYL++ A H       LE AP       + +  P RR Y +++ 
Sbjct: 209 EALRWMSGPP-NKPWFLYLSYNAVHT-----PLEIAPHLQKRIPESVKLPARRGYLSLLA 262

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNG----APTVEYRETSNYRNWGSNYPYRGVK 368
            LDDS+G +   L + G+ E ++IIF+SDNG    AP + Y       N G N+P RG K
Sbjct: 263 GLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAY-------NSGLNHPLRGDK 315

Query: 369 NTLWEGGVKVPAIL-WSPQ-----IQQNPRVSLQMMHISDWLPTLYTAAGGDTSR---LP 419
               EGG++VP  + W  Q     I + P +SL      D LPT+   A  + ++   LP
Sbjct: 316 GQTLEGGIRVPFFVSWPGQLPARTIYEQPIISL------DLLPTVCQLAANNPAKPQPLP 369

Query: 420 LNIDGLD 426
             IDG++
Sbjct: 370 QGIDGVN 376


>gi|326428223|gb|EGD73793.1| arylsulfatase [Salpingoeca sp. ATCC 50818]
          Length = 697

 Score =  182 bits (461), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 126/402 (31%), Positives = 192/402 (47%), Gaps = 47/402 (11%)

Query: 101 SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG--PPIWGAEPRG 158
           ++E  TP+ID LA NG +L   Y    C+P+R+SL++G+ PIH   +   PP+ G    G
Sbjct: 94  NDEPLTPHIDELARNGTMLRRFYTYKYCSPTRSSLLSGRLPIHVNQENRPPPVPGG---G 150

Query: 159 VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILS 218
           +      L E L+  GY+T  +GKWH G    +  P++RGF S FGYL+G   +Y  I  
Sbjct: 151 IDQNMTTLAEVLQRAGYATHQVGKWHCGMSSPDRLPVHRGFNSSFGYLSGAEHHYSQIRQ 210

Query: 219 DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 278
                       D+ R+   A+   G Y T ++ +EA ++I       PLFLYLA    H
Sbjct: 211 GYV---------DLWRDTQPAYGENGTYGTFMYAREAERIIHAHDPTTPLFLYLAFQNVH 261

Query: 279 AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
           +      LE P   ++++  +    R+   AMV  +D+++  V  AL+R G+ ++++IIF
Sbjct: 262 S-----PLEVPDRFLDKYPDVDYRPRKHCLAMVSAVDEAINNVTLALRRAGLFDDTLIIF 316

Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVS--LQ 396
            SDNG P           +  +NYP+RG K   +EGGV+  A +    + +N   +    
Sbjct: 317 SSDNGGP----------HDHANNYPFRGAKTADFEGGVRAVAFVSGGVVPKNMVGTNVHG 366

Query: 397 MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLL-LN 455
           +MH+ DW  TL   AG D +             +   N P      IDG D W  L   N
Sbjct: 367 LMHVCDWYSTLARLAGEDPT----------DHRAARSNLPP-----IDGFDMWPMLTGAN 411

Query: 456 TPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
           + S R  V ++       +A+    +KLV G Q  G   G +
Sbjct: 412 STSPRVDVPLSGGIPVAASALISGQYKLVRGKQPYGFYPGVH 453


>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
 gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
          Length = 470

 Score =  182 bits (461), Expect = 6e-43,   Method: Compositional matrix adjust.
 Identities = 112/330 (33%), Positives = 177/330 (53%), Gaps = 28/330 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGP- 149
           G+ D   +G  EIPTPNIDALA NG +  + Y +  VC PSRA ++TG Y    G +   
Sbjct: 38  GYADFGCYGGKEIPTPNIDALAKNGTLFTDAYVSASVCAPSRAGILTGMYQQRFGFEHNI 97

Query: 150 ---PI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
              P+  +     G+    + + + ++  GY T AIGKWH G    +Y PL RGF+  +G
Sbjct: 98  SELPVKPYTLNDVGMDPKIKTIGDQMKHNGYRTIAIGKWHQGDLP-QYFPLKRGFDEFYG 156

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           ++ G  S++ +      S  + L  +D         +T+G Y TD+FT +A+  +++   
Sbjct: 157 FVGGHRSFFGYPGGKAPSHELALFDNDK----IVPENTIG-YLTDMFTDKAISFVKENK- 210

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
            KP F+YLA+ A H       + A +E +++F  ITDP R+ YAAM+  LDD VG ++  
Sbjct: 211 SKPFFMYLAYNAVHV-----PMNAKKELMDRFPNITDPGRKAYAAMMVSLDDGVGELVKT 265

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
           L+ + + +N++I+F++DNGA T          N+  N   RG+K + WEGG++V  I+  
Sbjct: 266 LKEQNLYDNTLIVFVNDNGAAT---------GNYADNGKLRGLKGSKWEGGIRVAYIMQY 316

Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGD 414
           P      +V  +M+   D +PT   A  G+
Sbjct: 317 PDQIPAGKVYSKMVSGLDIMPTAVGAGKGE 346



 Score = 53.1 bits (126), Expect = 4e-04,   Method: Compositional matrix adjust.
 Identities = 28/73 (38%), Positives = 45/73 (61%), Gaps = 7/73 (9%)

Query: 10  DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
           +T+G Y TD+FT +A+  +++    KP F+YLA+ A H       + A +E +++F  IT
Sbjct: 189 NTIG-YLTDMFTDKAISFVKENK-SKPFFMYLAYNAVHV-----PMNAKKELMDRFPNIT 241

Query: 70  DPNRRTYAALTKS 82
           DP R+ YAA+  S
Sbjct: 242 DPGRKAYAAMMVS 254


>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 467

 Score =  181 bits (458), Expect = 1e-42,   Method: Compositional matrix adjust.
 Identities = 140/447 (31%), Positives = 206/447 (46%), Gaps = 64/447 (14%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           A L  +    L    GW D+ FH  N +PTPN+D LA  G+ L   Y  PVC+P+R + +
Sbjct: 37  APLRPNFIFILADDLGWGDVGFHHGN-VPTPNLDHLAGEGLELMQHYVYPVCSPTRCAFL 95

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           +G+Y     +  P      PR        L   L+ +GY T   GKWHLG  + E+ P  
Sbjct: 96  SGRYASRFSVTTP----QNPRAFRWDTVTLARALKSVGYDTALCGKWHLGS-KPEWGPQK 150

Query: 197 RGFESHFGYLNGVISYYDHILS-DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
            GF+  +G L G +  +DH     ++++T   +G  +             + TDL TKEA
Sbjct: 151 FGFDHSYGSLAGGVGPWDHHYKIGEFTQTWHRDGKLIEEQ---------GHVTDLITKEA 201

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKL 314
           V+ +E +  DKP FLY+   A H       +  P E + +    IT P+ R Y A V  L
Sbjct: 202 VEWLESR-TDKPFFLYVPFTAVHI-----PIREPDEILQRVPASITKPSLRHYGANVMHL 255

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRE----TSNY---RNWGSNYPYRG 366
           DDSVG ++ AL++ G   N+++IF SDNGA P VE  +      +Y      GSN P  G
Sbjct: 256 DDSVGKILVALEKTGKAGNTLVIFGSDNGAIPGVENNDPLYPPDHYPPGPAGGSNEPLHG 315

Query: 367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           +K  ++EGG+   A+  W  Q++  P   L + HI+DW+PT    AG             
Sbjct: 316 MKGEVYEGGIHTAAVARWPGQLK--PGKFLGLAHITDWMPTFCALAG------------- 360

Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
                        ++   DG + W  L    P +  ++ +      R+ A+R   WKLVL
Sbjct: 361 ---------YKPEKDLKWDGQNIWPQLTGAEPVKPRTIYV-AGPGFRSKALRDGDWKLVL 410

Query: 486 GTQENGTMDGYYGQTRSNKVPLLNFNA 512
            +Q  G+      +    KV L N  A
Sbjct: 411 -SQTKGS------KNSPPKVELFNIGA 430



 Score = 39.7 bits (91), Expect = 5.2,   Method: Compositional matrix adjust.
 Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 7/65 (10%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNR 73
           + TDL TKEAV+ +E +  DKP FLY+   A H       +  P E + +    IT P+ 
Sbjct: 192 HVTDLITKEAVEWLESR-TDKPFFLYVPFTAVHI-----PIREPDEILQRVPASITKPSL 245

Query: 74  RTYAA 78
           R Y A
Sbjct: 246 RHYGA 250


>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
 gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
          Length = 539

 Score =  181 bits (458), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 123/396 (31%), Positives = 184/396 (46%), Gaps = 61/396 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ FHG+  I TP++D +A  G  LN  Y  P+C+P+RA+LMTG+ PI  G+    I
Sbjct: 37  GWADVGFHGNQIIETPSLDRIAAEGTQLNRFYTTPICSPTRAALMTGRDPIRLGVAYSTI 96

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+   E FLPE     GY T  +GKWHLG  ++ Y P  RGFE  +G+L+  + 
Sbjct: 97  MPWHNNGIHPEETFLPELFAGAGYQTAMVGKWHLGHAQQTYHPNARGFEHFYGHLHTEVG 156

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           ++    S        L G D +RN   + D  G Y + L   E  + I ++   KP F+Y
Sbjct: 157 FFPPFAS--------LGGKDFQRN-GVSIDDQG-YESYLLADEVSRYIRERDAAKPFFIY 206

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPN-----------------------RRTYA 308
           +  +A H       L+AP++   ++  + D                         R  YA
Sbjct: 207 MPFIAPHT-----PLDAPEDLKAKYADMEDDRGKSRSKMADNTRFMAKVTRRESARPMYA 261

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
           A+V  +D S+G V+  L  +G+ +N+I++F SDNG                 N P RG K
Sbjct: 262 AVVDGMDQSIGRVLDTLDAEGLTDNTIVLFFSDNGGAVYAI-------GGADNAPLRGGK 314

Query: 369 NTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR---------L 418
              +EGG++V A + W  +I    +V   +M + D  PTL  AAG + +          L
Sbjct: 315 GDTFEGGIRVVATMRWPEKIAPGGKVD-SIMSVMDVFPTLLAAAGIEPATRHRLDGRNLL 373

Query: 419 PLNIDGLDQWSSLLL----NTPSRRNSNIDGL-DQW 449
           P  +DG D     LL     +P +   N+    D+W
Sbjct: 374 PAIVDGEDVAREDLLFFIAESPFKNTVNVTAFNDEW 409


>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 461

 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 124/351 (35%), Positives = 182/351 (51%), Gaps = 55/351 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ DLS  GS ++ TP+ID+L   G+   N YA   VC+PSRA+L+TG+YP   G+ G  
Sbjct: 47  GYGDLSCFGSTDLKTPHIDSLIGAGMRFTNFYANSSVCSPSRAALLTGRYPERVGVPGVI 106

Query: 149 ----PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                  WG     +  +   LP YLR+ GY +  IGKWHLG       P  RGF+  +G
Sbjct: 107 RDEVQDSWGY----LASSATLLPTYLRKQGYHSANIGKWHLGL-ESPNLPNERGFQEFYG 161

Query: 205 YLNGVISYYDHILSDQYSRTVEL-NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
            L G++  Y           V+L +G +  R+     D  G +ATD+FT  AV+ + D+ 
Sbjct: 162 LLEGMMDDY----------VVKLRHGQNFLRHNGQVIDPPG-HATDVFTDAAVRYLNDRK 210

Query: 264 VDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI---TDPNRRTYAAMVKKLDDSVG 319
             K P FLYLA+ A H       L+ P   +N+ Q      DP R    A+++ LDD++G
Sbjct: 211 AKKDPFFLYLAYTAPH-----DPLQPPVAYLNRVQKRQPGIDPQRAKLVALIEHLDDNIG 265

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG----SNYPYRGVKNTLWEGG 375
            V++ L+  G L N++++F SDNG              WG    +N  +RGVK   +EGG
Sbjct: 266 RVLATLRANGQLANTLVVFTSDNGG-------------WGPGKANNGQFRGVKGQFYEGG 312

Query: 376 VKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           +++P   +WS  I+     S  ++ + DW PTL  AAG   + +P  I+GL
Sbjct: 313 IRIPTGAMWSGHIEAGSE-SGALLQLMDWFPTLLQAAG---ASVPDGINGL 359


>gi|294053769|ref|YP_003547427.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613102|gb|ADE53257.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 490

 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 118/358 (32%), Positives = 181/358 (50%), Gaps = 35/358 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL F+GS  + TP +D LA+NG+I  N Y   P C PSRA L+TG+Y    GM+   
Sbjct: 34  GYGDLGFNGSKVVETPTLDRLAHNGVIFQNGYVTHPYCGPSRAGLITGRYQARFGMEINL 93

Query: 151 IWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            +      +G+PL E+   E +R  GY T  IGKWHLG     + P  RGF+  +G+L+G
Sbjct: 94  TYSPFDLHQGLPLDEKTFAERVRPAGYRTGMIGKWHLGA-SEPFHPNSRGFDYFYGFLSG 152

Query: 209 VISYYDHILSDQYSRTVELNG-----------HDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
              Y+  ++ + + + +  NG             + RN + A     EY T   +K+A +
Sbjct: 153 GHDYFPEMV-NTHPKLLLPNGKPHYSHNEGCTQPLLRNKNAA--EFNEYLTTALSKDAAR 209

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR-TYAAMVKKLDD 316
            + D   +KP  LYLA+ A H       L AP+E + ++  +    RR TY AM+  +D 
Sbjct: 210 FVSDS--EKPFLLYLAYNAPHG-----PLHAPKELVEKYSKLEKDKRRATYLAMIDSMDQ 262

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
            +G ++ AL++ G L+N++I FMSDNG   V+ +    Y  W  N P+R  K ++ EGG 
Sbjct: 263 GIGLIVEALEQSGKLDNTLIFFMSDNGG--VQSKPGHEYETWADNGPFRNGKGSMREGGS 320

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN-------IDGLDQ 427
            VP I   P+           +   D   T    +GGD +  PL+       ++G+D+
Sbjct: 321 HVPFIAHWPKGFPQGITYPHPVSSLDLTATAVELSGGDATGKPLDGVNLTPFVNGIDK 378



 Score = 38.9 bits (89), Expect = 8.9,   Method: Compositional matrix adjust.
 Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 8/78 (10%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           EY T   +K+A + + D   +KP  LYLA+ A H       L AP+E + ++  +    R
Sbjct: 197 EYLTTALSKDAARFVSDS--EKPFLLYLAYNAPHG-----PLHAPKELVEKYSKLEKDKR 249

Query: 74  R-TYAALTKSTTLTLLIV 90
           R TY A+  S    + ++
Sbjct: 250 RATYLAMIDSMDQGIGLI 267


>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 826

 Score =  180 bits (457), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 120/340 (35%), Positives = 179/340 (52%), Gaps = 33/340 (9%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI 142
           +LIV    G++D+ F+G  EIPTP++D LA +G++  N YA  P C+PSRA L+TG++  
Sbjct: 48  VLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQ 107

Query: 143 HTGMQGPP----IW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
             G +  P     W G +  G+PL+E  L + L+E GY T AIGKWHLG   + + P +R
Sbjct: 108 RFGHESNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNHR 166

Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
           GF+  FG+  G  SY+     D   +   L  H     +     T   + TD F+ EAV+
Sbjct: 167 GFDEWFGFSGGGFSYW----GDLGKKDPLLGVHRGDEPVDPKTLT---HLTDDFSTEAVK 219

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
            I+     +P FLYLA+ A HA +      A +  + +  +I    R  Y AMV  +D+ 
Sbjct: 220 FIQRHE-SEPFFLYLAYNAPHAPD-----HATRAHLQKTAHIEYGGRAVYGAMVAGMDEG 273

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           +G V+  ++  G+ EN+++IF SDNG            R    N+PYRG K  L+EGG++
Sbjct: 274 IGRVVDQIRESGLGENTMVIFYSDNGG----------RREHAVNFPYRGHKGMLFEGGIR 323

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
           VP ++  P   ++       +   D  PT   AAG D S+
Sbjct: 324 VPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPSQ 363


>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 491

 Score =  180 bits (456), Expect = 2e-42,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 191/364 (52%), Gaps = 34/364 (9%)

Query: 77  AALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPS 131
           A L+      +LI+     G++DL + GS EI +P ID LA NG+I  N Y   P C PS
Sbjct: 18  AVLSADERPNILIILADDLGYSDLGYTGSTEIESPVIDKLANNGVIFANGYVTHPYCGPS 77

Query: 132 RASLMTGKYPIHTGMQGPPIWGA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           RA L+TG++    GM+    +    +  G+P+ E    + L+  GY T  IGKWHLG   
Sbjct: 78  RAGLITGRHQARFGMEINATYSPFDQHMGLPVDEPTFAKRLQPAGYRTGIIGKWHLGA-A 136

Query: 190 REYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD-----------MRRNLST 238
            ++ P  RGF+  +G+L+G   Y+   + + +   V  NG             + RN + 
Sbjct: 137 PQFHPNNRGFDYFYGFLSGGHDYFPESV-NTHLELVLPNGKPNYGANEGTLLPLLRNKNA 195

Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
           A     +Y T   +K+A + +     ++P  LYLA+ A H       L+AP+ETI ++ +
Sbjct: 196 A--EFDDYLTTALSKDAARFVTSS--EQPFCLYLAYNAPHT-----PLQAPKETIAKYSH 246

Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
           I DP RR YAAM+ ++D  +G V+ AL++ G L+N++I F+SDNG    +  ++  Y ++
Sbjct: 247 IKDPKRRIYAAMIDEMDAGIGLVVDALEQSGKLDNTLIFFLSDNGGCVPQAWDS--YSDF 304

Query: 359 GSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
             N P++  K + +EGG  VP I  W  +++  P+    ++   D   T    AGGD S 
Sbjct: 305 ADNGPFKDGKTSFYEGGTHVPFIAHWPAKLK--PQTYDNIISSLDLAATAVALAGGDDSG 362

Query: 418 LPLN 421
            PL+
Sbjct: 363 KPLD 366



 Score = 47.0 bits (110), Expect = 0.030,   Method: Compositional matrix adjust.
 Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 7/77 (9%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y T   +K+A + +     ++P  LYLA+ A H       L+AP+ETI ++ +I DP R
Sbjct: 200 DYLTTALSKDAARFVTSS--EQPFCLYLAYNAPHT-----PLQAPKETIAKYSHIKDPKR 252

Query: 74  RTYAALTKSTTLTLLIV 90
           R YAA+       + +V
Sbjct: 253 RIYAAMIDEMDAGIGLV 269


>gi|403182689|gb|EJY57565.1| AAEL017303-PA [Aedes aegypti]
          Length = 176

 Score =  180 bits (456), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 81/126 (64%), Positives = 98/126 (77%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS++IPTPNIDALAY GIILN  Y  P+CTPSRASLM+GK+PI+ GMQ   I
Sbjct: 41  GWNDVSFHGSSQIPTPNIDALAYQGIILNRHYTPPLCTPSRASLMSGKHPINVGMQHHVI 100

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+ L ++ +PEY RE GY T+ +GKWHLGFFR+ YTP  RGF+SHFGY+   I 
Sbjct: 101 ESNEPWGLGLDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTRRGFDSHFGYIGPYID 160

Query: 212 YYDHIL 217
           Y+DH L
Sbjct: 161 YWDHSL 166


>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 507

 Score =  179 bits (455), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 157/569 (27%), Positives = 252/569 (44%), Gaps = 110/569 (19%)

Query: 84  TLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
            L L+  +GW ++ +H    + E+ TPNID L   G+ L+  YA   C+PSR+ LM+G+ 
Sbjct: 26  VLMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQGLELDQHYAYKFCSPSRSCLMSGRL 85

Query: 141 PIHTG---------MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           PIH               P+ G     +P     + E ++E GY+T  +GKW  G    +
Sbjct: 86  PIHVNDLNLAPTNYNPNDPVSGFS--AIPRNMTGIAEKMKEAGYATHQVGKWDAGMATPD 143

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRT--VEL--NGHDMRRNLSTAWDTVGEYA 247
           +TP  RGF++ FGY +    YY  ++  Q S +  V+L    H       T  D   +Y 
Sbjct: 144 HTPKGRGFDTSFGYYHHDNDYYTEVVGPQCSGSPIVDLWDTDHPAHGINGTGPD---KYE 200

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
             LF +  + ++     + PLFLY A   AH       L+ P + +N+F +I D +R+ Y
Sbjct: 201 EGLFKERLMDVVSKHDPNTPLFLYYAPHIAHT-----PLQVPDDYLNKFSFIDDSDRKYY 255

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRG 366
            AMV  LDD VG +++AL++KGM +N + +  SDNG P         Y   G+ NYP +G
Sbjct: 256 HAMVNYLDDVVGDLVAALKQKGMWDNLLYVTSSDNGGPI--------YAGGGANNYPLKG 307

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
            K T W+GG++V A +    + +  R   +   +H++DW  T    AG         +D 
Sbjct: 308 GKLTDWQGGIRVNAFVSGGYLPEKMRGQKTDGYVHLADWYGTFCAIAG---------VDP 358

Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTP-SRRNSVLINIDEKKRTAAVRLDSWKL 483
            D+ ++     P      +D  + W  +   T  S R  +  +I+      A+    +K+
Sbjct: 359 TDEKAAKAKLPP------VDSYNMWPFISGETDTSPRTDIPASIN------ALISGDYKI 406

Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTR 543
           ++GT +     G+ G    N+                           P   I       
Sbjct: 407 LIGTVDQA---GWTGPQYPNQTN-------------------------PAGGI------- 431

Query: 544 QQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRT 603
            +AT  CG        S C      L+N+ +DP E++N+ASS P+    + E LK +  T
Sbjct: 432 -KATERCGE-------SGC------LYNIKDDPEERHNLASSMPEQLKTMQEKLKKYLAT 477

Query: 604 LV--PQSHEQPDLVQADPKRFNDTWSPWI 630
                + H  P   QA   ++   W P++
Sbjct: 478 YFNPDRGHVSPLACQAAVNKYGGFWGPFV 506



 Score = 40.4 bits (93), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 5/67 (7%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y   LF +  + ++     + PLFLY A   AH       L+ P + +N+F +I D +R
Sbjct: 198 KYEEGLFKERLMDVVSKHDPNTPLFLYYAPHIAHT-----PLQVPDDYLNKFSFIDDSDR 252

Query: 74  RTYAALT 80
           + Y A+ 
Sbjct: 253 KYYHAMV 259


>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
 gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
          Length = 460

 Score =  179 bits (455), Expect = 3e-42,   Method: Compositional matrix adjust.
 Identities = 129/378 (34%), Positives = 191/378 (50%), Gaps = 60/378 (15%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP 141
           +LI++    G ND+  +GS EIPTP+ID LA  G++    Y A  +CTPSR  ++TG+ P
Sbjct: 30  ILIIFTDDQGINDVGCYGS-EIPTPHIDQLAKEGLLFRQYYSASAICTPSRFGILTGRNP 88

Query: 142 IHTGMQGPPIWGA--------EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
             +  Q   + GA        + RG+   E  + + L++ GY T  +GKWHLG     + 
Sbjct: 89  TRSQDQ---LLGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLGHGTESFL 145

Query: 194 PLYRGFESHFGYLNGVISYYDHI---LSDQY--SRTVELNGHDMRRNLSTAWDTVGEYAT 248
           P   GF+   G+  G I Y+      + D Y   R V  NG                YAT
Sbjct: 146 PTAHGFDLFRGHTGGCIDYFTMTYGNIPDWYHNQRHVSENG----------------YAT 189

Query: 249 DLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNA---GKH-----LEAPQETINQFQYI 299
           DL T+EA   ++DQ   DKP FL+L++ A H G     G       ++A  + + +   I
Sbjct: 190 DLITEEAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQARGDDLKRVGTI 249

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
            D  RR +AAM   LDD +G V+S+L+  G+ +N+++IFM+D+G          +Y   G
Sbjct: 250 KDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHGG---------DYVYGG 300

Query: 360 SNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
           +N P+RG K TL+EGG++VP I+ W  +I+     + ++    D  PT+   A  DT  L
Sbjct: 301 NNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETN-EVAWALDLFPTICHFANVDTDGL 359

Query: 419 PLNIDGLDQWSSLLLNTP 436
            L  DG D    L   TP
Sbjct: 360 TL--DGKDISGLLTRQTP 375



 Score = 46.6 bits (109), Expect = 0.041,   Method: Compositional matrix adjust.
 Identities = 29/77 (37%), Positives = 42/77 (54%), Gaps = 9/77 (11%)

Query: 15  YATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNA---GKH-----LEAPQETINQF 65
           YATDL T+EA   ++DQ   DKP FL+L++ A H G     G       ++A  + + + 
Sbjct: 187 YATDLITEEAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQARGDDLKRV 246

Query: 66  QYITDPNRRTYAALTKS 82
             I D  RR +AA+T S
Sbjct: 247 GTIKDKVRREFAAMTVS 263


>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 472

 Score =  179 bits (454), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 146/465 (31%), Positives = 215/465 (46%), Gaps = 98/465 (21%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS EI TPNID LA   + L+  Y  P+C+P+RA ++TG YP   G  G  +
Sbjct: 40  GWNDVGFHGS-EIRTPNIDRLANESVTLDRFYVTPICSPTRAGVLTGLYPFRFGFWGGVV 98

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              +  G+P      PE+L +LGY  +A+ GKWHLG     + PL  G    +G+ NG I
Sbjct: 99  SPTKKHGLPPQLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLQHGMTEFYGHYNGAI 158

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+         R  +L+ H   RN    +D+V E  Y+T+L     V  I+      P+
Sbjct: 159 DYFSR------ERFGQLDWH---RN----FDSVHEEGYSTELVGNAVVDFIDRNANAGPV 205

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------------------- 303
           + Y+A  A H+      L+A +  ++++ +  DPN                         
Sbjct: 206 YAYVAFNAPHS-----PLQALRSDLDEYGF--DPNNKLAPNTDRKIAKREKALDYGKRGK 258

Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               R+T+AAM   +D  +G ++ A+ R GM EN++++F SDNGA         + ++ G
Sbjct: 259 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGG 309

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           SN P RG K T WEGGV+V A++  P           +    D LPT+  AAGG     P
Sbjct: 310 SNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPTMVAAAGGPP---P 366

Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLD 479
              DG++     LL   S + S               PS R ++L++ +       V  D
Sbjct: 367 EETDGIN-----LLPFLSGKAS---------------PSER-TILLDAE------TVVSD 399

Query: 480 SWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQ 524
            WKL  G   + T D       +   PL   N  V S+  Q+LQ+
Sbjct: 400 RWKLKTGESFDLTND------PNETTPLEVANQNVRSRLEQALQR 438


>gi|156340112|ref|XP_001620356.1| hypothetical protein NEMVEDRAFT_v1g148421 [Nematostella vectensis]
 gi|156205165|gb|EDO28256.1| predicted protein [Nematostella vectensis]
          Length = 260

 Score =  179 bits (454), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 100/257 (38%), Positives = 144/257 (56%), Gaps = 15/257 (5%)

Query: 74  RTYAALTKSTTLTLLIVYGWNDLSFHG-SNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
           RT A         L+   GW+D+ +H  S+ + TPNID LA  G+ L + Y+QP+CTPSR
Sbjct: 18  RTSATKPPHILFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSR 77

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
            +LMTGKYPIH GMQ   I    P G+P     +P+ LR LGY T  IGKWHLGFF  +Y
Sbjct: 78  GALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDY 137

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
           TPL RGF+S  G+  G   ++ H          ++   D RR+   A +  G+++TD+FT
Sbjct: 138 TPLRRGFDSFLGFFAGEQDHWRH---------SKMGFLDFRRDEEPANEYGGQHSTDVFT 188

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
           +EA+ +       +PLFL L++ A H       L+A    +N+   ++D +R+ Y  M+ 
Sbjct: 189 QEAINIAMRHNASQPLFLLLSYAAVHTP-----LQAHPNDVNKIGGVSDKDRQNYLGMMG 243

Query: 313 KLDDSVGTVISALQRKG 329
             D S+G +I   +R G
Sbjct: 244 AADWSIGRLIDVYKRNG 260



 Score = 43.9 bits (102), Expect = 0.26,   Method: Compositional matrix adjust.
 Identities = 23/87 (26%), Positives = 46/87 (52%), Gaps = 5/87 (5%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
            RR+   A +  G+++TD+FT+EA+ +       +PLFL L++ A H       L+A   
Sbjct: 168 FRRDEEPANEYGGQHSTDVFTQEAINIAMRHNASQPLFLLLSYAAVHTP-----LQAHPN 222

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTL 87
            +N+   ++D +R+ Y  +  +   ++
Sbjct: 223 DVNKIGGVSDKDRQNYLGMMGAADWSI 249


>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
 gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
          Length = 545

 Score =  179 bits (454), Expect = 4e-42,   Method: Compositional matrix adjust.
 Identities = 118/363 (32%), Positives = 183/363 (50%), Gaps = 47/363 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ +HG  +I TP++D LA  G+ LN  Y  P+C+P+RA+LMTG+ PI  G+    I
Sbjct: 44  GWADVGYHGG-DIDTPSLDRLAQQGVRLNRFYTTPICSPTRAALMTGRDPIRLGVTYGVI 102

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +  +  GV   E F+PE  +  GY T  IGKWHLG  +  Y P  RGFE  +G+L+  + 
Sbjct: 103 FPWDNIGVHPDEHFMPETFQAAGYQTAIIGKWHLGHAQMTYHPNNRGFEHFYGHLHTEVG 162

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           +Y    S+Q        G D +RN   + D  G Y T L   E  + I ++  D+P  +Y
Sbjct: 163 FYPP-FSNQ-------GGKDFQRN-GVSIDDQG-YETYLLADEVSRYIRERDRDRPFLVY 212

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYI-----------TDPNRRT------------YA 308
           +  +A H       L+AP E  ++++ I           TD  R              YA
Sbjct: 213 MPFIAPHT-----PLDAPVELQDKYKDIETDLPMARSRQTDDTRLISRVMLQPSARPMYA 267

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
           A+V  +D ++G V+  L ++G+ +N+I++F SDNG     Y          +N P RG K
Sbjct: 268 AVVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGGAAYSY-------GGANNAPLRGGK 320

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
              +EGG++V +++  P + +  ++  Q+M + D  PTL  AA          +DG   W
Sbjct: 321 GETFEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTLVDAADVRPGN-NFALDGRSMW 379

Query: 429 SSL 431
           ++L
Sbjct: 380 TAL 382


>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 494

 Score =  177 bits (450), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 123/346 (35%), Positives = 170/346 (49%), Gaps = 33/346 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ-GP 149
           G+ D+ F GS EI TP +D LA  G+I NN Y     C PSRA L+TG+Y    G++   
Sbjct: 34  GYADVGFTGSTEIQTPVLDRLAAGGVIFNNGYVTHAYCGPSRAGLITGRYQARFGVEVNF 93

Query: 150 PIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P    +P  G+P  E+     L++ GY T  IGKWHLG     Y P  RGF+  +G+L G
Sbjct: 94  PYAPFDPHSGLPTDEKTFATRLKQSGYRTAMIGKWHLGA-AYPYHPNNRGFDYFYGFLGG 152

Query: 209 VISYYDHILSDQYSRTVELNGHD------------MRRNLSTAWDTVGEYATDLFTKEAV 256
              Y     S     T+E NG              MR N++  +D   EY T   +++A 
Sbjct: 153 AHDYMPENTSTTVPLTLE-NGKVNHMANAGSYLPLMRNNVNAEFD---EYLTTALSRDAA 208

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
           + IE    + P  +YL++ A H       L+AP+  I ++ +I    RRTYAAM+  +D 
Sbjct: 209 RFIEK--TEGPFCVYLSYNAPHT-----PLQAPKALIEKYAHIESQKRRTYAAMIDSMDQ 261

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY--RNWGSNYPYRGVKNTLWEG 374
            +G V+ AL++ G LEN++I F SDNG P    R    Y   ++  N PYRG K  + EG
Sbjct: 262 GIGRVVDALEKSGKLENTLIFFFSDNGGP----RPKPGYEGEDFADNTPYRGGKGAMLEG 317

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
           G  VP I + P+           +   D   T      GDTS  PL
Sbjct: 318 GTHVPFIAYWPKGIPAGSTYDYPVSALDVAATFVELGEGDTSGHPL 363



 Score = 44.7 bits (104), Expect = 0.16,   Method: Compositional matrix adjust.
 Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 10/82 (12%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           MR N++  +D   EY T   +++A + IE    + P  +YL++ A H       L+AP+ 
Sbjct: 187 MRNNVNAEFD---EYLTTALSRDAARFIEK--TEGPFCVYLSYNAPHT-----PLQAPKA 236

Query: 61  TINQFQYITDPNRRTYAALTKS 82
            I ++ +I    RRTYAA+  S
Sbjct: 237 LIEKYAHIESQKRRTYAAMIDS 258


>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
 gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
          Length = 467

 Score =  177 bits (450), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 121/366 (33%), Positives = 196/366 (53%), Gaps = 41/366 (11%)

Query: 76  YAALTKSTTLTLLIVY-----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCT 129
           Y    ++TT   +IV      G+ D   +G+ EIPTPNI+ LA  G + +  Y +  VC 
Sbjct: 18  YTTQAQTTTKPNIIVILSDDAGYADFECYGNKEIPTPNINRLAKEGTLFSKAYVSASVCA 77

Query: 130 PSRASLMTGKYPIHTGMQGPPI------WGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
           PSRA L+TG+Y    G +  P       +  E  G+ L+E+ + + ++E GY T A+GKW
Sbjct: 78  PSRAGLLTGRYQQRFGFENNPTGKPREGFKKEDMGLALSEKTIGDRMKEEGYRTLAVGKW 137

Query: 184 HLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV 243
           HLG    ++ PL RGF+  +G+  G   ++    S +  R  +    D  + +     T 
Sbjct: 138 HLGN-DAKFFPLKRGFDEFYGFQEGHRDFF----SFKKKRAEKYALWDNDKIIPEEEIT- 191

Query: 244 GEYATDLFTKEAVQLIEDQPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 301
             Y TD+FT +A++ I D+  DK  P F+YLA+ A H       L+A +  +++F  +  
Sbjct: 192 --YLTDMFTDKALKFI-DENADKKQPFFIYLAYNAVHT-----PLQAKKNDLDKFAQVGS 243

Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
             R+TY AM+  +D ++G V+  L+ KG+ +N+++IF++DNG  T          N+  N
Sbjct: 244 EGRQTYDAMLSNMDYNIGRVMQELKNKGIDDNTLVIFLNDNGGATT---------NYSDN 294

Query: 362 YPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
              RG+K ++WEGGV+V  I+ W+ +I  N  V  + +   D LPT   AAG   +R   
Sbjct: 295 GQLRGMKGSVWEGGVRVGYIMRWNGKIPANV-VYDKAVSSLDILPTSLAAAG--NTRKDK 351

Query: 421 NIDGLD 426
           ++DG++
Sbjct: 352 HLDGVN 357



 Score = 42.4 bits (98), Expect = 0.70,   Method: Compositional matrix adjust.
 Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 8/67 (11%)

Query: 15  YATDLFTKEAVQLIEDQPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           Y TD+FT +A++ I D+  DK  P F+YLA+ A H       L+A +  +++F  +    
Sbjct: 192 YLTDMFTDKALKFI-DENADKKQPFFIYLAYNAVHT-----PLQAKKNDLDKFAQVGSEG 245

Query: 73  RRTYAAL 79
           R+TY A+
Sbjct: 246 RQTYDAM 252


>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
 gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
           SH 1]
          Length = 543

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 121/340 (35%), Positives = 176/340 (51%), Gaps = 33/340 (9%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI 142
           +LIV    G++D+ F+G  EIPTP++D LA +G++  N YA  P C+PSRA L+TG++  
Sbjct: 48  VLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQ 107

Query: 143 HTGMQGPP----IW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
             G    P     W G +  G+PL+E  L + L+E GY T AIGKWHLG   + + P  R
Sbjct: 108 RFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGD-AKPFWPNRR 166

Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
           GF+  FG+  G  SY+     D   +   L  H   R           + TD F+ EAV+
Sbjct: 167 GFDEWFGFSGGGFSYW----GDLGMKDPLLGVH---RGDEPVDPKTLTHLTDDFSTEAVK 219

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
            I+    + P FLYLA+ A HA +      A +  + +  +I    R  Y AMV  +D+ 
Sbjct: 220 FIQRHETE-PFFLYLAYNAPHAPD-----HATRAHLQKTAHIEYGGRAVYGAMVAGMDEG 273

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           +G V+  ++  G+ EN++IIF SDNG            R    N+PYRG K  L+EGG++
Sbjct: 274 IGRVVDQIRESGLGENTMIIFYSDNGG----------RREHAVNFPYRGHKGMLFEGGIR 323

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
           VP ++  P   ++       +   D  PT   AAG D S+
Sbjct: 324 VPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPSQ 363


>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 470

 Score =  177 bits (449), Expect = 1e-41,   Method: Compositional matrix adjust.
 Identities = 136/412 (33%), Positives = 200/412 (48%), Gaps = 65/412 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW +    G+ +IPTP+ID++A NG+       A   C+PSRA L+TG+YP   G +   
Sbjct: 52  GWGETGIQGNPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGRYPTRFGHEFNR 111

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I  A   G+ L E  L + L  LGY T  +GKWHLG    EY P  RGF+  FG L    
Sbjct: 112 I--ANVSGLDLQETTLADRLHGLGYKTACVGKWHLGD-GPEYRPTKRGFDEFFGTLANTP 168

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            ++     D  SR           N          Y TD + K +V+ I  Q    P FL
Sbjct: 169 FFHPTKFVD--SRVSNDVAEVSDENF---------YTTDEYAKRSVEWIGQQQ-QSPWFL 216

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YL   A HA      L+APQ+ +++F+ I DP R+ +AAM+  +DD++G V+  ++  G 
Sbjct: 217 YLPFNAQHA-----PLQAPQKYLDRFESIADPKRKLFAAMMSAMDDAIGQVLGKVRELGQ 271

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQ- 388
            EN+++ F+SDNG PT   + T++      N P RG K T +EGG +VP ++ W  ++  
Sbjct: 272 EENTLVFFISDNGGPT---QGTTS-----QNGPLRGFKMTTFEGGTRVPFLVQWKGKLPA 323

Query: 389 ----QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
                NP ++L      D LPT+ TAAG         ID            P+ +   +D
Sbjct: 324 GKTYDNPVINL------DVLPTVLTAAGS-------KID------------PAWKLDGVD 358

Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
            +  ++S + N P    ++     E+    AVR   WKLV+    +G  + Y
Sbjct: 359 LVPYFTSSIANKP--HETLYWRFGEQ---WAVRQGDWKLVVARGGSGQPELY 405



 Score = 46.6 bits (109), Expect = 0.037,   Method: Compositional matrix adjust.
 Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 6/78 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD + K +V+ I  Q    P FLYL   A HA      L+APQ+ +++F+ I DP R+
Sbjct: 193 YTTDEYAKRSVEWIGQQQ-QSPWFLYLPFNAQHA-----PLQAPQKYLDRFESIADPKRK 246

Query: 75  TYAALTKSTTLTLLIVYG 92
            +AA+  +    +  V G
Sbjct: 247 LFAAMMSAMDDAIGQVLG 264


>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 473

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 116/346 (33%), Positives = 177/346 (51%), Gaps = 31/346 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS ++ TPN+D LA +G+     Y +   C PSRA LMTGKY    G +   
Sbjct: 40  GFGDFGFQGSTQLKTPNLDKLAQSGVRFTQGYVSDSTCGPSRAGLMTGKYQQRFGYEEIN 99

Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           + G             G+PL ++ + +YL+E GY T   GKWHLG   R + PL RGF++
Sbjct: 100 VPGFMSDNSALKGADMGLPLDQKTMGDYLKEQGYKTAVFGKWHLGDADR-FHPLKRGFDT 158

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
             G+  G  SY+++   +  +         + R+    ++   EY TD+  KEA + IE 
Sbjct: 159 FLGFRGGDRSYFNYSEQEMKNGNKHFFDKKLERDFGN-YEEPKEYLTDVLGKEAAKYIE- 216

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
           Q  D+P F+YLA  A H       LE+  + + +F  +T   R+  AAM   LD + G V
Sbjct: 217 QNKDEPFFIYLAFNAVHT-----PLESDPKDLAKFPNLTG-KRKELAAMTLGLDRASGYV 270

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           +  L+  G+ +N+I++F +DNG P+         +N  +N P  G K+   EGG++VP +
Sbjct: 271 LDKLKELGLDDNTIVVFSNDNGGPSD--------KNASNNAPLAGTKSNQLEGGIRVPFL 322

Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           + W   I+        +  + D LPT Y+AA G    L  +IDG+D
Sbjct: 323 ISWPKHIKPGSTYDYPVSTL-DLLPTFYSAAKGKA--LGSDIDGVD 365


>gi|149177349|ref|ZP_01855954.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
           8797]
 gi|148843874|gb|EDL58232.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
           8797]
          Length = 472

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 127/368 (34%), Positives = 193/368 (52%), Gaps = 57/368 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ +L   G+ +IPTP+ID+LA +GI     Y   P C+PSRA L+TG+ P   G +  P
Sbjct: 37  GYGELGCQGNPQIPTPHIDSLASHGIRFTQAYVTAPNCSPSRAGLLTGRIPTRFGYEFNP 96

Query: 151 IWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
           I GA       G+P  E+ + E L + GY+T  IGKWHLG    +Y P   GF+  FG++
Sbjct: 97  I-GARNEDSGTGLPPDEQTIAERLHDQGYTTCLIGKWHLGG-TADYHPFRHGFDEFFGFM 154

Query: 207 N-----------GVISYYDH--ILSDQYSRTVELN-------GHD---------MRRNLS 237
           +           GV +      +   Q  R +  N       G+D         + R   
Sbjct: 155 HEGHYFVPPPYHGVTTMLRRKTLPGRQKGRWISENLIYSTHMGYDEPDYDANNPIIRGGQ 214

Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
              +T  EY TD FT+EAV  I ++  DKP FLYLA+ A H+   GK     ++ I  F 
Sbjct: 215 PVNET--EYLTDAFTREAVSFI-NRHQDKPFFLYLAYNAVHSPLQGK-----KKDIQHFT 266

Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
            I D +R+ +AAM+  +D S+G ++  +Q+ G+ E ++I+F+SDNG PT   RE ++   
Sbjct: 267 QIEDIHRQIFAAMLSSMDQSIGKILKQVQQSGLDEKTLIVFLSDNGGPT---RELTS--- 320

Query: 358 WGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
             SN P RG K +++EGG++VP ++ W+  +     + + +  + D  PT    AG   +
Sbjct: 321 --SNLPLRGEKGSMYEGGLRVPFLMRWTGTLAPKQTIDVPVSSL-DIFPTSVALAG---A 374

Query: 417 RLPLNIDG 424
            LP N+DG
Sbjct: 375 SLPQNLDG 382



 Score = 51.2 bits (121), Expect = 0.002,   Method: Compositional matrix adjust.
 Identities = 30/69 (43%), Positives = 41/69 (59%), Gaps = 6/69 (8%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           EY TD FT+EAV  I ++  DKP FLYLA+ A H+   GK     ++ I  F  I D +R
Sbjct: 220 EYLTDAFTREAVSFI-NRHQDKPFFLYLAYNAVHSPLQGK-----KKDIQHFTQIEDIHR 273

Query: 74  RTYAALTKS 82
           + +AA+  S
Sbjct: 274 QIFAAMLSS 282


>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
 gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
          Length = 484

 Score =  177 bits (449), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 120/347 (34%), Positives = 177/347 (51%), Gaps = 37/347 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS  + TPN+D LA +G      Y     C PSRA L+TGKY    G +   
Sbjct: 34  GYADFGFQGSKIMKTPNLDKLAKSGAKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEIN 93

Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           + G         A+  G+PL +  + ++L++LGY T   GKWHLG   R Y P  RGF+ 
Sbjct: 94  VPGYMSANSKFLADDMGLPLDQLTIADHLKKLGYKTAMYGKWHLGDADR-YHPTKRGFDE 152

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
            +G+  G  +Y+       Y+   + N  + M R     +    EY TD   KEAV  IE
Sbjct: 153 FYGFRGGARNYFG------YNDVSKANLDNRMERGFGN-YQEPTEYVTDALAKEAVSFIE 205

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
               + P F+YLA  A H       ++A +E +N+F  +T   R+  AAM   LD + GT
Sbjct: 206 KNKGN-PFFIYLAFNAVHT-----PMQATKEDLNKFPNLTG-KRKELAAMTLALDRACGT 258

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           V++ L+  G+ +N+I++F +DNG PT         +N   N P  G K+   EGG++VP 
Sbjct: 259 VLNKLKELGLDKNTIVVFSNDNGGPTD--------KNASLNLPLSGTKSNHLEGGIRVPF 310

Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           ++ W  QI+     +  +  + D LPT Y A GG+ + L  +IDG+D
Sbjct: 311 LISWPKQIKSKTVYNFPVSTL-DLLPTFYAAGGGNVADLK-DIDGVD 355



 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 7/67 (10%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           EY TD   KEAV  IE    + P F+YLA  A H       ++A +E +N+F  +T   R
Sbjct: 190 EYVTDALAKEAVSFIEKNKGN-PFFIYLAFNAVHT-----PMQATKEDLNKFPNLTG-KR 242

Query: 74  RTYAALT 80
           +  AA+T
Sbjct: 243 KELAAMT 249


>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 489

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 122/361 (33%), Positives = 179/361 (49%), Gaps = 53/361 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS+ I TPNID LA   ++L+  Y  P+C+P+RA ++TG YP   G+ G  +
Sbjct: 57  GWNDVGFHGSD-IRTPNIDRLARESVVLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 115

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              +  G+P      PE+L +LGY  +A+ GKWHLG     + PL  G    +G+ NG I
Sbjct: 116 SPTKKHGLPSELETTPEHLAKLGYDHRAMFGKWHLGLASTLFHPLRHGMTEFYGHYNGAI 175

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+         R  +L+ H   RN     D+V E  Y+T+L     V  I+     +PL
Sbjct: 176 DYFSR------ERFGQLDWH---RN----HDSVHEEGYSTELVGNAVVDFIDRHAGQQPL 222

Query: 269 FLYLAHLAAH---------------------AGNAGKHLEAPQETINQFQYITDPN--RR 305
           + Y+A  A H                     A N  + + A +E   ++      N  R+
Sbjct: 223 YAYVAFNAPHSPLQALRSDLDDYGFDPTGKLAPNTDRKI-AKRENALEYGKRGKGNSIRQ 281

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
           T+AAM   +D  +G ++ A+ R GM EN++++F SDNGA         + ++ GSN P R
Sbjct: 282 TFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGGSNEPLR 332

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           G K T WEGGV+V A +  P+          +    D LPT+  AAGG     P   DG+
Sbjct: 333 GNKFTTWEGGVRVVATMRWPREFPAGVTYESVASYVDLLPTMVAAAGGQP---PERTDGV 389

Query: 426 D 426
           +
Sbjct: 390 N 390


>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 472

 Score =  177 bits (448), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 119/367 (32%), Positives = 180/367 (49%), Gaps = 65/367 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS EI TPNID LA   + L+  Y  P+C+P+RA ++TG YP   G+ G  +
Sbjct: 40  GWNDVGFHGS-EIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 98

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              +  G+P      PE+L +LGY  +A+ GKWHLG     + PL+ G    +G+ NG I
Sbjct: 99  SPTKKHGLPPQLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 158

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+         R  +L+ H         +D+V E  Y+T+L     V  I+      P+
Sbjct: 159 DYFSR------ERFGQLDWH-------RDFDSVHEEGYSTELVGNAVVDFIDRNANAGPV 205

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------------------- 303
           + Y+A  A H+      L+A +  ++++ +  DP                          
Sbjct: 206 YAYVAFNAPHS-----PLQALRSDLDEYGF--DPTGKLAPNTDRKIAKREKALDYGKRGK 258

Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               R+T+AAM   +D  +G ++ A+ R GM EN++++F SDNGA         + ++ G
Sbjct: 259 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGG 309

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           SN P RG K T WEGGV+V A++  P+          +    D LPT+  AAGG     P
Sbjct: 310 SNEPLRGNKFTTWEGGVRVVAMMRWPRELPAGIAYDSVTSYVDLLPTMVAAAGGPP---P 366

Query: 420 LNIDGLD 426
              DG++
Sbjct: 367 EETDGIN 373


>gi|241143299|ref|XP_002404958.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215493668|gb|EEC03309.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 166

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 89/171 (52%), Positives = 120/171 (70%), Gaps = 6/171 (3%)

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           PLFLYL+HLA HAG     ++AP+E + +F YI   NR  YA  V  LD SVG V  AL+
Sbjct: 1   PLFLYLSHLAVHAGTQYGPIQAPKENLRKFDYIGVKNRSLYAGAVDALDQSVGIVFQALE 60

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
           +K ML+N++++F+SDNGA  + + E SN    GSN+P RG K TLWEGGV+VPA +W P 
Sbjct: 61  KKDMLQNTMLVFISDNGA--IPWGEWSNA---GSNWPLRGAKFTLWEGGVRVPAFVWGPL 115

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
           +++  RVS Q+MH+SDW+PTLY+ AGG+ + L + IDGL+ W +L  N PS
Sbjct: 116 LRKPQRVSNQLMHVSDWVPTLYSLAGGNIADLGV-IDGLNMWQALSRNLPS 165



 Score = 52.4 bits (124), Expect = 8e-04,   Method: Compositional matrix adjust.
 Identities = 24/56 (42%), Positives = 34/56 (60%)

Query: 36 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
          PLFLYL+HLA HAG     ++AP+E + +F YI   NR  YA    +   ++ IV+
Sbjct: 1  PLFLYLSHLAVHAGTQYGPIQAPKENLRKFDYIGVKNRSLYAGAVDALDQSVGIVF 56


>gi|407938758|ref|YP_006854399.1| sulfatase [Acidovorax sp. KKS102]
 gi|407896552|gb|AFU45761.1| sulfatase [Acidovorax sp. KKS102]
          Length = 452

 Score =  176 bits (447), Expect = 2e-41,   Method: Compositional matrix adjust.
 Identities = 118/344 (34%), Positives = 172/344 (50%), Gaps = 31/344 (9%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRA 133
           +A T++  +  ++    GW DLS +G  +  TP++DALA  G+     YA   VC+ +R 
Sbjct: 10  SASTQAPNIVFILADDLGWADLSVYGQADFTTPHLDALAGEGVRFTQAYANSAVCSATRF 69

Query: 134 SLMTGKYP--IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           +L+TG+Y   +  G++ P +  A   G+P     LP  LR+ GY T  IGKWHLG     
Sbjct: 70  ALITGRYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRDAGYDTALIGKWHLGSLP-T 128

Query: 192 YTPLYRGFESHFGYLNGVISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
           + PL  G++  FG   G I Y+ H   + D  +R +      + R        VG Y T 
Sbjct: 129 FGPLKSGYDRFFGNYGGAIDYFTHKPGVGDAVARDLYEGEVPVER--------VGYY-TQ 179

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           L   EA + I ++   KP FL L   A H    G   E     +    +    N  TYA 
Sbjct: 180 LLADEATRWIGERSAAKPFFLSLHFTAPHWPWVGPEDEEISLGLKDLFHYDGGNLATYAR 239

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           MV+ LD +VG V+ AL+ +G+ +N+I++F SDNG              +   +P+ G K 
Sbjct: 240 MVRSLDKAVGQVLEALKAQGLADNTIVVFTSDNGG-----------ERFSKTWPFTGQKT 288

Query: 370 TLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
            L EGG++VP +L W  +I   P+VS Q+    DWLPTL  AAG
Sbjct: 289 ELLEGGIRVPTLLRWPARIA--PQVSDQVTATMDWLPTLLAAAG 330


>gi|149198444|ref|ZP_01875489.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149138450|gb|EDM26858.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 458

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 114/321 (35%), Positives = 177/321 (55%), Gaps = 27/321 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D+ F+G  +IPTP++D++A NG+  ++   + PVC PSRA L+TG+Y    G    P
Sbjct: 31  GYQDVGFNGCKDIPTPHLDSIAQNGVNCIDAHVSYPVCGPSRAGLLTGRYQDRFGFTTNP 90

Query: 151 IWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
               E    G+PL E+ + E L+E+GYS+  IGKWH+G     + PL RGF+  FG+L+G
Sbjct: 91  TVNPENPIAGLPLEEKNIAEVLKEVGYSSSIIGKWHMGTHPIHH-PLNRGFDHFFGFLSG 149

Query: 209 VISYY--DHILSD--QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
              Y+   + L D  +  R  +     + R+      + G Y TD+ T  AV  I+ +  
Sbjct: 150 GHDYFPAKYNLKDLSEVKRIWDWYRTHLIRDRERIQVSEG-YLTDILTDAAVDFIDKKAS 208

Query: 265 DK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
           +K P  LYL++ A H       L+A ++ + +F +I D  RRTYAAMV  +DD VG V++
Sbjct: 209 EKKPFMLYLSYNAPHTP-----LQASEKYLKRFTHIKDSKRRTYAAMVSAVDDGVGRVMA 263

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AIL 382
            L++  + E++++ F+SDNG P            +  N P+   K +L EGG +VP A+ 
Sbjct: 264 RLKKHKIEEDTLVFFLSDNGGP------LKGPMPFTDNGPFSKGKGSLHEGGTRVPFAVQ 317

Query: 383 W-----SPQIQQNPRVSLQMM 398
           W     S QI + P  SL ++
Sbjct: 318 WKGTIPSGQIYKKPMSSLDIL 338



 Score = 45.8 bits (107), Expect = 0.077,   Method: Compositional matrix adjust.
 Identities = 25/70 (35%), Positives = 39/70 (55%), Gaps = 6/70 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           Y TD+ T  AV  I+ +  +K P  LYL++ A H       L+A ++ + +F +I D  R
Sbjct: 190 YLTDILTDAAVDFIDKKASEKKPFMLYLSYNAPHTP-----LQASEKYLKRFTHIKDSKR 244

Query: 74  RTYAALTKST 83
           RTYAA+  + 
Sbjct: 245 RTYAAMVSAV 254


>gi|325108643|ref|YP_004269711.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324968911|gb|ADY59689.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 484

 Score =  176 bits (446), Expect = 3e-41,   Method: Compositional matrix adjust.
 Identities = 131/401 (32%), Positives = 198/401 (49%), Gaps = 70/401 (17%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
           LLIV    G+NDL    S+ I TP++D LA  G  L + Y A P CTPSRASL+TG+YP 
Sbjct: 31  LLIVSDDQGYNDLGVLNSDLI-TPHLDRLAAEGTRLTDFYVAWPACTPSRASLLTGRYPQ 89

Query: 143 HTGM-----QGPPIWGAEPR------------GVPLTERFLPEYLRELGYSTKAIGKWHL 185
             G+        P +G + +            G+   E+ LPEYL++LGY++   GKW L
Sbjct: 90  RNGIYDMIRNEAPDYGYKYKPAEYEVSFERIGGMDQREKLLPEYLKKLGYTSAIFGKWDL 149

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
           G  +R + P  RGF+  +G++N  I Y+ H   ++Y          M R  S   +  GE
Sbjct: 150 GSLKR-FLPTNRGFDEFYGFVNTGIDYFTH---ERYGVP------SMFRQTSLTEEDRGE 199

Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA-----GKHLEAPQE--------- 291
           YAT+LF +EA+  ++     +P  LYL   A H  ++        ++AP+E         
Sbjct: 200 YATELFKREALAFLDRAEASEPFLLYLPFNAPHNSSSLDPRIRSTVQAPEEYKAMYPAVD 259

Query: 292 ----TINQFQY-----ITDP--NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
               T+N+++Y     +  P   +R Y A V  +D ++G V+  L+ KG+L+N+I+IF S
Sbjct: 260 PEYRTVNRYRYGNPAQVVTPVARQRDYRAAVTCMDAAIGEVLGKLEDKGVLDNTIVIFFS 319

Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMH 399
           DNG                 N P RG K  +WEGG++VP ++ W          +   + 
Sbjct: 320 DNGG-----------SGGADNSPLRGHKAQVWEGGIRVPCLVRWPDGGVPAGATNDSFLS 368

Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
             + +P+L  AAGG T    + +DG + W +L     S RN
Sbjct: 369 SLELVPSLVKAAGG-TIPEDIVMDGRNWWPALTEAASSPRN 408


>gi|323455793|gb|EGB11661.1| hypothetical protein AURANDRAFT_70842 [Aureococcus anophagefferens]
          Length = 1489

 Score =  176 bits (445), Expect = 4e-41,   Method: Compositional matrix adjust.
 Identities = 125/395 (31%), Positives = 198/395 (50%), Gaps = 74/395 (18%)

Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
           TP +D LA +GI L+  Y+Q +CTPSR++L++G YPIHTGMQ   I      G+P   + 
Sbjct: 136 TPFMDLLADDGIKLDKYYSQQLCTPSRSALLSGYYPIHTGMQHDVIQPESMFGMPAGHKL 195

Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV 225
           +P +L++          W+           Y     H    +G   YYD           
Sbjct: 196 MPSFLKDDQL-------WY-----------YNHKSPH--ACDGSNCYYD----------- 224

Query: 226 ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
                 M+RN + A D++G Y+T LF      ++E++    PLF+YL+    HA      
Sbjct: 225 ------MQRNGAIAEDSIGVYSTFLFVDAFTDVLENEDASTPLFMYLSWQNVHA-----P 273

Query: 286 LEAP------QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
           L+ P      +  ++    I DP+RRT+A++   LD+++  V+ A+++ G  +NSI++  
Sbjct: 274 LDPPPASFYTKTELSLVNSIEDPHRRTFASITIVLDNAMKKVVGAMKKSGFYDNSILVVA 333

Query: 340 SDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR-VSLQ-M 397
           SDNG  +          + G NYPYRG K  L+EGG+ V A + SP I ++ R  S Q +
Sbjct: 334 SDNGGCS---------GSGGYNYPYRGGKQYLYEGGIHVNAFIHSPLIPKSARGTSYQGL 384

Query: 398 MHISDWLPTLYTAA-GGDTSRLPLNIDGLDQWSSLL---LNTPSR--RNSNIDGLDQWSS 451
            H+SDWLPTL T A   + + LP ++DG+DQW ++L      P+   R   +  +D WS 
Sbjct: 385 FHVSDWLPTLVTGAMKTEPTMLPADLDGVDQWHAILGSGRGAPAAYPRTEILHNIDLWSL 444

Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
            +    ++  + +          A+R+   KLV+G
Sbjct: 445 EIYGNVTKLRTPI---------QAIRVGDMKLVMG 470



 Score = 47.4 bits (111), Expect = 0.023,   Method: Compositional matrix adjust.
 Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 11/86 (12%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP-- 58
           M+RN + A D++G Y+T LF      ++E++    PLF+YL+    HA      L+ P  
Sbjct: 225 MQRNGAIAEDSIGVYSTFLFVDAFTDVLENEDASTPLFMYLSWQNVHA-----PLDPPPA 279

Query: 59  ----QETINQFQYITDPNRRTYAALT 80
               +  ++    I DP+RRT+A++T
Sbjct: 280 SFYTKTELSLVNSIEDPHRRTFASIT 305


>gi|187735676|ref|YP_001877788.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
 gi|187425728|gb|ACD05007.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
          Length = 465

 Score =  176 bits (445), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 120/346 (34%), Positives = 180/346 (52%), Gaps = 38/346 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL   GS +I TP++D LA  G+  +  Y   P+C+PSR  L+TG++P   G+   P
Sbjct: 40  GYGDLGCTGSKQIKTPSLDRLAREGVFCSRAYVTAPMCSPSRMGLLTGRFPKRYGITTNP 99

Query: 151 IWGAE--PR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
               +  P    G+P TE+ +PEYL   GY +   GKWHLG   + YTP  RGF   +G+
Sbjct: 100 NIQMDYLPESHYGLPQTEKLIPEYLAPCGYRSAVFGKWHLGH-TKGYTPPERGFTHWWGF 158

Query: 206 LNGVISYYDHILSDQYSRTVE-LNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           L G   Y+         +  E LN   +  N +   D    Y TD  T  AV+ +++   
Sbjct: 159 LGGSRHYF------PVKKEAEGLNPSMIVSNFTDKTDIT--YLTDDITDRAVEFLQEAGK 210

Query: 265 DK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
           DK P F+++++ A H  N     EA  E I +F+ + +  RR Y AMV  +D  +G ++ 
Sbjct: 211 DKKPFFMFVSYNAPHWPN-----EAKPEDIAKFRNVQNGERRVYCAMVYAMDRGIGRILD 265

Query: 324 ALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
           AL+  G+ +++I++F+SDN GAP     E S+      N P+RG K   +EGGV+VP I+
Sbjct: 266 ALKADGLEKDTIVVFLSDNGGAP-----EASS-----CNAPFRGAKRQHFEGGVRVPFII 315

Query: 383 WSPQIQQ--NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
             P  ++     V  Q +   D LP L  A G     +P  +DG+D
Sbjct: 316 RYPADKRLVPGSVCRQPVSSVDLLPALLKANG---RHIPRKLDGMD 358


>gi|399033016|ref|ZP_10732099.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
 gi|398068627|gb|EJL60037.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
          Length = 460

 Score =  175 bits (444), Expect = 5e-41,   Method: Compositional matrix adjust.
 Identities = 129/374 (34%), Positives = 193/374 (51%), Gaps = 47/374 (12%)

Query: 79  LTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
           LT+   + L+I    GWND+ +HGS  I TPNID LA NG+ LN  YA P C+PSRASL 
Sbjct: 31  LTEKPNIILIIADDAGWNDVEYHGS-VIQTPNIDFLAKNGVELNRFYANPTCSPSRASLF 89

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TG+     G+   PI       +P +   LP+ L +  Y T  IGKWHLG  ++   P  
Sbjct: 90  TGRPASRMGIVA-PISDKSQFKLPDSIATLPKLLHQNNYQTALIGKWHLG-LQQSSGPKA 147

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY------ATDL 250
            GF+  +G+L+G I  Y H+                 +N   +W   GE+      ATDL
Sbjct: 148 YGFDYSYGFLHGQIDQYTHLY----------------KNGDKSWYRNGEFIDEQGHATDL 191

Query: 251 FTKEAVQ-LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYA 308
            T EA+  L E +  +K  FL +A+ A H       L+  Q+  + +   I D +RR +A
Sbjct: 192 ITNEAIHWLSEKRDSNKNFFLEVAYSAPHFP-----LQEEQKWKDPYMNSIKDSSRRDFA 246

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN--------WGS 360
           A +  LD S+G ++  L+++ + +N+++IFMSDNGA  +E  ++ N  N         G 
Sbjct: 247 AAMSHLDYSIGILLEKLKQENLDKNTLVIFMSDNGA--MENWDSRNEYNGVHPSNTVLGD 304

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
           N P R  K + +EGG++VP I +     +N + S   + I D LP++   AG     LP 
Sbjct: 305 NSPLRDWKTSNYEGGIRVPCIFYWKDHLKNYK-SESYISIIDLLPSILFLAG--DKNLPQ 361

Query: 421 NIDGLDQWSSLLLN 434
           +I+G + W ++  N
Sbjct: 362 SIEGKNVWPAIAEN 375


>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
 gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
          Length = 579

 Score =  175 bits (444), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 118/367 (32%), Positives = 181/367 (49%), Gaps = 65/367 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS EI TPNID LA   + L+  Y  P+C+P+RA ++TG YP   G+ G  +
Sbjct: 147 GWNDVGFHGS-EIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 205

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
             ++  G+P      PE+L +LGY  +A+ GKWHLG     + PL+ G    +G+ NG I
Sbjct: 206 SPSKKHGLPPQLETAPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 265

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+         R  +L+ H         +D+V E  Y+T+L     V  I+      P+
Sbjct: 266 DYFSR------ERFGQLDWH-------RDFDSVHEEGYSTELVGNAVVDFIDRNANAGPV 312

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------------------- 303
           + Y+A  A H+      L+A +  ++++ +  DPN                         
Sbjct: 313 YAYVAFNAPHS-----PLQALRSDLDEYGF--DPNNKLAPNTDRKIAKREKALDYGKRGK 365

Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               R+T+AAM   +D  +G ++ A+ R GM EN++++F SDNGA         + ++ G
Sbjct: 366 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGG 416

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           +N P RG K T WEGGV+V A++  P           +    D LP++  AAG   S  P
Sbjct: 417 NNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPSMVGAAG---SPPP 473

Query: 420 LNIDGLD 426
              DG++
Sbjct: 474 EETDGIN 480


>gi|449671825|ref|XP_002165184.2| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 160

 Score =  175 bits (444), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 80/144 (55%), Positives = 102/144 (70%), Gaps = 6/144 (4%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS EI TPNID LA NG++L+N Y  P+CTPSR+++MTG+YPIHTGMQ   I
Sbjct: 21  GWNDIGFHGSKEISTPNIDRLATNGVVLDNYYVLPICTPSRSAIMTGRYPIHTGMQQDTI 80

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +G  P GV L E+FLP+YL++ GY T  +GKWHLGFF +EYTP YRGF+S++G   G   
Sbjct: 81  YGPNPYGVSLNEKFLPQYLKQQGYKTYGVGKWHLGFFAKEYTPTYRGFDSYYGSYLGKGD 140

Query: 212 YYDHILSDQYSRTVELNGHDMRRN 235
           Y++H      S T    G D+  N
Sbjct: 141 YWNH------SNTETFTGLDLHDN 158


>gi|149199999|ref|ZP_01877025.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149136872|gb|EDM25299.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 512

 Score =  175 bits (444), Expect = 6e-41,   Method: Compositional matrix adjust.
 Identities = 126/384 (32%), Positives = 190/384 (49%), Gaps = 40/384 (10%)

Query: 75  TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRA 133
           T+A    +  L      G++D+ +HG+  I TPNID++A  G+  +  Y +  VC PSRA
Sbjct: 15  TFADKQPNIILIFADDMGYDDVGYHGNKRIITPNIDSIAEQGVQFSQGYVSASVCGPSRA 74

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPR-------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
            L+TG Y    G    P     P        G+P ++  + E L+ LGY+   IGKWH+G
Sbjct: 75  GLLTGVYQQRFGCGENPNGSGYPNQMKYPMAGLPQSQSMISEELKTLGYTNGMIGKWHMG 134

Query: 187 FFRREYTPLYRGFESHFGYLNGVISY--YDHILSDQYSRTVELNGHDMR-RNLSTAWDTV 243
           F      P  RG++  +G++NG   Y  +    +   SR       +M   N +   D  
Sbjct: 135 F-DMSLRPNQRGYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYIDVF 193

Query: 244 GE---------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 294
            E         Y TDLFT EAV  I D+  DKP FLYLA+ A H        +  Q  ++
Sbjct: 194 KEKGVKVVDENYLTDLFTDEAVNFI-DRNADKPFFLYLAYNAVH-----HPWQTTQHALD 247

Query: 295 QFQYITD-PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT---VEYR 350
           +  ++ D  N   +A+MV  +D+ +G V+  L+ K + +N+IIIF+SDNG+P    +E+ 
Sbjct: 248 KTAHLKDDKNYHVFASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGIEHS 307

Query: 351 ETSNYRNWG-----SNYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWL 404
                R+ G     S   +RG K   +EGG++VP  I W  QIQ+  +  + +  + D  
Sbjct: 308 PKDPNRHRGGFTMSSTGIFRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPISAL-DLQ 366

Query: 405 PTLYTAAGGDTSR--LPLNIDGLD 426
           PTL  AAGG+  +       DG+D
Sbjct: 367 PTLVKAAGGNDKKPQKGFAYDGVD 390



 Score = 42.4 bits (98), Expect = 0.81,   Method: Compositional matrix adjust.
 Identities = 21/33 (63%), Positives = 23/33 (69%), Gaps = 1/33 (3%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 47
           Y TDLFT EAV  I D+  DKP FLYLA+ A H
Sbjct: 205 YLTDLFTDEAVNFI-DRNADKPFFLYLAYNAVH 236


>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 489

 Score =  175 bits (443), Expect = 7e-41,   Method: Compositional matrix adjust.
 Identities = 118/367 (32%), Positives = 178/367 (48%), Gaps = 65/367 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+ FHGS EI TPNID LA   + L+  Y  P+C+P+RA ++TG YP   G+ G  +
Sbjct: 57  GWNDVGFHGS-EIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 115

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              +  G+P      PE+L +LGY  +A+ GKWHLG     + PL+ G    +G+ NG I
Sbjct: 116 SPTKKHGLPTLLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 175

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
            Y+         R  +L+ H         +D+V E  Y+T+L     V  I+       +
Sbjct: 176 DYFSR------ERFGQLDWH-------RDFDSVHEEGYSTELVGNAVVDFIDRNANAGAV 222

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------------------- 303
           + Y+A  A H+      L+A +  ++++ +  DPN                         
Sbjct: 223 YAYVAFNAPHS-----PLQALRSDLDEYGF--DPNNKLSPNTDRKIAKREKALDYGKRGK 275

Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               R+T+AAM   +D  +G ++ A+ R GM EN++++F SDNGA         + ++ G
Sbjct: 276 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGG 326

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           SN P RG K T WEGGV+V A++  P           +    D LP +  AAGG     P
Sbjct: 327 SNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPIMVAAAGGPP---P 383

Query: 420 LNIDGLD 426
              DG++
Sbjct: 384 EETDGIN 390


>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
 gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
          Length = 471

 Score =  174 bits (442), Expect = 1e-40,   Method: Compositional matrix adjust.
 Identities = 120/377 (31%), Positives = 186/377 (49%), Gaps = 44/377 (11%)

Query: 74  RTYAALTKSTTLTLL-------IVY------GWNDLSFHGSNEIPTPNIDALAYNGIILN 120
           R  AAL+ S   T L       IV+      G+ D  F GS  + TPN+D LA  G+   
Sbjct: 7   RVVAALSISVACTSLSYAKQPNIVFLFSDDAGYADFGFQGSETMKTPNLDQLASEGVRFT 66

Query: 121 NMY-AQPVCTPSRASLMTGKYPIHTGMQG----------PPIWGAEPRGVPLTERFLPEY 169
             Y +   C PSRA +MTG+Y    G +             I GAE  G+PL E  + +Y
Sbjct: 67  QGYVSDSTCGPSRAGIMTGRYQQKFGYEEINVPGYMSEHSAIKGAE-MGIPLDEVTMGDY 125

Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
           ++ LGY T   GKWHLG    E  P++RGF+  +G+  G  SY+ + ++    ++     
Sbjct: 126 MKSLGYRTAFYGKWHLGG-TDELHPMHRGFDEFYGFRGGDRSYWAYEVNAPERKSAVFTD 184

Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
             +   +    +  G Y TD+  ++A Q IE  P DKP F++L+  A H       +EA 
Sbjct: 185 KKLEHGIDQFQEHEG-YLTDVLAEKANQFIEKAP-DKPFFIFLSFNAVHT-----PMEAT 237

Query: 290 QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEY 349
            E + +F  +    R+  AAM   LD + G V++ L+  G+ ++++++F +DNG PT   
Sbjct: 238 PEDLAKFPQLKG-KRKEVAAMTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGPTD-- 294

Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
                 +N  SNYP  G K+   EGG++VP ++  P      +V  + +   D LPT + 
Sbjct: 295 ------KNASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFK 348

Query: 410 AAGGDTSRLPLNIDGLD 426
           A GG+   +   +DG+D
Sbjct: 349 AGGGE--EVMSELDGVD 363


>gi|406834549|ref|ZP_11094143.1| Cerebroside-sulfatase [Schlesneria paludicola DSM 18645]
          Length = 470

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 115/337 (34%), Positives = 176/337 (52%), Gaps = 29/337 (8%)

Query: 95  DLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
           D +  G+ ++ TP+ID LA  G+ L   YA  PVC+PSRA L+TG+YP+  G+ G     
Sbjct: 61  DANCFGAKDLETPHIDGLAARGVRLTQFYAAAPVCSPSRAGLLTGRYPVRGGVPGNVSST 120

Query: 154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
           A   G+P T+  + E  +  GY+T  IGKWHLG +  +  PL +GF++ FG++ G I  +
Sbjct: 121 AGVAGMPTTQVTMAETFKAAGYATAHIGKWHLG-YSEDTMPLAQGFQTSFGHMGGCIDNF 179

Query: 214 DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLA 273
            H    Q       N HD+ ++        G +  DL  +EA Q I     ++P F+Y A
Sbjct: 180 SHFFYWQGP-----NRHDLHKDGREV-QRPGRFFPDLMVEEASQFIGTN-RERPFFVYFA 232

Query: 274 HLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLEN 333
               H    G         + +++ +  P R  YAA V  +D+ +G ++  +   G+ E 
Sbjct: 233 LNGPHYPYQGD-----PRWLERYKDLPYP-RNLYAAFVSSIDERIGGLLKRVDELGLREK 286

Query: 334 SIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPR 392
           +I++F SD+G  T    E   +   GS   YRG K +L+EGG++VPAI+ W  ++ Q   
Sbjct: 287 TIVVFQSDHGHST----EERAHLGGGSAGIYRGAKFSLFEGGIRVPAIISWPGRLPQGEA 342

Query: 393 VSLQMMHISDWLPTLYTAAGGDTSRLPL---NIDGLD 426
            S Q+ H  DWLPTL   A      +PL   +IDG++
Sbjct: 343 RS-QLAHGCDWLPTLAALAA-----VPLVEPDIDGVN 373


>gi|254444367|ref|ZP_05057843.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
 gi|198258675|gb|EDY82983.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
          Length = 462

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 125/375 (33%), Positives = 185/375 (49%), Gaps = 39/375 (10%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQP 126
           +T P+  + A    +         G+NDLS +G+ +I TP ID+L   GI   + Y A P
Sbjct: 22  LTPPSAASSAEKPPNIVFIFADDLGYNDLSSYGATDIATPAIDSLGEQGIRFTDFYSASP 81

Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           VC+PSRA+L+TG+YPI  G+ G   W     G+   E  + E L+E GY T  +GKWHLG
Sbjct: 82  VCSPSRAALLTGRYPIRQGITG-VFWPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLG 140

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
             ++ + PL  GF S+FG     I Y     S+     V + G+D+      +++    Y
Sbjct: 141 HHQK-HLPLQNGFHSYFG-----IPY-----SNDMDMVVYMRGNDVE-----SYEVDQHY 184

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
            T  +T+EAVQ IE Q  D+P FLYLAH   H                   ++    R  
Sbjct: 185 TTRRYTEEAVQFIE-QNKDQPFFLYLAHSMPHV-----------PIYASENFVGTSKRGL 232

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y  ++++LD SV  ++  L +  + EN++++F SDNG P    +        GS  P R 
Sbjct: 233 YGDVIQELDWSVAQILDTLDKHQLSENTLVVFTSDNG-PWTALKHLG-----GSAAPLRE 286

Query: 367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            K   ++GG++VP ++ W  QI    + S  M ++ DW PT    A  DT +   +IDGL
Sbjct: 287 GKMFTFDGGMRVPCLVRWPAQIPAG-QTSHAMANMMDWFPTFSRIANLDTPK-SRSIDGL 344

Query: 426 DQWSSLLLNTPSRRN 440
           D    L  + P   N
Sbjct: 345 DITDVLTGSGPRADN 359



 Score = 40.4 bits (93), Expect = 2.7,   Method: Compositional matrix adjust.
 Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 1/47 (2%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 47
           MR N   +++    Y T  +T+EAVQ IE Q  D+P FLYLAH   H
Sbjct: 170 MRGNDVESYEVDQHYTTRRYTEEAVQFIE-QNKDQPFFLYLAHSMPH 215


>gi|87306602|ref|ZP_01088749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
 gi|87290781|gb|EAQ82668.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
          Length = 468

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 123/363 (33%), Positives = 182/363 (50%), Gaps = 53/363 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGM---- 146
           G+ DLS  G N   TP +D LA +G  L + Y   P CTPSRASLMTG+YP   G     
Sbjct: 43  GFADLSCIGDNGCRTPRLDQLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRNGTYDMI 102

Query: 147 --QGP-----------PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
             + P            +      G  L E FL + L++ GY +   GKW  G  +R Y 
Sbjct: 103 RNEAPDYDYLYTPEEYAVTAERILGTDLQEVFLADVLKQAGYVSAVFGKWDGGQLKR-YL 161

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
           PL RGF+ ++G+ N  + Y+ H   ++Y          M R+     +  G Y TDLF +
Sbjct: 162 PLQRGFDQYYGFANTGVDYFTH---ERYGVP------SMFRDNQPTEEDKGTYLTDLFER 212

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHL----EAPQETINQF---QYITDPNRR 305
           EA++ I D+  D+P FLYL   A H A N  + +    +APQE ++ F   +   +  R+
Sbjct: 213 EAIRFI-DENHDRPFFLYLPFNAPHSASNLDRSIRGFAQAPQEYLDHFPGGESKQEKRRQ 271

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
            Y A V+++D+++G V+  LQ+  + +N++IIF+SDNG                 N P R
Sbjct: 272 AYLAAVERMDEAIGKVVDQLQQHQIADNTLIIFLSDNGGGGG-----------ADNSPLR 320

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI--D 423
           G K  ++EGG +VP I+  P      +VS Q +   +  PT+  A GG   +LP ++  D
Sbjct: 321 GGKAKMFEGGNRVPCIVHWPGKVPAGKVSNQFLTSLEVFPTVIAAIGG---KLPDDVIYD 377

Query: 424 GLD 426
           G D
Sbjct: 378 GFD 380



 Score = 42.7 bits (99), Expect = 0.60,   Method: Compositional matrix adjust.
 Identities = 29/70 (41%), Positives = 39/70 (55%), Gaps = 7/70 (10%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHL---- 55
            R N  T  D  G Y TDLF +EA++ I D+  D+P FLYL   A H A N  + +    
Sbjct: 192 FRDNQPTEEDK-GTYLTDLFEREAIRFI-DENHDRPFFLYLPFNAPHSASNLDRSIRGFA 249

Query: 56  EAPQETINQF 65
           +APQE ++ F
Sbjct: 250 QAPQEYLDHF 259


>gi|149177363|ref|ZP_01855968.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148843888|gb|EDL58246.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 466

 Score =  174 bits (440), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 129/367 (35%), Positives = 181/367 (49%), Gaps = 53/367 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM---- 146
           G+ DL  +G+  + TP +D LA  G+ L + Y A P CT SRA+L+TG+YP   G+    
Sbjct: 46  GYGDLGCYGNPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLTGRYPQRIGLNHQL 105

Query: 147 QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
                +G    G+  +E  +PEYL++ GY T   GKW++GF      P  RGF+  FG+ 
Sbjct: 106 SADENYGD---GLRKSEVLIPEYLKQQGYRTACFGKWNVGF-SPGSRPTERGFDEFFGFA 161

Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
            G I YY H  + +         HD+ R L   +  V  Y+TDLF   A Q I  +  D+
Sbjct: 162 AGNIDYYHHYYAGR---------HDLWRGLKEVF--VEGYSTDLFADAACQYISAE-SDQ 209

Query: 267 PLFLYLAHLAAH-------AGNAGKHLEAPQETINQFQY---ITDPNRRTYAAMVKKLDD 316
           P F+YL   A H           G   +AP     ++ Y     +P  R Y A+V  LD 
Sbjct: 210 PFFIYLPFNAPHFPSQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQER-YRAVVTALDS 268

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           ++G V+  L   G+ + +I+I+ SDNGA  ++ R         SN P R    TLWEGG+
Sbjct: 269 AIGRVLKQLDTSGLRDQTIVIWYSDNGAFMLKERGLEV----ASNKPLRDGGVTLWEGGI 324

Query: 377 KVPAILWSP------QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLDQW 428
           +VPAI+  P       + Q+P +SL      D LPTL T AGG    LP    +DG D  
Sbjct: 325 RVPAIIRYPGHLKAGTVNQSPLISL------DILPTLITLAGGP---LPAERILDGQDML 375

Query: 429 SSLLLNT 435
            +L   T
Sbjct: 376 PALAAQT 382


>gi|325108489|ref|YP_004269557.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324968757|gb|ADY59535.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 455

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 119/345 (34%), Positives = 175/345 (50%), Gaps = 25/345 (7%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
           +L++Y    G  D   +GS+++ TP ID LA  GI    MYA   +C+ SRA L+TG+ P
Sbjct: 34  VLVIYSDDQGTIDAGCYGSDDLTTPAIDHLAETGIRFTQMYAPSCICSASRAGLLTGRLP 93

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           +  G+           G+P +E  L +  ++ GY T  +GKWHLG+      PL +GF+S
Sbjct: 94  VRAGVPANVSSKKGAEGMPASEVTLGDVFKQAGYRTAHVGKWHLGY-TPATMPLGQGFDS 152

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
            FG++ G I  Y H            N HD+ R+    +   G++  DL   E  +L+ +
Sbjct: 153 SFGHMGGCIDNYSHFFYWNGP-----NRHDLWRDGKEVFHD-GQFFPDLLVDEMQRLLTE 206

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
           +  ++PLF+Y A    H    G       +  N +  +  P RR YAA V  LD+ +G V
Sbjct: 207 KS-EQPLFVYWAINVPHYPYQGT-----DKWRNYYADLETP-RREYAAFVSTLDEKLGEV 259

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           ++ L+  G  EN+I++F SD+G  T    E   +   GS  PYRG K  L+EGG++VP++
Sbjct: 260 LATLEETGQRENTIVVFQSDHGHST----EERAFFGGGSAGPYRGAKGCLFEGGIRVPSV 315

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           +  P       V  QM    DWLPTL      D   LPL +DG D
Sbjct: 316 ISWPAGLPQGEVRDQMAVGCDWLPTLVELTKVDVD-LPL-LDGKD 358


>gi|149174645|ref|ZP_01853271.1| sulfatase [Planctomyces maris DSM 8797]
 gi|148846755|gb|EDL61092.1| sulfatase [Planctomyces maris DSM 8797]
          Length = 405

 Score =  173 bits (439), Expect = 2e-40,   Method: Compositional matrix adjust.
 Identities = 110/322 (34%), Positives = 164/322 (50%), Gaps = 25/322 (7%)

Query: 95  DLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
           DL+ +G+ ++ TP++D++A  GI     YA  PVC+PSRA ++TG++P   G+ G     
Sbjct: 23  DLNCYGAKDLITPHMDSIARRGIRFTQFYASAPVCSPSRAGMLTGRFPARAGVPGNVSSH 82

Query: 154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
               G+P  +  + E +++ GY T  IGKWHLG+   E  P  +GFE+ FG++ G I   
Sbjct: 83  HGKSGMPTEQITIAEMMQQAGYQTAHIGKWHLGY-TPETMPHGQGFETSFGHMGGCI--- 138

Query: 214 DHILSDQYSRTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
                D YS     NG   HD+  N    W   G +  DL  ++    I  +  DKP FL
Sbjct: 139 -----DNYSHFFYWNGPNRHDLWENGKEVWRD-GAFFPDLMVEQCQDYIR-KAGDKPFFL 191

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y A    H       L+  ++    + +++ P R  YAA V  +DD +G V++ L    +
Sbjct: 192 YWAINVPH-----YPLQGKEKWRKTYAHLSSP-RDKYAAFVSTMDDCIGEVLATLDACQL 245

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            E +IIIF SD+G       E   +   GS  PYRG K +L+EGG++VPA++  P     
Sbjct: 246 REKTIIIFQSDHG----HSHEERTFGGGGSAGPYRGAKFSLFEGGIRVPAMISWPGTIAE 301

Query: 391 PRVSLQMMHISDWLPTLYTAAG 412
             V  Q+    DWLPT+    G
Sbjct: 302 GEVRDQLATGCDWLPTISALTG 323


>gi|372209242|ref|ZP_09497044.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 479

 Score =  173 bits (438), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 117/326 (35%), Positives = 165/326 (50%), Gaps = 40/326 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ND+ F+GS +I TPN+D LA NG+I+   Y A P C PSR S+MTGKY    G Q   
Sbjct: 39  GYNDVGFNGSKDIKTPNLDKLADNGMIMTAGYVAHPFCGPSRTSIMTGKYAHTMGAQFNI 98

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P        G+PL  +F+ + L+E GY T A GKWHLG     + P  RGF+  +G+L G
Sbjct: 99  PSESEGTGYGIPLNNKFISKELQEAGYYTGAFGKWHLGA-DTPFHPNKRGFDEFYGFLGG 157

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNL----------STAWDTVGEYATDLFTKEAVQL 258
              Y    + +QY    E       +N+           T  D   EY TD  ++EAV  
Sbjct: 158 GHDY----IPEQYKPKYEFLKQRGSKNIRDYIKPLEHNGTEVDEK-EYITDGLSREAVNF 212

Query: 259 IEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
           +      K P F+YLA+ A H       L+A +E +  F+ I D  RRTYAAMV  +D  
Sbjct: 213 VYKASEKKQPFFMYLAYNAPHV-----PLQAKKEDMAVFKSIKDEKRRTYAAMVYAVDRG 267

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           VG ++ AL+    LEN++++F SDNG    +           +N+P    K + +EGG +
Sbjct: 268 VGKLVEALKANQQLENTLLVFFSDNGGKLGK---------GANNFPLTEGKGSAYEGGHR 318

Query: 378 VPAIL-WSPQIQ-----QNPRVSLQM 397
           VP    W  +I+      NP  +L +
Sbjct: 319 VPMFFHWPNRIKAGQKFHNPVSALDL 344



 Score = 46.2 bits (108), Expect = 0.050,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 6/68 (8%)

Query: 14  EYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
           EY TD  ++EAV  +      K P F+YLA+ A H       L+A +E +  F+ I D  
Sbjct: 199 EYITDGLSREAVNFVYKASEKKQPFFMYLAYNAPHV-----PLQAKKEDMAVFKSIKDEK 253

Query: 73  RRTYAALT 80
           RRTYAA+ 
Sbjct: 254 RRTYAAMV 261


>gi|6863178|gb|AAF30403.1|AF109925_1 sulfatase 2 precursor [Helix pomatia]
          Length = 266

 Score =  173 bits (438), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 93/235 (39%), Positives = 134/235 (57%), Gaps = 32/235 (13%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YG+ D+ +HG+ E  TPN+D LA  G+ L N Y QP+C+P+R+ LMTG+Y IHTG+Q   
Sbjct: 38  YGYRDIGYHGA-EFATPNLDKLAAEGVKLENYYVQPICSPTRSQLMTGRYQIHTGLQHDI 96

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           IW ++P G+PL    + + L+ +GYST AIGKWHLG +++EYTPLYRGF+S++GYL G  
Sbjct: 97  IWPSQPYGLPLQFPTIADMLKSVGYSTHAIGKWHLGLYKKEYTPLYRGFDSYYGYLEGGE 156

Query: 211 SYYDHILSDQY-SRTVELN------------------------GHDMRRNLSTAWDTVGE 245
            YY +   D + +RT   +                        G+D+R       D  G 
Sbjct: 157 DYYTYYNCDTFHNRTTPADTSILESYSPKNILLGKHEDENKWCGYDLRDMNEPVTDMNGT 216

Query: 246 YATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
           Y+T L+TK+A+ +I       KP  LYLA+ A H+      +E P E    + +I
Sbjct: 217 YSTHLYTKKAIDIINGASTGGKPFLLYLAYQAVHSP-----MEVPAEYTKPYTFI 266


>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
          Length = 507

 Score =  173 bits (438), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 112/363 (30%), Positives = 173/363 (47%), Gaps = 54/363 (14%)

Query: 79  LTKSTTLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
           L     + L+  +GW ++ +H    + E+ TPNID L   G+ LN  YA   C+PSR+SL
Sbjct: 22  LKPHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQGLELNQHYAYRCCSPSRSSL 81

Query: 136 MTGKYPIHTGMQG---------PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           ++G+ PIH   Q           PI G     +P     + E ++E GY+T  +GKW  G
Sbjct: 82  ISGRLPIHVSDQNIAPTNYNPNDPISGFS--AIPRNMTGIAEKMKEAGYATHQVGKWDAG 139

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHIL-------------SDQYSRTVELNGHDMR 233
               ++TP  RGF++ FGY +    YY  ++             +DQ +  +   G D  
Sbjct: 140 MATPDHTPKGRGFDTSFGYFHHYNDYYTEVVDSCNGTGVVDLWNTDQPAHGINGTGPD-- 197

Query: 234 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 293
                      +Y   LF +  + ++       PLFLY A    H       L+ P E +
Sbjct: 198 -----------KYEEALFRERLLDIVSKHDPSTPLFLYYAPHIVHT-----PLQVPDEYL 241

Query: 294 NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS 353
           N+F +I D +R  Y AMV  LDD VG ++SAL++KGM +N + +  SDNG P   +    
Sbjct: 242 NKFSFIDDKDRMYYHAMVNYLDDVVGDLVSALKKKGMWDNLLYVTSSDNGGPVYPF---- 297

Query: 354 NYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAA 411
                 +NYP +G K + W+GG++V A +    + +  R   +   +H++DW  T    A
Sbjct: 298 ---GGANNYPLKGGKLSDWQGGIRVNAFVSGGHLPEKMRGQKTDGYIHLADWYGTFCAIA 354

Query: 412 GGD 414
           G D
Sbjct: 355 GVD 357



 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 3/71 (4%)

Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLV--PQSHEQPDLVQADP 619
           C +  C L+N+ +DP E+ N+ASS PD   ++ E  K +  T     + H  P   QA  
Sbjct: 437 CGDSGC-LYNIKDDPEERKNLASSMPDKLKEMQEKSKKYLATYFNPDRGHASPLACQAAV 495

Query: 620 KRFNDTWSPWI 630
            ++   W P++
Sbjct: 496 NKYGGFWGPFV 506


>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
           siliculosus]
          Length = 597

 Score =  173 bits (438), Expect = 3e-40,   Method: Compositional matrix adjust.
 Identities = 141/429 (32%), Positives = 207/429 (48%), Gaps = 56/429 (13%)

Query: 92  GWNDLSFHGSNEIP-TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G ND+ +  ++    TP ++ LA  G+IL++ Y+  +CTPSRASLMTG+    TGMQ   
Sbjct: 85  GTNDVGYESTDLWQLTPFMNTLAAEGVILDDYYSNEICTPSRASLMTGRDSFRTGMQFGV 144

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           +  +   G+P+ E  L E  +  GYST   GKWHLG +     P  RGF++  GY  G  
Sbjct: 145 VEDSAAWGLPIDEVTLAERFQAAGYSTHMTGKWHLGVYSNANYPFSRGFDTFLGYTGGGE 204

Query: 211 SYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDT-------VGEYATDLFTKEAV 256
            YY H           QY+   +    D    ++   +T        G+Y+T + T  A+
Sbjct: 205 GYYTHRECVTPEFEGGQYTCYQDFGYGDKDGYINFTTNTTRKGPEMTGKYSTTVITDRAI 264

Query: 257 QL----IEDQPVDKPLFLYLAHLAAH----AGNAGKHLEAPQETINQFQYITDPNRRTYA 308
           ++    +E  P D PLFLY+AH A H    +   G   +   E +    Y +   RR  A
Sbjct: 265 EVAREHVEKSPSD-PLFLYVAHQAVHEPLDSPPPGSFTDEEVELLEGLDY-SHRERRALA 322

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
           ++V  LD  V  + +AL   G+L+N++I+F SDNGA              GSNYP RG K
Sbjct: 323 SIVYYLDKEVRRLHTALDALGVLDNAVIVFASDNGACP---------SKGGSNYPLRGFK 373

Query: 369 NTLWEGGVKVPAILWS------PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN- 421
           +T ++GGV+VP  ++S      P+  +  R S  MMH  DW PTL +   G    LPL+ 
Sbjct: 374 HTTFQGGVRVPGFVYSKSADLIPEEARGTRYS-GMMHSMDWTPTLGSVVPG----LPLDE 428

Query: 422 ----IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN-IDEKKRTAAV 476
               + G DQW +++      R S  +G  +   +L  T    +S     + E     A 
Sbjct: 429 MGKELSGFDQWDAIV-----GRRSAEEGPVRTELVLGRTSYDFDSDAGEMVKEPLPKGAF 483

Query: 477 RLDSWKLVL 485
             D WKL+L
Sbjct: 484 IHDGWKLIL 492


>gi|313228866|emb|CBY18017.1| unnamed protein product [Oikopleura dioica]
          Length = 482

 Score =  172 bits (437), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 110/356 (30%), Positives = 172/356 (48%), Gaps = 33/356 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ND+ ++    I  PN+ +LA NG IL+N Y QPVCTPSR++LMT +YPI  G+Q   I
Sbjct: 94  GFNDMPWNNP-AIIAPNLHSLAKNGTILSNFYVQPVCTPSRSALMTSRYPIRLGLQTDVI 152

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              +P  +PL E  +    +  GY+T  +GKWHLG +  +  P  RGF++  GYL G   
Sbjct: 153 TAPQPSCLPLDEVTIGNEFQSAGYTTHIVGKWHLGHYCPQCLPNNRGFDTFRGYLTGAED 212

Query: 212 YYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL----------FTKEAV 256
           YY       ++ +Q       + +D    +  A  T   Y   +          F   + 
Sbjct: 213 YYKKTFCIPLVPNQRPAACGFDFYDNENRMPKANGTYSTYQVLIYLFIHTIILKFADASR 272

Query: 257 QLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           ++I+     K P FLYL   + H       +  P+   + +    +  R+ Y  MV  +D
Sbjct: 273 EVIKSHEGSKTPFFLYLPFQSVHYP-----VMVPKNYSDLYPNEKNHTRKEYMGMVTAMD 327

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           ++VG ++  L+  G  EN++  + +DNG               G N P+RG K TLW+GG
Sbjct: 328 EAVGHIVDELKAAGAYENTLFFYATDNGGLVGA---------GGRNAPFRGQKATLWQGG 378

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           ++VPA +  P +         MMHI D   T+  A G   +    ++DG+  WS++
Sbjct: 379 LRVPAFISGPGVPSASNYK-NMMHIVDVQMTVLDAIGHQRTG-EKHVDGVSHWSAI 432


>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
 gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
          Length = 471

 Score =  172 bits (436), Expect = 4e-40,   Method: Compositional matrix adjust.
 Identities = 114/326 (34%), Positives = 159/326 (48%), Gaps = 36/326 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           G  DL  +G  +I TPN+D LA  GI     YA   VC+ +R +L+TG+Y   +  G++ 
Sbjct: 44  GHADLGVYGQTDIRTPNLDKLAGQGIRFTQAYANSAVCSATRFALITGRYQYRLRGGLEE 103

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P    ++  G+P T   LP  L++ GY T  IGKWHLG+    + PL  G++S FG   G
Sbjct: 104 PIAGASDTLGLPRTHPTLPSLLKKQGYGTALIGKWHLGYLP-TFGPLKSGYDSFFGNYGG 162

Query: 209 VISYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
            I Y+ H       +  D Y   V ++              +G Y TDL    AV  ++ 
Sbjct: 163 AIDYFTHKPGVGPQVKEDLYEGEVPVH-------------QIGYY-TDLLGARAVDFVQK 208

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
           Q   KP  L L + A H    G   EA    I+   +    +  TY  MV+ LD S+G V
Sbjct: 209 QQAGKPFLLSLHYTAPHWPWEGPGDEAISRDIDNIFHYDGGDLATYGRMVESLDASIGRV 268

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           + AL+ KG+ EN+I+IF SDNG              +   +P  G K  L EGG++VPAI
Sbjct: 269 LKALEDKGLAENTIVIFTSDNGG-----------ERFSKTWPLTGQKTELLEGGIRVPAI 317

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTL 407
           +  P      +VS Q+    DWLPTL
Sbjct: 318 VRWPAKIAAGQVSEQVAISMDWLPTL 343


>gi|381205411|ref|ZP_09912482.1| sulfatase [SAR324 cluster bacterium JCVI-SC AAA005]
          Length = 448

 Score =  172 bits (435), Expect = 6e-40,   Method: Compositional matrix adjust.
 Identities = 132/427 (30%), Positives = 206/427 (48%), Gaps = 60/427 (14%)

Query: 79  LTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMT 137
           + K+  + +    G+ DLS  G+ +  TPN+D LA  G      Y+  PVC+PSRA+L+T
Sbjct: 1   MKKNVIVIMTDDQGYGDLSCMGATDFKTPNLDRLANEGARFKCWYSNSPVCSPSRAALLT 60

Query: 138 GKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRE-LGYSTKAIGKWHLGFFRREYTPLY 196
           G+YP + G++       +  G+P     LP+ LR+ LGY T   GKWHLG       P  
Sbjct: 61  GRYPANAGVRAILDGHRKATGLPKETLTLPKILRDRLGYHTAMSGKWHLG-LAEGCRPQD 119

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNG----HDMRRNLSTAWDTVGEYATDLFT 252
            GFE  FG+L G + ++ HI     +R  E  G    HD+  N    W   GEY T++ T
Sbjct: 120 HGFEQWFGFLAGCVDFFSHIFYWGMNR--EGPGMNPTHDLWVNNKEVWKN-GEYLTEVIT 176

Query: 253 KEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
           + ++Q +     D K  FLY+ + A H       + APQ+ +++F  +   +R+  AAMV
Sbjct: 177 EYSIQALRRAKKDSKSFFLYIPYNAPHY-----PMHAPQKYMDRFGGLP-WDRQVMAAMV 230

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW----------GSN 361
             +DD +G + + L R   +E++I  F SDNG      RE+   RNW          GS+
Sbjct: 231 SAVDDGIGEIFNELARLDFIEDTITFFTSDNGPS----RES---RNWLDGTLDPYYGGSS 283

Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
             ++G K +L+EGGV+VP I+  P +    ++  +     D LPT+          L L+
Sbjct: 284 GGFKGHKFSLFEGGVRVPGIVHYPGVVPASKIVEEPFCSFDMLPTI-------CEWLHLD 336

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
           I  ++                +DG+ Q   L    P+    +   +D++    ++R   W
Sbjct: 337 ISDME----------------LDGISQAGFLEGKQPALERDLFWEMDDQ---TSIRRGDW 377

Query: 482 KLVLGTQ 488
           KLVL  Q
Sbjct: 378 KLVLNGQ 384


>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
          Length = 562

 Score =  172 bits (435), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 152/579 (26%), Positives = 249/579 (43%), Gaps = 110/579 (18%)

Query: 85  LTLLIVYGWNDLSFHGS----NEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
           + L+   GW D+ +H S    ++I TP ID L   GI L   Y   VC+P+RAS  +G+ 
Sbjct: 57  MVLVDDLGWADVGYHRSGPHKSDIQTPTIDKLVSQGIALERHYVHKVCSPTRASFQSGRL 116

Query: 141 PIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
           P+H G+ G  +  A   G+P     + ++L + GY++  +GKW +G     +TP  RG+ 
Sbjct: 117 PVH-GIDGQVVLCAPRAGIPENMTTVAQHLNKAGYASHFVGKWDVGMATPSHTPHGRGYN 175

Query: 201 S---HFGYLNGVISYYDHILS-DQYSRTVELNGHDMRRNLSTAWDTVGE--------YAT 248
           +   +FG+ N + +  +   S +  S        D  ++    WDT           Y  
Sbjct: 176 TSLNYFGHANWMWNQDEWQGSQNNVSHRPPCKAPDCFKDF---WDTDRPAHNLSGTLYEE 232

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
            LF +    +IE     +PLFL  A   AH       L+AP E   QF  I  P+RR Y 
Sbjct: 233 QLFVQRITDIIEAHDPSQPLFLTYASKVAH-----YPLQAPIEYQQQFANIEPPSRRVYH 287

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS------------NYR 356
           AMVK LDD++  + + +Q+KGM  N++++F SDNG P + +                N  
Sbjct: 288 AMVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHPSRGYMCYNGE 347

Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVS--LQMMHISDWLPTLYTAAGGD 414
              +NYP RG K + +EGG++V A +    I  + R S    +MHI+DW  T    AG  
Sbjct: 348 AGANNYPLRGGKYSFFEGGIRVNAFVSGGLIPADQRGSNRTGIMHIADWYATFCALAG-- 405

Query: 415 TSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTP-SRRNSVLINIDEKKRT 473
                  +D  D  ++   N P      ID  + W  L      S R++VL++ D     
Sbjct: 406 -------VDPTDHRAA-AANLPP-----IDSHNMWPYLAGQARFSPRDTVLVDRD----- 447

Query: 474 AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
             +    +KL+LG                   P+ N++         +   +S       
Sbjct: 448 -VLVFGDYKLLLGASN----------------PMANWSGPAYPNATTATSPIS------- 483

Query: 534 SNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
                      + T++C             + P  LFN+  DP E++++A  +PDI   +
Sbjct: 484 -----------RYTLNC-------------SDPGCLFNVVEDPEERHDLAEEKPDILKLM 519

Query: 594 YELLKYHRRTLVPQSHE--QPDLVQADPKRFNDTWSPWI 630
              L    +++ P +     P  ++    +++  + PW+
Sbjct: 520 KFTLTELAKSIFPYNPGPVDPQCMKTAETKYDGFYGPWL 558



 Score = 42.0 bits (97), Expect = 0.94,   Method: Compositional matrix adjust.
 Identities = 39/129 (30%), Positives = 52/129 (40%), Gaps = 25/129 (19%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y   LF +    +IE     +PLFL  A   AH       L+AP E   QF  I  P+RR
Sbjct: 230 YEEQLFVQRITDIIEAHDPSQPLFLTYASKVAH-----YPLQAPIEYQQQFANIEPPSRR 284

Query: 75  TYAALTKS--------TTL---------TLLIVYGWND---LSFHGSNEIPTPNIDALAY 114
            Y A+ K         TTL         TL++    N    ++F G  +   P+   + Y
Sbjct: 285 VYHAMVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHPSRGYMCY 344

Query: 115 NGIILNNMY 123
           NG    N Y
Sbjct: 345 NGEAGANNY 353


>gi|298715187|emb|CBJ27859.1| Formylglycine-dependent sulfatase, C-terminal fragment
           Formylglycine-dependent sulfatase, N-terminal
           [Ectocarpus siliculosus]
          Length = 610

 Score =  172 bits (435), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 144/445 (32%), Positives = 214/445 (48%), Gaps = 73/445 (16%)

Query: 85  LTLLIVYGWNDLSFHGSN--EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
             L+   G ND+ +  ++  E+ TP +D+LA +G++L+N Y   +CTPSRASLMTG+   
Sbjct: 74  FILIDDVGTNDMGYRSTDLWEL-TPFLDSLASSGVVLDNYYTNQLCTPSRASLMTGRDSF 132

Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
            TGMQ   +  + P G+P  E  L +  +  GYST   GKWHLG F     P  RGF++ 
Sbjct: 133 RTGMQHGIVDYSAPWGLPFEEVTLADRFKAAGYSTHMTGKWHLGVFSDASYPFSRGFDTF 192

Query: 203 FGYLNGVISYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDTVGE-------YAT 248
            GY  G   YY+H           +YS   +    D    +    +T  E       Y+T
Sbjct: 193 LGYTGGGEGYYNHSTCFTPTFEGGEYSCLKDFGYGDEDGYIDYTTNTTKEGPAMVDNYST 252

Query: 249 DLFTKEAVQLIEDQ----PVDKPLFLYLAHLAAHAGNAGKHLEAP-------QETINQFQ 297
            + T  A+ +  +       D PLFLY+A+ AAH       L+AP       +E     +
Sbjct: 253 TIMTDRAIDVAREHTGTASSDDPLFLYVAYQAAHT-----PLDAPPPGTFTEEEEETLAR 307

Query: 298 YITD-PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
            ++D P+R  +A ++  LD  V  +  AL+  G+L+N++I+  SDNGA +          
Sbjct: 308 VVSDEPDRHVFARIIYYLDKEVRRLHDALEALGVLDNAVIVLASDNGACS---------S 358

Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL-----QMMHISDWLPTLYTAA 411
           + GSN+P RG KNT ++GGV+VPA ++S   +  P  +       MMH +DW  TL    
Sbjct: 359 SGGSNHPLRGFKNTTFQGGVRVPAFVYSKSTELIPEEARGTTYSGMMHSTDWTQTLGRVV 418

Query: 412 GGDTSRLPL-----NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN 466
            G    LPL      + G DQW +++      R S  +G  +   +L      RNS   +
Sbjct: 419 PG----LPLAGMGKELSGFDQWDAIV-----GRRSAEEGPVRTDMVL-----GRNSFTFD 464

Query: 467 IDEK---KRTA---AVRLDSWKLVL 485
            D     KR +   A   D WKL+L
Sbjct: 465 TDTAEMVKRASPWGAYIHDGWKLIL 489


>gi|294011191|ref|YP_003544651.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
 gi|292674521|dbj|BAI96039.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
          Length = 468

 Score =  171 bits (434), Expect = 7e-40,   Method: Compositional matrix adjust.
 Identities = 116/341 (34%), Positives = 172/341 (50%), Gaps = 28/341 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYP--IHTGMQG 148
           G  DL  +GS +I TP IDA+A  G+   N YA   VC+P+R +L+TG+Y      G++ 
Sbjct: 49  GHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEE 108

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P  +  +   +P   R LP  LR+LGY+T  +GKWH+G      +PL  G++  FG  +G
Sbjct: 109 PIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVGELPAS-SPLDHGYDYFFGIASG 167

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLS--TAWDTVGEYATDLFTKEAVQLIEDQPV-D 265
              Y+ H           +NGH+M +     T     G Y TDL   +A+  ++     D
Sbjct: 168 GTDYFAH--------ATTINGHEMGKLFENRTEIQRPG-YLTDLLGAKAIDRMQQAARQD 218

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           +P F+ L   A H    G    A        + +   N R YAAM++ +D +VG V++ L
Sbjct: 219 RPFFISLHFTAPHWPWQGPGDAAQGSAAADPRQMDGGNVRAYAAMMEGMDANVGRVLAEL 278

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
              G+ E++I+IF SDNG              +  ++P  G+K  L EGG++VP I+  P
Sbjct: 279 AALGLEEDTIVIFTSDNGG-----------ERFSDSWPLTGMKGELLEGGIRVPLIVSWP 327

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
                 R S Q+    D LPTL +AAGGD  R+   +DG+D
Sbjct: 328 AHLPKGRDSAQVTMSMDALPTLLSAAGGDPGRVE-GLDGID 367


>gi|196231680|ref|ZP_03130537.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224152|gb|EDY18665.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 474

 Score =  171 bits (434), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 123/370 (33%), Positives = 188/370 (50%), Gaps = 54/370 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ +   +G  +IPTPNID L  +G+  ++ Y + P C  SRA+LMTG+Y    G +  P
Sbjct: 39  GYGEPGCYGGKDIPTPNIDKLVASGVRFSSGYVSAPFCAASRAALMTGRYQTRFGFEYNP 98

Query: 151 IWG--AEP-RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
           I    A+P  G+P+ E+ + + LR++GY+T  +GKWHLG     + P  RGF+  FG+L+
Sbjct: 99  IGAKNADPGTGLPVNEKTVADRLRDVGYATGLVGKWHLG-GTAPFHPQRRGFDEFFGFLH 157

Query: 208 -----------GVISYYDH--ILSDQYSRTVELNGH-----DMRRNLSTAWDT------- 242
                      G  ++     +      R    +GH     D+  N   A+D        
Sbjct: 158 EGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGHTVWSTDLHEN-EPAYDADNPLLRN 216

Query: 243 ---VGEYA--TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
              V E A  TD FT+EA   I D+   +P FLYLA+ A H+      L+     + +F 
Sbjct: 217 SQPVEEKANLTDAFTREACSFI-DRHQAQPWFLYLAYNAVHS-----PLQGEDTYMEKFS 270

Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
           +I D  RR +AA++  LD+ +G V + L+  G+ EN++++F+SDNG PT E         
Sbjct: 271 HIGDIQRRIFAAVLAHLDEDIGKVRAQLRADGLEENTLVVFLSDNGGPTKELTS------ 324

Query: 358 WGSNYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
             SN P RG K  LW+GG+++P A+ W  QI     +    + + D   T    AG +T 
Sbjct: 325 --SNLPLRGGKGDLWDGGIRIPFAVSWKGQIPAGHTIDAPAISM-DLTATALKLAGAETE 381

Query: 417 RLPLNIDGLD 426
           +  L  DG+D
Sbjct: 382 QAKL--DGVD 389


>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 454

 Score =  171 bits (434), Expect = 8e-40,   Method: Compositional matrix adjust.
 Identities = 117/353 (33%), Positives = 183/353 (51%), Gaps = 48/353 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW D+ F+G  E  TPN+D LA  G      Y A  VC PSRA+LMTG+Y IH G+    
Sbjct: 51  GWGDVGFNGRTEWATPNLDRLAARGTTFKRFYTAAVVCAPSRAALMTGRYTIHDGVS--- 107

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR---REYT-PLYRGFESHFGYL 206
                   +P  E  L E  +  GY T   GKWH G  R   + Y  P+ +GF+  FG+ 
Sbjct: 108 ---RNNDDLPAREVTLAEAFKTHGYDTALFGKWHHGQPRDGSKTYVHPMDQGFDEFFGFT 164

Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE---DQP 263
           +   ++      ++Y   +  +G +++         V  Y+ D+F   A+  ++   ++P
Sbjct: 165 DAKHAW------EKYPEQL-WHGRELK--------PVSGYSDDMFADHAIDFLKRHKEKP 209

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETI----NQFQYI--TDPNRRTYAAMVKKLDDS 317
              P FLY+  +  H      ++EAP E +     +F  +  + P R TYAAMV +LD +
Sbjct: 210 T--PFFLYVPFINTHF-----NIEAPAEEVALHQGKFTEVDPSKPIRATYAAMVTQLDKN 262

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE--TSNYRNWGSNYPYRGVKNTLWEGG 375
           VG +++AL+  G+  +++++F+SD+GA T E      S+Y +  SN P+RG K TLWEGG
Sbjct: 263 VGRIMTALEGLGLAADTLVVFVSDHGA-TFESGNLGASDYHD--SNRPFRGQKRTLWEGG 319

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
           ++VP +   P       +S +++H++D  PTL  A G        ++DG D W
Sbjct: 320 IRVPGVACWPGHVPAGVISNEVVHMTDLFPTLLAATGAH-PEAAWHVDGADLW 371


>gi|323451422|gb|EGB07299.1| putative arylsulfatase [Aureococcus anophagefferens]
          Length = 512

 Score =  171 bits (434), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 154/548 (28%), Positives = 231/548 (42%), Gaps = 78/548 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWN+  ++   ++ TP+ DAL   G+ L N Y    C+P+R+S ++G+ PIH   +    
Sbjct: 32  GWNNGFYN--EDVLTPHTDALVAAGVKLTNHYVYKFCSPTRSSFLSGRLPIHVNQENSAT 89

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
             A   GVPL    + E +R  GY T  +GKWH G     +TP  RGF+S  G+ N    
Sbjct: 90  EQAG-AGVPLGMSTIGETMRAAGYRTAHVGKWHCGMATPAHTPYGRGFDSSIGFFNFGED 148

Query: 212 YYDHILSDQY----SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           +Y  +   Q       + + +G D+ R+ + A    G Y    FT+ AV  +E     KP
Sbjct: 149 HYTQVRGGQALEEGGASGQCSGVDLWRDGAPARGENGTYGGYTFTRAAVAAVEANDT-KP 207

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LFL+ A    H       LE P E +   +Y  +  R T   M   LDDSVG V  AL++
Sbjct: 208 LFLFAAFQNLH-----PPLEVPPEYVA--RYANETLRTTINGMAAFLDDSVGNVTKALKK 260

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
           + M ++S+I++  DNG    +          G + P+RG K + ++GGV+V A +    +
Sbjct: 261 QNMYDSSLIVYTPDNGGYLTQ---------GGDDAPFRGGKFSDFQGGVRVAAFISGGLV 311

Query: 388 QQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
               R   S  ++H+ DW+ T     GG    +P +       +   L  P       D 
Sbjct: 312 PAGLRGTASAALVHVCDWMATFAALGGG---AIPAD-------AVAGLPAP-------DS 354

Query: 446 LDQWSSLLLNTPSRRNSVLINI---DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
           LD W +L    PS R  V ++I   D   R A V  D     +G +         G    
Sbjct: 355 LDVWPALSRGRPSPRTEVPLSILPADGAAR-AFVDFDGVGYFVGGE---------GLIVG 404

Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
           +   LL          YQ            I   DK+ +          A   P  P  C
Sbjct: 405 DMKLLLG---------YQH-----------IGPFDKVTNATCAGVAGPWAAGRPGVPCDC 444

Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL-VPQSHEQPDLVQADPKR 621
               C L+N+  DP E +++A++RP   + L   L+  R T+  P    + D   A    
Sbjct: 445 ATSGC-LYNVTADPTESDDLAAARPGAVAALRTRLEALRATVYAPDRGAKDDAACAQIAT 503

Query: 622 FNDTWSPW 629
               W PW
Sbjct: 504 NGGFWGPW 511


>gi|320102821|ref|YP_004178412.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319750103|gb|ADV61863.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 480

 Score =  171 bits (434), Expect = 9e-40,   Method: Compositional matrix adjust.
 Identities = 117/337 (34%), Positives = 175/337 (51%), Gaps = 32/337 (9%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L    GW DLS++G  +  TP++D LA  G   +  Y A  VC PSRA+L+TGK PI
Sbjct: 39  VLILADDLGWGDLSYNGRIQWSTPHLDDLAARGARFDRFYTAGVVCAPSRAALLTGKNPI 98

Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE----YTPLYRG 198
           H G      W  +   +P  E  + E L+  GY T   GKWH G   RE      PL +G
Sbjct: 99  HCGT-----WRND-HDLPAEEVTIAEALKAKGYHTALYGKWHQGRPTREGGTRTHPLDQG 152

Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           F+  FG+L+   ++      +++   +  NG D R  +       G YA D FT + V+ 
Sbjct: 153 FDETFGFLSATHAW------EKFPNHL-FNGRD-RVEVPQ-----GRYADDWFTDKGVEF 199

Query: 259 IEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDD 316
           I     D  P FLYL   + H      H+EAP++ I + +   +DP +  YAA+V ++D 
Sbjct: 200 IARHKDDPTPFFLYLPLTSTHF-----HIEAPEDEIAKHRSTFSDPIQTAYAAIVTRMDA 254

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
            VG ++ A++  G+ ++++I+F SD+GA T E        +  SN+P+RG K T++EGG+
Sbjct: 255 QVGRIVQAVRDAGLEDSTLIVFTSDHGA-TFEGGSKEASSSLDSNHPFRGQKRTVYEGGI 313

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
           +VP I+  P +    R+    +   D LPT    AGG
Sbjct: 314 RVPGIMTYPGVIPAGRIVSTPVQTIDLLPTFLELAGG 350


>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 445

 Score =  171 bits (433), Expect = 1e-39,   Method: Compositional matrix adjust.
 Identities = 119/353 (33%), Positives = 181/353 (51%), Gaps = 42/353 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G+  I TPN+D LA  G++  + ++   VC+P+RA+LMTGKY   TG++G  
Sbjct: 39  GYGDLSCYGNEYINTPNLDLLASEGVLFTDYHSNGSVCSPTRAALMTGKYQQRTGVEGVV 98

Query: 151 IWGAE-PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
              +    G+ L E  L E L++LGY+T   GKWHLG+  + + P  +GF+   G+++G 
Sbjct: 99  TAKSHRDVGLALAEVTLAEELKQLGYNTGMFGKWHLGY-DKAFNPTLQGFDEFVGFVSGN 157

Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG-----EYATDLFTKEAVQLIEDQ-P 263
           + Y+ HI  + Y                  WD V       Y TDL ++  V+ I++  P
Sbjct: 158 VDYHGHIDQEGYL---------------DWWDGVKIKNEKGYTTDLISEYGVKFIQEHNP 202

Query: 264 VDK--PLFLYLAHLAAHAGNAGKHLEAPQE--TINQFQYITDPNRRTYAAMVKKLDDSVG 319
             K  P FLYL H A H+    +  +  +E  T    + I D     Y  MV+ +D+ VG
Sbjct: 203 EVKRAPFFLYLPHEAPHSPYQRRIDKVLREIGTAGTQEVIQDSISSIYKEMVEVMDEGVG 262

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            ++ +L+  G  EN+I+IF+SDNGA            ++G N   RG K   +EGG +VP
Sbjct: 263 RIMQSLKETGQYENTIVIFISDNGA-----------NHYGDNGGLRGFKAGPYEGGSRVP 311

Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           AI   P+  +   V+ Q +   D LPTL    G   S    ++DG+    +LL
Sbjct: 312 AIFSFPKEVKGGSVNNQTVLSMDLLPTLLDFIGRKPST---SVDGISIKENLL 361


>gi|347528892|ref|YP_004835639.1| putative arylsulfatase [Sphingobium sp. SYK-6]
 gi|345137573|dbj|BAK67182.1| putative arylsulfatase [Sphingobium sp. SYK-6]
          Length = 458

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 117/347 (33%), Positives = 175/347 (50%), Gaps = 27/347 (7%)

Query: 74  RTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTP 130
           R  AA  +   +  ++    G+ DLSF G  E  TP+ID++A  G   ++ YA   VC+P
Sbjct: 19  RVEAAPARPPHIVFIMADDLGYADLSFTGRTEYATPHIDSIAEQGCFFSDAYANSSVCSP 78

Query: 131 SRASLMTGKYP--IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
           +R +L TG+Y   +  G+  P   G EP G+P     +   LRE GY T  +GKWHLG  
Sbjct: 79  TRMALATGRYQYRLRGGLDEPIGGGPEPVGLPPEHPTIASRLREAGYRTVLVGKWHLGSP 138

Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
            R ++PL  G++  FG   G   Y+ H      S++++  G     + S+  D  G Y T
Sbjct: 139 PR-FSPLLSGYDRFFGIHGGGADYFTH------SQSMQGGGSSDLYDGSSVTDRHG-YLT 190

Query: 249 DLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
           D+ T+ AV  I+      KPLFL L + A H    G   EA    I         +  T+
Sbjct: 191 DILTERAVAEIDLAARTRKPLFLSLHYTAPHWPWEGPEDEALAAQIKSLLNYDGGSDSTF 250

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
           A MV+ LD  VG V++AL+ +GM +++I++F SDNG              +   +P+ G+
Sbjct: 251 AKMVQALDKGVGKVLAALEAQGMTQDTIVVFTSDNGG-----------ERFSKVWPFDGM 299

Query: 368 KNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
           K  L EGG++VP ++ W  +I    R S Q+    D++PT    AGG
Sbjct: 300 KGDLLEGGIRVPMMMRWPGKIPAGSR-SAQIAMTMDYMPTFLATAGG 345


>gi|126304968|ref|XP_001376926.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Monodelphis
           domestica]
          Length = 520

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 162/569 (28%), Positives = 243/569 (42%), Gaps = 111/569 (19%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRA 133
            A ++   +  L++   GW DL   G     TP++D +A  G++  N Y A P+C+PSRA
Sbjct: 22  GATSQPPNIVFLLMDDMGWGDLGVFGEPSRETPHLDQMAAEGMLFPNFYTANPLCSPSRA 81

Query: 134 SLMTGKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           +L+TG+ PI  G     G       P+    G+P +E  LPE L++ GY  K +GKWHLG
Sbjct: 82  ALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVGKWHLG 141

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDH--ILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
             R ++ PL  GF+  FG  N     YD+  + +    R  E+ G     +      T  
Sbjct: 142 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKAMPNIPVYRNWEMVGR-FYEDFPINHKTGE 199

Query: 245 EYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
              T ++ KEAV  I+ Q    +P FLY A  A HA                  ++    
Sbjct: 200 ANLTQIYLKEAVDFIKKQQAHQQPFFLYWAIDATHA-----------PVYASKSFLGTSQ 248

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
           R  Y   V+++DDS+G ++  LQ  G+ EN+ + F SDNGA  +     S  +  GSN P
Sbjct: 249 RGRYGDAVREIDDSIGKILKLLQDLGIKENTFVFFTSDNGAALI-----SAPKEGGSNGP 303

Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
           +   K T +EGG++ PAI W P      +VS Q+ ++ D   T  + AG           
Sbjct: 304 FLCGKQTTFEGGLREPAIAWWPGHIPAGQVSHQLGNVMDLFTTGLSLAG----------- 352

Query: 424 GLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
                    L  PS R   IDG+D   +++      R       +E     AVR+  +K 
Sbjct: 353 ---------LKPPSDR--EIDGIDLLPAIVQGKVIDRPIFYYRGNE---MMAVRVGQYKA 398

Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-FLPISNIDKMRST 542
              T  N                              S ++ SQ I F P  N+  + + 
Sbjct: 399 HYWTWSN------------------------------SWEEFSQGIDFCPGQNVSGVTTH 428

Query: 543 RQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSR---PDISSQLYELLKY 599
            Q+                 TN P  LF+LG DP E+  I  S     D   +++ +++ 
Sbjct: 429 SQEEH---------------TNLPL-LFHLGRDPGERFPIRVSSLEYKDALLRIFPIVEE 472

Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           H+RTL+P    +P L   +P   N  W+P
Sbjct: 473 HKRTLIPG---EPQLNVCNPAVMN--WAP 496


>gi|390167238|ref|ZP_10219235.1| putative arylsulfatase A [Sphingobium indicum B90A]
 gi|389590183|gb|EIM68184.1| putative arylsulfatase A [Sphingobium indicum B90A]
          Length = 468

 Score =  170 bits (431), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 118/343 (34%), Positives = 173/343 (50%), Gaps = 32/343 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYP--IHTGMQG 148
           G  DL  +GS +I TP IDA+A  G+   N YA   VC+P+R +L+TG+Y      G++ 
Sbjct: 49  GHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEE 108

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P  +  +   +P   R LP  LR+LGY+T  +GKWH+G      +PL  G++  FG  +G
Sbjct: 109 PIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVGELPAS-SPLDHGYDYFFGIASG 167

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLS--TAWDTVGEYATDLFTKEAV---QLIEDQP 263
              Y+ H           +NGH+M +     T     G Y TDL   +A+   +L   Q 
Sbjct: 168 GTDYFAH--------ATTINGHEMGKLFENRTEIQRPG-YLTDLLGAKAIDRMRLAARQ- 217

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
            D+P F+ L   A H    G    A        + +   N R YAAM++ +D +VG V++
Sbjct: 218 -DRPFFISLHFTAPHWPWQGPGDAAQGSAAADPRQMDGGNVRAYAAMMEGMDANVGRVLA 276

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
            L   G+ E++I+IF SDNG              +  ++P  G+K  L EGG++VP I+ 
Sbjct: 277 ELAALGLQEDTIVIFTSDNGG-----------ERFSDSWPLTGMKGELLEGGIRVPLIVS 325

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            P      R S Q+    D LPTL +AAGGD  R+   +DG+D
Sbjct: 326 WPAHLPKGRDSAQVTMSMDALPTLLSAAGGDPGRVE-GLDGID 367


>gi|336424342|ref|ZP_08604383.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
 gi|336003446|gb|EGN33530.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
           3_1_57FAA_CT1]
          Length = 460

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 148/512 (28%), Positives = 224/512 (43%), Gaps = 95/512 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ D          TPN+D L   G  +++ YA  PVC P+RA+L+TG+YP  TG     
Sbjct: 19  GYGDFGIFSDGSARTPNLDRLVRQGCAMSHCYAASPVCAPARAALLTGRYPHRTGAVDTY 78

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
             I G     + L E  L +  R  GY T  IGKWHLG   +EY P  RGF++  G+  G
Sbjct: 79  EAIGGDR---MALREVTLADVYRANGYRTGLIGKWHLGLIGKEYHPCRRGFDTFIGFRGG 135

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
              YY + L               R  +  A D  G Y TD+ T+E+++ I +   ++P 
Sbjct: 136 WSDYYQYKLD--------------RNGILEASD--GTYMTDVITEESIRFIREN-REQPF 178

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FL+ A+ A H        + P+E    F+   +P   T   M+  +D+ +G ++  L+  
Sbjct: 179 FLHAAYNAPHFP-----FQCPEEYAAPFRKRFNPTLATLYGMIACMDEGIGKLLDVLEET 233

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
           G+ +N+I+IF SDNG P + Y +T+ Y     N    G K   +EGG++VPA++ W   I
Sbjct: 234 GLSDNTIVIFASDNG-PQL-YGDTNRY-----NCYLNGQKGEAFEGGIRVPAVIRWPGHI 286

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP--LNIDGLDQWSSLLLNTPSRRNSNIDG 445
             + R      H  DW PTL +A       LP  +++DG ++   +LL + S        
Sbjct: 287 APDSRCH-AFFHGVDWFPTLLSAC---RLELPDGVSLDGKNRLDEILLGSSSPSEPY--- 339

Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
             QW+     TP             K  AAVR   WKLV    E              +V
Sbjct: 340 YWQWNRF---TPV-----------SKCNAAVRCGKWKLVWPPVEEAM-----------QV 374

Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
           P      +VE       ++   +   P    D  R             PAP  P      
Sbjct: 375 PPY----VVEIDERIKQEETVPDTLYPFD--DSYRRI-----------PAPHPPR----- 412

Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
              L+NL +DP E+++++   P+I+  L ++ 
Sbjct: 413 ---LYNLEDDPLERHDVSIDYPEITESLSQVF 441


>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
 gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 489

 Score =  170 bits (430), Expect = 2e-39,   Method: Compositional matrix adjust.
 Identities = 138/467 (29%), Positives = 213/467 (45%), Gaps = 85/467 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI------HT 144
           G+ DL F GS+   TPN+D LA   +  +  Y+  P C PSR S+MTGKYP       H 
Sbjct: 50  GFADLGFTGSDTHLTPNLDKLAKESVYFDRAYSSHPTCAPSRMSIMTGKYPARLGAVSHG 109

Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
            + G    G    G+P+TE  + E L++ GY+T  IGKWH+G  + E  P  RGF+    
Sbjct: 110 KLGGVAHPGPNDNGLPMTETTIGEALKKEGYTTAHIGKWHIG--KGENNPGTRGFDVD-- 165

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD----TVGEYATDLFTKEAVQLIE 260
                I+  +      Y    E N ++ +R  S   D      G++ TD   +EAV+ I 
Sbjct: 166 -----IASNEFCCPGSYMYPFESN-NEKQRVASKIPDLEDRKPGDFLTDALAEEAVKFIH 219

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR----TYAAMVKKLDD 316
               +KP FL ++  A H       + A  E + +++ +  P+ R    TYA +V+ LDD
Sbjct: 220 STD-EKPFFLNMSFYAVHTP-----ITAIPEKVEKYKRLIGPDARQKNPTYAGLVEHLDD 273

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           +VG ++ AL+ KG+++N+II+F SDNG   +             N+P R  K + +EGG 
Sbjct: 274 AVGAILKALEEKGIIDNTIIVFTSDNGGEIL--------HGITDNFPLRDGKGSSYEGGT 325

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN-- 434
           +VP ++  P + Q   VS + +   D+ PT  + AG +      +IDG D +S LL N  
Sbjct: 326 RVPLLVKWPGVTQVNTVSHERIIGFDYYPTFLSMAGANPDL--HDIDGRD-FSPLLKNPK 382

Query: 435 --------------------TPSRRNSNI------------------DGLDQWSSL--LL 454
                                P   N                     DG  Q   L  L 
Sbjct: 383 EKLEERDLHWLKYLSLIHYRIPIADNKRCFETIVSGDWKLHEYFQMPDGYKQHFELYNLK 442

Query: 455 NTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
           + PS +N++     EK +   ++++SWK  +G      M+ +YG  +
Sbjct: 443 DDPSEKNNLANKFPEKVQELKLKMESWKKDIGAPVY-DMEKFYGHVK 488


>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
 gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
          Length = 474

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 112/345 (32%), Positives = 175/345 (50%), Gaps = 33/345 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS+E  TP++D LA   I  +  Y +  VC PSRA ++TGKY    G +   
Sbjct: 47  GYADFGFQGSSEFKTPHLDQLASQSIRFSQAYVSAAVCGPSRAGILTGKYQQRFGYEENN 106

Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
           + G         +  G+PL ++ LPEYL+E GY T   GKWH+G   + + P  RGF++ 
Sbjct: 107 VPGYMSASATTGDEMGLPLDQKLLPEYLKEQGYKTALFGKWHMGNADK-FHPTKRGFDTF 165

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           +G+  G  SYY+   +++ +R  +     + R     ++    Y TD   +     IE  
Sbjct: 166 YGFRGGARSYYEFNENNKNNRQED----RLERGFGN-FEESKLYLTDALAEATTDFIEKN 220

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
              +P F+YL+  A H       +EA  + + QF  +    R+T AAM   LD + G V+
Sbjct: 221 Q-KQPFFVYLSFNAVHT-----PMEARPDDLKQFPNLKG-KRKTLAAMTLGLDRACGNVL 273

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
           + L   G+ EN++I+F +DNG P+          N  SN P  G K    EGG++VP ++
Sbjct: 274 AKLDELGVRENTLIVFTNDNGGPSGT--------NASSNKPLGGTKANHLEGGIRVPFLM 325

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            W  Q+++N      +  + D LPT Y  AGG+   +  +IDG++
Sbjct: 326 SWPKQLKENTVYDYPISTL-DLLPTFYQIAGGNEKNIN-DIDGVN 368


>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
 gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
          Length = 498

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 128/412 (31%), Positives = 200/412 (48%), Gaps = 49/412 (11%)

Query: 65  FQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
           F + T+ +R   AA +    + +++    G+ D+   GS  + TPN+D LA +G++ +  
Sbjct: 19  FVFATEASR---AAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGVLCSQA 75

Query: 123 Y-AQPVCTPSRASLMTGKYPIHTGMQG-----PPIWGAEPR--GVPLTERFLPEYLRELG 174
           Y A  VC+PSRA L+TG+ P   G +G        +   P   G+P +E+ L ++L   G
Sbjct: 76  YVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPKSEKTLADHLGAAG 135

Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRR 234
           Y+T  IGKWHLG     + P  RGF+   G L G   Y+   ++           H + R
Sbjct: 136 YATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMN-----------HVIER 183

Query: 235 NLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEAPQE 291
           N     +   EY TD FT E ++ I+       D+P F++ ++ A H       + A + 
Sbjct: 184 NGKRVENFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PMHATEA 238

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
            + +F  I +  RRTYAAM+  LD  VG +   L+  G  EN++++F SDNG  T     
Sbjct: 239 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT----- 293

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
             N  +W  N P RGVK ++ EGG++VP I  W  +     R    +  + D LPT  +A
Sbjct: 294 --NNGSW--NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVRYDGVISSL-DLLPTFCSA 348

Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT--PSRR 460
           AG +   L   +   D  +   +N   R +   DG+D    L+  +  P+RR
Sbjct: 349 AGAEPLALADPMSHEDASNRKRMN---RLSGTHDGIDMAPHLVDGSEPPNRR 397


>gi|149196020|ref|ZP_01873076.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149140867|gb|EDM29264.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 462

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 105/301 (34%), Positives = 162/301 (53%), Gaps = 22/301 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D+ F+G  +IPTP ID++A NG+  ++ Y    VC PSRA  +TG+Y    G +  P
Sbjct: 34  GYADVGFNGCKDIPTPGIDSIANNGVKFSSGYTSYSVCGPSRAGFITGRYQQRFGFERNP 93

Query: 151 IWG-AEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            W   +P   +P +E  + E L+++GY    IGKWHLG       P  RGF   FG+L G
Sbjct: 94  QWNLTDPNSALPKSEMTIAESLQQVGYHCGIIGKWHLGA-EPSLRPNQRGFNEFFGHLGG 152

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT---VGEYATDLFTKEAVQLIEDQPVD 265
             +Y+   L    +  V+ N  D   +  T  DT     +Y TD F+ EA++ +E    +
Sbjct: 153 GHAYFPEKLRIIKTEDVK-NEMDSYASYITRNDTPVKTTKYLTDEFSDEAIRFVEKN-YE 210

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           +P FL+L++ A H       L+A Q+ +++F +I D NRRTY AMV  +DD +  ++  L
Sbjct: 211 QPFFLFLSYNAPHT-----PLQATQKYLDRFPHIEDQNRRTYCAMVSAMDDGIVRLMKKL 265

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
               + +++I+ F+SDNG PT         +N   N   RG K  ++EGG +VP  +  P
Sbjct: 266 DALKIKDDTIVFFLSDNGGPTT--------KNNSDNGDLRGKKGDVFEGGFRVPFAMQYP 317

Query: 386 Q 386
           +
Sbjct: 318 R 318



 Score = 50.1 bits (118), Expect = 0.004,   Method: Compositional matrix adjust.
 Identities = 25/69 (36%), Positives = 43/69 (62%), Gaps = 6/69 (8%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y TD F+ EA++ +E    ++P FL+L++ A H       L+A Q+ +++F +I D NR
Sbjct: 191 KYLTDEFSDEAIRFVEKN-YEQPFFLFLSYNAPHT-----PLQATQKYLDRFPHIEDQNR 244

Query: 74  RTYAALTKS 82
           RTY A+  +
Sbjct: 245 RTYCAMVSA 253


>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 540

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 122/386 (31%), Positives = 184/386 (47%), Gaps = 55/386 (14%)

Query: 75  TYAALT----KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVC 128
           T+ AL     K   + + +V   GW D+S  G+  I TP ID+L   G+ L+  Y  P+C
Sbjct: 12  TFCALANAEEKRPNIIIFLVDDLGWADISLRGA-PIDTPAIDSLFSEGLTLDRFYTTPIC 70

Query: 129 TPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
           +P+RA+LMTG+ P+  G+    +      GV   E F+PE  +  GY T  +GKWHLG  
Sbjct: 71  SPTRAALMTGRDPLRLGISYSVVMPWMNNGVHPDEHFMPESFKAAGYQTAMVGKWHLGHS 130

Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
           +  + P  RGF+  +G+L+  + Y+    +          G D +RN  T  D    Y T
Sbjct: 131 QEIFHPNARGFDDFYGHLHTEVGYFLPFANQ--------GGVDFQRNGVTIADE--GYET 180

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT-- 306
            L   EA + I+ +  DKP FLY+  +A H+      LEAP + + +++ + D    T  
Sbjct: 181 FLLADEASRWIKARDKDKPFFLYMPFIAPHS-----PLEAPDDLVKKYENLEDTRELTRS 235

Query: 307 -------------------YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
                              YAA+V  LD ++G V+  L ++G+ E +II+F SDNG    
Sbjct: 236 AAIDRTRTLNGFSPSARPMYAAVVDGLDQAIGQVLDTLTKEGIEEETIILFSSDNGGAAY 295

Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPT 406
                        N+P RG K   +EGG++V A + W   +         +M + D  PT
Sbjct: 296 AGGGA-------DNFPLRGGKGDTYEGGIRVVAAMRWKGTLDAGKSFE-SIMTVMDVFPT 347

Query: 407 LYTAAGGDTSRL-PLNIDGLDQWSSL 431
           L +AAG   S L    I G D WS++
Sbjct: 348 LASAAG--VSMLNTKEIWGRDLWSAI 371



 Score = 39.3 bits (90), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 6/72 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y T L   EA + I+ +  DKP FLY+  +A H+      LEAP + + +++ + D    
Sbjct: 178 YETFLLADEASRWIKARDKDKPFFLYMPFIAPHS-----PLEAPDDLVKKYENLEDTREL 232

Query: 75  TY-AALTKSTTL 85
           T  AA+ ++ TL
Sbjct: 233 TRSAAIDRTRTL 244


>gi|449677596|ref|XP_004208885.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 193

 Score =  169 bits (429), Expect = 3e-39,   Method: Compositional matrix adjust.
 Identities = 77/130 (59%), Positives = 98/130 (75%), Gaps = 3/130 (2%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM---QG 148
           G+ND+SFHGS +IPTPNID +A  G+ILNN Y  P+CTPSR+++MTG+YPIHTG+     
Sbjct: 46  GFNDVSFHGSKQIPTPNIDKIAKEGVILNNYYVLPICTPSRSAIMTGRYPIHTGIFYHTY 105

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
             I+ A   GV L E+FLP+YL+ +GY T AIGKWHLGFF +EYTP YRGF+S +GY  G
Sbjct: 106 DTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGYYGG 165

Query: 209 VISYYDHILS 218
              Y+DH L+
Sbjct: 166 QADYWDHSLA 175


>gi|426242290|ref|XP_004015007.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Ovis aries]
          Length = 522

 Score =  169 bits (427), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 178/362 (49%), Gaps = 36/362 (9%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 34  LLLMDDMGWGDLGVYGEPSRETPNLDQMATEGMLFPNFYTANPLCSPSRAALLTGRLPIR 93

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           +G     G       P+    G+P +E  LP  L+  GY++K +GKWHLG  R ++ PL 
Sbjct: 94  SGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH-RPQFHPLK 152

Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
            GF+  FG  N     YD+         R  E+ G    +   NL T    +    T ++
Sbjct: 153 HGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL----TQIY 208

Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
            +EA++ I+  Q   +P FLY A  A HA                  ++    R  Y   
Sbjct: 209 LQEALEFIQRQQAAHRPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDA 257

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           +++LDDSVG ++  LQ  G+ EN+ + F SDNGA  +     S  R  GSN P+   K T
Sbjct: 258 IRELDDSVGRILRLLQDLGIAENTFVFFTSDNGAALI-----SAPRQGGSNGPFLCGKQT 312

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
            +EGG++ PAI W P      +VS Q+  I D   T  + AG +  R    IDGLD   +
Sbjct: 313 TFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPR-DRAIDGLDLLPA 371

Query: 431 LL 432
           +L
Sbjct: 372 ML 373


>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
 gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
          Length = 498

 Score =  169 bits (427), Expect = 5e-39,   Method: Compositional matrix adjust.
 Identities = 123/383 (32%), Positives = 186/383 (48%), Gaps = 44/383 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ D+   GS  + TPN+D LA +G++ +  Y A  VC+PSRA L+TG+ P   G +G  
Sbjct: 45  GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104

Query: 149 ---PPIWGAEPR--GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                 +   P   G+P++E+ L ++L   GY+T  IGKWHLG     + P  RGF+   
Sbjct: 105 NASDERYATRPELLGLPVSEKTLGDHLGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFC 163

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
           G L G   Y+   ++           H + RN     D   EY TD FT E ++ I+   
Sbjct: 164 GMLTGSHHYFPTTMN-----------HVIERNGQRVEDFSNEYLTDFFTDEGLRFIDQHE 212

Query: 264 V---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
               D+P F++ ++ A H       + A +  + +F  I +  RRTYAAM+  LD  VG 
Sbjct: 213 AAKPDQPWFVFYSYNAPHT-----PMHATEADLARFANIQNKKRRTYAAMMFALDRGVGR 267

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           +   L+  G  +N++++F SDNG  T       N  +W  N P RGVK ++ EGG++VP 
Sbjct: 268 IRKHLEETGQWKNTLLVFFSDNGGAT-------NNGSW--NGPLRGVKGSMREGGIRVPM 318

Query: 381 I-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
           I  W  +     R    +  + D LPT  +AAG     L   +   D  +   +N   R 
Sbjct: 319 IWTWPAKFPAGKRYDGVISSL-DLLPTFCSAAGAQPLALAEPMSHEDASNRKRMN---RL 374

Query: 440 NSNIDGLDQWSSLLLNT--PSRR 460
               DG++    L+  +  PSRR
Sbjct: 375 AGTHDGIEMTPHLVDGSEPPSRR 397



 Score = 42.0 bits (97), Expect = 0.98,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 40/82 (48%), Gaps = 8/82 (9%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
           + RN     D   EY TD FT E ++ I+       D+P F++ ++ A H       + A
Sbjct: 181 IERNGQRVEDFSNEYLTDFFTDEGLRFIDQHEAAKPDQPWFVFYSYNAPHT-----PMHA 235

Query: 58  PQETINQFQYITDPNRRTYAAL 79
            +  + +F  I +  RRTYAA+
Sbjct: 236 TEADLARFANIQNKKRRTYAAM 257


>gi|313236789|emb|CBY12041.1| unnamed protein product [Oikopleura dioica]
 gi|313242643|emb|CBY39450.1| unnamed protein product [Oikopleura dioica]
          Length = 622

 Score =  168 bits (426), Expect = 7e-39,   Method: Compositional matrix adjust.
 Identities = 147/544 (27%), Positives = 236/544 (43%), Gaps = 98/544 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW D+ ++      TP +D L  NG     MY+   C+PSRA  +TG+Y   +GM   PI
Sbjct: 40  GWADVGWNNKGLESTPFMDKLVKNGTQFTQMYSSHRCSPSRAMALTGRYAFRSGMGSFPI 99

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
               P G+   ++ LPEYL+E+GY T A+GKWHLG     Y P  RGF++ +G+ +G + 
Sbjct: 100 AREVPFGMNTQDKTLPEYLKEVGYDTHAVGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVD 159

Query: 212 YYDHILS-------DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           Y  H +        D +  T+E +  D+  +        G++ TDLF    + ++++   
Sbjct: 160 YRGHFIKRSKNFYHDFFDNTIEQHKLDLESD--------GQWTTDLFRDRTIDILKEAKR 211

Query: 265 DK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAAMVKKLDDSVG 319
            K P ++YLA  A H     +   AP + I +      PN    R  + A V ++D +V 
Sbjct: 212 SKTPAYVYLAFNAPH-----EPTRAPADLIARI-LEKHPNLPYTRAEHLAAVNQIDTAVR 265

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG------------SNYPYRGV 367
            +     +K +   ++I+FMSDNG    ++    +  N G             NYP++G 
Sbjct: 266 QIYQQ-AKKDVKRETLIVFMSDNGGAVKKFGGGRSPDNTGHEYDDEESLPRACNYPFKGF 324

Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQM-MHISDWLPTLYTAAGGDTSRLP-LNIDGL 425
           KN+L+EGG+  P+ L S   ++  R       HI DW PT    AG     +P  + DG+
Sbjct: 325 KNSLFEGGILSPSFLIS-TARKFSRTRFNFPFHIIDWAPTFLDFAG----IVPDADFDGI 379

Query: 426 DQWSSLLLN-TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----AAVRLDS 480
            + S L      S+R + + G                 VL   +EKK       AVR   
Sbjct: 380 SRKSELESRWIKSQRTNFVYG-----------------VLDFYNEKKNIWEDQIAVRFGK 422

Query: 481 WKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR 540
           WK                         +NFN  V +      +     +   + +   + 
Sbjct: 423 WK------------------------FMNFNMAVNTFKCDEHENQPVTVSRNMQDFLNVP 458

Query: 541 STRQQATIHCGANPA-PMTPSPCTNGPCYLFNLGNDPCEQNNI---ASSRPDISSQLY-E 595
           ++  +   H     A P T     +G   L+NL NDP E +N+    S  P+  +++  E
Sbjct: 459 TSYWERNFHNSITVAEPYTKKTLYDGFS-LYNLENDPFELDNLIDPISGNPEAYNEIIEE 517

Query: 596 LLKY 599
           ++K+
Sbjct: 518 IMKF 521


>gi|355689580|gb|AER98880.1| galactosamine -6-sulfate sulfatase [Mustela putorius furo]
          Length = 503

 Score =  168 bits (426), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 127/370 (34%), Positives = 183/370 (49%), Gaps = 42/370 (11%)

Query: 81  KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMT 137
           K   L LL++   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+T
Sbjct: 10  KPPNLVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLT 69

Query: 138 GKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           G+ PI  G     G       P+    G+P  ER LPE L+  GY++K +GKWHLG  R 
Sbjct: 70  GRLPIRNGFYTTNGHARNAYTPQEIVGGIPAEERLLPELLKGAGYASKIVGKWHLG-HRP 128

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGE 245
           ++ PL RGF+  FG  N     YD+         R  E+ G    +   NL T    +  
Sbjct: 129 QFHPLKRGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL-- 186

Query: 246 YATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
             T L+T+EA+  ++ Q    +P FLY A  A HA                  ++    R
Sbjct: 187 --TQLYTQEALDFVQRQHAARRPFFLYWAIDATHA-----------PVYASRPFLGTSRR 233

Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
             Y   V+++DDSVG ++S L+   + E++ + F SDNGA  +     S  +  GSN P+
Sbjct: 234 GRYGDAVREVDDSVGKILSLLRDLRIAEDTFVFFTSDNGAALI-----SAPKQGGSNGPF 288

Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNI 422
              K T +EGG++ PAI W P      +VS Q+  I D   T  + AG    + R+   I
Sbjct: 289 LCGKQTTFEGGMREPAIAWWPGRVPAGQVSHQLGSIMDLFTTSLSLAGLAPPSDRV---I 345

Query: 423 DGLDQWSSLL 432
           DGLD   ++L
Sbjct: 346 DGLDLLPAML 355


>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
 gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
          Length = 462

 Score =  168 bits (425), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 126/403 (31%), Positives = 189/403 (46%), Gaps = 56/403 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  FHGS E  TP +D  A N +  +  Y +  VC PSRA L+TGKY    G +   
Sbjct: 37  GYADFGFHGSKEFKTPELDKFAKNAVRFSQAYVSAAVCGPSRAGLLTGKYQQKFGFEENN 96

Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
           + G         +  G+PL ++ + +YL+E GY T   GKWH G   R + P  RGF+  
Sbjct: 97  VPGLMSKNGLTGDDMGLPLDQKTIADYLKEQGYRTALFGKWHQGNADR-FHPTKRGFDEF 155

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           +G+  G  SY      ++ +R    N   + R      +  G Y T+    EA+  IE  
Sbjct: 156 YGFRGGARSYMPFGADNELTR----NEDRLERGFGGFQEHEG-YLTEELADEAIAFIERN 210

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
             + P F+YLA  A H       +EA +E + + + ++   R+T AAM   +D + G V+
Sbjct: 211 QKN-PFFVYLAFNAVHT-----PMEAKEEDLKKVKGLSG-KRKTLAAMTIAMDRACGKVL 263

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
             ++  G+ +N++I+F +DNG P+          N  SN P  G K    EGG++VP ++
Sbjct: 264 DRIKALGLDKNTLIVFTNDNGGPSDA--------NESSNLPLSGTKANHLEGGIRVPFLM 315

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
            W  +++ N  V    +   D LPT  TAAGGD +     IDGL                
Sbjct: 316 SWPGKLKPNT-VYENPISTLDLLPTFVTAAGGDVN----AIDGL---------------- 354

Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
             DG+D    L      R +  L    +K+   A+R + WKLV
Sbjct: 355 --DGVDLLPYLQGKKQGRPHETLYW--KKEARGAIRHNDWKLV 393


>gi|408793054|ref|ZP_11204664.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
           meyeri serovar Hardjo str. Went 5]
 gi|408464464|gb|EKJ88189.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
           meyeri serovar Hardjo str. Went 5]
          Length = 583

 Score =  168 bits (425), Expect = 8e-39,   Method: Compositional matrix adjust.
 Identities = 130/384 (33%), Positives = 196/384 (51%), Gaps = 70/384 (18%)

Query: 92  GWNDLSFHGSN----EIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
           G+ND++ +G       +PTPNID++A +G+  ++ Y+   VCT SRA+LMTG+YP   G+
Sbjct: 108 GFNDVTTYGGGYADLGVPTPNIDSIAKDGVRFDSGYSGSAVCTVSRAALMTGRYPSRFGV 167

Query: 147 QGPPIWGAEPR-------------------------------GVPLTERFLPEYLRELGY 175
           +  P  GA  R                               G+P +E  + E L+E GY
Sbjct: 168 EFTPTPGALARVGADLYADPNRLYPVVIDKEKAEKSKSFNELGMPGSEITIAEVLKERGY 227

Query: 176 STKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD---- 231
            +  IGKWHLG    E  P  +GF+      +G+    D    + Y+   + +  D    
Sbjct: 228 HSVHIGKWHLGS-TEEMRPNKQGFDESLFMESGLYLPVDD--PNVYNAKQDFDPIDRFLW 284

Query: 232 --MRRNLS---TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHL 286
             MR  +S     W     Y TD FT EAV++I+     +P FL+LAH A H       L
Sbjct: 285 PNMRFGVSYNGGKWFEPSRYLTDYFTDEAVKVIKTNK-HRPFFLFLAHWAVHT-----PL 338

Query: 287 EAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT 346
           +A +E  +   +I D  +R Y +M++ LD SVG ++++L+ +G+ EN+I+IF SDNGAP 
Sbjct: 339 QASKEDYDALSHIKDHRKRVYLSMIRSLDRSVGKILASLKEEGLDENTIVIFTSDNGAP- 397

Query: 347 VEYRETSNYRNW-GSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWL 404
                  NY      N PYRG K TL++GG++VP +  W   I+   +    + +I D L
Sbjct: 398 -------NYIGLPDVNSPYRGWKLTLFQGGIRVPYVAKWPGHIKPGTKYQNAITNI-DIL 449

Query: 405 PTLYTAAGGDTSRLPLN--IDGLD 426
           PT+  AAG    +LPL+  IDG++
Sbjct: 450 PTVANAAG---IKLPLDREIDGVN 470



 Score = 47.4 bits (111), Expect = 0.025,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 6/74 (8%)

Query: 9   WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
           W     Y TD FT EAV++I+     +P FL+LAH A H       L+A +E  +   +I
Sbjct: 298 WFEPSRYLTDYFTDEAVKVIKTNK-HRPFFLFLAHWAVHT-----PLQASKEDYDALSHI 351

Query: 69  TDPNRRTYAALTKS 82
            D  +R Y ++ +S
Sbjct: 352 KDHRKRVYLSMIRS 365


>gi|354465430|ref|XP_003495183.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cricetulus
           griseus]
          Length = 493

 Score =  168 bits (425), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 124/359 (34%), Positives = 178/359 (49%), Gaps = 40/359 (11%)

Query: 90  VYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM-- 146
           V GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI  G   
Sbjct: 10  VMGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYT 69

Query: 147 -QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
             G       P+    G+P +E  LPE L++ GY+ K +GKWHLG  R ++ PL  GF+ 
Sbjct: 70  SNGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQFHPLKHGFDE 128

Query: 202 HFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAV 256
            FG  N     YD+ +       R  E+ G    +   NL T    +    T L+ +EA+
Sbjct: 129 WFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL----TQLYLQEAL 184

Query: 257 QLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
             I  Q   + P FLY A  A HA                  ++   +R  Y   V+++D
Sbjct: 185 DFIRTQHARQSPFFLYWAIDATHA-----------PVYASRPFLGTSHRGRYGDAVREID 233

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           DSVG ++S LQ  G+ +N+ + F SDNGA  +     S  +  GSN P+   K T +EGG
Sbjct: 234 DSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLCGKQTTFEGG 288

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQWSSLL 432
           ++ PAI W P      +VS Q+  I D   T  + AG    + R+   IDGLD   ++L
Sbjct: 289 MREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSIAGLKPPSDRI---IDGLDLLPTML 344


>gi|334139745|ref|YP_004532943.1| sulfatase [Novosphingobium sp. PP1Y]
 gi|333937767|emb|CCA91125.1| sulfatase [Novosphingobium sp. PP1Y]
          Length = 472

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 107/303 (35%), Positives = 159/303 (52%), Gaps = 26/303 (8%)

Query: 93  WNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ-GPP 150
           W D+S +G  ++PTPNID +A  G+  ++ Y A  VC  SRA LMTG+ P   G      
Sbjct: 39  WADVSTYGRTDVPTPNIDRIAKTGVAFSSGYVAASVCAVSRAGLMTGRMPQRFGFTYNIN 98

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
             G    G+P+ ++ + + L+ LGY T A GKWHLG   R++ P  RGF+  FG+L G  
Sbjct: 99  DKGDVGAGLPVGQKTIADRLQPLGYRTAAFGKWHLGA-DRQFYPTNRGFDEFFGFLAGET 157

Query: 211 SYYD----HILS-----DQYSRTVELNGHDMRRNL-STAWDTVGEYATDLFTKEAVQLI- 259
           +Y D     I++     D+Y        H M     +   D   +Y T+  T  AV  I 
Sbjct: 158 NYVDPKTPGIVTTPTKVDKYEIGPGEGNHAMVEGPDARPADDFSKYLTNQITDRAVDFIN 217

Query: 260 EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
                 +P F Y+A+ A H       L+ PQ   ++F  + DP RRTY AM+  +DD VG
Sbjct: 218 RSADAKQPFFSYVAYNAPHW-----PLQVPQAYYDRFANVKDPVRRTYIAMIAAMDDGVG 272

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            ++  L+ +G+ EN+I++F+SDNG P     E   + N  +N+P+   K T  EGG +VP
Sbjct: 273 QILDTLEERGLRENTIVVFLSDNGCP-----ERFGFCN--TNHPWGSGKFTYLEGGTRVP 325

Query: 380 AIL 382
            ++
Sbjct: 326 FMM 328


>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
 gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
          Length = 498

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 129/412 (31%), Positives = 197/412 (47%), Gaps = 49/412 (11%)

Query: 65  FQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
           F + T+ +R   AA +    + +++    G+ D+   GS  + TPN+D LA +G++ +  
Sbjct: 19  FVFATEASR---AAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGVLCSQA 75

Query: 123 Y-AQPVCTPSRASLMTGKYPIHTGMQG-----PPIWGAEPR--GVPLTERFLPEYLRELG 174
           Y A  VC+PSRA L+TG+ P   G +G        +   P   G+P +E+ L ++L   G
Sbjct: 76  YVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADHLGAAG 135

Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRR 234
           Y+T  IGKWHLG     + P  RGF+   G L G   Y+   +            H + R
Sbjct: 136 YATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMK-----------HVIER 183

Query: 235 NLSTAWDTVGEYATDLFTKEAVQLI---EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
           N         EY TD FT E ++ I   E    D+P F++ ++ A H       + A + 
Sbjct: 184 NGKRVDGFSSEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNAPHT-----PMHATEA 238

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
            + +F  I +  RRTYAAM+  LD  VG +   L+  G  EN++++F SDNG  T     
Sbjct: 239 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT----- 293

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
             N  +W  N P RGVK ++ EGG++VP I  W  +     R    +  + D LPT  +A
Sbjct: 294 --NNGSW--NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVRYDGVISSL-DLLPTFCSA 348

Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT--PSRR 460
           AG +   L   +   D  +   +N   R +   DG+D    L   +  P+RR
Sbjct: 349 AGAEPLALADPMSHEDASNRKRMN---RLSGTHDGIDMAPHLADGSEPPNRR 397


>gi|116620007|ref|YP_822163.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
 gi|116223169|gb|ABJ81878.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
          Length = 443

 Score =  167 bits (424), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 123/377 (32%), Positives = 188/377 (49%), Gaps = 40/377 (10%)

Query: 73  RRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPS 131
           R   AA   +  + +L   G +DL + G+ ++ TP+IDALA  G+   N Y+  PVC P+
Sbjct: 19  RAAPAADRPNVLVVVLDDLGCHDLGYLGAADLKTPHIDALAARGLKFRNWYSNAPVCAPA 78

Query: 132 RASLMTGKYPIHTGM--QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           R++++TG++P   G+   GP    A   G+P     L   L+  GY T   GKWHLG   
Sbjct: 79  RSAILTGRFPASAGVPDNGP----ALAHGIPT----LASVLKGSGYQTGCFGKWHLGS-T 129

Query: 190 REYTPLYRGFESHFGYLNGVISYYDHIL--SDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
            E  P   GF+S +G+ +G + YY H     D Y        HD+  N +  ++  G Y 
Sbjct: 130 DETAPTGHGFDSFYGFHSGCVDYYSHRFYWGDNY--------HDLWHNRTEIFED-GRYL 180

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
           T+    EA   I     ++P   Y+A  A H       + AP +   +F  +  P R+TY
Sbjct: 181 TERIADEAAGFIGR---NRPFLGYVAFNAPH-----YPMHAPAQYKARFPNLA-PERQTY 231

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN--YRNWGSNYPYR 365
           AAM+  +DD +G +  AL+  G  EN+++ F+ DNGA T E R   N  +   G N  ++
Sbjct: 232 AAMIAAVDDGIGQIQRALETTGAAENTLMFFIGDNGA-TTEKRAGLNGDFATAGDNGVFK 290

Query: 366 GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
           G K +L++GG+ VP  + W   I++        M + D LPT+  A G   + LP  +DG
Sbjct: 291 GYKFSLFDGGMHVPGFVSWPAGIRKGGWTDELAMSM-DILPTICRATG---APLPPRVDG 346

Query: 425 LDQWSSLLLNTPSRRNS 441
            D  +++  N PS   S
Sbjct: 347 SDLLNTIASNAPSPHKS 363


>gi|323451693|gb|EGB07569.1| hypothetical protein AURANDRAFT_2707, partial [Aureococcus
           anophagefferens]
          Length = 351

 Score =  167 bits (423), Expect = 1e-38,   Method: Compositional matrix adjust.
 Identities = 122/350 (34%), Positives = 169/350 (48%), Gaps = 36/350 (10%)

Query: 88  LIVY-----GWNDLSFH----GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
           LIV+     G ND+ +     G   I +P IDALA   ++L+  YAQP+CTP+RA+LMTG
Sbjct: 13  LIVFLADDLGRNDVGYAHRDGGPGRIASPRIDALAAESLVLDRFYAQPMCTPTRAALMTG 72

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           ++   TG+    +   +  G+P  E  + E LR+ GYST  IGKWHLGF ++   P  RG
Sbjct: 73  RHAYRTGLAYFVLLANQGTGLPAAEVTVAERLRDAGYSTHMIGKWHLGFAKKAMLPTSRG 132

Query: 199 FESHFGYLNGVISYYDH--------ILSDQYSRTVELN-----GHDM---RRNLSTAWDT 242
           F+  FGY  G   Y+ H        + S   +   E       GHD+      L     T
Sbjct: 133 FDRFFGYCLGSSDYWLHQSPEWVPGVPSTDRATGAEPPTTGGMGHDLWDGATPLPRTPKT 192

Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
              ++ DLF   A +     P DKPLFLY A  A HA   G                 DP
Sbjct: 193 ENVHSADLFAARATETFATAPRDKPLFLYYASQAPHAHFVGAPPRHVAAAAAMGFAAGDP 252

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R   AA+V  LD+ VG V+  ++     EN+++ FM DNG                S++
Sbjct: 253 -RADVAALVAALDEQVGNVVDGVRASRGWENTVLWFMGDNG---------PEPGTGASSW 302

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           P RG K TL+EGGV+ P+ +++P  +  P  S   +H+ D  PTL   AG
Sbjct: 303 PLRGAKRTLFEGGVREPSFVYAPG-RVAPGASSAPVHVIDVAPTLLGLAG 351


>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
          Length = 540

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 117/365 (32%), Positives = 178/365 (48%), Gaps = 44/365 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG--- 148
           GWN  SFH +NEI TP +  L  NG+ L + Y    C+P+RAS +TG++P    M     
Sbjct: 41  GWNAPSFH-NNEIITPTLHHLHANGVELYSHYTYMFCSPTRASFLTGRFPYKHEMTNTNL 99

Query: 149 -PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
            PP       G+ L+   L + L++  YST  IGKWHLG +++EYTP YRGF++ FG+L 
Sbjct: 100 LPP---TRMLGLDLSYTTLADKLKQANYSTHHIGKWHLGMYKKEYTPRYRGFDTTFGFLT 156

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK- 266
           G  ++Y           V     D+    + A+   G +   +FT  A+ +I +    + 
Sbjct: 157 GGENHYTQ------RAFVSPPAVDLWDEEAPAYGMNGTWTGKMFTDAALDIIRNNAQLRN 210

Query: 267 ------PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN--RRTYAAMVKKLDDSV 318
                 PLF+Y A    HA       ++P      + +  D    + T+  M+  +D++V
Sbjct: 211 ATGDAPPLFIYFALHDVHAPT-----QSPVRLAQLYNFTGDDAGLKSTFYGMISGVDEAV 265

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPT----------VEYRETSNYRNWGSNYPYRGVK 368
             V  AL+  GM  ++++++ SDNG+PT          V           GSNYP RG K
Sbjct: 266 RNVTDALKAAGMWNDTLLVWTSDNGSPTMGSFNGGCKNVVPDSRGGACMCGSNYPLRGSK 325

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNI---D 423
           +T +EGGV+ P  +  P +  + R      +MH+ DW  TL + AG D +  P  I   D
Sbjct: 326 HTNFEGGVRTPTFVSGPVLPAHMRGKRLDGLMHVCDWYSTLCSVAGVDPAD-PGGIAPLD 384

Query: 424 GLDQW 428
            +D W
Sbjct: 385 AIDMW 389


>gi|390356461|ref|XP_793612.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
          Length = 181

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 71/124 (57%), Positives = 93/124 (75%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GWND+SFHGS++IPTP+IDALA  G+IL N Y  P+CTP+R+++MTGK+PIHTG+Q   I
Sbjct: 40  GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+   E  +P+YLR LGY T  +GKWHLGFF+   TP +RGFES++GY  G+  
Sbjct: 100 IADEPYGLGTNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGYYGGMQD 159

Query: 212 YYDH 215
           Y+ H
Sbjct: 160 YFTH 163


>gi|345317091|ref|XP_001516952.2| PREDICTED: arylsulfatase I-like, partial [Ornithorhynchus anatinus]
          Length = 264

 Score =  166 bits (421), Expect = 2e-38,   Method: Compositional matrix adjust.
 Identities = 93/249 (37%), Positives = 144/249 (57%), Gaps = 20/249 (8%)

Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           +Y IHTG+Q   I   +P  +PL +  LP+ L+E GYST  +GKWHLGF+++E  P  RG
Sbjct: 6   RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 65

Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           F++  G L G + YY +   D       + G+D+    + AW+  G+Y+T L+ + A ++
Sbjct: 66  FDTFLGSLTGNVDYYTYDNCD----GPGVCGYDLHEGENVAWEQSGKYSTLLYAQRAGKI 121

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           +      +PLFLY+A  A H       L++P+E + +++ + +  RR YAAMV  +D++V
Sbjct: 122 LASHDPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAV 176

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
             +  AL++ G  +NS+I+F +DNG  T          + GSN+P RG K T WEGG   
Sbjct: 177 RNITRALKKYGYYDNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGTP- 226

Query: 379 PAILWSPQI 387
           PA L SP +
Sbjct: 227 PATL-SPSL 234



 Score = 47.8 bits (112), Expect = 0.017,   Method: Compositional matrix adjust.
 Identities = 23/75 (30%), Positives = 44/75 (58%), Gaps = 5/75 (6%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
           + AW+  G+Y+T L+ + A +++      +PLFLY+A  A H       L++P+E + ++
Sbjct: 100 NVAWEQSGKYSTLLYAQRAGKILASHDPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 154

Query: 66  QYITDPNRRTYAALT 80
           + + +  RR YAA+ 
Sbjct: 155 RAMGNVARRKYAAMV 169


>gi|351729940|ref|ZP_08947631.1| sulfatase [Acidovorax radicis N35]
          Length = 446

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 115/333 (34%), Positives = 158/333 (47%), Gaps = 40/333 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           GW DLS +G  +  TP++DALA  G+     YA   VC+ +R +L+TG+Y   +  G++ 
Sbjct: 20  GWADLSVYGQADFSTPHLDALAGEGVRFTQAYANSAVCSATRFALITGRYQYRLRGGLEE 79

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P +  A   G+P     LP  LR  GY T  IGKWHLG     + PL  G+   FG   G
Sbjct: 80  PLVRKAHVHGLPPEHPTLPSLLRNAGYDTALIGKWHLGSLP-TFGPLKSGYNRFFGNYGG 138

Query: 209 VISYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
            I Y+ H       +  D Y   V +             + VG Y T L   EA   +  
Sbjct: 139 AIDYFTHKPGVGEAVPRDLYEGEVPV-------------ERVG-YYTQLLADEASGWLRG 184

Query: 262 QPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
           +    KP FL L   A H    G   E     +    +    N  TY  MV+ LD +VG 
Sbjct: 185 RSAGSKPFFLSLHFTAPHWPWVGPEDEEVSLGLKDLFHYDGGNLHTYGKMVRSLDGAVGQ 244

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           V+ ALQ +G+ +N+I++F SDNG              +   +P+ G K  L EGG++VP 
Sbjct: 245 VLHALQEQGLADNTIVVFTSDNGG-----------ERFSKTWPFTGQKTELLEGGIRVPT 293

Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           +L W  +I   P+VS Q+    DWLPTL  AAG
Sbjct: 294 LLRWPARIA--PQVSDQVTATMDWLPTLLAAAG 324


>gi|344237970|gb|EGV94073.1| N-acetylgalactosamine-6-sulfatase [Cricetulus griseus]
          Length = 483

 Score =  166 bits (421), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 123/357 (34%), Positives = 177/357 (49%), Gaps = 40/357 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM---Q 147
           GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI  G     
Sbjct: 2   GWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTSN 61

Query: 148 GPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           G       P+    G+P +E  LPE L++ GY+ K +GKWHLG  R ++ PL  GF+  F
Sbjct: 62  GHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQFHPLKHGFDEWF 120

Query: 204 GYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           G  N     YD+ +       R  E+ G    +   NL T    +    T L+ +EA+  
Sbjct: 121 GSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL----TQLYLQEALDF 176

Query: 259 IEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
           I  Q   + P FLY A  A HA                  ++   +R  Y   V+++DDS
Sbjct: 177 IRTQHARQSPFFLYWAIDATHA-----------PVYASRPFLGTSHRGRYGDAVREIDDS 225

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           VG ++S LQ  G+ +N+ + F SDNGA  +     S  +  GSN P+   K T +EGG++
Sbjct: 226 VGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLCGKQTTFEGGMR 280

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQWSSLL 432
            PAI W P      +VS Q+  I D   T  + AG    + R+   IDGLD   ++L
Sbjct: 281 EPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSIAGLKPPSDRI---IDGLDLLPTML 334


>gi|196229618|ref|ZP_03128482.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196225944|gb|EDY20450.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 490

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/301 (36%), Positives = 156/301 (51%), Gaps = 36/301 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D SF GS +I TPN+DALA +G+     Y   PVC+PSRA LMTG+Y    G     
Sbjct: 50  GYADASFQGSKDILTPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMTGRYQERFGHHNNI 109

Query: 151 IW-GAEPRG-VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           +   A P   +P  E  LP+ L + GY T  +GKWHLG  +    P  RGF+  FG + G
Sbjct: 110 VAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGL-QDGCRPYERGFDEFFGIITG 168

Query: 209 VISYY-----DHILSDQ-YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
              Y+     +  + DQ Y   +E NG           + V  Y TD F  +AV++I + 
Sbjct: 169 GHDYFVNHPEERAVGDQSYKARIERNG--------PVGEAVPGYLTDAFGADAVRIIRES 220

Query: 263 PV---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSV 318
                D+PLFLYLA  A H        +AP++ ++     +   +RRTYAA +  +D SV
Sbjct: 221 HTKRPDQPLFLYLAFNAPHTPT-----QAPKDLVDTMPATLESKDRRTYAAQITSMDASV 275

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V +AL+  GM +++ I+F SDNG     Y +         N P R  K +L+EGG++V
Sbjct: 276 GKVRAALKENGMEKDTFIVFFSDNGGANHPYYD---------NTPLRDHKGSLYEGGIRV 326

Query: 379 P 379
           P
Sbjct: 327 P 327


>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
 gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
          Length = 520

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 127/411 (30%), Positives = 196/411 (47%), Gaps = 47/411 (11%)

Query: 65  FQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
           F + T+ +R   AA +    + +++    G+ D+   GS  + TPN+D LA +G++ +  
Sbjct: 41  FVFATEASR---AAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGVLCSQA 97

Query: 123 Y-AQPVCTPSRASLMTGKYPIHTGMQG-----PPIWGAEPR--GVPLTERFLPEYLRELG 174
           Y A  VC+PSRA L+T + P   G +G        +   P   G+P +E+ L ++L   G
Sbjct: 98  YVASAVCSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADHLGAAG 157

Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRR 234
           Y+T  IGKWHLG     + P  RGF+   G L G   Y+   +            H + R
Sbjct: 158 YATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPATMK-----------HVIER 205

Query: 235 NLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEAPQE 291
           N     D   EY TD FT E ++ I+       D+P F++ ++ A H       + A + 
Sbjct: 206 NGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PMHATEA 260

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
            + +F  I +  RRTYAAM+  LD  VG +   L+  G  EN++++F SDNG  T     
Sbjct: 261 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT----- 315

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
             N  +W  N P RGVK ++ EGG++VP I   P       +   ++   D LPT  +AA
Sbjct: 316 --NNGSW--NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSAA 371

Query: 412 GGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT--PSRR 460
           G +   L   +   D  +   +N   R +   DG+D    L   +  P+RR
Sbjct: 372 GAEPLALADPMSHEDASNRKRMN---RLSGTHDGIDMAPHLADGSEPPNRR 419



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 25/82 (30%), Positives = 40/82 (48%), Gaps = 8/82 (9%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
           + RN     D   EY TD FT E ++ I+       D+P F++ ++ A H       + A
Sbjct: 203 IERNGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PMHA 257

Query: 58  PQETINQFQYITDPNRRTYAAL 79
            +  + +F  I +  RRTYAA+
Sbjct: 258 TEADLARFANIQNQKRRTYAAM 279


>gi|149701806|ref|XP_001488119.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Equus caballus]
          Length = 491

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 121/355 (34%), Positives = 174/355 (49%), Gaps = 36/355 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM---Q 147
           GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI  G     
Sbjct: 10  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYTTS 69

Query: 148 GPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
           G       P+    G+P +ER LPE L+E GY +K +GKWHLG  R ++ PL  GF+  F
Sbjct: 70  GHARNAYTPQEIVGGIPDSERLLPELLKEAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 128

Query: 204 GYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
           G  N     YD+         R  E+ G    +   NL T    +    T ++ +EA+  
Sbjct: 129 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEALDF 184

Query: 259 I-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
           I   Q   +P FLY A  A HA                  ++    R  Y   V+++DDS
Sbjct: 185 IRRQQAARRPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREIDDS 233

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           VG ++ +LQ  G+ E++ + F SDNGA  +     S  +  GSN P+   K T +EGG++
Sbjct: 234 VGKILHSLQDLGIAESTFVFFTSDNGAALI-----SAPKQGGSNGPFLCGKQTTFEGGMR 288

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
            PA+ W P      +VS Q+  I D   T  + AG +  R    IDGL    ++L
Sbjct: 289 EPAVAWWPGRIPAGQVSHQLGSIMDLFTTSLSLAGLEPPR-DRAIDGLSLLPAML 342


>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
 gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
          Length = 467

 Score =  166 bits (420), Expect = 3e-38,   Method: Compositional matrix adjust.
 Identities = 111/345 (32%), Positives = 172/345 (49%), Gaps = 33/345 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS E  TP +D LA   I  +  Y +  VC PSRA ++TGKY    G +   
Sbjct: 38  GYADFGFQGSKEFKTPELDKLAKKSIKFSQAYVSAAVCGPSRAGILTGKYQQKFGFEENN 97

Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
           + G         +  G+PL +  +  YL++LGY T   GKWH G   R + P  RGF+  
Sbjct: 98  VPGYMSTSGLVGDEMGLPLDQITIANYLQDLGYKTALFGKWHQGNADR-FHPTKRGFDEF 156

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           +G+  G  SY  +  S+  S+    N   + R      +  G Y TD    EA+  I ++
Sbjct: 157 YGFRGGARSYMPYDDSNPLSK----NEDRLERGFGNFLEHEG-YLTDELAHEAISFI-NR 210

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
               P F+YL+  A H       +EA  E + QF ++    R+T AAM   +D ++GTV+
Sbjct: 211 NKKHPFFIYLSFNAVHTP-----MEATAEDLEQFPHLKG-KRKTLAAMTLAMDRAIGTVL 264

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
             L+ +G+  N++I+F +DNG P+          N   N P  G K    EGG++VP ++
Sbjct: 265 KELETQGLSNNTLIVFTNDNGGPSDA--------NDSDNSPLSGTKANHLEGGIRVPFLM 316

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            W  Q+Q+N   +  +  + D  PT  +AA G++S +   +DG++
Sbjct: 317 SWPKQLQKNKNYNYPISTL-DLFPTFLSAASGNSSAME-ELDGVN 359


>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score =  166 bits (420), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 117/353 (33%), Positives = 177/353 (50%), Gaps = 30/353 (8%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASL 135
           AA   +  L L    G+ DL F G+ +I TP+IDALA +G+     Y +  VC PSRA L
Sbjct: 18  AAQKPNIILILADDLGYEDLGFLGAPDIKTPHIDALARSGMNFTQGYQSASVCGPSRAGL 77

Query: 136 MTGKYPIHTGM-QGPPIWGAEPR-----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           +TG+Y    G  + PP  G   +     G+PL E+ + + L+   Y+T  IGKWH+G   
Sbjct: 78  LTGRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMIFDLLKPAAYTTGVIGKWHMGL-S 136

Query: 190 REYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
            E  P  R  + ++G+LNG  SY +  +     +   +     R N    +     Y T+
Sbjct: 137 HEQRPTQRSVDYYYGFLNGAHSYREAKMD---MKGAPMTWPIFRNNEPVPFSG---YTTE 190

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           +F  E V  I+ +  DKP FLY+++ + H        EA  + + +  +I    RR Y+A
Sbjct: 191 VFNDEGVNFIK-RNKDKPFFLYMSYNSVHG-----PWEAQPKDLQRSDHIKKKWRRIYSA 244

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           M+  +DD VG +I  L+ +G+ EN+++IFMSDNGAP   +          SN   RG K 
Sbjct: 245 MLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAPNNLHEAERAGDYLASNGSLRGRKG 304

Query: 370 TLWEGGVKVPAILWSPQI------QQNPRVSLQ----MMHISDWLPTLYTAAG 412
             +EGG++VP I+  PQ+       Q+P   L     ++HIS   P     +G
Sbjct: 305 DTYEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVPTLIHISQAAPAKKELSG 357


>gi|348550278|ref|XP_003460959.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cavia porcellus]
          Length = 502

 Score =  166 bits (419), Expect = 4e-38,   Method: Compositional matrix adjust.
 Identities = 127/375 (33%), Positives = 179/375 (47%), Gaps = 62/375 (16%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI 
Sbjct: 14  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIR 73

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
            G     G       P+    G+P +ER LP+ L+E GY+TK +GKWHLG  R ++ PL 
Sbjct: 74  NGFYTTNGHARNAYTPQEIVGGIPDSERLLPQLLKEAGYATKIVGKWHLG-HRPQFHPLK 132

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEY-------- 246
            GF+  FG  N     YD               +  R N+     WD VG +        
Sbjct: 133 HGFDEWFGSPNCHFGPYD---------------NKARPNIPVYRNWDMVGRFYEEFPINV 177

Query: 247 ------ATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
                  T ++ +EA+  I + Q    P FLY A  A H         AP      F  I
Sbjct: 178 KTGESNLTQIYLQEALDFIRQQQAAQHPFFLYWAVDATH---------APVYASRPFLGI 228

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
           +   R  Y   V+++DDSVG ++S L+   + ENS + F SDNGA  +     S     G
Sbjct: 229 S--QRGRYGDAVREIDDSVGKILSLLRTLSIAENSFVFFTSDNGAALI-----SAPSQGG 281

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSR 417
           SN P+   K T +EGG++ PAI W P+     +VS Q+  I D   T    AG    + R
Sbjct: 282 SNGPFLCGKQTTFEGGMREPAIAWWPKHIPAGQVSHQLGSIMDLFTTSLALAGLKPPSDR 341

Query: 418 LPLNIDGLDQWSSLL 432
           +   IDGL+   ++L
Sbjct: 342 V---IDGLNLLPTML 353


>gi|170726755|ref|YP_001760781.1| sulfatase [Shewanella woodyi ATCC 51908]
 gi|169812102|gb|ACA86686.1| sulfatase [Shewanella woodyi ATCC 51908]
          Length = 548

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 136/391 (34%), Positives = 190/391 (48%), Gaps = 71/391 (18%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI 142
            L L    G ND+S  G   I TPNID LA  G +  N Y+    C PSRA+L+TG+   
Sbjct: 66  VLILADDMGINDVSTFGGGMIETPNIDKLAAKGALFTNGYSGHANCAPSRAALLTGRDAT 125

Query: 143 HTGMQGPPI-----------------------WGAEP---------RGVPLTERFLPEYL 170
            TG    PI                       + AE          RG+P +E  +PE L
Sbjct: 126 RTGYDTTPIPDGMSRIIAAIENNEDNGRPEMSYSAEADATNPTYDNRGLPGSEILIPEIL 185

Query: 171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
           +E GY T  IGKWHLG    E  P  +GF+      +G+    DH   +  +  VE +G 
Sbjct: 186 KESGYHTMHIGKWHLGR-SPEMMPNAQGFDESLMMDSGLYLPVDH--PESVNAPVESSGL 242

Query: 231 D------MRRNLSTAWDTVGE------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 278
           D      MR   S  W+  GE      Y TD FT+EA + IE    ++P FLYLAH   H
Sbjct: 243 DRFIWATMR--YSVNWNG-GEIFKPNGYLTDYFTEEAEKAIEAN-ANRPFFLYLAHWGPH 298

Query: 279 AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
             N  +   A  E +   Q     N+R YAAM++ +D SV  V++ L+++G+ +N+I+I 
Sbjct: 299 --NPVQAKRADYEAVGDIQ---PHNKRVYAAMLRSIDRSVERVMAKLEKQGIADNTIVIL 353

Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQM 397
            SDNG    +Y   ++      N PYRG KNT +EGG++VP ++ W   I ++  +   +
Sbjct: 354 SSDNGG--ADYVAINDL-----NKPYRGWKNTFFEGGIRVPFSVTWPNVIDESTVIEEPV 406

Query: 398 MHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
            HI D +PT+   A  D   LP +  IDG+D
Sbjct: 407 NHI-DLMPTIINMANAD---LPQDREIDGVD 433



 Score = 41.6 bits (96), Expect = 1.3,   Method: Compositional matrix adjust.
 Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 6/68 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD FT+EA + IE    ++P FLYLAH   H  N  +   A  E +   Q     N+R
Sbjct: 267 YLTDYFTEEAEKAIEAN-ANRPFFLYLAHWGPH--NPVQAKRADYEAVGDIQ---PHNKR 320

Query: 75  TYAALTKS 82
            YAA+ +S
Sbjct: 321 VYAAMLRS 328


>gi|443716207|gb|ELU07840.1| hypothetical protein CAPTEDRAFT_136127, partial [Capitella teleta]
          Length = 311

 Score =  166 bits (419), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 102/327 (31%), Positives = 161/327 (49%), Gaps = 32/327 (9%)

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SR S ++G+Y   + MQ   I   +P+ + L    LP YL+ELGY T A GKWHLG+ R 
Sbjct: 1   SRGSFLSGRYSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRD 60

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL 250
           E TP +RGF+S  G  +G   +Y+H        T     +D        +D +G+++ DL
Sbjct: 61  ECTPTHRGFDSFSGGFSGEGEFYEHT-------TATGGYYDWHLGTEVDYDAIGKHSEDL 113

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRR 305
                 + +++     PLF+Y+A    H         +P +   +F  + D      +R+
Sbjct: 114 IGYYVNKTLDEYDQSSPLFMYVAFHNVH---------SPLDPKPEFLALYDDLDVTDDRK 164

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
            Y  +V  +D  +G ++  ++  G+ EN+ I+F SDNG    E          G N P+R
Sbjct: 165 KYLGLVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDVGE----------GDNSPHR 214

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           G K+TLWEGG K  +   SP +  +   +  +MH++DWLPT+   AGG        +DG 
Sbjct: 215 GGKSTLWEGGCKANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAGGSVPASD-GLDGF 273

Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSL 452
            Q   ++ N  S R + I  +D+ S +
Sbjct: 274 VQTDMVINNGDSARTTMIYNIDRESDI 300


>gi|114145565|ref|NP_001041316.1| N-acetylgalactosamine-6-sulfatase precursor [Rattus norvegicus]
 gi|123779981|sp|Q32KJ6.1|GALNS_RAT RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|81158026|tpe|CAI84987.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus
           norvegicus]
          Length = 524

 Score =  165 bits (418), Expect = 5e-38,   Method: Compositional matrix adjust.
 Identities = 125/368 (33%), Positives = 179/368 (48%), Gaps = 46/368 (12%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 35  VLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 94

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 95  RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 150

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
            PL  GF+  FG  N     YD+ +       R  E+ G    +   NL T    +    
Sbjct: 151 HPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL---- 206

Query: 248 TDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           T L+ +EA+  I  Q   + P FLY A  A HA                 Q++    R  
Sbjct: 207 TQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----------PVYASKQFLGTSLRGR 255

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y   V+++DDSVG ++S LQ  G+ +N+ + F SDNGA  +     S  +  GSN P+  
Sbjct: 256 YGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLC 310

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDG 424
            K T +EGG++ PAI W P      +VS Q+  I D   T  + AG    + R+   IDG
Sbjct: 311 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDG 367

Query: 425 LDQWSSLL 432
           LD   ++L
Sbjct: 368 LDLLPTML 375


>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 455

 Score =  165 bits (418), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 120/350 (34%), Positives = 181/350 (51%), Gaps = 47/350 (13%)

Query: 76  YAALTKSTTLTLLIV---YGWNDLSFHGSNE--IPTPNIDALAYNGIILNNMY-AQPVCT 129
           + AL  S    ++I+    G+ D+S++  ++  I TP+ DALA +G+I +  Y +  VC+
Sbjct: 17  FVALADSKPNIIVILSDDQGYADVSYNPEHDDYISTPHTDALAKSGVIFHRGYTSGSVCS 76

Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
            +R+ LMTG+Y    G+      G    G  L  +F+P YL+E GY + A GKWHLG   
Sbjct: 77  TTRSGLMTGRYQQRYGIYTA---GEGGTGTDLNAKFIPNYLKEAGYKSMAFGKWHLG-HE 132

Query: 190 REYTPLYRGFESHFGYLN-GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
            +Y PL+RGF+  +G++  G   ++   L  +Y       G  + R L    D    Y T
Sbjct: 133 MKYHPLHRGFDDFYGFMGRGAHDFFR--LEKEYDGKF---GGPIYRGLEPIDDK--GYLT 185

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
              T+E V+ IE+   DKP F Y+A+ A H        +AP E I       D  R    
Sbjct: 186 TRITEETVKFIEENK-DKPFFAYVAYNAVHT-----PAQAPAEDIKAVS--GDETRDILV 237

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
           AM+K LD  VG ++  L++  + EN+III++SDNG                +N P RGVK
Sbjct: 238 AMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNGGAKSMV---------ANNKPLRGVK 288

Query: 369 NTLWEGGVKVPAIL-WSPQIQ-----QNPRVSLQMMHISDWLPTLYTAAG 412
           + +++GG++VP ++ W  QI+     Q+P +SL      D LPTL  AAG
Sbjct: 289 HDIYDGGIRVPFLMSWPAQIKAGQDTQSPVISL------DILPTLLDAAG 332


>gi|406661802|ref|ZP_11069914.1| Arylsulfatase [Cecembia lonarensis LW9]
 gi|405554347|gb|EKB49446.1| Arylsulfatase [Cecembia lonarensis LW9]
          Length = 486

 Score =  165 bits (418), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 134/386 (34%), Positives = 190/386 (49%), Gaps = 53/386 (13%)

Query: 77  AALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRAS 134
           + +TK   + +L    G+ DL F G   I TPN+D LA  GI  +N YA   VC PSR++
Sbjct: 29  STITKPNIIFILADDLGYGDLGFLGQKIIETPNLDRLAAEGIFFSNHYAGATVCAPSRST 88

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           LMTG +  HT ++G   +  +P G   +P T   +P   +E GY+T A GKW LGF   E
Sbjct: 89  LMTGLHTGHTPVRGN--FEIQPEGQYPLPDTILTIPRLFKEAGYATGAFGKWGLGFVGTE 146

Query: 192 YTPLYRGFESHFGYLNGVIS--YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
             P ++GF+  FGY    I+  YY   L D  +  V L G+D  + ++        YA D
Sbjct: 147 GDPNHQGFDQFFGYNCQRIAHRYYPEYLWDN-AEKVFLEGNDWTQKIT--------YAAD 197

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHA-----------------GNAGKHLEAPQ-- 290
           +  KE+++ I D   + P F+++  +  HA                 G    H+ AP+  
Sbjct: 198 IIHKESLKFIRDN-ANNPFFMFVTSVMPHAELAVPDGELLAYYREKIGPEEPHI-APRGS 255

Query: 291 ----ETINQFQYITDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
               E IN   Y ++P  + +YAAMV  LD  VG ++  L   G+ E +IIIF SDNG P
Sbjct: 256 DYGDEPINIPGYQSNPYPKASYAAMVALLDRQVGEIMDKLDELGLTEQTIIIFTSDNG-P 314

Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLP 405
             E     ++ N  SN  +RG K  L+EGG++VP IL  P + Q    S  +    D+LP
Sbjct: 315 HQEGGNDPDFFN--SNGIFRGYKRDLYEGGIRVPMILKWPGVIQGGAQSGHVSAFWDFLP 372

Query: 406 TLYTAAGGDTSRLPLNIDGLDQWSSL 431
           T     G        N+D LD  S L
Sbjct: 373 TFADLLGEK------NLDKLDGVSFL 392


>gi|325110321|ref|YP_004271389.1| arylsulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324970589|gb|ADY61367.1| Arylsulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 980

 Score =  165 bits (418), Expect = 6e-38,   Method: Compositional matrix adjust.
 Identities = 106/303 (34%), Positives = 167/303 (55%), Gaps = 23/303 (7%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGP 149
           +G+     +G     TP++DALA  G+   N Y +  VCTPSRA ++TG+Y    G++  
Sbjct: 40  FGFQGGGINGDFANLTPHLDALAEGGVRFTNGYVSAAVCTPSRAGMLTGRYQHRFGVE-- 97

Query: 150 PIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            ++G  P  G+P +E  + + LR+ GY T AIGKWHLG    E+ P  RGF+  +G L G
Sbjct: 98  TVYGRIPEAGLPASEITMADTLRKAGYRTYAIGKWHLGEHLHEHLPNQRGFDEFYGALTG 157

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD--- 265
             +++ +  ++  S+ ++ NG  +   L         Y TDL  ++ V  I+D   +   
Sbjct: 158 ARTFFPYRGNNPGSK-LQRNGVFLPEPLDQP------YFTDLLARQTVAYIDDHVANHAN 210

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETIN--QFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
            P FLYLA  A H       LEA  + ++  + Q I+ P R+T AAM   +DD+VGTV++
Sbjct: 211 APFFLYLAFTAVHT-----PLEADPKRLDDRRIQDISPPQRKTLAAMTIAMDDAVGTVMA 265

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
            L+   + EN+I++F+SDNG P  E   +    NW  N   +G K+  +EGG++VP ++ 
Sbjct: 266 KLRENNLTENTIVVFLSDNGGP--EDNRSLRAPNWSDNGALQGNKSQNFEGGIRVPFVIH 323

Query: 384 SPQ 386
            P+
Sbjct: 324 WPR 326


>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
 gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
          Length = 498

 Score =  165 bits (418), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 128/412 (31%), Positives = 197/412 (47%), Gaps = 49/412 (11%)

Query: 65  FQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
           F + T+ +R   AA +    + +++    G+ D+   GS  + TPN+D LA +G++ +  
Sbjct: 19  FVFATEASR---AAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGVLCSQA 75

Query: 123 Y-AQPVCTPSRASLMTGKYPIHTGMQG-----PPIWGAEPR--GVPLTERFLPEYLRELG 174
           Y A  VC+PSRA L+TG+ P   G +G        +   P   G+P +E+ L ++L   G
Sbjct: 76  YVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADHLGAAG 135

Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRR 234
           Y+T  IGKWHLG     + P  RGF+   G L G   Y+   ++           H + R
Sbjct: 136 YATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGGHHYFPTTMN-----------HVIER 183

Query: 235 NLSTAWDTVGEYATDLFTKEAVQLI---EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
           N     +   EY TD FT E ++ I   E    D+P F++ ++ A H       + A + 
Sbjct: 184 NGKRVENFSSEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNAPHT-----PMHATEA 238

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
            + +F  I    RRTYAAM+  LD  VG +   L+  G   N++++F SDNG  T     
Sbjct: 239 DLARFANIQTQKRRTYAAMMYALDRGVGRIREHLEESGQWGNTLLVFFSDNGGAT----- 293

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
             N  +W  N P RGVK ++ EGG++VP I  W  +     R    +  + D LPT  +A
Sbjct: 294 --NNGSW--NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVRYDGVVSSL-DLLPTFCSA 348

Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT--PSRR 460
           AG +   L   +   D  +   +N   R +   DG+D    L   +  P+RR
Sbjct: 349 AGAEPLALANPMSHEDASNRKRMN---RLSGTHDGIDMAPHLADGSEPPNRR 397


>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 508

 Score =  165 bits (417), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 150/569 (26%), Positives = 243/569 (42%), Gaps = 111/569 (19%)

Query: 84  TLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
            + L+  +GW ++ +H    + E+ TPNID+L   G+ L+  Y   VC+PSR+SLM+G+ 
Sbjct: 28  VMMLVDDWGWANVGYHRNPPTKEVVTPNIDSLVRQGLELDQHYVFNVCSPSRSSLMSGRL 87

Query: 141 PIHTG---------MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           PIH               P+ G     +P     + + ++  GY T  +GKW  G     
Sbjct: 88  PIHVNDLNIEPDYYNPDDPVSGFS--AIPRNMTGIAQKMKLGGYDTHQVGKWDAGMATHT 145

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG----EYA 247
           +TP  RGF+S FGY +    +Y  I     ++T  +   D+       +   G     Y 
Sbjct: 146 HTPKGRGFDSSFGYFHHANDFYTEIDGKPCNKTKIV---DIWVTDKPGYGLNGTGPDNYE 202

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
             LF ++ ++++ +    KPLFLY A    HA      L+ PQ   ++F +I D +R+ Y
Sbjct: 203 EGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA-----PLQVPQRYQDKFSFIDDHDRQIY 257

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRG 366
            AMV  LDD VG ++ AL++KG+ EN + +  SDNG P         Y   G+ NYP +G
Sbjct: 258 HAMVNYLDDVVGELVDALKKKGLWENLLFVTSSDNGGPV--------YPGGGANNYPLKG 309

Query: 367 VKNTLWEGGVKVPAIL---WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
            K + W+GG++  A +   + P+  +  ++    +H++DW  T    A         N+D
Sbjct: 310 GKASDWQGGIRSNAFVSGGYLPEKMRGKKLD-GYIHLADWYATFCALA---------NVD 359

Query: 424 GLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
             D+ ++     P      ID L+ W  L+         V I I  K   +      +K+
Sbjct: 360 PTDEAAAKADLPP------IDSLNMW-PLISGETMTSPRVDIPISYKTLISG----EYKI 408

Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTR 543
           + GT       GY G    N+    N N  +             NIF             
Sbjct: 409 LTGTNNQA---GYTGPQYPNQT---NPNGGI-------------NIF------------- 436

Query: 544 QQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE-LLKYHRR 602
           QQ    C                  L+N+  DP E+ ++A+  PD   ++ + L +Y + 
Sbjct: 437 QQCGDGC------------------LYNIMTDPEERVDLAAKMPDKLKEMQQKLAQYQKT 478

Query: 603 TLVP-QSHEQPDLVQADPKRFNDTWSPWI 630
              P +    P    A    +N  W P++
Sbjct: 479 HFAPDRGSPWPGACDAALNTYNGFWGPFV 507



 Score = 42.0 bits (97), Expect = 1.0,   Method: Compositional matrix adjust.
 Identities = 21/66 (31%), Positives = 35/66 (53%), Gaps = 5/66 (7%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y   LF ++ ++++ +    KPLFLY A    HA      L+ PQ   ++F +I D +R+
Sbjct: 201 YEEGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA-----PLQVPQRYQDKFSFIDDHDRQ 255

Query: 75  TYAALT 80
            Y A+ 
Sbjct: 256 IYHAMV 261


>gi|171184398|ref|NP_057931.3| N-acetylgalactosamine-6-sulfatase isoform 1 precursor [Mus
           musculus]
 gi|124007189|sp|Q571E4.2|GALNS_MOUSE RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|74198064|dbj|BAE35212.1| unnamed protein product [Mus musculus]
 gi|148679747|gb|EDL11694.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_b [Mus
           musculus]
          Length = 520

 Score =  165 bits (417), Expect = 7e-38,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 175/365 (47%), Gaps = 40/365 (10%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31  VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 91  RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
            PL  GF+  FG  N     YD+         R  E+ G            T     T L
Sbjct: 147 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205

Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           +T+EA+  I+ Q   + P FLY A  A HA                 Q++    R  Y  
Sbjct: 206 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V+++DDSVG ++S LQ  G+ +N+ + F SDNGA  +     S     GSN P+   K 
Sbjct: 255 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
           T +EGG++ PAI W P      +VS Q+  I D   T  + AG    + R+   IDGLD 
Sbjct: 310 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 366

Query: 428 WSSLL 432
             ++L
Sbjct: 367 LPTML 371


>gi|329744562|ref|NP_001193258.1| N-acetylgalactosamine-6-sulfatase precursor [Bos taurus]
 gi|296478055|tpg|DAA20170.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Bos taurus]
          Length = 522

 Score =  164 bits (416), Expect = 9e-38,   Method: Compositional matrix adjust.
 Identities = 122/362 (33%), Positives = 177/362 (48%), Gaps = 36/362 (9%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 34  LLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPSRAALLTGRLPIR 93

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           +G     G       P+    G+P +E  LP  L+  GY++K +GKWHLG  R ++ PL 
Sbjct: 94  SGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH-RPQFHPLK 152

Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
            GF+  FG  N     YD+         R  E+ G    +   NL T    +    T ++
Sbjct: 153 HGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL----TQIY 208

Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
            +EA++ I+  Q   +P FLY A  A HA                  ++    R  Y   
Sbjct: 209 LQEALEFIQRQQAAHRPFFLYWAVDATHA-----------PIYASKPFLGTSQRGRYGDA 257

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           +++LDDSVG ++  L+   + EN+ + F SDNGA  +     S  R  GSN P+   K T
Sbjct: 258 IRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALI-----SAPRQGGSNGPFLCGKQT 312

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
            +EGG++ PAI W P      +VS Q+  I D   T  + AG +  R    IDGLD   +
Sbjct: 313 TFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPR-DRAIDGLDLLPA 371

Query: 431 LL 432
           +L
Sbjct: 372 ML 373


>gi|149038400|gb|EDL92760.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus norvegicus]
          Length = 466

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 125/368 (33%), Positives = 179/368 (48%), Gaps = 46/368 (12%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 35  VLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 94

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 95  RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 150

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
            PL  GF+  FG  N     YD+ +       R  E+ G    +   NL T    +    
Sbjct: 151 HPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL---- 206

Query: 248 TDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           T L+ +EA+  I  Q   + P FLY A  A HA                 Q++    R  
Sbjct: 207 TQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----------PVYASKQFLGTSLRGR 255

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y   V+++DDSVG ++S LQ  G+ +N+ + F SDNGA  +     S  +  GSN P+  
Sbjct: 256 YGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLC 310

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDG 424
            K T +EGG++ PAI W P      +VS Q+  I D   T  + AG    + R+   IDG
Sbjct: 311 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDG 367

Query: 425 LDQWSSLL 432
           LD   ++L
Sbjct: 368 LDLLPTML 375


>gi|395508489|ref|XP_003758543.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Sarcophilus harrisii]
          Length = 492

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 122/364 (33%), Positives = 169/364 (46%), Gaps = 41/364 (11%)

Query: 79  LTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMT 137
           LT     T  +  GW DL   G     TP++D +A  G++  N Y A P+C+PSRA+L+T
Sbjct: 21  LTCLRVATFSLQMGWGDLGVFGEPSKETPHLDQMAAEGMLFPNFYTANPLCSPSRAALLT 80

Query: 138 GKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           G+ PI  G        A  R          G+P +E  LPE L++ GY  K +GKWHLG 
Sbjct: 81  GRLPIRNGFY---TTNAHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVGKWHLGH 137

Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDT 242
            R ++ PL  GF+  FG  N     YD+         R  E+ G    D   NL T    
Sbjct: 138 -RPQFHPLKHGFDEWFGAPNCHFGPYDNKARPNIPVYRNWEMVGRFFEDFPINLKTGEAN 196

Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
           +    T ++ +EAV  I+ Q   +P FLY A  A HA                  ++   
Sbjct: 197 L----TQIYLQEAVDFIKQQAHQQPFFLYWAVDATHA-----------PVYASKSFLGTS 241

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R  Y   V+++DD VG ++  LQ  G+ EN+ + F SDNGA  +     S     GSN 
Sbjct: 242 QRGRYGDAVREIDDCVGKILKLLQDLGIQENTFVFFTSDNGAALI-----SAPIQGGSNG 296

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
           P+   K T +EGG++ PAI W P      RVS Q+ ++ D   T  + AG         I
Sbjct: 297 PFLCGKQTTFEGGMREPAIAWWPGHIPAGRVSHQLGNVMDLFTTSLSLAGLKPPS-DREI 355

Query: 423 DGLD 426
           DG+D
Sbjct: 356 DGID 359


>gi|384098332|ref|ZP_09999449.1| chondroitinsulfatase [Imtechella halotolerans K1]
 gi|383835828|gb|EID75248.1| chondroitinsulfatase [Imtechella halotolerans K1]
          Length = 464

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 104/310 (33%), Positives = 167/310 (53%), Gaps = 30/310 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G  D+  +G + I TP+ID LA  G+  NN YA   VC PSRA+L+TG++  HT ++G  
Sbjct: 36  GIGDIGVYGQSIIQTPHIDRLAEEGMKFNNFYAGSTVCAPSRAALLTGQHTGHTQVRGNG 95

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
            +  +P      ++ +PE L+  GY+    GKW +G      TP  RGF+   G+++ V 
Sbjct: 96  EFPLDP-----AKKIIPEVLKNAGYTNAIFGKWGMGLNGSGSTPDKRGFDVFAGHIHHVS 150

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           ++Y      Q   +++   H   + +    DT   Y  + FT++A+Q I +QP DKP FL
Sbjct: 151 AHY------QKPDSIDAIVHGDLKRIGLPKDT---YVNEYFTEQALQFINNQPKDKPFFL 201

Query: 271 YLAHLAAHA------GNAGKHLEAPQETINQFQ--------YITDP-NRRTYAAMVKKLD 315
           ++++   HA          KHL + +++ +  +        Y   P  +  YAA+V+ LD
Sbjct: 202 FMSYTIPHAELVVPPKYLQKHLLSAEQSKHDSEKEWPAGRHYGPQPFPKAAYAALVESLD 261

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           D  G +++AL+ KG+ EN+I+IF SDNG  T   R   +   +GS+  Y+GVK  L+ GG
Sbjct: 262 DYTGQILAALKAKGIDENTIVIFTSDNGTHTEGGRTQEDVDYFGSSAHYQGVKRDLYTGG 321

Query: 376 VKVPAILWSP 385
           +K P ++  P
Sbjct: 322 IKEPFLIRWP 331



 Score = 40.4 bits (93), Expect = 3.3,   Method: Compositional matrix adjust.
 Identities = 15/34 (44%), Positives = 24/34 (70%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 48
           Y  + FT++A+Q I +QP DKP FL++++   HA
Sbjct: 177 YVNEYFTEQALQFINNQPKDKPFFLFMSYTIPHA 210


>gi|60359902|dbj|BAD90170.1| mFLJ00319 protein [Mus musculus]
          Length = 537

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 175/365 (47%), Gaps = 40/365 (10%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 48  VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 107

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 108 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 163

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
            PL  GF+  FG  N     YD+         R  E+ G            T     T L
Sbjct: 164 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 222

Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           +T+EA+  I+ Q   + P FLY A  A HA                 Q++    R  Y  
Sbjct: 223 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 271

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V+++DDSVG ++S LQ  G+ +N+ + F SDNGA  +     S     GSN P+   K 
Sbjct: 272 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 326

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
           T +EGG++ PAI W P      +VS Q+  I D   T  + AG    + R+   IDGLD 
Sbjct: 327 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 383

Query: 428 WSSLL 432
             ++L
Sbjct: 384 LPTML 388


>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
          Length = 492

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 141/481 (29%), Positives = 221/481 (45%), Gaps = 92/481 (19%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
            L K   +  ++V   G++DL +  + ++ +PNIDALA + + L   Y QP CTPSRA+ 
Sbjct: 19  VLEKKPHVIFVLVDDLGFDDLGY-VNRDVISPNIDALAKDALHLKKHYVQPSCTPSRAAF 77

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           +TG+Y I  GMQ   I  +EP G+PL E  L E  ++ GY T   GKWHLGF+  ++ P 
Sbjct: 78  LTGRYNIRMGMQSGVIRPSEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQ 137

Query: 196 YRGFESHFGYLNGVISYYDH----ILSDQYSRTVELNGHDMRR----------------- 234
            RGF+  +G+  G   +Y H    + S ++S   E++G + +R                 
Sbjct: 138 NRGFDRFYGFYLGSQDFYFHDSGRLSSHKFSYEDEVSGVNNKRWLQGPHGRPWGYDFREA 197

Query: 235 ---NLSTAWDTV-------GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGK 284
              N     DT+       G Y+T LF  + +  +        LF Y++    H      
Sbjct: 198 YPANGDVENDTILDDFHTNGTYSTKLFVNDFINDLAKHDPAVSLFNYVSFQDVH------ 251

Query: 285 HLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
               P +T+N+F+  Y    +  T+  ++      + T I+          S+++F SDN
Sbjct: 252 ---GPLQTVNKFKKHYKNKTDTWTHERIL------ISTKIT----------SLLVFTSDN 292

Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ----MM 398
           G    E           SN+P RG K T+++GG+K  A + SP +Q   R+  Q    + 
Sbjct: 293 GGRPKE---------GASNWPLRGSKGTIYDGGLKSRAFVASPLLQN--RMKGQNYNYLF 341

Query: 399 HISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLLNTPSR-RNSNIDGLDQWSSLLLNT 456
           H+SDW PTL T +G D  R     +DG  +   L +  P + RN  +  LD  +      
Sbjct: 342 HVSDWFPTLLTLSGCDVPRKEGRPLDG--KIHELFVYKPKKARNKMLHYLDPLTQFDYGD 399

Query: 457 PSRRNSVLINIDEKKRT--AAVRLDSWKLVLGT-QENGTMDGYYGQTR-----SNKVPLL 508
           P  RN  +++      T  A+ R   WK++ G    NG   GY  + R      NK+ L+
Sbjct: 400 P--RNFEVLDDRAFNTTIKASYRQGRWKIITGKPCANGC--GYIKKERPWNYAKNKMKLI 455

Query: 509 N 509
           +
Sbjct: 456 D 456


>gi|323454930|gb|EGB10799.1| hypothetical protein AURANDRAFT_22180 [Aureococcus anophagefferens]
          Length = 530

 Score =  164 bits (416), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 114/380 (30%), Positives = 180/380 (47%), Gaps = 42/380 (11%)

Query: 80  TKSTTLTLLIVYGWNDLSFHGSNEIPT-------PNIDALAYNGIILNNMYAQPVCTPSR 132
           +K   +  ++V   +DL F+ +  + T       P++ ALA   + +   Y   +CTP+R
Sbjct: 17  SKQPHIAFMLV---DDLGFNDAWAVNTSDVAGAWPHVGALANESVRIATYYTTMLCTPTR 73

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
            + MTG+ P   G+    I G +  G+P  E  + + L+  GY+T  +GKWHLG F    
Sbjct: 74  GAFMTGRLPQRLGLHHGVIGGFQDYGLPANETTIADKLKGAGYATAHVGKWHLGNFDDAS 133

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS--TAWDTV------- 243
            P  RGF++ +GY NG          D ++R ++  G     +L+   ++ TV       
Sbjct: 134 EPTRRGFDASYGYQNGE--------EDHFTRILQGLGDFQASSLADGASYYTVDDLLKIN 185

Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA---GNAGKHLEAPQETINQFQYIT 300
           G + + LF   AV ++       PLFLYLA    HA     AG   +A   +I   ++  
Sbjct: 186 GTHNSFLFVDRAVAVVGAHDASIPLFLYLALQDTHAPIQAPAGYADDAACASIGSSRHYP 245

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
           D  RR +  MV+  D+S+  V +A+       NS+I+   DNG            +  G+
Sbjct: 246 DGARREFCGMVRAADESIANVTAAIDANFGKRNSVIVIAGDNGGIV---------KGGGN 296

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
           N+P RG K   WEGGV+  A++ +P           M+H+ D   TL   AGG ++    
Sbjct: 297 NWPLRGQKGQPWEGGVRNHALIRAPGRLAPGLYDKGMVHVVDLHATLVALAGGASA---Y 353

Query: 421 NIDGLDQWSSLLLNTPSRRN 440
            IDGLD W++L   +PS R+
Sbjct: 354 EIDGLDVWAALRDGSPSPRS 373


>gi|7527462|gb|AAF63155.1|AF111346_1 N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
 gi|7576473|gb|AAF63858.1| N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
          Length = 520

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 124/365 (33%), Positives = 174/365 (47%), Gaps = 40/365 (10%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31  VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 91  RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
            PL  GF   FG  N     YD+         R  E+ G            T     T L
Sbjct: 147 HPLKHGFNEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205

Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           +T+EA+  I+ Q   + P FLY A  A HA                 Q++    R  Y  
Sbjct: 206 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V+++DDSVG ++S LQ  G+ +N+ + F SDNGA  +     S     GSN P+   K 
Sbjct: 255 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
           T +EGG++ PAI W P      +VS Q+  I D   T  + AG    + R+   IDGLD 
Sbjct: 310 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 366

Query: 428 WSSLL 432
             ++L
Sbjct: 367 LPTML 371


>gi|406661522|ref|ZP_11069640.1| Arylsulfatase [Cecembia lonarensis LW9]
 gi|405554671|gb|EKB49747.1| Arylsulfatase [Cecembia lonarensis LW9]
          Length = 477

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 190/364 (52%), Gaps = 40/364 (10%)

Query: 69  TDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPV 127
           ++P      ++  +  L      G++D+  +G+++I TP++D LA  G+   N Y AQ V
Sbjct: 19  SNPQGNNETSIIPNIILIFTDDQGYHDVGVYGASDIETPHLDQLASEGLQFTNFYVAQAV 78

Query: 128 CTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           C+ SRA+L+TG YP   G+ G  +  +   G+   E  + + L+ LGY+T   GKWHLG 
Sbjct: 79  CSASRAALLTGTYPNRLGIHGA-LDHSSKHGLHPEEATIADLLKPLGYATAVFGKWHLGH 137

Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
              E+ P  +GF+ +FG       + +H  +  Y   + +  +D  + + T W+    + 
Sbjct: 138 -HPEFLPTNQGFDEYFGIPYSNDMWPNHPQTKDYYPPLPIYQND--KVVDTIWNDQSMFT 194

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
           T  FT++++  IE    D+P FLYLAH   H          P    ++F+     NR  Y
Sbjct: 195 T-WFTEKSIDFIERNK-DRPFFLYLAHPMPHV---------PLFVSDKFK--GKSNRGLY 241

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
             ++ ++D SVG ++  L++  + EN+IIIF+SDNG P + Y E S     GS YP R  
Sbjct: 242 GDVIMEIDWSVGQIMQTLKQNDLDENTIIIFLSDNG-PWLSYSEHS-----GSAYPLREG 295

Query: 368 KNTLWEGGVKVPAIL-W-----SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL- 420
           K T W+GGVKVPAI+ W     S  +Q+NP +++      D LP L    G   + LP+ 
Sbjct: 296 KGTSWDGGVKVPAIMRWKGTWPSGLVQKNPAMTI------DILPMLAHLTG---AELPVK 346

Query: 421 NIDG 424
            IDG
Sbjct: 347 KIDG 350


>gi|340369111|ref|XP_003383092.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 507

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 112/347 (32%), Positives = 165/347 (47%), Gaps = 33/347 (9%)

Query: 84  TLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
            + L+  +GW ++ +H    + E+ TPNID L   G+ LN  YA   C+PSR+SL++G+ 
Sbjct: 28  VMMLVDDWGWANVGYHRNPPTREVVTPNIDKLVKEGLELNQHYAFRFCSPSRSSLISGRL 87

Query: 141 PIHTG---------MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           PIH               P+ G     +P     + E +R  GY+T  +GKW  G    +
Sbjct: 88  PIHVNDLNLAPTNYNPNDPVSGFS--AIPRNMTGIAEKMRGAGYATHQVGKWDAGMATSD 145

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNL--STAWDTVGEYATD 249
           +TP  RGF++ FGY +    YY  I        V+L   D   +    T  D    Y   
Sbjct: 146 HTPKGRGFQTSFGYFHHANDYYTEIAGKCKIPIVDLWETDKPAHGINGTGPDN---YEEG 202

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           LF +  + ++ +     PLFLY A    HA       E P   +N+F +I +  R+ Y A
Sbjct: 203 LFKERVLSVVSNHDPSTPLFLYYAPHIVHA-----PYEVPDRYLNKFSFIDNHLRQYYHA 257

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           MV  LDD VG +  AL+ KGM +N + +  SDNG P  E           +NYP +G K 
Sbjct: 258 MVNYLDDVVGELTDALKNKGMWDNLLFVTSSDNGGPVHE-------GTGANNYPLKGGKT 310

Query: 370 TLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD 414
           T W+GGV+V A +    + +  R   +   +H++DW  T    AG D
Sbjct: 311 TDWQGGVRVNAFVSGGYLPEKMRGQKTDGYIHLADWYGTFCAIAGVD 357


>gi|299530773|ref|ZP_07044188.1| sulfatase [Comamonas testosteroni S44]
 gi|298721289|gb|EFI62231.1| sulfatase [Comamonas testosteroni S44]
          Length = 440

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 110/328 (33%), Positives = 160/328 (48%), Gaps = 30/328 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           GW DL  +G+ +  TP++D LA  G+  N  YA   VC+ +R +L+TG+Y   +  G++ 
Sbjct: 15  GWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGRYQYRLQAGLEE 74

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P        G+P     LP  L++ GY T  IGKWHLG     Y P   G++  FG  +G
Sbjct: 75  PLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLGK-PPAYGPQLSGYDYFFGNHSG 133

Query: 209 VISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVD 265
            I Y+ H   + +Q+S  +      + R           Y T +   EA + + E +  D
Sbjct: 134 AIDYFTHKPGVGEQFSSDLWQGNEPVERT---------GYYTYILGDEATRYVHERKGQD 184

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           KP FL L   A H    G   E    +I    +    + + Y  +V+ LD +VG V+ AL
Sbjct: 185 KPFFLSLHFTAPHWPWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVEALDKAVGQVLQAL 244

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WS 384
              G  +N+I+IF SDNG              +   +P+ G K  L EGG++VP +L WS
Sbjct: 245 DDSGQADNTIVIFTSDNGG-----------ERFSKTWPFTGQKTELLEGGIRVPTLLRWS 293

Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAG 412
            +IQ  P+V  Q+    DWLPTL  AAG
Sbjct: 294 ARIQ--PQVQEQITASFDWLPTLLAAAG 319


>gi|149196558|ref|ZP_01873612.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140238|gb|EDM28637.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 443

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 106/313 (33%), Positives = 159/313 (50%), Gaps = 32/313 (10%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L    G+ D+ F GS++I TP+ID LA +G+I +  Y +  VC PSRA LMTGK  +
Sbjct: 12  VLILADDLGYGDVGFTGSSQIKTPHIDRLAKDGVIFSQGYVSSSVCGPSRAGLMTGKNQV 71

Query: 143 HTGMQGP-----PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
             G         P +  E  G+P++E+ L   L E GY    +GKWHLG  + +Y PL R
Sbjct: 72  RFGFDNNLTNYLPQFKDEFHGLPISEKTLATRLAEKGYVNGLVGKWHLGD-KEQYHPLKR 130

Query: 198 GFESHFGYLNGVISYYDHILSDQ-YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           GF   +GYL G   Y+    + + Y   +E N +   + +S   D  G+   D   +   
Sbjct: 131 GFHEFWGYLGGGHHYFRSKPNGKGYDCPIECN-YKTPQPISYITDDKGDECVDFIRRHK- 188

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
                   D+P FL+ +  A HA      + A +E +  + +I    RR Y AM   LD 
Sbjct: 189 --------DEPFFLFASFNAPHA-----PMHAKEEDLKLYSHIEGEKRRAYCAMTHNLDL 235

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           +VG +++ L+ + + +N++++F+SDNG PT          N   N P RG K  L EGG+
Sbjct: 236 NVGKIVAELKAQNIYDNTVVVFLSDNGGPTPN--------NASLNAPLRGGKGILLEGGI 287

Query: 377 KVP-AILWSPQIQ 388
           +VP  + W  +IQ
Sbjct: 288 RVPFTMTWPAKIQ 300


>gi|395856891|ref|XP_003800850.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Otolemur garnettii]
          Length = 526

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 125/373 (33%), Positives = 176/373 (47%), Gaps = 42/373 (11%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASL 135
           A  + +  L L+   GW DL  +G     TPN+D +A  G +  N Y A P+C+PSRA+L
Sbjct: 30  APQSPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAAL 89

Query: 136 MTGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHL 185
           +TG+ PI  G        A  R          G+P +E  LPE L+E GY +K +GKWHL
Sbjct: 90  LTGRLPIRNGFY---TTNAHARNAYTPQEIVGGIPRSEHLLPELLKEAGYISKIVGKWHL 146

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAW 240
           G  R ++ PL  GF+  FG  N     YD+         R  E+ G    +   NL T  
Sbjct: 147 G-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGE 205

Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
             +    T ++ +EA+  I+ Q   + P FLY A  A HA                  ++
Sbjct: 206 SNL----TQIYLQEALDFIKRQQAQQHPFFLYWAIDATHA-----------PVYASKAFL 250

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               R  Y   V+++DDSVG ++  L+   + EN+ + F SDNGA  +     S  R  G
Sbjct: 251 GTSQRGRYGDAVREIDDSVGKILDLLRGLSIGENTFVFFTSDNGAALI-----SAPRQGG 305

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           SN P+   K T +EGG++ PAI W P      +VS Q+  I D   T  + AG +     
Sbjct: 306 SNGPFLCGKQTTFEGGMREPAIAWWPGHIPAAQVSHQLGSIMDLFTTSLSLAGLEPPS-D 364

Query: 420 LNIDGLDQWSSLL 432
             IDGLD   +LL
Sbjct: 365 RTIDGLDLLPTLL 377


>gi|255038246|ref|YP_003088867.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254951002|gb|ACT95702.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 481

 Score =  164 bits (415), Expect = 1e-37,   Method: Compositional matrix adjust.
 Identities = 138/419 (32%), Positives = 198/419 (47%), Gaps = 64/419 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D+ F+G   I TPNID LA  G+I N  YA   VC PSR+SL+TG++  HT ++G  
Sbjct: 41  GYGDVGFNGQKLIKTPNIDKLAKEGMIFNQFYAGTSVCAPSRSSLLTGQHTGHTYIRGNK 100

Query: 151 IWGAEPRGV-PLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
             G EP G  P+ +    L E L++ GY T A GKW LG    E  P  +GF+  +GY  
Sbjct: 101 --GVEPEGQQPIADSVTTLAEVLKKSGYVTAAFGKWGLGPVGSEGDPNKQGFDRFYGYNC 158

Query: 208 GVIS--YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
             ++  YY   L D   + +      +  N         EYA DL  K+A+  +  Q   
Sbjct: 159 QSLAHRYYPEHLWDNSKKILLEGNKGLIHNK--------EYAPDLIQKKALSFVNAQDGK 210

Query: 266 KPLFLYLAHLAAHAG-----------NAGKHLEAPQETI------NQFQYIT-DPNRRTY 307
           +P FL+L ++  HA              GK  E P +        N   Y + D    T+
Sbjct: 211 QPFFLFLPYILPHAELVVPDDSLFRYYKGKFEEKPHKGADYGPGANGGGYASQDFPHATF 270

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
           AAMV +LD  VG V++AL++KG+ +N+++IF SDNG P VE      + N G+   +RGV
Sbjct: 271 AAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNG-PHVEGGADPRFFNSGAG--FRGV 327

Query: 368 KNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           K  L+EGG++ P A  W   I+   + S  +    D LPT                    
Sbjct: 328 KRDLYEGGIREPFAARWPAAIKPGSK-SDYIGAFWDILPTF------------------- 367

Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI-NIDEKKRTAAVRLDSWKLV 484
              + L N P+ R  NIDG+    +L      +++  L     E+    AVR  +WK V
Sbjct: 368 ---AELANAPAPR--NIDGISFTDALKGKAIQKKHDYLYWEFHEQGGRQAVRQGNWKAV 421


>gi|418530519|ref|ZP_13096442.1| sulfatase [Comamonas testosteroni ATCC 11996]
 gi|371452238|gb|EHN65267.1| sulfatase [Comamonas testosteroni ATCC 11996]
          Length = 440

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 110/328 (33%), Positives = 160/328 (48%), Gaps = 30/328 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           GW DL  +G+ +  TP++D LA  G+  N  YA   VC+ +R +L+TG+Y   +  G++ 
Sbjct: 15  GWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGRYQYRLQAGLEE 74

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P        G+P     LP  L++ GY T  IGKWHLG     Y P   G++  FG  +G
Sbjct: 75  PLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLGK-PPAYGPQLSGYDYFFGNHSG 133

Query: 209 VISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVD 265
            I Y+ H   + +Q+S  +      + R           Y T +   EA + + E +  D
Sbjct: 134 AIDYFTHKPGVGEQFSSDLWQGNEPVERT---------GYYTYILGDEATRYVHERKGQD 184

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           KP FL L   A H    G   E    +I    +    + + Y  +V+ LD +VG V+ AL
Sbjct: 185 KPFFLSLHFTAPHWPWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVEALDKAVGQVLQAL 244

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WS 384
              G  +N+I+IF SDNG              +   +P+ G K  L EGG++VP +L WS
Sbjct: 245 DDSGQADNTIVIFTSDNGG-----------ERFSKTWPFTGQKTELLEGGIRVPTLLRWS 293

Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAG 412
            +IQ  P+V  Q+    DWLPTL  AAG
Sbjct: 294 ARIQ--PQVQEQVTASFDWLPTLLAAAG 319


>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
 gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
          Length = 472

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 107/323 (33%), Positives = 164/323 (50%), Gaps = 32/323 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ D   +G  +IPTPNIDA+A  G    + Y +  VC PSRA ++TG+Y    G +   
Sbjct: 40  GYVDFGCYGGKQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRAGILTGRYQQRFGFEHNT 99

Query: 149 ----PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                P +     G+  +E+ +   ++  GY T AIGKWH G   + + PL RGF   +G
Sbjct: 100 SNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQGDEPKHF-PLNRGFNEFYG 158

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           +  G   ++ +    +  RT   N H +  N     +    Y TD+FT +A   I     
Sbjct: 159 FTGGHRDFFAY----KGKRT---NEHALYNNKEIVPENEITYLTDMFTDKATSFITANK- 210

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
           DKP F+YL++ A H       + A ++ + ++  I D  RR YAAM+  LDD +G V++ 
Sbjct: 211 DKPFFMYLSYNAVHT-----PMNAKKDLMERYASIADTGRRAYAAMMTSLDDGIGKVMAT 265

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-W 383
           L+   + +N++IIF++DNG  TV         N   N P RG+K + WEGG++V  ++ W
Sbjct: 266 LKANQLDKNTLIIFINDNGGATV---------NSSDNGPLRGMKGSKWEGGIRVAMMMKW 316

Query: 384 SPQIQQNPRVSLQMMHISDWLPT 406
              I  N   S  +  + D LPT
Sbjct: 317 PGHIAANKTDSRPVSSL-DILPT 338



 Score = 45.1 bits (105), Expect = 0.13,   Method: Compositional matrix adjust.
 Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 6/68 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD+FT +A   I     DKP F+YL++ A H       + A ++ + ++  I D  RR
Sbjct: 193 YLTDMFTDKATSFITANK-DKPFFMYLSYNAVHT-----PMNAKKDLMERYASIADTGRR 246

Query: 75  TYAALTKS 82
            YAA+  S
Sbjct: 247 AYAAMMTS 254


>gi|340367653|ref|XP_003382368.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
          Length = 504

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 130/436 (29%), Positives = 201/436 (46%), Gaps = 73/436 (16%)

Query: 76  YAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV------ 127
           + A  K   +  ++V  +G+ D+ F     I +PN D LA  G++LN  Y          
Sbjct: 17  FVATAKLPNIVFVLVDDWGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYLAKTGLVLNR 75

Query: 128 ------CTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIG 181
                 C+PSRAS +TG++P H      P   ++  G  L    +P  L+   YST  +G
Sbjct: 76  HYVFKYCSPSRASFLTGRWPHHAHQWNLP--QSDMAGTNLNMTMIPAKLKAAKYSTHMVG 133

Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD 241
           KWH GFF   Y P+ RGF++  G+L G     DH ++ +    ++   +D     +  +D
Sbjct: 134 KWHQGFFDPRYLPINRGFDTSSGFLGGG---EDH-MTQEAECAIDYWKNDAPDPRNGTYD 189

Query: 242 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-IT 300
               Y  DL       ++ +  V+ PLF+YL     HA      L+AP E +N +    T
Sbjct: 190 AY-NYGDDL-----TDIMNNHNVNDPLFIYLPLHNVHA-----PLQAPDEWLNIYSVNST 238

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
             NRRTY AMV   D+  G ++  L++KGM +N+III  +DNG               GS
Sbjct: 239 CENRRTYQAMVSVADNVTGYLVQLLKKKGMWDNTIIIISADNGGAACG----------GS 288

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD---- 414
           NYP +G K + +EGGV+  A +    + ++ R   +   +HI+DW  T    AG D    
Sbjct: 289 NYPLKGCKGSFFEGGVRALAFVNGGLLPESRRGESTDGFIHIADWYTTFCKLAGVDPDDS 348

Query: 415 -TSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKR- 472
            T + P  +DGLD W              I G D+         ++   +++  D   R 
Sbjct: 349 GTGKFP--VDGLDVWPI------------ITGEDE--------KTQHEDIILGYDFFSRH 386

Query: 473 TAAVRLDSWKLVLGTQ 488
           T A+ +  +KL++G Q
Sbjct: 387 TGAIIMGDYKLLVGPQ 402


>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
 gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
          Length = 502

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 112/365 (30%), Positives = 176/365 (48%), Gaps = 31/365 (8%)

Query: 69  TDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPV 127
            +PN R       +  L L    G+ D  F GS++I TPN+D LA    +    Y +  V
Sbjct: 48  ANPNPRGSNKDAPNIVLILSDDAGYGDFGFQGSSQIRTPNLDNLAVQSTVFTQAYVSAAV 107

Query: 128 CTPSRASLMTGKYPIHTGMQGPPIWG--------AEPRGVPLTERFLPEYLRELGYSTKA 179
           C PSRA ++TGKY    G +   + G         +  G+PL +R + +YL   GYST  
Sbjct: 108 CGPSRAGILTGKYQQRFGFEENNVPGYMSDSGLTGDDMGLPLNQRTIGDYLTHFGYSTAL 167

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
           IGKWH G   + + P  RGFE  +G+  G  SY++   ++  S   +     + R  +  
Sbjct: 168 IGKWHQGNADK-FHPTKRGFEHFYGFRGGARSYFEFGPNNPVSYPED----RLERGFAH- 221

Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
           +     Y T     EA++ I+ Q   +P FL+L+  A H       ++A +E + QF  +
Sbjct: 222 YKESPHYLTQALADEAIKFIK-QNQREPFFLFLSFNAVHT-----PMDANKEDLAQFPQL 275

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
           +   +R  AAM   +D  +G V+++L   G+ EN++++F +DNG P+          N  
Sbjct: 276 SGKRQRV-AAMTLSMDREIGKVLASLDELGLSENTLLVFTNDNGGPSDT--------NAS 326

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
            N P  G K    EGG++VP ++  P     P    + +   D LPT + AAGG  S + 
Sbjct: 327 DNTPLSGTKANHLEGGIRVPFLMRWPAGNIGPNTYHEPISTLDLLPTFFAAAGGKKSAVA 386

Query: 420 LNIDG 424
            ++DG
Sbjct: 387 -DLDG 390


>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
          Length = 534

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 160/585 (27%), Positives = 245/585 (41%), Gaps = 116/585 (19%)

Query: 75  TYAALTKS--TTLTLLIVYGWNDLSF----HGSNEIPTPNIDALAYNGIILNNMYAQPVC 128
           T  AL ++  T   LL V G +D+ F    + S ++ +P +D LA  G+ L   YA   C
Sbjct: 15  TLGALARAADTKPHLLFVLG-DDVGFGDVGYSSPDVISPTLDRLAAEGLKLGRHYAYMWC 73

Query: 129 TPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRE-LGYSTKAIGKWHLGF 187
            PSRA+L+TG YP  TG+      GA+   +PL    LP  LR+  GY T A+GKWHLGF
Sbjct: 74  APSRAALLTGYYPSTTGVYS--TSGAQ-NALPLEFALLPGLLRDRAGYRTAAVGKWHLGF 130

Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD--TVGE 245
                 P  RGF+  FG+L+G   +Y  + +         +  D RR      D    G 
Sbjct: 131 MSEADLPERRGFDGFFGFLDGGEDHYSRVGAGAPGCDRVFDLWDSRRRGPATDDPSAFGR 190

Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA-------------------------- 279
           Y+ +L+ + A   I      +PLFLY A   AH+                          
Sbjct: 191 YSAELYGEAAADAIRGHDAAEPLFLYAAFQVAHSPLEVPPAKYFDAYADAALCANASTKG 250

Query: 280 -GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
              A    +AP    N   +    +R   AAMV  LD +V  + +AL  K M  + ++++
Sbjct: 251 FACARPGADAPYRAPN---HDCGCDRLVVAAMVTALDAAVANMTTALASKNM-ADFVLVY 306

Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMM 398
             DNG P +E        +W  N   RG K T +EGGV+  A + SP + +    +   +
Sbjct: 307 SGDNGGPEIE-------GHW--NGGLRGGKWTWFEGGVRPAAFVHSPLLTRRGWHN-GTL 356

Query: 399 HISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPS 458
           H+ DWLPT    AG   +R     DG DQW+SL       R + + G             
Sbjct: 357 HLVDWLPTFLALAG--AARDDDAFDGADQWASLAFGAAPAREATLIG------------- 401

Query: 459 RRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN--AIVES 516
               VL+                                G  +   VP   F   ++  +
Sbjct: 402 --PGVLVQ-------------------------------GNYKLYAVPPAGFGECSVFNT 428

Query: 517 KTYQ-SLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP------APMTPSPCT-NGPCY 568
            T+  +L +  Q +  P +++    +  +   + C A P      A +  +PC+ + PC 
Sbjct: 429 ATWDCTLGRRGQPLAAP-ADVKADGNASRCPRLQCSAGPFDDPADAWLCSAPCSLDKPC- 486

Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
           L+++ +DP E  ++AS  PD+ + +   L+ H  T+ P     PD
Sbjct: 487 LYDVVSDPAEARDVASRFPDVVAAMAAALRNH--TIRPPYPIVPD 529


>gi|323454643|gb|EGB10513.1| hypothetical protein AURANDRAFT_62515 [Aureococcus anophagefferens]
          Length = 981

 Score =  164 bits (414), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 132/467 (28%), Positives = 204/467 (43%), Gaps = 71/467 (15%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTP-NIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
            L  L   G NDL    S+++P P NI  L   G+ L   Y Q +C+P+RA+L++GK+  
Sbjct: 78  VLVTLDDVGMNDL--WQSSDLPVPENIATLVAEGVELTAYYGQSMCSPARAALLSGKFVH 135

Query: 143 HTGMQGPPIWGAEPR-------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
             G      WG +          VPL    +PE L+  GY T  IGKW++G     Y P 
Sbjct: 136 KIGFSDK--WGPKREVTAFSNYSVPLGHVLMPEALKRNGYGTHGIGKWNIGHCNEAYLPW 193

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM-------RRNLSTAWDTVGEYAT 248
            RGF++  GYL   I Y DH+     S   + +  D+       R          G Y T
Sbjct: 194 MRGFDTFVGYLTDGIGYTDHVADGPSSYLYDNDALDLYDFVSHERGVTKNGSAYAGAYTT 253

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP---------QETINQFQYI 299
           ++F   A  ++ ++P D PLFL+LAH   H  NAG    A                   +
Sbjct: 254 EIFNARAETILREEPSDAPLFLWLAHHGVH-DNAGVADAATCVDGDDGDDTAIFALAASM 312

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
           T P  R +   +K +D  VG V  AL  +      +++  SDNG  T        + N G
Sbjct: 313 TRPRFR-FGCALKAVDRGVGAVRRALDDRA--REYVLVVHSDNGGNTC-----GTHCN-G 363

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQI---QQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
           +NYP RG K   +EGG++VP +++SP +   ++       +MH  DWL T     GGD +
Sbjct: 364 NNYPRRGAKFFEFEGGLRVPGVVYSPTLIPARRRGAAYAGLMHHVDWLATFVALGGGDAA 423

Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAV 476
            L  + D +D W+ +                  +  +   P  R++++        TA +
Sbjct: 424 DLDPSYDSVDHWAHI------------------AGRVSGAP--RDTIVFA--ASGDTATI 461

Query: 477 RLDSWKLV--------LGTQENGTMDGYYGQTRSNKVPLLNFNAIVE 515
           R  ++KL+          T  NGT + Y  +  SN++  L ++ + +
Sbjct: 462 RSGNYKLIHHALNATYFHTWFNGTANSYPWKCNSNEMVSLFYDVVAD 508



 Score = 39.7 bits (91), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 1/41 (2%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAG 52
            G Y T++F   A  ++ ++P D PLFL+LAH   H  NAG
Sbjct: 248 AGAYTTEIFNARAETILREEPSDAPLFLWLAHHGVH-DNAG 287


>gi|241756057|ref|XP_002401379.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215508427|gb|EEC17881.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 287

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 104/321 (32%), Positives = 168/321 (52%), Gaps = 38/321 (11%)

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V ++D S+G VI AL ++ ML NSI+IF +DNG        T    N+G N+P RG K T
Sbjct: 2   VYEMDQSIGLVIEALHKRNMLGNSIVIFSTDNGG-----LPTGLGSNYGFNWPLRGFKAT 56

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
           LWEGGV+  A +WSP ++++ R+S QMMHI+DWLPTLY+AAGGD SRL + IDG+D    
Sbjct: 57  LWEGGVRGAAFIWSPLLRRSGRISSQMMHITDWLPTLYSAAGGDVSRLGV-IDGID---- 111

Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
                             W +L  + PS R+ +L+NID    ++A+ + + K+ L +   
Sbjct: 112 -----------------MWEALCEDLPSPRHEILLNIDSAANSSALIVGNRKVFLTSSFP 154

Query: 491 GTMDGYYGQTRSNKVPLLNFN-AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
             +  +  +      P+   +  +++S+T   L+      F  +  +    + R++  ++
Sbjct: 155 DELRYHRSEVPGGTRPVEGLDEMMLKSRTGTVLRD-----FYKVEKLSIRPNWRKEVVVN 209

Query: 550 CGA-NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
           C   NP     +   N   Y F++  DPCE  N+A+       +L E L  +  T++P +
Sbjct: 210 CSQYNPG---SNFVANSSPYYFDIQRDPCELENLAAVNATEVDELMEKLAAYAATMIPPA 266

Query: 609 HEQPDLVQADPKRFNDTWSPW 629
           ++  D  +  PK  +  W+PW
Sbjct: 267 NKPMD-PRGLPKYNHGLWAPW 286


>gi|404450774|ref|ZP_11015752.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
 gi|403763554|gb|EJZ24502.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
          Length = 469

 Score =  163 bits (413), Expect = 2e-37,   Method: Compositional matrix adjust.
 Identities = 119/365 (32%), Positives = 180/365 (49%), Gaps = 48/365 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G  D+  +G   IPTPN+D +A  G+  N+ YA   VC PSRASLMTG++  HT ++G  
Sbjct: 42  GIGDIGPYGQTIIPTPNLDKMASTGMKFNDFYAGSTVCAPSRASLMTGQHTGHTLVRGNG 101

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
            +  E        + LPE L+E GY+    GKW LG      +P   G++   G+L+ V 
Sbjct: 102 EYPLES-----GRKILPERLKEAGYTNAMFGKWGLGLANTHSSPEKVGWDHFVGHLHHVS 156

Query: 211 SYY------DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           ++Y      D I+  + SR     G               +YA DLF + +++ IE Q  
Sbjct: 157 AHYQKPDSLDCIIEGEISRMPVPKG---------------KYANDLFVENSIEFIETQTD 201

Query: 265 DKPLFLYLAHLAAHA-------------GNAGKHLEAPQETIN--QFQYITDPN-RRTYA 308
           ++P FLYL+    HA               +GK +  P E  +     Y   P  +  YA
Sbjct: 202 EQPFFLYLSLTVPHAELLVPERYFQPFLDESGKSI-FPSEKAHPEGLHYGAQPYPKAAYA 260

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
           A+V  +DD VG V+SAL++ G+ EN+++IF SDNG      R   +   + S+  +RGVK
Sbjct: 261 ALVSSIDDYVGRVMSALEKIGIAENTLVIFTSDNGTHVEGGRRMEDVDYFKSSAHFRGVK 320

Query: 369 NTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
             L++GG+KVP ++ W  +I++      +     D  PTL   AG +  R   + DG+  
Sbjct: 321 RDLFDGGIKVPLMVNWPGKIKEGSTSDFRGA-FWDLFPTLSHLAGIEVDR--DDTDGISF 377

Query: 428 WSSLL 432
             +LL
Sbjct: 378 LPTLL 382



 Score = 38.9 bits (89), Expect = 7.6,   Method: Compositional matrix adjust.
 Identities = 16/36 (44%), Positives = 24/36 (66%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 48
           G+YA DLF + +++ IE Q  ++P FLYL+    HA
Sbjct: 181 GKYANDLFVENSIEFIETQTDEQPFFLYLSLTVPHA 216


>gi|346992020|ref|ZP_08860092.1| arylsulfatase [Ruegeria sp. TW15]
          Length = 522

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 139/469 (29%), Positives = 213/469 (45%), Gaps = 72/469 (15%)

Query: 78  ALTKSTTLTLLIVY----GWNDL-SFH-GSNEIPTPNIDALAYNGIILNNMYAQPVCTPS 131
           A+ +S    +L++     GW  L S+H G   I TPN+D LA  G  L + YAQP CT  
Sbjct: 22  AVAQSDKPNILVIMADDVGWASLGSYHQGVKSIQTPNLDQLAAEGARLTDYYAQPSCTAG 81

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           R++ +TG++P+ TGM    + G +P G+   +  L   L+ +GY+T   GK HLG   + 
Sbjct: 82  RSAFITGQFPVRTGMHTVGLPG-DPVGLSDEDPTLANILKSMGYTTGQFGKNHLGDLNK- 139

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNL--STAWDTVGEYATD 249
           + P  +GF+ ++G+L   ++  ++     +    E       RN+  S A DTV E    
Sbjct: 140 FLPTAKGFDEYWGWLYH-LNAMEYTTDPDWPDDPEFAAQFGPRNIIHSFATDTVDETVDP 198

Query: 250 LFTKEAVQ-LIEDQPV--------------------------DKPLFLYLAHLAAHAGNA 282
            +     Q +I+D P                           + P F+++A   AH    
Sbjct: 199 RWGPVGKQRIIDDGPAPPERQKTLDDEVTAHTIDFINRAVDSETPFFVWMAPARAHVWT- 257

Query: 283 GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
             HL        +++ +    R     ++K+LDD+VG V + L+  G+ +N+I+IF SDN
Sbjct: 258 --HLSP------EYEAMLGDGRGLQDVVMKELDDNVGKVTAHLEELGIADNTIVIFTSDN 309

Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISD 402
           G  T+ + +       G   P+ G K T WEGG +VPAI+  P      +V   +    D
Sbjct: 310 GPETMTWPD-------GGTTPFHGEKGTTWEGGFRVPAIIRWPGKVAEGQVLNGIFDGMD 362

Query: 403 WLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNS 462
           W+PTL  AAGG +  LP +          LL        ++DG +Q S L   TPS R  
Sbjct: 363 WMPTLVAAAGG-SQNLPAD----------LLEGFEGYQVHLDGYNQLSYLTGETPSNRKE 411

Query: 463 VLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
           ++    E     A+R + WK     QE     G+ G       PLL FN
Sbjct: 412 IVYY--EGTDLQAIRYNDWKAHFTIQEK----GWAGPKEELNAPLL-FN 453


>gi|326433598|gb|EGD79168.1| hypothetical protein PTSG_09899 [Salpingoeca sp. ATCC 50818]
          Length = 515

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 122/386 (31%), Positives = 182/386 (47%), Gaps = 45/386 (11%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGS--------NEIPTPNIDALAYNGIILNNMYAQP 126
            A T    + L+I    GW +   H +        +++ TP +  L   G+ L   YA  
Sbjct: 22  VARTSQPNIVLIIGDDVGWANTELHCNISDPAQVCSDVKTPFLLQLKDEGVHLLRHYAYK 81

Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
            C+P+R++LM+G+ PIH     P     +  GV L    LPE L  +GY+T  +GKWHLG
Sbjct: 82  FCSPTRSALMSGRLPIHVNEVNPTY--IKGGGVSLNFTMLPEKLSRVGYATHHVGKWHLG 139

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV---ELNGHDMRRNLSTAWDTV 243
                  P  RGF+S  GYL G          D + +TV      G D+  N   A+   
Sbjct: 140 MSSHSRVPTGRGFDSSLGYLLGA--------EDHWKQTVTQGNATGVDLWLNEGPAYLRN 191

Query: 244 GEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
            EY   LF + AV++I D      D PLFLY+     H+       E P+E  +++  I 
Sbjct: 192 NEYGDLLFGRRAVEVIHDHAKNHRDSPLFLYVGLQVNHS-----PAEVPKEYQDRYPGI- 245

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
             +R     M   +DD++G + +AL    M E+++++++SDNG        +    N G+
Sbjct: 246 QVDRAIVNGMSSVMDDAIGKITTALWETTMWEDTLVVYISDNGG------ASGINANNGN 299

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ--MMHISDWLPTLYTAAGG----D 414
           NYP RG K+T +EGGV+  A++    +  + R S +   MH  DW  TL   AG     D
Sbjct: 300 NYPLRGGKHTDFEGGVRTAALVSGGFLPDDQRGSTRDGYMHACDWYTTLAKLAGASPADD 359

Query: 415 TSRLPLNIDGLDQWSSLLLNTPSRRN 440
              LP  +DG+D W  +  N  S R+
Sbjct: 360 VEGLP-GVDGMDMWPMIANNEKSPRH 384


>gi|444722183|gb|ELW62881.1| N-acetylgalactosamine-6-sulfatase [Tupaia chinensis]
          Length = 764

 Score =  163 bits (412), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 174/364 (47%), Gaps = 37/364 (10%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
           T T L   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI
Sbjct: 65  TPTALFFMGWGDLGVYGEPSRETPNLDQMAAEGMLFPNFYSANPLCSPSRAALLTGRLPI 124

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
            TG        A  R          G+P +E  LPE L+  GY +K +GKWHLG  R ++
Sbjct: 125 RTGFY---TTNAHARNAYTPQEIVGGIPSSEHLLPELLKGAGYVSKIVGKWHLG-HRPQF 180

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
            PL  GF+  FG  N     YD+         R  E+ G         +  T     T +
Sbjct: 181 HPLRHGFDEWFGAPNCHFGPYDNKARPNIPVYRDWEMVGR-FYEEFPISLKTGEANLTQI 239

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
           + +EA+  I+ Q   +P FL+ A  A HA      + A Q  +   Q      R  Y   
Sbjct: 240 YLQEALDFIKRQAGRRPFFLHWAIDATHA-----PVYASQPFLGTSQ------RGRYGDA 288

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V+++DDSVG +++ L+  G+  ++ + F SDNGA  +           GSN P+   K T
Sbjct: 289 VREIDDSVGKILALLRALGIGNSTFVFFTSDNGAALISAPRQGKC---GSNGPFLCGKQT 345

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQW 428
            +EGG++ PAI W P      +VS Q+  + D   T  + AG    + R    IDGLD  
Sbjct: 346 TFEGGMREPAIAWWPGHIPAGQVSHQLGSLMDLFTTSLSLAGLKPPSDR---EIDGLDLL 402

Query: 429 SSLL 432
            ++L
Sbjct: 403 PTIL 406


>gi|431899642|gb|ELK07596.1| Arylsulfatase J [Pteropus alecto]
          Length = 637

 Score =  162 bits (411), Expect = 3e-37,   Method: Compositional matrix adjust.
 Identities = 102/270 (37%), Positives = 146/270 (54%), Gaps = 22/270 (8%)

Query: 121 NMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAI 180
            + + P C  S  +L   +Y IHTG+Q   I   +P  +PL    LP+ LRE GY+T  +
Sbjct: 94  GVRSSPECQGSPDTL---QYQIHTGLQHSVIRPTQPNCLPLDAATLPQKLREAGYATHMV 150

Query: 181 GKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAW 240
           GKWHLGF+R E  P  RGF+S FG L G   YY H   D    +  + G+D+  N   AW
Sbjct: 151 GKWHLGFYRTECLPTRRGFDSFFGSLLGSGDYYTHYKCD----SPGVCGYDLHENDRAAW 206

Query: 241 D-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
               G Y+T ++T+   +++      KPLFLY+A+ A H+      L+AP   +  ++ I
Sbjct: 207 GRGGGAYSTQMYTQRVQRILASHDPGKPLFLYIAYQAVHS-----PLQAPGRYLGHYRSI 261

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
            +  RR YAAM+  LD++V  V  AL+  G  +NS+I++ SDNG               G
Sbjct: 262 ANARRRRYAAMLSCLDEAVANVTLALKAHGFYDNSVIVYSSDNGGQPAAG---------G 312

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           SN+P RG K T WEGGV+    + SP ++Q
Sbjct: 313 SNWPLRGRKGTYWEGGVRAVGFVHSPLLRQ 342


>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
 gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
 gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 525

 Score =  162 bits (410), Expect = 4e-37,   Method: Compositional matrix adjust.
 Identities = 117/355 (32%), Positives = 182/355 (51%), Gaps = 43/355 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGM-QGP 149
           GW D+ + G+ ++ TPNIDALA  G+  +  Y    +  PSRA ++TG Y    G     
Sbjct: 43  GWGDVGYQGAVDVSTPNIDALARRGVQFSQGYVSCSISGPSRAGILTGVYQQRFGFYNNL 102

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
             W   P G    +  L E +R+ GY+T  +GKWH+     E +P  RGF+  +G+ +  
Sbjct: 103 HPWAKIPEG----QSTLGEMVRDCGYATGFVGKWHMAD-SPEQSPNRRGFDQFYGFWSDT 157

Query: 210 ISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
             YY   D    + Y         +++  L  +    GEY TD FT+EAV+ I D+    
Sbjct: 158 HDYYRSTDKPGVELYDFCPLYRNGEIQPPLHES----GEYITDCFTREAVEFI-DKHASS 212

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVISA 324
           P  L L++ A H+       + P+  +N+ +       +R+ +AAMV  LDD +G V+ +
Sbjct: 213 PFLLCLSYNAVHS-----PWQVPEHYVNRLEGRRFHHEDRKVFAAMVLALDDGIGRVMES 267

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETS---NYRNWG-----SNYPYRGVKNTLWEGGV 376
           L++ G+ EN++ I +SDNG+P  +  E S    Y++ G     S  P+RG K   +EGG+
Sbjct: 268 LRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNTTMSSPGPFRGYKADTYEGGI 327

Query: 377 KVPAIL-WSPQIQQ-----NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           +VP I+ W  ++ Q     NP +SL      D  PT+  A GG TSR   ++DG+
Sbjct: 328 RVPYIMSWPSELPQGMVYDNPVISL------DIFPTVMQAVGG-TSRQKYSLDGV 375


>gi|119587161|gb|EAW66757.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
           mucopolysaccharidosis type IVA), isoform CRA_a [Homo
           sapiens]
          Length = 708

 Score =  162 bits (410), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 35  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIR 94

Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
            G        A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ 
Sbjct: 95  NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
           PL  GF+  FG  N     YD+         R  E+ G    +   NL T    +    T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
            ++ +EA+  I+ Q    P FLY A  A HA                  ++    R  Y 
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             V+++DDS+G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+   K
Sbjct: 256 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T +EGG++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+  
Sbjct: 311 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369

Query: 429 SSLL 432
            +LL
Sbjct: 370 PTLL 373


>gi|241998988|ref|XP_002434137.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215495896|gb|EEC05537.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 167

 Score =  162 bits (410), Expect = 5e-37,   Method: Composition-based stats.
 Identities = 81/163 (49%), Positives = 105/163 (64%), Gaps = 5/163 (3%)

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           LFTK+A+ LI+D    KP+FLY +H A H G+    LEAP   I  F YI + NR  +A 
Sbjct: 1   LFTKKALSLIKDHDPAKPMFLYFSHQAVHCGDYKVKLEAPASAIAHFPYIKEQNRSIHAG 60

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V +LD SVG V+ AL ++GML+NSII+F +DNG         S Y     N+P RG K 
Sbjct: 61  AVFELDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGKEPNSGY-----NWPLRGSKF 115

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           TLWEGGV+  A +WSP + +  R+S Q+MHI+DWLPT Y+AAG
Sbjct: 116 TLWEGGVRGAAFVWSPLLSKTGRLSNQLMHITDWLPTFYSAAG 158



 Score = 60.1 bits (144), Expect = 4e-06,   Method: Composition-based stats.
 Identities = 32/73 (43%), Positives = 41/73 (56%), Gaps = 4/73 (5%)

Query: 19 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 78
          LFTK+A+ LI+D    KP+FLY +H A H G+    LEAP   I  F YI + NR  +A 
Sbjct: 1  LFTKKALSLIKDHDPAKPMFLYFSHQAVHCGDYKVKLEAPASAIAHFPYIKEQNRSIHAG 60

Query: 79 ----LTKSTTLTL 87
              L KS  L +
Sbjct: 61 AVFELDKSVGLVM 73


>gi|189069200|dbj|BAG35538.1| unnamed protein product [Homo sapiens]
          Length = 522

 Score =  162 bits (410), Expect = 5e-37,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 35  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAGGLLFPNFYSANPLCSPSRAALLTGRLPIR 94

Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
            G        A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ 
Sbjct: 95  NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
           PL  GF+  FG  N     YD+         R  E+ G    +   NL T    +    T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
            ++ +EA+  I+ Q    P FLY A  A HA                  ++    R  Y 
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             V+++DDS+G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+   K
Sbjct: 256 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T +EGG++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+  
Sbjct: 311 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369

Query: 429 SSLL 432
            +LL
Sbjct: 370 PTLL 373


>gi|351712929|gb|EHB15848.1| N-acetylgalactosamine-6-sulfatase [Heterocephalus glaber]
          Length = 482

 Score =  162 bits (409), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 120/358 (33%), Positives = 172/358 (48%), Gaps = 43/358 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L++G+ PI +G     
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLSGRPPIRSGFY--- 58

Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
              A  R          G+P +ER LP  L+E GY+TK +GKWHLG  R ++ PL  GF+
Sbjct: 59  TTNAHARNAYTPQEIVGGIPDSERLLPSLLKEAGYATKIVGKWHLG-HRPQFHPLKHGFD 117

Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
             FG  N     YD+         +  E+ G    +   N+ T    +    T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKAKPNIPVYKDWEMVGRFYEEFPINVKTGESNL----TQIYLQEA 173

Query: 256 VQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           +  I+  Q   +P FLY A  A HA                  ++    R  Y   V+++
Sbjct: 174 LDFIKRQQAARRPFFLYWAVDATHA-----------PVYASRPFLGSSQRGRYGDAVREI 222

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           DDSVG ++S LQ  G+ E++ + F SDNGA  V           GSN P+   K T +EG
Sbjct: 223 DDSVGKMLSLLQVLGLAESTFVFFTSDNGAALVSAPSQ------GSNGPFLCGKQTTFEG 276

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           G++ PAI W P+     +VS Q+  I D   T    AG    R  + IDG     +LL
Sbjct: 277 GMREPAIAWWPRHIPPGQVSHQLGSIMDLFTTSLALAGLKPPRDRV-IDGFSLLPALL 333


>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
          Length = 468

 Score =  162 bits (409), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 107/349 (30%), Positives = 170/349 (48%), Gaps = 41/349 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS   PTPN+D LA  G++    Y    VC PSRA L+TG+Y    G +   
Sbjct: 37  GYFDFGFQGSKTFPTPNLDQLAKEGMVFKQAYTTAAVCGPSRAGLLTGRYQQRFGFEENN 96

Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           + G          +  G+PL E+ + +YL +LGY +  +GKWH+G   R Y PL RGF  
Sbjct: 97  VPGYMSKSSKLLGDDMGLPLDEKTMADYLGKLGYQSIVLGKWHMGNADR-YHPLKRGFTE 155

Query: 202 HFGYLNGVISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
            +G+  G  S+Y       +D+    +E+            +    +Y T      A   
Sbjct: 156 FYGFRGGARSFYPLTQKQAADKPEDRLEIG--------YKKYQEPKKYLTYDLADAACDF 207

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I D+   KP F+Y++  A H+      ++A QE + + + +T   R+  AAM   LD + 
Sbjct: 208 I-DRNKKKPFFMYVSFNAVHS-----PMQATQEDLKKIKGLTG-KRKVLAAMTLALDKAC 260

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G ++  L+ +G+ ++++I+F +DNG P               NYP  G K+   EGG++V
Sbjct: 261 GQILEKLKEEGLEDDTLIVFSNDNGGPD---------GTKTCNYPLSGCKSNNLEGGIRV 311

Query: 379 PAILWSPQIQQNPRVSLQM-MHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           P ++  P++ Q P  S +  +   D LPT    AGG   ++   +DG+D
Sbjct: 312 PFVMKLPKVIQ-PGSSYEYPVSTLDLLPTFVNIAGGKADKIA-QLDGVD 358


>gi|4503899|ref|NP_000503.1| N-acetylgalactosamine-6-sulfatase precursor [Homo sapiens]
 gi|462148|sp|P34059.1|GALNS_HUMAN RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|618426|gb|AAC51350.1| N-acetylgalactosamine 6-sulphatase [Homo sapiens]
 gi|870751|dbj|BAA04535.1| N-acetylgalactosamine 6-sulfate sulfatase [Homo sapiens]
 gi|33440495|gb|AAH56151.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
 gi|37589093|gb|AAH50684.2| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
 gi|119587163|gb|EAW66759.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
           mucopolysaccharidosis type IVA), isoform CRA_c [Homo
           sapiens]
          Length = 522

 Score =  162 bits (409), Expect = 6e-37,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 35  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIR 94

Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
            G        A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ 
Sbjct: 95  NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
           PL  GF+  FG  N     YD+         R  E+ G    +   NL T    +    T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
            ++ +EA+  I+ Q    P FLY A  A HA                  ++    R  Y 
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             V+++DDS+G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+   K
Sbjct: 256 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T +EGG++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+  
Sbjct: 311 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369

Query: 429 SSLL 432
            +LL
Sbjct: 370 PTLL 373


>gi|440908775|gb|ELR58760.1| N-acetylgalactosamine-6-sulfatase, partial [Bos grunniens mutus]
          Length = 525

 Score =  161 bits (408), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 175/362 (48%), Gaps = 37/362 (10%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 38  LLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPSRAALLTGRLPIR 97

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           +G     G       P+    G+P +E  LP  L+  GY++K +GKWHLG  R ++ PL 
Sbjct: 98  SGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH-RPQFHPLK 156

Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
            GF+  FG  N     YD+         R  E+ G    +   NL T    +    T ++
Sbjct: 157 HGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL----TQIY 212

Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
            +EA++ I+  Q   +P FLY A  A HA                  ++    R  Y   
Sbjct: 213 LQEALEFIQRQQAAHRPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDA 261

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           +++LDDSVG ++  L+   + EN+ + F SDNGA  +           GSN P+   K T
Sbjct: 262 IRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALISAPRQ------GSNGPFLCGKQT 315

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
            +EGG++ PAI W P      +VS Q+  I D   T  + AG +  R    IDGLD   +
Sbjct: 316 TFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPR-DRAIDGLDLLPA 374

Query: 431 LL 432
           +L
Sbjct: 375 ML 376


>gi|264677229|ref|YP_003277135.1| sulfatase [Comamonas testosteroni CNB-2]
 gi|262207741|gb|ACY31839.1| sulfatase [Comamonas testosteroni CNB-2]
          Length = 440

 Score =  161 bits (408), Expect = 9e-37,   Method: Compositional matrix adjust.
 Identities = 110/328 (33%), Positives = 159/328 (48%), Gaps = 30/328 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           GW DL  +G+ +  TP++D LA  G+  N  YA   VC+ +R +L+TG+Y   +  G++ 
Sbjct: 15  GWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGRYQYRLQAGLEE 74

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P        G+P     LP  L++ GY T  IGKWHLG     Y P   G++  FG  +G
Sbjct: 75  PLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLGK-PPSYGPQRSGYDYFFGNHSG 133

Query: 209 VISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVD 265
            I Y+ H   + +Q+S  +      + R           Y T +   EA + + E +  D
Sbjct: 134 AIDYFTHKPGVGEQFSPDLWQGSEPVERT---------GYYTYILGDEATRYVHERKGQD 184

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           KP FL L   A H    G   E     I    +    + + Y  +V+ LD +VG V+ AL
Sbjct: 185 KPFFLSLHFTAPHWPWEGPDDEHVAHNIKDLFHYDGGSLKKYGEIVEALDKAVGQVLQAL 244

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WS 384
              G  +N+I+IF SDNG              +   +P+ G K  L EGG++VP +L WS
Sbjct: 245 DDTGQADNTIVIFTSDNGG-----------ERFSKTWPFTGQKTELLEGGIRVPTLLRWS 293

Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAG 412
            +IQ  P+V  Q+    DWLPTL  AAG
Sbjct: 294 ARIQ--PQVQEQVTVSFDWLPTLLAAAG 319


>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 128/432 (29%), Positives = 200/432 (46%), Gaps = 65/432 (15%)

Query: 75  TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
            + A  K   +  +++   G+ D+ F     I +PN D LA  G++LN  Y    C+PSR
Sbjct: 15  CFVATAKLPNIVFVLIDDLGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYVFKYCSPSR 73

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
           ASL+TG++P H     P +      G  +    LP  L+   YST  +GKWHLGFF   Y
Sbjct: 74  ASLLTGRWPHHAHQWNPLM--DSTIGTNINMTMLPAKLKAANYSTHMVGKWHLGFFDPRY 131

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL---NGHDMRRNLSTAWDTVGEYATD 249
            P+ RGF++  G+        DH ++++   +++    N  D R     A++    Y  D
Sbjct: 132 LPINRGFDTSTGFFGCC---EDH-MNEKSGCSIDYWKNNAPDPRNGTYDAYN----YRDD 183

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYA 308
           L       ++ +   + PLFLYL     HA       +AP+E ++ +    T   R T  
Sbjct: 184 L-----TDVMSNYNTENPLFLYLPLHNVHA-----PFQAPKEWLDIYPANSTCEKRHTLQ 233

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGV 367
           AM+   D+  G ++  L++KG+ +N+I++  +DN GAP             GSNYP +G 
Sbjct: 234 AMISVADNVTGHLVELLKKKGIWDNTIMVISADNGGAPCA-----------GSNYPLKGC 282

Query: 368 KNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD-----TSRLPL 420
           K T +EGGV+  A +    + ++ R   +   +HI+DW PT    AG D     T + P 
Sbjct: 283 KTTFFEGGVRSVAFVNGGLLPESRRGQSTDGFIHIADWYPTFCKLAGVDPDDSGTGKFP- 341

Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS 480
            +DGLD W  +       ++  I        L  N   RR              A+ + +
Sbjct: 342 -VDGLDVWPIITGENEKTQHEEI-------VLGYNFDFRRPG----------QGAIIVGN 383

Query: 481 WKLVLGTQENGT 492
           +KLV+G QE G 
Sbjct: 384 YKLVVGLQEIGC 395


>gi|410215590|gb|JAA05014.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410254514|gb|JAA15224.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410288780|gb|JAA22990.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
 gi|410330541|gb|JAA34217.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
          Length = 522

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 35  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIR 94

Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
            G        A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ 
Sbjct: 95  NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
           PL  GF+  FG  N     YD+         R  E+ G    +   NL T    +    T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
            ++ +EA+  I+ Q    P FLY A  A HA                  ++    R  Y 
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             V+++DDS+G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+   K
Sbjct: 256 DAVREIDDSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T +EGG++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+  
Sbjct: 311 QTTFEGGMREPALAWWPGHVAAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369

Query: 429 SSLL 432
            +LL
Sbjct: 370 PTLL 373


>gi|432452061|ref|ZP_19694315.1| arylsulfatase [Escherichia coli KTE193]
 gi|433035723|ref|ZP_20223410.1| arylsulfatase [Escherichia coli KTE112]
 gi|430977211|gb|ELC94062.1| arylsulfatase [Escherichia coli KTE193]
 gi|431545828|gb|ELI20473.1| arylsulfatase [Escherichia coli KTE112]
          Length = 551

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 140/445 (31%), Positives = 214/445 (48%), Gaps = 69/445 (15%)

Query: 82  STTLTLLIVYGWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
           +  + LL   GW D+ F+G       PTP+IDA+A  G+IL + Y+QP  +P+RA+++TG
Sbjct: 87  NVVIFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTG 146

Query: 139 KYPIHTGMQGPPIWGAEPRGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
           +Y IH G+  PP++G +P G+  LT   LP+ L + GY T+AIGKWH+G   +E  P   
Sbjct: 147 QYSIHHGILMPPMYG-QPGGLQGLTT--LPQLLHDQGYVTQAIGKWHMG-ENKESQPQNV 202

Query: 198 GFESHFGYLNGVISYYD-----HI-----LSDQYSRTVEL------NGHDMRRNLSTA-W 240
           GF+   G+ N V   Y      H+     LS   S  ++       + H MR     A  
Sbjct: 203 GFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAMRGGEQQAIA 261

Query: 241 DTVGEYATDL---FTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 296
           D   +Y  DL   + +  V+ ++     DKP FLY      H  N            N  
Sbjct: 262 DITPKYMEDLDQRWMEYGVKFLDKMAKSDKPFFLYYGTRGCHFDN----------YPNAK 311

Query: 297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
              + P R +Y   + +++D    +  AL++ G L+N++I+F SDNG P  E        
Sbjct: 312 YAGSSPARTSYGDCMVEMNDVFANLYKALEKNGQLDNTLIVFTSDNG-PEAEVPPH---- 366

Query: 357 NWGSNYPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
                 P+RG K + WEGGV+VP  + W   IQ  PR S  ++ ++D  PT    AG   
Sbjct: 367 ---GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHPG 421

Query: 416 SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAA 475
           +++          ++L+  T     + IDG+DQ +S  L T  + N    +     + AA
Sbjct: 422 AKV----------ANLVPKT-----TFIDGVDQ-TSFFLGTNGQSNRKAEHYFLNGKLAA 465

Query: 476 VRLDSWKLVLGTQE--NGTMDGYYG 498
           VR+D +K  +  Q+    T  GY G
Sbjct: 466 VRMDEFKYHVLIQQPYAYTQSGYQG 490


>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 178/367 (48%), Gaps = 46/367 (12%)

Query: 75  TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
            +AA  K   +  ++V  +G+ D+ F     I +PN D LA  G++LN  Y    C+PSR
Sbjct: 15  CFAATAKVPNIVFVLVDDWGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYVFKYCSPSR 73

Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
           AS +TG++P H     P +      G  L    LP  L+   Y+T  +GKWHLGFF   Y
Sbjct: 74  ASFLTGRWPHHAHQWNPLM--DNMIGTNLNMTMLPAKLKAANYATHMVGKWHLGFFDPRY 131

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL---NGHDMRRNLSTAWDTVGEYATD 249
            P+ RGF++  G+L G     DH ++++   +++    N  D R     A++    Y  D
Sbjct: 132 LPINRGFDTSTGFLGGG---EDH-MNEKSGCSIDYWKNNAPDPRNGTYDAYN----YRDD 183

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYA 308
           L       ++ +   D PLF YL     H       L+AP+E ++ +    T   R    
Sbjct: 184 L-----TDVMNNHNADNPLFFYLPLHNVHT-----PLQAPKEWLDIYPANSTCEKRHILQ 233

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
           AMV   D+  G ++  L++KGM +N+I++  +DNG               GSNYP +G K
Sbjct: 234 AMVSVADNVTGHLVELLKKKGMWDNTIMVISADNGGAPCS----------GSNYPLKGCK 283

Query: 369 NTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD-----TSRLPLN 421
           +T +EGG++  A +    + ++ R   +   +HI+DW  T    AG D     T + P  
Sbjct: 284 STFFEGGIRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFP-- 341

Query: 422 IDGLDQW 428
           +DGLD W
Sbjct: 342 VDGLDVW 348


>gi|156368526|ref|XP_001627744.1| predicted protein [Nematostella vectensis]
 gi|156214663|gb|EDO35644.1| predicted protein [Nematostella vectensis]
          Length = 157

 Score =  161 bits (407), Expect = 1e-36,   Method: Composition-based stats.
 Identities = 66/124 (53%), Positives = 90/124 (72%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+ +H   ++ TPNID LA  G++L N Y QP+CTP+R + ++G+YPIHTG+Q   I
Sbjct: 33  GWSDVGYHNITDLKTPNIDRLAGEGVVLENYYVQPICTPARGTFLSGRYPIHTGLQHSNI 92

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
              EP G+PL    LP+ L++ GYST A+GKWHLGFF +EYTPLYRGF++ FGY +G   
Sbjct: 93  HETEPFGLPLDFTLLPQKLKKAGYSTHAVGKWHLGFFEKEYTPLYRGFDTFFGYYSGSED 152

Query: 212 YYDH 215
           +Y H
Sbjct: 153 HYTH 156


>gi|332666885|ref|YP_004449673.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
 gi|332335699|gb|AEE52800.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
           1100]
          Length = 443

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 117/347 (33%), Positives = 163/347 (46%), Gaps = 43/347 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHT--GMQG 148
           G+ DLS +G  +  TPNID LA  GI   N Y A P+CTP+R + MTG+YP  T  G+  
Sbjct: 42  GYGDLSGYGRKDFLTPNIDKLAAQGIKFVNAYSAAPLCTPTRTAFMTGRYPARTPVGLME 101

Query: 149 PPIWGAEPRGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
           P         V LT  F  +   +R  GY T  IGKWHLGF  +  +P+  GF+  FG  
Sbjct: 102 PLTPSKRDSTVGLTAAFPSVATLMRASGYETALIGKWHLGFLPQN-SPVKNGFDYFFGIH 160

Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
           +G   Y  H       R      HD+  N    +     Y TDLFT++AV  ++ Q  +K
Sbjct: 161 SGAADYISHKTGPAGRRI-----HDLYENDQAVYPE--GYLTDLFTQKAVTFLK-QKHNK 212

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR------TYAAMVKKLDDSVGT 320
           P FL L + AAH          P +  N   Y+   + R       YAAM+ +LD+ +G 
Sbjct: 213 PFFLTLTYNAAHW---------PWQGPNDKPYVDSVDLRIGGSAAVYAAMMNRLDEGIGE 263

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           +I  L  + +   +I+IF +DNG              +  N      K  LWEGG++VPA
Sbjct: 264 IIKTLDEEQLSNQTIVIFTNDNGG-----------EKYSDNGGLAKAKGALWEGGIRVPA 312

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR-LPLNIDGLD 426
           I+  P        + Q+    DW  T+ +A G    +  PL  DG+D
Sbjct: 313 IVKWPGKITAGSSTQQVAITMDWTKTILSAGGAKMDKNFPL--DGID 357


>gi|403260898|ref|XP_003922887.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Saimiri boliviensis
           boliviensis]
          Length = 482

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 120/357 (33%), Positives = 170/357 (47%), Gaps = 41/357 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  +G     TPN+D +A  G +  N Y A P+C+PSRA+L+TG+ PI  G     
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 58

Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
              A  R          G+P +E+ LPE L+E GY TK +GKWHLG  R ++ PL  GF+
Sbjct: 59  TTNAHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLG-HRPQFHPLKHGFD 117

Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
             FG  N     YD+         R  E+ G    +   NL T    +    T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 173

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           +  I+ Q   +P FLY A  A HA                  ++    R  Y   V+++D
Sbjct: 174 LDFIKRQARRRPFFLYWAVDATHA-----------PVYASKPFLGTSRRGRYGDAVREMD 222

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           DS+G ++  LQ   + +++ + F SDNGA  +   E       GSN P+   K T +EGG
Sbjct: 223 DSIGKMLQLLQDLRVADSTFVFFTSDNGAALISAPEQG-----GSNSPFLCGKQTTFEGG 277

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           ++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+   +LL
Sbjct: 278 MREPALAWWPGHITPGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLLPALL 333


>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
            queenslandica]
          Length = 2318

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 109/351 (31%), Positives = 164/351 (46%), Gaps = 34/351 (9%)

Query: 84   TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
               L+  +G+ D SF     I TPN   L  NG+ILN  Y    C+PSRAS +TG++P H
Sbjct: 1144 VFVLVDDWGFADASFRNP-AIKTPNFQYLVDNGLILNRHYVFKYCSPSRASFLTGRFPHH 1202

Query: 144  TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                 P   G     + +T   LP  L+  GY+T  +GKWH G +   Y P+ RGF++  
Sbjct: 1203 VHQWNPTPLGMVGTNINMT--MLPAKLKNAGYATHMVGKWHQGLYDPAYLPINRGFDTSS 1260

Query: 204  GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
            G+L     +++  +       V+   +D     +   D+       ++ K+   +  +  
Sbjct: 1261 GFLQAEEGHFNQTI----GCAVDFWKNDAPDTRNGTCDSY------IYNKDLTTVFNEHD 1310

Query: 264  VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTVI 322
              KPLFLYL     HA      +EAPQE ++ +    T   RR   AMV   D+  G V+
Sbjct: 1311 ASKPLFLYLPLHNVHA-----PIEAPQEWLDLYPANSTCDTRRILQAMVSVADNVTGHVV 1365

Query: 323  SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            + L+   M +N++++  +DNG    +          GSNYP +G K T +EGGV+V A  
Sbjct: 1366 NLLRENDMWDNTLMVVSADNGGADCK----------GSNYPLKGSKYTFFEGGVRVIAFA 1415

Query: 383  WSPQIQQNPR--VSLQMMHISDWLPTLYTAAG---GDTSRLPLNIDGLDQW 428
                I    R   +   +HI+DW  T    AG    D+ +    +DGLD W
Sbjct: 1416 SGGLIPAGRRGKSTDGFIHIADWYTTFCKMAGVDHNDSGKGKFPVDGLDVW 1466


>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 495

 Score =  161 bits (407), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 113/363 (31%), Positives = 173/363 (47%), Gaps = 40/363 (11%)

Query: 76  YAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
           + A  K   +  ++V  +G+ D+ F     I +PN D LA  G++LN  Y    C PSRA
Sbjct: 17  FVATAKLPNIVFVLVDDWGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYVFKYCAPSRA 75

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           SL+TG++P H       +      G  L     P  L+   YST  +GKWH GFF   Y 
Sbjct: 76  SLLTGRWPHHVYQWN--LATDATAGTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYL 133

Query: 194 PLYRGFESHFGYLNGVISYY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
           P+ RGF++  G+L G   +   + I +  Y +    N  D R          G Y   ++
Sbjct: 134 PINRGFDTSSGFLCGSEDHMTQNAICAIDYWKN---NAPDPRN---------GTYDAYIY 181

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR-TYAAM 310
             +   +I     D+PLFLYL     H       L+AP+E ++ +   +  ++R T  AM
Sbjct: 182 RDDLTDIINSHNTDEPLFLYLPLHNVHLP-----LQAPKEWLDIYPANSTCDKRHTLQAM 236

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V   D+  G ++  L++KG+ +N+I++  +DNG               GSNYP RG K T
Sbjct: 237 VSVADNVTGHLVELLKKKGLWDNTIMVISADNGGAHCS----------GSNYPLRGCKET 286

Query: 371 LWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAG---GDTSRLPLNIDGL 425
           L+EGGV+  A +    + ++ R   +   +HI+DW  T    AG    D+     ++DGL
Sbjct: 287 LFEGGVRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFSVDGL 346

Query: 426 DQW 428
           D W
Sbjct: 347 DVW 349


>gi|221068581|ref|ZP_03544686.1| sulfatase [Comamonas testosteroni KF-1]
 gi|220713604|gb|EED68972.1| sulfatase [Comamonas testosteroni KF-1]
          Length = 440

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 111/340 (32%), Positives = 163/340 (47%), Gaps = 30/340 (8%)

Query: 80  TKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTG 138
           T +    L    GW DL  +G+ +  TP++D LA  G+  N  YA   VC+ +R +L+TG
Sbjct: 3   TPNIVFILADDLGWADLGVYGATDFKTPHLDHLASQGVRFNQAYANSAVCSATRIALITG 62

Query: 139 KYP--IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           +Y   +  G++ P        G+P     LP  L++ GY T  IGKWHLG     Y P  
Sbjct: 63  RYQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLGK-PPAYGPQR 121

Query: 197 RGFESHFGYLNGVISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
            G++  FG  +G I Y+ H   + +Q+S  +      + R           Y T +   E
Sbjct: 122 SGYDYFFGNHSGAIDYFTHKPGVGEQFSPDLWQGNEPVERT---------GYYTYILGDE 172

Query: 255 AVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
           A + + E +  DKP FL L   A H    G   E    +I    +    + + Y  +V+ 
Sbjct: 173 ATRYVHERKDQDKPFFLSLHFTAPHWPWEGPGDEHVARSIKDLFHYDGGSLKKYGEIVEA 232

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           LD +VG V+ AL   G  +N+I+IF SDNG              +   +P+ G K  L E
Sbjct: 233 LDKAVGQVLQALDDTGQADNTIVIFTSDNGG-----------ERFSKTWPFTGQKTELLE 281

Query: 374 GGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           GG++VP +L WS +I+  P+V  Q+    DWLPTL  AAG
Sbjct: 282 GGIRVPTLLRWSARIR--PQVQEQVTASFDWLPTLLAAAG 319


>gi|410030097|ref|ZP_11279927.1| sulfatase [Marinilabilia sp. AK2]
          Length = 476

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 120/356 (33%), Positives = 183/356 (51%), Gaps = 48/356 (13%)

Query: 73  RRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPV 127
           RR+  A+ K +   ++I++    G++D+   G+++I TP++D LA  G+   N Y AQ V
Sbjct: 18  RRSDEAVEKQSLPNIIIIFTDDQGYHDVGVFGASDIATPHLDQLAAEGVQFTNFYVAQAV 77

Query: 128 CTPSRASLMTGKYP----IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
           C+ SRA+L+TG Y     IH  +     +G  P      E  + + L+ LGY+T   GKW
Sbjct: 78  CSASRAALLTGVYSNRLGIHGALDHMSRYGLHPE-----EATIADILKPLGYATAMFGKW 132

Query: 184 HLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV 243
           HLG +  E+ P  +GF+ + G       + +H  +  Y   + L  +D  + + T W+  
Sbjct: 133 HLGHYP-EFLPTNQGFDEYLGIPYSNDMWPNHPQTKDYYPPLPLYQND--KVIDTIWNDQ 189

Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
             + T LFT++++  IE    D+P FLYLAH   H          P    ++F+     N
Sbjct: 190 SMFTT-LFTEKSIDFIERNK-DRPFFLYLAHPMPHV---------PLFVSDKFK--GKSN 236

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
           R  Y  ++ ++D SVG ++  L+   + EN+IIIF+SDNG P + Y   S     GS YP
Sbjct: 237 RGLYGDVIMEIDWSVGQIMETLKHYDLDENTIIIFLSDNG-PWLSYSGHS-----GSAYP 290

Query: 364 YRGVKNTLWEGGVKVPAIL-W-----SPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
            R  K T W+GGVKVP I+ W     S  +Q NP +S+      D LPTL    G 
Sbjct: 291 LREGKGTSWDGGVKVPVIMRWKGTWASGLVQPNPAMSI------DILPTLAYLTGA 340


>gi|343086704|ref|YP_004775999.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342355238|gb|AEL27768.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 462

 Score =  160 bits (406), Expect = 1e-36,   Method: Compositional matrix adjust.
 Identities = 130/402 (32%), Positives = 189/402 (47%), Gaps = 49/402 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ DL  +G+ ++ T NID LA  G+     YA  PVC+PSRA+LMTGKY  + G+ G  
Sbjct: 41  GYKDLGSYGAKDLVTTNIDRLAEEGVKFTQFYAAAPVCSPSRAALMTGKYNFNAGLFGNV 100

Query: 149 -PPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
            PP    E + G+P  E  + E  +  GY T  IGKWHLG    E  P  +GF+  FG+ 
Sbjct: 101 SPPHSDPEGKSGLPTEEITMAEMFKSNGYRTGLIGKWHLGH-TPEKLPNGQGFDYFFGHQ 159

Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
            G I  + H     Y     L  HD+ RN +  +   GE+  DL  +E  Q I ++  ++
Sbjct: 160 RGCIDNFSHFF---YWHGPNL--HDLYRNETEIYRP-GEFFGDLMVQEMKQFISEK-SEE 212

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
           P F Y A    H    G       + ++ +  +  P R+ YAA V  +D+ VG +   L+
Sbjct: 213 PFFTYWAINMPHYPYQGD-----PKWLDYYADLESP-RKEYAAFVSTIDEMVGEIYHHLK 266

Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
             G LEN+II++ SD+G  +VE R    +   G   P  G K ++ EGG++VPAI+  P 
Sbjct: 267 AIGELENTIIVYQSDHGY-SVEERA---FFGGGDAGPLSGAKFSMLEGGIRVPAIIRFPS 322

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
           +           +  DW+PTL    G                    +  P+ +  NIDG 
Sbjct: 323 MLPKDEARHHWANSIDWMPTLAELTG--------------------IKIPASQ--NIDGK 360

Query: 447 DQWSSLL-LNTPSRRNSVLINI---DEKKRTAAVRLDSWKLV 484
                +L  N PS   ++       + KK   AVR   WKL+
Sbjct: 361 SLLPLILDANVPSPHETMFWGTGDPENKKHAWAVRKGPWKLL 402


>gi|380796101|gb|AFE69926.1| N-acetylgalactosamine-6-sulfatase precursor, partial [Macaca
           mulatta]
          Length = 503

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 120/364 (32%), Positives = 172/364 (47%), Gaps = 41/364 (11%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 16  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIR 75

Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
            G        A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ 
Sbjct: 76  NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 131

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
           PL  GF+  FG  N     YD+         R  E+ G    +   NL T    +    T
Sbjct: 132 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 187

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
            ++ +EA+  I+ Q    P FLY A  A HA                  ++    R  Y 
Sbjct: 188 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 236

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             V+++DDSVG ++  L    + +N+ + F SDNGA  +   E +     GSN P+   K
Sbjct: 237 DAVREIDDSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQA-----GSNGPFLCGK 291

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T +EGG++ PA+ W P      +VS Q+  I D   T    AG    R    IDGL+  
Sbjct: 292 QTTFEGGMREPALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAGLTPPR-DRAIDGLNLL 350

Query: 429 SSLL 432
            +LL
Sbjct: 351 PALL 354


>gi|338213632|ref|YP_004657687.1| arylsulfatase [Runella slithyformis DSM 19594]
 gi|336307453|gb|AEI50555.1| Arylsulfatase [Runella slithyformis DSM 19594]
          Length = 535

 Score =  160 bits (406), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 128/415 (30%), Positives = 195/415 (46%), Gaps = 78/415 (18%)

Query: 75  TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           T AA   +    L+   G++D+  +G  E+ TPNID +A NGI L + Y    C P+RAS
Sbjct: 37  TQAAKRPNILYILVDDMGFSDIGCYGG-EVNTPNIDQMAANGIKLRSFYNNARCCPTRAS 95

Query: 135 LMTGKYPIHTGMQ---GPPIWGAEPRGVP--LTERF--LPEYLRELGYSTKAIGKWHLGF 187
           L+TG+YP   GM      P    +P      L +R+  + E L++ GY T  +GKWH+G 
Sbjct: 96  LLTGQYPHTVGMGLMVTMPNAAIQPGSYQGFLDDRYPTIAEQLKKTGYHTYMLGKWHVG- 154

Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
            R ++ PL RGF+++FG ++G  SYY+ I +++  R + L+  +          + G Y 
Sbjct: 155 ERPQHWPLKRGFDNYFGLISGASSYYEIIPAEKGKRFMVLDDKEFTP------PSDGFYV 208

Query: 248 TDLFTKEAVQLIEDQ---PVDKPLFLYLAHLAAH----------------------AGNA 282
           TD FT  AVQ +  Q     DKP F+YLA+ A H                         +
Sbjct: 209 TDAFTDYAVQYLNKQKQEAADKPFFMYLAYTAPHFPLHAYESDIAKYEKLYAQGWDVTRS 268

Query: 283 GKHLEAPQ--------------ETINQFQYITDPNR-----RTYAAMVKKLDDSVGTVIS 323
            ++ +  Q              + I  ++  TD +R       YAAM+ ++D ++G +I 
Sbjct: 269 KRYQKMKQLGFIDKRYPLTPRPKDIPDWESATDKDRWIRKMAVYAAMIDRMDQNIGRLIK 328

Query: 324 ALQRKGMLENSIIIFMSDNGA------------PTVEYRETSNYRNWG------SNYPYR 365
            L+  G  EN++I+FMSDNG+            PT +  +  +Y  +       SN P+R
Sbjct: 329 TLKANGQYENTLIVFMSDNGSSNENMESRKLNDPTKKIGDKGSYVTYDTPWANVSNTPFR 388

Query: 366 GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
             K  L EGG+  P I+ W   IQ        + H+ D LPT    AG   + LP
Sbjct: 389 KYKRFLHEGGMITPCIMQWPRTIQPKAGFVDGIGHVMDLLPTGLELAGIPDNALP 443


>gi|426383228|ref|XP_004058189.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gorilla gorilla
           gorilla]
          Length = 528

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 119/357 (33%), Positives = 170/357 (47%), Gaps = 41/357 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI  G     
Sbjct: 48  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 104

Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
              A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ PL  GF+
Sbjct: 105 TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFD 163

Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
             FG  N     YD+         R  E+ G    +   NL T    +    T ++ +EA
Sbjct: 164 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 219

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           +  I+ Q    P FLY A  A HA                  ++    R  Y   V+++D
Sbjct: 220 LDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREID 268

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           DS+G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+   K T +EGG
Sbjct: 269 DSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGKQTTFEGG 323

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           ++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+   +LL
Sbjct: 324 MREPALAWWPGHVAAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLLPTLL 379


>gi|13278373|gb|AAH04002.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Mus musculus]
          Length = 520

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 122/365 (33%), Positives = 173/365 (47%), Gaps = 40/365 (10%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31  VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 91  RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
            PL  GF+  FG  N     YD+         R  E+ G            T     T L
Sbjct: 147 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205

Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           + +EA+  I  Q   + P FLY A  A HA                 Q++    R  Y  
Sbjct: 206 YLQEALDFIRTQHARQGPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V+++DDSVG +++ LQ  G+ +N+ + F SDNGA  +     S     GSN P+   K 
Sbjct: 255 AVREIDDSVGKILNLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
           T +EGG++ PAI W P      +VS Q+  I D   T  + AG    + R+   IDGLD 
Sbjct: 310 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 366

Query: 428 WSSLL 432
             ++L
Sbjct: 367 LPTML 371


>gi|301608268|ref|XP_002933701.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Xenopus
           (Silurana) tropicalis]
          Length = 520

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 163/577 (28%), Positives = 234/577 (40%), Gaps = 145/577 (25%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL   G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 32  LLLMDDMGWGDLGVFGEPSRETPNLDKMASEGMLFPNFYTANPLCSPSRAALLTGRLPIR 91

Query: 144 TGMQGP-----------PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
            G                I G    G+P +E   PE L++ GY  K IGKWHLG  R  Y
Sbjct: 92  NGFYTTNSHARNAYTPQEIMG----GIPDSEVLFPELLKKAGYVNKIIGKWHLG-HRPSY 146

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEYATD- 249
            PL RGF+  FG  N     YD               +  R N+     WD VG Y  D 
Sbjct: 147 HPLRRGFDEWFGSPNCHFGPYD---------------NKQRPNIPVYRDWDMVGRYYEDF 191

Query: 250 -------------LFTKEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQ 295
                        ++ +EA++ I  Q  D +P FLY +  A HA                
Sbjct: 192 NIDHKTGESNLTQIYLQEALEFINRQAGDQQPFFLYWSIDATHA-----------PVYAS 240

Query: 296 FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY 355
             ++    R  Y   V+++D S+G ++  L +K +++++ + F SDNGA  +     S  
Sbjct: 241 RPFLGTSRRGLYGDAVREIDFSIGKILGLLNQKDIIKDTFVFFTSDNGAALI-----SAP 295

Query: 356 RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
              GSN P+   K T +EGG++VP I W P      +VS Q+  I D   T  + AG   
Sbjct: 296 MQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIMDLFTTSLSLAG--- 352

Query: 416 SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAA 475
                            +  P  R   IDG+D    +L +  + R       +E     A
Sbjct: 353 -----------------VEIPKDR--AIDGIDLSPVILESKETERPIFYYRGNE---LMA 390

Query: 476 VRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-FLPIS 534
           VRL   K    T  N                              S ++LSQ I F P  
Sbjct: 391 VRLGQHKAHYWTWSN------------------------------SWEELSQGINFCPGQ 420

Query: 535 NIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDIS---S 591
           NI  + +  Q+       NP              LF+LG DP E+ +I+    +     S
Sbjct: 421 NITGVTTHVQEEH---SINP-------------LLFHLGRDPGEKYSISVLSTEYEAALS 464

Query: 592 QLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           ++  +++ HR ++VP    +P L   D    N  WSP
Sbjct: 465 RISPVVQQHRNSMVPG---EPQLNVCDLAVMN--WSP 496


>gi|397468273|ref|XP_003805816.1| PREDICTED: N-acetylgalactosamine-6-sulfatase isoform 1 [Pan
           paniscus]
          Length = 482

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 119/357 (33%), Positives = 170/357 (47%), Gaps = 41/357 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI  G     
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 58

Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
              A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ PL  GF+
Sbjct: 59  TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFD 117

Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
             FG  N     YD+         R  E+ G    +   NL T    +    T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 173

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           +  I+ Q    P FLY A  A HA                  ++    R  Y   V+++D
Sbjct: 174 LDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREID 222

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           DS+G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+   K T +EGG
Sbjct: 223 DSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGKQTTFEGG 277

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           ++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+   +LL
Sbjct: 278 MREPALAWWPGHVAAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLLPTLL 333


>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 500

 Score =  160 bits (405), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 112/352 (31%), Positives = 167/352 (47%), Gaps = 36/352 (10%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
              L+  +G+ D SF     I TP+   L  NG+ILN  Y    C+PSRAS +TG++P H
Sbjct: 31  VFVLVDDWGFADASFRNP-AIKTPSFQYLVDNGLILNRHYVFKYCSPSRASFLTGRFPHH 89

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                P   G     + +T   LP  L+  GYST  +GKWH G +   Y P+ RGF++  
Sbjct: 90  VHQWNPTPPGMVGTNINMT--MLPAKLKTAGYSTHMVGKWHQGLYDPAYLPVNRGFDTSS 147

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQ 262
           G+L     +++  +             D  +N   A DT  G Y + ++ K+   +    
Sbjct: 148 GFLQAGEGHFNQTIGCAV---------DFWKN--HAPDTRNGTYDSYIYNKDLTTIFSKH 196

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMVKKLDDSVGTV 321
              KPLFLYL     HA      +EAPQE ++ +    T   RR   AMV   D+  G V
Sbjct: 197 DASKPLFLYLPLHNVHAP-----IEAPQEWLDLYPVNSTCATRRVLQAMVSVADNVTGHV 251

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           ++ L+   M +N+I++  +DNG    +          GSNYP +G K+T +EGGV+V A 
Sbjct: 252 VNLLKENDMWDNTIMVISADNGGADCK----------GSNYPLKGSKHTFFEGGVRVIAF 301

Query: 382 LWSPQI--QQNPRVSLQMMHISDWLPTLYTAAG---GDTSRLPLNIDGLDQW 428
                I   +  + +   +H++DW  T    AG    D+ +    +DGLD W
Sbjct: 302 ASGGLIPAARRGKSTDGFIHVADWYTTFCKMAGVDHNDSGKGKFPVDGLDVW 353


>gi|423226077|ref|ZP_17212543.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
           CL02T12C19]
 gi|392630595|gb|EIY24583.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
           CL02T12C19]
          Length = 483

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 125/374 (33%), Positives = 186/374 (49%), Gaps = 49/374 (13%)

Query: 75  TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCT 129
           T A  TKS     + +Y    G++D+S +G     TPNIDALA  GI   + YA  P+ +
Sbjct: 24  TEATPTKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISS 83

Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           PSRA  +TG+YP   G+QG   +     G+   E  + E L+  GY+T  IGKWHLG  R
Sbjct: 84  PSRAGFLTGRYPARMGIQG-VFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SR 141

Query: 190 REYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
            +Y PL +GF+ +FG     I Y + + +  Y R  E+    +  N            T 
Sbjct: 142 EKYLPLQQGFDEYFG-----IPYSNDMSAQVYLRGNEVEEFHIDIN----------NVTK 186

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
            +T+EAV  I  +  D+P FL+LAH   H          P    ++F   +      Y  
Sbjct: 187 KYTEEAVDYIR-RKADQPFFLFLAHSMMH---------VPIYVSDEFAGKSGAG--IYGD 234

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V ++D SVG ++  L+  G+ +N++++F SDNG P ++          G   P R  K 
Sbjct: 235 AVLEVDWSVGRIMETLRELGLDDNTLVVFTSDNG-PWLQEGPLG-----GRALPLREGKT 288

Query: 370 TLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQ 427
           T +EGGV+VP I  W  QI+  P V+  ++ + DW PT+   +GG    LP + +DG D 
Sbjct: 289 TAFEGGVRVPCIAYWKGQIK--PVVNTDVVSLLDWFPTVTALSGG---ILPDVRLDGYDL 343

Query: 428 WSSLLLNTPSRRNS 441
             + +LN   +R S
Sbjct: 344 --TAVLNGTGKRAS 355


>gi|431797495|ref|YP_007224399.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430788260|gb|AGA78389.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 477

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 112/330 (33%), Positives = 166/330 (50%), Gaps = 27/330 (8%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP 141
           +L+++    G  D + +GS ++ TPN+D +A+ G      YA  PVC+PSRASLMTGK+ 
Sbjct: 44  ILVIFADDQGTFDTNAYGSKDLETPNLDRIAHEGTRFTQFYAAAPVCSPSRASLMTGKFN 103

Query: 142 IHTGMQG---PPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
            H  + G   PP    E + G+P  E  + E  +  GY T  IGKWHLG    E  P  +
Sbjct: 104 HHARLFGNVAPPHADPEFKSGLPTDEITMAEIFKADGYKTGLIGKWHLGH-SPEKLPNGQ 162

Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
           GF+  FG+  G I  + H            N HD+ RN    +   G++  DL   E  +
Sbjct: 163 GFDYFFGHQRGCIDNFSHFFF-----WAGPNMHDLYRNEEEVY-YPGQFFGDLMVDEVKK 216

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
               +  D P F+Y A    H    G+      + +  ++ +  P RR YAA V   D+ 
Sbjct: 217 FTAPETED-PFFVYWAINMPHYPYQGR-----PKWLEHYKDLPSP-RREYAAFVSTYDEM 269

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
           +G V+  L+  G L+N+II++ SD+G  T    E   +   G+N  Y G K  + EGG++
Sbjct: 270 IGEVLDHLKATGQLDNTIIVYQSDHGHST----EERAFFGGGNNGVYHGAKFCMLEGGIR 325

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
           VPA++  P+     +V  QM +  DWLPTL
Sbjct: 326 VPAMIRYPKAVPANQVRNQMANSIDWLPTL 355


>gi|311745970|ref|ZP_07719755.1| arylsulfatase [Algoriphagus sp. PR1]
 gi|311302452|gb|EAZ80458.2| arylsulfatase [Algoriphagus sp. PR1]
          Length = 476

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 120/377 (31%), Positives = 182/377 (48%), Gaps = 62/377 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL F G   I TPNID LA  G+  ++ Y+   VC PSR+S +TG +  HT ++G  
Sbjct: 30  GYGDLGFLGQEYIETPNIDRLAKEGMFFSDFYSGSTVCAPSRSSFLTGMHTGHTPIRGNS 89

Query: 151 IWGAEPRG-VPLTERFL--PEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
               +P G  P+ +  L   + +++ GY T A GKW LGF      P  +GFE  +GY  
Sbjct: 90  E--VQPEGQFPMPDSVLTVAKVMKQAGYVTGAFGKWGLGFIGSTGEPASQGFEEFYGY-- 145

Query: 208 GVISYYDHILSDQY----SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
                Y H     Y       ++L G+D        W T G+YA DL  ++ ++ I D+ 
Sbjct: 146 -NCQRYAHRYYPAYLWHNDEKIDLPGND--------WTTKGDYAPDLIQEKTLEFI-DEN 195

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR------------------ 305
            + P FL++  +  HA      L AP + I +      P+ +                  
Sbjct: 196 KENPFFLFMPIVTPHA-----ELAAPDDEIMEKYRAMFPDEKPYVAGNGADYGPDMRIPG 250

Query: 306 ---------TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE-TSNY 355
                    T+AAMV+++D  VG V+  L+  G+ EN++IIF SDNGA    +RE  ++ 
Sbjct: 251 YQSQPYPHATFAAMVERIDRYVGEVLQKLESNGLSENTVIIFTSDNGA----HREGGADP 306

Query: 356 RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
             + SN  +RG K  L+EGGV+ P I W P   +    +  +    D LPT + A GG+ 
Sbjct: 307 EFFDSNGKFRGFKRDLYEGGVRAPMIAWWPNKIKAGSTTDHVSAFWDLLPT-FAAIGGE- 364

Query: 416 SRLPLNIDGLDQWSSLL 432
            ++  +IDG+    +LL
Sbjct: 365 -KITKSIDGISFLPTLL 380


>gi|167520280|ref|XP_001744479.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163776810|gb|EDQ90428.1| predicted protein [Monosiga brevicollis MX1]
          Length = 314

 Score =  160 bits (404), Expect = 2e-36,   Method: Compositional matrix adjust.
 Identities = 104/331 (31%), Positives = 168/331 (50%), Gaps = 35/331 (10%)

Query: 88  LIVYGWNDLSFHGSN--------EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGK 139
           + V G+ D+S+             IPTP+ID LA +G+ L + Y   +C+P+R SL++G+
Sbjct: 4   VAVLGFTDVSYKNQTYPDVAFGPMIPTPSIDELALSGVRLESYYVNQLCSPTRTSLLSGR 63

Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
           Y    GM    I    P  +PLT + + E+L+E G++T A GKW  G      TP  RGF
Sbjct: 64  YAYTLGMNAEVIVDGHPDQLPLTVKTIGEHLQEGGWATFAGGKWDAGMTSWGCTPTCRGF 123

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI 259
           ++  G+ N    Y+ H + +     ++L  HD   +L+    T G Y T+L T    Q I
Sbjct: 124 DTFSGFYNAFNDYFTHRVGNY----LDLR-HDFAPDLADDNHT-GVYMTELLTSRVQQFI 177

Query: 260 EDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT----DPNRRTYAAMVKKL 314
                 ++  F Y+AH A HA N     + P   +  +   T     P RR    M++ +
Sbjct: 178 TTAAETEQSTFAYVAHQAVHAPN-----QVPMSYLEGYCLDTIPEDRPTRRILCGMMRAV 232

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D+SV  + +  ++ G+  ++++IF +DNG          N    GSNYP RG K T +EG
Sbjct: 233 DESVRNITATYKQLGLWNDTVLIFTTDNGG---------NPETGGSNYPLRGQKATTFEG 283

Query: 375 GVKVPAILWSPQIQQNPR--VSLQMMHISDW 403
           G++    + SP + ++ R  +S +++H+SDW
Sbjct: 284 GMRGVGFVNSPLLNESQRGFISDELIHVSDW 314


>gi|432943494|ref|ZP_20140329.1| arylsulfatase [Escherichia coli KTE196]
 gi|433045335|ref|ZP_20232807.1| arylsulfatase [Escherichia coli KTE117]
 gi|431466713|gb|ELH46730.1| arylsulfatase [Escherichia coli KTE196]
 gi|431551968|gb|ELI25931.1| arylsulfatase [Escherichia coli KTE117]
          Length = 551

 Score =  160 bits (404), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 139/440 (31%), Positives = 213/440 (48%), Gaps = 69/440 (15%)

Query: 87  LLIVYGWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
           LL   GW D+ F+G       PTP+IDA+A  G+IL + Y+QP  +P+RA+++TG+Y IH
Sbjct: 92  LLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIH 151

Query: 144 TGMQGPPIWGAEPRGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
            G+  PP++G +P G+  LT   LP+ L + GY T+AIGKWH+G   +E  P   GF+  
Sbjct: 152 HGILMPPMYG-QPGGLQGLTT--LPQLLHDQGYVTQAIGKWHMG-ENKESQPQNVGFDDF 207

Query: 203 FGYLNGVISYYD-----HI-----LSDQYSRTVEL------NGHDMRRNLSTA-WDTVGE 245
            G+ N V   Y      H+     LS   S  ++       + H +R     A  D   +
Sbjct: 208 RGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPK 266

Query: 246 YATDL---FTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 301
           Y  DL   + +  V++++     DKP FLY      H  N            N     + 
Sbjct: 267 YMEDLDQRWMEYGVKILDKMAKSDKPFFLYYGTRGCHFDN----------YPNAKYAGSS 316

Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
           P R +Y   + +++D    +  AL++ G L+N++I+F SDNG P  E             
Sbjct: 317 PARTSYGDCMVEMNDVFANLYKALEKNGQLDNTLIVFTSDNG-PEAEVPPH-------GR 368

Query: 362 YPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
            P+RG K + WEGGV+VP  + W   IQ  PR S  ++ ++D  PT    AG   +++  
Sbjct: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHPGAKV-- 424

Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS 480
                   ++L+  T     + IDG+DQ +S  L T  + N    +     + AAVR+D 
Sbjct: 425 --------ANLVPKT-----TFIDGVDQ-TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDE 470

Query: 481 WKLVLGTQE--NGTMDGYYG 498
           +K  +  Q+    T  GY G
Sbjct: 471 FKYHVLIQQPYAYTQSGYQG 490


>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
 gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
          Length = 553

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 126/382 (32%), Positives = 178/382 (46%), Gaps = 70/382 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ--- 147
           G+ND+S  G   +PTPNID++A  G    + Y+    C PSRA +MTG+Y   TG +   
Sbjct: 82  GFNDISHFGGGIVPTPNIDSIARGGANFTSAYSGTAACAPSRAMIMTGRYGTRTGFEFTP 141

Query: 148 -------------------------------GPPIWGAEPRGVPLTERFLPEYLRELGYS 176
                                           PP      +G+P +E  L E L+  GY 
Sbjct: 142 TPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPF---REQGLPGSEITLAEALKPKGYH 198

Query: 177 TKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---- 232
              IGKWHLG    E+ P  +GF+      +G+    D    D  +  +  +  D     
Sbjct: 199 NIHIGKWHLGN-APEFLPNAQGFDESVMLESGLFLPEDS--PDVVNAKLPFDPIDQFLWA 255

Query: 233 RRNLST-----AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLE 287
           R   +T     AW     Y TD +T EA++ IE    ++P FLYLAH   H       L+
Sbjct: 256 RMQYATSYNGSAWFEPKGYLTDFYTDEAIKAIEAN-RNRPFFLYLAHWGVHT-----PLQ 309

Query: 288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
           A +   +   +I D   R YAAM+  LD SVG V+ +L+  G+ EN+++IF SDNGAP  
Sbjct: 310 ASKADYDALSHIEDERLRVYAAMIVALDRSVGRVLQSLKENGLEENTLVIFSSDNGAPG- 368

Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPT 406
            Y    +      N PYRG K T +EGG++VP    W  +I      +  + H+ D  PT
Sbjct: 369 -YIGLPDV-----NKPYRGWKLTFFEGGIRVPFFAKWPARIPAGTERTTPVAHL-DMFPT 421

Query: 407 LYTAAGGDTSRLPLN--IDGLD 426
           +  AAGG+   LP +  IDG+D
Sbjct: 422 IVAAAGGE---LPADRVIDGID 440



 Score = 44.3 bits (103), Expect = 0.20,   Method: Compositional matrix adjust.
 Identities = 26/74 (35%), Positives = 37/74 (50%), Gaps = 6/74 (8%)

Query: 6   STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
            +AW     Y TD +T EA++ IE    ++P FLYLAH   H       L+A +   +  
Sbjct: 265 GSAWFEPKGYLTDFYTDEAIKAIEAN-RNRPFFLYLAHWGVHT-----PLQASKADYDAL 318

Query: 66  QYITDPNRRTYAAL 79
            +I D   R YAA+
Sbjct: 319 SHIEDERLRVYAAM 332



 Score = 40.8 bits (94), Expect = 2.2,   Method: Compositional matrix adjust.
 Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 4/53 (7%)

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYH----RRTLVPQSHEQPDLV 615
            +LFNL  DP EQNN+A   P+  ++L  L++ H    R  L P   E P  V
Sbjct: 482 TWLFNLKTDPTEQNNVADENPEKVAELKALVEAHNATQREPLFPAVAEMPVTV 534


>gi|323452121|gb|EGB07996.1| hypothetical protein AURANDRAFT_64538 [Aureococcus anophagefferens]
          Length = 1591

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 114/364 (31%), Positives = 173/364 (47%), Gaps = 34/364 (9%)

Query: 84   TLTLLIVYGWNDLSFHGSNEIPTPN-IDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
             L  +   G NDL    S ++P P  +  L  +G+ L + Y Q +C+P+RA+LMTGK+  
Sbjct: 1098 VLVFMDDVGLNDL--WRSTDLPKPTEMSKLVRDGVELTSYYGQSLCSPARATLMTGKFAH 1155

Query: 143  HTGM---QGP--PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
              G    QG    +       VPL    LP+ ++ LGY T AIGKW++G    +Y P  R
Sbjct: 1156 KIGFSDQQGGVREVTAYSNFSVPLGHDMLPQGMKRLGYQTHAIGKWNIGHCNVKYMPWQR 1215

Query: 198  GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
            GF++  GY    I Y DH+ SD  + T  +N   +  N S   +  G Y T LFT+ A +
Sbjct: 1216 GFDTFVGYFTDGIGYTDHV-SDT-ANTYTVNDGGLAFNGS---EYEGTYTTALFTERAEK 1270

Query: 258  LIEDQPVDKPLFLYLAHLAAH-------AGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
            ++ D P D PLF++LA+   H       A    +  +   + I   +      R  +   
Sbjct: 1271 VLHDAPEDAPLFMWLAYHGMHDNDGVTDADTCAQTGDDDDDAIFMERGHLPFARFNFGCG 1330

Query: 311  VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
            ++ +D  VG V +AL  +    + +++  SDNG           Y   GSNYPYRG K  
Sbjct: 1331 LRAIDRGVGQVRTALDDRA--RDYVLVVTSDNGG------FACGYHCQGSNYPYRGAKFY 1382

Query: 371  LWEGGVKVPAILWSPQIQQNPRVSLQ---MMHISDWLPTLYTAAGGDTSRLPL---NIDG 424
             +EGG+++P +++SP      R       +MH  DW  T     GGD + L     + D 
Sbjct: 1383 EFEGGLRLPGVVYSPSRVPARRRGAAYDGLMHHVDWFGTFVHLGGGDVAALTTFDSSFDT 1442

Query: 425  LDQW 428
            ++ W
Sbjct: 1443 MNHW 1446


>gi|255014752|ref|ZP_05286878.1| arylsulfatase A [Bacteroides sp. 2_1_7]
 gi|410102839|ref|ZP_11297764.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
 gi|409237966|gb|EKN30761.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
          Length = 459

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 123/362 (33%), Positives = 179/362 (49%), Gaps = 45/362 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G+  I TPNID +A  G+ L   Y    V TPSRA+LMTG+ P+  G+ G  
Sbjct: 42  GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101

Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
           +    P    G+   E  + + L++ GY+T  +GKWHLG F   Y P   GF+++FG   
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQNGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
             I Y + +        V+  G   R   ST     G+           T  +T++AV  
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPSTPLILDGKQIESEPDQGELTRRYTEKAVSF 209

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I+D    +P FLY AH   H          P  T  +F+  +   R  Y  +V+++D SV
Sbjct: 210 IKDHS-KEPFFLYFAHTFPHI---------PLYTNARFEGTS--KRGLYGDVVEEIDWSV 257

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V+ AL+  G+ EN+ +IF SDNG    E      + N GS  P +  K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGPWLTE------HENGGSAGPLKDGKGTWWEGGFRV 311

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           PAI W P  + NP ++ ++M   D  PT  + AG +  +  L +DG++Q   L     S 
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369

Query: 439 RN 440
           R+
Sbjct: 370 RD 371


>gi|241574524|ref|XP_002403353.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215500210|gb|EEC09704.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 255

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 104/283 (36%), Positives = 151/283 (53%), Gaps = 37/283 (13%)

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           MV  LD+S+G V  AL   G+LEN+II+  SDNG   + ++      N G N+P RG K 
Sbjct: 1   MVDALDESIGEVFQALGDAGILENTIIVLSSDNGGLPIGFQS-----NRGFNFPLRGGKG 55

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
           TLWEGG +    +WSP +++    S QMMH++DWLPTLY+AAG D    PL++       
Sbjct: 56  TLWEGGCRATGFIWSPLLKRKGVASDQMMHVTDWLPTLYSAAGED----PLSL------- 104

Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
                        +DG D W+ L L   S R  +L NID     +A+R  ++KLVLG   
Sbjct: 105 -----------GKLDGTDMWAPLSLGYASPRYEILYNIDPINNISALRYTNYKLVLGRTN 153

Query: 490 NGTMDGYYGQTRSNKVPLLNFNAI-VESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
           +G  D  +  T  ++ P  + + + V+S   + L+ L     L   +       R++AT+
Sbjct: 154 DGQYDPRFLTTGGSR-PYDDLDLLMVQSTAARVLRDLRNKGDLQFPD-----QWREKATL 207

Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISS 591
            CG N  P   +  +  P YLF+L +DPCE NNIASS+  ++S
Sbjct: 208 DCG-NKVP--KNFVSLDPPYLFDLASDPCEINNIASSQAGVTS 247


>gi|262383796|ref|ZP_06076932.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|262294694|gb|EEY82626.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
          Length = 459

 Score =  159 bits (403), Expect = 3e-36,   Method: Compositional matrix adjust.
 Identities = 123/362 (33%), Positives = 179/362 (49%), Gaps = 45/362 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G+  I TPNID +A  G+ L   Y    V TPSRA+LMTG+ P+  G+ G  
Sbjct: 42  GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101

Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
           +    P    G+   E  + + L++ GY+T  +GKWHLG F   Y P   GF+++FG   
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQNGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
             I Y + +        V+  G   R   ST     G+           T  +T++AV  
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPSTPLILDGKQIESEPDQGELTRRYTEKAVSF 209

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I+D    +P FLY AH   H          P  T  +F+  +   R  Y  +V+++D SV
Sbjct: 210 IKDHS-KEPFFLYFAHTFPHI---------PLYTNARFEGTS--KRGLYGDVVEEIDWSV 257

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V+ AL+  G+ EN+ +IF SDNG    E      + N GS  P +  K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGPWLTE------HENGGSAGPLKDGKGTWWEGGFRV 311

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           PAI W P  + NP ++ ++M   D  PT  + AG +  +  L +DG++Q   L     S 
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369

Query: 439 RN 440
           R+
Sbjct: 370 RD 371


>gi|149177301|ref|ZP_01855906.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
 gi|148843826|gb|EDL58184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
           maris DSM 8797]
          Length = 501

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 127/455 (27%), Positives = 192/455 (42%), Gaps = 99/455 (21%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGM---- 146
           G+ DL   GS EI TP++D LA  G  L + Y   P CTPSR SL+TG+YP   G+    
Sbjct: 49  GYRDLGSFGSEEIMTPHLDRLAKEGAKLTSFYVTWPACTPSRGSLLTGRYPQRNGIYDMI 108

Query: 147 -QGPPIWGAEPR------------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
               P +G + +            G+ + E+ LP  L+  GY +   GKW LG  +R + 
Sbjct: 109 RNEAPDFGHKYKPAEYEVTFERIGGMDVREKLLPALLKPAGYVSAIYGKWDLGIHKR-FL 167

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
           PL RGF+  +G+ N  I Y+ H   ++Y          M RN     +  G Y T LF +
Sbjct: 168 PLARGFDDFYGFTNTGIDYFTH---ERYGVP------SMYRNNQPTEEDKGTYCTYLFQR 218

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNA-----GKHLEAPQETINQFQYITD------- 301
           EAV+ I++    KP FLYL   A H  ++         +AP++  N + ++ D       
Sbjct: 219 EAVRFIKENH-QKPFFLYLPFNAPHGASSLDPRIRGGAQAPEKYKNMYPHLKDTLVTKKK 277

Query: 302 ----------------------PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
                                   R  Y A +  +DD++G V+  L    + +N+I++F 
Sbjct: 278 TGRYEFRERPDGPVIHQGVSASKRRLEYVASITCMDDAIGEVLGLLDEYQIADNTIVVFF 337

Query: 340 SDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMH 399
           SDNG                 N P +G K  ++EGG++VP ++  P   +   V+ +++ 
Sbjct: 338 SDNGG-----------SGGADNSPLKGKKGMMFEGGIRVPCLVRYPAKIKPGTVNDELLT 386

Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSR 459
             + +PT    A                        P   N  IDG D    L+  T S 
Sbjct: 387 SLELVPTFLKEAA----------------------IPLPENVVIDGYDMLPVLMGKTTSP 424

Query: 460 RNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
           RN +     E K   A R+  WK V   + +G  D
Sbjct: 425 RNEMYWQRREDK---AARVGHWKWVESEKGSGLFD 456


>gi|355710478|gb|EHH31942.1| hypothetical protein EGK_13112 [Macaca mulatta]
          Length = 482

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 119/357 (33%), Positives = 170/357 (47%), Gaps = 41/357 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI  G     
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 58

Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
              A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ PL  GF+
Sbjct: 59  TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFD 117

Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
             FG  N     YD+         R  E+ G    +   NL T    +    T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 173

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           +  I+ Q    P FLY A  A HA                  ++    R  Y   V+++D
Sbjct: 174 LDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREID 222

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           DSVG ++  L    + +N+ + F SDNGA  +   E +     GSN P+   K T +EGG
Sbjct: 223 DSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQA-----GSNGPFLCGKQTTFEGG 277

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           ++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+   +LL
Sbjct: 278 MREPALAWWPGHIAAGQVSHQLGSIMD-LFTTSLALAGLTPPRDRAIDGLNLLPALL 333


>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
          Length = 616

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 118/383 (30%), Positives = 182/383 (47%), Gaps = 64/383 (16%)

Query: 85  LTLLIVYGWNDLSFHGSN--EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
             L+   G ND+ +  ++  E+ TP +D+L+  G+ L   Y   +CTPSRASLMTG+   
Sbjct: 124 FILIDDVGTNDIGYQSTDLWEL-TPFMDSLSSEGVRLTKYYTNQLCTPSRASLMTGRDTF 182

Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
            TGMQ   +  +   G+PL E  L E  +         GKWHLG +   + P  RGF++ 
Sbjct: 183 RTGMQYEVVEDSGAWGLPLEEVTLAERFK--------TGKWHLGMYSDAHYPFARGFDTF 234

Query: 203 FGYLNGVISYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDT-------VGEYAT 248
            GY+  V  Y  H           +YS   +    D    ++   +T       VG Y+T
Sbjct: 235 LGYMGAVRGYSSHEGCNTPTFEGGEYSCFKDFGYGDKDGYINHITNTTRQGPSFVGNYST 294

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT-------D 301
            + T  A+++ ++   D P FLY++H A H+         P +   + ++ T       D
Sbjct: 295 TIITDRAIEVAKEHGED-PFFLYVSHQAVHSPVD----PPPDDYFTEEEHATLARVVSED 349

Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGS 360
             R+ +A ++  LD  +  +   L   G L+N++++  SDNGA PT            GS
Sbjct: 350 AYRQRFARVILFLDKEMRRLHDELDALGALDNTVLVVASDNGACPTA----------GGS 399

Query: 361 NYPYRGVKNTLWEGGVKVPAILWS------PQIQQNPRVSLQMMHISDWLPTLYTAAGGD 414
           NYP RG K+T++EGGV+VPA ++S      P+  +  R S  MMH +DW PT     G  
Sbjct: 400 NYPLRGYKHTIFEGGVRVPAFVYSKSTDLIPEEARGTRYS-GMMHSTDWTPTF----GSV 454

Query: 415 TSRLPLN-----IDGLDQWSSLL 432
              LPL      + G D W +++
Sbjct: 455 VPDLPLGEMGKELSGFDHWDAII 477


>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 494

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 112/354 (31%), Positives = 170/354 (48%), Gaps = 42/354 (11%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
             L+  +G+ D+ F     I +PN D LA  G++LN  Y    C+PSRASL+TG++P HT
Sbjct: 27  FVLVDDWGFADVGFKNP-AISSPNFDHLAKTGLVLNRHYVYMYCSPSRASLLTGRWPHHT 85

Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
                 +      G  L    +P  L+   Y+T  +GKWH GFF   Y P+ RGF++  G
Sbjct: 86  HQWN--LGNNSTAGTNLAMTMIPAKLKAANYATHMVGKWHQGFFDPRYLPINRGFDTSSG 143

Query: 205 YLNGVISYY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           +L G   +   + I +  Y +    N  D R          G Y   ++  +   +I   
Sbjct: 144 FLCGSEDHMTQNAICAIDYWKN---NAPDPRN---------GTYDAYIYRDDLTDIINSH 191

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTV 321
             ++PLFLYL     H       L+AP+E ++ +    T   R T  AM+   D+  G +
Sbjct: 192 NTNEPLFLYLPLHNVHLP-----LQAPKEWLDIYPANSTCEKRHTLQAMISVADNVTGHL 246

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           +  L++KG+ +N+I++  +DNG               GSNYP RG K TL+EGGV+  A 
Sbjct: 247 VELLKKKGIWDNTIMVISADNGGAHCS----------GSNYPLRGCKETLFEGGVRSLAF 296

Query: 382 LWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD-----TSRLPLNIDGLDQW 428
           +    + ++ R   +   +HI+DW  T    AG D     T + P  +DGLD W
Sbjct: 297 VNGGLLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFP--VDGLDVW 348


>gi|404447701|ref|ZP_11012695.1| sulfatase [Indibacter alkaliphilus LW1]
 gi|403766287|gb|EJZ27159.1| sulfatase [Indibacter alkaliphilus LW1]
          Length = 486

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 118/351 (33%), Positives = 180/351 (51%), Gaps = 44/351 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ND+   GS +I TPN+D +A  G+  +N Y AQ VC+ SRA+L+TG Y    G+ G  
Sbjct: 52  GYNDVGVFGSKDISTPNLDNMAKEGVRFSNFYVAQAVCSASRAALLTGTYSNRLGIHGA- 110

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           +  +   G+   E  + E L+ LGY T   GKWHLG    E+ P+ +GF+ +FG      
Sbjct: 111 LDHSSNHGLNPDETTIAEMLKPLGYQTAIFGKWHLGH-HPEFLPVQQGFDEYFGIPYSND 169

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE---YATDLFTKEAVQLIEDQPVDKP 267
            + +H  +  Y   + L  +D         DT+       T + T++++  IE    + P
Sbjct: 170 MWPNHPETKNYYPPLPLYEND------EVVDTLHHDQSMLTTVLTEKSLDFIERNK-ETP 222

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
            FLYLAH   H          P    ++F+  ++  R  Y  ++ ++D SVG V+ +L+ 
Sbjct: 223 FFLYLAHPMPHV---------PLFVSDKFEGKSE--RGLYGDVIMEIDWSVGQVLESLKS 271

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
           KG+ EN+++IF+SDNG P + Y   +     GS YP +  K T W+GGVK+P+I+ W  +
Sbjct: 272 KGLEENTLVIFLSDNG-PWLSYSLHA-----GSAYPLKEGKGTSWDGGVKIPSIMQWKGK 325

Query: 387 I-----QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           I     Q NP +S+      D LPT+    G     LP  IDG   W  +L
Sbjct: 326 IPEGKVQTNPAMSI------DILPTIAEITGAKLPELP--IDGKSIWKMIL 368


>gi|168701882|ref|ZP_02734159.1| Arylsulfatase [Gemmata obscuriglobus UQM 2246]
          Length = 467

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 122/378 (32%), Positives = 186/378 (49%), Gaps = 47/378 (12%)

Query: 71  PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCT 129
           P+ R   A   +  L +    G  +L  +G  +I TP+ID LA  G      Y+  PVC 
Sbjct: 16  PSGRAADAPKPNIVLIVADDLGCFELGCYGQTKIKTPHIDKLAQGGAKFTRFYSGSPVCA 75

Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLG 186
           PSR  LMTGK+  H  ++      A+P G   +   +  + + L+  GY+T A+GKW LG
Sbjct: 76  PSRCVLMTGKHSGHATVRNN--VEAKPEGQFPIRAEDVTVADALKAHGYATGAMGKWGLG 133

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY--SRTVELNGHDMRRNLSTAWDTVG 244
            F    +PL  GF+  FGY N     + H  +  Y   + VEL G+D +        T  
Sbjct: 134 MFDTAGSPLKHGFDLFFGY-NCQRHAHSHYPTYIYRNDKRVELKGNDGK--------TGK 184

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDP 302
           ++  DLF +EA+  IE     KP FLYL     H       ++ P++++N++  Q   DP
Sbjct: 185 QFTQDLFEEEALGFIEANKA-KPFFLYLPFTVPHVA-----VQVPEDSLNEYKGQLGDDP 238

Query: 303 N-------------RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT--V 347
                            YAAMV ++D SVG V+  L   G+ +N++++F SDNG PT  V
Sbjct: 239 AYDGKKGYQPHPAPHAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDNG-PTHNV 297

Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
              ++S + + G     RG+K +++EGG++VP I + P   +    S   ++  D LPTL
Sbjct: 298 GGADSSFFNSAGK---LRGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTL 354

Query: 408 YTAAGGDTSRLPLNIDGL 425
              AG   ++ P  IDG+
Sbjct: 355 CAFAG---TKAPSAIDGI 369


>gi|449692603|ref|XP_004213101.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
          Length = 181

 Score =  159 bits (402), Expect = 4e-36,   Method: Compositional matrix adjust.
 Identities = 85/192 (44%), Positives = 118/192 (61%), Gaps = 12/192 (6%)

Query: 183 WHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT 242
           WHLGFF ++YTP YRGF+S++G   G   Y++H  ++ YS      G D+  N +  +  
Sbjct: 1   WHLGFFAKQYTPTYRGFDSYYGSYLGKGDYWNHSNTETYS------GLDLHDNENGVFSQ 54

Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITD 301
            G Y+T+++T EA+  I +    +PLFLYLA+ A H+ N  +  L+APQE I++F YI  
Sbjct: 55  DGNYSTEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKH 114

Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
             RR YAAM+  +D  VG V  AL  K ML+NSIIIF +DNG P   +       NW +N
Sbjct: 115 EQRRKYAAMLGYMDYGVGRVHDALAEKKMLDNSIIIFTTDNGGPANGFD-----YNWANN 169

Query: 362 YPYRGVKNTLWE 373
           +P RGVK TL+E
Sbjct: 170 FPLRGVKATLFE 181



 Score = 65.1 bits (157), Expect = 1e-07,   Method: Compositional matrix adjust.
 Identities = 32/80 (40%), Positives = 49/80 (61%), Gaps = 1/80 (1%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQ 59
           +  N +  +   G Y+T+++T EA+  I +    +PLFLYLA+ A H+ N  +  L+APQ
Sbjct: 44  LHDNENGVFSQDGNYSTEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQ 103

Query: 60  ETINQFQYITDPNRRTYAAL 79
           E I++F YI    RR YAA+
Sbjct: 104 EWIDKFSYIKHEQRRKYAAM 123


>gi|372221524|ref|ZP_09499945.1| sulfatase [Mesoflavibacter zeaxanthinifaciens S86]
          Length = 461

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 121/373 (32%), Positives = 183/373 (49%), Gaps = 46/373 (12%)

Query: 70  DPNRRTYAALTKSTTLTLLIVY---GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP 126
           D   ++  +L K     LL++    GWND+ +HGS +I TP +D+LA NG  L   Y  P
Sbjct: 22  DKKIKSSNSLPKEQPNILLVIADDAGWNDVGYHGS-KIKTPVLDSLANNGAKLERFYVAP 80

Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
            C+PSRA+L+TG      G+  P I G     +P +   LP+ +++LGY T   GKWHLG
Sbjct: 81  TCSPSRAALLTGIPASRLGIVAP-IAGKSKIALPDSLVTLPKAMKKLGYRTALFGKWHLG 139

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
               E  P   GF++ +G+L+G I  Y H    +Y            +N   +W   G++
Sbjct: 140 L-TPENGPQAYGFDTSYGFLHGQIDQYTH----EY------------KNGDPSWHKNGKF 182

Query: 247 ------ATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETINQFQY 298
                  TDL T  A+    ++   + P F+ LA+ A H      +  + P  TI     
Sbjct: 183 LKEDGHVTDLLTDAAIAYFNQETKTETPSFVTLAYSAPHFPLQEEERYKRPYRTI----- 237

Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-----PTVEYR-ET 352
            TD +R  YAA +  LD + G ++++L+R G L+N++I+FMSDNGA     P  +Y+ + 
Sbjct: 238 FTDSSRVDYAAAMTHLDIAFGRLLNSLKRSGKLDNTLILFMSDNGAMKNWYPKDQYQGKH 297

Query: 353 SNYRNWGSNYPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                 G N P +  K T +EG ++VPA + W      N +     + + D LPTL    
Sbjct: 298 GPNTTLGDNTPLKDYKTTNYEGAIRVPAFVYWKNHF--NGKTLDDFIGVIDVLPTLVELT 355

Query: 412 GGDTSRLPLNIDG 424
           G      P  I+G
Sbjct: 356 GASVK--PNTIEG 366


>gi|224537481|ref|ZP_03678020.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
           DSM 14838]
 gi|224520919|gb|EEF90024.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
           DSM 14838]
          Length = 525

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 125/374 (33%), Positives = 185/374 (49%), Gaps = 49/374 (13%)

Query: 75  TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCT 129
           T A  TKS     + +Y    G++D+S +G     TPNIDALA  GI   + YA  P+ +
Sbjct: 66  TEATPTKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISS 125

Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           PSRA  +TG+YP   G+QG   +     G+   E  + E L+  GY+T  IGKWHLG  R
Sbjct: 126 PSRAGFLTGRYPARMGIQG-VFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SR 183

Query: 190 REYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
            +Y PL +GF+ +FG     I Y + + +  Y R  E+    +  N            T 
Sbjct: 184 EKYLPLQQGFDEYFG-----IPYSNDMSAQVYLRGNEVEEFHIDIN----------NVTK 228

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
            +T+EAV  I  +  D+P FL+LAH   H          P    ++F          Y  
Sbjct: 229 KYTEEAVDYIR-RKADQPFFLFLAHSMMH---------VPIYVSDEFA--GKSGAGIYGD 276

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V ++D SVG ++  L+  G+ +N++++F SDNG P ++          G   P R  K 
Sbjct: 277 AVLEVDWSVGRIMETLRELGLDDNTLVVFTSDNG-PWLQEGPLG-----GRALPLREGKT 330

Query: 370 TLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQ 427
           T +EGGV+VP I  W  QI+  P V+  ++ + DW PT+   +GG    LP + +DG D 
Sbjct: 331 TAFEGGVRVPCIAYWKGQIK--PVVNTDVVSLLDWFPTVTALSGG---ILPDVRLDGYDL 385

Query: 428 WSSLLLNTPSRRNS 441
             + +LN   +R S
Sbjct: 386 --TAVLNGTGKRAS 397


>gi|355757044|gb|EHH60652.1| hypothetical protein EGM_12064 [Macaca fascicularis]
          Length = 482

 Score =  159 bits (402), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 119/357 (33%), Positives = 169/357 (47%), Gaps = 41/357 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  +G     TPN+D +A  G +  N Y A P+C+PSRA+L+TG+ PI  G     
Sbjct: 2   GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 58

Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
              A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ PL  GF+
Sbjct: 59  TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKVVGKWHLG-HRPQFHPLKHGFD 117

Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
             FG  N     YD+         R  E+ G    +   NL T    +    T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 173

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           +  I+ Q    P FLY A  A HA                  ++    R  Y   V+++D
Sbjct: 174 LDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREID 222

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           DSVG ++  L    + +N+ + F SDNGA  +   E +     GSN P+   K T +EGG
Sbjct: 223 DSVGKMLELLHDLHIADNTFVFFTSDNGAALISAPEQA-----GSNGPFLCGKQTTFEGG 277

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           ++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+   +LL
Sbjct: 278 MREPALAWWPGHIAAGQVSHQLGSIMD-LFTTSLALAGLTPPRDRAIDGLNLLPALL 333


>gi|386287174|ref|ZP_10064349.1| sulfatase family protein [gamma proteobacterium BDW918]
 gi|385279706|gb|EIF43643.1| sulfatase family protein [gamma proteobacterium BDW918]
          Length = 555

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 125/383 (32%), Positives = 188/383 (49%), Gaps = 70/383 (18%)

Query: 92  GWNDLSFHG----SNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGM 146
           G ND+S +G    + ++ TPNID LA +G   +  Y+    C PSRA L+TG+YP  TG 
Sbjct: 76  GINDISTYGGGLANGQVQTPNIDELAASGANFSQAYSGAGTCAPSRAMLLTGRYPTRTGF 135

Query: 147 QGPP--------------------------------IWGAEPRGVPLTERFLPEYLRELG 174
           +  P                                I   + +G+P +E  L E LR+ G
Sbjct: 136 EFTPTPDGMGAMATMIASSHDNGLPPSIYNKAGADKIPSFDEQGLPSSEITLAEILRDRG 195

Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM-- 232
           Y T  IGKWHLG   + ++P  +GFE      +G+    DH   D  +  V+ +  D   
Sbjct: 196 YYTAHIGKWHLG-RGKGFSPEEQGFEQSLLMHSGLYLPVDH--PDVVNAKVDFDPIDQFL 252

Query: 233 --RRNLSTAWDTVG------EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGK 284
             +   + A++  G       Y TD +T E++++I+    ++P FLYLAH  AH      
Sbjct: 253 WAKMRYAAAFNDGGPVFEPKSYLTDYWTDESIKIIKANK-NRPFFLYLAHWGAHT----- 306

Query: 285 HLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
            L+A +E       I     R YAAMV+ +D SVG +++ L+ +G+ +N+I++F SDNG 
Sbjct: 307 PLQATREDYEAVGDIKPHRARVYAAMVRAVDRSVGRILATLKEEGLSDNTIVVFSSDNGG 366

Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDW 403
               Y    +      N PYRG K T++EGG++VP  + W   I +N  +     HI D+
Sbjct: 367 --AGYIGIPDI-----NAPYRGWKLTMFEGGIRVPLFMRWPAHIARNTSIDTPAAHI-DF 418

Query: 404 LPTLYTAAGGDTSRLPLN--IDG 424
           LPTL  AA    ++LP +  IDG
Sbjct: 419 LPTLVAAA---QAKLPEDRIIDG 438


>gi|161502378|ref|YP_001569490.1| hypothetical protein SARI_00412 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:- str. RSK2980]
 gi|160863725|gb|ABX20348.1| hypothetical protein SARI_00412 [Salmonella enterica subsp.
           arizonae serovar 62:z4,z23:-]
          Length = 550

 Score =  159 bits (401), Expect = 5e-36,   Method: Compositional matrix adjust.
 Identities = 145/463 (31%), Positives = 214/463 (46%), Gaps = 75/463 (16%)

Query: 66  QYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEI---PTPNIDALAYNGIILN 120
           Q + D  ++T     K   + + I+   GW D+ F+G       PTP+IDA+A  G+IL 
Sbjct: 73  QKLADIEKKT----GKKPNIVVFILDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128

Query: 121 NMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVP-LTERFLPEYLRELGYSTKA 179
           + Y+QP  +P+RA++MTG+Y +H G+  PP++G  P G+  LT   LP+ L E GY T+A
Sbjct: 129 SAYSQPSSSPTRATIMTGQYSVHHGILMPPMYGM-PGGLEGLTT--LPQLLHEQGYVTQA 185

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY----------DHILSD---QYSRTVE 226
           IGKWH+G       P   GF+   G+ N V   Y          +  LS    QY + + 
Sbjct: 186 IGKWHMG-ENTGSQPQNVGFDDFRGF-NSVSDMYTEWRDPNMNPEVALSPSRYQYIKNLP 243

Query: 227 LNGHDMRRNLSTAWDTVGE----YATDL---FTKEAVQLIEDQPV-DKPLFLYLAHLAAH 278
            + +D+        + V E    Y  DL   + K  V  I      DKP FLY      H
Sbjct: 244 FDKNDVHAVRGGKQEAVAEITPKYMEDLDQRWMKYGVDFINKMAKNDKPFFLYYGTRGCH 303

Query: 279 AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
             N      A +           P R +Y+  + +++D    +  AL+  G L+N++I+F
Sbjct: 304 FDNYPNAHYAGRS----------PARTSYSDCIVEMNDVFANLYRALETSGQLDNTLIVF 353

Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQM 397
            SDNG P  E              P+RG K + WEGGV+VP  + W   IQ  PR S  +
Sbjct: 354 TSDNG-PEAEVPPH-------GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGI 403

Query: 398 MHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTP 457
           + ++D  PT  + AG   + L   +             PS+  + IDG+DQ SS  L T 
Sbjct: 404 VDLADLFPTSLSLAGHPGAELAKLV-------------PSK--TFIDGVDQ-SSFFLGTN 447

Query: 458 SRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE--NGTMDGYYG 498
            + N    +       +AVR+D +K  L  Q+    T  GY G
Sbjct: 448 GQSNRKAEHYFLNGELSAVRIDEFKYHLLIQQPFAFTQTGYQG 490


>gi|430745365|ref|YP_007204494.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430017085|gb|AGA28799.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 476

 Score =  159 bits (401), Expect = 6e-36,   Method: Compositional matrix adjust.
 Identities = 120/364 (32%), Positives = 185/364 (50%), Gaps = 55/364 (15%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
           L L    G+ DLS +G+ ++ TPNIDAL  +G+  +  YA  PVC+P+RA+L+TG YP  
Sbjct: 35  LILADDLGYGDLSSYGAADLKTPNIDALVASGVRFDRFYANSPVCSPTRAALLTGCYPDL 94

Query: 144 TGMQG------PPIWGA-EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
            G+ G         WG   P+ V      LP+ L+  GY T  +GKWHLG       P  
Sbjct: 95  VGVPGVIRTHPDDSWGVLSPQAV-----LLPQVLKGAGYHTALVGKWHLGL-SGASLPSR 148

Query: 197 RGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           RGF+   G+L  ++  Y++H            +G +  R      D  G +ATDLF++ A
Sbjct: 149 RGFDLFHGFLGDMMDDYHNH----------RRHGINYMRRDDREIDPKG-HATDLFSQWA 197

Query: 256 VQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT---DPNRRTYAAMV 311
           +  + E +  D+P FL LA+   H       ++ PQE + + +      DP R    A++
Sbjct: 198 IDFLNESKGQDRPFFLELAYNVPHT-----PVQPPQEWLEKVRRRAPNLDPKRARLVALI 252

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
           + LD  +G V++AL+  G  E ++I+F SDNG       +     + G   PYRG K  +
Sbjct: 253 EHLDHGIGQVLAALRANGQAERTLIVFTSDNGG------QLDAGAHCG---PYRGGKQDM 303

Query: 372 WEGGVKVP-AILWSPQIQ---QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
           +EGG++VP   +W  +I+   ++ RV++ M    D  PTL  AAG   +R    +DG+  
Sbjct: 304 YEGGIRVPLCAVWPGRIEPGTRSDRVAVSM----DLFPTLCEAAG---ARFEHPVDGISL 356

Query: 428 WSSL 431
            S+ 
Sbjct: 357 LSTF 360


>gi|326927530|ref|XP_003209945.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Meleagris
           gallopavo]
          Length = 485

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 158/562 (28%), Positives = 239/562 (42%), Gaps = 123/562 (21%)

Query: 89  IVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ 147
           +  GW DL   G     TPN+D +A  G++  + YA  P+C+PSRA+L+TG+ P+  G  
Sbjct: 1   MTMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAASPLCSPSRAALLTGRLPVRNGFY 60

Query: 148 GPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
                 A  R          G+  +E  LPE L++ GY+ K IGKWHLG  R ++ PL  
Sbjct: 61  ---TTNAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWHLGH-RPQFHPLKH 116

Query: 198 GFESHFGYLNGVISYYDH--ILSDQYSRTVELNG---HDMRRNLSTAWDTVGEYATDLFT 252
           GF+  FG  N     YD+  + +    R  E+ G    D + +L T    +    T ++ 
Sbjct: 117 GFDEWFGSPNCHFGPYDNRALPNIPVYRDWEMIGRYYEDFKIDLKTGEANL----TQIYL 172

Query: 253 KEAVQLI-EDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETINQFQYITDPNRRTYAAM 310
           +EA+  I + Q   +P FLY A  A HA   A KH            ++    R  Y   
Sbjct: 173 QEALDFISKQQASQQPFFLYWAIDATHAPVYASKH------------FLGTSQRGRYGDA 220

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V+++DDSVG ++  LQ+ G+ EN+ + F SDNGA  +     S  +  GSN P+   K T
Sbjct: 221 VREIDDSVGKILKHLQKLGISENTFVFFTSDNGAALI-----SAPKQGGSNGPFLCGKQT 275

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
            +EGG++ PAI W P       VS Q+  + D   T  +  G                  
Sbjct: 276 TFEGGMREPAIAWWPGHIPAGGVSRQLGSVMDLFTTSLSLVG------------------ 317

Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
             L  PS R   IDG+D   ++L      R       +E     AVR+  +K    T  N
Sbjct: 318 --LQPPSDR--QIDGIDLLPAILQGKLIDRPIFYYRGNE---MMAVRVGLYKAHYWTWSN 370

Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-FLPISNIDKMRSTRQQATIH 549
                                         S ++ S+ I F P  N+  + +  Q+    
Sbjct: 371 ------------------------------SWEEYSKGIDFCPGQNVSGVTTHLQEEH-- 398

Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPD---ISSQLYELLKYHRRTLVP 606
              +  P+           LF+LG DP E+  ++ +  +   +  Q+  +++ H+ T+VP
Sbjct: 399 ---SSLPL-----------LFHLGRDPGEKYPLSFASDEYQGVMRQISAVVQQHKETMVP 444

Query: 607 QSHEQPDLVQADPKRFNDTWSP 628
                P L   D    N  WSP
Sbjct: 445 GV---PQLNVCDKAVMN--WSP 461


>gi|240977207|ref|XP_002402636.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215491217|gb|EEC00858.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 159

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 79/162 (48%), Positives = 108/162 (66%), Gaps = 5/162 (3%)

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
           +T+ AV LI      +PLFLYL++ A H G     LEAP+E + +F +I + NR TYA M
Sbjct: 1   YTERAVSLIRSHNPSQPLFLYLSYQAPHVGTGPSLLEAPEENMKKFLHIPEKNRTTYAGM 60

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V  LD S+GT++ ALQ   ++ N+I++F SDNGA  + +   SN    G N+P RG K T
Sbjct: 61  VDALDQSIGTIMEALQEADLIGNTIVVFTSDNGA--LPWGVKSNR---GYNWPLRGGKFT 115

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           LWEGG++    +WSP ++ + RVS QMMHI+DWLPTLY+AAG
Sbjct: 116 LWEGGLRTTTFIWSPLLEMSRRVSNQMMHITDWLPTLYSAAG 157



 Score = 56.6 bits (135), Expect = 4e-05,   Method: Compositional matrix adjust.
 Identities = 26/63 (41%), Positives = 38/63 (60%)

Query: 20 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAL 79
          +T+ AV LI      +PLFLYL++ A H G     LEAP+E + +F +I + NR TYA +
Sbjct: 1  YTERAVSLIRSHNPSQPLFLYLSYQAPHVGTGPSLLEAPEENMKKFLHIPEKNRTTYAGM 60

Query: 80 TKS 82
            +
Sbjct: 61 VDA 63


>gi|114326198|ref|NP_001041585.1| N-acetylgalactosamine-6-sulfatase precursor [Canis lupus
           familiaris]
 gi|122138594|sp|Q32KH5.1|GALNS_CANFA RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|81158068|tpe|CAI85008.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Canis lupus
           familiaris]
          Length = 522

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 124/368 (33%), Positives = 175/368 (47%), Gaps = 48/368 (13%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI 
Sbjct: 34  LLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIR 93

Query: 144 TGM-----------QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
            G                I G    G+P  E  LPE L+E GY +K +GKWHLG  R ++
Sbjct: 94  NGFYTTNRHARNAYTPQEIVG----GIPDQEHVLPELLKEAGYVSKIVGKWHLG-HRPQF 148

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
            PL  GF+  FG  N     YD+         R  E+ G    +   NL T    +    
Sbjct: 149 HPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL---- 204

Query: 248 TDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
           T ++ +EA+  I+  Q   +P FLY A  A HA                  ++    R  
Sbjct: 205 TQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA-----------PVYASRPFLGTSQRGR 253

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y   V+++D+SVG ++S LQ   + EN+ + F SDNGA  +     S     GSN P+  
Sbjct: 254 YGDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALI-----SAPNQGGSNGPFLC 308

Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDG 424
            K T +EGG++ PAI W P      RVS Q+  I D   T  + AG    + R+   IDG
Sbjct: 309 GKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRV---IDG 365

Query: 425 LDQWSSLL 432
           LD   ++L
Sbjct: 366 LDLLPAML 373


>gi|332263239|ref|XP_003280658.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Nomascus leucogenys]
          Length = 528

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 118/357 (33%), Positives = 168/357 (47%), Gaps = 41/357 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI  G     
Sbjct: 48  GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 104

Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
              A  R          GVP +E+ LPE L++ GY +K +GKWHLG  R ++ PL  GF+
Sbjct: 105 TTNAHARNAYTPQEIVGGVPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFD 163

Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
             FG  N     YD+         R  E+ G    +   NL T    +    T ++ +EA
Sbjct: 164 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 219

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           +  I+ Q    P FLY A  A HA                  ++    R  Y   ++++D
Sbjct: 220 LDFIKRQARRHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAIREID 268

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           DS+G ++  L    + +N+ + F SDNGA  +   E       GSN P+   K T +EGG
Sbjct: 269 DSIGKMLELLHNLHIADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGKQTTFEGG 323

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           ++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL    +LL
Sbjct: 324 MREPALAWWPGHVAAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLSLLPTLL 379


>gi|50753991|ref|XP_414208.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gallus gallus]
          Length = 513

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 122/374 (32%), Positives = 179/374 (47%), Gaps = 44/374 (11%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
            A   +  L L+   GW DL   G     TPN+D +A  G++  + YA  P+C+PSRA+L
Sbjct: 17  CAAPPNVVLLLMDDMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAANPLCSPSRAAL 76

Query: 136 MTGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHL 185
           +TG+ P+  G        A  R          G+  +E  LPE L++ GY+ K IGKWHL
Sbjct: 77  LTGRLPVRNGFYTT---NAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWHL 133

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDH--ILSDQYSRTVELNG---HDMRRNLSTAW 240
           G  R ++ PL  GF+  FG  N     YD+  + +    R  E+ G    D + +L T  
Sbjct: 134 GH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRALPNIPVYRDWEMIGRYYEDFKIDLRTGE 192

Query: 241 DTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETINQFQY 298
             +    T ++ +EA+  I + Q   +P FLY A  A HA   A KH            +
Sbjct: 193 ANL----TQIYLQEALDFISKQQASQQPFFLYWAIDATHAPVYASKH------------F 236

Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
           +    R  Y   V+++DDSVG ++  LQ+ G+ EN+ + F SDNGA  +     S  +  
Sbjct: 237 LGTSQRGRYGDAVREIDDSVGKILKHLQKLGISENTFVFFTSDNGAALI-----SAPKQG 291

Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
           GSN P+   K T +EGG++ PAI W P       VS Q+  + D   T  +  G      
Sbjct: 292 GSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGSVSRQLGSVMDLFTTSLSLVGLQPPS- 350

Query: 419 PLNIDGLDQWSSLL 432
              IDG+D   ++L
Sbjct: 351 DRQIDGIDLLPAIL 364


>gi|72161757|ref|YP_289414.1| N-acetylgalactosamine-6-sulfatase [Thermobifida fusca YX]
 gi|71915489|gb|AAZ55391.1| n-acetylgalactosamine-6-sulfate sulfatase [Thermobifida fusca YX]
          Length = 471

 Score =  158 bits (400), Expect = 7e-36,   Method: Compositional matrix adjust.
 Identities = 113/357 (31%), Positives = 171/357 (47%), Gaps = 45/357 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           GW DL  +GS  I TPN+D LA  GI   + YA  P C+ +R SL TG+YP  +  G++ 
Sbjct: 32  GWADLGCYGSTTIRTPNLDQLAAQGIRFTHGYAGSPWCSSTRISLYTGRYPGRLQAGLEE 91

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P +  +   G+P     L   L E GY+T   GKWH G+    Y+PL  GFE+ FG  +G
Sbjct: 92  PLVTRSPENGIPEGHPTLSSLLVEAGYATAMFGKWHCGWLPW-YSPLRIGFETFFGNFDG 150

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
            + Y++H+  D   +     G        T  + VG Y T++ ++ A + I     ++P 
Sbjct: 151 ALDYFEHV--DTLGKADLYEG-------ETPVEEVG-YYTEIISERAAEYITAH-RNRPF 199

Query: 269 FLYLAHLAAHAGNAG-KHLEAPQETINQFQ---------YITDPNRRTYAAMVKKLDDSV 318
           ++ L + A H    G    E  QE   ++Q         ++   +   Y  +V+ +D  +
Sbjct: 200 YVQLNYTAPHWPWEGPDDHEVGQEIRRRYQQRWEHSPLMHLDGGSIAKYGELVEAMDAGI 259

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V++AL R G  +N+I++F SDNG              W  N+P+ G K  L EGG++V
Sbjct: 260 GQVLAALDRAGAADNTIVVFSSDNGG-----------ERWSKNWPFVGEKGDLTEGGIRV 308

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD---------TSRLPLNIDGLD 426
           P I+  P+     +VS   +   DW  TL  AAG +            LP  +DG D
Sbjct: 309 PLIVAWPEAIAGNQVSDHPVITMDWTATLLAAAGTEPHPDWPLDGVDLLPWLVDGAD 365


>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
          Length = 544

 Score =  158 bits (400), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 123/394 (31%), Positives = 185/394 (46%), Gaps = 79/394 (20%)

Query: 85  LTLLIVYGWNDLSFH----GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGK 139
           L L    G+ND+S H        + T NIDALA +GI+    YA    C PSRAS+MTGK
Sbjct: 64  LVLADDMGYNDISIHNGGAADGTLQTKNIDALAKSGILFTRGYAANATCAPSRASIMTGK 123

Query: 140 YPIHTGMQGPPI---------WGAEP----------------------RGVPLTERFLPE 168
           YP   G +  PI         W AE                       +G+P  +  + E
Sbjct: 124 YPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDREVVSNMPPFMEQGMPTEQITIAE 183

Query: 169 YLRELGYSTKAIGKWHLGFFRREY--TPLYRGFESHFGYLNGVISYYDH--ILSDQYSRT 224
            LR+ GY T  IGKWHLG    EY   P+ +GF+   G +  +    DH  +++ ++   
Sbjct: 184 VLRDAGYYTAHIGKWHLG---HEYGMDPMSQGFQDSLGLVGPLYLPEDHPDVVNAKFDTR 240

Query: 225 VE------------LNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYL 272
           ++             NG D+            +Y TD +T EA+++IE+   ++P FLYL
Sbjct: 241 IDKMIWGMGQYSANFNGGDLF--------APDKYVTDYYTDEALKVIENNK-NRPFFLYL 291

Query: 273 AHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLE 332
           +H A H       L+A +    Q  ++   N + Y+ M+  LD SVG +I  L+   +  
Sbjct: 292 SHWAIH-----NPLQALRSDFEQMSHMHGHNLQVYSGMINSLDRSVGKIIEKLKELDIYG 346

Query: 333 NSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR 392
            ++IIF SDNG     Y E ++      N PYRG K + ++GG++VP I+  P      +
Sbjct: 347 KTLIIFTSDNGG--ANYIELNDI-----NKPYRGWKISFFDGGIRVPYIISWPDEINPGK 399

Query: 393 VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            S   +H  D  PT+  AAG +++     +DG+D
Sbjct: 400 KSENAVHHFDIFPTILKAAGIESTN---ELDGVD 430



 Score = 40.8 bits (94), Expect = 2.5,   Method: Compositional matrix adjust.
 Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 6/69 (8%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y TD +T EA+++IE+   ++P FLYL+H A H       L+A +    Q  ++   N 
Sbjct: 265 KYVTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQALRSDFEQMSHMHGHNL 318

Query: 74  RTYAALTKS 82
           + Y+ +  S
Sbjct: 319 QVYSGMINS 327


>gi|431907857|gb|ELK11464.1| Arylsulfatase B [Pteropus alecto]
          Length = 274

 Score =  158 bits (400), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 81/209 (38%), Positives = 124/209 (59%), Gaps = 22/209 (10%)

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
           IHTG+Q   IW  +P  VPL E+ LP+ L+E GY+T  +GKWHLG +R+E  P  RGF++
Sbjct: 22  IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 81

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRR---------NLSTAWDTVGEYATDLFT 252
           +FGYL G   YY H       R   +N  ++ R          ++T ++ +  Y+T++FT
Sbjct: 82  YFGYLLGSEDYYSH------ERCTLINALNVTRCALDFRDGEEVATGYENM--YSTNIFT 133

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
           + A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR  YA MV 
Sbjct: 134 QRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVS 188

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSD 341
            +D++VG V +AL+  G+  N++ IF ++
Sbjct: 189 LMDEAVGNVTAALKSHGLWNNTVFIFSTE 217



 Score = 55.1 bits (131), Expect = 1e-04,   Method: Compositional matrix adjust.
 Identities = 36/111 (32%), Positives = 58/111 (52%), Gaps = 14/111 (12%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y+T++FT+ A  LI + P +KPLFLYLA  + H     + L+ P+E +  + +I D NR 
Sbjct: 127 YSTNIFTQRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 181

Query: 75  TYAALTK------STTLTLLIVYG-WNDLSFHGSNEIPTPNIDALAYNGII 118
            YA +              L  +G WN+  F  S E+ +  +  +A++G I
Sbjct: 182 YYAGMVSLMDEAVGNVTAALKSHGLWNNTVFIFSTELLS--VLVVAFDGAI 230


>gi|343082868|ref|YP_004772163.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342351402|gb|AEL23932.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 462

 Score =  158 bits (399), Expect = 8e-36,   Method: Compositional matrix adjust.
 Identities = 112/372 (30%), Positives = 183/372 (49%), Gaps = 56/372 (15%)

Query: 86  TLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
            LLI+    G++DL   G+ +I TP +D LA  G+ L N Y     CTPSR+ L+TG+YP
Sbjct: 27  VLLIISDDQGYHDLGSFGNKDILTPYLDQLAAEGVRLTNFYVTSSACTPSRSGLLTGRYP 86

Query: 142 -------------IHTGMQGPPI-WGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWH 184
                        ++ G Q     +   P    G  L E F+ E L+E GY+   IGKW 
Sbjct: 87  QRNGTFELFRNNMVNYGHQYTDYEYSVSPERILGTDLREIFISELLKEGGYTNGYIGKWD 146

Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
           LG  +R Y P+ +G++ ++G+ N  I Y+ H   ++Y     ++G+ +      + +  G
Sbjct: 147 LGQLKR-YLPMQQGYDKYYGFANTGIDYFTH---ERYGSPSMIDGNQL------SLEDKG 196

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHLEAPQETINQFQYI---- 299
            Y T LF +EA++ I +    KP FL LA  A H A N    ++   +   ++  +    
Sbjct: 197 RYTTALFEREALEFI-NGSQGKPFFLTLAFNAPHSASNLDPEIKGTVQATEKYLKMYPKS 255

Query: 300 ---TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
                  +R Y A V ++D S+GT++  ++  G+ EN+++IF+SDNG  ++         
Sbjct: 256 NTKRGEKQRAYKAAVTQMDQSIGTILERIKEMGLEENTLVIFLSDNGGMSL--------- 306

Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
               NYP +G K   +EGGV+VP ++  P+     + +   +   +  P +  AAG    
Sbjct: 307 --ADNYPLKGGKAQFFEGGVRVPCLVKWPEKIDAGQTNDAFLSSLELFPMIVAAAG---L 361

Query: 417 RLPLNI--DGLD 426
            LP ++  DG D
Sbjct: 362 ELPKDVIYDGFD 373


>gi|149195951|ref|ZP_01873007.1| sulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140798|gb|EDM29195.1| sulfatase [Lentisphaera araneosa HTCC2155]
          Length = 464

 Score =  158 bits (399), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 112/354 (31%), Positives = 176/354 (49%), Gaps = 40/354 (11%)

Query: 95  DLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM----QGPP 150
           D++ +GS ++ TPN+D LA +G+     YA  VC P+RA LMTG++P  + +    QG  
Sbjct: 46  DVNCYGSKDLYTPNMDKLAEDGVRFTQAYAHQVCCPARAMLMTGRHPQRSNVNHWTQGDA 105

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
             G + R + L E  L E L++ GY T   GKWHLG    +Y P  +GF+  +G   G I
Sbjct: 106 K-GPKTRNMNLEEYTLAEALKDSGYKTALFGKWHLG-AHLDYGPTKQGFDEFYGIRGGFI 163

Query: 211 SYYDHILSDQYSRTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
             Y+H           L+G   HD+       +D  G+Y  +L T  A+  I D+  + P
Sbjct: 164 DNYNHYF---------LHGEGFHDLYEGTKEVFDE-GKYFPNLVTDRALNFI-DRNKNNP 212

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
            FL+LA    H        +A  +   +++ +  P R++YA M+   DD +G ++S LQ 
Sbjct: 213 FFLFLAFNIPHYPE-----QADPKFDERYKNMKMP-RQSYAKMISTTDDHMGQIMSKLQE 266

Query: 328 KGMLENSIIIFMSDNG-APTVEYRETSNYRN-WGSNYPY------------RGVKNTLWE 373
            G+ +N+IIIFMSDNG +    + +  N+++    N  Y            RG K+  +E
Sbjct: 267 HGIYDNTIIIFMSDNGHSRERNHIKFDNHKSGLAKNTKYGALGGGGNTGKWRGNKSNFYE 326

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
           GG++VPAI+  P       V  Q +   DW+PT+      +  ++  +   L Q
Sbjct: 327 GGIRVPAIITFPNKLPKGAVRDQAITAMDWMPTVLELCNIEPPKIKFDGKSLTQ 380


>gi|301788958|ref|XP_002929896.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like, partial
           [Ailuropoda melanoleuca]
          Length = 519

 Score =  158 bits (399), Expect = 9e-36,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 176/364 (48%), Gaps = 40/364 (10%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 31  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIR 90

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
            G     G       P+    G+P  E  LPE L+  GY++K +GKWHLG  R ++ PL 
Sbjct: 91  NGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLK 149

Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
            GF+  FG  N     YD+         R  E+ G    +   NL T    +    T ++
Sbjct: 150 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLQTGEANL----TQVY 205

Query: 252 TKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
            +EA+  ++ Q V  +P FLY A    HA                  ++    R  Y   
Sbjct: 206 LQEALDFMKRQQVAQRPFFLYWAIDGTHA-----------PVYASRPFLGTSRRGRYGDA 254

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V+++DDSVG ++S L+   + EN+ + F SDNGA  +     S  +  GSN P+   K T
Sbjct: 255 VREIDDSVGKILSLLRDLRITENTFVFFTSDNGAALI-----SAPKQGGSNGPFLCGKQT 309

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQW 428
            +EGG++ PAI W P      +VS Q+  + D   T    AG    + R+   IDGLD  
Sbjct: 310 TFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLALAGLAPPSDRV---IDGLDLL 366

Query: 429 SSLL 432
            ++L
Sbjct: 367 PAML 370


>gi|403072042|pdb|4FDI|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A
 gi|403072043|pdb|4FDI|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A
 gi|403072044|pdb|4FDJ|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
 gi|403072045|pdb|4FDJ|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
           With Galnac
          Length = 502

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/364 (32%), Positives = 172/364 (47%), Gaps = 41/364 (11%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+ +PSRA+L+TG+ PI 
Sbjct: 9   LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLXSPSRAALLTGRLPIR 68

Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
            G        A  R          G+P +E+ LPE L++ GY +K +GKWHLG  R ++ 
Sbjct: 69  NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 124

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
           PL  GF+  FG  N     YD+         R  E+ G    +   NL T    +    T
Sbjct: 125 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 180

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
            ++ +EA+  I+ Q    P FLY A  A HA                  ++    R  Y 
Sbjct: 181 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 229

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             V+++DDS+G ++  LQ   + +N+ + F SDNGA  +   E       GSN P+   K
Sbjct: 230 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 284

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T +EGG++ PA+ W P      +VS Q+  I D L T   A  G T      IDGL+  
Sbjct: 285 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 343

Query: 429 SSLL 432
            +LL
Sbjct: 344 PTLL 347


>gi|313246966|emb|CBY35811.1| unnamed protein product [Oikopleura dioica]
          Length = 388

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 103/305 (33%), Positives = 164/305 (53%), Gaps = 17/305 (5%)

Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           +C+PSRA  +TG+Y    G+   PI    P G+   E+ LPEYL+E+GY T A+GKWHLG
Sbjct: 20  LCSPSRAQFLTGRYAFRYGLGSDPISFENPIGMSTKEKLLPEYLKEVGYETHAVGKWHLG 79

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
           +    + P  RGF++  G+  G + Y+ H  + Q +    LN H +        D   E+
Sbjct: 80  YCNESFQPHNRGFDTFLGHYGGGVDYHTH--ATQGALGSYLN-HFLNGEPHIPEDGF-EF 135

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ--YITDPN- 303
           A+  ++    +++ +   DKP F+YLA  A H     + + APQ+ I + +  Y   P  
Sbjct: 136 ASYAWSNRTRKVLREN-TDKPNFVYLAFNAPH-----EKVAAPQDLIEEAKSMYPGIPGT 189

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
           RR   A VK +D ++ +VI   + K +   +I++F +DNG   + Y    +    G +YP
Sbjct: 190 RRLQLAAVKSIDVAMQSVIE--EAKTLERETIVVFHTDNGGALLAYAGGFDGPR-GCSYP 246

Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
           Y+G K+ L EGG   P  ++S + Q + R    M+HI D+ PT+   AG D   +P ++D
Sbjct: 247 YKGYKSVLAEGGTLSPTWVYSTKRQFHSRYIDGMIHIMDFFPTILRWAGYDKP-MPRDLD 305

Query: 424 GLDQW 428
           G+DQ+
Sbjct: 306 GIDQY 310


>gi|149175125|ref|ZP_01853748.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
           8797]
 gi|148846103|gb|EDL60443.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
           8797]
          Length = 413

 Score =  158 bits (399), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 136/473 (28%), Positives = 213/473 (45%), Gaps = 89/473 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +GS    TP++D LA NGI   + ++   VC+P+RA L+TG+Y    G+ G  
Sbjct: 6   GYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAGIDG-- 63

Query: 151 IWGAEPR-----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
           +  A P+     G+   E  L + L++ GY T   GKWHLG+ +R+Y P +RGF+   GY
Sbjct: 64  VVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGY-QRQYNPTFRGFQQFVGY 122

Query: 206 LNGVISYYDHILSD---QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           ++G + Y+ H+       +    ELN  +              Y T L    A++ I  Q
Sbjct: 123 VSGNVDYFAHLDGTGVFDWWHNAELNREEQ------------GYVTHLINDHALEFIRQQ 170

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR----TYAAMVKKLDDSV 318
             +KP F+Y+AH A H+   G H +  ++       I    R+     Y  M  ++D  +
Sbjct: 171 Q-EKPFFVYIAHEAVHSPYQGPHDQPMRKEGG--GDIKSAKRKDIANAYREMNTEMDKGI 227

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G ++  L+   + E + I F+SDNGA              GSN   RG K +LWEGG +V
Sbjct: 228 GQIVDVLKEVNLTEKTFIFFLSDNGA-----------NKNGSNGKLRGFKGSLWEGGHRV 276

Query: 379 PAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
           PAI  W  +I +       ++ I D +PT+   A    +++P                  
Sbjct: 277 PAIACWPGRIPEGTVCDEPVISI-DLMPTILELA---NAKIPA----------------- 315

Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINID-EKKRTAAVRLDSWKLVLGTQENGTMDGY 496
                +DG+    SL+     R++ V   I  E    +A+R   WKLVL           
Sbjct: 316 --GHKLDGV----SLVSLLKDRKSLVPRQIFWEYNGKSAMRQGHWKLVL----------- 358

Query: 497 YGQTRSNKVPLLNFNA-IVESKTY-----QSLQQLSQNIFLPISNIDKMRSTR 543
             QTR   + L +    + ESK       Q +QQ+   +    S++ K  +T+
Sbjct: 359 -NQTRKEPIELYDLTRDMSESKNLADNQPQRVQQMQSALAAWKSDVQKTATTQ 410


>gi|414070344|ref|ZP_11406330.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
 gi|410807261|gb|EKS13241.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
          Length = 470

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 109/358 (30%), Positives = 169/358 (47%), Gaps = 32/358 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G++D  F GS  + TP ID LA   ++    Y    VC PSRA L TGKY    G +   
Sbjct: 38  GYHDFGFQGSEVMQTPTIDKLASQSVVFEQAYVTAAVCGPSRAGLYTGKYQQRFGFEENN 97

Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
           + G         +  G+P T+  + E+L+ELGY T   GKWH G    +Y P  RGF++ 
Sbjct: 98  VPGYMSKSGFTGDKMGLPFTQVTMAEHLKELGYHTGLFGKWHQG-NHDDYHPTKRGFDNF 156

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           +G+  G   Y+ +   +Q +   +     M R+     +  G Y TD    +    I   
Sbjct: 157 YGFREGARGYFAYSNEEQQAYPSQ----KMERDFKHYIEHEG-YLTDALATQTSHFIGQS 211

Query: 263 PVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
            V+K P F  L+  A HA      ++A    + QF ++T   R+  AAM   +D ++  V
Sbjct: 212 VVNKQPFFAVLSFSAVHAP-----MQATNADLAQFPHLTG-KRKILAAMNFAMDRAIAVV 265

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           +  L    + +N++++F++DNG PT +        N+ +NYP  G K    EGG++VP +
Sbjct: 266 LDKLNALKISDNTLVVFINDNGGPTDQ--------NYANNYPLSGTKANHLEGGIRVPML 317

Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           L W  ++  N       +   D LPT   AA G  +   + +DG+D    LL  +  R
Sbjct: 318 LKWPKRLSSNNTRYTYPVSSLDLLPTFIHAANGKVTE-QMQLDGVDLMPFLLSKSTQR 374


>gi|449138314|ref|ZP_21773584.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
 gi|448883087|gb|EMB13631.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           europaea 6C]
          Length = 485

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 125/402 (31%), Positives = 186/402 (46%), Gaps = 70/402 (17%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
           LLIV    G+NDL   G N I TP +D LA  G  L N Y A P CTPSRASL+TG+YP 
Sbjct: 33  LLIVSDDQGYNDLGQLG-NGIITPALDRLANEGTRLTNFYVAWPACTPSRASLLTGRYPQ 91

Query: 143 HTGM-----QGPPIWG------------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
             G+        P +G                G+   E  +P  L+  GY +   GKW L
Sbjct: 92  RNGIYDMIRNEAPDYGHRYTPDQYAVTFERIGGMDEREVIIPAVLKRAGYKSGIYGKWDL 151

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
           G  RR   P  RGF+  +G++N  I Y+ H   ++Y          M RNL    +  G 
Sbjct: 152 GALRR-MLPNSRGFDDFYGFVNTGIDYFTH---ERYGVPC------MVRNLEPTEEDKGT 201

Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA-----GKHLEAPQE--------- 291
           Y T LF +EA++ +++   ++P FLY+   A H  ++        ++AP+          
Sbjct: 202 YCTYLFQREALRFLDEHAGEEPFFLYVPFNAPHNSSSLDPKIRSSVQAPERFKAMYPPVE 261

Query: 292 ----TINQFQYIT-------DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
                 ++++Y T          RR Y A V  +D ++G ++  L+ K MLE++I++F S
Sbjct: 262 SETRVTDRYRYGTPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLEDTIVVFFS 321

Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMH 399
           DNG                 N P RG K   WEGG++VP I+ W+        V+ + + 
Sbjct: 322 DNGG-----------SGGADNSPLRGHKAQTWEGGIRVPCIVRWTAGKIPAGAVNDEFLT 370

Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
             + LP+  +AAG +     + +DG D W +L   + S R  
Sbjct: 371 SLELLPSFASAAGLEPP-ADIVLDGFDWWPTLRGESASPREE 411


>gi|260062070|ref|YP_003195150.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88783632|gb|EAR14803.1| arylsulphatase A [Robiginitalea biformata HTCC2501]
          Length = 459

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 124/344 (36%), Positives = 178/344 (51%), Gaps = 41/344 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS  G+ ++ +PNIDALA NG+   N YA   VC+PSRA+L+TG+YP   G+ G  
Sbjct: 53  GYGDLSCQGATDLQSPNIDALAANGMRFTNFYANSTVCSPSRAALLTGRYPDLVGVPGVI 112

Query: 151 IWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
               E     L +    +P  L   GY T  IGKWHLG    + TP  RGF    G+L  
Sbjct: 113 RQNPENNWGNLADDAVLIPSELNPAGYHTGIIGKWHLGLEEPD-TPNDRGFTYFKGFLGD 171

Query: 209 VIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDK 266
           ++  Y+DH       R   +N   + R      D  G +ATDLFT   +  + E Q  ++
Sbjct: 172 MMDDYWDH-------RRGGINWMRLNRE---EIDPKG-HATDLFTDWTIDFLKERQGEEQ 220

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAAMVKKLDDSVGTVI 322
           P FLYLA+ A H       ++ P+E +++ +   +PN    R    A V+ LD SVG V+
Sbjct: 221 PFFLYLAYNAPHF-----PIQPPREWLDKVRE-REPNLTEKRAKNVAFVEHLDYSVGRVM 274

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL+  G+ EN++++F+SDNG   + Y +        SN P RG K  ++EGG++VPAI 
Sbjct: 275 EALKTTGLEENTLVVFVSDNGG-ALWYAQ--------SNGPLRGGKQDMYEGGIRVPAIF 325

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            W  +I      S     + D  PT    AG    + P N+DG+
Sbjct: 326 YWKGKIAPG-TTSDNTALLMDLFPTFCELAG---RKPPENVDGI 365


>gi|323450479|gb|EGB06360.1| hypothetical protein AURANDRAFT_13369, partial [Aureococcus
           anophagefferens]
          Length = 304

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 104/321 (32%), Positives = 153/321 (47%), Gaps = 35/321 (10%)

Query: 128 CTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           CTPSR SL++G+YP+  G+Q   +  A   G+    R LPE L+  GY T  +GKWHLG 
Sbjct: 1   CTPSRGSLLSGRYPVRLGLQRGVVSAASKWGLDPAIRLLPETLQTAGYRTALVGKWHLGH 60

Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT----- 242
           +    TP  RGF   +G+L G + ++ H+     +   E    D    L   W       
Sbjct: 61  YNESLTPARRGFGESYGFLGGFVDHFTHVAEAPPACASETPATDGTCALD--WQVNGLPR 118

Query: 243 ---VGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
               G YA       A +L+   +  ++PLFLY A    H          P + +  +  
Sbjct: 119 PRATGVYAAYACRDAARRLVASHEGKEEPLFLYYALPTPH------EPITPPDDVETYDL 172

Query: 299 I----TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN 354
           +        RR YAA+V   D ++G + +ALQ+  +  N++++ +SDNGA          
Sbjct: 173 LPKLDVAGRRRDYAAVVAATDGAIGDLQAALQKHALWPNTLLVLLSDNGASAAG------ 226

Query: 355 YRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ--MMHISDWLPTLYTAAG 412
               GSN P RG K   +EGGV+  A+L    + QN R S    ++H++DW PTL  AAG
Sbjct: 227 ----GSNAPLRGAKGGYFEGGVRTRALLAGSLLPQNVRGSSHEGLVHVADWAPTLLNAAG 282

Query: 413 --GDTSRLPLNIDGLDQWSSL 431
              D       +DG+DQW +L
Sbjct: 283 LRPDLVAGFDAMDGVDQWDAL 303


>gi|423293952|ref|ZP_17272079.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
           CL03T12C18]
 gi|392677173|gb|EIY70592.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
           CL03T12C18]
          Length = 453

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 135/424 (31%), Positives = 208/424 (49%), Gaps = 68/424 (16%)

Query: 79  LTKSTTLTLLIVY---GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRAS 134
           + K+T   LLI+    G  DLS   + ++ TPNID +   G+ L+N YA   V +PSRA+
Sbjct: 26  VDKATPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSRAA 85

Query: 135 LMTGKYPIHTGMQG---PPI---WG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           L+TG++P   G+ G   P I   WG  +P  V +     PE L+  GY T  IGKWHLG+
Sbjct: 86  LLTGRFPAMVGVPGVIRPSIDQNWGYFDPSAVTM-----PEVLKNGGYRTALIGKWHLGW 140

Query: 188 FRREYTPLYRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
            +    P  RGFE   G+L  ++  YY H    Q    + LN  ++         T   +
Sbjct: 141 -KSPNLPNERGFEHFHGFLADMMDDYYTH--QRQGGNYMYLNDKEI---------TPKGH 188

Query: 247 ATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP--- 302
           AT+LFT  +V  I+ +  +K P FLYLA+ A H+      L+ P E +N+ Q        
Sbjct: 189 ATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPV 243

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R    A+++ LDD++G V+ AL+  G L N+++IF SDNG          +  +  +N 
Sbjct: 244 KRARLIALIEHLDDNIGKVMQALEESGQLSNTLVIFASDNGG---------DRGSMANNG 294

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL-- 420
           P RG K  ++EGG+ VP  L  P + +  R     + + D +PT+      D   +P+  
Sbjct: 295 PTRGAKGDMFEGGIHVPCALNMPGVFEGGRRDNHFVVMMDLMPTIC-----DFVNVPVRH 349

Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS 480
            IDG+      +L+   R+  N +  D++   L N    +          K  +AVR + 
Sbjct: 350 EIDGIS-----VLDAIKRKKQNTE--DRFVFWLRNEGGPQFG-------GKSQSAVRYNE 395

Query: 481 WKLV 484
           +KL+
Sbjct: 396 YKLL 399


>gi|423330335|ref|ZP_17308119.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
           CL03T12C09]
 gi|409231951|gb|EKN24799.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
           CL03T12C09]
          Length = 459

 Score =  157 bits (398), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 122/362 (33%), Positives = 177/362 (48%), Gaps = 45/362 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G+  I TPNID +A  G+ L   Y    V TPSRA+LMTG+ P+  G+ G  
Sbjct: 42  GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101

Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
           +    P    G+   E  + + L++ GY+T  +GKWHLG F   Y P   GF+++FG   
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQSGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
             I Y + +        V+  G   R    T     G+           T  +T++AV  
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSF 209

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I+D    +P FLY AH   H          P  T  +F   +   R  Y  +V+++D SV
Sbjct: 210 IKDHS-KEPFFLYFAHTFPHI---------PLYTNARFAGTS--KRGLYGDVVEEIDWSV 257

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V+ AL+  G+ EN+ +IF SDNG    E      + N GS  P +  K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGPWLTE------HENGGSAGPLKDGKGTWWEGGFRV 311

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           PAI W P  + NP ++ ++M   D  PT  + AG +  +  L +DG++Q   L     S 
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369

Query: 439 RN 440
           R+
Sbjct: 370 RD 371


>gi|47522740|ref|NP_999120.1| N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
 gi|75054309|sp|Q8WNQ7.1|GALNS_PIG RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
           Full=Chondroitinsulfatase; Short=Chondroitinase;
           AltName: Full=Galactose-6-sulfate sulfatase; AltName:
           Full=N-acetylgalactosamine-6-sulfate sulfatase;
           Short=GalNAc6S sulfatase; Flags: Precursor
 gi|18028088|gb|AAL55968.1|AF322917_1 N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
          Length = 522

 Score =  157 bits (397), Expect = 1e-35,   Method: Compositional matrix adjust.
 Identities = 120/362 (33%), Positives = 173/362 (47%), Gaps = 36/362 (9%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  + YA  P+C+PSRA+L+TG+ PI 
Sbjct: 34  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYAANPLCSPSRAALLTGRLPIR 93

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TG     G       P+    G+P  E  LPE L+  GY++K +GKWHLG  R ++ PL 
Sbjct: 94  TGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLK 152

Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
            GF+  FG  N     YD+         R  E+ G    +   NL T    +    T ++
Sbjct: 153 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNL----TQIY 208

Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
            +EA+  I+  Q    P FLY A  A HA                  ++    R  Y   
Sbjct: 209 LQEALDFIKRQQATHHPFFLYWAIDATHA-----------PVYASRAFLGTSQRGRYGDA 257

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V+++DDSVG ++  L+   +  N+ + F SDNGA  V     S  +  GSN P+   K T
Sbjct: 258 VREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALV-----SAPKQGGSNGPFLCGKQT 312

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
            +EGG++ PAI W P      +VS Q+  + D   T  + AG +       IDGLD   +
Sbjct: 313 TFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLAGLEPPS-DRAIDGLDLLPA 371

Query: 431 LL 432
           +L
Sbjct: 372 ML 373


>gi|325285326|ref|YP_004261116.1| arylsulfatase [Cellulophaga lytica DSM 7489]
 gi|324320780|gb|ADY28245.1| Arylsulfatase [Cellulophaga lytica DSM 7489]
          Length = 466

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 103/347 (29%), Positives = 173/347 (49%), Gaps = 37/347 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS E  TP +D LA   +     Y +  VC PSRA ++TG+Y    G +   
Sbjct: 38  GYADFGFQGSTEFKTPELDKLAKKSVKFTQAYVSAAVCGPSRAGMLTGQYQQKFGFEENN 97

Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
           + G         +  G+P  +  + +YL++ GY+T   GKWH+G     + P  RGF+  
Sbjct: 98  VPGYMSSSGLLGDQMGLPTNQLTIADYLKKQGYTTALFGKWHMGN-ADLFHPTKRGFDEF 156

Query: 203 FGYLNGVISY--YDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
           +G+  G  SY  YD       +  +  N   + +      +  G Y TD    EA+  I 
Sbjct: 157 YGFRGGSRSYLPYDK------NNVLTKNEDRLEKGFGNYLEHKG-YLTDKLANEAISFI- 208

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
           ++    P F+YL+  A H       +EA +E +  F ++T   R+T AAM   +D ++GT
Sbjct: 209 NKNKKTPFFIYLSFNAVHT-----PMEATKEDLKYFSHLTG-KRKTLAAMTLAMDRAIGT 262

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           V+  L+++G+ +N++++F +DNG P+          N  +N P  G K    EGG++VP 
Sbjct: 263 VLDELKKQGLDKNTLVVFTNDNGGPSDA--------NQSNNNPLSGTKANHLEGGIRVPF 314

Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           ++ W  ++++N   +  +  + D  PT    A G++S+L   +DG+D
Sbjct: 315 LMSWPGKLKKNTTYNYPISTL-DLFPTFLNVANGNSSKLK-QLDGVD 359


>gi|281346853|gb|EFB22437.1| hypothetical protein PANDA_020197 [Ailuropoda melanoleuca]
          Length = 520

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 120/364 (32%), Positives = 175/364 (48%), Gaps = 38/364 (10%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 30  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIR 89

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
            G     G       P+    G+P  E  LPE L+  GY++K +GKWHLG  R ++ PL 
Sbjct: 90  NGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLK 148

Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
            GF+  FG  N     YD+         R  E+ G    +   NL T    +    T ++
Sbjct: 149 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLQTGEANL----TQVY 204

Query: 252 TKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
            +EA+  ++ Q V  +P FLY A    HA                  ++    R  Y   
Sbjct: 205 LQEALDFMKRQQVAQRPFFLYWAIDGTHA-----------PVYASRPFLGTSRRGRYGDA 253

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V+++DDSVG ++S L+   + EN+ + F SDNGA  +   +       GSN P+   K T
Sbjct: 254 VREIDDSVGKILSLLRDLRITENTFVFFTSDNGAALISAPKQGQC---GSNGPFLCGKQT 310

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQW 428
            +EGG++ PAI W P      +VS Q+  + D   T    AG    + R+   IDGLD  
Sbjct: 311 TFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLALAGLAPPSDRV---IDGLDLL 367

Query: 429 SSLL 432
            ++L
Sbjct: 368 PAML 371


>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 389

 Score =  157 bits (397), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 180/367 (49%), Gaps = 43/367 (11%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYN-GIILNNMYAQPVCTPSRA 133
           A +     L  ++V  +G+ D+SF     I +P+ ++LA   G+IL+  Y    C+PSRA
Sbjct: 17  ATVNAKPNLVFVLVDDWGFADVSFRNP-AISSPHFESLATKEGLILDRHYVFKYCSPSRA 75

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           S +TG++P H     PP  G     + +T   +P  L+  GY T  +GKWH GF+ +++ 
Sbjct: 76  SFLTGRFPHHAHQWNPPQSGLVGANINMT--MIPAKLKTAGYKTHMVGKWHEGFYLKKFL 133

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFT 252
           P+ RGF++  G+L G     DH+  +           D  +N   A DT  G Y    + 
Sbjct: 134 PINRGFDTMSGFLGGG---EDHMTQEIICAV------DYWKN--DAIDTRNGTYDDYTYV 182

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMV 311
           ++  +++ +     P FLYL     HA      L+AP E +N +    T  +RRT  AMV
Sbjct: 183 EDLSKILTNHDPSDPFFLYLPLHNVHAP-----LQAPDEFLNLYPVNSTCSDRRTIQAMV 237

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
              D+  G V+  L+ KGM +N++I+  +DNG    E          GSNYP +G K T 
Sbjct: 238 SVADNVTGFVVKMLKDKGMWDNTLIVVSADNGGAPCE----------GSNYPLKGSKMTF 287

Query: 372 WEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDT-----SRLPLNIDG 424
           +EGGV+  A +    +  + R   +   +HI+DW  T    AG D+      + P  +DG
Sbjct: 288 YEGGVRSLAFVSGGLLPSDRRGKKTEGFIHIADWYTTFCQLAGVDSDDSGAGKFP--VDG 345

Query: 425 LDQWSSL 431
           ++ W  L
Sbjct: 346 MNVWPIL 352


>gi|298375848|ref|ZP_06985804.1| sulfatase family protein [Bacteroides sp. 3_1_19]
 gi|298266885|gb|EFI08542.1| sulfatase family protein [Bacteroides sp. 3_1_19]
          Length = 459

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 121/362 (33%), Positives = 178/362 (49%), Gaps = 45/362 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G+  I TPNID +A  G+ L   Y    V TPSRA+LMTG+ P+  G+ G  
Sbjct: 42  GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101

Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
           +    P    G+   E  + + L++ GY+T  +GKWHLG F   Y P   GF+++FG   
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQNGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
             I Y + +        V+  G   R    T     G+           T  +T++AV  
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSF 209

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I+D    +P FLY AH   H          P  T  +F+  +   R  Y  +V+++D SV
Sbjct: 210 IKDHS-KEPFFLYFAHTFPHI---------PLYTNARFEGTS--KRGLYGDVVEEIDWSV 257

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V+ AL+  G+ EN+ +IF SDNG         + + N GS  P +  K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGP------WLTKHENGGSAGPLKDGKGTWWEGGFRV 311

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           PAI W P  + NP ++ ++M   D  PT  + AG +  +  L +DG++Q   L     S 
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369

Query: 439 RN 440
           R+
Sbjct: 370 RD 371


>gi|302370951|ref|NP_001180574.1| N-acetylgalactosamine-6-sulfatase isoform 2 precursor [Mus
           musculus]
 gi|26329565|dbj|BAC28521.1| unnamed protein product [Mus musculus]
          Length = 440

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 109/316 (34%), Positives = 153/316 (48%), Gaps = 35/316 (11%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31  VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 91  RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
            PL  GF+  FG  N     YD+         R  E+ G            T     T L
Sbjct: 147 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205

Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           +T+EA+  I+ Q   + P FLY A  A HA                 Q++    R  Y  
Sbjct: 206 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V+++DDSVG ++S LQ  G+ +N+ + F SDNGA  +     S     GSN P+   K 
Sbjct: 255 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309

Query: 370 TLWEGGVKVPAILWSP 385
           T +EGG++ PAI W P
Sbjct: 310 TTFEGGMREPAIAWWP 325


>gi|410635995|ref|ZP_11346602.1| arylsulfatase [Glaciecola lipolytica E3]
 gi|410144672|dbj|GAC13807.1| arylsulfatase [Glaciecola lipolytica E3]
          Length = 499

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 118/365 (32%), Positives = 171/365 (46%), Gaps = 50/365 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP----IHTGM 146
           G+ D+   G + + TPN+D +A  G++L + Y   P+C+PSRA LMTG YP    + TG 
Sbjct: 56  GYEDVGVFGGDHVLTPNLDKMAEEGLMLTDFYVPSPLCSPSRAGLMTGSYPRRVDMATGS 115

Query: 147 QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
             P +  A+ +G+   E  + E L+ +GY+T   GKWHLG  + E+ P  +GF+  FG  
Sbjct: 116 NFPVLLAADTKGLNPAEITIAEVLKSVGYATGIFGKWHLG-DQPEFLPTRQGFDEFFGL- 173

Query: 207 NGVISYYDHILSDQYSRTVELNGHD---MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
                 Y H ++  + R       D   M        +   EY T   T+ A+  IE   
Sbjct: 174 -----PYSHDIAPTHKRQAHFKFPDLPLMENENVIELNPNPEYLTRRITERAIDFIERHQ 228

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT-------------DPNRRTYAAM 310
            D P FLYL H   H       L   QE +N     T             +  +  Y  +
Sbjct: 229 -DAPFFLYLPHPMPHG-----PLHVSQEFLNTVDKATLARLNISDADIAANSKKAMYPLV 282

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW-GSNYPYRGVKN 369
           + +LD++VG ++  L+R  +  N+I+IF SDNG         S  RN  G+     G K 
Sbjct: 283 IGELDNAVGNILDTLKRLNLDSNTIVIFTSDNGP--------SGRRNSDGTERYLSGHKT 334

Query: 370 TLWEGGVKVPAILWSP-QIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDGLD 426
              EGG++VP ++W P QI  N  VS  +    D LPT+   AG D  T R+   IDG +
Sbjct: 335 LTQEGGMRVPTVVWGPSQIPAN-TVSQALTTSMDILPTVAFLAGADVPTDRV---IDGKN 390

Query: 427 QWSSL 431
            W  L
Sbjct: 391 IWPIL 395


>gi|148679746|gb|EDL11693.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_a [Mus
           musculus]
          Length = 462

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 109/316 (34%), Positives = 153/316 (48%), Gaps = 35/316 (11%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 53  VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 112

Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
             G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG  R ++
Sbjct: 113 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 168

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
            PL  GF+  FG  N     YD+         R  E+ G            T     T L
Sbjct: 169 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 227

Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           +T+EA+  I+ Q   + P FLY A  A HA                 Q++    R  Y  
Sbjct: 228 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 276

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V+++DDSVG ++S LQ  G+ +N+ + F SDNGA  +     S     GSN P+   K 
Sbjct: 277 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 331

Query: 370 TLWEGGVKVPAILWSP 385
           T +EGG++ PAI W P
Sbjct: 332 TTFEGGMREPAIAWWP 347


>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 490

 Score =  157 bits (396), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 113/366 (30%), Positives = 169/366 (46%), Gaps = 46/366 (12%)

Query: 76  YAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
           + A  K   +  ++V  +G+ D+ F     I +PN D LA  G++LN  Y    C PSRA
Sbjct: 17  FVATAKLPNIVFVLVDDWGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYVFKYCAPSRA 75

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           SL+TG++P H       +      G  L     P  L+   YST  +GKWH GFF   Y 
Sbjct: 76  SLLTGRWPHHVYQWN--LATDATAGTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYL 133

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL---NGHDMRRNLSTAWDTVGEYATDL 250
           P+ RGF++  GYL G + +++     +    V+    N  D R          G Y +  
Sbjct: 134 PINRGFDTSSGYLCGWVDHFNQ----KQGCAVDCWKNNAPDPRN---------GTYDSYY 180

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAA 309
           +  +   ++ +   + PLF+YL     H       LEAP+E ++ +    T   R T  A
Sbjct: 181 YRDDLTNIVNNHDANNPLFIYLPLHNVHTP-----LEAPKEWLDIYPANSTCKTRHTLQA 235

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           MV   D+  G ++  L+ K M +N+I++  +DNG                SNYP RG K 
Sbjct: 236 MVSVADNVTGHLVELLKEKEMWDNTIMVISADNGGDQCV----------ASNYPLRGCKY 285

Query: 370 TLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD-----TSRLPLNI 422
           T +EGGV+  A +    + ++ R   +   +HISDW  T    AG D     T + P  +
Sbjct: 286 TFFEGGVRSLAFVNGGLLPESRRGQSTDGFIHISDWYTTFCKLAGVDPDDSGTGKFP--V 343

Query: 423 DGLDQW 428
           DGLD W
Sbjct: 344 DGLDVW 349


>gi|340619477|ref|YP_004737930.1| sulfatase [Zobellia galactanivorans]
 gi|339734274|emb|CAZ97651.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
          Length = 488

 Score =  156 bits (395), Expect = 2e-35,   Method: Compositional matrix adjust.
 Identities = 119/377 (31%), Positives = 183/377 (48%), Gaps = 52/377 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-P 149
           G+ DLS +G  +  TPNID LA +GI+    Y+   VC PSR++L+TG +  HT ++G  
Sbjct: 46  GYGDLSSYGQQKFSTPNIDKLAKDGILFTQHYSGSTVCAPSRSALLTGMHTGHTVVRGNK 105

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN-- 207
            I       +P     L E +++ GY T A GKW LGF   E  PL +GF++ FGY    
Sbjct: 106 EILPEGQYPLPDNTYTLAEAMKKAGYITGAFGKWGLGFPGSEGDPLNQGFDAFFGYNCQR 165

Query: 208 -GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
            G   Y  H+ +++ S  +E N    +            YA +L   + ++ IE    D 
Sbjct: 166 LGHNYYPRHLWANKDSLVLEGNSGFKKET----------YAPELIHDKTLEFIEMNK-DN 214

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----------------------- 303
           P FLY+A +  HA      L AP+E + +++    P                        
Sbjct: 215 PFFLYVASIIPHA-----ELTAPEEIMEKYKGKFAPETVYEGVDDGPEYKLGPYGSQKEP 269

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
              +AAMV  LD  VG +++ L++ G+ +N++++F SDNG P  E      Y N  S+ P
Sbjct: 270 HAAFAAMVHLLDAQVGEIMATLEKLGIADNTLVVFTSDNG-PHTEGGADPEYFN--SSGP 326

Query: 364 YRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
           +RG K  L+EGG++VP +  W  +I+   R  L +    D  PT    AG +T   P  +
Sbjct: 327 FRGTKRDLYEGGIRVPMVAAWPGKIKPGSRTEL-VSAFWDVFPTFSDIAGIET---PEGL 382

Query: 423 DGLDQWSSLLLNTPSRR 439
           DG+    +LL +T  ++
Sbjct: 383 DGISFLPTLLGDTDQQK 399


>gi|423220023|ref|ZP_17206519.1| hypothetical protein HMPREF1061_03292 [Bacteroides caccae
           CL03T12C61]
 gi|392623855|gb|EIY17953.1| hypothetical protein HMPREF1061_03292 [Bacteroides caccae
           CL03T12C61]
          Length = 455

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 123/406 (30%), Positives = 188/406 (46%), Gaps = 60/406 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G  DL  +G  +I TPNID LA  GI+  + Y+   V  PSR  LMTGK+  H+ ++G  
Sbjct: 39  GIGDLGCYGQTKIKTPNIDKLADEGILFTDHYSGSAVSAPSRCCLMTGKHTGHSYVRGNK 98

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN--G 208
                   +P  E  + + L++ GY T  +GKW LG      TPL +GF+  +GYL+   
Sbjct: 99  SDNGFDLALPAEEVTVAQVLKQKGYKTMCVGKWGLG---GPGTPLQKGFDYFYGYLSQRE 155

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
              YY   L +   + V+L G                Y+  L   + +  I  +  D P 
Sbjct: 156 AHRYYPERLFEN-DKEVKLGGK--------------AYSHFLIMDKGLDFIR-KNADNPF 199

Query: 269 FLYLAHLAAHAG--------NAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSV 318
           F Y +    HA          +G+  E P  +E +  F+   +P + TYA+MV ++D +V
Sbjct: 200 FAYFSITLPHADLDYPDISEYSGEFPEKPYGREKVEAFKMQLNP-KATYASMVSEIDRNV 258

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G +I  L+ +G+ EN+II F SDNG   V      +Y+ + SN P+RG K  L+EGG++ 
Sbjct: 259 GQIIQLLKEEGIWENTIIFFSSDNG---VHLTGGHDYKFFNSNGPFRGHKRDLYEGGIRA 315

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           P I+  P I +  RV+  +    D++PT+    G D        DG+   S L L     
Sbjct: 316 PFIVSWPMIIKEKRVTDHLSAFWDFMPTVAEIVGADVKS-----DGI---SYLPLLKGRE 367

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
           +N                P + + +     E+    ++R D WKLV
Sbjct: 368 KN----------------PQQHDYIYHEFYEQGGKQSIRKDGWKLV 397


>gi|150008924|ref|YP_001303667.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937348|gb|ABR44045.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
          Length = 459

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 121/362 (33%), Positives = 178/362 (49%), Gaps = 45/362 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G+  I TPNID +A  G+ L   Y    V TPSRA+LMTG+ P+  G+ G  
Sbjct: 42  GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101

Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
           +    P    G+   E  + + L++ GY+T  +GKWHLG F   Y P   GF+++FG   
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQSGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
             I Y + +        V+  G   R    T     G+           T  +T++AV  
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSF 209

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I++    +P FLY AH   H          P  T  +F+  +   R  Y  +V+++D SV
Sbjct: 210 IKNHS-KEPFFLYFAHTFPHI---------PLYTNARFEGTS--KRGLYGDVVEEIDWSV 257

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V+ AL+  G+ EN+ +IF SDNG    E      + N GS  P +  K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGPWLTE------HENGGSAGPLKDGKGTWWEGGFRV 311

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
           PAI W P  + NP ++ ++M   D  PT  + AG +  +  L +DG++Q   L     S 
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369

Query: 439 RN 440
           R+
Sbjct: 370 RD 371


>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 486

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 130/417 (31%), Positives = 202/417 (48%), Gaps = 49/417 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYP--IHTGM--Q 147
           GW+DL  +G++   TPNID  A   +   + YA  VC+PSR++LMTGK+   +H  +  +
Sbjct: 37  GWSDLGCYGADLHETPNIDRFASGAVRFTSAYAMSVCSPSRSTLMTGKHAARLHFTIWAE 96

Query: 148 GPPIWGAEPR---------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           G    GA+ R          +P +E+ +  YL+  GY T  IGKWHLG +  E+ P   G
Sbjct: 97  GAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKWHLGDW--EHYPEAHG 154

Query: 199 FESHFGYLN-GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
           F+ + G  N G    +    S   +     +G + R      +   GEY TD  T EA++
Sbjct: 155 FDINIGGTNWGAPQTFWWPYSGSGT-----HGPEFRYIPHLEYGHPGEYLTDRLTDEAIK 209

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNRR--TYAAMVKK 313
           +I D   D+P F+YLAH A H       +EA  + I  F  +Y    N R   YAAM K+
Sbjct: 210 VI-DHAGDQPFFVYLAHHAVHTP-----IEAKADDIQHFDAKYRDGMNHRHTIYAAMNKE 263

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           LD++VG V+  L+ +G+ +N+++IF SDNG      + +       +N P R  K  L+E
Sbjct: 264 LDENVGRVLEHLKERGLDKNTVVIFASDNGGYIGVDKVSGKNMPVTNNAPLRSGKGALYE 323

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
           GG++VP I+  P +  N     + + ++D L T     G   +      DG+D   S LL
Sbjct: 324 GGIRVPLIIRWPGVTPNGATCDEPVILTDMLQTFLHITGQPPATDA--TDGMD--ISPLL 379

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
             PS + +         +L  + P   ++           +A+R   WKL+   ++N
Sbjct: 380 KDPSAKLNR-------DALFFHYPHYYHTT-------TPVSAIRARDWKLLEFYEDN 422



 Score = 44.3 bits (103), Expect = 0.19,   Method: Compositional matrix adjust.
 Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 10/73 (13%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITD 70
           GEY TD  T EA+++I D   D+P F+YLAH A H       +EA  + I  F  +Y   
Sbjct: 196 GEYLTDRLTDEAIKVI-DHAGDQPFFVYLAHHAVHTP-----IEAKADDIQHFDAKYRDG 249

Query: 71  PNRR--TYAALTK 81
            N R   YAA+ K
Sbjct: 250 MNHRHTIYAAMNK 262


>gi|423219918|ref|ZP_17206414.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
           CL03T12C61]
 gi|392624181|gb|EIY18274.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
           CL03T12C61]
          Length = 463

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 109/301 (36%), Positives = 154/301 (51%), Gaps = 36/301 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ND  F GS E+ TPNIDAL   G++  + + A  V +PSRA L+TG+Y    G +   
Sbjct: 43  GYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGRYGHRFGYECN- 101

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  G+PL E  + E  +  GY T AIGKWHLG  R E  P  RGF+  +G   G  
Sbjct: 102 -LSDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLGS-RDEQHPNNRGFDLFYGMKAGGR 159

Query: 211 SY-YDHILSDQ--YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
            Y Y+   SD+    R + LN   ++           +Y TD F+++AV+ I +    +P
Sbjct: 160 DYFYNEKKSDRPGDERNLLLNDRQVK---------FEKYLTDAFSEKAVEFINESS--QP 208

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
             +YLA+ A H       ++A  E + +F+    P R+  AAM   LD  VGTVI  L+ 
Sbjct: 209 FMMYLAYNAVHT-----PMQATDEDMAKFE--GHP-RQKLAAMTYALDRGVGTVIRGLKD 260

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
            G  +N++I F+SDNG  T          N  SNYP +G K   +EGG +VP  ++W   
Sbjct: 261 SGKFDNTLIFFLSDNGGATT---------NQSSNYPLKGFKGNKFEGGHRVPYFVVWKNG 311

Query: 387 I 387
           I
Sbjct: 312 I 312


>gi|373956421|ref|ZP_09616381.1| sulfatase [Mucilaginibacter paludis DSM 18603]
 gi|373893021|gb|EHQ28918.1| sulfatase [Mucilaginibacter paludis DSM 18603]
          Length = 510

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 122/386 (31%), Positives = 177/386 (45%), Gaps = 71/386 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ--- 147
           G+NDLS +G+  I TPNIDALA  G+     Y + P+C+PSRA L+TG+Y    G +   
Sbjct: 41  GYNDLSSYGNTLINTPNIDALAKEGVNFTQGYVSAPICSPSRAGLITGRYQQRFGYEFFA 100

Query: 148 -GPPIWGAE--------------------------------PRGVPLTERFLPEYLRELG 174
             P  WG +                                P+G+P  E  +   L++ G
Sbjct: 101 ATPATWGVQDAAHAEASKNALRKYGAYYTIDGLALETYNKTPQGLPANEITIAGLLKKQG 160

Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY-----YDHILSD----QYSRTV 225
           Y T  IGKW+LG    ++ P   GF+ H+G+L+G   Y      D ++ D     ++  +
Sbjct: 161 YKTAVIGKWNLGE-TDDFIPEKYGFDYHYGFLSGGSRYGSTDDVDFVVKDLPHLYWNAQI 219

Query: 226 ELNGH---DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA 282
              G     +R+N  T   T  EY T  F  EA   IE    ++P FLY+   A H    
Sbjct: 220 IKYGKGPVKLRKNAGTVLTT--EYLTTRFGNEAADFIEANK-NRPFFLYVPFNAPH---- 272

Query: 283 GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
                A +E       + D  RR Y AMVK LDD+VG +   L+   + +N++IIF SDN
Sbjct: 273 -DPFMAKKEDFAAVTTVKDSTRRVYQAMVKSLDDAVGVITKKLKELNLDKNTLIIFTSDN 331

Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISD 402
           G         + Y +   N P RG K T +EGG++VP I+  P       V  + +   D
Sbjct: 332 GG--------AMYTHALDNKPLRGGKATHFEGGIRVPFIIKYPGGIPAGEVFNKPVSTLD 383

Query: 403 WLPTLYTAAGGDTSRLPLNI--DGLD 426
             PT+    G   + LP N+  DG++
Sbjct: 384 LFPTIAAVTG---ANLPGNVTYDGVN 406


>gi|153807102|ref|ZP_01959770.1| hypothetical protein BACCAC_01379 [Bacteroides caccae ATCC 43185]
 gi|149130222|gb|EDM21432.1| arylsulfatase [Bacteroides caccae ATCC 43185]
          Length = 463

 Score =  156 bits (395), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 109/301 (36%), Positives = 154/301 (51%), Gaps = 36/301 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ND  F GS E+ TPNIDAL   G++  + + A  V +PSRA L+TG+Y    G +   
Sbjct: 43  GYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGRYGHRFGYECN- 101

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  G+PL E  + E  +  GY T AIGKWHLG  R E  P  RGF+  +G   G  
Sbjct: 102 -LSDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLGS-RDEQHPNNRGFDLFYGMKAGGR 159

Query: 211 SY-YDHILSDQ--YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
            Y Y+   SD+    R + LN   ++           +Y TD F+++AV+ I +    +P
Sbjct: 160 DYFYNEKKSDRPGDERNLLLNDRQVK---------FEKYLTDAFSEKAVEFINESS--QP 208

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
             +YLA+ A H       ++A  E + +F+    P R+  AAM   LD  VGTVI  L+ 
Sbjct: 209 FMMYLAYNAVHT-----PMQATDEDMAKFE--GHP-RQKLAAMTYALDRGVGTVIRGLKD 260

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
            G  +N++I F+SDNG  T          N  SNYP +G K   +EGG +VP  ++W   
Sbjct: 261 SGKFDNTLIFFLSDNGGATT---------NQSSNYPLKGFKGNKFEGGHRVPYFVVWKNG 311

Query: 387 I 387
           I
Sbjct: 312 I 312


>gi|338212321|ref|YP_004656376.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
 gi|336306142|gb|AEI49244.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
          Length = 446

 Score =  156 bits (394), Expect = 3e-35,   Method: Compositional matrix adjust.
 Identities = 120/368 (32%), Positives = 190/368 (51%), Gaps = 52/368 (14%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPI 142
            + L+   GW ++  +GS+   TP++D LA  G+   N YA   VC+P+RA+LMTG+YP 
Sbjct: 30  VMILVDDMGWGEVGIYGSHYCQTPHLDNLAKQGMRFTNFYANSTVCSPTRAALMTGRYPD 89

Query: 143 HTGMQG------PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
             G+ G         WG   P+ V L     P+ L+  GY T  +GKWHLG    E  P 
Sbjct: 90  LVGVPGVIRGNVENSWGYFSPKAVTL-----PQVLKTAGYRTAMVGKWHLGL-EPENHPN 143

Query: 196 YRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
             GF    G+L  ++  YY H+   +    + LNG  +        D  G +ATDLFT+ 
Sbjct: 144 RHGFTHFHGFLEDMMDDYYTHLREGK--NWMRLNGQVI--------DPSG-HATDLFTQW 192

Query: 255 AVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAA 309
           A++ +  Q  + +P FLYLA+ A H       ++ P+E + + +   +PN    R    A
Sbjct: 193 AIEYLNTQKKNPQPFFLYLAYNAPHF-----PVQPPREWLEKVKK-RNPNLSEKRAKLTA 246

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
           +++ LD+S+G V+ AL++ G  +N++I+F SDNG    +  +         N  ++G K 
Sbjct: 247 LIEHLDESIGKVMEALRQNGQADNTLIVFSSDNGGLLSDEAD---------NGIWKGGKQ 297

Query: 370 TLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
           T++EGG++VP I +W  +I    +   + + + D  PT   AAG   ++ P  +DG   +
Sbjct: 298 TMYEGGIRVPTIAVWKNKIIPGTKTDFRALTM-DLFPTFCEAAG---TKPPQAVDG-HSF 352

Query: 429 SSLLLNTP 436
            SLLL  P
Sbjct: 353 LSLLLGQP 360


>gi|148223113|ref|NP_001088386.1| galactosamine (N-acetyl)-6-sulfate sulfatase precursor [Xenopus
           laevis]
 gi|54038351|gb|AAH84661.1| LOC495239 protein [Xenopus laevis]
          Length = 520

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 163/581 (28%), Positives = 236/581 (40%), Gaps = 140/581 (24%)

Query: 79  LTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
           +T+   L LL+   GW DL   G     TPN+D +A  G++  N Y A P+C+PSRA+L+
Sbjct: 25  VTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMASEGMLFPNFYTANPLCSPSRAALL 84

Query: 137 TGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           TG+ PI  G        +  R          G+  +E   PE L++ GY  K IGKWHLG
Sbjct: 85  TGRLPIRNGFFTT---NSHARNAYTPQEIVGGISDSEILFPELLKKAGYINKIIGKWHLG 141

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
             R  Y PL  GF+  FG  N     YD             N       L   W+ VG Y
Sbjct: 142 H-RPSYHPLRHGFDEWFGSPNCHFGPYD-------------NKQIPNIPLYRDWNMVGRY 187

Query: 247 --------------ATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQE 291
                          T ++ +EA++ I  Q V  +P FLY +  A HA            
Sbjct: 188 YEEFNINHKTGESNLTQIYLQEALEFIYSQAVAQQPFFLYWSIDATHA-----------P 236

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
                 ++   NR  Y   V+++D S+G +++ L   G+ +++ + F SDNGA  +    
Sbjct: 237 VYASRPFLGTSNRGLYGDAVREIDFSIGKILNLLNLTGIAKDTFVFFTSDNGAALI---- 292

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
            S     GSN P+   K T +EGG++VP I W P      +VS Q+  I D   T  + A
Sbjct: 293 -SAPMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIMDLFTTSLSLA 351

Query: 412 GGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK 471
           G                    L  P  R+  IDG+D    +L +  + R       +E  
Sbjct: 352 G--------------------LEIP--RDRAIDGIDLSPVILESKETERPIFYYRGNE-- 387

Query: 472 RTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-F 530
              AVRL   K    T  N                              S ++LSQ I F
Sbjct: 388 -LMAVRLGQHKAHYWTWSN------------------------------SWEELSQGINF 416

Query: 531 LPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNI---ASSRP 587
            P  NI  + +  Q   +    NP              LF+LG DP E+ +I   ++   
Sbjct: 417 CPGQNITGVTTHMQ---VEHSINP-------------LLFHLGRDPGEKYSISVLSTEYE 460

Query: 588 DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           D  +++  +++ HR ++VP    +P L   D    N  WSP
Sbjct: 461 DAVNRISYVVQQHRDSMVPG---EPQLNVCDLAVMN--WSP 496


>gi|410912979|ref|XP_003969966.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Takifugu
           rubripes]
          Length = 519

 Score =  156 bits (394), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 124/376 (32%), Positives = 178/376 (47%), Gaps = 50/376 (13%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           + L+   GW DL   G     TPN+DA+A  G++L N Y A P+C+PSRA+L+TG+ P+ 
Sbjct: 31  IMLMDDMGWGDLGAFGQPSKETPNLDAMAAQGMLLLNFYTANPLCSPSRAALLTGRLPVR 90

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
            G     G       P+    G+   E  LP+ L++ GY  K +GKWHLG  R +Y PL 
Sbjct: 91  NGFYTTNGHARNAYTPQEIVGGISKDEILLPQMLKKRGYFNKIVGKWHLGH-RPQYLPLE 149

Query: 197 RGFE-------SHFGYLNGVIS-----YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
            GF+        HFG  N  +      Y +  +  +Y    +++      NL        
Sbjct: 150 HGFDEWFGAPNCHFGPYNNSVRPNIPVYRNSWMLGRYYEEFKIDKKTGESNL-------- 201

Query: 245 EYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
              T ++  E +  I+ Q    KP FLY A  A HA      + A ++ + + Q      
Sbjct: 202 ---TQMYLLEGLDFIQSQAEAQKPFFLYWAPDATHA-----PVYASKDFLGKSQ------ 247

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
           R  Y   V +LD SVG ++S L+R G+  N+ + F SDNGA  +     S     GSN P
Sbjct: 248 RGRYGDAVMELDYSVGQILSLLRRLGIDSNTFVFFTSDNGAALMSGPNES-----GSNGP 302

Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
           +   K T +EGG++ PAI W P   +   VS Q+ ++ D L T   A  G +      ID
Sbjct: 303 FLCGKETTFEGGMREPAIAWWPGHIREGTVSFQLANVMD-LFTTSLALAGMSPPDDRTID 361

Query: 424 GLDQWSSLLLNTPSRR 439
           G+D    LL NT   R
Sbjct: 362 GMDLTPVLLHNTVLNR 377


>gi|410029536|ref|ZP_11279368.1| sulfatase [Marinilabilia sp. AK2]
          Length = 486

 Score =  155 bits (393), Expect = 4e-35,   Method: Compositional matrix adjust.
 Identities = 126/370 (34%), Positives = 181/370 (48%), Gaps = 52/370 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL F G   I TPN+D LA  G+  +N YA   VC PSR++LMTG +  HT ++G  
Sbjct: 45  GYGDLGFLGQKIIETPNLDRLAAEGMFFSNHYAGATVCAPSRSALMTGLHTGHTPVRGN- 103

Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
            +  +P G   +P T   +P   +E GY+T A GKW LGF   E  P ++GF+  FGY  
Sbjct: 104 -FEIQPEGQYPLPDTILTIPRLFKEAGYATGAFGKWGLGFVGTEGDPNHQGFDQFFGYNC 162

Query: 208 GVIS--YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
             I+  YY   L +  +  V L G+D  + ++        YA D+  +E+++ I D   +
Sbjct: 163 QRIAHRYYPEYLWEN-AEKVFLEGNDWIQKIT--------YAPDIIHQESLKFIGDN-AN 212

Query: 266 KPLFLYLAHLAAHA-----------------GNAGKHLEAP------QETINQFQYITDP 302
            P  +++  +  HA                 G    H+ AP       E IN   Y ++P
Sbjct: 213 NPFLMFIPSVMPHAELAVPDGELLAYYREKIGEEKPHI-APVGWDYGDEPINIPGYQSNP 271

Query: 303 N-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
             + +YAAMV  LD  VG ++  L   G+ E +IIIF SDNG P  E     ++ N  SN
Sbjct: 272 YPKASYAAMVALLDRQVGEIMDKLDELGLTEQTIIIFTSDNG-PHQEGGNDPDFFN--SN 328

Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
             +RG K  L+EGG++VP I+  P + Q    S  +    D+LPT     G        N
Sbjct: 329 GIFRGYKRDLYEGGIRVPMIVKWPGVIQGGAQSGHVSAFWDFLPTFADLLGEK------N 382

Query: 422 IDGLDQWSSL 431
           +D LD  S L
Sbjct: 383 LDKLDGVSFL 392


>gi|332663784|ref|YP_004446572.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332598|gb|AEE49699.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 580

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 150/492 (30%), Positives = 212/492 (43%), Gaps = 119/492 (24%)

Query: 74  RTYAALTKSTTLTLLIVY---GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
           +T  A T S    +LI+    G++D   +GS EI TPNID LAY G+ L   Y   +C P
Sbjct: 23  KTTPAPTPSKPNIILIMVDDLGYSDFGAYGS-EIQTPNIDKLAYGGLRLKEFYNNSICAP 81

Query: 131 SRASLMTGKYPIHTGMQ------GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWH 184
           +RASL+TG+YP   G+       G P +        LT     E L++ GY+T   GKWH
Sbjct: 82  TRASLITGQYPHKAGLGYFNTNLGLPAYQGWLNQESLT---FGEVLQQGGYNTYLTGKWH 138

Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
           +G     Y P  RGF   +G++ G  +YYD     + +  VEL  ++ R NL+      G
Sbjct: 139 VG-NDSLYWPNQRGFNKFYGFIGGASNYYDISPYPEKAPPVELVENNQRINLAP-----G 192

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ------- 297
           +Y TD  T  A+  I ++  DKP FLYLA  A H       L+AP E I +++       
Sbjct: 193 KYLTDEITNHALSYI-NESKDKPFFLYLAFNAPHW-----PLQAPAEDIAKYKGKYKIGW 246

Query: 298 --------------YITDPNRR---------------------------TYAAMVKKLDD 316
                          I DP ++                            YAAMV  +D 
Sbjct: 247 DSLRAQRYRRQLALGIADPRQKVAERDPEVVAWESLTFDEKELWQRKMEVYAAMVDHVDQ 306

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAP----------------------TVEYRETSN 354
           SVG V+  L++    +N++I+ +SDNGA                       + +Y+E   
Sbjct: 307 SVGRVVEELKKLKKDDNTLIVLISDNGAQGGLNQLGQRRRQRNSGPIGSAGSYDYQE--- 363

Query: 355 YRNWG--SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
            +NW   SN P+R  K T  EGG+  P I W P+  +   ++    H+ D  PT Y  AG
Sbjct: 364 -QNWAYVSNTPFRNYKATSHEGGISSPFIAWFPKKIKAGTITKGTGHLIDLAPTFYDVAG 422

Query: 413 GDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKR 472
                 P N  G+    +  L   S +N   +G D    L+ N P           E+  
Sbjct: 423 ---VNYPTNYRGV---QTNTLPGVSLKNLLFEGND----LVRNEPI--------FWERAG 464

Query: 473 TAAVRLDSWKLV 484
             AVR   WKLV
Sbjct: 465 NRAVRKGKWKLV 476


>gi|443700719|gb|ELT99563.1| hypothetical protein CAPTEDRAFT_110993 [Capitella teleta]
          Length = 339

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 98/305 (32%), Positives = 152/305 (49%), Gaps = 36/305 (11%)

Query: 91  YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           YG+ D  +  S +I TPNID L  +GI   N Y+   C+PSR++ ++G+Y   +GMQ   
Sbjct: 42  YGYQDAGYRNS-DIHTPNIDKLVADGISFTNAYSAQQCSPSRSAFLSGRYAYTSGMQHGV 100

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           I   +   + L   F+ +YL+EL Y+T A GKWHLG+  +E TP YRGF++  G  +G  
Sbjct: 101 IGDTKAHCMDLKYNFISDYLKELKYNTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEG 160

Query: 211 SYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
            YY+H      Y      NG ++ R+ S      G ++ D+   + +  + D+    PLF
Sbjct: 161 EYYEHTTFQGMYDWH---NGTELDRSAS------GIHSQDMIEADLLNQL-DKNDGTPLF 210

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRRTYAAMVKKLDDSVGTVISA 324
            Y A    H         +P     + + + D     P RR Y  +V  +D  +G V++ 
Sbjct: 211 YYAAFHNTH---------SPMHPKPEHEALYDDLDAPPTRRKYLGLVSGMDAVIGKVVTK 261

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
           L+ K M +N+ I F SDNG    E          G N P RG K++L++GG    +   S
Sbjct: 262 LKEKDMFDNTYIFFSSDNGGDVHE----------GDNSPKRGAKSSLFDGGCHAHSWASS 311

Query: 385 PQIQQ 389
           P +++
Sbjct: 312 PLLKK 316


>gi|405970955|gb|EKC35816.1| N-acetylgalactosamine-6-sulfatase [Crassostrea gigas]
          Length = 511

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 123/369 (33%), Positives = 181/369 (49%), Gaps = 38/369 (10%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL   G     TP +D +A  G++  + Y A P+C+PSRA+L+TG+ PI 
Sbjct: 24  LMLMDDMGWGDLGVFGEPNKETPYLDQMAAEGMLFPDFYSANPLCSPSRAALLTGRLPIR 83

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
            G     G       P+    G+P  E  LPE L++ GY +K +GKWHLG  + +Y PL 
Sbjct: 84  NGFYTTNGHARNAYTPQNIVGGIPDEEILLPELLQKAGYKSKLVGKWHLG-HQAKYLPLK 142

Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEY-ATDLFTK 253
            GF+  FG  N     YD++ +      R  E+ G   R       +  GE   T L+TK
Sbjct: 143 HGFDEWFGAPNCHFGPYDNVHTPNIPVYRNEEMAG---RYYQDFKIEKNGESNLTQLYTK 199

Query: 254 EAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
           EAV+ I       KP FLY A  A H     + L A +       ++   NR  Y  +V+
Sbjct: 200 EAVEFITRMHNKSKPFFLYWAVDATH-----EPLYASKP------FLGTSNRGLYGDVVR 248

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
           +LD +VG ++++L+   +  N+++ F SDNG  T   +        GSN P+   K T +
Sbjct: 249 ELDSAVGKILASLRSLKIDNNTLVFFSSDNGGATYAKQHG------GSNGPFLCGKETTF 302

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL--YTAAGGDTSRLPLNIDGLDQWSS 430
           EGG++ P I W P   +  +VS Q+  + D   T   +      T R+   IDG+   +S
Sbjct: 303 EGGMREPTIAWWPGHIKKGQVSHQVGSLMDLYTTFVDFAEVAMPTDRV---IDGISLRAS 359

Query: 431 LLLNTPSRR 439
           LL NT  +R
Sbjct: 360 LLNNTNVQR 368


>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
 gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
          Length = 482

 Score =  155 bits (393), Expect = 5e-35,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 172/350 (49%), Gaps = 43/350 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS+ + TPN+D LA  G +    Y +  VC PSRA ++TGKY    G +   
Sbjct: 49  GYADFGFQGSDVMRTPNLDKLASQGTVFTQAYVSAAVCGPSRAGILTGKYQQRFGYEENN 108

Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
           + G         +  G+PL ++ + +YLRE GY T  IGKWH G   R + P  RGF+  
Sbjct: 109 VPGYMSQSGLTGDDMGLPLDQKTMADYLRERGYKTALIGKWHQGNADR-FHPTKRGFDEF 167

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           +G+  G  SY+     +  S   +     + R  +   ++   Y T+    E V+ I+ +
Sbjct: 168 YGFRGGARSYFGFGAQNPVSYPED----KLERGFAHFQES-KRYLTEALATETVEFIK-R 221

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
               P F++L+  A H       +EA    + QF  +    R+  AAM   +D  +G V+
Sbjct: 222 NQKHPFFVFLSFNAVHT-----PMEAKPADLAQFSNLKG-KRQQLAAMTLSMDREIGKVL 275

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
           + L   G+ +N++++F +DNG P+          N  +N P  G K    EGG++VP I+
Sbjct: 276 NTLDELGLSDNTLVVFTNDNGGPSDT--------NASNNGPLSGTKANHLEGGIRVPFIM 327

Query: 383 WSP--QIQQ----NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
             P  ++ Q    NP  +L      D LPT ++AAGG+  +L   IDG+D
Sbjct: 328 RWPAGKVSQGEYPNPISTL------DLLPTFFSAAGGNEQQLAA-IDGVD 370


>gi|334145289|ref|YP_004538499.1| sulfatase [Novosphingobium sp. PP1Y]
 gi|333937173|emb|CCA90532.1| sulfatase [Novosphingobium sp. PP1Y]
          Length = 461

 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 114/359 (31%), Positives = 174/359 (48%), Gaps = 52/359 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           G+ DL  +G  +  TP ID +A  G++    YA   VC+ SR +L+TG+Y   +  G++ 
Sbjct: 32  GYADLGCYGRTDYATPAIDRIAREGMLFRQAYANSAVCSASRTALITGRYQNRLPVGLEE 91

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P   G   RG+P +   +P  LR  GY T  IGKWHLG   + Y PL  G++  +G   G
Sbjct: 92  P--LGRRVRGLPPSHPTMPSLLRSQGYETTLIGKWHLGQLPK-YGPLQSGYDHFWGLRGG 148

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE-----YATDLFTKEAVQLIEDQP 263
            I Y+ H          + NG      +   W+   E     Y TDL  + A+  I+D  
Sbjct: 149 GIDYFSH----------DFNG------VPDLWEDDREIEQPGYMTDLLAERAMTAIDDYA 192

Query: 264 V-DKPLFLYLAHLAAHAGNAG-------KHLEAPQETINQFQYITDP-NRRTYAAMVKKL 314
              +P F+ L   A H    G       + L A  +  + F   TD  +R+TYA MV ++
Sbjct: 193 ARQQPFFMSLHFTAPHWPWEGPDDQTESQRLAASDKPFSMFD--TDGGSRKTYAEMVTRM 250

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D  +G ++  L R G+ E++II+F SDNG              +   +PY G+K  L EG
Sbjct: 251 DMEIGRILERLDRLGLAEDTIIVFTSDNGG-----------ERYSKTWPYNGLKTELLEG 299

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS-RLPLNIDGLDQWSSLL 432
           G++VPA++  P+     + S Q++   DW+PT  +        R P+  DG+D  ++LL
Sbjct: 300 GLRVPAVMRWPRRIAAGQESEQVIMSMDWMPTFLSLIQASPDPRYPM--DGMDLSATLL 356


>gi|149175233|ref|ZP_01853855.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
 gi|148845842|gb|EDL60183.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
           8797]
          Length = 459

 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 177/346 (51%), Gaps = 29/346 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL+ +G+ ++ TP+ID LA + +   + + A  +CTP+RA+++TG+Y    G Q   
Sbjct: 46  GYGDLACYGNKQVKTPHIDRLAASALKFTDFHSAGAMCTPTRAAMLTGQYQQRFGRQFES 105

Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
               +     G+P     + E L++ GY+T   GKWHLG+ +  + P  +GF+   G  +
Sbjct: 106 ALSGKSNHDIGLPHQAVTMAELLKQQGYATACFGKWHLGY-QPPWLPTNQGFDLFRGLTS 164

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           G   ++ H+           N   M +           Y  DL +K +V  +E     +P
Sbjct: 165 GDGDHHTHVDRSGNEDWWHNNEISMEKG----------YTADLLSKYSVAFMEANRT-RP 213

Query: 268 LFLYLAHLAAHAGNAGK----HLEAPQE-TINQFQYITDPNRRT--YAAMVKKLDDSVGT 320
            FLY+ HLA H    G     H +A Q+    ++  I DP   +    AM++ LD SVG 
Sbjct: 214 FFLYVPHLAIHFPWQGPQDPPHRKAGQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGK 273

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           ++SAL+R  + +N+++IF SDNG   + Y +  N++N  SN P RG K TL+EGG +VP 
Sbjct: 274 ILSALKRLDLEQNTLVIFTSDNGG-YLTYGK--NFQNISSNGPLRGQKATLYEGGHRVPC 330

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           ++  P +     V+ Q  H  D LPTL  AAG   S      DGLD
Sbjct: 331 LISWPGV-ITAGVTDQTAHSVDLLPTLAQAAG--ISATNFQTDGLD 373


>gi|320105193|ref|YP_004180784.1| sulfatase [Isosphaera pallida ATCC 43644]
 gi|319752475|gb|ADV64235.1| sulfatase [Isosphaera pallida ATCC 43644]
          Length = 481

 Score =  155 bits (392), Expect = 6e-35,   Method: Compositional matrix adjust.
 Identities = 128/409 (31%), Positives = 201/409 (49%), Gaps = 70/409 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G+ +I TP ID+LA +G  L++ ++  PVCTP+RA+L+TG++P   G++   
Sbjct: 76  GYADLGCYGAPDIATPRIDSLARDGARLSHFHSPGPVCTPTRAALLTGRWPQRVGLE--- 132

Query: 151 IWG-----AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
            W       EP G+P+ E  L   L+E+GY T  +GKWHLG+ R E+ P   GF+  FG 
Sbjct: 133 -WALSASDTEP-GLPVEEPILSRPLKEVGYRTVMVGKWHLGY-RPEFGPNAHGFDEFFGL 189

Query: 206 LNGVISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQLIED--- 261
           L+G + +Y H          E+NG  D   N       V  Y+TDL +  AV  I+    
Sbjct: 190 LSGNVDHYSH---------REINGKEDWYENTKPV--RVEGYSTDLLSDRAVAAIQKTAA 238

Query: 262 QPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
           QP D+  PL+LY+A+ A H     +    PQ+  ++  +     R  Y  MV+ +D  VG
Sbjct: 239 QPPDQRQPLWLYVAYNAVHW--PFQPPGRPQDIRDRSTWFNG-TRADYVKMVESIDAGVG 295

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            ++ AL+  GM +++++IF +DNG   +             N P+   K T+WEGG +VP
Sbjct: 296 RILDALEAGGMADHTLVIFTNDNGGERLS-----------DNGPFFHHKGTVWEGGHRVP 344

Query: 380 AIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
            ++ W  +I          + + D   T+  AA     ++P N              P R
Sbjct: 345 CLIRWPGRIAAGTEFGQPTIGM-DLTATILAAA-----KIPPN--------------PER 384

Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDE--KKRTAAVRLDSWKLVL 485
               +DG++    L    P    ++   ID   +K+ AA+R   WK V+
Sbjct: 385 ---PLDGVNLVPILRGEVPCPDRALFWRIDRPNRKQKAALR-GRWKYVV 429


>gi|256841083|ref|ZP_05546590.1| arylsulphatase A [Parabacteroides sp. D13]
 gi|256736926|gb|EEU50253.1| arylsulphatase A [Parabacteroides sp. D13]
          Length = 452

 Score =  155 bits (392), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
           AL  S  + L++V   G+ DLS  G+  +I TP+ID L   G+   N +A  PV +PSRA
Sbjct: 23  ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
           SL+TG+YP   G+ G      E     L+E    LP+ L++ GY    +GKW+LG     
Sbjct: 83  SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
            TP  RGF+ + G+L  ++        D Y        + MR NL    D  G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHCRFGNNYMRENLKEI-DPQG-HATEVF 191

Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
           +  A++ + D +   +P FLYLA+ A H       ++ PQE + + +   +P+    R  
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             A+++ LD +VG V  AL++ G LEN+IIIF SDNG                +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296

Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
            K  ++EGG++V   I W  QI+  P V    + +SD  PTL      D + +P++  ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349

Query: 424 GL 425
           G+
Sbjct: 350 GI 351


>gi|437818613|ref|ZP_20842917.1| arylsulfatase, partial [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SARB17]
 gi|435307850|gb|ELO82888.1| arylsulfatase, partial [Salmonella enterica subsp. enterica serovar
           Enteritidis str. SARB17]
          Length = 555

 Score =  155 bits (391), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 147/487 (30%), Positives = 222/487 (45%), Gaps = 82/487 (16%)

Query: 53  KHLEAPQETI--NQFQYITDPN-----RRTYAALTKST------TLTLLIVYGWNDLSFH 99
           +++ AP   I  N    I+ P      R+  A L K T       + +L   GW D+ F+
Sbjct: 50  QYIAAPTTKIADNLMPVISHPGQDKETRQKLADLEKKTGKKPNIVVFILDDVGWMDMGFN 109

Query: 100 GSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEP 156
           G       PTP+IDA+A  G+IL + Y+QP  +P+RA++MTG+Y +H G+  PP++G  P
Sbjct: 110 GGGVAVGNPTPDIDAIASQGLILTSAYSQPSSSPTRATIMTGQYSVHHGILMPPMYGM-P 168

Query: 157 RGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY-- 213
            G+  LT   LP+ L + GY T+AIGKWH+G       P   GF+   G+ N V   Y  
Sbjct: 169 GGLEGLTT--LPQLLHDQGYVTQAIGKWHMG-ENTGSQPQNVGFDDFRGF-NSVSDMYTE 224

Query: 214 --------DHILSD---QYSRTVELNGHDMRRNLSTAWDTVGE----YATDL---FTKEA 255
                   +  LS    QY + +  + +D+        + + +    Y  DL   + K  
Sbjct: 225 WRDPNVNPEVALSPSRLQYIKNLPFDKNDVHAVRGGKQEAIADITPKYMEDLDQRWMKYG 284

Query: 256 VQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           V  I      DKP FLY      H  N            N +     P R +Y+  + ++
Sbjct: 285 VDFINKMAKNDKPFFLYYGTRGCHFDN----------YPNAYYAGRSPARTSYSDCIVEM 334

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           +D    +  AL+  G L+N++I+F SDNG P  E              P+RG K + WEG
Sbjct: 335 NDVFANLYKALETSGQLDNTLIVFTSDNG-PEAEVPPH-------GRTPFRGAKGSTWEG 386

Query: 375 GVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
           GV+VP  + W   IQ  PR S  ++ ++D  PT  + AG   + L          + L+ 
Sbjct: 387 GVRVPTFVYWKGMIQ--PRKSDGLVDLADLFPTSLSLAGYPGAEL----------AKLVP 434

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE--NG 491
            T     + IDG+DQ SS  + T  + +    +       +AVR+D +K  L  Q+    
Sbjct: 435 KT-----TFIDGIDQ-SSFFIGTNGQSSRKAEHYFLNGELSAVRIDEFKYHLLIQQPYAF 488

Query: 492 TMDGYYG 498
           T  GY G
Sbjct: 489 TQTGYQG 495


>gi|282900986|ref|ZP_06308919.1| Twin-arginine translocation pathway signal [Cylindrospermopsis
           raciborskii CS-505]
 gi|281194077|gb|EFA69041.1| Twin-arginine translocation pathway signal [Cylindrospermopsis
           raciborskii CS-505]
          Length = 434

 Score =  155 bits (391), Expect = 7e-35,   Method: Compositional matrix adjust.
 Identities = 112/336 (33%), Positives = 167/336 (49%), Gaps = 35/336 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQG-P 149
           GW DLS  G     TP +D LA +GI  N  Y+  PVCTP+R    TG+YP    +    
Sbjct: 23  GWGDLSIDGQKNYQTPYLDQLARDGIRFNQAYSDSPVCTPTRIGFFTGRYPGRLAIGNYE 82

Query: 150 PIWG----AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
           P+       +  G+P     +   LR+ GY T  +GKWH G+  R Y+PL  GF   FG 
Sbjct: 83  PLLSFQQIGDSVGLPPEHPTIASLLRDNGYETVLVGKWHCGYLPR-YSPLKSGFNKFFGN 141

Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
            +G I Y+ H         V+ NG        T  + +G Y TD+FT+ AV  I+   V+
Sbjct: 142 FSGAIDYFRH---------VDTNGKPDLWEADTPIEKIG-YVTDIFTERAVDFIKGPHVN 191

Query: 266 KPLFLYLAHLAAHAGNAG-KHLEAPQETI---NQFQYITDPNRRTYAAMVKKLDDSVGTV 321
            P +L L + A H    G + +E   + I   N   +I    + +Y A+V+ LD+ VG V
Sbjct: 192 -PFYLSLHYTAPHWPWQGPEDVELSNDLIGRDNLENWINTGTKESYKAVVQSLDNGVGKV 250

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           + AL+  G+   +I+IF SDNG           + N+G    ++G K  L+EGG++VP +
Sbjct: 251 LQALEDVGIANRTIVIFASDNGG--------ERFSNFGL---FQGKKGNLYEGGIRVPTL 299

Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
           + WS  I  N ++S Q++   D   T+  A   +T+
Sbjct: 300 IRWSGVIAPN-QISDQVIITHDITATILAATKTNTN 334


>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
          Length = 328

 Score =  155 bits (391), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 113/337 (33%), Positives = 160/337 (47%), Gaps = 35/337 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALA-YNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL +   + I TPNID LA   G+IL+N Y    C+PSRAS +TG+ PIH     P 
Sbjct: 11  GYYDLGYRNPDSI-TPNIDQLATQEGVILDNAYGYRYCSPSRASFLTGRVPIHVHQGNPG 69

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           +  A    +  T   +P  LR  GY T  +GKWH G    E  P+ RGF++ FGYL+G  
Sbjct: 70  LAAAGCTNLNYT--MIPAQLRRAGYRTAMVGKWHQGASLPECLPVNRGFDTSFGYLSG-- 125

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
              DH+  DQ +   + N  D   +   A    G Y++  +    V +I+    ++PL L
Sbjct: 126 -EEDHM--DQTTNGGQCNVTDFWLDSGPAIRRNGTYSSFQYNDAIVDIIQQHAPEQPLML 182

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQY-----------ITDPNRRTYAAMVKKLDDSVG 319
           Y A    H        E P+    +F              +   R    AMV  LDD VG
Sbjct: 183 YAALQNVHG-----PYEVPERYRAKFPADKNCTETGDNPTSQCTRDFMLAMVNVLDDIVG 237

Query: 320 TVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
            +  AL+  GM  N++++F SDNG   P      + N      N+P RG K   +EGGV+
Sbjct: 238 NITRALKANGMWNNTVLVFSSDNGGAVPGAPQHGSMN------NHPLRGGKVAYFEGGVR 291

Query: 378 VPAILWSPQIQQNPRVS--LQMMHISDWLPTLYTAAG 412
             A + SP + ++ R +    ++HISDW  T    AG
Sbjct: 292 TAAFVASPLLPKSIRGTELRGLIHISDWYATFCHLAG 328


>gi|393719701|ref|ZP_10339628.1| sulfatase [Sphingomonas echinoides ATCC 14820]
          Length = 563

 Score =  155 bits (391), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 128/413 (30%), Positives = 186/413 (45%), Gaps = 83/413 (20%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNE----IPTPNIDALAYNGIILNNMYA-QPVCTP 130
           A T+   + L++    G+ND+SF+G       + TPNIDAL ++G+   + YA    C P
Sbjct: 56  AGTRPPNIVLIVADDLGYNDISFNGGGVAGGLVKTPNIDALGHDGVTFADGYAGNATCAP 115

Query: 131 SRASLMTGKYPIHTGMQ------GPPIW-------------------------------- 152
           SRA+LMTG+YP   G +       PP W                                
Sbjct: 116 SRAALMTGRYPTRFGYEFTPADTHPPAWFPSSLIRTDAMFSRTIANFPTTWEHKTIFDED 175

Query: 153 ------GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
                    P+GVP +E  + E LR  GY T  +GKWHLG   +   P  +GF+   G+L
Sbjct: 176 AARQPASTAPKGVPSSEVTIAELLRGKGYHTMHLGKWHLGE-AKGMRPEDQGFDESLGFL 234

Query: 207 NGVISYYDHILSDQYSRTVELNGHD--MRRNLSTAWDTVG-------EYATDLFTKEAVQ 257
            G   Y      D  +     +  D  +  NL  +    G        Y TD  T +A  
Sbjct: 235 IGGQMYLPEASPDVENSKQPFDPIDKFLWANLPFSVQYNGGERFHPDRYMTDYLTVQAKA 294

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
            I     ++P FLYLA+ A H       L+A +   +    ITD   R YAAM++ LD  
Sbjct: 295 AITAN-RNRPFFLYLAYNAPHT-----PLQASKADYDALPQITDHRLRVYAAMIRALDRG 348

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGV 376
           VG V++ L+++G+ +N+++IF SDNG         +NY      N PYRG K T +EGGV
Sbjct: 349 VGQVMATLKQQGLDQNTLVIFTSDNGG--------ANYIGLPDINRPYRGWKATFFEGGV 400

Query: 377 KVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
           KVP  + W  +I+   R++  + H         TA     + LP +  +DG+D
Sbjct: 401 KVPFFMRWPARIKPGTRIAGPVSHFD----IFATAGDAGHASLPRDRALDGVD 449


>gi|340369113|ref|XP_003383093.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 374

 Score =  155 bits (391), Expect = 8e-35,   Method: Compositional matrix adjust.
 Identities = 105/316 (33%), Positives = 152/316 (48%), Gaps = 36/316 (11%)

Query: 84  TLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
            + L+  +GW ++ +H    + E+ TPNID L   G+ LN  Y    C+PSR+SL++G+ 
Sbjct: 28  VMMLVDDWGWANVGYHRNPPTCEVVTPNIDKLVKEGLELNQHYVYKFCSPSRSSLISGRL 87

Query: 141 PIHTGMQG---------PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           PIH   Q           P+ G     +P     + E +R  GY+T  +GKW  G    +
Sbjct: 88  PIHVNDQNLLPTNYNPDDPVSGFS--AIPRNMTGIAEKMRGAGYATHQVGKWDAGMATSD 145

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG----EYA 247
           +TP  RGF++ FGY +    YY    S   +RT  +   D+      A    G     Y 
Sbjct: 146 HTPKGRGFQTSFGYFHHANDYYKETAS-ACNRTQIV---DLWETDKPAHGINGTGPDNYE 201

Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
             LF +  + ++ +     PLFLY A    H       L+ P   +  F +I D +R+ Y
Sbjct: 202 EGLFKERVLDVVSNHDPSTPLFLYYAPHIVHT-----PLQVPDRYLKLFSFINDHDRQYY 256

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRG 366
            AMV  LDD VG +  AL+ KGM +N + +  SDNG P         Y   G+ NYP RG
Sbjct: 257 HAMVNYLDDVVGEITDALKNKGMWDNLLFVTSSDNGGPV--------YPGGGANNYPLRG 308

Query: 367 VKNTLWEGGVKVPAIL 382
            K T W+GGV+V A +
Sbjct: 309 GKVTDWQGGVRVNAFV 324


>gi|414344125|ref|YP_006985646.1| sulfatase [Gluconobacter oxydans H24]
 gi|411029460|gb|AFW02715.1| sulfatase [Gluconobacter oxydans H24]
          Length = 478

 Score =  155 bits (391), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 119/358 (33%), Positives = 170/358 (47%), Gaps = 36/358 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           G+ D+S  G+    TP+ID +A+ GI L   YA   VC+ +R +L+TG+Y   I  G++ 
Sbjct: 56  GYADVSCFGTPGFTTPSIDRIAHEGIKLTRAYANSAVCSATRTALITGRYQDRIACGLEE 115

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P    +   G+P     LPE LR  GY T  +GKWHLG    E+ PL  G++  +G   G
Sbjct: 116 PIASSSGKIGLPPGLPTLPEQLRRAGYKTALVGKWHLGN-PPEFGPLKSGYDHFYGISGG 174

Query: 209 VISYYDHILS---DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
              Y+ H      DQ+ R   L  HD              Y T L   EAV+L+E    +
Sbjct: 175 AADYFTHQGKPGEDQFYRDDHLI-HDH------------GYLTGLLGDEAVRLVETFASE 221

Query: 266 -KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN-RRTYAAMVKKLDDSVGTVIS 323
             P FL L   A H    G   EA  + +    +  D   R+TY  M+K +DD +G V++
Sbjct: 222 TSPFFLSLHFNAPHWPWEGPEDEAESKRLKGKIFDYDGGTRKTYGRMIKAMDDQIGRVLA 281

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
            L    + EN+I++F SDNG              +   +P+ G+K  L EGG++VPA+L 
Sbjct: 282 TLDSYHLTENTIVVFTSDNGG-----------ERFSDTWPFTGMKGELLEGGLRVPAVLR 330

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
            P        S Q +   DWLP+ + AA G T       DG+     +L   PS+R S
Sbjct: 331 WPARVAAGSESDQTLITMDWLPS-FLAATGATPDATFPSDGISLLPFVL--DPSKRQS 385


>gi|393784381|ref|ZP_10372546.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
           CL02T12C01]
 gi|392666157|gb|EIY59674.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
           CL02T12C01]
          Length = 456

 Score =  154 bits (390), Expect = 9e-35,   Method: Compositional matrix adjust.
 Identities = 120/353 (33%), Positives = 181/353 (51%), Gaps = 46/353 (13%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
           L ++   G  DLS   + ++ TPNID L + G+ ++N YA   V +PSRA L+TG +P  
Sbjct: 40  LIMVDDLGLGDLSCQYAKDVQTPNIDRLFHQGVRMDNFYANSSVSSPSRAGLLTGCFPDM 99

Query: 144 TGMQGPPIWGAEPRG----VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
            G+  P +   +P+G    +      LPE +++ GY T  IGKWHLG       P  RGF
Sbjct: 100 VGV--PGVIRTDPKGSWGYLSPDATLLPEMMKKAGYQTAIIGKWHLGL-ESPNLPNERGF 156

Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI 259
           +   G+L  ++        D Y+  +    + MR N +   D  G +ATDLFT  A   I
Sbjct: 157 DFFHGFLGDMM--------DDYNTHLRRGFNYMRLN-TEEIDPQG-HATDLFTDWASDYI 206

Query: 260 -EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLD 315
            + +  DKP FLYLA+ A H+      L+ P E    + +        R    A+++ +D
Sbjct: 207 FKARKQDKPFFLYLAYNAPHS-----PLQPPVEWEKKVRKRHPNISETRGKLVALIEHMD 261

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
           D VG VI++L++ G L+N++IIF SDNG          + ++  +N P RG K  +++GG
Sbjct: 262 DGVGRVINSLEKSGQLDNTLIIFCSDNGG---------DRKSEANNGPVRGDKGDMYDGG 312

Query: 376 VKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGL 425
           +KV   L W   ++     +L MM  SD  PTL      D  +LP+N  IDG+
Sbjct: 313 IKVACSLYWKGHLEHRRVNNLVMM--SDIFPTLC-----DLVQLPVNHRIDGI 358


>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
           queenslandica]
          Length = 523

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 115/361 (31%), Positives = 173/361 (47%), Gaps = 56/361 (15%)

Query: 76  YAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
           ++  T +    L+   GW DL  +G     TPN+D +A  G++L + Y A P+C+PSRA+
Sbjct: 28  FSPTTPNMIFMLMDDMGWGDLGVYGHPVKETPNLDKMALEGMLLPDFYSANPLCSPSRAA 87

Query: 135 LMTGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWH 184
           ++TG+ PI  G        A  R          G+P +E   PE L++ GY+T  IGKWH
Sbjct: 88  MLTGRLPIRNGFY---TTNAHARNAYTPQDIVGGIPDSEILYPELLQKNGYATMIIGKWH 144

Query: 185 LGFFRREYTPLYRGFE-------SHFGYLNG-----VISYYDHILSDQYSRTVELNGHDM 232
           LG  +  Y PL  GF+        HFG  +G     +  Y +  ++ +Y +   +N    
Sbjct: 145 LGQ-QTHYHPLKHGFDEFFGSTNCHFGPFDGKEQPNMPVYRNATMAGRYYQDFPINHKTG 203

Query: 233 RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQE 291
             NL+             +T+EA++ I     +K P FLY    A H       L A ++
Sbjct: 204 ESNLTVE-----------YTQEAIKFINKNAANKKPFFLYWTPDATHT-----PLFASKD 247

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
            +   Q      R  Y   VK+LD SVG ++  L+  G+ +++++IF SDNG  T  Y +
Sbjct: 248 FLGTSQ------RGLYGDAVKELDYSVGQILDTLKTLGIDKDTLVIFSSDNGGAT--YAK 299

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
            S     GSN P+   K T +EGG++ P I W P   Q  +VS Q+  + DW  T    A
Sbjct: 300 ESG----GSNSPFLCGKETTFEGGMREPTIAWWPGTIQPGQVSHQLGSLMDWYSTALDLA 355

Query: 412 G 412
           G
Sbjct: 356 G 356


>gi|284039849|ref|YP_003389779.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283819142|gb|ADB40980.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 533

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 122/398 (30%), Positives = 181/398 (45%), Gaps = 78/398 (19%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ---G 148
           G++D+  +G  E+ TPN+D LA  GI L + Y    C P+RASL+TG+YP   GM     
Sbjct: 52  GFSDIGCYGG-EVNTPNLDKLAAGGIKLRSFYNNARCCPTRASLLTGQYPHTVGMGLMVT 110

Query: 149 PPIWGAEPRGVP--LTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
            P    +P      L  R+  + E L+E GYST  +GKWH+G  R E+ PL RGFE +FG
Sbjct: 111 MPNAAIQPGSYQGFLDARYPTIAERLKETGYSTYMLGKWHVGE-RPEHWPLKRGFEHYFG 169

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP- 263
            ++G  SYY+ I +++  R + L+  +            G Y TD FT  AVQ +  Q  
Sbjct: 170 LISGASSYYEIIPAEKGKRFIVLDDKEFTPPAD------GFYMTDAFTDYAVQYLNQQKQ 223

Query: 264 --VDKPLFLYLAHLAAH----------------------AGNAGKHLEAPQETINQFQYI 299
              DKP F+YLA+ A H                           ++ +  Q  +   +Y 
Sbjct: 224 EQADKPFFMYLAYTAPHFPLHAYESDIAKYEKLYAQGWDVTRTKRYQKMQQLGLIDKRYQ 283

Query: 300 TDP-------------------NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
             P                       YAAM+ ++D ++G +I  L+  G  +N++I+FMS
Sbjct: 284 LTPRPANVPAWNSATDKAQWIRKMAVYAAMIDRMDQNIGRLIKTLKANGQYDNTLIVFMS 343

Query: 341 DNGA------------PTVEYRETSNYRNWGSNY------PYRGVKNTLWEGGVKVPAIL 382
           DNG+            PT +  E  +Y  + + +      P+R  K  L EGG+  P I+
Sbjct: 344 DNGSSNENMESRKLNDPTKKIGERGSYVTYDTPWANVSVTPFRKYKRFLHEGGMITPCIM 403

Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
            W   I+        + H+ D LPT    AG   + LP
Sbjct: 404 QWPRNIRPAAGYVDGIGHVMDLLPTSLELAGLSANDLP 441


>gi|440749357|ref|ZP_20928605.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
 gi|436482362|gb|ELP38485.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
          Length = 484

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 126/393 (32%), Positives = 183/393 (46%), Gaps = 53/393 (13%)

Query: 72  NRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTP 130
           +R+   ++  +    L    G+ DL   G  +I TPN+D LA  G++  N Y+   VC P
Sbjct: 25  DRQINKSVKPNIIFILADDLGYGDLGVTGQTKIETPNLDKLAREGMLFTNHYSGATVCAP 84

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           SRA+LMTG +  HT ++G      +P G   +P +   LP+ ++  GY+T A GKW LGF
Sbjct: 85  SRAALMTGLHTGHTPIRGNK--EIQPEGQHPMPDSVLTLPKMMKRAGYATGAFGKWGLGF 142

Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQY----SRTVELNGHDMRRNLSTAWDTV 243
                 P+ +GF+  FGY       Y H    +Y    S  V L G+D        W   
Sbjct: 143 VGTSGDPINQGFDQFFGY---NCQRYAHRYFPEYLWLNSEKVFLPGND--------WTNK 191

Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAG--------KHLE-------- 287
             +A D+  K A++ IE      P F+Y+  +  HA  A         K+ E        
Sbjct: 192 ATFAPDVIQKAALEFIEANS-GNPFFMYVPMVIPHAELAADEESPIFKKYRERFGDEREF 250

Query: 288 --------APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
                    P   I  +Q +  P R TYA MV+KLD  VG +I+ L+  G+ EN+II+F 
Sbjct: 251 VAPAGWDYGPGINIPGYQSVKHP-RATYATMVEKLDAHVGEIIAKLEVLGIRENTIIVFA 309

Query: 340 SDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMH 399
           SDNG P  E     ++  + SN  YRG K  L+EGG+K   I   P+  +   VS  +  
Sbjct: 310 SDNG-PHQEGGNDPDF--FDSNGIYRGYKRDLYEGGIKTSLIANWPRKIKAGSVSDHVSA 366

Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
             D +PT    AG     +   +DG+    +LL
Sbjct: 367 FWDLMPTFAELAG---VSIDAELDGISMVPTLL 396


>gi|150008923|ref|YP_001303666.1| secreted sulfatase [Parabacteroides distasonis ATCC 8503]
 gi|149937347|gb|ABR44044.1| putative secreted sulfatase precursor [Parabacteroides distasonis
           ATCC 8503]
          Length = 452

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
           AL  S  + L++V   G+ DLS  G+  +I TP+ID L   G+   N +A  PV +PSRA
Sbjct: 23  ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
           SL+TG+YP   G+ G      E     L+E    LP+ L++ GY    +GKW+LG     
Sbjct: 83  SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
            TP  RGF+ + G+L  ++        D Y        + MR NL    D  G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191

Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
           +  A++ + D +   +P FLYLA+ A H       ++ PQE + + +   +P+    R  
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             A+++ LD +VG V  AL++ G LEN+IIIF SDNG                +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296

Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
            K  ++EGG++V   I W  QI+  P V    + +SD  PTL      D + +P++  ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349

Query: 424 GL 425
           G+
Sbjct: 350 GI 351


>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
 gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
          Length = 454

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 109/329 (33%), Positives = 159/329 (48%), Gaps = 36/329 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           GW DL  +G+++  TPN+D LA  G+     YA   VC+ +R +L+TG+Y   +  G++ 
Sbjct: 23  GWADLGVYGASDFATPNLDRLAAQGVRFTQAYANSAVCSATRIALITGRYQYRLPAGLEE 82

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P        G+P     LP  LRE GY T  IGKWHLG     Y PL  G++  FG + G
Sbjct: 83  P--IARSDIGLPPEHPTLPSLLREAGYDTALIGKWHLGK-PPTYGPLKSGYDRFFGNIGG 139

Query: 209 VISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA---VQLIEDQP 263
            + Y+ H   +  Q  R +      + R           Y T++   EA   V+  ED+ 
Sbjct: 140 ALDYFTHKPGVGAQVPRDLWEGDVPVERT---------GYYTNILGDEASAYVRAREDE- 189

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
             KP FL L   A H    G   EA    I    +    N   Y  +V+ LD +VG V++
Sbjct: 190 -KKPFFLSLHFTAPHWPWEGPGDEAVSREITDLFHYDGGNLAKYGELVEALDAAVGQVLA 248

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL- 382
           AL   G  +N++++F SDNG              +   +P+ G K  L EGG++VP +L 
Sbjct: 249 ALDETGQADNTLVVFTSDNGG-----------ERFSKTWPFTGQKTELLEGGLRVPTLLR 297

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
           W  +++  P+V  Q+   +DWLPTL  AA
Sbjct: 298 WPARVR--PQVQAQVTITADWLPTLLAAA 324


>gi|149199736|ref|ZP_01876767.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149137141|gb|EDM25563.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 585

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 118/389 (30%), Positives = 184/389 (47%), Gaps = 55/389 (14%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           AA  K   + + +    GW DLS +G+ +I TPNID+LA++G +  N Y QPVC+P+RA 
Sbjct: 16  AAENKKPNIIIFLTDDQGWGDLSINGNKDISTPNIDSLAHDGALFENFYVQPVCSPTRAE 75

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           L+TG+Y   +G++     G       L E+ + +  ++ GY+T A GKWH G  +  Y P
Sbjct: 76  LLTGRYAFRSGVRSTSEGGER---FNLDEQTIADVFKKAGYATGAFGKWHSG-MQYPYHP 131

Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
             RGF+  +G+ +G    Y       YS  +E NG  ++            +  D FT +
Sbjct: 132 NGRGFDEFYGFCSGHWGNY-------YSPMLEHNGKIVKGQ---------GFCVDDFTSK 175

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY------- 307
           A+  IE+   +KP  +YL +   H+      ++ P     ++Q  T   + TY       
Sbjct: 176 AMDFIEENK-EKPFLVYLPYNTPHS-----PMQVPDRFWEKYQDKTLTMKNTYNKKWGHG 229

Query: 308 -------AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
                   AM + +D +VG V++ L    + +N+I+++ SDNG          N   W  
Sbjct: 230 EDFVKAALAMCENIDWNVGRVLNKLDELQIADNTIVLYFSDNGP---------NSPRW-- 278

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
           N   +G K ++ EGGV+ P  L  PQ     +V  ++    D LPTL + AG +      
Sbjct: 279 NGEMKGKKGSVDEGGVRSPLHLRWPQKVAQGQVIKEVTGAIDLLPTLSSMAGIEFQS-KK 337

Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
            +DGLD +SSLL       N     L  W
Sbjct: 338 KLDGLD-FSSLLTGKEKNWNDQRTMLAYW 365


>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
          Length = 913

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 103/331 (31%), Positives = 160/331 (48%), Gaps = 39/331 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT---GMQG 148
           GW+D+ +H  + + TP + ALA +G++L+  YA   C+PSR+SL++G+YP+H     M G
Sbjct: 422 GWHDVPWHNPS-LKTPTLAALAADGVVLDRFYAYRFCSPSRSSLLSGRYPMHVNQYNMAG 480

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
             + G    GV +    + + L+  GY+T  +GKWH G    +  P  RGF++  GYLNG
Sbjct: 481 DALGG----GVHVNMTTIAKKLKGAGYATHQLGKWHAGQSSADLVPAARGFDTSLGYLNG 536

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
                DH    + +  V  N  D+      A+   G Y   ++   A+ +I D     PL
Sbjct: 537 A---EDHWTQARPACGVG-NFVDLYATDGPAFGKNGTYGAQIYHDAALDIIADHDASVPL 592

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYI---TDPN----RRTYAAMVKKLDDSVGTV 321
           F+Y A    H         AP +  + +  +    DP     R TY AM    DD V   
Sbjct: 593 FVYFAFQINH---------APMQVPDAYARVYPCDDPKTCGVRSTYQAMTAMADDVVRNA 643

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
            +AL+ KGM ++++I+  SDNG P+    ++SN      N+P RG K +  EGGV+   +
Sbjct: 644 TAALKAKGMWDDTLIVLTSDNGGPSGTDADSSN------NFPLRGGKYSDLEGGVRAAGL 697

Query: 382 LWSPQIQQNPR-----VSLQMMHISDWLPTL 407
           +    +    R      +   +HI DW  T 
Sbjct: 698 VAGGFLPDAARGRTLDGAGAYVHICDWYATF 728


>gi|325110588|ref|YP_004271656.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324970856|gb|ADY61634.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 458

 Score =  154 bits (390), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 111/355 (31%), Positives = 169/355 (47%), Gaps = 34/355 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGM-QGP 149
           G+ DLS +G     TP +D +A  G+  ++ ++   VC+P+RA+L+TG+Y    G+ +  
Sbjct: 48  GYGDLSCYGCERYQTPFLDQMAAEGMKWSDFHSNGNVCSPTRAALLTGQYQQRWGVDRVI 107

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
              G   RG+PL      E   + GY+T   GKWH G+  R + P+ +GF+   GY++G 
Sbjct: 108 TAAGHRDRGLPLEAVTFAETFSQHGYATAIYGKWHQGYQAR-FNPVRQGFDHFVGYVSGN 166

Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           + +  HI  DQ  +    +  ++             Y T L T+ AV+ IED   ++P  
Sbjct: 167 VDFQSHI--DQAGQADWWHNDELTEEAG--------YVTHLITQYAVKFIEDNK-EQPFC 215

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
           LYL H   H    G +  A +    +F      D     YA M+ ++D  +G V+  LQ+
Sbjct: 216 LYLPHETPHYPYQGPNDLADRTVGGKFVNHGSRDDKAAAYAEMMIEMDRGIGEVLQTLQK 275

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
            G+ E ++++F SDNG  ++           G N   RG K + WEGG +VP I W+P+ 
Sbjct: 276 HGLAEQTLVLFFSDNGGTSL-----------GDNGGLRGTKGSDWEGGHRVPMIAWNPET 324

Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL--LNTPSRRN 440
               R   Q+    D  PTL      D + LPL  D      SLL  L  P R N
Sbjct: 325 VPAGRDCDQLAITHDIFPTL-----ADYADLPLPKDQSLDGISLLPYLEDPKREN 374


>gi|146275662|ref|YP_001165822.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
 gi|145322353|gb|ABP64296.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
          Length = 462

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 113/357 (31%), Positives = 177/357 (49%), Gaps = 32/357 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP--IHTGMQG 148
           G+ D S  GS  I TP ID++   G++L   Y+  P+C+P+R +L+TG Y      G++ 
Sbjct: 46  GYADTSATGSRHIRTPAIDSIGAGGVMLRQGYSSTPICSPTRTALLTGCYAQRFAIGVE- 104

Query: 149 PPIWGAEPRG--VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
            P+    P G  VPL    +   ++ LGY T  +GKWHLG     + PL  G++   G +
Sbjct: 105 EPLGPNAPAGIGVPLDRPTIASVMKALGYRTSLVGKWHLGE-PPAHGPLKHGYDHFLGIV 163

Query: 207 NGVISYYDH--ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
            G   Y+ H  ++S + +  V L   D + + +        Y TD+F  EAV++IE+   
Sbjct: 164 EGGADYFVHRMVMSGKPA-GVGLAEDDAQTDRTG-------YLTDIFGDEAVRVIEEGG- 214

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
           ++P FL L   A H    G+  E     +    +    N   Y  MV+ +D +V  V++A
Sbjct: 215 NQPFFLSLHFTAPHWPWEGREDEKLARALPSSFHYEGGNLAKYREMVETMDQNVAKVLAA 274

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-W 383
           + R G  +N++++F SDNG              +   +P+ G K  + EGGV+VP ++ W
Sbjct: 275 IDRSGKADNTVVVFTSDNGG-----------ERFSDTWPFVGHKGEVLEGGVRVPLMVRW 323

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
             +I+   R S Q+M   D+LPTL   AGGD +R+    DG D  + L    P  R 
Sbjct: 324 PRRIKAGSR-SEQVMVSMDFLPTLLGMAGGDAARIG-RFDGADLSAQLAGAAPVTRT 378


>gi|343086062|ref|YP_004775357.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354596|gb|AEL27126.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 444

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 130/410 (31%), Positives = 188/410 (45%), Gaps = 84/410 (20%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G+ +  TP++D LA  GI   N Y    VCTPSRA L+TG+YP  + +    
Sbjct: 45  GYADLGVYGAEDFETPHLDQLASEGIRFTNFYVPATVCTPSRAGLLTGQYPKRSNLHEAV 104

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNG 208
           ++     G+      + E L+  GYST  IGKWHLG  + EY P  +GF++ +G  Y N 
Sbjct: 105 LFPYSEGGLSPQAFTMAELLKGAGYSTACIGKWHLG-HKDEYMPYNQGFDTFYGVPYSND 163

Query: 209 VISYYDHILSDQ------YSRT-VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
           + +YY   +  Q      Y  T V  NG D R            Y T  +T+E V+ I++
Sbjct: 164 MDNYYYKNIDFQSPPLPFYENTKVIENGSDQR------------YLTKRYTEETVKRIKN 211

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
           +  +KP F+YLAH   H          P      F+         Y  ++ +LD S G +
Sbjct: 212 RG-EKPFFIYLAHNMPH---------TPLFASPAFE--GKSKNGLYGDVIMELDWSAGEI 259

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           I+AL+ +G+ EN+I +F SDNG               GS  P RG K   WEGG +VP I
Sbjct: 260 INALKEEGIYENTIFVFTSDNGPAK------------GSAKPLRGKKAQTWEGGQRVPGI 307

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP--LNIDGLDQWSSLLLNTPSRR 439
           +  P       V+ + +   D  PT    A   ++++P  + IDG+D  S+ L+N    +
Sbjct: 308 ITWPGSIPRGVVTDEFVSTLDLFPTFAKLA---STKIPEEIKIDGMD-ISAFLMNP---K 360

Query: 440 NSNIDGLDQWSSLLLNTPSR------RNSVLINIDEKKRTAAVRLDSWKL 483
           N N+             P R      RN  L          A+RL  WKL
Sbjct: 361 NENL-------------PERPFYFYARNGEL---------EAIRLGKWKL 388


>gi|298375849|ref|ZP_06985805.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|423330336|ref|ZP_17308120.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
           CL03T12C09]
 gi|298266886|gb|EFI08543.1| arylsulfatase [Bacteroides sp. 3_1_19]
 gi|409231952|gb|EKN24800.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
           CL03T12C09]
          Length = 452

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
           AL  S  + L++V   G+ DLS  G+  +I TP+ID L   G+   N +A  PV +PSRA
Sbjct: 23  ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
           SL+TG+YP   G+ G      E     L+E    LP+ L++ GY    +GKW+LG     
Sbjct: 83  SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
            TP  RGF+ + G+L  ++        D Y        + MR NL    D  G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191

Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
           +  A++ + D +   +P FLYLA+ A H       ++ PQE + + +   +P+    R  
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             A+++ LD +VG V  AL++ G LEN+IIIF SDNG                +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296

Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
            K  ++EGG++V   I W  QI+  P V    + +SD  PTL      D + +P++  ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349

Query: 424 GL 425
           G+
Sbjct: 350 GI 351


>gi|325109298|ref|YP_004270366.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324969566|gb|ADY60344.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 463

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 118/372 (31%), Positives = 185/372 (49%), Gaps = 49/372 (13%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASL 135
           AA   +  L L+   G+ DLS +G+ ++ +PNID L   G+   N YA  PVC+P+RA++
Sbjct: 30  AADKPNVLLILVDDLGYGDLSCYGATDLQSPNIDKLVSRGLKFTNFYANCPVCSPTRAAI 89

Query: 136 MTGKYPIHTGMQG------PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           ++GKYP   G+ G         WG      P  E  LP  L+  GY +  IGKWHLG   
Sbjct: 90  LSGKYPDRVGVPGVIRTHADNSWGYL---APEAE-LLPSLLQPAGYHSAIIGKWHLGL-E 144

Query: 190 REYTPLYRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
               P  RGF+   GYL  ++  YYDH            +G +  R      D  G +AT
Sbjct: 145 APNRPNDRGFDHFKGYLGDMMDDYYDH----------RRHGINYMRENEQEIDPEG-HAT 193

Query: 249 DLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ---ETINQFQYITDPNR 304
           DLFT+ +   +++Q   ++P FLYLA+ A H       ++ PQ   E + + +      R
Sbjct: 194 DLFTEWSCDYLKEQADNEQPFFLYLAYNAPHT-----PIQPPQDWLEKVKKREAGIGEQR 248

Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY-P 363
               A+++ +DD +G V++ L++ G  +++++IF SDNG             N G+N   
Sbjct: 249 AKLVALIEHMDDGIGQVLTCLEQTGQADDTLVIFTSDNGGQL----------NVGANNGA 298

Query: 364 YRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
            R  K +++EGG+KVP A++W  +I+       Q + + D+  T+   AG +    P + 
Sbjct: 299 TRDGKQSMYEGGLKVPTAVVWPGKIEAGQTTEFQALSM-DFCSTILDVAGVEQ---PSDC 354

Query: 423 DGLDQWSSLLLN 434
           DG     +LL N
Sbjct: 355 DGRSILPTLLGN 366


>gi|241836593|ref|XP_002415110.1| arylsulfatase B, putative [Ixodes scapularis]
 gi|215509322|gb|EEC18775.1| arylsulfatase B, putative [Ixodes scapularis]
          Length = 254

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 106/289 (36%), Positives = 152/289 (52%), Gaps = 52/289 (17%)

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V +LD SVG V+ +L R+GMLENS+++F SDNGA     R      N GSN+P RG K+T
Sbjct: 8   VYELDQSVGMVMESLHRRGMLENSVVVFSSDNGA-----RPLGLGANAGSNWPLRGGKST 62

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
           LWEGGV+  A +WSP + +  R+S QMMHISDWLPT YTAAGG+ SRL            
Sbjct: 63  LWEGGVRGAAFVWSPLLSRVGRLSDQMMHISDWLPTFYTAAGGNLSRL------------ 110

Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
                      +IDG D W +L  N  S R+ +L+NID     +A+  ++ K++LG    
Sbjct: 111 ----------GDIDGKDMWDALSKNLASPRHEILLNIDPIYNYSALIFENRKVILGD--- 157

Query: 491 GTMDGYY-------GQTRSNKVPLLNFN-AIVESKTYQSLQQLSQNIFLPISNIDKMRST 542
            T  G Y       G TR    P+   +  ++ S+T   L+      F  +  +    + 
Sbjct: 158 -TAGGLYSLRTPVPGGTR----PVDGLDQMMLNSRTGAVLRD-----FYKVQQVPVRPNW 207

Query: 543 RQQATIHCG-ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDIS 590
           RQ+  ++C   NP     +   N   Y F++  DPCE +N+A++   +S
Sbjct: 208 RQEVAVNCSYYNP---RNNFSDNVSLYYFDVKQDPCELDNLAATNVTVS 253


>gi|381160679|ref|ZP_09869911.1| arylsulfatase A family protein [Thiorhodovibrio sp. 970]
 gi|380878743|gb|EIC20835.1| arylsulfatase A family protein [Thiorhodovibrio sp. 970]
          Length = 609

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 150/527 (28%), Positives = 228/527 (43%), Gaps = 86/527 (16%)

Query: 54  HLEAPQETINQFQYITDPN-----RRTYAALTKSTT------LTLLIVYGWNDLSFHGSN 102
           HL+A +   N +  I  P      R   A L K T       + LL   GW D  F+G  
Sbjct: 72  HLKAEKPADNMYPVIAHPEQENAAREKLAELEKKTGKKPNVLIFLLDDVGWMDPGFNGGG 131

Query: 103 EI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGV 159
                 TPN+D LA +G++L + Y+ P C+PSRA++ TG+ P+H G+  PP++G EP G+
Sbjct: 132 VAVGNATPNMDELAQDGLLLTSAYSTPSCSPSRATIHTGQNPLHHGILRPPMYG-EPGGL 190

Query: 160 PLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY------ 213
                 LP+ L++ GY T+ +GKWH+G   +   P   G++ + G+L GV   Y      
Sbjct: 191 D-GAITLPKLLKKHGYVTQGVGKWHMG-ENKGSLPQNVGYDDYVGFL-GVSDMYTEWRDL 247

Query: 214 ----DHILSDQYSRTVELNGHDMRRNLSTAWDTVG---------EYATDLFT--KEAVQL 258
               +  LS    + ++ +G D      TA DT           +Y  DL T  K+    
Sbjct: 248 YFNPEVALSPARFKMMQEDGFDHHEVHCTADDTEQCKDGKLIDLDYIKDLDTHWKDVSLA 307

Query: 259 IED--QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
             D  +  D+P FLY A    H  N       P E + +    T      Y+  + ++DD
Sbjct: 308 FLDKMKDSDEPFFLYHATRGCHFDNY-----PPDEYLGKSAAHT-----VYSDCIVQMDD 357

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
             G ++  L+  G LEN++I+  SDNG P  E              P+RG K + WEGGV
Sbjct: 358 IFGALVDKLEETGQLENTLILLTSDNG-PECEIPPH-------GRTPFRGCKGSSWEGGV 409

Query: 377 KVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
           +VP    W   I    R S  +   +D LPT+ +  G         + G D      L  
Sbjct: 410 RVPTFAYWKGMIPS--RRSDGLFDFADILPTVMSMTG---------VSGAD------LAG 452

Query: 436 PSRRNSNIDGLDQWSSLLLNTP-SRRNSVLINIDEKKRTAAVRLDSWKLVL------GTQ 488
                + IDG+DQ   LL +   S R S    I++      +R+D +KLV       G  
Sbjct: 453 EFSDVTYIDGVDQLPFLLADDGLSSRRSRPYTINQ--YFGGIRIDEFKLVWTGEIENGVV 510

Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISN 535
           + G   G+ G   +     + FN     +   S+      +++P+  
Sbjct: 511 QKGDFGGFSGSVFTESGGAIMFNLYTNPQEDVSIGVRHIPMYVPLEG 557


>gi|406661473|ref|ZP_11069592.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
 gi|405554747|gb|EKB49822.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
          Length = 478

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 117/345 (33%), Positives = 172/345 (49%), Gaps = 39/345 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL   G+++I TPNID +A  GI   + + A PVC+PSRA L+TG+ P   G+    
Sbjct: 58  GYGDLGCFGASDIATPNIDRIAAEGIKFTSFLSASPVCSPSRAGLLTGRMPQRMGIN-TV 116

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
            +     G+   E  + E L+  GY T  +GKWHLG   R + PL +GF  +FG     I
Sbjct: 117 FFPESLTGMDPEEITIAEILKTKGYRTGIVGKWHLGHLER-FLPLNQGFYEYFG-----I 170

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            Y + + S  Y          MR N   A+     Y T  +T+E+++ I D   D+P FL
Sbjct: 171 PYSNDMASVVY----------MRGNEVEAYHVDQRYMTRTYTEESLKFI-DASGDQPFFL 219

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLAH   H          P     +F+  +  NR  Y  ++++LD SVG V+  L+ KG+
Sbjct: 220 YLAHNMPHV---------PIYASPEFEGTS--NRGLYGDVIQELDWSVGEVLRKLEEKGI 268

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI-LWSPQIQQ 389
           LEN++I+F SDNG P +   +     + GS    R  K   +EGGV+VP + +W  +I+ 
Sbjct: 269 LENTLIVFSSDNG-PWLVMED-----HGGSAGHLREGKQFTFEGGVRVPTVAMWKGKIEP 322

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
             +V   +    DW PT     G +  +  + IDG D    L  N
Sbjct: 323 G-QVYEDLATQMDWFPTFSQLVGAEIPQDRI-IDGKDLGPVLFEN 365


>gi|260063387|ref|YP_003196467.1| arylsulfatase [Robiginitalea biformata HTCC2501]
 gi|88783481|gb|EAR14653.1| putative arylsulfatase [Robiginitalea biformata HTCC2501]
          Length = 561

 Score =  154 bits (389), Expect = 1e-34,   Method: Compositional matrix adjust.
 Identities = 124/375 (33%), Positives = 180/375 (48%), Gaps = 62/375 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G  +  TPNID LA  G++    Y+   VC PSRA+L+TG +  HT ++G  
Sbjct: 126 GYGDLGCYGQEKFNTPNIDRLAQEGLLFTQHYSGSTVCAPSRAALLTGMHTGHTVVRG-- 183

Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY-- 205
               +P G   +P +   L E  RE GY+T A GKW LG+   E  P  +GF++ FGY  
Sbjct: 184 NREVQPEGQHPIPDSTYTLAEMFRENGYATGAFGKWGLGYPGSEGDPNAQGFDTFFGYNC 243

Query: 206 -LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
              G   Y  H+  ++ S  +E N   ++          G YA +L  +E +  IE+   
Sbjct: 244 QRMGHHYYPHHLWYNRDSIVLEENAGKLK----------GRYAPELIHRETLNFIEEN-R 292

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-----------------QYITDP----- 302
           + P FLY+A +  HA      L AP+ET+ +F                 QY   P     
Sbjct: 293 ENPFFLYVASIIPHA-----ELAAPEETLQRFRGRYPPEKPYEGTDQGPQYRKGPYESQQ 347

Query: 303 -NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
             R  +AAM+  LD  VG ++  ++  G+ EN++I+F SDNG P +E     +Y  + SN
Sbjct: 348 KPRAAFAAMISILDRQVGEIMEKVRELGLEENTLIVFTSDNG-PHLEGGADPDY--FDSN 404

Query: 362 YPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHIS---DWLPTLYTAAGGDTSR 417
            P RG K  L+EGG++VP +  W   I    R      HIS   D  PT     G     
Sbjct: 405 GPLRGYKRDLYEGGIRVPMLARWPGWIPPGTRTD----HISAFWDVFPTFAEILG---VA 457

Query: 418 LPLNIDGLDQWSSLL 432
            P+ +DG+    +LL
Sbjct: 458 PPVEMDGISFLPTLL 472


>gi|255014751|ref|ZP_05286877.1| putative secreted sulfatase precursor [Bacteroides sp. 2_1_7]
 gi|410102840|ref|ZP_11297765.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
 gi|409237967|gb|EKN30762.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
          Length = 452

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
           AL  S  + L++V   G+ DLS  G+  +I TP+ID L   G+   N +A  PV +PSRA
Sbjct: 23  ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
           SL+TG+YP   G+ G      E     L+E    LP+ L++ GY    +GKW+LG     
Sbjct: 83  SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
            TP  RGF+ + G+L  ++        D Y        + MR NL    D  G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191

Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
           +  A++ + D +   +P FLYLA+ A H       ++ PQE + + +   +P+    R  
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             A+++ LD +VG V  AL++ G LEN+IIIF SDNG                +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296

Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
            K  ++EGG++V   I W  QI+  P V    + +SD  PTL      D + +P++  ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349

Query: 424 GL 425
           G+
Sbjct: 350 GI 351


>gi|343086301|ref|YP_004775596.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354835|gb|AEL27365.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 481

 Score =  154 bits (389), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 148/526 (28%), Positives = 223/526 (42%), Gaps = 109/526 (20%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL   G   I TPN+D +A+ G    N Y A PVCTPSRA L+TG+ PI +GM    
Sbjct: 44  GYGDLGVFGHPSIKTPNLDKMAFEGQKWTNFYVAAPVCTPSRAGLLTGRLPIRSGMSSDD 103

Query: 151 ---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--Y 205
              ++     G+P +E  + + L+  GY T AIGKWHLG  +  Y P   GF+S+FG  Y
Sbjct: 104 RRVLFPDSDGGLPQSEITIAKALKGNGYQTAAIGKWHLGH-KSPYLPTDHGFDSYFGIPY 162

Query: 206 LNGV--ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
            N +  I   DH       +    N   M+        T     T  +T+EAV  I+   
Sbjct: 163 SNDMDKIDKTDHFTLTDNEKFDAYNVPLMKDAEIAERPTDQRTLTKRYTEEAVSKIKAFK 222

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR---TYAAMVKKLDDSVGT 320
            D+P F+YLAH   H              I  F+     ++     Y  +++++D SVG 
Sbjct: 223 -DEPFFIYLAHNLPH--------------IPLFRSAAFKDQSLGGIYGDVIEEIDWSVGQ 267

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           ++S L+ +G+ EN++++F SDNG P   ++      + G+    RG K   +EGG++ P 
Sbjct: 268 ILSTLKEEGIAENTLVVFTSDNG-PWHVFK-----THGGTAGLLRGAKGGTFEGGMREPT 321

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
           + W P  Q  P V + M    D LPT    +G   + LP                    +
Sbjct: 322 VFWWPA-QIKPGVVMDMGTTMDLLPTFCAISG---TELP-------------------DD 358

Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
              DG D    L     S R +V       +R  A+R   +K    TQ        YG  
Sbjct: 359 RIYDGYDISGLLKGTGKSERETVFYY--RGQRVYAIRKGDYKAHFITQLE------YGNP 410

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
            ++ V                                    TR +  +    N A +  +
Sbjct: 411 TAHPV------------------------------------TRPEVVVE---NKATVLET 431

Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
           P       L+N+  DP E+ NIA   P+I +++ E+L  H+ ++VP
Sbjct: 432 PL------LYNVNVDPGERFNIAEDHPEIIAEIREVLAAHQASIVP 471


>gi|301311961|ref|ZP_07217883.1| putative arylsulfatase [Bacteroides sp. 20_3]
 gi|423339483|ref|ZP_17317224.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
           CL09T03C24]
 gi|300830063|gb|EFK60711.1| putative arylsulfatase [Bacteroides sp. 20_3]
 gi|409230864|gb|EKN23725.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
           CL09T03C24]
          Length = 452

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
           AL  S  + L++V   G+ DLS  G+  +I TP+ID L   G+   N +A  PV +PSRA
Sbjct: 23  ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
           SL+TG+YP   G+ G      E     L+E    LP+ L++ GY    +GKW+LG     
Sbjct: 83  SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
            TP  RGF+ + G+L  ++        D Y        + MR NL    D  G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191

Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
           +  A++ + D +   +P FLYLA+ A H       ++ PQE + + +   +P+    R  
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             A+++ LD +VG V  AL++ G LEN+IIIF SDNG                +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296

Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
            K  ++EGG++V   I W  QI+  P V    + +SD  PTL      D + +P++  ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349

Query: 424 GL 425
           G+
Sbjct: 350 GI 351


>gi|390944774|ref|YP_006408535.1| arylsulfatase A family protein [Belliella baltica DSM 15883]
 gi|390418202|gb|AFL85780.1| arylsulfatase A family protein [Belliella baltica DSM 15883]
          Length = 470

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 116/371 (31%), Positives = 180/371 (48%), Gaps = 46/371 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G  D+  +G +   TPN+  LA  GI+  + YA   VC PSRASLMTG++     ++G  
Sbjct: 43  GVGDIEPYGQSIFETPNLSRLAQEGIMFKDFYAGSTVCAPSRASLMTGQHTGKIRIRGNG 102

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
            +  +P       + LPE L+E GY+    GKW LG    E +P  RG++   G+L+ V 
Sbjct: 103 EFPLDP-----DVKILPEMLKENGYTNLMFGKWGLGLQESESSPEKRGWDFFLGHLHHVS 157

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           +++      Q   +++L    +   L       G YA D FT+ A+  IE    ++P F+
Sbjct: 158 AHF------QKPDSLDLF---LNNTLGRVKVPEGTYANDWFTEGAINYIETSSNEEPFFI 208

Query: 271 YLAHLAAHAGNAGKHLEAP----QETINQFQYITDPN----------------RRTYAAM 310
           YL++   HA      L  P    ++ +N+      PN                +  YAA+
Sbjct: 209 YLSYTIPHA-----ELLVPDRYFKDFVNENGQSVFPNEIAWPDGKHYGQQNYPKAAYAAL 263

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V  LDD VG V+ AL+ KG+ +N+I+IF SDNG      R   +   + S+  YRGVK  
Sbjct: 264 VHSLDDYVGQVMDALESKGIADNTILIFTSDNGTHIEGGRRMEDVDYFQSSGQYRGVKRD 323

Query: 371 LWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
           L++GG+KVP I+ W  +I+QN   S+     S W   +Y   G     +    DG+    
Sbjct: 324 LYDGGIKVPLIIKWPDKIKQN---SVTEHRASFW--DVYNTFGDIVGNIDSTRDGISFLP 378

Query: 430 SLLLNTPSRRN 440
           +LL  +  +++
Sbjct: 379 TLLGKSKQKQH 389


>gi|149197520|ref|ZP_01874571.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
 gi|149139538|gb|EDM27940.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
           HTCC2155]
          Length = 446

 Score =  154 bits (388), Expect = 2e-34,   Method: Compositional matrix adjust.
 Identities = 103/339 (30%), Positives = 170/339 (50%), Gaps = 19/339 (5%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASL 135
           AA   +  L      GW D+++HG  +  TP IDA+A  G+     YA   VC PSRA +
Sbjct: 16  AADKPNIVLVFADDMGWGDVAYHGVEDAQTPAIDAIAKGGVWFEQGYAAASVCGPSRAGI 75

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           +TG+Y     + G    G   +G+P +++ + E L+  GY + A GKWHLG  + ++ P 
Sbjct: 76  LTGRY---QQLFGVVTNGDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKKGQF-PN 131

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
            RGF++ +G+  G   YY    +D+     +     +  N        G+Y T+  T  A
Sbjct: 132 DRGFDTFYGFHFGAHDYY---RADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHA 188

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
           V+ IE+   D+P F+Y+A+ + H+       + P E + +        RR + AMV  +D
Sbjct: 189 VEFIEENK-DQPFFMYVAYNSVHS-----PWQVPDEYLARIPESVPAYRRLFLAMVLAMD 242

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE--TSNYRNWGSNYPYRGVKNTLWE 373
           D VG + + L+   + EN+I +F +DNG+P +  ++     YR   S   +RG K   +E
Sbjct: 243 DGVGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKPNEGQYRMSMSQ-GFRGYKGDTYE 301

Query: 374 GGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
           GG++VP  + W  +I+   +    ++   D  PT  +AA
Sbjct: 302 GGIRVPFCMSWPKKIKSGNKFEAPVI-AYDLAPTFLSAA 339


>gi|443694992|gb|ELT96000.1| hypothetical protein CAPTEDRAFT_113428 [Capitella teleta]
          Length = 521

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 115/384 (29%), Positives = 187/384 (48%), Gaps = 55/384 (14%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRA 133
           + L++   + ++++   GW DL  +G     TPN+D +A  G++  + Y A  +C+PSRA
Sbjct: 21  SCLSRKPNIIVMLMDDMGWGDLGSYGHPARETPNLDRMAAEGMLFTDFYSAAGICSPSRA 80

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPR-------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           S+MTG+ PI  G      +            G+  +E   PE L++ GY  K +GKWHLG
Sbjct: 81  SVMTGRLPIRNGFYSTNQFAKNSYLPQDVMGGISDSEVLFPELLKQNGYRNKIVGKWHLG 140

Query: 187 FFRREYTPLYRGFESHFG-------YLNGVIS----YYDHILSDQYSRTVELNGHDMRRN 235
             R ++ PL  GF+  FG       Y + V+     Y D ++  +Y    ++   D++  
Sbjct: 141 H-REQFLPLQHGFDEWFGAPDCHYKYDDKVMPNIPVYKDSLMQGRYYDHFKI---DIKTG 196

Query: 236 LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 295
           LS          T L+ +EA+  +E +  D+P FLY A  A H          P     +
Sbjct: 197 LSNL--------TQLYIQEALDFLEKE-RDRPFFLYWAPDATHT---------PLYASEK 238

Query: 296 FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY 355
           F+  ++  R  Y   V +LD  VG +++ L+  G+  N+   F SDNGA T  + E S  
Sbjct: 239 FRGKSE--RGLYGDAVMELDSGVGKILNKLKELGIENNTFAFFSSDNGAATYAFTEVSII 296

Query: 356 RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
            + GSN P+   K T +EGG++ PA+ W P   +   +S Q+ ++ D   T+        
Sbjct: 297 ISGGSNGPFLCGKQTTFEGGMREPALAWWPTKIRPGSISKQVGNLMDIFSTVLK-----I 351

Query: 416 SRLP----LNIDGLDQWSSLLLNT 435
           + +P    L +DG+D  SS+L N+
Sbjct: 352 TSIPHPKGLYLDGVD-LSSVLFNS 374


>gi|313241546|emb|CBY33792.1| unnamed protein product [Oikopleura dioica]
          Length = 336

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 107/304 (35%), Positives = 152/304 (50%), Gaps = 23/304 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW+D+S++      TP +  L  + I L++ Y+   CTPSRASL+TGKY    G+   PI
Sbjct: 41  GWSDVSWNNKKIKATPFLGQLEKHSITLSSSYSTHRCTPSRASLLTGKYAWRFGLGTDPI 100

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                 G+ L E+ LPE LR+ GYST  +GKWHLG     Y P  RGF++ +G+  GV++
Sbjct: 101 DANTAAGLDLKEKLLPEILRKNGYSTHHVGKWHLGHCNSSYLPHNRGFDTFYGHTGGVLN 160

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
           Y+ H  +    + +     D   N +   +  G Y+T  F   A +L     +D P F+Y
Sbjct: 161 YFQHNRAVGNCKYL-----DYFENDTPIHEKTGVYSTFDFGDHARKLY--NKIDDPKFIY 213

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRR-TYAAMVKKLDDSVGTVISALQRK 328
           LA  A H       L AP+  I + +  Y   P  R TY AMVK +D  +  ++  +  K
Sbjct: 214 LALNAPHGP-----LMAPEHMIEEMRVLYPDSPRTRLTYLAMVKAIDLEMEKLLGTIWMK 268

Query: 329 GMLENSII--IFMSDNGAPTV-----EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
               ++II  +F SDNG P       +  + S YR    NYPY G K  L EGG   P +
Sbjct: 269 NEKRDTIIATVFQSDNGGPLYMNGRPKDNDPSRYRQ-ACNYPYLGEKTLLTEGGTLSPTL 327

Query: 382 LWSP 385
              P
Sbjct: 328 GTDP 331


>gi|326331907|ref|ZP_08198194.1| twin-arginine translocation pathway signal [Nocardioidaceae
           bacterium Broad-1]
 gi|325950404|gb|EGD42457.1| twin-arginine translocation pathway signal [Nocardioidaceae
           bacterium Broad-1]
          Length = 463

 Score =  153 bits (387), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 111/337 (32%), Positives = 163/337 (48%), Gaps = 43/337 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP--IHTGMQG 148
           GW DLS +GS +I TP++D LA +G+   + Y A  VC+P+R  L TG+YP  +  G+  
Sbjct: 60  GWADLSAYGSPDIKTPHLDRLAASGVRFTDAYSASSVCSPTRFGLYTGRYPGRLAGGLPE 119

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P     +  G+PL    L   L+E GY+T   GKWH G+    Y+P   G++  FG  +G
Sbjct: 120 PISAPNKLHGIPLDHPTLASLLKEQGYATALFGKWHCGYLPW-YSPTRVGWDEFFGNFSG 178

Query: 209 VISYYDHI----LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
            + Y+  I      D Y   VE+   D+R            Y T++ T+ AV  IE +  
Sbjct: 179 GLDYFSKINHNGAHDLYEDEVEV--EDLR------------YYTEIVTERAVSFIE-RDH 223

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQ--------FQYITDPNR-RTYAAMVKKLD 315
           D P  L L     H    G   +A  + +            + TD     TY  MV+ LD
Sbjct: 224 DTPWLLNLNFTTPHWPWEGPEDKAVSDELTARIEGGEKGVLFHTDGGSLATYKTMVEDLD 283

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
            S+G VI AL+R G L ++++ F SDNG              + + +P+ G K+ L EGG
Sbjct: 284 QSIGQVIKALERTGQLRDTVVFFASDNGG-----------ERFSNVWPFTGAKSGLNEGG 332

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           ++VP IL  P I +  +VS + +  +DW  T    AG
Sbjct: 333 IRVPTILSWPGILRGRQVSDEPVVTTDWTATFLELAG 369


>gi|444523022|gb|ELV13435.1| Arylsulfatase J [Tupaia chinensis]
          Length = 383

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 115/405 (28%), Positives = 183/405 (45%), Gaps = 69/405 (17%)

Query: 227 LNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
           + G+D+  N   AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      
Sbjct: 1   MCGYDLYENDHAAWDQDNGIYSTQMYTQRVQQILASHNPAKPIFLYIAYQAVHSP----- 55

Query: 286 LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA- 344
           L+AP      ++ I + NRR YAAM+  LD+++  V  AL+  G  +NSI+I+ SDNG  
Sbjct: 56  LQAPGRYFEHYRSIININRRRYAAMLACLDEAIHNVTLALKTYGFYDNSILIYSSDNGGQ 115

Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWL 404
           PT            GSN+P RG K T WEGG++    + SP ++    V  +++HI+DW 
Sbjct: 116 PTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNRGTVCRELVHITDWY 165

Query: 405 PTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL 464
           PTL + A G      + +DG D W ++   +   R+  +D L     +     +   +  
Sbjct: 166 PTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAG 221

Query: 465 INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQ 524
             I      +A+R+  WKL+ G        GY     S+ VP   F              
Sbjct: 222 YGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQAF-------------- 256

Query: 525 LSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIAS 584
                    SN+   R   ++ T+  G +              +LFN+  DP E+ +++S
Sbjct: 257 ---------SNLGPTRWHNERVTLSTGKS-------------VWLFNITADPYERVDLSS 294

Query: 585 SRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
             P I  QL   L    +T VP  +  P   +++P+     W PW
Sbjct: 295 RYPGIVKQLLRRLSQFNKTAVPVRYP-PKDPRSNPRLNGGVWGPW 338



 Score = 48.9 bits (115), Expect = 0.009,   Method: Compositional matrix adjust.
 Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 6/80 (7%)

Query: 1  MRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
          +  N   AWD   G Y+T ++T+   Q++      KP+FLY+A+ A H+      L+AP 
Sbjct: 6  LYENDHAAWDQDNGIYSTQMYTQRVQQILASHNPAKPIFLYIAYQAVHS-----PLQAPG 60

Query: 60 ETINQFQYITDPNRRTYAAL 79
               ++ I + NRR YAA+
Sbjct: 61 RYFEHYRSIININRRRYAAM 80


>gi|149199917|ref|ZP_01876945.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149136986|gb|EDM25411.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 499

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 114/352 (32%), Positives = 169/352 (48%), Gaps = 37/352 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ DL  +G   I TPNID +A  G+   + YA+  C+P+RASLMTG +    G+ G  +
Sbjct: 34  GYGDLGCYGHPIIKTPNIDKMADRGVRFTDFYARHKCSPARASLMTGAFNFRVGV-GSIV 92

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL---NG 208
           +     G+      +PE L+E GY+T  IGKWHLG     Y P  +GF+ +FG     +G
Sbjct: 93  YPNSTTGLIKEVVTIPEMLKEKGYTTALIGKWHLGH-TAGYLPRDQGFDYYFGVPGTNHG 151

Query: 209 VISYYDHILSDQYSRTVELNGHD----------------MRRNLSTAWDTVGEYATDLFT 252
               +   +++ +  + E    D                M+ +    W T     T  +T
Sbjct: 152 DAKTHKLPVAEGFKPSGEFTIEDYWADKGKGVHGNSTILMKNDNVIEWPTDITQLTKRYT 211

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
            +AV+ I++   DKP FLY AH   H          P      F+  +D     Y  M++
Sbjct: 212 HDAVRYIKENK-DKPFFLYFAHGTPHH---------PYTVDAAFRGKSD--HGLYGDMIE 259

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
           ++D SVG VI ALQ  G+ + +II F SDNGA +   +E   +   GSN P +G K +  
Sbjct: 260 EIDWSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKE---HAEKGSNLPLKGWKGSSE 316

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
           EGGV+VP +L  P      + + ++  + D  PT Y A  G    +P  IDG
Sbjct: 317 EGGVRVPFVLSWPGTLPEGKKTNEIASLMDIFPT-YAALAGIEPEVPQKIDG 367


>gi|149198313|ref|ZP_01875359.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
 gi|149138609|gb|EDM27016.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
           HTCC2155]
          Length = 476

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 116/366 (31%), Positives = 179/366 (48%), Gaps = 43/366 (11%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP 141
           ++I++    G+ DLS  G   + TP ID +A  G  L + Y A PVCTPSRA+LMTG YP
Sbjct: 34  IIIIFTDDQGYADLSCFGGTHVSTPRIDQMAAEGAKLTSFYVAAPVCTPSRAALMTGTYP 93

Query: 142 ----IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
               +  G     +   + +G+   E  + E L+ +GY T   GKWHLG  + E+ P  +
Sbjct: 94  KRIDMARGSNFVVLLAGDKKGLNPKEITIAEVLKAVGYKTGMFGKWHLGD-QPEFLPTRQ 152

Query: 198 GFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           GF+  FG  Y + +  Y+      ++     L+G ++        D   +Y T  FT+ A
Sbjct: 153 GFDEFFGLPYSHDIHPYHPQQSHFKFPSLPLLDGEEV-----IEMDPDADYLTKRFTERA 207

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHA---GNAGKHLEAPQETINQFQ----YITDPNR-RTY 307
           VQ IE    D+P FLY+ H   H     +     + P+E   +      YI    R +  
Sbjct: 208 VQFIEKNK-DQPFFLYMPHPIPHTPLHASPAYMKDLPEEIKAKLSKENNYIDYKTRTQLL 266

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
           +  + ++D SVG ++ AL++ G+ +N+I+IF SDNG P +           G   P RG 
Sbjct: 267 SRAIGEVDWSVGQILDALKKNGIDDNTIVIFSSDNG-PAI-----------GKATPLRGR 314

Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDGL 425
           KN   EGG++VP ++  P        + ++M   D LPT    AG +  + R+   IDG 
Sbjct: 315 KNQTLEGGMRVPTVIHWPGKIPAGIANDEIMTTMDLLPTFAKLAGAEIPSDRV---IDGK 371

Query: 426 DQWSSL 431
           D W+ +
Sbjct: 372 DIWAVI 377


>gi|262383795|ref|ZP_06076931.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
 gi|262294693|gb|EEY82625.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
          Length = 452

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 123/362 (33%), Positives = 187/362 (51%), Gaps = 47/362 (12%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
           AL  S  + L++V   G+ DLS  G+  +I TP+ID L   G+   N +A  PV +PSRA
Sbjct: 23  ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
           SL+TG+YP   G+ G      E     L+E    LP+ L++ GY    +GKW+LG     
Sbjct: 83  SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141

Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
            TP  RGF+ + G+L  ++        D Y        + MR NL    D  G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191

Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
           +  A++ + D +   +P FLYLA+ A H       ++ PQE + + +   +P+    R  
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
             A+++ LD +VG V  AL++ G LEN+IIIF SDNG                +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296

Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
            K  ++EGG++V   I W  +I+  P V    + +SD  PTL      D + +P++  ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNRIR--PAVRENFVMLSDMFPTLC-----DLTAVPVSHEID 349

Query: 424 GL 425
           G+
Sbjct: 350 GI 351


>gi|424046834|ref|ZP_17784396.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Vibrio cholerae HENC-03]
 gi|408884894|gb|EKM23622.1| type I phosphodiesterase / nucleotide pyrophosphatase family
           protein [Vibrio cholerae HENC-03]
          Length = 500

 Score =  153 bits (386), Expect = 3e-34,   Method: Compositional matrix adjust.
 Identities = 129/429 (30%), Positives = 197/429 (45%), Gaps = 66/429 (15%)

Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
           TPNID++A  G++L + YAQP CT  R++ +TG+ P+ TGM    + G  P G+      
Sbjct: 54  TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHTVGLPGG-PVGLNQNTPT 112

Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN--GVISYYD--------- 214
           LPE L+ELGY T   GK HLG  R E+ P   GF+ ++G+L     + Y +         
Sbjct: 113 LPELLKELGYVTGQFGKNHLGD-RDEFLPTMHGFDEYWGWLYHLNAMEYTEDPDWPKDEA 171

Query: 215 ----------HILSD-QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
                     H  +D +  ++++ +G      + T  D V  +A D F + AV+      
Sbjct: 172 FQKFAPRNVIHAKADGKGGQSIKDDGALTIERMRTLDDEVNRHAID-FIERAVK------ 224

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
            +KP F +      H      HL    E +     +          ++K LDD VG +++
Sbjct: 225 AEKPFFTWYCPSRGHVWT---HLSPKYEAM-----LGTNGWGLQEVVMKDLDDHVGEMLA 276

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL- 382
            L+  G+ +N+I+IF +DNG   + + +       G   P+ G K T WEGGV+ P ++ 
Sbjct: 277 KLEELGVADNTIVIFTADNGPEIMTWPD-------GGMTPFHGEKGTTWEGGVRAPFLIS 329

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
           W  +I  N +VS  M    D LPTL  AAGG     P ++         +L       ++
Sbjct: 330 WPEKIPAN-KVSNGMFDGMDLLPTLVEAAGG-----PADL------KQQMLKGYKGHKAH 377

Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
           +DG +Q   L  N  S RN +     E+ +  AVR+  WK     Q +    G+ G    
Sbjct: 378 LDGYNQMEMLTNNGKSNRNEIFYY--ERDKLQAVRVGDWKAHFVVQNH----GWSGPKEE 431

Query: 503 NKVPLLNFN 511
              PLL FN
Sbjct: 432 LNAPLL-FN 439


>gi|421614584|ref|ZP_16055638.1| arylsulfatase A [Rhodopirellula baltica SH28]
 gi|408494671|gb|EKJ99275.1| arylsulfatase A [Rhodopirellula baltica SH28]
          Length = 1012

 Score =  152 bits (385), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 148/513 (28%), Positives = 223/513 (43%), Gaps = 80/513 (15%)

Query: 68   ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQ 125
            ++  +R   A  TK   + +L    G+ DLS  G+  + TP ID +A  G  L + Y A 
Sbjct: 557  VSPADREKTAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAA 616

Query: 126  PVCTPSRASLMTGKYPIHTGMQGPPIWG----AEPRGVPLTERFLPEYLRELGYSTKAIG 181
            PVCTPSRA LMTG YP    M     +G     +P+G+   E  + E L+  GY T   G
Sbjct: 617  PVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFG 676

Query: 182  KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---RRNLST 238
            KWHLG  + E+ P  +GF+  FG        Y H +   + R    +   +   + +   
Sbjct: 677  KWHLGD-QPEFLPTKQGFDEFFGIP------YSHDIHPFHPRQNHYHFPPLPLLQNDTVI 729

Query: 239  AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA----------GNAGKHLEA 288
              D   ++ T   T++AV  IE    D+P FLYL H   HA          G A   + A
Sbjct: 730  EMDPDADFLTKRLTEQAVSFIERN-KDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAA 788

Query: 289  PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
             ++      Y T  N   +   + ++D SVG ++ AL+  G+ E ++++F SDNG P   
Sbjct: 789  IEKEDGNIDYATRAN--LFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPP--- 843

Query: 349  YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
              + + Y + G     RG K T +EGG++ P ++ W  QI      + ++M   D LPT 
Sbjct: 844  --KNTLYASPGE---LRGHKGTTFEGGMREPTVVRWPGQIPAE-HQNDELMTAMDLLPTF 897

Query: 408  YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
               AG                      TP+ R   IDG D W +L   T +  ++   + 
Sbjct: 898  AKLAGA--------------------ATPTDR--VIDGKDIWPTLKGETQTPHDAFFYH- 934

Query: 468  DEKKRTAAVRLDSWKLVLGTQENGTMDGYYG--QTRSNKVPLLNFNAIVESKTYQSLQQL 525
                + AAVR   WKL +    NG     Y        KV ++  N  V  K    L++ 
Sbjct: 935  -RGNQLAAVRSGKWKLHV---NNGVAKQLYDLENDLGEKVNVIETNPEVVKKLQHQLKEF 990

Query: 526  SQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
            + +I           ++R  A I   ANP P++
Sbjct: 991  AADI---------ASNSRPAAFI---ANPKPLS 1011



 Score =  108 bits (271), Expect = 7e-21,   Method: Compositional matrix adjust.
 Identities = 140/574 (24%), Positives = 223/574 (38%), Gaps = 127/574 (22%)

Query: 75  TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPS 131
           T  A  +   + L+ V   G+ DLS +G+ ++ TPNID LA  G    + + A  VCTPS
Sbjct: 32  TSVAAERPPNVVLIFVDDLGYGDLSCYGATKLSTPNIDRLAAKGRRFTDAHSASAVCTPS 91

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPL----TERFLPEYLRELGYSTKAIGKWHLGF 187
           R  L+TG+YP+   M G  IWG  P    L      + + +  +  GY+T  +GKWHLGF
Sbjct: 92  RYGLLTGQYPVRA-MGGQGIWGPLPTTSGLIIDTNTQTIGKVFKNKGYATACLGKWHLGF 150

Query: 188 FRREYT--------PLYRGFESHFGY-LNGVISYYDHILSDQYSRTVELNGHDMRRNLST 238
                         P   GF+ +FG  L    S Y ++  D       + G+D    L  
Sbjct: 151 KEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDS------IFGYDPNDPLVY 204

Query: 239 AWDTVG-----------------------------EYATDLFTKEAVQLIEDQPVDKPLF 269
               V                              E    L T+ AV+ I ++  D+P F
Sbjct: 205 GGKPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITEK-KDEPFF 263

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
           LY A    H          P     +F+  +      Y   V +LD  VG ++ +L+  G
Sbjct: 264 LYFATPNIH---------HPFTPAPRFKGTSQCG--LYGDFVHELDWMVGEIVQSLEDNG 312

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQ 388
           + +N++++F SDNGA        +       N    G K  +WEGG +VP I  W  +I+
Sbjct: 313 LTDNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIK 372

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LNTPSRRNSNIDGLD 447
                                 AG  + +L   +D    +S+L     PS      D ++
Sbjct: 373 ----------------------AGTQSDQLISQVDLFATFSALTEQEMPSSEQK--DSIN 408

Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
              +LL +      + L+    + R  A+R   W L +G + +G  +G   Q  +   P 
Sbjct: 409 MLPALLDDPNEPLRTELVLAPRQPRNLAIRKGKW-LYIGARGSGGFNGSKPQHHAWGGP- 466

Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
               A V+   +++            S+I   R  +           AP         P 
Sbjct: 467 ----AAVQFSGHKN------------SDIVNGRIKKN----------AP---------PA 491

Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
            L++L ND  +  N+    P++  ++  LL+ +R
Sbjct: 492 QLYDLENDRSQTTNVYREHPEVVEEMKALLESYR 525


>gi|410631840|ref|ZP_11342513.1| arylsulfatase B [Glaciecola arctica BSs20135]
 gi|410148741|dbj|GAC19380.1| arylsulfatase B [Glaciecola arctica BSs20135]
          Length = 544

 Score =  152 bits (385), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 130/392 (33%), Positives = 186/392 (47%), Gaps = 69/392 (17%)

Query: 82  STTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKY 140
           +  L L    G ND+S  G   I TPNID LA  G +  N Y+    C PSRA+L+TG+ 
Sbjct: 60  NVVLILTDDMGINDVSTFGGGMINTPNIDKLAAKGALFTNGYSGHANCAPSRAALLTGRD 119

Query: 141 PIHTGMQGPPI----------------WGAEP----------------RGVPLTERFLPE 168
              TG    PI                 G  P                RG+P  E  + E
Sbjct: 120 ASRTGFDVTPIPDGFGRMIAMIENKNSKGRPPVEYFPEIDKNNPTFNNRGLPGEEITIAE 179

Query: 169 YLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELN 228
            +++ GY T  IGKWHLG    E  P  +GF+      +G+    DH   +  +  VE +
Sbjct: 180 LMQDAGYRTLHIGKWHLGR-TDEMMPNAQGFDESLMMASGLYLPVDH--PEAVNAQVEFS 236

Query: 229 GHD----MRRNLSTAWDTVGE------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 278
           G D       + +  W+  GE      Y TD FT+EA + IE    ++P FLYLAH   H
Sbjct: 237 GLDRFIWATMDYAVTWNG-GERFAPRGYLTDYFTEEAERAIEAN-ANRPFFLYLAHWGPH 294

Query: 279 AGNAGKHLEAPQETINQFQYITDPNR-RTYAAMVKKLDDSVGTVISALQRKGMLENSIII 337
             N  +  ++  E +   Q    P+R R YAAM++ LD SV  V++ L+ +G+ +N+I+I
Sbjct: 295 --NPVQAKKSDYEAVGDIQ----PHRKRVYAAMLRSLDRSVERVMAKLEEQGIADNTIVI 348

Query: 338 FMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQ 396
             SDNG    +Y    N      N PYRG K+T +EGG++VP +I W  +I+    +   
Sbjct: 349 LSSDNGG--ADYVAIDNL-----NKPYRGWKSTFFEGGIRVPFSITWPGEIEAGLIIDEP 401

Query: 397 MMHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
           + H  D +PT+  A  G   +LP +  +DG D
Sbjct: 402 VNHF-DLMPTIVNAGQG---KLPQDRILDGRD 429



 Score = 39.7 bits (91), Expect = 4.8,   Method: Compositional matrix adjust.
 Identities = 27/69 (39%), Positives = 38/69 (55%), Gaps = 8/69 (11%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR- 73
           Y TD FT+EA + IE    ++P FLYLAH   H  N  +  ++  E +   Q    P+R 
Sbjct: 263 YLTDYFTEEAERAIEAN-ANRPFFLYLAHWGPH--NPVQAKKSDYEAVGDIQ----PHRK 315

Query: 74  RTYAALTKS 82
           R YAA+ +S
Sbjct: 316 RVYAAMLRS 324


>gi|442317872|ref|YP_007357893.1| sulfatase family protein [Myxococcus stipitatus DSM 14675]
 gi|441485514|gb|AGC42209.1| sulfatase family protein [Myxococcus stipitatus DSM 14675]
          Length = 559

 Score =  152 bits (385), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 122/450 (27%), Positives = 203/450 (45%), Gaps = 64/450 (14%)

Query: 87  LLIVYG-----WNDLSF-HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
           +L+++G     WN  ++  G     TPNID +A  G ++ + Y Q  CT  RA+ +TG  
Sbjct: 30  ILVIWGDDIGLWNISAYNQGMMGYRTPNIDRIAKQGALMTDCYGQQSCTAGRAAFITGMN 89

Query: 141 PIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
           P+ TG+    + GA+  G+  ++  + E L+ LGY+    GK HLG   R Y P   GF+
Sbjct: 90  PLRTGLTTIGMPGAD-YGLQDSDPTIAEMLKPLGYTCGQFGKNHLGDSNR-YLPTVHGFD 147

Query: 201 SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT-------------VGEYA 247
              G L  + +  +    D Y +      H   R +  +W T             VG+  
Sbjct: 148 EFHGNLYHLNAENEPECPD-YPKDPAFKDHFGPRGVLHSWATDRDDPTEEHRWGRVGKQK 206

Query: 248 ---TDLFTKEAVQLIEDQPV-------------DKPLFLYLAHLAAHAGNAGKHLEAPQE 291
              T   TK+ ++ ++++ +             DKP F++      H             
Sbjct: 207 VVDTGPLTKKRMETVDEEFLASSLAFMEQAVKSDKPFFIWHNTTRTHV-----------W 255

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
           T  Q +Y     +  YA  + +LDD VGT+++ L   G+ +N+I++F +DNG   + + +
Sbjct: 256 TFLQEKYRNKTGKGLYADAMTELDDIVGTLLAKLDELGIADNTIVVFSTDNGVEKMSWPD 315

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                  G N P+RG K + WEGGV+VP ++  P + +   V   +    DW+PTL  AA
Sbjct: 316 -------GGNAPFRGEKGSTWEGGVRVPCVVRWPGVIEPGTVINDLFAHEDWMPTLVAAA 368

Query: 412 GGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK 471
           GG T       D +++       +  +   ++DG DQ   L    P +R+  +  +D   
Sbjct: 369 GGPT-------DLVEKCKQGYSASGKKFRVHLDGYDQRGLLSGKEPGQRHEFVYVLDSGN 421

Query: 472 RTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
             AAVR D WK++   Q+    D ++   R
Sbjct: 422 -IAAVRYDDWKVIFAYQDGHGPDMWFSGKR 450


>gi|404486643|ref|ZP_11021833.1| hypothetical protein HMPREF9448_02276 [Barnesiella intestinihominis
           YIT 11860]
 gi|404336461|gb|EJZ62922.1| hypothetical protein HMPREF9448_02276 [Barnesiella intestinihominis
           YIT 11860]
          Length = 478

 Score =  152 bits (385), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 131/443 (29%), Positives = 195/443 (44%), Gaps = 75/443 (16%)

Query: 77  AALTKSTTLTLLIVY------GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCT 129
           +  TKS    + ++Y      G+ DL  +G   I TPNID LA  G++    Y+   V  
Sbjct: 18  SCTTKSKKQKMNVIYILADDLGYGDLGCYGQKIIKTPNIDRLAEEGMLFTQHYSGCTVSA 77

Query: 130 PSRASLMTGKYPIHT---GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           PSR+ LMTG++  HT   G Q     G EP  +P     L   +++ GY+T A GKW LG
Sbjct: 78  PSRSCLMTGQHTGHTYIRGNQSHGTEGEEP--LPGNTYTLARMMKDAGYATGAFGKWGLG 135

Query: 187 FFRREYTPLYRGFESHFGY--LNGVISYYD-HILSDQYSRTVELNGHDMRRNLSTAWDTV 243
           +   E  P   GF+  FGY        YY  H+  +Q    V L G++  +  +      
Sbjct: 136 YPCSEGDPTNLGFDEFFGYNCQRQAHHYYPYHLWHNQ--EKVMLPGNEGSKTET------ 187

Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAG----------------NAGKHLE 287
             YA DL  ++A+Q I D    KP FLYL ++  HA                   GK  E
Sbjct: 188 --YAQDLIQEKALQFIVDNQ-SKPFFLYLPYILPHAELVSPEDSILAMYKGKIEEGKPYE 244

Query: 288 APQETINQF----QYITDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
              +  N +     Y +  N    +A+MV + D  VG ++  L+R G+ +N+I+ F SDN
Sbjct: 245 GVDDIKNPYYKYGGYCSSENPHADFASMVTRFDAYVGEIMQTLKRLGLDKNTIVFFASDN 304

Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISD 402
           G P  E     ++ N  S  P+RG+K  ++EGG++VP I W+P   +    S  +    D
Sbjct: 305 G-PHQEGGADPDFFN--SYGPFRGIKRDMYEGGIRVPLIAWAPGKVEAGTCSDHISAFWD 361

Query: 403 WLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNS 462
            +PTL   AGG+                             DG+    +L      +++ 
Sbjct: 362 VMPTLADIAGGEI------------------------KGECDGISFAPTLFGGKGQKQHE 397

Query: 463 VLI-NIDEKKRTAAVRLDSWKLV 484
            L     E+    AVR+ +WK V
Sbjct: 398 YLYWEFHERGGRVAVRMGNWKGV 420


>gi|32475143|ref|NP_868137.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
 gi|32445684|emb|CAD78415.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH 1]
          Length = 485

 Score =  152 bits (385), Expect = 4e-34,   Method: Compositional matrix adjust.
 Identities = 127/401 (31%), Positives = 177/401 (44%), Gaps = 68/401 (16%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
           LLIV    G+NDL   G N I TP +D LA  G  L N Y A P CTPSRASL+TG+YP 
Sbjct: 33  LLIVSDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGRYPQ 91

Query: 143 HTGM-----QGPPIWG------------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
             G+        P +G                G+   E  LP  LR  GY +   GKW L
Sbjct: 92  RNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYKSGIFGKWDL 151

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
           G  +R   P  RGF+  +G++N  I Y+ H   ++Y          M RNL       G 
Sbjct: 152 GALQR-MLPTSRGFDDFYGFVNTGIDYFTH---ERYGVPC------MVRNLEPTEADKGT 201

Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH--------------AGNAGKHLEAPQE 291
           Y T LF +EA++ +++   ++P FLY+   A H              A +  K +  P E
Sbjct: 202 YCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLVPTIRSSVQAPDQFKAMYPPVE 261

Query: 292 T----INQFQY-----ITDPN--RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
                 ++++Y     +  P   RR Y A V  +D ++G ++  L+ K ML+ +I++F S
Sbjct: 262 VETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEILDRLEAKQMLDETIVVFFS 321

Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHI 400
           DNG                 N P RG K   WEGG++VP ++  P  Q    V       
Sbjct: 322 DNGG-----------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGQIPAGVVNDEFLT 370

Query: 401 SDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
           S  L   + AA G      + +DG D W +L     S R  
Sbjct: 371 SLELLPSFAAAAGVEPPPGVVLDGFDWWPTLRGEAESPRKE 411


>gi|167523060|ref|XP_001745867.1| hypothetical protein [Monosiga brevicollis MX1]
 gi|163775668|gb|EDQ89291.1| predicted protein [Monosiga brevicollis MX1]
          Length = 221

 Score =  152 bits (384), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 84/201 (41%), Positives = 123/201 (61%), Gaps = 8/201 (3%)

Query: 77  AALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
           AA TK   L ++I   G++DL F G ++I TPNIDAL   G++   MY Q VC+PSRAS+
Sbjct: 25  AAGTKPHVLFIVIDDLGYDDLYFRG-HQIRTPNIDALQEEGLLFTQMYMQDVCSPSRASI 83

Query: 136 MTGKYPIHTGMQG--PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           ++G+Y +H G+    PP    +  G+ L +  L + +RE GY T+A+GKWH+GF++  YT
Sbjct: 84  LSGRYAMHHGVTDWIPP---RDSYGLMLNDTTLADKMREAGYDTRAVGKWHMGFYKWAYT 140

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
           P +RGF S  GY +G   Y+ H   + Y    +   H        AWD  G+Y+T +F++
Sbjct: 141 PTFRGFNSFLGYYSGGEDYFTHETDNAYDMHRDEGRHCGPNCSIPAWDLKGQYSTTIFSE 200

Query: 254 EAVQLI-EDQPVDKPLFLYLA 273
           EA+++I + Q  D PLFLYLA
Sbjct: 201 EAIRIINQRQAADPPLFLYLA 221



 Score = 43.1 bits (100), Expect = 0.51,   Method: Compositional matrix adjust.
 Identities = 19/36 (52%), Positives = 28/36 (77%), Gaps = 1/36 (2%)

Query: 8   AWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLA 42
           AWD  G+Y+T +F++EA+++I + Q  D PLFLYLA
Sbjct: 186 AWDLKGQYSTTIFSEEAIRIINQRQAADPPLFLYLA 221


>gi|375143552|ref|YP_005005993.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
 gi|361057598|gb|AEV96589.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
          Length = 482

 Score =  152 bits (384), Expect = 5e-34,   Method: Compositional matrix adjust.
 Identities = 125/424 (29%), Positives = 198/424 (46%), Gaps = 64/424 (15%)

Query: 78  ALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLM 136
           A+  +    L    G+ DL  +G   I TPN+DA+A  G      YA   VC PSR+S +
Sbjct: 32  AVKPNIIFILADDLGYGDLGCYGQKLIQTPNLDAMAKQGTRFTQFYAGTAVCAPSRSSFL 91

Query: 137 TGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           TG++  HT ++G    G +P G   +P +   + E L++ GY+T   GKW LG       
Sbjct: 92  TGQHTGHTPIRGNK--GVQPEGQWPIPDSAVTIAEVLKKAGYATGDFGKWGLGPVASSGD 149

Query: 194 PLYRGFESHFGYLNGVIS---YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATD 249
           P+ +GF++ +GY    ++   Y DH+  +           D R + +    +   +Y+ D
Sbjct: 150 PIKQGFDAFYGYNCQTLAHDYYPDHLWEN-----------DTRIDFAANTPEHPTDYSAD 198

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI-NQFQYITDPN----- 303
           L  ++A+Q I DQ    P F++L++   HA      L+ P +++  +++ + +       
Sbjct: 199 LIHQKALQFI-DQHKGNPFFVFLSYTLPHAA-----LQVPDDSLFEKYKKLLNEKPIPVP 252

Query: 304 -------------RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
                        R  YAAMV +LD  VG V+  L+  G+ +N+++IF SDNG P  E  
Sbjct: 253 KWNGKNYAPQAYPRAAYAAMVSRLDVYVGQVLQKLKEAGIDKNTLVIFSSDNG-PHKEGG 311

Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
              +Y N  SN P RG+K  L+EGG++ P I W P   +    S  +    D+LPT    
Sbjct: 312 YDPDYFN--SNGPLRGIKRDLYEGGMREPMIAWWPGKIKAGATSDYVGAFWDFLPTFAEL 369

Query: 411 AGGDTSRLPLNIDGLDQWSSLL--LNTPS----------RRNSNIDGLDQWSSLLLNTPS 458
           A    +  P NIDG+    +LL   N PS          +       + +W  + LN  S
Sbjct: 370 A---KALKPANIDGISIVPALLGQKNAPSHPWLYWEFHEQGGKQAVRMGKWKGIKLNVTS 426

Query: 459 RRNS 462
           + N 
Sbjct: 427 QPNG 430


>gi|323454325|gb|EGB10195.1| hypothetical protein AURANDRAFT_62802 [Aureococcus anophagefferens]
          Length = 2048

 Score =  152 bits (384), Expect = 6e-34,   Method: Composition-based stats.
 Identities = 119/391 (30%), Positives = 191/391 (48%), Gaps = 54/391 (13%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSN-EIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
           +A T+   + L++    G+ND+ +  ++    TP +D+LA  G+ L + Y  P CTP+R+
Sbjct: 191 SATTQQQNILLILADDMGYNDVGYQSTDLGAATPTLDSLASRGVELASYYTLPSCTPARS 250

Query: 134 SLMTGKYPIHTGM--QGPPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SL T  Y  + GM   GP  +  +   G+PL    + E L+  GY+T             
Sbjct: 251 SLFTANYAANNGMGYDGPGTYVIDSSYGLPLGATLMSEVLKGRGYAT------------- 297

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL 250
            +   Y   E++    N   SY D +      R+  ++G      +      + EY+TDL
Sbjct: 298 -HIETYFSHEAYGTDPNNGTSYCDLM------RSNSVDG------VKVGECYLDEYSTDL 344

Query: 251 FTKEAVQLIEDQPV--DKPLFLYLAHLAAHA---GNAGKHLEAPQET-INQFQYITDP-- 302
           + +  V ++ +     D PLFLY A  A HA        +  APQE  +       D   
Sbjct: 345 YARRLVDVVGEHATTSDSPLFLYFAAQAPHAPLEEPPPMNATAPQEAMLAALVGAGDSWG 404

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            RRT+A MV  LD++V  V+ AL    +L+N+++ F SDNG            R+ GSN+
Sbjct: 405 KRRTFAKMVMNLDNNVRKVVDALAAAKLLQNTVVAFASDNGGCP---------RDGGSNW 455

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPL 420
           P RG K + +EGGV+VPA LWS  +Q+    ++   + H++DW PTL  +AG        
Sbjct: 456 PLRGTKFSQYEGGVRVPAFLWSSTLQKRGAGTVFEGLFHVADWFPTLVASAG--VRATVG 513

Query: 421 NIDGLDQWSSLLLNTPSR-RNSNIDGLDQWS 450
            IDG++Q+ +L   T +  R+  +  L++W+
Sbjct: 514 KIDGVNQYDALFDATQNAPRDEVLIHLNKWT 544


>gi|149190919|ref|ZP_01869182.1| sulfatase, putative [Vibrio shilonii AK1]
 gi|148835274|gb|EDL52248.1| sulfatase, putative [Vibrio shilonii AK1]
          Length = 500

 Score =  152 bits (383), Expect = 6e-34,   Method: Compositional matrix adjust.
 Identities = 126/421 (29%), Positives = 192/421 (45%), Gaps = 50/421 (11%)

Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
           TPNID++A  G++L + YAQP CT  R++ +TG++P+ TGM    + G+ P G+      
Sbjct: 54  TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFITGQFPVRTGMHTVGLPGS-PVGLNAETPT 112

Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN--GVISY-------YDHI 216
           LPE L+ LGY+T   GK HLG  R E  P   GF+ ++G+L     + Y        D  
Sbjct: 113 LPEILKTLGYTTGQFGKNHLG-DRDEVLPTMNGFDEYWGWLYHLNAMEYTEDPDWPKDEA 171

Query: 217 LSDQYSRTVELNGHDMRRNLSTAWD---TVGEYAT--DLFTKEAVQLIEDQ-PVDKPLFL 270
                 R V  +  D +   +   D   T+    T  D   K A+  IE+    DKP F 
Sbjct: 172 FQKFAPRNVIHSKSDGKGGQTIKDDGPLTIERMRTLDDEVNKHAINFIENAVKQDKPFFT 231

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +        G+   HL    E +     +          ++K LDD VG +++ L+  G+
Sbjct: 232 W---YCPSRGHVWTHLSPEYEAM-----LGQNGWGLQEVVMKDLDDHVGEMLAKLEELGV 283

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +N+I+IF +DNG   + + +       G   P+ G K + WEGGV+ P ++  P     
Sbjct: 284 ADNTIVIFTADNGPEIMTWPD-------GGMTPFHGEKGSTWEGGVRAPLLVSWPGKIPA 336

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
            ++   M    DWLPTL  AAGG +      + G + +            +++DG +Q S
Sbjct: 337 GKIHNGMFDGMDWLPTLVEAAGGPSDLKQQMLTGYEGFK-----------AHLDGYNQIS 385

Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
            L    PS R  ++    E+ R  AVR+  WK     Q +    G+ G       PLL F
Sbjct: 386 MLTELAPSNRKEIIYY--ERDRLQAVRVGDWKAHFVVQNH----GWSGPKEELNAPLL-F 438

Query: 511 N 511
           N
Sbjct: 439 N 439


>gi|86147009|ref|ZP_01065327.1| sulfatase family protein [Vibrio sp. MED222]
 gi|85835259|gb|EAQ53399.1| sulfatase family protein [Vibrio sp. MED222]
          Length = 500

 Score =  152 bits (383), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 127/428 (29%), Positives = 193/428 (45%), Gaps = 64/428 (14%)

Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
           TPNID++A  G++L + YAQP CT  R++ +TG+ P+ TGM    + G  P G+      
Sbjct: 54  TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHSVGLPGG-PVGLSADTPT 112

Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL----------------NGV 209
           LPE L+ +GY T   GK HLG  R E+ P   GF+ ++G+L                +G 
Sbjct: 113 LPEILKTMGYVTGQFGKNHLGD-RDEFLPTMHGFDEYWGWLYHLNAMEYTEDPDWPKDGS 171

Query: 210 ISYYD-----HILSD-QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
           +  +      +  SD +  +T+E +G      + T  D V ++A + F + AV+      
Sbjct: 172 LDAFAPRNVIYARSDGKGGQTIEDDGALSIERMRTLDDEVNKHAIN-FIERAVE------ 224

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
            DKP F +      H      HL    E +     +          ++K LDD VG +++
Sbjct: 225 ADKPFFTWYCPSRGHVWT---HLSPEYEAM-----LGQNGWGLQEVVMKDLDDHVGEMMA 276

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
            ++  G+ +N+IIIF +DNG   + + +       G   PY G K T WEGGV+ PA++ 
Sbjct: 277 KMEELGIADNTIIIFTADNGPEIMTWPD-------GGMTPYHGEKGTTWEGGVRAPALVS 329

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
            P       V   +    DWLPTL  AAGG T      + G D +            +++
Sbjct: 330 WPGKIPAGTVGNGIFDGMDWLPTLVAAAGGPTDLKEKLLKGHDGF-----------KAHL 378

Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
           DG +Q   L     S R  +     E+ +  AVR+  WK     Q +    G+ G     
Sbjct: 379 DGYNQVDMLTEKGESNRKEIYYY--ERDKLQAVRVGDWKAHFVVQNH----GWSGPKEEL 432

Query: 504 KVPLLNFN 511
             PLL FN
Sbjct: 433 NAPLL-FN 439


>gi|424864690|ref|ZP_18288593.1| steryl-sulfatase [SAR86 cluster bacterium SAR86B]
 gi|400759436|gb|EJP73618.1| steryl-sulfatase [SAR86 cluster bacterium SAR86B]
          Length = 549

 Score =  152 bits (383), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 117/379 (30%), Positives = 182/379 (48%), Gaps = 63/379 (16%)

Query: 92  GWNDLSFH----GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
           G+ND+S H        + T NIDALA +GI+    YA    C PSRAS+MTGKY    G 
Sbjct: 76  GYNDISIHNGGAADGTLKTKNIDALADSGILFTRGYAANATCAPSRASIMTGKYSTSFGY 135

Query: 147 QGPPI---------WGAEP----------------------RGVPLTERFLPEYLRELGY 175
           +  PI         W AE                       +G+P  +  + E LR+ GY
Sbjct: 136 EFTPIPALGRTVLGWLAEEDDSELKQRIDRDVVSNMPAFMDQGMPTEQITIAEILRDAGY 195

Query: 176 STKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDH--ILSDQYSRTVE-----LN 228
            T  IGKWHLG   +   PL +GF+   G    +    DH  +++ ++  +++     + 
Sbjct: 196 YTAHIGKWHLGH-GQGMDPLSQGFQDSLGLDGPLYLPEDHPDVVNAKFDSSIDKMIWSIG 254

Query: 229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 288
            +    N  + +    +Y TD +T EA+++IE+   ++P FLYL+H A H       L+A
Sbjct: 255 QYSATFNGGSVF-APDKYVTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQA 307

Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
            +    Q  ++   N + Y+ M+  LD SVG ++  L+   +   ++IIF SDNG     
Sbjct: 308 LRTDYEQMSHMHGHNLQVYSGMINALDRSVGKIVEKLKDLNIYGKTLIIFTSDNGG--AN 365

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
           Y E  +      N PYRG K + +EGG++VP I+ W  +I    +  + + H  D  PT+
Sbjct: 366 YIELDDI-----NKPYRGWKISFFEGGIRVPYIVSWPSEISPGKKSDIAVHHF-DIFPTI 419

Query: 408 YTAAGGDTSRLPLNIDGLD 426
             AAG +++     +DG D
Sbjct: 420 AKAAGIESAHA---LDGRD 435



 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 27/92 (29%), Positives = 48/92 (52%), Gaps = 7/92 (7%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y TD +T EA+++IE+   ++P FLYL+H A H       L+A +    Q  ++   N 
Sbjct: 270 KYVTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQALRTDYEQMSHMHGHNL 323

Query: 74  RTYAALTKSTTLTL-LIVYGWNDLSFHGSNEI 104
           + Y+ +  +   ++  IV    DL+ +G   I
Sbjct: 324 QVYSGMINALDRSVGKIVEKLKDLNIYGKTLI 355


>gi|336413800|ref|ZP_08594149.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
           3_8_47FAA]
 gi|335934817|gb|EGM96800.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
           3_8_47FAA]
          Length = 465

 Score =  152 bits (383), Expect = 7e-34,   Method: Compositional matrix adjust.
 Identities = 115/350 (32%), Positives = 163/350 (46%), Gaps = 38/350 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G  DL  +G  +I TPNID LA  G++  N Y+   V  PSR SL+TGK+  H  ++G  
Sbjct: 40  GIGDLGCYGQKKIKTPNIDKLAEKGVLFTNHYSGSTVSAPSRCSLLTGKHTGHAYIRGNK 99

Query: 151 IWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
              +E       +P  E  + E L+  GY+T  +GKW LG      +P+ +GF+  FGYL
Sbjct: 100 GIKSEEGFFDLHLPSDEVTVAEILKRKGYATMCVGKWGLGGPHTTGSPVRKGFDYFFGYL 159

Query: 207 N--GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           +      YY   L +   + V LN       L      + E   D   K A         
Sbjct: 160 SQGAAHRYYPEYLYENEDK-VMLNKKVYSHFL------IMEKGLDFIRKNA--------- 203

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR---------RTYAAMVKKLD 315
             P F Y A    HA      L   ++   +  YI +  +           YA+MV ++D
Sbjct: 204 GHPFFAYFAITPPHADLDYPDLSQYEDAFPETPYINNKQKGFKTQMKPKAAYASMVSEID 263

Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
            +VG +I  L+ KG+LEN+IIIF SDNG   V   E   +    SN PYRG K  L+EGG
Sbjct: 264 KNVGQIIQLLKEKGILENTIIIFSSDNGVHCVGGHEPDFFD---SNGPYRGYKRDLYEGG 320

Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           V+ P I+  P++ +  R    +    D+LPT+    G D  +    IDG+
Sbjct: 321 VRAPFIVSWPKMIKKKRTVEHITTFWDFLPTVTELVGADMVK---GIDGI 367


>gi|407071736|ref|ZP_11102574.1| arylsulfatase [Vibrio cyclitrophicus ZF14]
          Length = 500

 Score =  151 bits (382), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 119/402 (29%), Positives = 178/402 (44%), Gaps = 47/402 (11%)

Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
           TPNID++A  G++L + YAQP CT  R++ +TG+ P+ TGM    + G  P G+      
Sbjct: 54  TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHTVGLPGG-PVGLSADTPT 112

Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV 225
           LPE L+ +GY T   GK HLG  R EY P   GF+ ++G+L   ++  ++     + +  
Sbjct: 113 LPEVLKTMGYVTGQFGKNHLG-DRDEYLPTMHGFDEYWGWLY-HLNAMEYTEDPDWPKDG 170

Query: 226 ELNGHDMRRNLSTAWDTVGEYAT---------------DLFTKEAVQLIEDQ-PVDKPLF 269
            L+    R  + +  D  G  +                D   K A+  IE     DKP F
Sbjct: 171 SLDAFAPRNVIYSKSDGKGGQSIEDDGALNIERMRTLDDEVNKHAINFIERAVEADKPFF 230

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
            +      H      HL    E +     +          ++K LDD VG +++ +   G
Sbjct: 231 TWYCPSRGHVWT---HLSPEYEAM-----LGQNGWGLQEVVMKDLDDHVGEMLAKIDELG 282

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           + +N+IIIF SDNG   + + +       G   P+ G K T WEGGV+ PA++  P    
Sbjct: 283 IADNTIIIFTSDNGPEIMTWPD-------GGMTPFHGEKGTTWEGGVRAPALVSWPGKIP 335

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V   +    DWLPTL  AAGG T      + G D +            +++DG +Q 
Sbjct: 336 AGSVGNGIFDGMDWLPTLVAAAGGPTDLKEKMLKGHDGF-----------KAHLDGYNQV 384

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
             L     S RN +     E+ +  AVR+  WK     Q  G
Sbjct: 385 DMLTEKGESNRNEIYYY--ERDKLQAVRVGDWKAHFIVQNRG 424


>gi|149177395|ref|ZP_01855999.1| arylsulfatase A [Planctomyces maris DSM 8797]
 gi|148843728|gb|EDL58087.1| arylsulfatase A [Planctomyces maris DSM 8797]
          Length = 474

 Score =  151 bits (382), Expect = 8e-34,   Method: Compositional matrix adjust.
 Identities = 124/384 (32%), Positives = 188/384 (48%), Gaps = 43/384 (11%)

Query: 75  TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCT 129
           + A L  +    ++++Y    G+ DL   G   I TP +D +A  G+     Y A PVCT
Sbjct: 27  SVADLNAAPGPNVIVIYADDLGYGDLGCFGHPTIKTPALDQMAAEGMKFTQFYSAAPVCT 86

Query: 130 PSRASLMTGKYPIHTGMQGPP---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           PSRA+L+TG+YPI +GM       ++     G+P +E  L E L+  GY T  +GKWHLG
Sbjct: 87  PSRAALLTGRYPIRSGMCSDKRRVLFPNSGGGIPASEVTLAEALKAAGYKTACVGKWHLG 146

Query: 187 FFRREYTPLYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT-V 243
               ++ P   GF+S+FG  Y N +    D +   ++ R++ L       N+    +T V
Sbjct: 147 HL-PQFLPTNNGFDSYFGIPYSNDM----DRVADRKHGRSIFLKPEVKFWNVPLMRNTEV 201

Query: 244 GEYATD------LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
            E   D       +T+EA++LI+ Q   +P F+YLAH   H          P     +FQ
Sbjct: 202 VELPADQTTITKRYTEEAIKLIQ-QNKQQPFFIYLAHNMPH---------VPLFRSPEFQ 251

Query: 298 YITDPNRR-TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
              D +RR  Y  +++++D SVG V+  L+ +G+ +N+I+ F SDNG P + + E     
Sbjct: 252 ---DKSRRGLYGDVIEEIDWSVGEVLKELKTQGLDQNTIVWFCSDNG-PWLIFNEQG--- 304

Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
             GS    R  K + WEGG++ P I W P    +  VS ++    D   T    AGG   
Sbjct: 305 --GSAGLLRDGKGSTWEGGMREPTIAWWPGHIPSGSVSQELGSTMDIYTTSIKLAGGSVP 362

Query: 417 RLPLNIDGLDQWSSLLLNTPSRRN 440
              + +DG+D    LL    S R 
Sbjct: 363 TDQV-VDGVDLTPVLLKKENSPRE 385


>gi|440716877|ref|ZP_20897380.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
 gi|436438070|gb|ELP31646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SWK14]
          Length = 485

 Score =  151 bits (382), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 130/455 (28%), Positives = 193/455 (42%), Gaps = 94/455 (20%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
           LLIV    G+NDL   G N I TP +D LA  G  L N Y A P CTPSRASL+TG+YP 
Sbjct: 33  LLIVSDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGRYPQ 91

Query: 143 HTGM-----QGPPIWG------------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
             G+        P +G                G+   E  LP  LR  GY +   GKW L
Sbjct: 92  RNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYKSGIFGKWDL 151

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
           G  +R   P  RGF+  +G++N  I Y+ H   ++Y          M RNL       G 
Sbjct: 152 GALQR-MLPTSRGFDDFYGFVNTGIDYFTH---ERYGVPC------MVRNLEPTEADKGT 201

Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHLEAPQETINQFQYITDP-- 302
           Y T LF +EA++ +++   ++P FLY+   A H + +    + +  +  +QF+ +  P  
Sbjct: 202 YCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKAMYPPVE 261

Query: 303 ----------------------NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
                                  RR Y A V  +D ++G ++  L+ K ML+N+I++F S
Sbjct: 262 VETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDNTIVVFFS 321

Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMH 399
           DNG                 N P RG K   WEGG++VP ++ W         V+ + + 
Sbjct: 322 DNGG-----------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNNEFLT 370

Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSR 459
             + LP+   AAG                    +  P      +DG + W +L     S 
Sbjct: 371 SLELLPSFAIAAG--------------------VEPPP--GVVLDGFNWWPTLRGEAESP 408

Query: 460 RNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
           R  +     +++     R+  WK V   ++ G  D
Sbjct: 409 RKEMFW---KRRDQIGARVGQWKWVQMGEQGGLFD 440


>gi|432851909|ref|XP_004067102.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 1
           [Oryzias latipes]
          Length = 525

 Score =  151 bits (382), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 118/370 (31%), Positives = 167/370 (45%), Gaps = 52/370 (14%)

Query: 80  TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
           T S  + ++++   GW DL   G     TPN+DA+A  GI+  + Y A P+C+PSRA+L+
Sbjct: 28  TSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSRAALL 87

Query: 137 TGKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           TG+ PI  G     G       P+    G+   E  LP+ L+E GY  K +GKWHLG  R
Sbjct: 88  TGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHLGH-R 146

Query: 190 REYTPLYRGFESHFGYLNGVISYYDHIL---------SDQYSRTVELNGHDMRRNLSTAW 240
            +Y PL  GF+  FG  N     Y++ +         S+   R  E    D +   S   
Sbjct: 147 PQYLPLENGFDEWFGAPNCHFGPYNNTVRPNIPVYNNSEMLGRYFEEFKIDKKTGESNLT 206

Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
               E   D  +++A          +P FLY A  A H+                  ++ 
Sbjct: 207 QMYLEAGLDFISRQA-------EAKRPFFLYWAADATHS-----------PVYASKPFLG 248

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
              R  Y   V +LD SVG +++ LQR G+  N+ + F SDNGA  +     S     GS
Sbjct: 249 KSQRGRYGDAVMELDYSVGRILTKLQRLGIDNNTFVFFTSDNGAALM-----SGPNEGGS 303

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG----GDTS 416
           N P+   K T +EGG++ PAI W P   +   VS Q  ++ D   T    AG    GD  
Sbjct: 304 NGPFLCGKETTFEGGMREPAIAWWPGRIKGGTVSFQPANVMDLFTTSLALAGIGPPGDRV 363

Query: 417 RLPLNIDGLD 426
                +DGLD
Sbjct: 364 -----LDGLD 368


>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
          Length = 493

 Score =  151 bits (382), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 108/363 (29%), Positives = 166/363 (45%), Gaps = 42/363 (11%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
           +A +       ++V  +G+ D+SF     I +PN + LA  G+IL+  Y    C+PSRAS
Sbjct: 16  SAFSTKPNFVFVLVDDWGYADVSFRNP-AIHSPNFEKLAKEGLILDRHYVFKYCSPSRAS 74

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
            +TG++P H     PP       G  L    +P  L+   Y T  IGKWH G ++  Y P
Sbjct: 75  FLTGRWPHHAHQWNPP--EDALVGANLKMTMIPAKLKLARYKTHMIGKWHEGLYKEAYLP 132

Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVEL---NGHDMRRNLSTAWDTVGEYATDLF 251
           + RGF++  G+L G  ++    ++ Q     +    +G D R          G Y    +
Sbjct: 133 INRGFDTMSGFLGGGENH----MNQQVGCATDFWKNDGPDSRN---------GSYDAYTY 179

Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAM 310
             +   +I +     P FLYL     H       L+AP E +N +    T  NRR Y AM
Sbjct: 180 RDDLTDIITNHNPSDPFFLYLPLHNVHT-----PLQAPDEWLNIYPANSTCQNRRIYQAM 234

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           V   D+  G ++  LQ  GM +N++++  +DNG               GSNYP +G K T
Sbjct: 235 VSVADNVTGHLVKLLQDNGMWDNTLLVISADNGGAPCG----------GSNYPLKGSKGT 284

Query: 371 LWEGGVKVPAILWSPQIQQN--PRVSLQMMHISDWLPTLYTAAG---GDTSRLPLNIDGL 425
            +EGGV+  A +    +  N   + +   +HI+DW  T    AG    D+      +DG+
Sbjct: 285 FYEGGVRSLAFVNGGLLPTNRKGKSTQGFIHIADWYTTFCQLAGIDHDDSGSGKFPVDGM 344

Query: 426 DQW 428
           + W
Sbjct: 345 NVW 347


>gi|84386700|ref|ZP_00989726.1| sulfatase family protein [Vibrio splendidus 12B01]
 gi|84378506|gb|EAP95363.1| sulfatase family protein [Vibrio splendidus 12B01]
          Length = 500

 Score =  151 bits (382), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 126/428 (29%), Positives = 193/428 (45%), Gaps = 64/428 (14%)

Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
           TPNID++A  G++L + YAQP CT  R++ +TG+ P+ TGM    + G  P G+      
Sbjct: 54  TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHSVGLPGG-PVGLSADTPT 112

Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL----------------NGV 209
           LPE L+ +GY T   GK HLG  R E+ P   GF+ ++G+L                +G 
Sbjct: 113 LPEILKTMGYVTGQFGKNHLGD-RDEFLPTMHGFDEYWGWLYHLNAMEYTEDPDWPKDGS 171

Query: 210 ISYYD-----HILSD-QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
           +  +      +  SD +  +T+E +G      + T  D V ++A + F + AV+      
Sbjct: 172 LDAFAPRNVIYARSDGKGGQTIEDDGALSIERMRTLDDEVNKHAIN-FIERAVE------ 224

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
            DKP F +      H      HL    E +     +          ++K LDD VG +++
Sbjct: 225 ADKPFFTWYCPSRGHVWT---HLSPEYEAM-----LGQNGWGLQEVVMKDLDDHVGEMMA 276

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
            ++  G+ +N+IIIF +DNG   + + +       G   P+ G K T WEGGV+ PA++ 
Sbjct: 277 KMEELGIADNTIIIFTADNGPEIMTWPD-------GGMTPFHGEKGTTWEGGVRAPALVS 329

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
            P       V   +    DWLPTL  AAGG T      + G D +            +++
Sbjct: 330 WPGKIPAGTVGNGIFDGMDWLPTLVAAAGGPTDLKEKMLKGHDGF-----------KAHL 378

Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
           DG +Q   L     S R  +     E+ +  AVR+  WK     Q +    G+ G     
Sbjct: 379 DGYNQVDMLTEKGESNRKEIYYY--ERDKLQAVRIGDWKAHFVVQNH----GWSGPKEEL 432

Query: 504 KVPLLNFN 511
             PLL FN
Sbjct: 433 NAPLL-FN 439


>gi|406834305|ref|ZP_11093899.1| arylsulfatase [Schlesneria paludicola DSM 18645]
          Length = 592

 Score =  151 bits (382), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 142/496 (28%), Positives = 217/496 (43%), Gaps = 91/496 (18%)

Query: 63  NQFQYITDPNRRTYAALTKSTTLT-------LLIVYG-----WNDLSFH-GSNEIPTPNI 109
           N F + + P      AL  + T T       +L++ G     WN  ++H G     TPN+
Sbjct: 62  NPFCFTSKPQNERVLALANAQTPTGSSKKPNILVIMGDDVGIWNIGAYHRGMMAGRTPNL 121

Query: 110 DALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEY 169
           D LA  G++  + YA+  CT  RA+ +TG+ P  TGM      G+   G+P     +   
Sbjct: 122 DKLAAQGMLFTDYYAEASCTAGRANFITGQLPKRTGMTTVGQAGSA-IGLPAQAPTIATI 180

Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV--EL 227
           L+ LGY+T   GK HLG  R E+ P   GF+  FGYL     Y+   + D + +    EL
Sbjct: 181 LKSLGYATGQFGKNHLG-DRNEFLPTVHGFDEFFGYL-----YHLDAMEDPFHKNYPPEL 234

Query: 228 NGHDMRRNLSTAWDTVGEYATD--LFTKEAVQLIED---------QPVD----------- 265
                 RN+  +W T  + AT+   + K   Q IED         + VD           
Sbjct: 235 RDSVGPRNMLHSWATDKDDATEQPRWGKIGKQKIEDAGTLPPKRMETVDDEILEHTLKFI 294

Query: 266 -------KPLFLYL----AHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
                  KP F++L     H+  H     +++  PQ               T  A + +L
Sbjct: 295 DKAKKDEKPFFVWLNPTRIHIVTHLSEKYENMRTPQN-----------GWATVEAGMAQL 343

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW--GSNYPYRGVKNTLW 372
           DD VG+++  L+ +GM +N+I++F +DNG        T N+  W  G   P+ G K T+ 
Sbjct: 344 DDIVGSMMKKLKDEGMEDNTIVMFTTDNG--------TENF-TWPDGGQTPFAGGKGTVM 394

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           EGG + PA++  P      +V   ++   DWLPT   AAG      P  +D L +   L 
Sbjct: 395 EGGFRAPAMVRWPNHIPAGKVENGIISGMDWLPTFVAAAGN-----PKIVDELKEGKKL- 448

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
               +    ++DG +Q   L   TPS R+ +    +      AVR+D +K     Q +  
Sbjct: 449 --GDATYKVHLDGYNQLDLLTGKTPSARHEIYYFAE--GTLGAVRIDDFKYRFIDQPS-- 502

Query: 493 MDGYYGQTRSNKVPLL 508
             G++G T    VP+L
Sbjct: 503 --GWFGGTVKVDVPIL 516


>gi|305665421|ref|YP_003861708.1| arylsulfatase A [Maribacter sp. HTCC2170]
 gi|88710176|gb|EAR02408.1| arylsulfatase A [Maribacter sp. HTCC2170]
          Length = 469

 Score =  151 bits (382), Expect = 9e-34,   Method: Compositional matrix adjust.
 Identities = 117/380 (30%), Positives = 184/380 (48%), Gaps = 45/380 (11%)

Query: 72  NRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQP 126
           N   +       T   +I++    G+ DLS +G+  I TPN+D +A+ G    N Y    
Sbjct: 18  NVDIFGQKANQQTPNFIIIFADDLGYGDLSTYGNPTIHTPNLDRMAHEGQKWTNFYVGAS 77

Query: 127 VCTPSRASLMTGKYPIHTGMQGPP---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
           VCTPSRA+L+TG+ P+ +GM       ++     G+P  E  L E L+  GYST AIGKW
Sbjct: 78  VCTPSRAALLTGRLPVRSGMASSKTRVLFPNSKNGLPANEITLAEQLKTAGYSTAAIGKW 137

Query: 184 HLGFFRREYTPLYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMR---RNLST 238
           HLG  + +Y P   GF+ +FG  Y N +    D+I    Y    + N  D++    N+  
Sbjct: 138 HLGH-KEQYLPTNNGFDYYFGIPYSNDM----DYIGKANYKNLAQNNPSDIKIENYNVPL 192

Query: 239 AWDTV-------GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
             +T            T  +T+E ++ I++   + P F+YLAH   H          P  
Sbjct: 193 LQNTKIIERPTNQNTITKRYTQETIKFIKENKRE-PFFVYLAHSLPHI---------PLF 242

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
           T  +F+         Y  +V+++D  VG +++AL+ +G+ +N+I++F SDNG P + ++ 
Sbjct: 243 TSKEFE--GHSKNGIYGDVVEEIDHGVGEILNALEEEGLDQNTIVVFTSDNG-PWLPFK- 298

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                + GS    R  K + WEGG++ P I W P + +   V+  M    D   T  T A
Sbjct: 299 ----THGGSAGLLRAGKGSTWEGGMREPGIFWGPGLVKKGLVT-DMGSTMDLFTTFSTMA 353

Query: 412 GGDTSRLPLNIDGLDQWSSL 431
           G +     + IDG D   +L
Sbjct: 354 GVEIPNDRI-IDGHDLSKTL 372


>gi|440715400|ref|ZP_20895947.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
 gi|436439744|gb|ELP33158.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
          Length = 1004

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 141/484 (29%), Positives = 211/484 (43%), Gaps = 68/484 (14%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQ 125
           ++   R   A  TK   + +L    G+ DLS  G+  + TP ID +A  G  L + Y A 
Sbjct: 549 VSPAGREKTAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAA 608

Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWG----AEPRGVPLTERFLPEYLRELGYSTKAIG 181
           PVCTPSRA LMTG YP    M     +G     +P+G+   E  + E L+  GY T   G
Sbjct: 609 PVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFG 668

Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---RRNLST 238
           KWHLG  + E+ P  +GF+  FG        Y H +   + R    +   +   + +   
Sbjct: 669 KWHLGD-QPEFLPTKQGFDEFFGIP------YSHDIHPFHPRQNHYHFPSLPLLQNDTVI 721

Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA----------GNAGKHLEA 288
             D   ++ T   T++AV  IE    D+P FLYL H   HA          G A   + A
Sbjct: 722 EMDPDADFLTKRLTEQAVSFIERN-KDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAA 780

Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
            ++      Y T  N   +   + ++D SVG ++ AL+  G+ E ++++F SDNG P   
Sbjct: 781 IEKEDGNIDYATRAN--LFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPP--- 835

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
             + + Y + G     RG K T +EGG++ P ++ W  QI      + ++M   D LPT 
Sbjct: 836 --KNTLYASPGE---LRGHKGTTFEGGMREPTVVRWPGQIPAG-HQNDELMTAMDLLPTF 889

Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              AG                      TP+ R   IDG D W +L   T +  ++   + 
Sbjct: 890 AKLAGA--------------------ATPTDR--VIDGKDIWPTLKGETQTPHDAFFYH- 926

Query: 468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYG--QTRSNKVPLLNFNAIVESKTYQSLQQL 525
               + AAVR   WKL +    NG     Y        KV ++  N  V  K  + L+  
Sbjct: 927 -RGNQLAAVRSGKWKLHV---NNGVAKQLYDLENDLGEKVNVIETNPEVVKKLQRQLKDF 982

Query: 526 SQNI 529
           + +I
Sbjct: 983 AADI 986



 Score =  106 bits (265), Expect = 3e-20,   Method: Compositional matrix adjust.
 Identities = 119/459 (25%), Positives = 185/459 (40%), Gaps = 79/459 (17%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
           A  +   + L+ V   G+ DL  +G+ ++ TPNID LA  G    + + A  VCTPSR  
Sbjct: 27  AAERPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSRYG 86

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPL----TERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           L+TG+YP+   M G  IWG  P    L      + + +  +  GY+T  +GKWHLGF   
Sbjct: 87  LLTGQYPVRA-MGGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGFKEE 145

Query: 191 EYT--------PLYRGFESHFGY--LNGVISYY----DHILSDQYSRTVELNGHDMR--- 233
                      P   GF+ +FG   +N    Y     D I     S  +   G  +    
Sbjct: 146 PCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVSPTP 205

Query: 234 -----------RNLSTAWDTVGEYATD----LFTKEAVQLIEDQPVDKPLFLYLAHLAAH 278
                         S A      Y  +    L T+ AV+ I ++  D+P FLY A    H
Sbjct: 206 MFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITEK-KDEPFFLYFATPNIH 264

Query: 279 AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
                     P     +F+  +      Y   V +LD  VG ++ +L+  G+ +N++++F
Sbjct: 265 ---------HPFTPAPRFKGTSQCG--LYGDFVHELDWMVGEIVQSLEDNGLTDNTLVLF 313

Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQM 397
            SDNGA        +       N    G K  +WEGG +VP I  W  +I+         
Sbjct: 314 TSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIK--------- 364

Query: 398 MHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LNTPSRRNSNIDGLDQWSSLLLNT 456
                        AG  + +L   +D    +S+L     PS      D ++   +LL + 
Sbjct: 365 -------------AGTQSDQLISQVDLFATFSALTEQEMPSSEQK--DSINMLPALLDDP 409

Query: 457 PSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
                + L+    + R  A+R   W L +G + +G  +G
Sbjct: 410 NEPLRTELVLAPRQPRNLAIRKGKW-LYIGARGSGGFNG 447


>gi|284035278|ref|YP_003385208.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283814571|gb|ADB36409.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 474

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 120/363 (33%), Positives = 179/363 (49%), Gaps = 50/363 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D+  +G N I TPNID +A  GI     YA   VC PSR++LMTG++  HT ++G  
Sbjct: 37  GYGDIGINGQNLIKTPNIDRMAAQGIQFRQFYAGTTVCAPSRSALMTGQHTGHTYIRGNR 96

Query: 151 IWGAEPRG-VPLTERFL--PEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY-L 206
               +P G  P+ +  +   E L++ GY+T A GKW LG    E  PL +GF+  +GY  
Sbjct: 97  EL--QPEGQFPIADSVMTMAEILKQAGYTTAAFGKWGLGPVGSEGDPLKQGFQRFYGYNC 154

Query: 207 NGVI-SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
            G+   YY + L D   + V        +NL+       E+A DL  K+A++ ++ +   
Sbjct: 155 QGLAHRYYPNHLWDNDKKVVLA----ANKNLTQ----TSEFAPDLIQKQALRFLDKRTGS 206

Query: 266 KPLFLYLAHLAAHA-----------GNAGKHLEAPQETINQFQYITDPNRRT-------- 306
           +P FL+L +   HA              GK  E P    +       PN +T        
Sbjct: 207 QPFFLFLPYTLPHAELVPPDDSLFASYKGKFTEKPYTGAD-----YGPNAKTGGYASQAY 261

Query: 307 ----YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
               YAAMV +LD  VG V++ L+ KG+ +N+++IF SDNG P VE      + N  SN 
Sbjct: 262 PHAAYAAMVARLDLYVGQVLAKLKAKGLDKNTLVIFSSDNG-PHVEGGGDPTFFN--SNG 318

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
            +RGVK  L+EGG++ P +   P + +    S  +    D LPT    AG   +++    
Sbjct: 319 GFRGVKRDLYEGGIREPFVARWPGVIKPGTQSDFVGAFWDLLPTFAELAG---AKVAART 375

Query: 423 DGL 425
           DG+
Sbjct: 376 DGI 378


>gi|432851911|ref|XP_004067103.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 2
           [Oryzias latipes]
          Length = 523

 Score =  151 bits (382), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 118/370 (31%), Positives = 167/370 (45%), Gaps = 52/370 (14%)

Query: 80  TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
           T S  + ++++   GW DL   G     TPN+DA+A  GI+  + Y A P+C+PSRA+L+
Sbjct: 28  TSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSRAALL 87

Query: 137 TGKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           TG+ PI  G     G       P+    G+   E  LP+ L+E GY  K +GKWHLG  R
Sbjct: 88  TGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHLGH-R 146

Query: 190 REYTPLYRGFESHFGYLNGVISYYDHIL---------SDQYSRTVELNGHDMRRNLSTAW 240
            +Y PL  GF+  FG  N     Y++ +         S+   R  E    D +   S   
Sbjct: 147 PQYLPLENGFDEWFGAPNCHFGPYNNTVRPNIPVYNNSEMLGRYFEEFKIDKKTGESNLT 206

Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
               E   D  +++A          +P FLY A  A H+                  ++ 
Sbjct: 207 QMYLEAGLDFISRQA-------EAKRPFFLYWAADATHS-----------PVYASKPFLG 248

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
              R  Y   V +LD SVG +++ LQR G+  N+ + F SDNGA  +     S     GS
Sbjct: 249 KSQRGRYGDAVMELDYSVGRILTKLQRLGIDNNTFVFFTSDNGAALM-----SGPNEGGS 303

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG----GDTS 416
           N P+   K T +EGG++ PAI W P   +   VS Q  ++ D   T    AG    GD  
Sbjct: 304 NGPFLCGKETTFEGGMREPAIAWWPGRIKGGTVSFQPANVMDLFTTSLALAGIGPPGDRV 363

Query: 417 RLPLNIDGLD 426
                +DGLD
Sbjct: 364 -----LDGLD 368


>gi|305667515|ref|YP_003863802.1| N-acetylgalactosamine 6-sulfatase [Maribacter sp. HTCC2170]
 gi|88709563|gb|EAR01796.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Maribacter sp. HTCC2170]
          Length = 596

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 113/349 (32%), Positives = 167/349 (47%), Gaps = 54/349 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW DLSF+G+  + TPNIDA+A NG    N Y QPVC+P+RA L+TGKY    G+     
Sbjct: 48  GWGDLSFNGNTNLSTPNIDAIAKNGASFQNFYVQPVCSPTRAELLTGKYAARLGVYSTST 107

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G         E  + E  ++ GY T A GKWH G  +  Y P  RGF+ ++G+ +G   
Sbjct: 108 GGERFNS---KETTIAEIFKKAGYKTTAYGKWHSG-MQPPYHPNSRGFDDYYGFTSGHWG 163

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE-YATDLFTKEAVQLIEDQPVDKPLFL 270
            Y       +S  +E NG  ++          GE +  D  T + +  I +   + P FL
Sbjct: 164 NY-------FSPMLEHNGEIVK----------GEGFLVDDLTNKGLDFITENK-NNPFFL 205

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR------------RTYAAMVKKLDDSV 318
           YL +   H+      ++ P E   +F+      R            R   AMV+ +D ++
Sbjct: 206 YLPYNTPHS-----PMQVPNEYWERFEKKKLDMRYQGNEEESENFTRAALAMVENIDFNM 260

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G + + L+  G+ EN+II+++SDNG              W  N   RG K +  EGGV+ 
Sbjct: 261 GRLTNKLKELGLEENTIIVYLSDNGP-----------NGWRWNGGMRGRKGSTDEGGVRS 309

Query: 379 P-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           P  I W   I +N ++S Q+    D LPTL + AG +   +  +IDG D
Sbjct: 310 PFFIQWKNTIPKNKKIS-QIAGAIDILPTLTSLAGINQPTIK-SIDGKD 356


>gi|71280931|ref|YP_269082.1| sulfatase [Colwellia psychrerythraea 34H]
 gi|71146671|gb|AAZ27144.1| sulfatase family protein [Colwellia psychrerythraea 34H]
          Length = 492

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 122/373 (32%), Positives = 179/373 (47%), Gaps = 45/373 (12%)

Query: 66  QYITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA 124
           Q +  P++ T    +K   + LL+  +G  DLS +GSN   TPNID LA +G+  +N YA
Sbjct: 19  QAVATPDKST----SKPNVVMLLVDDFGRQDLSTYGSNFYETPNIDQLAADGMKFDNAYA 74

Query: 125 -QPVCTPSRASLMTGKYPIHTGM-QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGK 182
             P C PSR ++ +G YP   G+ QG  +       +PL+     E+L+E GY T  IGK
Sbjct: 75  AHPRCVPSRVAIFSGSYPTRYGVPQGERV---GKHHLPLSAVTFGEHLKEAGYQTGYIGK 131

Query: 183 WHLGFFRREYTPLYRGFESHF--GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAW 240
           WHLG  +    P  +GF+S    G+     SYY       +  T +++     +  +   
Sbjct: 132 WHLG--KEGGDPTKQGFDSSIMAGHWGAPPSYY-------FPYT-KMSKSGKNKGFAKVE 181

Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA--------------------- 279
            +  EY TD  T EA+  IE Q  D+P  L LAH A H                      
Sbjct: 182 GSEEEYLTDRLTDEALTFIE-QKKDQPFLLVLAHYAVHTPIEGKPALVKKYKTKMKKLGI 240

Query: 280 GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
            NAG   +A     +   + T  N   YAAMV+ +D SVG +   L+R G+ +N+III  
Sbjct: 241 ANAGPKSDADLIKDSTGYHKTIQNNPDYAAMVESVDISVGRIEQQLKRLGLEDNTIIILT 300

Query: 340 SDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMH 399
           SD+G  +    + SN     SN PYR  K  +++GG +VP I+  P+  +   +S   + 
Sbjct: 301 SDHGGLSSRGLK-SNRVLATSNNPYRHGKGWIYDGGTRVPLIVKWPEKVKAGSISQVQVT 359

Query: 400 ISDWLPTLYTAAG 412
            +D  PT+   AG
Sbjct: 360 GTDHYPTILQMAG 372


>gi|410638700|ref|ZP_11349253.1| sulfatase [Glaciecola lipolytica E3]
 gi|410141228|dbj|GAC16458.1| sulfatase [Glaciecola lipolytica E3]
          Length = 522

 Score =  151 bits (381), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 133/446 (29%), Positives = 204/446 (45%), Gaps = 69/446 (15%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
           AA   +    L+   G  D+  +G   I TPN+  +A  G++ +  YA  PVC PSRA+L
Sbjct: 40  AAKKPNVIYILVDDLGIGDIEPYGQELINTPNLQRMADQGMVFSQHYAGNPVCAPSRAAL 99

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERF-----------LPEYLRELGYSTKAIGKWH 184
           MTG++  H  ++G      E  G    E F           + + +++ GY+T  IGKW 
Sbjct: 100 MTGQHSGHNQIRG----NFELGGFADEEEFGQMPLQPGTETVAKLMKKAGYTTALIGKWG 155

Query: 185 LGFFRREYTPLYRGFESHFGYLNGVIS---YYDHI--------LSDQYSRTVELNGHDMR 233
           LG       P  +GF+  FGYL+   +   Y  H+        L+++Y    +  G    
Sbjct: 156 LGGPGSYGVPTKQGFDYFFGYLDQKQAHNHYPTHLWQNETWFPLNNEYLDPHKGLGKHDD 215

Query: 234 RNLSTAWDTV--GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
            N   +++    G++A    T EA+  I D   D+P FLYL++ A HA      L+AP E
Sbjct: 216 PNDEASYEKFKRGDFAQQRLTDEALSYI-DNKKDEPFFLYLSYAAPHAA-----LQAPDE 269

Query: 292 TINQFQYITD------------PN---RRTYAAMVKKLDDSVGTVISALQRKGMLENSII 336
            I ++ +  D            P+   R T AAM+  +D+ VG V+  L++ G+ EN+++
Sbjct: 270 EIAKYNHFEDTPYDGVADGGYVPSFRPRATRAAMISHIDEGVGLVLDKLKKLGLDENTLV 329

Query: 337 IFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ 396
           IF SDNG P+ E    ++   + SN  YRG K  L++GG+++P I   P      +VS  
Sbjct: 330 IFSSDNG-PSFE--GGADLEFFDSNAQYRGFKRDLYDGGIRMPTIARWPTKVPAGKVSDH 386

Query: 397 MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLN- 455
           +    D LPTL           P NIDG+    +LL           DG  Q  SL    
Sbjct: 387 VSAFWDVLPTLADLV---EIEAPKNIDGISFLPTLLAE---------DGQQQHDSLYWEF 434

Query: 456 ---TPSRRNSVLINIDEKKRTAAVRL 478
                S + +V I  ++     A+R 
Sbjct: 435 HHWNGSHKQAVRIQAEKNADWKAIRF 460



 Score = 39.7 bits (91), Expect = 5.3,   Method: Compositional matrix adjust.
 Identities = 22/58 (37%), Positives = 32/58 (55%), Gaps = 6/58 (10%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 70
           G++A    T EA+  I D   D+P FLYL++ A HA      L+AP E I ++ +  D
Sbjct: 228 GDFAQQRLTDEALSYI-DNKKDEPFFLYLSYAAPHAA-----LQAPDEEIAKYNHFED 279


>gi|332663783|ref|YP_004446571.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
 gi|332332597|gb|AEE49698.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
          Length = 550

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 145/486 (29%), Positives = 216/486 (44%), Gaps = 109/486 (22%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
           A TK   + L++V   G++DL  +GS EI TPNID LAY G+ L   Y   +C P+RASL
Sbjct: 25  AQTKKPNIILIMVDDMGYSDLGAYGS-EIQTPNIDKLAYEGLRLREFYNNSICAPTRASL 83

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFL-------PEYLRELGYSTKAIGKWHLGFF 188
           +TG+YP   G+    +      G+P  + +L        E LR+ GYST   GKWH+G  
Sbjct: 84  ITGQYPHKAGVGYFDV----NLGIPPYQGYLNKESLTFGEVLRQAGYSTLLSGKWHVGND 139

Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHI---LSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
              + P  RGF+  FG + G  +Y+D     L  QY   V L  +  ++  + ++     
Sbjct: 140 SLHW-PKQRGFDRFFGVIGGGSNYFDAEPMPLGRQYP-VVILEDNQRQKPKANSY----- 192

Query: 246 YATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAH----------AGNAGKH--------- 285
           Y TD  T  AVQ +++Q  +DKP FLYLA+ A H          A   GK+         
Sbjct: 193 YFTDEITNHAVQFLDEQNKMDKPFFLYLAYTAPHWPLQALPEDIAKYRGKYDTGWDALRK 252

Query: 286 ------------------------LEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVG 319
                                   +E P      F+   +       YAAM+ + D  VG
Sbjct: 253 ERLARQVKLGIITQAQADAAAARDVEVPDWDNLTFEEKQLWKAKMEVYAAMLDRADQGVG 312

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPT--VEYRETSNY-RNWG----------------- 359
            V++ L+     +N++IIF+SDNGAP   + +R+ ++  RN G                 
Sbjct: 313 KVLAKLKELKKDDNTLIIFISDNGAPAEDMAHRDKNHAGRNTGPVGTAGSFESQGKPWSY 372

Query: 360 -SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
            SN P+R  K+  +EGG+  P I W P   +   ++    H+ D  PT Y  AG    + 
Sbjct: 373 VSNTPFRSFKSFAYEGGISSPFIAWFPGKIKAGSIAKGTAHLIDLAPTFYELAG---LKY 429

Query: 419 PLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRL 478
           P   +G        ++T + +  ++      S+LL  T +      I   E+    AVR 
Sbjct: 430 PSTYNG--------VSTNALQGKSL------SNLLFKTEAWERGEPI-FWERAGNRAVRQ 474

Query: 479 DSWKLV 484
             WKLV
Sbjct: 475 GKWKLV 480


>gi|383114003|ref|ZP_09934770.1| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
 gi|382948682|gb|EFS34104.2| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
          Length = 496

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 113/320 (35%), Positives = 164/320 (51%), Gaps = 40/320 (12%)

Query: 103 EIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPL 161
           +I TPNID LA +G+     Y    + +P+RA LMTGKY    G  G  +      G+P 
Sbjct: 60  KIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGLMTGKYQQSFGFYG--LSTPHVGGIPQ 117

Query: 162 TERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY 221
            ++ L EYL E GY+T  IGKWHLG + R + P  RGF++ FG++NG+  YYD ++   +
Sbjct: 118 DQKLLSEYLVENGYNTACIGKWHLGDYIRSH-PNNRGFQTFFGFINGLHDYYDPLVGGSW 176

Query: 222 SRTVELNGHDMRRNLSTAWD-----TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLA 276
                 NG      L+   D     T  EY+T  +TK AV  I+    D P FLYL + A
Sbjct: 177 DGV--YNG------LAFTLDNMEPVTEMEYSTYEYTKRAVDFIQKN-ADHPFFLYLPYNA 227

Query: 277 AHAGNAGKHLEAPQETINQF----QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLE 332
            H+      L+AP+E I +     Q I   +     AM   LD  VG V+  L++ G+ +
Sbjct: 228 IHS-----PLQAPEELIGELAINPQEIGKDD--IARAMTFALDQGVGKVVETLEQLGLRD 280

Query: 333 NSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR 392
           N+II ++SDNGA  VEY +          + +RG K + +EGG++VP I+  P       
Sbjct: 281 NTIIFYLSDNGA--VEYSD---------KWEFRGRKGSYYEGGIRVPFIVSYPAKLAKGT 329

Query: 393 VSLQMMHISDWLPTLYTAAG 412
           +  + +   D  PT+   AG
Sbjct: 330 IYNKPVMSIDIAPTVMELAG 349


>gi|254294928|ref|YP_003060951.1| sulfatase [Hirschia baltica ATCC 49814]
 gi|254043459|gb|ACT60254.1| sulfatase [Hirschia baltica ATCC 49814]
          Length = 487

 Score =  150 bits (380), Expect = 1e-33,   Method: Compositional matrix adjust.
 Identities = 112/392 (28%), Positives = 188/392 (47%), Gaps = 41/392 (10%)

Query: 69  TDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-Q 125
           T P   +  A  K   +  ++    G+ D+S +G  E  TP ID+LA+ G+     Y+  
Sbjct: 41  TSPASHSNIAQKKRPNIIFIMADDLGYADVSCYGRREYKTPAIDSLAHTGVKFTQAYSNS 100

Query: 126 PVCTPSRASLMTGKYP--IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
           PVC+ +R S++TG+Y   +  G++ P  +     G+P T   LP  L++ GY T  +GKW
Sbjct: 101 PVCSATRTSVITGRYQYRLPVGLEEPLTF--RDVGLPTTHPTLPGQLKKSGYQTALVGKW 158

Query: 184 HLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV 243
           HLG   + Y PL  G++  +G   G + Y+ H   D  ++  +L   ++  + +      
Sbjct: 159 HLGALPK-YGPLKSGYDHFWGVRGGSVDYFTH---DSMAKQNDLWDGEVEIHET------ 208

Query: 244 GEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ-----FQ 297
            EY T L   ++++LIE     ++P F+ L + A H    G   +   + + Q       
Sbjct: 209 -EYLTKLIGGKSIELIEKMNENEEPFFMSLHYTAPHWPWEGPEDKTESDRLKQGGPFSLL 267

Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
           ++   +  TYAAMV  +D+ +G +++ L   G+ E ++I+F SDNG              
Sbjct: 268 HLDGGSLETYAAMVTSMDNQIGRLLNKLDELGITEETLIVFTSDNGG-----------ER 316

Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
           +   +P+ G+K  L EGG++VP I+  P I +    S Q+    D +PTL   AG     
Sbjct: 317 FSDTWPFSGIKTELLEGGLRVPLIIRWPGITKANTTSDQVSMSMDLMPTLL-KAGQAAPH 375

Query: 418 LPLNIDGLDQWSSLLLNTPSRRN-----SNID 444
               +DG+D   +L+    S R      SN+D
Sbjct: 376 PDFPLDGIDLTEALVSERTSTRQIYFRYSNLD 407


>gi|395804316|ref|ZP_10483556.1| sulfatase [Flavobacterium sp. F52]
 gi|395433415|gb|EJF99368.1| sulfatase [Flavobacterium sp. F52]
          Length = 539

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 119/400 (29%), Positives = 190/400 (47%), Gaps = 69/400 (17%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRA 133
           + + K T + +L+    G  D+S +G    PTP ID+LA +G+   + YA   +C+PSRA
Sbjct: 57  SIIGKKTNIIILLADDLGKYDISLYGGKSTPTPQIDSLATSGVTFTDGYASAAICSPSRA 116

Query: 134 SLMTGKYPIHTGMQGPP----------IWG-----------------------AEPRGVP 160
            L+TG+Y    G +  P           +G                        + +G+P
Sbjct: 117 GLITGRYQERFGHEYQPGDRYPKNNLEYYGFKYLINTDGWRLNDKINYPNEASIKTQGLP 176

Query: 161 LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY-------- 212
            +E    +  +  GYST  IGKWHLG   + + PL RGF+ H+G+      Y        
Sbjct: 177 KSEITFADLAKRQGYSTGIIGKWHLGH-NKGFFPLDRGFDHHYGFYQAFSLYAPEDDNPD 235

Query: 213 -YDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
             +H   D   +T+  NG      +RR+ ST  D    Y T+ F +EA   I D+  +KP
Sbjct: 236 IINHHHKDFTDKTIWGNGRVGIGQIRRD-STIIDEKA-YLTEKFAEEAEAFI-DKNKNKP 292

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
             LY+   A H        +  ++  ++F  + D N+R Y AMV  LDD++G +   +++
Sbjct: 293 FLLYIPFNAPHT-----PFQVRKKYYDRFPNVKDENKRVYFAMVSALDDAIGRIREKVKK 347

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
           +G+ +N++IIF SDNG         ++Y    +N P +G K + +EGG+ VP A+ W  +
Sbjct: 348 EGLEDNTLIIFASDNGG--------ADYTFATTNAPLKGGKFSHFEGGINVPFALSWKGK 399

Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           I+ N  V    +   D   T+ TA G D  +  +  DG+D
Sbjct: 400 IKPN-TVYKNPVSTLDIFTTIATAIGSDLPKDRI-YDGVD 437


>gi|344292944|ref|XP_003418184.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Loxodonta
           africana]
          Length = 513

 Score =  150 bits (380), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 123/363 (33%), Positives = 175/363 (48%), Gaps = 36/363 (9%)

Query: 84  TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
            L L+   GW DL  +G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ PI
Sbjct: 34  VLMLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 93

Query: 143 HTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
             G     G       P+    G+P +E  LPE L++  Y+TK +GKWHLG  R ++ PL
Sbjct: 94  RNGFYTTNGHARNAYTPQDIVGGIPDSEHLLPELLKKANYATKIVGKWHLG-HRPQFHPL 152

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDL 250
             GF+  FG  N     YD+         R  E+ G    +   NL T    +    T +
Sbjct: 153 KHGFDEWFGSPNCHFGPYDNRAKPNIPVYRDWEMVGRFYEEFPINLKTGEANL----TQI 208

Query: 251 FTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
           + +EA+  I+  Q   +P FLY A  A HA                  ++    R  Y  
Sbjct: 209 YLQEALDFIKRQQSQQRPFFLYWAIDATHA-----------PVYASKPFLGTSQRGRYGD 257

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V+++DDSVG ++  L+  G+ E++ I F SDNGA  V     S  +  GSN P+   K 
Sbjct: 258 AVREVDDSVGQLLQLLRGLGIAEHTFIFFTSDNGAALV-----SAPKEAGSNGPFLCGKQ 312

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
           T +EGGV+ PAI W P      +VS Q+  + D   T  + AG +       IDGLD   
Sbjct: 313 TTFEGGVREPAIAWWPGHIPAGQVSHQLGSLMDLFTTSLSLAGLEPPS-DREIDGLDLLP 371

Query: 430 SLL 432
            LL
Sbjct: 372 VLL 374


>gi|374369744|ref|ZP_09627765.1| sulfatase [Cupriavidus basilensis OR16]
 gi|373098668|gb|EHP39768.1| sulfatase [Cupriavidus basilensis OR16]
          Length = 433

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 113/330 (34%), Positives = 159/330 (48%), Gaps = 32/330 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
           G+ DL  +G  E  +P +D+LA NG++L   YA  PVC+P+R +LMT +Y     G    
Sbjct: 14  GYADLGCYGGREPVSPVLDSLAANGLMLTQGYANSPVCSPTRFALMTARYQYRLRGAAEE 73

Query: 150 PI----WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
           PI     G+   G+P     LP  LR  GY T  IGKWHLGF    + PL  G+++ FG 
Sbjct: 74  PINSKSRGSSALGLPPAHPTLPSLLRAQGYRTALIGKWHLGF-APAFGPLLSGYQAFFGP 132

Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
           + G + Y+ H   D   R      HD+  N   A      Y TDL ++ AV  +     +
Sbjct: 133 MAGGVDYFTHC--DSSGR------HDLWHN--DAEHQEPGYLTDLLSRRAVDYVTRMSGE 182

Query: 266 -KPLFLYLAHLAAHA--GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
             P FL L + A H           APQ + N F ++   N  TY  M+  +D+ +G ++
Sbjct: 183 GAPFFLSLHYTAPHWPWETRDDAALAPQVSNNLF-HLDGGNIHTYRRMISHMDEGIGWLV 241

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
            AL++ G L+N++I+F SDNG              +  N+P  G K  L EGG++VP I 
Sbjct: 242 QALRQLGQLDNTLIVFTSDNGG-----------ERFSDNWPLVGGKMDLTEGGIRVPWIA 290

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
             P +      S Q     DW  TL  AAG
Sbjct: 291 HWPAMIAPGGTSAQHCMTMDWSATLLDAAG 320


>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
 gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
          Length = 508

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 132/414 (31%), Positives = 185/414 (44%), Gaps = 82/414 (19%)

Query: 92  GWNDLSFHGSN----EIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
           G ND+S  G       + TPNID LA  G + +  Y+    C PSRA LMTG+YP  TG 
Sbjct: 30  GINDISTFGGGMADGRVQTPNIDRLAAEGALFSTAYSGTGTCAPSRAMLMTGRYPTRTGF 89

Query: 147 QGPPI----------------WGAEP----------------RGVPLTERFLPEYLRELG 174
           +  P                  G  P                +G+P  E  L E L++ G
Sbjct: 90  EYTPTPPGMSRIVPMFANDMKTGLPPTEQVKENEKLMPPFAEQGLPTEEVTLAEVLKDRG 149

Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV---------------ISYYDHILSD 219
           Y T  IGKWHLG     + P  +GF+      +G+                   D  L  
Sbjct: 150 YHTVHIGKWHLG-NTSPFRPNDQGFDESLDMASGLFLPPGDPRGVEARLDFDPIDKFLWA 208

Query: 220 QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 279
           +       NG D        W   G Y TD +T E++++I D   ++P FLYLAH   H 
Sbjct: 209 RMDFAASYNGSD--------WFEPGGYLTDYWTDESLKVI-DANKNRPFFLYLAHWGVHT 259

Query: 280 GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
                 L+A +E       I     R YAAM++ +D SVG ++  L+ +G+ +N+II+F 
Sbjct: 260 -----PLQATKEDYEAVGDIEPHRLRVYAAMLRAVDRSVGRIMEKLEEEGLADNTIIVFS 314

Query: 340 SDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQM 397
           SDN GA  +   E         N PYRG K TL+EGG++VP  + W   IQ    +   +
Sbjct: 315 SDNGGAGYIGIPEV--------NAPYRGWKITLFEGGIRVPLFMKWPGTIQPGTVIDEPV 366

Query: 398 MHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
            HI D +PTL  A G   +RLP +++ +D  + L L T +   +  D    WSS
Sbjct: 367 AHI-DLMPTLAAATG---ARLPSSVE-IDGQNLLPLATGTGPITRPDDALFWSS 415



 Score = 39.7 bits (91), Expect = 5.5,   Method: Compositional matrix adjust.
 Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 4/50 (8%)

Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYH----RRTLVPQSHEQP 612
            +LFNL  DP EQ N+A+S P+  ++L  L+  H    R  L P + E P
Sbjct: 437 SWLFNLNEDPTEQTNLAASEPEKLAELQALIDAHWKDAREPLYPYTIESP 486



 Score = 39.3 bits (90), Expect = 6.8,   Method: Compositional matrix adjust.
 Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 6/75 (8%)

Query: 9   WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
           W   G Y TD +T E++++I D   ++P FLYLAH   H       L+A +E       I
Sbjct: 221 WFEPGGYLTDYWTDESLKVI-DANKNRPFFLYLAHWGVHT-----PLQATKEDYEAVGDI 274

Query: 69  TDPNRRTYAALTKST 83
                R YAA+ ++ 
Sbjct: 275 EPHRLRVYAAMLRAV 289


>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
           [Ciona intestinalis]
          Length = 513

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 110/343 (32%), Positives = 159/343 (46%), Gaps = 49/343 (14%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
             L+   GW DL  +G     TPN+D +A  G +  + Y A P+C+PSRA+L+TG+ PI 
Sbjct: 25  FMLMDDMGWGDLGINGQPSKETPNLDNMAKEGTLFTDFYSANPLCSPSRAALLTGRLPIR 84

Query: 144 TGMQGPPIWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
            G       G     P+    G+P  E  + E L   GY+ K IGKWHLG  + +Y PL 
Sbjct: 85  NGFYTSNYHGHNGYTPQHIVGGIPDHEILVSELLSSAGYTNKLIGKWHLGQ-QEQYLPLK 143

Query: 197 RGFESHFGYLNGVISYYDHI------------LSDQYSRTVELNGHDMRRNLSTAWDTVG 244
            GF   FG  N     YD              +  +Y     +  H    N+        
Sbjct: 144 HGFHEWFGSPNCHFGPYDDKTTPNIPVYNNTEMVGRYYEEFAIESHKYLSNM-------- 195

Query: 245 EYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
              T  + +EA+  IE  +  +KP FLY A  A H+         P  +   F+  +   
Sbjct: 196 ---TQYYIQEALDFIERMERNEKPFFLYWAPDATHS---------PVYSSPMFRGAS--R 241

Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
           R  Y   V +LD  VG +I  L++ G+ +N++++F SDNGA  +     S + N GSN P
Sbjct: 242 RGPYGDAVMELDYGVGVIIQKLKQLGLDKNTLVLFSSDNGAAMI----GSAFEN-GSNGP 296

Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPT 406
           +   K T +EGG++ P I W P   Q  +VS Q+  + DW  T
Sbjct: 297 FLCGKQTTFEGGIREPTIAWGPGFVQQGQVSHQVGSLMDWFAT 339


>gi|343084600|ref|YP_004773895.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342353134|gb|AEL25664.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 472

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 116/355 (32%), Positives = 175/355 (49%), Gaps = 49/355 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL  +GS    TPNID L   G+     Y A PVC+PSRAS++TGK P + G  G  
Sbjct: 40  GWKDLGCYGSEFYETPNIDKLRDQGMKFTAAYSASPVCSPSRASILTGKNPANIGFTGHI 99

Query: 151 I------WGAEPRGVP--------LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
                  +  E R +P        L E+ +PE L + GY++ +IGKWH+G    ++ P +
Sbjct: 100 TAIGKHRYPEEGRIIPPDDYMHVSLEEKMIPEILLQSGYTSASIGKWHVG-EEEKFFPTH 158

Query: 197 RGFESHF-GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           +GF  +  GY +G    Y      + S    +   D R          G+Y T+  T EA
Sbjct: 159 QGFAINIAGYEHGSPPTYWGPFESEKSWNPVIKNLDNREE--------GQYLTNRLTDEA 210

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI----TDPNRRTYAAMV 311
           +  I D+  + P FLYL+H A H       LEAP   I +++       +     YAAM+
Sbjct: 211 INFI-DENKEGPFFLYLSHYAVHT-----PLEAPDSLIKKYELKLEGQMEQKNAIYAAMI 264

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
           + +D +VG ++ +L   G+  N+I+I  SDNG    E R T       +N P R  K  +
Sbjct: 265 ENMDWNVGRLLKSLDSLGLEGNTIVILGSDNGG---EGRVT-------NNVPLREGKGYI 314

Query: 372 WEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           +EGG++VP ++ W  +++ N    + ++   D LPT+   AG   S    +IDG+
Sbjct: 315 YEGGIRVPLVIKWPGKVKPNSSSDVPVI-TDDMLPTIVEMAG--LSNENQDIDGV 366



 Score = 39.7 bits (91), Expect = 5.0,   Method: Compositional matrix adjust.
 Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 6/54 (11%)

Query: 13  GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
           G+Y T+  T EA+  I D+  + P FLYL+H A H       LEAP   I +++
Sbjct: 199 GQYLTNRLTDEAINFI-DENKEGPFFLYLSHYAVHT-----PLEAPDSLIKKYE 246


>gi|254482499|ref|ZP_05095738.1| sulfatase domain protein [marine gamma proteobacterium HTCC2148]
 gi|214037190|gb|EEB77858.1| sulfatase domain protein [marine gamma proteobacterium HTCC2148]
          Length = 602

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 115/354 (32%), Positives = 169/354 (47%), Gaps = 45/354 (12%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
           L  L  +G+NDL+ +  ++ PTP +DA+A  G+     YA+  CT SR +L+TG+YP   
Sbjct: 20  LLALDDFGYNDLAINNGSDSPTPRLDAIAAQGVRFTRHYAESSCTASRVALLTGRYPARV 79

Query: 145 GMQGPPIWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
                   GA P   G+      LP+ L   GY    +GKWH G   RE  P Y+GF+  
Sbjct: 80  --------GAHPYLNGIDHELMTLPDALGSEGYIRHMVGKWHTGDSHRESRPEYQGFDHW 131

Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           FG++N +     H  S  Y R      +    N          + TD+ T  A+ +I+ +
Sbjct: 132 FGFINQLYLRGPH-RSANYRRGKPTYINPWLENELGDLQQYEGHLTDILTDRALDVIKRE 190

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNRRTYAAMVKKLDDSVGT 320
               P FLYL++ A H          P E   +F  +Y  DP  R Y AM  +LD ++G 
Sbjct: 191 --QNPWFLYLSYYAPH---------TPIEPAARFSERYADDPAGR-YQAMKDQLDSNIGR 238

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           +I  L   G ++N++II +SDNG            ++W SN P+ G K T  EGGV+ P 
Sbjct: 239 IIDWLTESGEIDNTMIIVVSDNGGTA---------KSWPSNLPFYGSKATYTEGGVRTPL 289

Query: 381 IL-WS---PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
           +L W    P  QQ+ ++++    I D  PT+  A G      P  +DG D + S
Sbjct: 290 LLSWPGHWPVGQQDDQIAM----IFDLYPTILAALG---KAQPEGLDGADLFDS 336


>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 443

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 109/343 (31%), Positives = 173/343 (50%), Gaps = 34/343 (9%)

Query: 77  AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
           A + +   + L++V   G+ DL  +GS  I TP +D +A +G+ L + YA  P+CTP+RA
Sbjct: 28  AEIQQQPNVVLILVDDLGYGDLGCYGSESIRTPRLDRMAASGMKLTSFYAAAPICTPTRA 87

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           +LMTG Y    G+   P+   +  G+  +E  L E +++ GY T  +GKWHLG   R Y 
Sbjct: 88  ALMTGCYATRVGLP-TPLHVYDEIGINESEFTLGEAMQQCGYETVCVGKWHLGHQPRFY- 145

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
           P   GF  ++G   G + +    +      T +L   D  R +    D      T+  T+
Sbjct: 146 PTEHGFNHYWGTPLGHM-FNRPAVGKAIGDTSDLFLDDT-REIPFPEDA---DLTERLTE 200

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR-TYAAMVK 312
           +AV+ IE +  D+P FL+LAH              P E +   +   D ++   Y  +V+
Sbjct: 201 KAVEFIEAK-RDRPFFLFLAH------------PMPHEPLAASEKFADRSQAGLYGDVVE 247

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
            LD S G V+ AL+R G  EN+++ F+SDNG            +  G+    RG K+  +
Sbjct: 248 CLDWSTGEVLDALERTGQAENTLVAFLSDNGPK----------KGHGTTAGLRGFKHDPY 297

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
           EGG++VP I+ +  +    + S ++ ++ DW PTL   AGG T
Sbjct: 298 EGGMRVPCIIAAQGLIPAGQTSDEICNVMDWYPTLVNLAGGTT 340


>gi|340619489|ref|YP_004737942.1| sulfatase [Zobellia galactanivorans]
 gi|339734286|emb|CAZ97663.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
          Length = 483

 Score =  150 bits (379), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 120/388 (30%), Positives = 178/388 (45%), Gaps = 68/388 (17%)

Query: 75  TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPS 131
            +A   K   +  ++V   GW D++ +GS    TPNID LA  GI     YA  P+C+P+
Sbjct: 22  VFAQTDKQPNIVFILVDDLGWMDIAANGSTFYETPNIDQLAKEGIRFTKAYAASPICSPT 81

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEP------RGVPLTERFLPEYLRELGYSTKAIGKWHL 185
           RAS++TGK P    +     W   P      R +PL E   PE L++ GY T  +GKWHL
Sbjct: 82  RASILTGKNPARIDLTQ---WIGGPGNPDYLRNLPLEEVLFPELLQDAGYKTAFMGKWHL 138

Query: 186 GFFRRE--YTPLYRGFESHF-GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT 242
                E  + P  +GF+ +  G+  G +   +   S      +E NG +           
Sbjct: 139 NNVAGEEKFWPDKQGFDVNVAGHFRGGLYIKNKYFSPWNIPNLE-NGPE----------- 186

Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ------- 295
            GEY TD   KEA+  I D+    P  LY +  + HA       +AP + + +       
Sbjct: 187 -GEYMTDRLAKEAIDFI-DENSKAPFLLYFSLYSVHAP-----FDAPADRVEKYEKKKKA 239

Query: 296 ------------------FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIII 337
                             F+Y    +  TYAAMV+ +D +VG ++  LQ KG+ +N+++I
Sbjct: 240 LALTDKERFAEEKNAQKPFKYRIKQDHPTYAAMVESMDMAVGKILGKLQEKGIADNTVVI 299

Query: 338 FMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ 396
           F SDNG   T E   T       +N P R  K  L+EGG++ PAI+  P   +   VS  
Sbjct: 300 FFSDNGGLSTSEGIPT-------ANTPLRAGKGWLYEGGIREPAIIRWPGTIKPGTVSDA 352

Query: 397 MMHISDWLPTLYTAAGGDTSRLPLNIDG 424
           ++   D+ PT+     G   R  L++DG
Sbjct: 353 VITSMDFYPTILEMT-GQALRPDLHVDG 379


>gi|146302377|ref|YP_001196968.1| sulfatase [Flavobacterium johnsoniae UW101]
 gi|146156795|gb|ABQ07649.1| sulfatase [Flavobacterium johnsoniae UW101]
          Length = 539

 Score =  150 bits (378), Expect = 2e-33,   Method: Compositional matrix adjust.
 Identities = 136/513 (26%), Positives = 227/513 (44%), Gaps = 98/513 (19%)

Query: 73  RRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCT 129
           + T AA  K   + +L+    G  D+S +G    PTP ID+LA +G+   + Y +  +C+
Sbjct: 53  KDTSAASEKKPNIIILLADDLGKYDISLYGGKSTPTPQIDSLAASGVTFTDGYVSSSICS 112

Query: 130 PSRASLMTGKYPIHTGMQGPP---------------------IWGAEPR----------- 157
           PSRA L+TG+Y    G +  P                      W   P+           
Sbjct: 113 PSRAGLLTGRYQERFGHEYQPGDRYPKNNLEYYAFKYLLNTNSWRLNPKIEYPNDASIAT 172

Query: 158 -GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY---- 212
            G+P +E    +  ++ GYST  IGKWHLG   + + PL RGF+ H+G+      +    
Sbjct: 173 QGLPKSEITFADLAKKQGYSTAIIGKWHLGH-TKGFFPLDRGFDYHYGFYQAFSLFAPED 231

Query: 213 -----YDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
                 +H  +D   +T+  NG      +RR+ ST  D   +Y T+ F +EA   I D+ 
Sbjct: 232 NNPDIINHHHTDFTDKTIWGNGRVGTGQIRRD-STIIDE-KKYLTEKFAEEAEAFI-DKN 288

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
            +KP  LY+   A H        +  ++  ++F  + D N+R Y AM+  LDD++G + +
Sbjct: 289 KNKPFLLYVPFNAPHT-----PFQVRKKYYDRFPNVKDENKRVYFAMISALDDAIGLIRA 343

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AIL 382
            ++++G+ EN++I F SDNG     Y  T        N P +G K + +EGGV VP A+ 
Sbjct: 344 KVKKEGLEENTLIFFASDNGGADYTYATT--------NAPLKGGKFSHFEGGVNVPFALS 395

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR--RN 440
           W  +I+ +               T+Y             +  LD +S++   T S   ++
Sbjct: 396 WKGKIKPH---------------TIYKTP----------VSSLDIFSTIAAVTHSGLPKD 430

Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
              DG+D    +  N  + +N    + D K    A+R   WKL++  + + T      + 
Sbjct: 431 RVYDGVDLVDVVNNNKQAHQNLYWRSGDAK----AIRSGDWKLIISGKTHETWLYNLAKD 486

Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
           +S    L + N     +   +LQ   + +  P+
Sbjct: 487 KSETTDLASKNPEKVKELQTALQNWEKGLIKPL 519


>gi|296122982|ref|YP_003630760.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296015322|gb|ADG68561.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 466

 Score =  150 bits (378), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 119/369 (32%), Positives = 181/369 (49%), Gaps = 37/369 (10%)

Query: 82  STTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
           ST   +++++    G+ DL  +GS  I TP++D +A  G+   + Y A  VCTPSRA+L+
Sbjct: 33  STKPNIVVIFADDLGYGDLGCYGSPTIRTPHLDEMAAEGLRFTDFYSAAEVCTPSRAALL 92

Query: 137 TGKYPIHTGMQGP--PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           TG+ PI +GM G    ++     G+P  E  + E L+E GY+T  IGKWHLG       P
Sbjct: 93  TGRLPIRSGMCGARRVLFPNSKGGLPPAEITIAEALKEKGYATAQIGKWHLGIHPGS-RP 151

Query: 195 LYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHD--MRRNLSTAWDTVGEYA-TD 249
           L +GF+  FG  Y N + +  D       S    L+G +  + RN         +   T 
Sbjct: 152 LDQGFDQSFGLPYSNDMDARADLPKGSTGSPNPPLDGWNVALLRNGEVVEQPANQTTLTK 211

Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
            +T+EA++ I ++  + P FLY+ H   H          P      F+  +      Y  
Sbjct: 212 RYTEEAIKFITEK-KNVPFFLYMPHTFPHV---------PMFASQDFKGKSRAG--IYGD 259

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
            V++LD SVG V+ AL R+G+ EN+++ F SDNG   +   +       GS    +  K 
Sbjct: 260 AVEELDWSVGQVLGALCREGIAENTLVFFSSDNGPWLIMGDQG------GSAGLLKDGKG 313

Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI--DGLDQ 427
           + WEGG++VP I W P  +  P V+ Q+    D  PT    AG   + LP ++  DG+D 
Sbjct: 314 STWEGGMRVPGIAWMPS-RIKPGVTSQLASAMDVFPTALALAG---ASLPKDVVFDGVDL 369

Query: 428 WSSLLLNTP 436
              L  + P
Sbjct: 370 APLLFESRP 378



 Score = 40.4 bits (93), Expect = 3.1,   Method: Compositional matrix adjust.
 Identities = 15/41 (36%), Positives = 28/41 (68%)

Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
           P  LF+LG DP E+ N+A++ P++  ++ E +K H+  ++P
Sbjct: 420 PELLFHLGRDPSEKRNVAAAHPEVLIRIQEAVKAHQSQVIP 460


>gi|255035933|ref|YP_003086554.1| sulfatase [Dyadobacter fermentans DSM 18053]
 gi|254948689|gb|ACT93389.1| sulfatase [Dyadobacter fermentans DSM 18053]
          Length = 497

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 127/426 (29%), Positives = 191/426 (44%), Gaps = 65/426 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ +L  +G  +I TPN+D LA  GI     Y   PVC P+RA LMTGK+  H+ ++G  
Sbjct: 38  GYGELGCYGQQKIKTPNLDRLAKEGIRFTQHYTGTPVCAPARAMLMTGKHAGHSAIRGNF 97

Query: 151 IWGA----EPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
             G     E RG   +P  E  + E L++ GY+T   GKW +G    E TP  +GF+ ++
Sbjct: 98  ELGGFRDEEERGQMPLPANELTVAELLKQKGYATALTGKWGMGMNNTEGTPTRQGFDYYY 157

Query: 204 GYLNGVIS---YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG--------EYATDLFT 252
           GYL+   +   Y  H+  +    T+     D+ R L  A  T          EYA    T
Sbjct: 158 GYLDQKQAHNLYPSHLWENDRWDTLAQPWQDIHRKLDPAKATDADFESFKGKEYAPAKMT 217

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNR------ 304
           ++A+  I D+    P FLY+ +   H       L+AP E + ++  Q+   P        
Sbjct: 218 EKALAFI-DRSKAGPFFLYMPYTLPHVS-----LQAPDEYVKKYIGQFDEKPYYGEKNYA 271

Query: 305 ------RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
                  TYA+M+  LDD VG ++  L+  G+ +N+I++F SDNGA    +    N + +
Sbjct: 272 STKYPLSTYASMITFLDDQVGIILDKLKALGLDDNTIVMFSSDNGA---TFNGGVNPQFF 328

Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
            S    RG+K  ++EGG++ P I+  P   +  RVS  +    D +PTL    G  +   
Sbjct: 329 NSVAGLRGLKMDVYEGGIREPFIVRWPGKIKPGRVSDHVSAQFDLMPTLAELTGQASP-- 386

Query: 419 PLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRL 478
               DG+     LL  T                   N   +   +     EK    AVR+
Sbjct: 387 --PTDGISFLPELLGQT-------------------NRQKKHEFLYFEYPEKGGQIAVRM 425

Query: 479 DSWKLV 484
             WK V
Sbjct: 426 GDWKGV 431


>gi|440289645|ref|YP_007342410.1| arylsulfatase A family protein [Enterobacteriaceae bacterium strain
           FGI 57]
 gi|440049167|gb|AGB80225.1| arylsulfatase A family protein [Enterobacteriaceae bacterium strain
           FGI 57]
          Length = 550

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 130/435 (29%), Positives = 200/435 (45%), Gaps = 69/435 (15%)

Query: 92  GWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
           GW D+ F+G       PTP+IDA+A  G+IL + Y+QP  +P+RA+++TG+Y +H G+  
Sbjct: 97  GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSVHHGILM 156

Query: 149 PPIWGAEPRGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
           PP++G  P G+  LT   LP+ L + GY T+AIGKWH+G       P   GF+   G+ N
Sbjct: 157 PPMYGM-PGGLEGLTT--LPQLLHDQGYVTQAIGKWHMG-ENEGSQPQNVGFDDFRGF-N 211

Query: 208 GVISYY----------DHILS---DQYSRTVELNGHDMRR----NLSTAWDTVGEYATDL 250
            V   Y          +  LS    +Y + +  +  D+            D   +Y  DL
Sbjct: 212 SVSDMYTEWRDVNVNPEVALSPARSEYIKKLPFSKDDVHAVRGGKQQAVADITPKYMEDL 271

Query: 251 ---FTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
              + K  V  I      DKP FLY      H  N            N     + P R +
Sbjct: 272 DQRWMKYGVDFINKMAKSDKPFFLYYGTRGCHFDN----------YPNAKYAGSSPARTS 321

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y   + +++D    +   L++ G L+N++I+F SDNG P  E              P+RG
Sbjct: 322 YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEAEVPPH-------GRTPFRG 373

Query: 367 VKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            K + WEGGV+VP  + W   +Q  PR S  ++ ++D  PT  + AG   + L       
Sbjct: 374 AKGSTWEGGVRVPTFVYWKGMVQ--PRKSDGIVDLADLFPTALSLAGHPGAEL------- 424

Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWK--L 483
              + L+  T     + IDG+DQ +S  +    + N    +     + AA+R+D +K  +
Sbjct: 425 ---AKLVPKT-----TFIDGVDQ-TSFFVGDNGQSNRKAEHYFLNGQLAAIRMDEFKYHM 475

Query: 484 VLGTQENGTMDGYYG 498
           V+      T  GY G
Sbjct: 476 VIQQPYAFTQSGYQG 490


>gi|171910115|ref|ZP_02925585.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 460

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 115/343 (33%), Positives = 171/343 (49%), Gaps = 29/343 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGP- 149
           G+ DL  +GS  I TP++D +A  G+   + Y A  VCTPSRA+L+TG+YP+ +GM G  
Sbjct: 41  GYGDLGCYGSPTIATPHLDQMAAEGLRFTDFYVASEVCTPSRAALLTGRYPVRSGMYGKR 100

Query: 150 -PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YL 206
             ++     G+P  E  LPE L+  GY+T  +GKWHLG       PL +GF+  FG  Y 
Sbjct: 101 RVLFPNSTGGLPAGEITLPEALKARGYATAHVGKWHLGIHEGS-RPLDQGFDQSFGLPYS 159

Query: 207 NGVISYYDHILSDQYSRTVELNGHD--MRRNLSTAWDTVGE-YATDLFTKEAVQLIEDQP 263
           N + +  D       S T  ++G +  + RN         + + T  +T+EAV+ I+ Q 
Sbjct: 160 NDMDARPDLPKGSTGSPTPPIDGWNVPLLRNGEVVEKPADQVHLTGHYTEEAVKFIQ-QK 218

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
             +P FLY+AH   H          P      F+  +      Y   V++LD SVG V+ 
Sbjct: 219 KSQPFFLYMAHSFPH---------VPLFASENFKGRSRAG--IYGDTVEELDWSVGQVLQ 267

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
            L+++G+ EN+++ F SDNG   +   +       GS  P +  K + WEGG++VP I W
Sbjct: 268 TLRKEGLSENTLVFFTSDNGPWLIMGAQG------GSAGPLKDGKGSHWEGGMRVPGIAW 321

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
            P     P VS    +  D   T    AG +  +  + IDG D
Sbjct: 322 WPG-HIKPGVSRTPANSMDLFVTSVALAGAEVPKDRV-IDGTD 362


>gi|260816809|ref|XP_002603280.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
 gi|229288598|gb|EEN59291.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
          Length = 377

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 86/223 (38%), Positives = 121/223 (54%), Gaps = 19/223 (8%)

Query: 87  LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
           LLIV    GW+D+S++    + TPN+  LA  G+I N  YAQP C+PSR +L+TGK+P  
Sbjct: 28  LLIVADDLGWSDVSWNNPYVV-TPNLHTLATTGVIFNQTYAQPTCSPSRTALLTGKFPFR 86

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
            GMQ   +   +P G+PL E  LP+ L++LGY+T  +GKWHLG  + EYTP  RGF+S +
Sbjct: 87  LGMQRV-MDSKKPHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGFDSFY 145

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
           GY +G   YY H             G D     ++  D  G Y+T+ F   A  +I    
Sbjct: 146 GYHHGSQDYYTH---------KSARGLDFWDGKTSISDQNGVYSTESFATRAENIISQHD 196

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
            + PLFLYL   + H  +     + P   +  F  I D NR++
Sbjct: 197 PNTPLFLYLPFQSVHTPH-----QVPSSYLQTFSTIQDDNRKS 234


>gi|224064555|ref|XP_002192706.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Taeniopygia guttata]
          Length = 549

 Score =  149 bits (377), Expect = 3e-33,   Method: Compositional matrix adjust.
 Identities = 118/350 (33%), Positives = 171/350 (48%), Gaps = 35/350 (10%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASL 135
           AA T +  L L+   GW DL   G     TPN+D +A  G++  + Y A P+C+PSRA+L
Sbjct: 53  AATTTNIVLLLMDDMGWGDLGAFGEPSKETPNLDQMAAEGMLFLDFYTASPLCSPSRAAL 112

Query: 136 MTGKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
           +TG+ P+  G     G       P+    G+P +E  LPE L++ GY+ K IGKWHLG  
Sbjct: 113 LTGRLPVRNGFYTTNGHARNAYTPQDIVGGIPDSELLLPELLKKAGYTNKIIGKWHLGH- 171

Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTV 243
           R ++ PL  GF+  FG  N     Y+          R  E+ G    D + +L T    +
Sbjct: 172 RAQFHPLKHGFDEWFGSPNCHFGPYNSTERPNIPVYRDWEMVGRYYEDFKIDLKTGEANL 231

Query: 244 GEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
               T L+ +EA+  I + Q   +P FLY A  A HA                 Q++   
Sbjct: 232 ----TQLYLQEALDFISKQQASQQPFFLYWAIDATHA-----------PVYASKQFLGTS 276

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R  Y   V+++DDSVG ++  LQ+ G+ E++ + F SDNGA  +     S  +  GSN 
Sbjct: 277 QRGLYGDAVREIDDSVGKILKHLQQLGISESTFVFFTSDNGAALI-----SAPKQGGSNG 331

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           P+   K T +EGG++ PAI W P       VS Q+  + D   T  + AG
Sbjct: 332 PFLCGKQTTFEGGMREPAIAWWPGHIPAGGVSHQLGSVMDLFSTSLSLAG 381


>gi|383780433|ref|YP_005464999.1| putative sulfatase [Actinoplanes missouriensis 431]
 gi|381373665|dbj|BAL90483.1| putative sulfatase [Actinoplanes missouriensis 431]
          Length = 464

 Score =  149 bits (376), Expect = 4e-33,   Method: Compositional matrix adjust.
 Identities = 118/338 (34%), Positives = 165/338 (48%), Gaps = 34/338 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHT-GMQGP 149
           GW DLS +GS EI TPN+D LA +G+     Y A  VC+P+R +L TG+ P  T G    
Sbjct: 59  GWADLSSYGSPEIRTPNLDRLARSGVRFTQGYSAGAVCSPTRVALYTGRLPGRTPGGLPE 118

Query: 150 PIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           PI     R G+P T   L   L++ GY+T  IGKWH GF    ++PL  G++  FG  +G
Sbjct: 119 PIGSVNERDGIPATHPTLASLLKKTGYATALIGKWHGGFLP-NFSPLRTGWDYFFGNYSG 177

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKP 267
            + YY      +YS T +   +D+    +   D  G Y TD+   +A   I + +   KP
Sbjct: 178 GVDYYS-----KYSHTGK---YDLFEGETEVPDD-GRYYTDVIADKAEDWITKAKKQGKP 228

Query: 268 LFLYLAHLAAH----AGNAGKHLEAPQETIN----QFQYITDPNR-RTYAAMVKKLDDSV 318
             L L   + H    A    K  E     I     Q  +  D     TY  MV+ LD  +
Sbjct: 229 WLLNLNFTSPHWPWEAPGDRKISEEITAKIKAGNTQAIFHNDGGSLETYTKMVENLDARI 288

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V++AL+R G  EN++++F SDNG     Y           N+P  G K +L EGG++V
Sbjct: 289 GQVLNALRRSGQEENTLVVFASDNGGERFSY-----------NWPLTGNKGSLSEGGIRV 337

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
           P IL  P   ++ +VS + +   DW  TL   AG   S
Sbjct: 338 PTILRWPARIRSGQVSHEPVVTHDWTATLLEIAGAAPS 375


>gi|220919599|ref|YP_002494902.1| sulfatase [Methylobacterium nodulans ORS 2060]
 gi|219952019|gb|ACL62410.1| sulfatase [Methylobacterium nodulans ORS 2060]
          Length = 482

 Score =  149 bits (376), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 105/327 (32%), Positives = 164/327 (50%), Gaps = 29/327 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           G+ D+  +G  ++ TP+ID +A  G+     YA   VC+ +R +L+TG+Y   +  G++ 
Sbjct: 48  GYADVGCYGRPDLKTPHIDRIAARGVRFTQGYANSAVCSATRTALITGRYQYRLPVGLE- 106

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            P+      G+P     LP  LR  GY T  +GKWHLG   + + PL RG+++ +G+  G
Sbjct: 107 EPVTTRFTGGLPAEHPTLPGLLRRAGYRTALVGKWHLGSVAK-FGPLQRGYDTFWGFRGG 165

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKP 267
            I Y+ H  +D      +L   D+R       + VG YAT L  + AV+ I       +P
Sbjct: 166 TIDYFSHRGTDDKP---DLWDGDVRV------EEVG-YATHLIGRRAVETIGTAVRAGRP 215

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETI--NQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
            FL L   A H        EA    I  +  ++     + TY  +V ++D  +G V+ AL
Sbjct: 216 FFLSLHFNAPHWPWETPEDEAESRRIAGSNLRHFDGGTQATYHRIVAEMDAQIGRVLDAL 275

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
           ++ G+ +++I++F SDNG              +   +P+ G K  L EGG+++PAI+  P
Sbjct: 276 EQHGIAQDTIVVFTSDNGG-----------ERFSDTWPFTGRKTELLEGGLRIPAIVSWP 324

Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAG 412
           +      VS Q M   DWLPTL  AAG
Sbjct: 325 RRLPAGVVSAQPMITMDWLPTLLAAAG 351


>gi|440750638|ref|ZP_20929879.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
 gi|436480856|gb|ELP37068.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
          Length = 478

 Score =  149 bits (375), Expect = 5e-33,   Method: Compositional matrix adjust.
 Identities = 117/354 (33%), Positives = 179/354 (50%), Gaps = 39/354 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G+ +I TPN+D +A NG+   N Y +Q VC+ SRASL+TG Y    G+ G  
Sbjct: 44  GYGDLGAYGAEDIATPNLDQMAENGVKFTNFYVSQAVCSASRASLLTGTYANRLGIHGAL 103

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              A+  G+   E  + E L+E GY T   GKWHLG     + P  +GF+  +G      
Sbjct: 104 DHTAK-HGLNPEEITIAEMLKEQGYQTAMFGKWHLG-HHPPFLPTKQGFDEFYGIPYSND 161

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            +  H  +  Y   + L  +D    + T  +      T+L T+ AV  I+ +  D+P FL
Sbjct: 162 MWPQHPQTKNYYPPLPLFFND--NVVDTLHNDQSRLTTEL-TERAVNFIQRKK-DQPFFL 217

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLAH   H          P     +F+      R  Y  ++ ++D SVG ++  L+  G+
Sbjct: 218 YLAHPMPH---------VPLFVSEKFR--GKSKRGLYGDVLMEIDWSVGQILKTLRDNGL 266

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI-- 387
            E +++IF+SDNG P + +   +     GS YP R  K T W+GGVKVPA++ W  +I  
Sbjct: 267 EEKTLVIFISDNG-PWLSFSGHA-----GSAYPLREGKGTSWDGGVKVPALMYWKGKIPA 320

Query: 388 ---QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
              Q NP +++ M      LPT+      D  +LP  IDG D  S+++L+  ++
Sbjct: 321 GTVQTNPAMTIDM------LPTIAKLTRADLPKLP--IDGKDI-STMILDAKAK 365


>gi|154501502|ref|ZP_02039203.1| hypothetical protein BACCAP_04855 [Bacteroides capillosus ATCC
           29799]
 gi|150269818|gb|EDM97356.1| arylsulfatase [Pseudoflavonifractor capillosus ATCC 29799]
          Length = 494

 Score =  149 bits (375), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 137/440 (31%), Positives = 195/440 (44%), Gaps = 105/440 (23%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
           ++++Y    G+ DL   G+  I TPNIDALA  G++L N YA  P+C+ SRA L+TG+YP
Sbjct: 73  VVVIYVDDMGYGDLGCTGATAISTPNIDALAEGGVLLTNYYAPAPICSASRAGLLTGRYP 132

Query: 142 IHT----------GMQG------PPIWGAEP---RGVPLTERFLPEYLRELGYSTKAIGK 182
           I T          G+ G        + G  P    G+P  E  LPE L++ GY T  +GK
Sbjct: 133 IRTLTSGAYMNTEGLSGHLANLLEVVKGTYPYQNDGLPTDEILLPEVLQQAGYETALVGK 192

Query: 183 WHLGFFRREYTPLYRGFESHFGYLNG------VISYYDHILSDQYSRTVELNGHDMRRNL 236
           WHLG  R E  P  RGF+  +G L         I + D ++ D+                
Sbjct: 193 WHLGI-REEERPYNRGFDLFYGALYSDDNDPHRIYHNDEVVHDE---------------- 235

Query: 237 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHLEAPQETINQ 295
              +D  G   T   T+ A Q I+D   D P FLY A    H   NA +           
Sbjct: 236 --PYDQSG--MTKELTQVAKQFIDDNQ-DGPFFLYYASPFPHWPSNASE----------- 279

Query: 296 FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY 355
            +++       Y   ++++D SVG ++  L+  G+LEN+++IF SDNG            
Sbjct: 280 -EWLGTSQAGIYGDCMQEVDWSVGEIMDTLEENGLLENTLVIFTSDNGP----------- 327

Query: 356 RNW--GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
             W  G+    RG K+T + GG  VP I + P       V   +M   D  PT+      
Sbjct: 328 --WYDGATGGQRGRKDTNYNGGSHVPFIAYMPGTIPEGEVYDGLMSGVDVFPTI------ 379

Query: 414 DTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT 473
                 LN+ G++         P  R   IDG+D W  L   + S R  + +N D  K T
Sbjct: 380 ------LNLLGIE--------LPQDRV--IDGMDMWPFLTGQSDSPRTELFLNKD--KDT 421

Query: 474 AAVRLDSWK-LVLGTQENGT 492
            A+  D++K L     ENGT
Sbjct: 422 FALIEDNFKYLERSYSENGT 441


>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
 gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
          Length = 457

 Score =  149 bits (375), Expect = 6e-33,   Method: Compositional matrix adjust.
 Identities = 116/356 (32%), Positives = 172/356 (48%), Gaps = 54/356 (15%)

Query: 77  AALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
           AA TK   + +LI   G  D   +G+    TP+ID LA  G+   + YA PVC+P+RASL
Sbjct: 27  AAPTKPNIVFILIDDMGCKDAGCYGATNFSTPHIDRLANQGMRFTDAYAAPVCSPTRASL 86

Query: 136 MTGKYP--IHTGMQGPPIWGAEPRG----------VPLTERFLPEYLRELGYSTKAIGKW 183
           MTGK+P  +H     P I    P G          +PL E+ + + L   GY    IGKW
Sbjct: 87  MTGKHPARLHLTNFIPQIGRQLPAGKLIPPGFNHVLPLDEKTIAQELHADGYQCAMIGKW 146

Query: 184 HLGFFRR-EYTPLYRGFE-----SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
           HLG     EY P  RGF+      H G  N    +Y  +   ++     L G+       
Sbjct: 147 HLGEEHGPEYRPQNRGFDRVVLSEHHGIFN---YFYPFVDQQKWPYAGPLPGNP------ 197

Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
                 G+Y  D  T EA+  + +   ++P FLYL+H + H    G++  AP+  I +++
Sbjct: 198 ------GDYLPDRLTDEAIDFVREN-RERPFFLYLSHWSVH----GRYF-APESLIAKYR 245

Query: 298 YITDPNR-RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
                 R   YAAM++ +D+SVG +++ L    + +N++ +FMSDNG   +         
Sbjct: 246 ERGLEERPAIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMSDNGGERIT-------- 297

Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHIS-DWLPTLYTAA 411
              S  P RG K +L+EGGV+VP I+  P + + P  +  +  IS D  PT    A
Sbjct: 298 ---SMAPLRGSKGSLYEGGVRVPLIVRYPGVVK-PNTTCSVPVISHDLFPTFLDFA 349


>gi|326802120|ref|YP_004319939.1| sulfatase [Sphingobacterium sp. 21]
 gi|326552884|gb|ADZ81269.1| sulfatase [Sphingobacterium sp. 21]
          Length = 471

 Score =  149 bits (375), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 114/354 (32%), Positives = 173/354 (48%), Gaps = 48/354 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D+ F+G   I TPN+D LA  GI  N  YA   VC PSR+SLM+G++  HT ++G  
Sbjct: 37  GYGDVGFNGQRLINTPNLDRLAKEGIRFNQFYAGTAVCAPSRSSLMSGQHTGHTFIRGN- 95

Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
             G EP G   +P +   + E L++ GY T A GKW LG    E  P  +GF+  +GY  
Sbjct: 96  -LGVEPEGQYPIPDSVETIAETLKKAGYQTGAFGKWGLGPVGSEGDPNKQGFDRFYGYNC 154

Query: 208 GVIS--YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
             ++  YY   L D   + V     D+  +         +YA DL  ++A+  IE+   D
Sbjct: 155 QSLAHRYYPTHLWDNSEKVVLTENGDLEYHK--------QYAPDLIQEQALGFIEEHK-D 205

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQF---QYITDPNR------------------ 304
           +P FL+L ++  HA      L  P ++I Q+   ++   P +                  
Sbjct: 206 RPFFLFLPYILPHA-----ELLVPDDSIFQYYKGKFEEKPFKGFDYGKGAKSGGYASQQY 260

Query: 305 --RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
              T+AAMV +LD  VG +   L+  G+ EN++I+F SDNG P  E     N+  + S+ 
Sbjct: 261 PHSTFAAMVARLDLYVGQIEEKLRSLGLDENTLIVFTSDNG-PHKEGGADPNF--FQSSG 317

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
             RG+K  L+EGG++VP +   P+  +  + S  +    D  PT     G  T+
Sbjct: 318 GLRGIKRDLYEGGIRVPFVAKWPKAIRAGQQSEFIGAFWDIFPTFAELGGAATA 371


>gi|170727990|ref|YP_001762016.1| sulfatase [Shewanella woodyi ATCC 51908]
 gi|169813337|gb|ACA87921.1| sulfatase [Shewanella woodyi ATCC 51908]
          Length = 484

 Score =  148 bits (374), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 114/346 (32%), Positives = 164/346 (47%), Gaps = 63/346 (18%)

Query: 80  TKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV-CTPSRAS 134
           +K     ++I+Y    G  D   +GS + PTPNID LA +G+     YA    C PSRAS
Sbjct: 33  SKLKQANVVIIYVDDLGIMDTGIYGSAQYPTPNIDKLANSGVRFTQAYANAANCAPSRAS 92

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLP---------------EYLRELGYSTKA 179
           LMTG  P   G+      G+  RG     + +P               +  ++ GY+T  
Sbjct: 93  LMTGLTPAEHGIL---TVGSSERGESQYRKLIPVTNNTELNPDLTTIADLFKQQGYATAV 149

Query: 180 IGKWHLGFFRREYTPLYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
           IGKWHLG    +  P   GF++     +L    SY+ +  S    + + L    ++    
Sbjct: 150 IGKWHLG----KTAPTEYGFDTAIAASHLGHPPSYF-YPYSKGKRKLIGLEEGGLK---- 200

Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF- 296
                  EY ++  T+EAV  I  Q   +P FLYL   A H       +EAP+E +NQ  
Sbjct: 201 ------DEYLSNRITREAVNYISSQ--RQPFFLYLPFYAVHTP-----IEAPKEWVNQHN 247

Query: 297 --QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN 354
             Q   +     YAAM+  LD  VG ++ AL + G  EN++++F SDNGA          
Sbjct: 248 ARQQAGEIKSAAYAAMIANLDRDVGKLLQALDKSGQRENTLVVFASDNGA---------- 297

Query: 355 YRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNP--RVSLQM 397
           Y    S+ PYRG K++L+EGG+K+P +L W  QI  N   R  +QM
Sbjct: 298 YDPATSSLPYRGYKSSLFEGGIKIPLVLSWPKQIPPNSQNRTPVQM 343


>gi|372210887|ref|ZP_09498689.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 476

 Score =  148 bits (374), Expect = 7e-33,   Method: Compositional matrix adjust.
 Identities = 133/453 (29%), Positives = 203/453 (44%), Gaps = 85/453 (18%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           G  DL  +G  +I TPN+DALA  GI     Y+   VC PSR+SLMTG++  HT ++G  
Sbjct: 42  GIGDLGTYGQKKIKTPNLDALAAGGIKFTQHYSGSAVCAPSRSSLMTGQHTGHTPIRGNM 101

Query: 149 --PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
               I G +P  +P +   L E L++ GY+T   GKW LGF   E  P  +GF+  +GY 
Sbjct: 102 ELKGIGGQQP--LPESSITLAEMLKKAGYTTGMFGKWGLGF--GEGDPNKQGFDEFYGY- 156

Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
                  D  L+ +Y  T   +  ++       W +   YA D   ++ ++ I+    DK
Sbjct: 157 ------NDQKLAHRYYPTHLNHNQEIDSLEGNDWTSTEVYAQDKIQQKTLEFIQTNK-DK 209

Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQ-------------FQYITD--PNR------- 304
           P F Y+  +  HA      L +PQ++I Q               Y++D  PN        
Sbjct: 210 PFFAYVPMVLPHA-----ELISPQDSILQQYVGKFDEVPYKNHPYLSDYGPNMVYYKYCS 264

Query: 305 -----RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
                  YA+MV ++D  +G ++S L++ G+  N+II+F SDNGA        +N   + 
Sbjct: 265 QETPYAVYASMVTRMDAYIGQIVSKLKKLGIDNNTIIMFSSDNGAHAA---GGANPEFFD 321

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           S    RG+K  L+EGG++ P I+  P    +  V+  +    D +PTL    G    + P
Sbjct: 322 STDGLRGIKRDLYEGGIRAPFIVAWPGTIPSGVVTDHISAFWDVMPTLADIVG---IKAP 378

Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLD 479
              DG+    +LL    +++  NI     W                NI   ++  AVRL 
Sbjct: 379 KQSDGISFLPTLL----NQKQQNIHEYMYWE--------------YNIKGGRK--AVRLG 418

Query: 480 SWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNA 512
            WK V+          Y    + +K+ L N  A
Sbjct: 419 KWKGVI----------YKANNKKSKIELYNLEA 441


>gi|212379227|gb|ACJ24843.1| n-acetylgalactosamine-6-sulfate sulfatase-like protein
           [Streptomyces pactum]
          Length = 524

 Score =  148 bits (374), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 109/342 (31%), Positives = 160/342 (46%), Gaps = 43/342 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
           GW DLS +GS  I TP++D LA  G+   + YA    C+P+R SL TG++P  T G    
Sbjct: 116 GWADLSSYGSPHIHTPHLDRLARQGVRFTHAYAGSSTCSPTRFSLYTGRFPGRTPGGLHE 175

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
           PI G    G+P     L   LR  GY+T  IGKWH G+   +++P   G+E+ FG   G 
Sbjct: 176 PIPGGSDAGLPPNHPTLASLLRGAGYATALIGKWHCGYLP-DHSPTKSGWETFFGNFGGA 234

Query: 210 ISYYDHI-LSDQYS-RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           + YY  + L+ +Y     E++  D+R            Y T + T+ A + I  +   KP
Sbjct: 235 LEYYSKLGLTGEYDLYEGEVSHQDLR------------YYTRIITERAAEYI-GRDHRKP 281

Query: 268 LFLYLAHLAAH-----------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
             L L     H           +      ++A Q       +    +   Y  +V+ LD 
Sbjct: 282 WLLNLNFTTPHWPWIAEGDTAESARVTARIKAGQR--GALNHRDGGSLEKYRELVEDLDR 339

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           SVG V++AL+R G  EN++++F SDNG     Y            +P  G K TL EGG+
Sbjct: 340 SVGEVLAALRRSGQEENTLVVFASDNGGERFSY-----------QWPLSGEKFTLLEGGI 388

Query: 377 KVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
           +VP I+ W  +I    +VS + ++  DW  TL    G    R
Sbjct: 389 RVPTIVRWPARIDGGAQVSHEPVYTPDWTATLLEVGGARPDR 430


>gi|418459428|ref|ZP_13030547.1| n-acetylgalactosamine-6-sulfate sulfatase [Saccharomonospora azurea
           SZMC 14600]
 gi|359740510|gb|EHK89351.1| n-acetylgalactosamine-6-sulfate sulfatase [Saccharomonospora azurea
           SZMC 14600]
          Length = 447

 Score =  148 bits (374), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 167/350 (47%), Gaps = 37/350 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
           GW DL  +GS  I TPN+D LA  G+   + YA  P C+ +R SL TG+YP  +  G++ 
Sbjct: 10  GWKDLGCYGSTAIRTPNLDRLAAEGLRFTHGYAGSPWCSSTRISLYTGRYPGRLPAGLEE 69

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           P +  +E  G+P     LP  L   G+ T   GKWH G+    Y+PL  GF++ FG L+G
Sbjct: 70  PLVTRSEGHGIPADHPTLPSLLAGAGHRTAMFGKWHCGWLPW-YSPLRIGFQTFFGNLDG 128

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
            + Y++H+  D   R     G        T  + VG Y TDL +  A + +       P 
Sbjct: 129 ALDYFEHV--DTLGRPDLYEGE-------TPVEQVG-YYTDLISDRAAEYVAAD-HGAPF 177

Query: 269 FLYLAHLAAH-------AGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSV 318
           +L L + A H         + G+ + A  +   T +   ++   +   Y  +V+ +D  +
Sbjct: 178 YLQLNYTAPHWPWEGPGDSDVGRDIRARYDREWTRSPLMHLDGGSVAKYGELVEAMDAGI 237

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V++AL   G  +N+I++F SDNG              W  N+P+ G K  L EGG++V
Sbjct: 238 GRVLNALAAAGRSDNTIVVFTSDNGG-----------ERWSKNWPFVGEKGDLTEGGIRV 286

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
           P +L  P      + S Q +   D   TL  A G + + LPL+   L  W
Sbjct: 287 PLLLRWPAALAPGQESDQPVVTMDLTATLLDACGVEPA-LPLDGVSLAPW 335


>gi|325109524|ref|YP_004270592.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969792|gb|ADY60570.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 486

 Score =  148 bits (374), Expect = 8e-33,   Method: Compositional matrix adjust.
 Identities = 113/363 (31%), Positives = 174/363 (47%), Gaps = 39/363 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ DL+ +G+ +I TP ID +A  G+  N+ Y   VC+P+RASLMTG + I  G+ G  +
Sbjct: 41  GYGDLACYGAKDIATPAIDRMATEGVKCNSFYVSAVCSPTRASLMTGSHSIRVGIGG-VM 99

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
           +     G+   E  LPE L++ GY+T  IGKWHLG     + P+  GF+  +G       
Sbjct: 100 FPRNNHGLNPDEITLPELLKDQGYATAIIGKWHLG-NEDMFQPMNHGFDYWYGTPASNNQ 158

Query: 212 YY---------DHILSDQYSRTVELNGHD-----MRRNLSTAWDTVGEYATDLFTKEAVQ 257
           +Y         D +  + Y+R   L         +R N+           T  +T+E ++
Sbjct: 159 FYYPTIKKYAADCVFREGYTRNGILTRETAACPLIRDNVVIEVPADQSQFTQRYTRETIR 218

Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
            I +   ++P F+YLAH   H       L A +      +++   +R  Y   +++LD S
Sbjct: 219 FITENH-EQPFFIYLAHNMPHI-----PLHASE------RFVGSSDRGIYGDTIQELDWS 266

Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
            G ++ AL+  G+  N+++IF SDNG          N    GS  P +G K + +EGGV+
Sbjct: 267 TGEILRALKELGLDSNTLVIFTSDNG---------PNTGKGGSAGPLKGGKGSTYEGGVR 317

Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
           VP +   P        S + M   D  PTL   AGGD     + IDG D  S LL   P+
Sbjct: 318 VPFVARWPGAIPAGIESDEAMTGMDLPPTLTRLAGGDVPDDRV-IDGKD-ISPLLTGKPN 375

Query: 438 RRN 440
            ++
Sbjct: 376 AKS 378


>gi|374374259|ref|ZP_09631918.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
 gi|373233701|gb|EHP53495.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
          Length = 476

 Score =  148 bits (373), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 125/397 (31%), Positives = 188/397 (47%), Gaps = 56/397 (14%)

Query: 77  AALTK-STTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
           AA TK +  L L    G+ D+  +G  +I TP+IDALA  G+     Y    VC PSR+S
Sbjct: 23  AAQTKPNIVLILADDLGYGDVGVYGQQKIKTPHIDALAKEGLRFTRFYCGTAVCAPSRSS 82

Query: 135 LMTGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
           L+TG++  HT ++G    G +P G   +P     +   L+  GYST   GKW LGF    
Sbjct: 83  LLTGQHTGHTPIRGN--IGVKPEGQWPLPGNTVTIATLLKSAGYSTGCFGKWGLGFIGTS 140

Query: 192 YTPLYRGFESHFGYLNGVIS---YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
             PL +GF+  +GY    ++   + DH+ +++  + VEL+    ++ +         Y+ 
Sbjct: 141 GDPLKQGFDRFYGYNCQTLAHDYFPDHLWNNE--QRVELSNTHEKQTI---------YSA 189

Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR--- 305
            L  ++A+Q I  Q   +P FL LA+   HA      L+ P++      Y    N +   
Sbjct: 190 QLIHEQAMQFIR-QHQRQPFFLLLAYTLPHAA-----LQLPKDDTLYNYYKKLFNEKPVP 243

Query: 306 ------------------TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
                              YAAMV KLD  VG V   L+  G+ +N++I+F SDNG    
Sbjct: 244 VAEHWNGKGYEPQAYPHAAYAAMVGKLDAYVGEVQQQLKVLGLDKNTMILFASDNGP--- 300

Query: 348 EYRETSNY-RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPT 406
            +RE  N    + SN   RG+K  L+EGG++ P I W P   Q    S Q+    D LPT
Sbjct: 301 -HREGGNDPAFFNSNGGLRGIKRDLYEGGIRSPFIAWWPGTIQAGTQSNQVGAFWDLLPT 359

Query: 407 LYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
               AG     +P N DG+    +LL     +++ ++
Sbjct: 360 FAALAG---IGMPANNDGISMLPTLLARGKQQQHDHL 393


>gi|86142609|ref|ZP_01061048.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
 gi|85830641|gb|EAQ49099.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
          Length = 477

 Score =  148 bits (373), Expect = 9e-33,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 170/367 (46%), Gaps = 40/367 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G   I TPN+D +A  G    N Y    VCTPSRA++MTG+ P+  GM    
Sbjct: 43  GYGDLSSYGHPTIKTPNLDKMAMEGQKWTNFYVGSSVCTPSRAAIMTGRLPVRNGMTSKV 102

Query: 151 IWGAEP---RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY-- 205
               +P    G+P +E  + E L++ GY T A GKWHLG  + EY P+  GF+  +G   
Sbjct: 103 HRVLQPWSTNGLPQSEITIAEQLKKGGYGTAAFGKWHLGD-KEEYLPVNNGFDYWYGVPY 161

Query: 206 ---------LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
                    +N    Y++   S++       N   MR              T  +T EA+
Sbjct: 162 SNDMDITLEMNEPQDYWNFWKSEERKEISSFNIPLMRNTEIIERPANQHTITKRYTNEAI 221

Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
           + I++   + P F+YLA+   H          P     +F+      R  Y  +V+++D 
Sbjct: 222 EYIKEHK-ENPFFIYLAYNLPHV---------PLFASEEFE--GKSKRGLYGDVVEEIDH 269

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           +VG V+ AL+ +G+ EN+I++F SDNG   V   E       GS    R  K + WEGG+
Sbjct: 270 NVGRVMEALKEQGIAENTIVVFTSDNGPWLVMDEEG------GSAGLLRDGKGSTWEGGM 323

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLDQWSSLLLN 434
           + P I WSP     P +  ++    D   T    AG     LP +  +DG D  S+L   
Sbjct: 324 REPGIFWSPG-NLKPGIVTELGSTMDLFTTFSKLAG---VELPDDRILDGEDLSSTLFEG 379

Query: 435 TPSRRNS 441
             S R +
Sbjct: 380 KESPRET 386


>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
 gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
          Length = 470

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 102/352 (28%), Positives = 168/352 (47%), Gaps = 38/352 (10%)

Query: 69  TDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-Q 125
           T  + +    L K   +  ++    G+ DLS +G   I TP ID +  +G+ L   Y+  
Sbjct: 22  TGASGKAAVKLNKKPNIIFILADDLGYADLSSYGHPTIRTPAIDKIGNDGVRLLQAYSNS 81

Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
           PVC+ +R ++MTG+Y     +           G+P ++  LP  L++ GY T  IGKWHL
Sbjct: 82  PVCSATRTAIMTGQYQYRLALGLEEPLAGRDIGLPPSQTTLPSLLKQAGYETTLIGKWHL 141

Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
           G + + Y PL  G++  +G+    +SYY+H             G D   + +   +  G 
Sbjct: 142 GAYPK-YGPLKSGYDHFYGFRGSALSYYNH-------------GKDFWEDDAPV-EKAG- 185

Query: 246 YATDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAH-----AGNAGKHLEAPQETINQFQYI 299
           Y TDL   + V+LI+     ++P F  +   A H      G  GK   A ++  N     
Sbjct: 186 YFTDLLGDKTVELIQKSDSCERPFFASVHFNAPHWPWEAPGEQGKKEAARKK--NSLAAF 243

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
               +  Y  +V+++D  VG ++ AL  K + EN+I++F SDNG              + 
Sbjct: 244 DAGTQDVYRQIVERMDFQVGRILQALTDKKIRENTIVVFTSDNGG-----------ERFS 292

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
           + +P+ G K+ L EGG+++P+++  P    + + S Q+    DWLPTL  AA
Sbjct: 293 NTWPFSGKKSELLEGGIRIPSLICWPAKIPSGQTSEQVTLSMDWLPTLLEAA 344


>gi|423214668|ref|ZP_17201196.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
           CL03T12C04]
 gi|392692574|gb|EIY85811.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
           CL03T12C04]
          Length = 453

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 116/345 (33%), Positives = 175/345 (50%), Gaps = 47/345 (13%)

Query: 79  LTKSTTLTLLIVY---GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRAS 134
           + KST   LLI+    G  DLS   + ++ TPNID +   G+ L+N YA   V +PSRA+
Sbjct: 26  VDKSTPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSRAA 85

Query: 135 LMTGKYPIHTGMQG---PPI---WG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           L+TG++P   G+ G   P +   WG  +P  V      +PE L+  GY T  IGKWHLG 
Sbjct: 86  LLTGRFPAMVGVPGVIRPSVDQNWGYFDPSAVT-----MPEVLKNGGYRTALIGKWHLG- 139

Query: 188 FRREYTPLYRGFESHFGYLNGVI-SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
           +     P  RGF+   G+L  ++  YY H    Q    + LN  ++        D  G +
Sbjct: 140 WESPNLPNERGFDHFHGFLADMMDDYYTH--RRQGGNYMYLNDKEI--------DPKG-H 188

Query: 247 ATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP--- 302
           AT+LFT  +V  I+ +  +K P FLYLA+ A H+      L+ P E +N+ Q        
Sbjct: 189 ATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPV 243

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R    A+++ LD ++G VI +L+  G L N+++IF SDNG          +  +  +N 
Sbjct: 244 KRARLIALIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNGG---------DRGSMANNG 294

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
           P RG K  ++EGG+ V   L  P + +  R     + + D +PT+
Sbjct: 295 PTRGAKGDMFEGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTI 339


>gi|119475675|ref|ZP_01616028.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
 gi|119451878|gb|EAW33111.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
          Length = 479

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 123/410 (30%), Positives = 188/410 (45%), Gaps = 58/410 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D+  +G   I +PN+D +A  GI   N YA   VCTPSRA L+TG+ P+ +GM    
Sbjct: 49  GYGDIGAYGHPTIRSPNLDQMAAEGIKWTNFYAASSVCTPSRAGLLTGRLPVRSGMAHDQ 108

Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY-- 205
           I    P    G+P TE  + + L+E  Y T  +GKWHLG     + PL  GF+ +FG   
Sbjct: 109 IRVLFPTSTGGLPTTEITIAKALKEKDYRTALVGKWHLGHLPG-FQPLDHGFDEYFGIPY 167

Query: 206 -----LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
                L   +SY   I    +++  + N   M+              T  +T+EAV  I+
Sbjct: 168 SNDHDLKKELSYIQTI---THAKDGDFNVPLMQNRSIIERPANQNTITKRYTQEAVSFIK 224

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
               ++P FLYLAH   H          P    +QF+  +D  R  Y  +++++D SVG 
Sbjct: 225 KNS-NQPFFLYLAHSMPHV---------PLFASDQFRGSSD--RGLYGDVIEEIDWSVGQ 272

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           V+S L  +G+ EN++++F SDNG   +         + GS    +  K T +EGG++ PA
Sbjct: 273 VLSTLSEQGISENTLVVFTSDNGPWLIM------GAHGGSAGLLKSGKGTSYEGGMREPA 326

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
           I W P+ +  P V+       D  PT+ + AG                    ++ PS R 
Sbjct: 327 IFWWPE-KIKPAVAHNTASTLDLFPTIMSIAG--------------------IDMPSDR- 364

Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
            + DG D   ++     + R ++     +K    AVR   WK+   T  N
Sbjct: 365 -SYDGYDLSPTMFEQKSNERKNIFYYHGDK--IFAVRQGDWKVHFKTVAN 411


>gi|294053911|ref|YP_003547569.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
 gi|293613244|gb|ADE53399.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
          Length = 469

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 110/336 (32%), Positives = 161/336 (47%), Gaps = 30/336 (8%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL + GS  IPTPNID LA  G+     Y     C PSRA  +TG+Y    G +  P
Sbjct: 43  GYGDLGYTGSKHIPTPNIDRLANEGVECTYGYVTHQYCGPSRAGFLTGRYQQRFGFETNP 102

Query: 151 IWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            +       GVP +ER   E L+ +GY T  +GKWH+G     ++  +        +   
Sbjct: 103 PYDRHNTIAGVPASERLFAERLQAVGYKTGIVGKWHIG----SHSIHHPNNRGFDFFFGF 158

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           +   +D    D      E     M RN S+    V  Y T   T EA+  IE    D P 
Sbjct: 159 LGGGHDFFRVDTREPMDEGYLDPMMRNGSSV--DVEGYLTTQLTDEAIGFIERNEKD-PF 215

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
           FL+L++ A HA      L+AP+E+I +F ++    RR Y+AMV ++D  +G ++S L+ +
Sbjct: 216 FLFLSYNAPHAP-----LQAPEESIAKFSHVEGKERRVYSAMVYEMDLEIGRILSMLEDR 270

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP--- 385
            + E +II F+SDNG P    +   +Y    SN  +RG K   ++GGV VP + + P   
Sbjct: 271 DLAEETIIFFLSDNGGPPHWDKSQDSYT---SNGAFRGYKGDTYDGGVHVPFLAYWPGTL 327

Query: 386 ---QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
              +   +P VSL +        T     GGD S +
Sbjct: 328 PAGKQFTSPVVSLDIAR------TAVALGGGDESEM 357



 Score = 47.8 bits (112), Expect = 0.016,   Method: Compositional matrix adjust.
 Identities = 31/87 (35%), Positives = 45/87 (51%), Gaps = 8/87 (9%)

Query: 1   MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
           M RN S+    V  Y T   T EA+  IE    D P FL+L++ A HA      L+AP+E
Sbjct: 182 MMRNGSSV--DVEGYLTTQLTDEAIGFIERNEKD-PFFLFLSYNAPHAP-----LQAPEE 233

Query: 61  TINQFQYITDPNRRTYAALTKSTTLTL 87
           +I +F ++    RR Y+A+     L +
Sbjct: 234 SIAKFSHVEGKERRVYSAMVYEMDLEI 260


>gi|32477899|ref|NP_870893.1| iduronate-sulfatase or arylsulfatase A [Rhodopirellula baltica SH
           1]
 gi|32448456|emb|CAD77971.1| iduronate-sulfatase or arylsulfatase A [Rhodopirellula baltica SH
           1]
          Length = 1012

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 140/484 (28%), Positives = 209/484 (43%), Gaps = 68/484 (14%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQ 125
           ++   R   A  TK   + +L    G+ DLS  G+  + TP ID +A  G  L + Y A 
Sbjct: 557 VSPAGREKTAETTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAA 616

Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWG----AEPRGVPLTERFLPEYLRELGYSTKAIG 181
           PVCTPSRA LMTG YP    M     +G     +P+G+   E  + E L+  GY T   G
Sbjct: 617 PVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFG 676

Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---RRNLST 238
           KWHLG  + E+ P  +GF+  FG        Y H +   + R    +   +   + +   
Sbjct: 677 KWHLGD-QPEFLPTKQGFDEFFGIP------YSHDIHPFHPRQNHYHFPPLPLLQNDTVI 729

Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA----------GNAGKHLEA 288
             D   ++ T   T++AV  IE    D+P FLYL H   HA          G A   + A
Sbjct: 730 EMDPDADFLTKRLTEQAVSFIERN-KDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAA 788

Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
            ++      Y T  N   +   + ++D SVG ++ AL+  G+ E ++++F SDNG P   
Sbjct: 789 IEKEDGNIDYATRAN--LFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPP--- 843

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
             + + Y + G     RG K T +EGG++ P ++ W  QI      + ++M   D LPT 
Sbjct: 844 --KNTLYASPGE---LRGHKGTTFEGGMREPTVVRWPGQIPAG-HQNDELMTAMDLLPTF 897

Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              AG                       P+ R   IDG D W +L   T +  ++   + 
Sbjct: 898 AKLAGA--------------------AIPTDR--VIDGKDIWPTLKGETQTPHDAFFYH- 934

Query: 468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYG--QTRSNKVPLLNFNAIVESKTYQSLQQL 525
               + AAVR   WKL +    NG     Y        KV ++  N  V  K    L+  
Sbjct: 935 -RGNQLAAVRSGKWKLHV---NNGVAKQLYDLENDLGEKVNVIETNPEVVKKLQHQLKDF 990

Query: 526 SQNI 529
           + +I
Sbjct: 991 AADI 994



 Score =  105 bits (262), Expect = 8e-20,   Method: Compositional matrix adjust.
 Identities = 119/462 (25%), Positives = 186/462 (40%), Gaps = 79/462 (17%)

Query: 75  TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPS 131
           T  A  +   + L+ V   G+ DL  +G+ ++ TPNID LA  G    + + A  VCTPS
Sbjct: 32  TSVAAERPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPS 91

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPL----TERFLPEYLRELGYSTKAIGKWHLGF 187
           R  L+TG+YP+   M G  IWG  P    L      + + +  +  GY+T  +GKWHLGF
Sbjct: 92  RYGLLTGQYPVRA-MGGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGF 150

Query: 188 FRREYT--------PLYRGFESHFGY--LNGVISYY----DHILSDQYSRTVELNGHDMR 233
                         P   GF+ +FG   +N    Y     D I     S  +   G  + 
Sbjct: 151 KEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVS 210

Query: 234 --------------RNLSTAWDTVGEYATD----LFTKEAVQLIEDQPVDKPLFLYLAHL 275
                            S A      Y  +    L T+ AV+ I ++  ++P FLY A  
Sbjct: 211 PTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITEK-KNEPFFLYFATP 269

Query: 276 AAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSI 335
             H          P     +F+  +      Y   V +LD  VG ++ +L+  G+ +N++
Sbjct: 270 NIH---------HPFTPAPRFKGTSQCG--LYGDFVHELDWMVGEIVQSLEDNGLTDNTL 318

Query: 336 IIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVS 394
           ++F SDNGA        +       N    G K  +WEGG +VP I  W  +I+      
Sbjct: 319 VLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIK------ 372

Query: 395 LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LNTPSRRNSNIDGLDQWSSLL 453
                           AG  + +L   +D    +S+L     PS      D ++   +LL
Sbjct: 373 ----------------AGTQSDQLISQVDLFATFSALTEQEMPSSEQK--DSINMLPALL 414

Query: 454 LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
            +      + L+    + R  A+R   W L +G + +G  +G
Sbjct: 415 DDPNEPLRTELVLAPRQPRNLAIRKGKW-LYIGARGSGGFNG 455


>gi|116621986|ref|YP_824142.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
 gi|116225148|gb|ABJ83857.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
          Length = 461

 Score =  148 bits (373), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 118/343 (34%), Positives = 162/343 (47%), Gaps = 49/343 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +GS  I TPNID LA  G    + Y A PVC+PSRA+LMTG+YP  T ++ P 
Sbjct: 39  GYGDLGCYGS-PIATPNIDRLAEEGARFTSFYSASPVCSPSRAALMTGRYP--TRVEVPV 95

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           + G    G+P +E  + + L+  GY T  IGKWH+G     Y P  RGF+  FG     +
Sbjct: 96  VLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIGS-TPGYLPTNRGFDEFFG-----V 149

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            Y   I      R   +        ++ A D      T  FT+EA+  +  +  D P FL
Sbjct: 150 PYSADITPCPLMRGSSV--------VAPAVDC--STLTSSFTQEALDFMR-RAQDNPFFL 198

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           YLAH A H   A     A Q  +             YA +V++LD S G V++AL+  G+
Sbjct: 199 YLAHTAPHLPLAASPRFAGQSGLGM-----------YADVVQELDWSTGQVMAALKATGL 247

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNW--GSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
             N++++F SDNG              W  GS    RG K   +EGG++ P +   P + 
Sbjct: 248 DSNTLVMFSSDNGP-------------WYQGSQGKLRGRKGETYEGGMREPFLARYPGVI 294

Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
            +      +    D LPTL   AG  T   PL  DG+D W  L
Sbjct: 295 PSGIGCAGLATTMDLLPTLARLAGAQTPSNPL--DGVDIWPVL 335


>gi|160891517|ref|ZP_02072520.1| hypothetical protein BACUNI_03968 [Bacteroides uniformis ATCC 8492]
 gi|317478374|ref|ZP_07937538.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|156858924|gb|EDO52355.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
 gi|316905533|gb|EFV27323.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 448

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 133/410 (32%), Positives = 186/410 (45%), Gaps = 83/410 (20%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G+  I TPNID +A  G+     Y    V TPSR++LMTG+ P+  G+ G  
Sbjct: 43  GYGDLGCYGNPTIHTPNIDRMASEGMKFTQFYVGASVSTPSRSALMTGRLPVRNGLYGDK 102

Query: 151 ---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
              ++     G+   E  + + L++ GY+T  +GKWHLG F   Y P   GF+  FG   
Sbjct: 103 RDVLFPDSKAGLGQDEVTVAKLLQQNGYATACVGKWHLGHFS-PYLPTDHGFDDFFG--- 158

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA------TDLFTKEAVQLIED 261
             I Y                 +DMR       D+V E        T  +T+ AV  I+ 
Sbjct: 159 --IPY----------------SNDMRPASLMRGDSVLEIKPEQGELTRRYTEYAVDYIKK 200

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
              ++P FLYLAH   H          P  T  +F+  +  NR  Y  +V++LD SVG V
Sbjct: 201 NK-NRPFFLYLAHTFPHT---------PLHTNERFEGRS--NRGLYGDVVEELDWSVGEV 248

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           + AL+  G+ EN+++IF SDNG   V+        N GS    R  K+T WEGG++VPA+
Sbjct: 249 LKALKENGLDENTLVIFTSDNGPWLVQ------RLNGGSAGALRQGKSTCWEGGLRVPAV 302

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL----NIDGLDQWSSLLLNTPS 437
              P  +  P     MM   D LPT    AG     +PL     +DG+DQ   L      
Sbjct: 303 FRMPG-RIAPCTQQGMMATMDILPTFLNMAG-----IPLPEGVTLDGMDQTPMLW----E 352

Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGT 487
           +R S+ D +  W          R S L+         AVR   WK+   T
Sbjct: 353 QRESSRDEMYYW----------RGSHLL---------AVRKGPWKIHFST 383


>gi|87307904|ref|ZP_01090047.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
 gi|87289518|gb|EAQ81409.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
           marina DSM 3645]
          Length = 459

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 113/346 (32%), Positives = 176/346 (50%), Gaps = 27/346 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL   G  +I TPN+D +A  G+ L + YA   VC PSR  LMTG +  H  ++G  
Sbjct: 26  GYGDLGCFGQQKISTPNLDQMAQEGMRLTDFYAGCTVCAPSRCVLMTGLHTGHCFIRGNA 85

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG-- 208
                P  V + E      L+  GY T  +GKW LG       P  +GF+  +GYL+   
Sbjct: 86  RDNLRPSDVTVAE-----VLKAAGYQTALMGKWGLGHEGSNGVPTRQGFDYFYGYLDQGH 140

Query: 209 VISYYDHILSDQYSRT----VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
             +YY   L     R     V LN  +  + +  A + V EY+ DL  +EA+  ++    
Sbjct: 141 AHNYYTTFLMRNEERVALKNVPLNEDESGKGV--AKEKV-EYSHDLIMEEALAFVDRAAK 197

Query: 265 -DKPLFLYLAHLAAHAGN-AG-KHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
            D P FLYL+    HA N AG K +E P       +   +P ++ +AAM+ ++D  +GT+
Sbjct: 198 KDAPFFLYLSLTIPHANNQAGDKGMEVPDYGQYADKDWPEP-QKGHAAMISRMDRDLGTL 256

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG-SNYPYRGVKNTLWEGGVKVPA 380
            + L++ G+ +N+++IF SDNG     ++E  N  ++  SN P +G+K +L EGG++VP 
Sbjct: 257 FAQLKQDGIDDNTLVIFTSDNGP----HKEGGNDPDFADSNGPLQGIKRSLHEGGIRVPT 312

Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           I+ W   ++ +           D +PTL  A  G +  +P  IDG+
Sbjct: 313 IVRWPGHVKADSESDFAGA-FWDVMPTL-AAVAGVSKEVPAGIDGI 356


>gi|348516447|ref|XP_003445750.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Oreochromis
           niloticus]
          Length = 525

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 119/383 (31%), Positives = 175/383 (45%), Gaps = 42/383 (10%)

Query: 78  ALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
           A T +  + L+   GW DL   G     TPN+DA+A  G++L N Y A P+C+PSRA+L+
Sbjct: 28  APTPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAEGMLLPNFYTANPLCSPSRAALL 87

Query: 137 TGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           TG+ PI  G        A  R          G+   E  LP+ L+  GY +K +GKWHLG
Sbjct: 88  TGRLPIRNGFYTT---NAHARNAYTPQEIVGGISKDEILLPQLLKTKGYVSKIVGKWHLG 144

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL-NGHDM----RRNLSTAWD 241
             R +Y PL  GF+  FG  N     Y+    DQ    + + N  +M      +     +
Sbjct: 145 H-RPQYLPLKNGFDEWFGSPNCHFGPYN----DQSKPNIPVYNNSEMLGRFYEDFKIDRN 199

Query: 242 TVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
           T     T ++  E +  I  Q    +P FLY A  A HA                 +++ 
Sbjct: 200 TGESNLTQIYLMEGLDFILRQTKAQQPFFLYWAVDATHA-----------PVYASKRFLG 248

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
              R  Y   V +LD S+G ++  L+  G+  N+ + F SDNGA  +     S     GS
Sbjct: 249 KSQRGRYGDAVMELDYSIGQILKWLRTLGIDNNTFVFFTSDNGAAVMSGPNES-----GS 303

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
           N P+   K T +EGG++ PAI W P   +   V+ Q+ ++ D   T    AG +     +
Sbjct: 304 NGPFLCGKETTFEGGMREPAIAWWPGHIKEGSVNFQLANVMDLFTTSLVLAGINPPDDRI 363

Query: 421 NIDGLDQWSSLLLNTPSRRNSNI 443
            +DGLD    LL ++   +N  I
Sbjct: 364 -LDGLDLTPVLLNSSQPLKNRPI 385


>gi|436835023|ref|YP_007320239.1| sulfatase [Fibrella aestuarina BUZ 2]
 gi|384066436|emb|CCG99646.1| sulfatase [Fibrella aestuarina BUZ 2]
          Length = 485

 Score =  147 bits (372), Expect = 1e-32,   Method: Compositional matrix adjust.
 Identities = 151/532 (28%), Positives = 224/532 (42%), Gaps = 103/532 (19%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRASL 135
           AA   +  L  +   G+ DLS  G+    TPN+D +A  G    N + AQPVCT SRASL
Sbjct: 33  AAQKPNVVLFFMDDLGYGDLSCTGAMGYLTPNLDRMAAEGTRFTNFLAAQPVCTASRASL 92

Query: 136 MTGKYPIHTGMQGPPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           +TG YP   G+ G   +G +   G+   E  L E L+E GY+T   GKWHLG  ++ + P
Sbjct: 93  LTGCYPNRIGIAG--AYGPSSTVGLNPDEETLAELLKEQGYATGIFGKWHLG-SKQAFLP 149

Query: 195 LYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
           L  GF+ +FG  Y + +  ++      +Y     + G+     +    D  GE  T   T
Sbjct: 150 LQHGFDEYFGVPYSHDMWPHHPDQAHAKYPPLYLIEGNQPTNEVKYLTD-AGEL-TAAIT 207

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
             AV  I  Q    P FLY+ H   H          P     QF+  +   R  +  ++ 
Sbjct: 208 DRAVSFIR-QHRKAPFFLYIPHPLPH---------VPLAASPQFRGKS--RRGLFGDVLM 255

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
           +LD SVG V+  L+ +G+ +N++++FM DNG P + Y + +     GS    R  K T +
Sbjct: 256 ELDWSVGQVMQELKAQGLDKNTLVLFMGDNG-PWLNYGDHA-----GSTAGLREGKGTTY 309

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           EGG +VP ++  P +    RVS +++   D LPT+    G    + P  IDG+D W++LL
Sbjct: 310 EGGNRVPCLVRWPGVVPAGRVSNKLLTTMDVLPTVAKLCGARQPKRP--IDGVD-WTALL 366

Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
               ++                 TP  R         K    AVR   WKLV        
Sbjct: 367 KGDDTQ-----------------TPRDR---FYYYYRKNNLEAVRKGDWKLVFAH----- 401

Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
                                   +TY+    L +N                        
Sbjct: 402 ----------------------PGRTYEGF-PLGRN-----------------------G 415

Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL 604
            P P T +   N P  L++L +DP E+ ++ +  PDI   L +L    R  L
Sbjct: 416 QPGPSTEN--FNFPTALYDLAHDPGERYDVQAQHPDIVQSLQKLADEARADL 465


>gi|16125424|ref|NP_419988.1| sulfatase [Caulobacter crescentus CB15]
 gi|13422494|gb|AAK23156.1| sulfatase family protein [Caulobacter crescentus CB15]
          Length = 521

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 158/350 (45%), Gaps = 77/350 (22%)

Query: 92  GWNDLSFHGSN----EIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
           G+ND++ +G       +PTP ID++A  G+   N Y+    C PSRA++MTG+Y    G 
Sbjct: 44  GYNDITLNGGGVAGGTVPTPAIDSIAKEGVTFANGYSGNATCAPSRAAIMTGRYATRFGF 103

Query: 147 QGPPIWGAEPR------------------------------GVPLTERFLPEYLRELGYS 176
           +  P   A  R                               VP  E  + E L+  GY 
Sbjct: 104 EFTPTPVAFSRVVGGHAGDPLHPSRFNQAEVKNMPKDENVLAVPAAEVTIAEALKTKGYH 163

Query: 177 TKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY-----------------YDHILSD 219
           T  +GKWHLG  +    P  +GF+   G++ G   +                  D  L  
Sbjct: 164 TIHLGKWHLGGVKGS-RPEDQGFDESLGFMAGAALFAPVGDPGVVESRQDWDPIDKFLWG 222

Query: 220 QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 279
                V+ NG  +             Y TD  T EAV+ I D   ++P F+YLA+ A H 
Sbjct: 223 AAPFAVQFNGGKLF--------NPSHYMTDYLTDEAVKAI-DANKNRPFFMYLAYNAVHT 273

Query: 280 GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
                 L+AP+   +   +I D   R YAAMV+ LD +VG V+ AL+ +G+ +N+++IF 
Sbjct: 274 -----PLQAPKADYDALSHIKDHRMRVYAAMVRNLDRNVGKVLQALKDRGLDDNTLVIFT 328

Query: 340 SDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
           SDNG         +NY      N PYRG K T +EGG+KVP +L W  Q+
Sbjct: 329 SDNGG--------ANYIGLPDINKPYRGWKATFFEGGIKVPFLLRWPEQL 370



 Score = 43.5 bits (101), Expect = 0.32,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 6/68 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD  T EAV+ I D   ++P F+YLA+ A H       L+AP+   +   +I D   R
Sbjct: 241 YMTDYLTDEAVKAI-DANKNRPFFMYLAYNAVHT-----PLQAPKADYDALSHIKDHRMR 294

Query: 75  TYAALTKS 82
            YAA+ ++
Sbjct: 295 VYAAMVRN 302


>gi|291232668|ref|XP_002736267.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
           [Saccoglossus kowalevskii]
          Length = 518

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 122/372 (32%), Positives = 173/372 (46%), Gaps = 41/372 (11%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
           L L+   GW DL   G+    TPN+D +A  G ++ + YA  P+C+PSRASL+TG+ PI 
Sbjct: 31  LMLMDDMGWGDLGVLGNPAKETPNLDRMASEGALMTDFYAPNPLCSPSRASLLTGRLPIR 90

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
            G             P+    G+P +E  LPE L + GY +K IGKWHLG  + +Y PL 
Sbjct: 91  NGFYTTNDHARCSYTPQYIVGGIPDSEIVLPELLNKAGYRSKIIGKWHLG-HQTQYHPLK 149

Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEY-ATDLFTK 253
            GF+ +FG  N  +  YD+         R  ++ G   R       D  GE   T +F +
Sbjct: 150 HGFDEYFGAPNCHVGPYDNKKQPNIPVYRDADMIG---RYYEEFKIDKSGESNLTQMFIE 206

Query: 254 EAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
           EA+  IE Q    +  FLY    A+H         +P      F  +    R  Y   V 
Sbjct: 207 EAIAFIEKQHQTGEQFFLYWTPDASH---------SPHYASKSF--LGTSQRELYGDAVM 255

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
           +LD  VG ++  L+   +  N+ + F SDNGA        S     GSN P+   K+T +
Sbjct: 256 ELDYGVGQILGKLKELQIENNTFVFFSSDNGA------ALSGGAYVGSNGPFLCGKHTTF 309

Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
           EGG++ P+I W P   +  +VS Q+  I D   T    AG     + L  D +   +SLL
Sbjct: 310 EGGMREPSIAWWPGRIKPGKVSHQLGTIMDLFTTALDIAG-----VTLPTDRIIDGTSLL 364

Query: 433 LNTPSRRNSNID 444
              P+  N  ID
Sbjct: 365 ---PALVNGKID 373


>gi|323452998|gb|EGB08871.1| hypothetical protein AURANDRAFT_71524 [Aureococcus anophagefferens]
          Length = 495

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 108/361 (29%), Positives = 167/361 (46%), Gaps = 48/361 (13%)

Query: 107 PNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI----------WGAEP 156
           P++ ALA +G+ L N Y Q  CTP+RA+LM+GK+   TG     +          WG   
Sbjct: 16  PHMLALAQDGVRLTNYYGQAFCTPARAALMSGKFGHRTGFANVDVDSAATLEISAWGNFS 75

Query: 157 RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHI 216
                 + FL + L   GY    +GKW+LG     + P  RGF+S+ GY    I Y  H 
Sbjct: 76  LANAGDQIFLSQRLATAGYEVHGVGKWNLGHCNAAFLPPSRGFKSYLGYYGAGIGYVSHE 135

Query: 217 LSDQYSRTVELNGH-------DMRR-----NLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           +    +       H       DM+R      LS A   +GEY+T LFT E+++ I+   V
Sbjct: 136 VEQSGNVENSFTRHFRDYSLVDMQRCTPQGCLSDA-SVIGEYSTILFTNESLRRIDGFSV 194

Query: 265 DKPLFLYLAHLAAH---AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
           + P +LY+A+   H     NA   L+ P   +   +      RR +   +  +D  VG +
Sbjct: 195 ENPTYLYVAYHGVHDDKQSNASG-LDYPAAALEALE----ERRRNFGIALHAVDHGVGLL 249

Query: 322 ISALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
              L+++    + +++  SDN G+P   Y ++       SN PYRG+K   +EGG+K+PA
Sbjct: 250 QQRLEKRS--SDYVLVVHSDNGGSPCGTYCDS-------SNAPYRGMKFFDFEGGLKLPA 300

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG---DTSRLPLNIDGLDQWSSLLLNTPS 437
            ++SP    +      +MH  DW  T      G   D S    + D  D WS++  N+  
Sbjct: 301 FVYSPTRLASRGTYDGLMHHVDWTATFLRGLAGTLLDCS----DCDSRDHWSAISSNSDD 356

Query: 438 R 438
           +
Sbjct: 357 K 357


>gi|325111172|ref|YP_004272240.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324971440|gb|ADY62218.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 474

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 120/392 (30%), Positives = 192/392 (48%), Gaps = 42/392 (10%)

Query: 64  QFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIIL 119
           +F ++         AL  + T   +IV+    G+ D+   G+    TPN+D +A  G   
Sbjct: 2   RFHFVILALLLLAPALATAETPNFVIVFTDDQGYADVGCFGAEGYETPNLDRMAQEGRKF 61

Query: 120 NNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTK 178
            + Y +Q VC+ SR SL+TG Y +  G+ G  +      G+   E  L E +++ GY+T 
Sbjct: 62  TSFYVSQAVCSASRTSLLTGCYNVRVGINGA-LSHKSTVGISTEEMTLGELVKQQGYTTA 120

Query: 179 AIGKWHLGFFRREYTPLYRGFESHFG--YLNGVISYY--------DHILSDQYSRTVELN 228
           A+GKWHLG+ ++++ PL  GF+ +FG  Y N +  Y+        +      Y      +
Sbjct: 121 AVGKWHLGY-QQKFLPLQHGFDRYFGLPYSNDMWPYHPRLEGMTLEEARVKGYPDLPLYS 179

Query: 229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 288
           G+++     T  D      T  +T++A+  I+D   DKP  LYLAH   H          
Sbjct: 180 GNEIIDAQVTPEDQ--RQLTTQYTEQALDFIDDNH-DKPFLLYLAHSMPHV--------- 227

Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
           P    ++FQ  T+  R  Y  +++++D SVG V++ L++ G+ +N+++IF SDNG P + 
Sbjct: 228 PLFVSDKFQDKTEQGR--YGDVIQEIDWSVGQVLARLKKHGLDDNTLVIFTSDNG-PWLS 284

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
           Y   +     GS  P+R  K T WEGGV+VP I+ W  +I        ++    D  PT+
Sbjct: 285 YGNHA-----GSAGPFREGKGTSWEGGVRVPCIMRWPGKIPAGTSCD-ELAATIDIFPTV 338

Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
             AA  D       IDG D   SL+  +P  +
Sbjct: 339 --AALADAPLPDHKIDGKD-IRSLMFESPCAK 367


>gi|398810130|ref|ZP_10568960.1| arylsulfatase A family protein [Variovorax sp. CF313]
 gi|398083821|gb|EJL74525.1| arylsulfatase A family protein [Variovorax sp. CF313]
          Length = 438

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 112/338 (33%), Positives = 163/338 (48%), Gaps = 44/338 (13%)

Query: 92  GWNDLSFHGSNEIP----TPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH-TG 145
           G+ DL  +G  +      +P +D LA NG+ L   YA  PVC+P+R  ++TG+Y     G
Sbjct: 15  GYADLGCYGGRDAVFGPVSPVLDGLAANGLKLTQGYANSPVCSPTRFGMITGRYQYRLRG 74

Query: 146 MQGPPI----WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
               PI     G+   G+P     LP  L+  GYST  IGKWHLG+    + PL  G++ 
Sbjct: 75  AAEEPINSRSRGSATLGLPTDHPTLPSLLKASGYSTALIGKWHLGY-PPSFGPLRSGYDE 133

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE-YATDLFTKEAVQLIE 260
            FG ++G + Y+ H   D   R      HD+    S   D V E Y TD+ +K AV+ +E
Sbjct: 134 FFGPMSGGVDYFTHC--DSTGR------HDL---WSGEEDKVEEGYLTDILSKRAVEYVE 182

Query: 261 DQPV-DKPLFLYLAHLAAH-----AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
                + P FL L + A H       +A K   AP    N F  +   N   Y  M+  +
Sbjct: 183 RMAKQEAPFFLSLHYTAPHWPWETRDDAAK---APAVKDNLFD-LAGGNIHVYRRMIHHM 238

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D+ +G +++ALQ+ GM +N++++F SDNG              +  N+P  G K  L EG
Sbjct: 239 DEGIGWIMAALQKHGMADNTLVVFTSDNGG-----------ERFSDNWPLVGGKMDLTEG 287

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           G++VP I   P +      S Q+    DW  T+  AAG
Sbjct: 288 GIRVPWIAHWPAVIGKGGESTQLCMTMDWSATMLDAAG 325


>gi|325108958|ref|YP_004270026.1| sulfatase [Planctomyces brasiliensis DSM 5305]
 gi|324969226|gb|ADY60004.1| sulfatase [Planctomyces brasiliensis DSM 5305]
          Length = 499

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 121/364 (33%), Positives = 175/364 (48%), Gaps = 49/364 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ--G 148
           G+ +L  +G   I TP +D LA  GI L   Y+   VC PSR  LMTGK+P H  ++  G
Sbjct: 56  GYQELGCYGQKWIRTPFVDQLAAEGIKLTQFYSGNAVCAPSRCCLMTGKHPGHAWVRNNG 115

Query: 149 PPIWGAEPRG-----------VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
            P    E R            +P +E  + E L++ GY+T AIGKW LG F     P  +
Sbjct: 116 DPKLPEELREKYQWEFAGQFPLPESEVTVAELLKQKGYATAAIGKWGLGHFGTTGAPDRQ 175

Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELN-------GHDMRRNLSTAWDTVGEYATDL 250
           G +  +G+ N     ++H     Y R +  N       G+D   N  T       Y+ D 
Sbjct: 176 GIDLFYGF-NSQRHAHNH-----YPRFLWRNDQKEMQPGNDRTLNGET-------YSQDR 222

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAH-------AGNAGKHLEAPQETINQFQYITDPN 303
           FT+ A++ I DQ  D+P FLYL     H       +  A    E P+E      Y+  P 
Sbjct: 223 FTEVALEFI-DQNKDQPFFLYLPFAIPHLSIQVPESSLAEYTDEIPEEEYKHRGYLKHPK 281

Query: 304 RRT-YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            R  YAAM+  +D  +G +++ L   G+ +N++IIF SDNG PT +    S+   + S  
Sbjct: 282 PRAGYAAMITHMDRDIGKIMARLNEYGLTDNTLIIFTSDNG-PTYDRLGGSDSDFFESAG 340

Query: 363 PYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
             RG+K +L+EGG++VP +  W  QI       L +  + D  PT+  AAG +    P  
Sbjct: 341 KMRGLKGSLYEGGIRVPCVARWPGQIPAGNTSDL-ISALWDLFPTIAEAAGVEP---PEE 396

Query: 422 IDGL 425
           IDG+
Sbjct: 397 IDGI 400


>gi|149196206|ref|ZP_01873261.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149140467|gb|EDM28865.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 454

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/362 (30%), Positives = 180/362 (49%), Gaps = 44/362 (12%)

Query: 76  YAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTP 130
           +A + K+    ++I++    G+ D+S +G+  I TP++D +AY G    N Y A PVCTP
Sbjct: 12  FALVAKAEKPNVIIIFADDMGYGDMSCNGNPSIKTPHLDRMAYEGQKWTNFYVAAPVCTP 71

Query: 131 SRASLMTGKYPIHTGM---QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
           SRA L+TG+ PI  GM   +   ++     G+P +E  L E  +  GY T  +GKWHLG 
Sbjct: 72  SRAGLLTGRLPIRNGMCSNKRRVLFPDSQGGLPQSEYTLAEMFKSAGYETGMVGKWHLG- 130

Query: 188 FRREYTPLYRGFESHFG--YLNGVISYYDHILSDQYSRTVELN-GHDMRRNLSTAW---- 240
            + E+ P+  GF+S +G  Y N + +  + I   +  +++  + G       S  W    
Sbjct: 131 HKAEFLPIKHGFDSWYGIPYSNDMDANRELINKVKKDKSLPWHKGKHWEEPKSEYWEVPL 190

Query: 241 ---DTVGEYA------TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
              +TV E A      T  +T+++ Q I     DKP FLYLAH   H          P  
Sbjct: 191 MQGETVLERAPNQELLTKTYTEKSQQFIRVNK-DKPFFLYLAHSMPH---------VPLF 240

Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
             N+F+ ++      Y  ++++LD SVG V+  ++  G+ + +I++F SDNG P + ++ 
Sbjct: 241 RSNKFKDMS--TMGLYGDVIEELDWSVGQVLKTVKEMGLEQQTIVVFTSDNG-PWLSFKT 297

Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
                  G   P R  K + WEGG++ P ++W P   + P +   +    D + T    +
Sbjct: 298 QG-----GIAGPLRDGKGSTWEGGMREPTVIWGPGYVK-PGIIHDLGSTLDMMATFAKMS 351

Query: 412 GG 413
           GG
Sbjct: 352 GG 353


>gi|221234167|ref|YP_002516603.1| sulfatase family protein [Caulobacter crescentus NA1000]
 gi|220963339|gb|ACL94695.1| sulfatase family protein [Caulobacter crescentus NA1000]
          Length = 551

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 110/350 (31%), Positives = 158/350 (45%), Gaps = 77/350 (22%)

Query: 92  GWNDLSFHGSN----EIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
           G+ND++ +G       +PTP ID++A  G+   N Y+    C PSRA++MTG+Y    G 
Sbjct: 74  GYNDITLNGGGVAGGTVPTPAIDSIAKEGVTFANGYSGNATCAPSRAAIMTGRYATRFGF 133

Query: 147 QGPPIWGAEPR------------------------------GVPLTERFLPEYLRELGYS 176
           +  P   A  R                               VP  E  + E L+  GY 
Sbjct: 134 EFTPTPVAFSRVVGGHAGDPLHPSRFNQAEVKNMPKDENVLAVPAAEVTIAEALKTKGYH 193

Query: 177 TKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY-----------------YDHILSD 219
           T  +GKWHLG  +    P  +GF+   G++ G   +                  D  L  
Sbjct: 194 TIHLGKWHLGGVKGS-RPEDQGFDESLGFMAGAALFAPVGDPGVVESRQDWDPIDKFLWG 252

Query: 220 QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 279
                V+ NG  +             Y TD  T EAV+ I D   ++P F+YLA+ A H 
Sbjct: 253 AAPFAVQFNGGKLF--------NPSHYMTDYLTDEAVKAI-DANKNRPFFMYLAYNAVHT 303

Query: 280 GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
                 L+AP+   +   +I D   R YAAMV+ LD +VG V+ AL+ +G+ +N+++IF 
Sbjct: 304 -----PLQAPKADYDALSHIKDHRMRVYAAMVRNLDRNVGKVLQALKDRGLDDNTLVIFT 358

Query: 340 SDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
           SDNG         +NY      N PYRG K T +EGG+KVP +L W  Q+
Sbjct: 359 SDNGG--------ANYIGLPDINKPYRGWKATFFEGGIKVPFLLRWPEQL 400



 Score = 43.5 bits (101), Expect = 0.34,   Method: Compositional matrix adjust.
 Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 6/68 (8%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
           Y TD  T EAV+ I D   ++P F+YLA+ A H       L+AP+   +   +I D   R
Sbjct: 271 YMTDYLTDEAVKAI-DANKNRPFFMYLAYNAVHT-----PLQAPKADYDALSHIKDHRMR 324

Query: 75  TYAALTKS 82
            YAA+ ++
Sbjct: 325 VYAAMVRN 332


>gi|392413324|ref|YP_006449931.1| arylsulfatase A family protein [Desulfomonile tiedjei DSM 6799]
 gi|390626460|gb|AFM27667.1| arylsulfatase A family protein [Desulfomonile tiedjei DSM 6799]
          Length = 566

 Score =  147 bits (371), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 120/418 (28%), Positives = 195/418 (46%), Gaps = 68/418 (16%)

Query: 92  GWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
           GW D  F+G  E    PTP +D LA +G++L + Y+ P CTP+RA++ TG+ P+H G+  
Sbjct: 108 GWMDPGFNGGGESVGNPTPTMDRLARDGLVLTSAYSTPSCTPTRATIHTGQNPLHHGLTR 167

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
           PP++G EP G+      +   L++LGY T+ +GKWH+G       P   G+++++G+L G
Sbjct: 168 PPMYG-EPGGLD-GAVTVASVLQKLGYVTQGVGKWHMG-ENEASLPHNVGYDNYYGFL-G 223

Query: 209 VISYYDHILSDQYSRTVELN------------GHDMRRNLSTAWDTVGEYATDLFTKEAV 256
           V   Y       ++  + L+             H+  R +  A      Y  DL T + +
Sbjct: 224 VSDMYTEWRDTYFNPEIALSPGRFALMQKLPFNHNNVRCVKGAKSCENLYEIDLTTIKDL 283

Query: 257 ---------QLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
                    + I D   + KP FLY A    H  N           +N       P R  
Sbjct: 284 DQDWAAYSEKFIRDMKDNTKPWFLYHATRGCHFDN----------YLNDKYAGKSPARTV 333

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
           Y+  + ++DD +  ++  L+  G LEN++I+  SDNG P  E              P+RG
Sbjct: 334 YSDCMVEMDDILARLVKTLEETGQLENTLILLTSDNG-PECEIPPH-------GRTPFRG 385

Query: 367 VKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
            K + WEGGV+VP  + W   ++  PR S  +  ++D   T  + AG   + +   +   
Sbjct: 386 CKGSNWEGGVRVPTFVYWKGMVK--PRRSDGLFDLADLFNTCISLAGSPGAEVAKFV--- 440

Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT-PSRRNSVLINIDEKKRTAAVRLDSWK 482
                     P+ R   +DG+DQ S L+ +   S R S++  +++    A VR+D +K
Sbjct: 441 ----------PADR--YVDGIDQASFLIGDQGQSNRRSIIYTLNQ--FVAGVRVDEFK 484


>gi|260062069|ref|YP_003195149.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88783631|gb|EAR14802.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
          Length = 492

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 133/448 (29%), Positives = 197/448 (43%), Gaps = 72/448 (16%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP 141
            +IV+    G+ DLS  G   I T N+D +A  G    N Y A  VCTPSRA L+TG+ P
Sbjct: 52  FIIVFADDLGYGDLSSFGHPTIHTKNLDRMAAEGQKWTNFYVAASVCTPSRAGLLTGRLP 111

Query: 142 IHTGMQGPPIWGAEP---RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           +  G+    I    P    G+P +E  L E L++ GY+T  +GKWHLG  + EY P   G
Sbjct: 112 VRNGLTSNEIGVFFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLGH-KEEYLPPNHG 170

Query: 199 FESHFG--------YLNGVISYYDHI--LSDQYS--RTVELNGHDMRRNLSTAWDTVGEY 246
           F+ +FG        +     SY D+    +++Y   +T E N   +R             
Sbjct: 171 FDDYFGIPYSNDMDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERPVNQNT 230

Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
            T  +  EAV+ I +   D+P F+YLAH   H          P  T ++F+  +   R  
Sbjct: 231 ITKRYNDEAVKWIREHK-DEPFFMYLAHSLPHV---------PLFTSDEFRGTS--ARGL 278

Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRNWGSNYPY 364
           Y  +V+++D  VG ++  L+ +G+ EN+I++F SDNG   PT          + GS    
Sbjct: 279 YGDVVEEIDHGVGQIMELLEAEGLAENTIVVFTSDNGPWLPTG--------ISGGSAGLL 330

Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
           R  K T WEGG++ P I W+P +    +V + M    D   T  + AG            
Sbjct: 331 REGKGTTWEGGMREPTIFWAPGMLP-AKVVMDMGSTLDLFNTFSSLAG------------ 377

Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
                      P   +  +DG+D    L  +  S R  +     +     AVRL ++K  
Sbjct: 378 ----------VPMPDDREMDGVDLSPILFGDAESPRKEMFYY--QGADLYAVRLGAYKAH 425

Query: 485 LGTQENGTMDGYYGQTRSNKVPLLNFNA 512
             T+E   M    G  R    P L +N 
Sbjct: 426 FYTKEAYVM----GAERVEHNPPLLYNV 449


>gi|410635289|ref|ZP_11345904.1| N-acetylgalactosamine-6-sulfatase [Glaciecola lipolytica E3]
 gi|410145262|dbj|GAC13109.1| N-acetylgalactosamine-6-sulfatase [Glaciecola lipolytica E3]
          Length = 493

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/371 (30%), Positives = 173/371 (46%), Gaps = 56/371 (15%)

Query: 60  ETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIIL 119
           ET+ +    ++      AAL  +  +      G+ D+S  G++ I TPNID++   G   
Sbjct: 21  ETLKETSTESNKGSVQKAALKPNIVIIFADDLGYGDISSFGADGIRTPNIDSIGQEGFTS 80

Query: 120 NNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGV-PLTERF------------ 165
            + +    VC+PSRASL+TG+YP+  GM         P  V PL+E+             
Sbjct: 81  RDFFIPANVCSPSRASLLTGRYPMRNGM---------PVAVNPLSEKHVSSHFGLHPDEI 131

Query: 166 -LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRT 224
            +PE L+  GY +  +GKWHLGF ++   PL  GF+ H G L    +YY    +D     
Sbjct: 132 TIPEMLKPAGYRSLMVGKWHLGFQQKGSHPLDAGFDEHLGLLG---NYYKARENDPRYPI 188

Query: 225 VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGK 284
           ++ N    R + +   +   E  T  +T E +  IE +  D P F+Y AH   H      
Sbjct: 189 LKDNQTLYRGHEAVKEEIELEEVTQRYTDEVISFIEREK-DGPFFVYFAHNIVH------ 241

Query: 285 HLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
              +P     +FQ  ++  +  Y   + ++D S G +++ L+R  + EN+++IF SDNGA
Sbjct: 242 ---SPHLPRKEFQGTSEKGK--YGDFIHEMDYSTGRILATLKRLNLEENTLVIFTSDNGA 296

Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI---QQNPRVSLQMMHIS 401
                    NY   G+N    GVK T  EGG  VPA+L  P +    Q+ + +L  M   
Sbjct: 297 ---------NYA--GNNGELAGVKYTTMEGGHIVPAMLKWPSVIPAGQDSKTTLTSM--- 342

Query: 402 DWLPTLYTAAG 412
           D  P     AG
Sbjct: 343 DLFPLFADIAG 353


>gi|440714613|ref|ZP_20895192.1| arylsulfatase A [Rhodopirellula baltica SWK14]
 gi|436440809|gb|ELP34113.1| arylsulfatase A [Rhodopirellula baltica SWK14]
          Length = 470

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 114/381 (29%), Positives = 185/381 (48%), Gaps = 50/381 (13%)

Query: 81  KSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASL 135
           KS    +++++    G+NDL  +GS  I TPN+D LA  G    + Y A  VC+PSRA+L
Sbjct: 24  KSDKPNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAAL 83

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           +TG YP   G+    ++     G+   E  + ++L+  GY+T  +GKWHLG   +E  P 
Sbjct: 84  LTGCYPKRVGLHQHVLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPT 142

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGE 245
             GF+S++G     I Y + +      R  +++  D   + S+A   W+T       + E
Sbjct: 143 SNGFDSYYG-----IPYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIE 197

Query: 246 YATD------LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
              D       +T  A++ +E    DKP FLYL H   H       L  P++      Y 
Sbjct: 198 LPVDQRTVTRRYTDRAIEFVEANQ-DKPFFLYLPHSMPHI-----PLYVPEDV-----YD 246

Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
            DP +  Y  +++ +D  VG ++  ++  G+ E ++I++ SDNG P ++++      + G
Sbjct: 247 PDP-QNAYKCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDNG-PWLQFKN-----HGG 299

Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL--YTAAGGDTSR 417
           S  P R  K T +EGG +VP I+W+P        S       D LPT+  +T    +  R
Sbjct: 300 SAGPLRAGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASFTGVALENDR 359

Query: 418 LPLNIDGLDQWSSLLLNTPSR 438
               IDG+D  S+   +  +R
Sbjct: 360 ---KIDGIDLTSTFTSDESAR 377


>gi|421612498|ref|ZP_16053605.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
 gi|408496794|gb|EKK01346.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
          Length = 487

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 112/364 (30%), Positives = 177/364 (48%), Gaps = 42/364 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+NDL  +GS  I TPN+D LA  G    + Y A  VC+PSRA+L+TG YP   G+    
Sbjct: 58  GYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHV 117

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           ++     G+   E  + ++L+  GY+T  +GKWHLG   +E  P   GF+S++G     I
Sbjct: 118 LFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGFDSYYG-----I 171

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGEYATD------LFTKE 254
            Y + +      R  +++  D   + S+A   W+T       + E   D       +T  
Sbjct: 172 PYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTITRRYTDR 231

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           A++ +E    DKP FLYL H   H       L  P++      Y  DP +  Y  +++ +
Sbjct: 232 AIEFVEAN-QDKPFFLYLPHSMPHIP-----LYVPEDV-----YDPDP-QNAYKCVIEHV 279

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D  VG +I  ++  G+ E ++I++ SDNG P ++++      + GS  P R  K T +EG
Sbjct: 280 DTEVGRLIQTVRDLGLSEKTLIVYTSDNG-PWLQFKN-----HGGSAGPLRAGKGTTFEG 333

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
           G +VP I+W+P        S       D LPT+ +  G    +    IDG+D  S+   +
Sbjct: 334 GQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVTGV-ALKNDRKIDGVDLTSTFTSD 392

Query: 435 TPSR 438
             +R
Sbjct: 393 ESAR 396


>gi|223940145|ref|ZP_03632007.1| sulfatase [bacterium Ellin514]
 gi|223891162|gb|EEF57661.1| sulfatase [bacterium Ellin514]
          Length = 508

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 113/375 (30%), Positives = 186/375 (49%), Gaps = 47/375 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D+   G  +I TPNID +A  G+     Y+  PVC PSR  LMTGK+  H+ ++   
Sbjct: 49  GYADVGCFGQKKIHTPNIDRIATEGMKFTQHYSGSPVCAPSRCVLMTGKHSGHSAVRD-- 106

Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
               +P G   +P     +   L++ GY T A GKW LG       PL +GF   FGY  
Sbjct: 107 NRELKPEGQFPLPANTITVARLLQQNGYITGAFGKWGLGGPESSGKPLDQGFTRFFGYNC 166

Query: 208 GVISY--YDHILSDQYSRTVELN---GHDMR------RNLSTAWD--TVGEYATDLFTKE 254
             +++  +   L D   R    N   G D +       N   ++   T   YA DL+ ++
Sbjct: 167 QRVAHNLFPTYLWDDNHRLALDNPPIGEDQKLPADADSNDPASYKAFTGKSYAPDLYAEQ 226

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-------------YITD 301
           A++ I D   D P FL+   +  H       L+ P++++ +++             Y+  
Sbjct: 227 ALRFIRDN-KDHPFFLFFPTIVPHVA-----LQVPEDSLKEYEGKLPETPYTGGKGYL-- 278

Query: 302 PNR---RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
           PNR     YAAM+ ++D  +G +++ ++   + +++I +F SDNG P  +    ++ + +
Sbjct: 279 PNRTPHAAYAAMITRMDRDLGRMLALIKELNLDDDTIFVFTSDNG-PAPQDMGGTDTKFF 337

Query: 359 GSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
            S+ P+R  K +++EGG+++P I+ W  +IQ N   S ++    DWLPTL   +G   S 
Sbjct: 338 NSSGPFRSGKTSIYEGGMRIPLIVRWHGKIQPN-STSDRVTGFEDWLPTLLELSGNKKS- 395

Query: 418 LPLNIDGLDQWSSLL 432
           +P  IDGL   S+LL
Sbjct: 396 VPTGIDGLSFASTLL 410


>gi|343086300|ref|YP_004775595.1| sulfatase [Cyclobacterium marinum DSM 745]
 gi|342354834|gb|AEL27364.1| sulfatase [Cyclobacterium marinum DSM 745]
          Length = 454

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 117/348 (33%), Positives = 179/348 (51%), Gaps = 54/348 (15%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP 141
           +LI++    G++D+S++G+ +I TP+IDA+  +G+  +  Y   PVC P+RASLM+G+YP
Sbjct: 34  ILIIFTDDQGYHDVSYYGTEDIQTPHIDAIRNDGMQFDYFYTNSPVCAPTRASLMSGRYP 93

Query: 142 IHTGMQG------PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
            H G+ G         WG   P  +      LP  L++ GY T  IGKW+LG       P
Sbjct: 94  DHVGVPGLVRSNPANNWGYLNPETI-----LLPAMLKKGGYHTAHIGKWNLGLMHPN-LP 147

Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
              GF+   G+L  ++  Y          T   +G +  R      D  G +ATDLFT+ 
Sbjct: 148 NDHGFDYFHGWLEDMMDDY---------LTHRRHGKNYMRLNEEIIDPKG-HATDLFTQW 197

Query: 255 AVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ---ETINQFQYITDPNRRTYAAM 310
           +V  I E +  ++P FLYLA+ A H       ++ P+   E +N+ +      R    A 
Sbjct: 198 SVNYIKEREKEERPFFLYLAYNAPHFP-----VQPPKSWVEKVNKREPGLPEKRANLIAF 252

Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
           ++ LDD +G VI+AL+  G  EN++IIF SDNG          +  +  +N   R  K +
Sbjct: 253 IEHLDDGIGQVINALKETGQYENTLIIFTSDNGG---------HLPDLANNGLLRDGKQS 303

Query: 371 LWEGGVKVP-AILWSPQIQQ---NPRVSLQMMHISDWLPTLYTAAGGD 414
           ++EGG++VP AI W  +I+    + +V+L M    D  PTL   AG D
Sbjct: 304 MYEGGLRVPTAISWPAKIKAGSTSDQVNLSM----DIFPTLTEIAGVD 347


>gi|421613760|ref|ZP_16054831.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
 gi|408495346|gb|EKJ99933.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
           baltica SH28]
          Length = 485

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 117/394 (29%), Positives = 175/394 (44%), Gaps = 67/394 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM---- 146
           G+NDL   G N I TP +D LA  G  L N Y A P CTPSRASL+TG+YP   G+    
Sbjct: 41  GYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGRYPQRNGIYDMI 99

Query: 147 -QGPPIWG------------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
               P +G                G+   E  LP  LR  GY +   GKW LG  +R   
Sbjct: 100 RNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAMLRPAGYKSGIYGKWDLGALQR-ML 158

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
           P  RGF+  +G++N  I Y+ H   ++Y          M RNL       G Y T LF +
Sbjct: 159 PTSRGFDDFYGFVNTGIDYFTH---ERYGVPC------MVRNLEPTEADKGTYCTYLFQR 209

Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHLEAPQETINQFQYITDP---------- 302
           EA++ +++   ++P FLY+   A H + +    + +  +  +QF+ +  P          
Sbjct: 210 EALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKAMYPPVEVETRVTDR 269

Query: 303 --------------NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
                          RR Y A V  +D ++G ++  L+ K ML+ ++++F SDNG     
Sbjct: 270 YRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDETMVVFFSDNGG---- 325

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
                       N P RG K   WEGG++VP ++ W         V+ + +   + LP+ 
Sbjct: 326 -------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNDEFLTSLELLPS- 377

Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
           + AA G      + +DG + W +L     S R  
Sbjct: 378 FAAAAGVEPPPGVVLDGFNWWPTLRGEAESPRKE 411


>gi|327289638|ref|XP_003229531.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Anolis
           carolinensis]
          Length = 521

 Score =  147 bits (370), Expect = 2e-32,   Method: Compositional matrix adjust.
 Identities = 157/573 (27%), Positives = 239/573 (41%), Gaps = 122/573 (21%)

Query: 78  ALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
           A + +  L L+   GW DL   G     TP++D +A  G++  + Y A P+C+PSRA+L+
Sbjct: 25  ATSPNLVLLLMDDMGWGDLGVFGQPSKETPHLDQMAAEGMLFPSFYTANPLCSPSRAALL 84

Query: 137 TGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
           TG+ PI  G        A  R          G+P +E  LPE L++ GY+ K +GKWHLG
Sbjct: 85  TGRLPIRNGFY---TTNAHARNAYTPQDITGGIPDSELLLPELLKKAGYANKIVGKWHLG 141

Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWD 241
             + ++ PL  GF+  FG  N     YD+  +      R  E+ G    D++ N  T   
Sbjct: 142 H-QPQFHPLRHGFDEWFGSPNCHFGPYDNKAAPNIPVYRDWEMIGRYYEDIKINHKTGES 200

Query: 242 TVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
            +    T ++ +EA+  +  Q    +P FLY A  A HA                  ++ 
Sbjct: 201 NL----TQMYLQEALDFMSSQQAKQQPFFLYWAIDATHA-----------PVYASRAFLG 245

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
              R  Y   V+++DDSVG ++S L++ G+ E++ + F SDNGA  V   E +     GS
Sbjct: 246 SSQRGRYGDAVREIDDSVGKILSHLRKLGISEDTFVFFTSDNGAALVSAPEQA-----GS 300

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSP-QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           N P+   K T +EGG++ PAI W P  I     VS Q+  + D   T    AG       
Sbjct: 301 NGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQLVSHQLGSVMDLFATSLALAG------- 353

Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLD 479
                           P   +  IDG+D    +L      R       +E     AVR+ 
Sbjct: 354 ---------------LPPPGDREIDGIDLSPVVLQGKLIDRPIFYYRGNE---MMAVRVG 395

Query: 480 SWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-FLPISNIDK 538
            +K    T  N                              S ++  Q I F P  N+  
Sbjct: 396 LYKAHYWTWSN------------------------------SWEEFGQGIDFCPGQNVSG 425

Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRP---DISSQLYE 595
           + S  Q+       +  P+           LF+LG DP E+  I+ S     D+  ++  
Sbjct: 426 VTSHDQEEH-----SKLPL-----------LFHLGKDPGERFPISFSSAEYQDVMGRISP 469

Query: 596 LLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
           +++ H+ T++P    QP L   D    N  W+P
Sbjct: 470 VVEQHKETMLPG---QPQLNVCDKAVMN--WAP 497


>gi|311746665|ref|ZP_07720450.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|311302556|gb|EAZ82500.2| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 465

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 113/347 (32%), Positives = 173/347 (49%), Gaps = 32/347 (9%)

Query: 78  ALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSR 132
           +L +S     +I++    G+ D+   G   I TPN+D +A  G    N Y A  VCTPSR
Sbjct: 22  SLAQSQKPNFIIIFTDDQGYGDVGTFGHPTIKTPNLDQMAMEGQKWTNFYVAANVCTPSR 81

Query: 133 ASLMTGKYPIHTGMQGPP---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
           +++MTG+ P+ TGM       ++     G+P TE  + + L+  GYST AIGKWHLG   
Sbjct: 82  SAIMTGRLPVRTGMYSNTRRVLFPDSGGGLPATENTIAKLLKTSGYSTAAIGKWHLGHLP 141

Query: 190 REYTPLYRGFESHFG--YLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGE 245
            EY P   GF+++FG  Y N +    D    + ++  +    N   MR            
Sbjct: 142 -EYLPTSHGFDTYFGIPYSNDMDRINDVTAQEAFASPKPEYFNVPLMRDKEIIERPADQT 200

Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
             T  +T+EAV  I+    D+P F+YLAH   H       L A ++      ++T   R 
Sbjct: 201 TITKRYTEEAVSYIKANK-DQPFFIYLAHSLPHI-----PLFASED------FLTKSERG 248

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
            Y  +++++D SVG ++S L+ +G+ +N+ +IF SDNG P + Y E              
Sbjct: 249 LYGDVIEEIDWSVGQILSTLKSEGIDKNTYVIFTSDNG-PWLVYNEQGGSSGGLFG---- 303

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
             K T +EGGV+VP I+W P     P+V  ++    D LPT+ + +G
Sbjct: 304 -GKGTSYEGGVRVPTIIWGPG-NIKPQVVSKIGSTLDLLPTISSLSG 348


>gi|449134034|ref|ZP_21769542.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448887354|gb|EMB17735.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 728

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 128/432 (29%), Positives = 189/432 (43%), Gaps = 62/432 (14%)

Query: 77  AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASL 135
           +A   +  L L+   G+ DL  +G+ E+ TP ID +A  GI   + YA  P+C+PSRA L
Sbjct: 2   SAEKPNIVLILVDDQGYYDLGCYGATEVKTPRIDEMAGGGIRFTDYYAAAPICSPSRAGL 61

Query: 136 MTGKYPIHTGMQGPPIW---GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
           +TG YP   G     +W        G+   E  L E  ++ GY T  IGKWHLG F   +
Sbjct: 62  LTGCYPRRVGNH---VWVHRADSNTGIHSDELTLAELFKDNGYKTACIGKWHLG-FHEPF 117

Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
            P  +GF+ +FG L+ +         D     ++ +   ++R          +  T L+T
Sbjct: 118 LPQNQGFDHYFGLLHNLDPVETVYFEDVGGVPLQRDRDVVKRPADP------DELTKLYT 171

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
            EA+  IE    + P  LYL H   H       L   +E     Q+        Y   ++
Sbjct: 172 NEAIDFIEANK-EGPFLLYLPHTMLH-----NPLGVSEEFKGTSQW------GEYGDAIQ 219

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN--YPYRGVKNT 370
           +LD  VG +I  L+R  + +N+++++ SDNG            R  G N     RG K +
Sbjct: 220 ELDHHVGRIIDVLKRLNIDDNTVVVYASDNG------------RGPGRNPQQKIRGRKLS 267

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSL-QMMHISDWLPTLYTAAGGDTSRLPLNIDGLD--- 426
            +EGG++VPAI W P +     +S   ++   DW PTL T A  +     + IDG D   
Sbjct: 268 TYEGGIRVPAIAWGPGLGLRAGISSPAVVRAMDWYPTLATLADIEIPSERV-IDGRDLSP 326

Query: 427 -------------QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT 473
                           SL    P RR    DG  +W SL+    ++ N        +   
Sbjct: 327 LLRGETDFVPPPGLKKSLNAGVPLRRRWEPDG--EWGSLI--NRNQYNDAFFYHGSQGAL 382

Query: 474 AAVRLDSWKLVL 485
           AAVR  +WKL L
Sbjct: 383 AAVRWRNWKLYL 394


>gi|260060774|ref|YP_003193854.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
 gi|88784904|gb|EAR16073.1| arylsulfatase A (precursor) [Robiginitalea biformata HTCC2501]
          Length = 526

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 111/341 (32%), Positives = 167/341 (48%), Gaps = 25/341 (7%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G++D+  +G+ +IPTPN+DA+A +G++L N Y AQPVC+ SRA L+TG YP   G+    
Sbjct: 85  GYSDVGVYGARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGLLTGCYPNRVGIHN-A 143

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           +    P G+   E  L E LR+ GY T   GKWHLG    ++ P   GF+  FG      
Sbjct: 144 LMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLG-DHPDFLPTRHGFDEFFGIPYSND 202

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
            +  H L         L  ++  R + T  D      T   T+ +V  I ++  ++P FL
Sbjct: 203 MWPLHPLQGPVFDFGPLPLYEQERVVDTLEDQ--RLLTRQITERSVDFI-NRHKEEPFFL 259

Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           Y+ H   H          P    + F+  +   R  Y  ++ ++D SVG V+ AL+  G+
Sbjct: 260 YVPHPQPH---------VPLFVSDAFRGKS--GRGLYGDVIMEIDWSVGQVLGALEDNGL 308

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            +++ +IF SDNG P + Y   S     G   P R  K T WEGGV+ P I+  P     
Sbjct: 309 TDDTWVIFTSDNG-PWLAYGNHS-----GRAEPLREGKGTNWEGGVREPCIMKFPGRLPR 362

Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
            +V  + +   D LPT+ +  G  + +    IDG + W  L
Sbjct: 363 GKVLDEPLMAIDLLPTIASVTG--SPQPGREIDGKNAWGLL 401


>gi|196232399|ref|ZP_03131252.1| sulfatase [Chthoniobacter flavus Ellin428]
 gi|196223471|gb|EDY17988.1| sulfatase [Chthoniobacter flavus Ellin428]
          Length = 452

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 179/364 (49%), Gaps = 60/364 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQG-- 148
           G  + S  G+ +I TPNID L + G+  +N YA   VC+PSRA+LMTG +P   G+ G  
Sbjct: 39  GRGEYSDFGTKDIQTPNIDRLCHEGLTFDNFYANSCVCSPSRAALMTGCFPNRVGVPGVI 98

Query: 149 -----PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                   WG   +GV    R  P+ L+  GY +  +GKWHLG      TP  RGF+   
Sbjct: 99  REEDQDNNWGWLAQGV----RLFPQLLKPAGYQSAIVGKWHLGI-NSPNTPTERGFDRFD 153

Query: 204 GYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           G+L  ++  Y+ H+   Q         + MR N      +   +ATD+FT  A++ ++++
Sbjct: 154 GFLGDMMDDYWTHLRHGQ---------NFMRHNQEVV--SPEGHATDVFTDWALRYLDER 202

Query: 263 PVDK-PLFLYLAHLAAHAGNAGK-----HLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
              K P FLYLA+ A H     K      ++A +  +++        R    A+++ LDD
Sbjct: 203 SKSKEPFFLYLAYNAPHDPIQPKPDWLAKVKAREPGLSE-------KRAKLVALIEHLDD 255

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
            +G V++ L   G+ +N+I++F SDNG             N  +N P+R  K  ++EGG+
Sbjct: 256 GIGKVLAKLDETGLAKNTIVLFSSDNGG---------VLSNEANNGPWRSGKTHMYEGGL 306

Query: 377 KVPAIL-W----SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           +VP ++ W    +P    N RV+L M    D LPT     G      P NIDG+    +L
Sbjct: 307 RVPCMVRWPGHVAPGTHSN-RVALLM----DILPTALEIVG---QTPPPNIDGVSFLPTL 358

Query: 432 LLNT 435
           L  T
Sbjct: 359 LGET 362


>gi|313232711|emb|CBY19381.1| unnamed protein product [Oikopleura dioica]
          Length = 453

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 136/522 (26%), Positives = 218/522 (41%), Gaps = 117/522 (22%)

Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
           MTG+YP+  G+Q   I   EP G+PL E  LP+  RE GYST   GKWHLG++ + + P 
Sbjct: 1   MTGRYPMRYGLQRGVIRPDEPEGIPLREILLPQGFRECGYSTLMSGKWHLGYYTKAHRPQ 60

Query: 196 YRGFESHFGYLNGVISYYDHIL-----------SDQYSRTVELNGHDMRRNLSTAWDTVG 244
            RGF+  +G+  G   +Y H              + YS   E+  HD       AW   G
Sbjct: 61  NRGFDRFYGFYVGSQDFYFHRTGFKSDGASFWSEEGYSTGEEIIRHD-------AW---G 110

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP---QETINQFQYITD 301
            Y+T L   +A++ + ++    PLF YL+    HA       + P   +++I  F    +
Sbjct: 111 HYSTTLLVDDAIKQLRNRNPGNPLFHYLSIQDPHAPE-----QRPLRLEQSIKHFALRGE 165

Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLE--NSIIIFMSDNGAPTVEYRETSNYRNWG 359
             R+ +   VK +D  +G  +  L+R  + +  N+I+IF SDNG P          R  G
Sbjct: 166 --RKIFVTKVKSMDLEIGRYLDVLKRIYIPDYNNTIVIFSSDNGGP----------RKLG 213

Query: 360 -SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
            SN+P RG K  +++G  +  A++  P ++   +    M H  DW+PTL         + 
Sbjct: 214 ASNWPLRGGKGCIFDGAFRSRALIHIPGMKH--KNYPHMFHAVDWMPTLMEMLSCKLPKD 271

Query: 419 PLNIDGLDQWSSLL-LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN---IDEKKRTA 474
            L +DG+    ++L +   + R   +  +D    L       R+   +     D + R A
Sbjct: 272 HLPLDGVSHMKTILGMTDRAPREEMLLDIDPEEVLQKTASDARDFSYLKKSGFDSRMR-A 330

Query: 475 AVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPIS 534
           A+R   WK++ G   +GT  G                          +Q  ++N +    
Sbjct: 331 ALRYKEWKIITGPPCSGTKCG--------------------------VQSENRNYYKAEE 364

Query: 535 NIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLY 594
            + K RS R                         L+N+  DP E  +++   P+++  L 
Sbjct: 365 YVSKSRSVR-------------------------LYNITADPRESYDLSDKYPEVTKAL- 398

Query: 595 ELLKYHRRTLVPQSHEQPDLV-------QADPKRFNDTWSPW 629
            LLK      +   HE+  L        +A P+    TW PW
Sbjct: 399 -LLK------LSVYHEEQSLFMDRMRDKKALPETSEHTWFPW 433


>gi|32476258|ref|NP_869252.1| arylsulfatase A [Rhodopirellula baltica SH 1]
 gi|32446802|emb|CAD76638.1| arylsulfatase A [Rhodopirellula baltica SH 1]
          Length = 489

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 112/366 (30%), Positives = 178/366 (48%), Gaps = 46/366 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+NDL  +GS  I TPN+D LA  G    + Y A  VC+PSRA+L+TG YP   G+    
Sbjct: 58  GYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHV 117

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           ++     G+   E  + ++L+  GY+T  +GKWHLG   +E  P   GF+S++G     I
Sbjct: 118 LFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGFDSYYG-----I 171

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGEYATD------LFTKE 254
            Y + +      R  +++  D   + S+A   W+T       + E   D       +T  
Sbjct: 172 PYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDR 231

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           A++ +E    DKP FLYL H   H       L  P++      Y  DP +  Y  +++ +
Sbjct: 232 AIEFVEANQ-DKPFFLYLPHSMPHIP-----LYVPEDV-----YDPDP-QNAYKCVIEHI 279

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D  VG ++  ++  G+ E ++I++ SDNG P ++++      + GS  P R  K T +EG
Sbjct: 280 DTEVGRLVQTVRDLGLSEKTLIVYTSDNG-PWLQFKN-----HGGSAGPLRAGKGTTFEG 333

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL--YTAAGGDTSRLPLNIDGLDQWSSLL 432
           G +VP I+W+P        S       D LPT+  +T    +  R    IDG+D  S+  
Sbjct: 334 GQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASFTGVALENDR---KIDGIDLTSTFT 390

Query: 433 LNTPSR 438
            +  +R
Sbjct: 391 SDESAR 396


>gi|297180066|gb|ADI16291.1| arylsulfatase a and related enzymes [uncultured bacterium
           HF0010_16H03]
          Length = 543

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 121/409 (29%), Positives = 193/409 (47%), Gaps = 61/409 (14%)

Query: 62  INQFQYITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFH----GSNEIPTPNIDALAYNG 116
           I++ Q I  P   +    TK   + +L    G+ND+SF+        + TPNID LA  G
Sbjct: 38  ISENQEINWPIVESSVITTKPNIILILADDLGFNDVSFYNGGAADGSLMTPNIDNLAKEG 97

Query: 117 I-ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI------------------------ 151
           +  LN   A PVC+PSRA++MTG+Y    G +  P                         
Sbjct: 98  VAFLNGYAASPVCSPSRAAIMTGRYSSRYGFEFTPYPAQAAKIMNLLREDDELETIYLED 157

Query: 152 -----WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
                 G    G+P  E  + E L++ GY T  IGKWHLG F     P  +GF+     L
Sbjct: 158 VQLDEVGLTIGGLPDKEVTVAEMLQKNGYYTAHIGKWHLGGFTDGMKPNDQGFDDSL-ML 216

Query: 207 NGVISYYDHILSDQYSRTVELNGHDM---RRNLSTAWDTV-----GEYATDLFTKEAVQL 258
           N  + Y      D  +  ++ +  DM       + +++       G Y TD +T EA+++
Sbjct: 217 NSSL-YLPKNHPDIVNAKIDSSIEDMVWASSQFAASFNGSEPFKPGGYITDYYTDEAIKV 275

Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           I +   D+P FLYL H A H       L++ ++   +  ++ +   + Y+ M++ LD S+
Sbjct: 276 INNNK-DRPFFLYLGHFAPH-----NPLQSLRKDYEKHNHMENHTLQVYSGMIEALDRSI 329

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G +++ L+  G+ EN++IIF SDNG     Y    N      N PYRG K T +EGG+ +
Sbjct: 330 GKIVTTLEENGLTENTLIIFTSDNGG--AGYIGLDNI-----NKPYRGWKLTHFEGGMHI 382

Query: 379 PAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           P    W  +I+++ R   ++ H +D   T+  AA  +  +  + IDG++
Sbjct: 383 PFFAKWPAKIKKDMRYDKRIHH-TDIFSTILGAASIEPPK-EIKIDGIN 429



 Score = 39.3 bits (90), Expect = 5.7,   Method: Compositional matrix adjust.
 Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 7/80 (8%)

Query: 540 RSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
           ++   QA IH   N   +  S   N   YL+NL  DP EQNN+A S   + S L+E L  
Sbjct: 449 KNASYQAVIH---NDWKLLRSKFPNEQEYLYNLKKDPYEQNNLAMSELTLKSLLHEKLDT 505

Query: 600 HRRTLV----PQSHEQPDLV 615
           H  ++     PQS   P +V
Sbjct: 506 HIESMPEPNWPQSVFMPVVV 525


>gi|336413808|ref|ZP_08594157.1| hypothetical protein HMPREF1017_01265 [Bacteroides ovatus
           3_8_47FAA]
 gi|335934825|gb|EGM96808.1| hypothetical protein HMPREF1017_01265 [Bacteroides ovatus
           3_8_47FAA]
          Length = 472

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 139/457 (30%), Positives = 204/457 (44%), Gaps = 71/457 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G  DL  +G N I TP IDALA  G+     Y+   V  PSR  L+TGK+  H+ ++G  
Sbjct: 48  GIGDLGCYGQNRIKTPAIDALAAQGMKFTQHYSGSTVSAPSRCVLLTGKHTGHSYIRGNK 107

Query: 151 IWGAEP-RGVPLT----ERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
            + AE  R   L     E  + E  +   Y T  +GKW LG    E  P  +GF+  FGY
Sbjct: 108 GYKAEDGRSYDLNLADEEITIGEMFKTKNYVTACVGKWGLGGPSAEGHPNKQGFDYFFGY 167

Query: 206 LN-GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           L  G    Y  +   + +  V LN H +             Y+ D+   +A++ I D+  
Sbjct: 168 LGQGNAHNYYPVQLFENNTPVMLN-HKV-------------YSHDMIMDKALEFI-DKNA 212

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETI----NQFQYITDPN---------RRTYAAMV 311
           DKP F+YL     HA      +  P   +     +F+ +  P          R T+AAMV
Sbjct: 213 DKPFFVYLTPTIPHA-----DIIVPNNELFDYDGKFEEVPYPGKGYRPQEKPRATFAAMV 267

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS-NYRNWGSNYPYRGVKNT 370
            +LD  V  VI  L  KG+L+N+IIIF SDNG     ++E   + R + SN P+RG+K  
Sbjct: 268 TRLDRDVQRVIDLLADKGVLDNTIIIFTSDNGT----HKEGGHDPRYFDSNGPFRGMKRD 323

Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
           L+EGG++ P I+  P +     VS Q+    D++PT+    G   +  P NIDGL    +
Sbjct: 324 LYEGGIRTPFIVKWPGVIPAGSVSFQVSAFWDFMPTMCELIG---APAPANIDGLSYLPT 380

Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
           L      +++                    + +     E+    AV  D WKL L  Q N
Sbjct: 381 LTGKGKQKQH--------------------DYLYFEFHEQGGKQAVIKDGWKL-LHLQVN 419

Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQ 527
                +Y     N  P  N N +  ++  Q +Q+L +
Sbjct: 420 NPQKEHYELYNLNADPGENANVL--NQYPQKVQELEK 454


>gi|417301368|ref|ZP_12088525.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327542298|gb|EGF28785.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 470

 Score =  146 bits (369), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 112/373 (30%), Positives = 181/373 (48%), Gaps = 46/373 (12%)

Query: 87  LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP 141
           +++V+    G+NDL  +GS  I TPN+D LA  G    + Y A  VC+PSRA+L+TG YP
Sbjct: 30  VIVVFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYP 89

Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
              G+    ++     G+   E  + ++L+  GY+T  +GKWHLG   +E  P   GF+S
Sbjct: 90  KRVGLHQHVLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGFDS 148

Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGEYATD-- 249
           ++G     I Y + +      R  +++  D   + S+A   W+T       + E   D  
Sbjct: 149 YYG-----IPYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQR 203

Query: 250 ----LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
                +T  A++ +E    DKP FLYL H   H       L  P++      Y  DP + 
Sbjct: 204 TVTRRYTDRAIEFVEANQ-DKPFFLYLPHSMPHIP-----LYVPEDV-----YDPDP-QN 251

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
            Y  +++ +D  VG ++  ++  G+ E ++I++ SDNG P ++++      + GS  P R
Sbjct: 252 AYKCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDNG-PWLQFKN-----HGGSAGPLR 305

Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
             K T +EGG +VP I+W+P        S       D LPT+ +  G         IDG+
Sbjct: 306 AGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVTGVALEN-DRKIDGI 364

Query: 426 DQWSSLLLNTPSR 438
           D  S+   +  +R
Sbjct: 365 DLTSTFTSDESAR 377


>gi|149195908|ref|ZP_01872964.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
 gi|149140755|gb|EDM29152.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
          Length = 490

 Score =  146 bits (368), Expect = 3e-32,   Method: Compositional matrix adjust.
 Identities = 114/351 (32%), Positives = 172/351 (49%), Gaps = 30/351 (8%)

Query: 75  TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
            Y    K T   ++ +     G   L   G   + TP ID L       ++ +  P CTP
Sbjct: 18  VYCEELKGTKPNIIFILTDDQGKGQLGCEGHPWLETPYIDKLFSESSYFSDFHMSPTCTP 77

Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
           SRA+LMTG YP   G+      GA  R + L    LPE L+ LGY+T   GKWHLG +  
Sbjct: 78  SRAALMTGNYPFRNGVT--HTGGARAR-MTLKIETLPEALKSLGYTTGIFGKWHLG-YEE 133

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL 250
            Y P  RGF+  F +  G I     + +++Y   +      +R N  T   T G + TD+
Sbjct: 134 AYQPHSRGFDEVFIHGYGGIGQPMDVPNNKYENPI------IRHN-ETFVKTEG-FCTDV 185

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT-DPNRRTYAA 309
           F  +A+  I+ +  DKP F Y++  A H    G ++ AP++   +F+ +        +  
Sbjct: 186 FFDQALAWIK-KSKDKPFFAYISTNAPH----GPYI-APKDKREKFKKLGFKSGDAGFYG 239

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS-NYRNWGSNYPY---- 364
           M++ +DD+VG +++ L++  + EN+++IFMSDNG  ++   E     RN    Y Y    
Sbjct: 240 MIENIDDNVGRLMTKLKQWDLDENTLVIFMSDNGFASLVINEKKLGQRNGEDLYSYQAGL 299

Query: 365 RGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD 414
           RG K T +EGG  VPA   W  + +   ++S    HI D  PTL   AGG+
Sbjct: 300 RGYKKTPYEGGTLVPAFFRWKGKFKAGNKISGLSAHI-DLFPTLVELAGGE 349


>gi|149179345|ref|ZP_01857904.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Planctomyces maris DSM
           8797]
 gi|148841817|gb|EDL56221.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Planctomyces maris DSM
           8797]
          Length = 461

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 118/366 (32%), Positives = 178/366 (48%), Gaps = 50/366 (13%)

Query: 75  TYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSR 132
           T A  T+   L +L+   G+ DLS +G+ ++ +P+ID L   G+  +N YA  PVC+P+R
Sbjct: 26  TTAQQTRPNVLVILVDDLGYGDLSSYGATDLKSPHIDELLNRGMKFSNFYANCPVCSPTR 85

Query: 133 ASLMTGKYPIHTGMQG------PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
           A+L+TG Y    G+ G         WG  +P  V L + F        GY T  IGKWHL
Sbjct: 86  AALLTGHYQDMVGVPGVIRTHPENSWGYLKPSAVTLADVF-----HSAGYQTAIIGKWHL 140

Query: 186 GFFRREYTPLYRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
           G      TP  RGF+   G+L  ++  YY H       R   +N   MRRN  T  D  G
Sbjct: 141 GL-ESPNTPNERGFDLFRGFLGDMMDDYYLH-------RRHGVN--YMRRNQKTV-DPQG 189

Query: 245 EYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYIT 300
            +ATDLFT    + ++ Q   + P FLYLA+ A H       ++ P++    + Q +   
Sbjct: 190 -HATDLFTDWTCEYLKQQATSESPFFLYLAYNAPHT-----PIQPPEDWLGKVKQRETGI 243

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
           DP+R    A+++ LD  +G VI  L    + +N++I+F SDNG                +
Sbjct: 244 DPDRARLVALIEHLDAGIGKVIQTLDETKLSDNTLIVFSSDNGG---------QLGVGAN 294

Query: 361 NYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           N   R  K +++EGG+KVP  ++W   I  +       M + D  PT+  A G    ++P
Sbjct: 295 NGALRDGKQSMYEGGLKVPTGVVWKKHIAPHTETDFMAMSM-DIFPTVCEAGG---IKVP 350

Query: 420 LNIDGL 425
             +D +
Sbjct: 351 SGLDAV 356


>gi|427388367|ref|ZP_18884250.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
           12058]
 gi|425724950|gb|EKU87824.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
           12058]
          Length = 457

 Score =  146 bits (368), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 117/350 (33%), Positives = 171/350 (48%), Gaps = 54/350 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           G  DLS   + ++ TPNID L  +G+ LNN YA   V +PSRA L+TG YP   G+ G  
Sbjct: 48  GLGDLSCQYARDLQTPNIDKLFSDGVRLNNFYANSSVSSPSRAGLLTGCYPDMVGVPGVI 107

Query: 149 ----PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                  WG   P  +      LP+ +++  Y+T  IGKWHLG       P  RGF+   
Sbjct: 108 RTETTASWGYLSPEAI-----LLPQMMKKAEYNTAIIGKWHLGL-EEPNLPNSRGFDFFH 161

Query: 204 GYLNGVI-SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           G+L  ++ +YY H+            GH+  R      +  G +ATDLF+  A+   +  
Sbjct: 162 GFLGDMMDNYYTHLR----------RGHNYMRLNEEVIEPTG-HATDLFSSWALDYFDTM 210

Query: 263 PVDK-PLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSV 318
              K P FLYLA+ A H       L+ P E    + + +      R    A+++ LD ++
Sbjct: 211 RKSKVPFFLYLAYNAPHVP-----LQPPTEWLERVREREPGISEKRAKLVALIEHLDYNI 265

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V  AL+  G L+N++IIF SDNG          +     +N P RG K  ++EGG+KV
Sbjct: 266 GRVYKALEESGQLDNTLIIFCSDNGG---------DAGAEANNGPVRGSKGDMYEGGIKV 316

Query: 379 P-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGL 425
           P A+ W   ++  PR    ++ +SD  PTL      D   +P+N  IDG+
Sbjct: 317 PCAVYWRNHLK--PRSIDDLVIMSDIFPTL-----CDLVSIPVNHRIDGI 359


>gi|87312329|ref|ZP_01094424.1| arylsulfatase A [Blastopirellula marina DSM 3645]
 gi|87284951|gb|EAQ76890.1| arylsulfatase A [Blastopirellula marina DSM 3645]
          Length = 477

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 114/345 (33%), Positives = 171/345 (49%), Gaps = 33/345 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G+ D+   GS    TPN++A+A  G+ L   YA PVC+PSRA+LMTG YP         +
Sbjct: 42  GYADIEPFGSEVNRTPNLNAMADEGMKLTCFYAAPVCSPSRAALMTGCYPKRALTIPHVL 101

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNGV 209
           +     G+   E  + E ++E GY+T  IGKWHLG  + ++ P  +GF+ ++G  Y N +
Sbjct: 102 FPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLG-DQPDFLPTRQGFDYYYGLPYSNDM 160

Query: 210 ISYYDHILSDQYSRTVELNGHD-----MRRNLSTAWDTVGEYATDL---FTKEAVQLIED 261
               D + S+  +   +  G       + RN +     + +  T+L   +T+EA+Q I D
Sbjct: 161 GPAADGVKSNYGAPIPQRKGKGQPPLPLLRNETVLQRVLAKDQTELVTNYTEEAIQFIRD 220

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
              +KP FLYL H A H          P    + F+     +   Y   V+++D SVG V
Sbjct: 221 HQ-EKPFFLYLPHSAVH---------FPMYPGDAFR--GKNSHGLYNDWVEEVDWSVGQV 268

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           + AL+  G+ + +++IF SDNG  T         R    N P R  K T +EGG++VP I
Sbjct: 269 LQALKDLGLDQRTLVIFTSDNGGQT---------RFGAVNKPLRAGKATTYEGGMRVPTI 319

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
           +  P        S  ++ + D LPTL   AGG T+     IDG D
Sbjct: 320 VRWPGKVPAGSSSDAVVGMIDVLPTLVKLAGG-TTPTDRKIDGAD 363


>gi|171912352|ref|ZP_02927822.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
          Length = 491

 Score =  145 bits (367), Expect = 4e-32,   Method: Compositional matrix adjust.
 Identities = 122/361 (33%), Positives = 177/361 (49%), Gaps = 37/361 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL   G+    TPN+D +A  G+     Y AQPVC+ SR +LMTG YP   G++G  
Sbjct: 45  GYGDLGCFGAKGQATPNLDRMAAEGVKFERFYVAQPVCSASRMALMTGCYPNRVGIKGAL 104

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNG 208
             GA+  G+   E  L E +++ GY+T A GKWHLG    ++ P+  GF+ + G  Y N 
Sbjct: 105 GPGAK-VGISKEETTLAELVKQNGYATAAFGKWHLGD-DPQFLPVRHGFDEYLGLPYSND 162

Query: 209 VISYYDHILSDQYSRTVELNGH------DMRRNLSTAWDTVGE-YATDLFTKEAVQLIED 261
           +  Y+  +++    +  +  G       D  R +     TV +   T  +T+ AV+ I  
Sbjct: 163 MWPYHPELVNLTPEQRKKRRGFPALPLVDGDRIILPEVTTVEQTRLTTWYTQRAVKFINT 222

Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
              DKP  LYLAH   H                  +Y  +     Y  +++++D SVG V
Sbjct: 223 NK-DKPFLLYLAHSMPHV-----------PLFVSDKYKGNSANGVYGDVIEEIDWSVGEV 270

Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
           + AL+  G+ EN+++IF SDNG P + Y   S     GS  P R  K T+WEGGV+ P +
Sbjct: 271 LKALKENGLDENTLVIFTSDNG-PWLSYGTHS-----GSPGPLREGKGTVWEGGVREPFV 324

Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLNTPSRR 439
             W  +I     V+   M I D LPT+    G   S+LP   IDG + W  LL  T   +
Sbjct: 325 ARWPGKIPSGKVVTEPAMTI-DILPTVAALTG---SKLPDHKIDGRNIW-PLLSATDGAK 379

Query: 440 N 440
           N
Sbjct: 380 N 380


>gi|410447290|ref|ZP_11301386.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
 gi|409979565|gb|EKO36323.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
           bacterium SAR86E]
          Length = 548

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 117/370 (31%), Positives = 168/370 (45%), Gaps = 60/370 (16%)

Query: 92  GWNDLSFH----GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
           G+ND+S H        + T NIDALA +G++    YA    C PSRAS+MTGKYP   G 
Sbjct: 73  GYNDISMHNGGAADGTLQTQNIDALAKSGMLFTRGYAANATCAPSRASIMTGKYPTRFGY 132

Query: 147 QGPPI---------WGAEP----------------------RGVPLTERFLPEYLRELGY 175
           +  P+         W A                        +G+P  E  + E LR+ GY
Sbjct: 133 EFTPVPATGRLIMNWLANESDEELKARIDREVASKIPPFMQQGMPTEEITIAEVLRDAGY 192

Query: 176 STKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRN 235
            T  IGKWHLG       PL +GF+     +  +    DH   D  +   +     M   
Sbjct: 193 YTAHIGKWHLG-HAYGMDPLSQGFQDSLSMVGPLYLPEDH--PDVVNAKFDTRIDQMIWG 249

Query: 236 LSTAWDTVGE--------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLE 287
           +     T  E        Y TD +T EA+++IE+   ++P FLYL+H A H       L+
Sbjct: 250 IGQYSATFNEGELFAPDKYLTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQ 303

Query: 288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
           A +  + Q  ++   N + Y+ M+  LD SVG VI  L+   +   ++IIF SDNG    
Sbjct: 304 ALRSDVEQMSHMEGHNLQVYSGMIAALDRSVGKVIQKLKDLDIHGKTLIIFTSDNGG--A 361

Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
            Y E  +      N PYRG K + +EGG++VP I+  P      + S   +H  D   T+
Sbjct: 362 NYIELDDI-----NKPYRGWKISFFEGGIRVPYIVSWPDEINPAQKSNSAVHHFDIFSTI 416

Query: 408 YTAAGGDTSR 417
            +AAG  +S 
Sbjct: 417 ASAAGIQSSH 426



 Score = 43.5 bits (101), Expect = 0.36,   Method: Compositional matrix adjust.
 Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 7/92 (7%)

Query: 14  EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
           +Y TD +T EA+++IE+   ++P FLYL+H A H       L+A +  + Q  ++   N 
Sbjct: 267 KYLTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQALRSDVEQMSHMEGHNL 320

Query: 74  RTYAALTKSTTLTL-LIVYGWNDLSFHGSNEI 104
           + Y+ +  +   ++  ++    DL  HG   I
Sbjct: 321 QVYSGMIAALDRSVGKVIQKLKDLDIHGKTLI 352


>gi|223936859|ref|ZP_03628768.1| sulfatase [bacterium Ellin514]
 gi|223894428|gb|EEF60880.1| sulfatase [bacterium Ellin514]
          Length = 460

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 121/391 (30%), Positives = 184/391 (47%), Gaps = 63/391 (16%)

Query: 75  TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRA 133
            +AA   +  + L    G+ DL  +G   I TPN+D +A  GI   + Y A  VCTPSRA
Sbjct: 23  VFAAKAPNIVIILADDLGYGDLGCYGHPSIRTPNLDRMAAEGIRFTDFYVAANVCTPSRA 82

Query: 134 SLMTGKYPIHTGMQGPP---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
            LMTG++PI +GM G     +      G+P  E  +   L+  GY+T  IGKWHLG    
Sbjct: 83  GLMTGRWPIRSGMAGSTNDVLRVFSMGGLPTNEITIAAALKSKGYATACIGKWHLGH-EL 141

Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA-------W--- 240
           ++ P + GF+  +G               ++S  +E     + +N S++       W   
Sbjct: 142 QFLPTHHGFDYFYGL--------------RFSNDMEPVRGKIPKNASSSLHPKLEWWNSA 187

Query: 241 ----DTVGEYATDL------FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
               D + E  TDL      +T+EA++ I  Q   +P FLY  H   H          P 
Sbjct: 188 LLQNDKILEQPTDLSTLTRRYTEEAIKFIH-QNKKQPFFLYFPHTFPHV---------PL 237

Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
              + F+      R  Y  +V++LD SVG V++ L+++G+ EN+++ F SDNG   +  R
Sbjct: 238 FASDAFK--EHSARGLYGDVVEELDWSVGQVLNTLRKEGLAENTLVFFTSDNGPWLI--R 293

Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
           + +     GS  P +  K + WEGG++ PAI W P  +  P ++ +++   D   T    
Sbjct: 294 DLAG----GSAGPLKDGKGSTWEGGMREPAIAWWPG-KIKPAINHELVCSLDLFTTSLCL 348

Query: 411 AGGDTSRLPLN--IDGLDQWSSLLLNTPSRR 439
           AG     LP +  IDGLD    L     S+R
Sbjct: 349 AG---VALPPDRVIDGLDMRPMLFGTGHSQR 376


>gi|434394677|ref|YP_007129624.1| N-acetylgalactosamine-6-sulfatase [Gloeocapsa sp. PCC 7428]
 gi|428266518|gb|AFZ32464.1| N-acetylgalactosamine-6-sulfatase [Gloeocapsa sp. PCC 7428]
          Length = 469

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 120/400 (30%), Positives = 189/400 (47%), Gaps = 55/400 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G  +I TP++D +A  G+   N Y+   VC PSR +LMTG +  H  ++G  
Sbjct: 44  GYGDLGCYGQKQIKTPHLDQMASEGMRFTNFYSGSTVCAPSRCTLMTGYHTGHCRIRGNN 103

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
             GA+   +   +  + + L+  GY+T   GKW LG       P  +GF+  FGYLN   
Sbjct: 104 --GADDISLHAEDITVAQVLKAAGYTTAMFGKWGLGGPDSPGEPHLQGFDEWFGYLN--- 158

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
               H+ +  Y  +       + RN        G Y+ +LFT+ A+  I+ +  D+P FL
Sbjct: 159 ----HVHAHDYYPSY------LWRNQQRVAIAPGTYSHNLFTQAALDFIKAKQ-DEPFFL 207

Query: 271 YLAHLAAHAGN--AGKHLEAPQETINQFQYITDP---NRRTYAAMVKKLDDSVGTVISAL 325
           YL +   HA N    + ++ P +      Y  +P    ++ YAAM+  LD  VG +++ L
Sbjct: 208 YLPYTIPHANNELGSEGMQVPSDA----PYSDEPWPQQQKNYAAMISLLDADVGKILALL 263

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WS 384
              G+ E++I+ F SDNG P  E   T+ +  + SN P  G+K  L++GGV+VP I+ W+
Sbjct: 264 AELGLEEDTIVFFSSDNG-PHEEGGATAEF--FDSNGPLNGIKRDLYDGGVRVPMIVRWT 320

Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
             IQ   + + Q   + D+LPT    AG    R                         ID
Sbjct: 321 GTIQPK-QENHQPYALWDFLPTAAEIAGVSAPR------------------------GID 355

Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
           G+    +LL    ++ N +     E+    A+R+  WK V
Sbjct: 356 GVSILPALLGQPANKHNFLYWEFHERGFEQAIRMGKWKAV 395


>gi|449282487|gb|EMC89320.1| N-acetylgalactosamine-6-sulfatase, partial [Columba livia]
          Length = 488

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 118/364 (32%), Positives = 176/364 (48%), Gaps = 42/364 (11%)

Query: 86  TLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHT 144
            LL   GW DL   G     TPN+D +A  G++  + Y A P+C+PSRA+L+TG+ P+  
Sbjct: 1   VLLTQMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYTANPLCSPSRAALLTGRLPVRN 60

Query: 145 GMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
           G        A  R          G+P +E  LPE L++ GY+ K IGKWHLG  R ++ P
Sbjct: 61  GFY---TTNAHARNAYTPQDIVGGIPDSELLLPELLKKAGYTNKIIGKWHLGH-RPQFHP 116

Query: 195 LYRGFESHFGYLNGVISYYDH--ILSDQYSRTVELNG---HDMRRNLSTAWDTVGEYATD 249
           L  GF+  FG  N     +D+  + +    R  E+ G    D + +L T    +    T 
Sbjct: 117 LKHGFDEWFGSPNCHFGPFDNRALPNIPLYRDWEMIGRYYEDFKIDLKTGEANL----TQ 172

Query: 250 LFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
           ++ +EA+  I + Q   +P FLY A  A HA                 Q++    R  Y 
Sbjct: 173 IYLQEALDFISKQQASQQPFFLYWAIDATHA-----------PVYASKQFLGTSQRGLYG 221

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             V+++DDS+G ++  LQ+ G+  N+ + F SDNGA  +     S  +  GSN P+   K
Sbjct: 222 DAVREIDDSIGKILKHLQKLGISNNTFVFFTSDNGAALI-----SAPKQGGSNGPFLCGK 276

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T +EGG++ PAI W P       VS Q+ ++ D   T  + AG         IDG+D  
Sbjct: 277 QTTFEGGMREPAIAWWPGHIPAGGVSHQLGNVMDLFTTSLSLAGLQPPS-DRQIDGIDLL 335

Query: 429 SSLL 432
            ++L
Sbjct: 336 PAIL 339


>gi|218262868|ref|ZP_03477199.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
           DSM 18315]
 gi|218223078|gb|EEC95728.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
           DSM 18315]
          Length = 461

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 101/308 (32%), Positives = 153/308 (49%), Gaps = 36/308 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS +I TPNID LA  G I  + + A  V +PSR+ ++TG+Y    G +   
Sbjct: 42  GYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 100

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  G+P  E  LP  L+  GY T  IGKWHLG       P  +GF++ +G L G  
Sbjct: 101 -LDKPGDGIPDDEELLPALLKRYGYRTGCIGKWHLGS-EPSQRPNAKGFDTFYGLLAGHR 158

Query: 211 SY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           SY YD   SD+    +  + NG  +         +   Y TD    +A Q + +   ++P
Sbjct: 159 SYFYDPETSDKDGNLQQYQYNGQKL---------SFDGYFTDELASKARQFVAES--EQP 207

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
             LY++  A H+ N     EA +E + +F+      R+ YAAM+  LD  VG ++  L+ 
Sbjct: 208 FMLYMSFTAPHSPN-----EAAEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDELKA 259

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
            G  +++II F+SDNG  T          N  SN P +G K   +EGG +VP  ++W  +
Sbjct: 260 AGKFDDTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWGDR 310

Query: 387 IQQNPRVS 394
            +++ R +
Sbjct: 311 FKRDQRFT 318


>gi|390958067|ref|YP_006421824.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390958408|ref|YP_006422165.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390412985|gb|AFL88489.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
 gi|390413326|gb|AFL88830.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
          Length = 470

 Score =  145 bits (367), Expect = 5e-32,   Method: Compositional matrix adjust.
 Identities = 115/363 (31%), Positives = 173/363 (47%), Gaps = 43/363 (11%)

Query: 74  RTYAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCT 129
           +++AA TK     + I+    G+ D S +G  +  TP+ID LA  G+     YA  PVC 
Sbjct: 29  KSFAAQTKRRPNIIFILADDMGYADTSVYGQRKFKTPHIDRLAAEGVRFAQAYAGAPVCA 88

Query: 130 PSRASLMTGKYPIHTGMQGPPIWGA---------EPRGVPLTE--RFLPEYLRELGYSTK 178
           PSR+ LMTG    HT ++      A         E R   LT+    + +YL   GY T 
Sbjct: 89  PSRSVLMTGLNTGHTRVRDNFALAAGHVGFKGKEEIRRASLTDADHTIADYLARSGYKTA 148

Query: 179 AIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY--SRTVELNGHDMRRNL 236
             GKWHL  +    TP   GF++  G+L    S   +  + +Y   R ++L         
Sbjct: 149 LFGKWHLDGYDPAATPTRHGFQTFKGWLTQTASTQGYFPTQRYDNERLIDLP-------- 200

Query: 237 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 296
             A    G Y T + T ++V  I+ Q  D P FLY+A+ A H+         P    +Q 
Sbjct: 201 ENANGKQGRYDTTMITDDSVNYIQ-QSSDTPFFLYVAYDAPHS---------PYIAPDQG 250

Query: 297 QYITDP---NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNG---APTVEYR 350
            Y  +P   + + YAAM+  LD+ VG +++AL++KG+ E++I+ F SDNG    PT +  
Sbjct: 251 SYAKEPWDEDEKIYAAMIGHLDNGVGKILAALKQKGIDEDTIVFFASDNGPRSEPTPQQT 310

Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
           +  N+  + SN   +G K  ++EGG++ P I   P      +VS   +   D LPT    
Sbjct: 311 KVVNF--FDSNGALQGYKRDMYEGGIREPFIARWPGHIPAGKVSQVPIFFPDVLPTALDL 368

Query: 411 AGG 413
           AG 
Sbjct: 369 AGA 371


>gi|431838572|gb|ELK00504.1| N-acetylgalactosamine-6-sulfatase [Pteropus alecto]
          Length = 456

 Score =  145 bits (366), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 131/412 (31%), Positives = 190/412 (46%), Gaps = 86/412 (20%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
           L L+   GW DL  +G     TPN+D +A  G++  N Y A P+C+PSRA+L+TG+ PI 
Sbjct: 37  LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIR 96

Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
            G     G       P+    G+P TE  LPE L++ GY +K +GKWHLG  R ++ PL 
Sbjct: 97  NGFYTTNGQARNAYTPQEVVGGIPDTELLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLR 155

Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEY-------- 246
            GF+  FG  N     YD               +  R N+     W+ VG +        
Sbjct: 156 HGFDEWFGSPNCHFGPYD---------------NRARPNIPVYRDWEMVGRFYEEFPIDL 200

Query: 247 ------ATDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHA-----------------GNA 282
                  T ++ +EA+  I+  Q   +P FLY A  A HA                 G+A
Sbjct: 201 RTGEANLTQIYLQEALDFIKRQQAAQRPFFLYWAIDATHAPVYASKPFLGTSQRGRYGDA 260

Query: 283 --------GKHLEAPQE---TINQFQYITD---------PNRRTYAAMVKKLDDSVGTVI 322
                   GK L   Q+   T + F + T          P +  Y   V+++DDSVG ++
Sbjct: 261 VREIDDSVGKILSLLQDLKITKDTFVFFTSDNGAALISAPKQGRYGDAVREIDDSVGKIL 320

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
           S LQ   + +++ + F SDNGA  +   +       GSN P+   K T +EGG++ PAI+
Sbjct: 321 SLLQDLKITKDTFVFFTSDNGAALISAPKQGRC---GSNGPFLCGKQTTFEGGMREPAIV 377

Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLDQWSSLL 432
           W P      +VS Q+ ++ D   T  + AG     LP +  IDGLD   S+L
Sbjct: 378 WWPGHIAAGQVSHQLGNVMDLFTTSLSLAG---LELPSDRVIDGLDLLPSML 426


>gi|296121751|ref|YP_003629529.1| sulfatase [Planctomyces limnophilus DSM 3776]
 gi|296014091|gb|ADG67330.1| sulfatase [Planctomyces limnophilus DSM 3776]
          Length = 536

 Score =  145 bits (366), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 122/384 (31%), Positives = 183/384 (47%), Gaps = 57/384 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           GW ++   G ++IPTPNID LA  G+ L   Y+  P C PSR  LMTGK+  H  ++G  
Sbjct: 50  GWGEVGCFGQSKIPTPNIDRLASRGVKLTRHYSGAPTCAPSRCVLMTGKHLGHAEIRGNQ 109

Query: 149 ------PPIWGAEPRGVPLTERFL--PEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
                 P     +    PL+++ L      ++ GY+T A GKW LG       P  +GF+
Sbjct: 110 QAKVKLPQFTEGQ---HPLSDKALTIARQFQKAGYATGAFGKWGLGPVGSTGEPNRQGFD 166

Query: 201 SHFGYLNGVI--SYYDHILSDQYSRTVE----LNGHDMRRNLSTAWDTV-GE-YATDLFT 252
             FGY    +  SY+   L       V     + GH  +       +   GE YA  L  
Sbjct: 167 EFFGYNCQALAHSYFPKALWKNAESIVNNEKPVPGHKKQPEGEVTMEAYQGENYAPRLIM 226

Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-------------YI 299
            EA+  I D+   +P FLYL     H       ++ P + + +F              Y+
Sbjct: 227 AEALSFI-DRHHQQPFFLYLPFTEPHVA-----MQPPPKIVEEFPVEWDERVYRGDGGYL 280

Query: 300 TDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE-----YRETS 353
             P  R  YAAM++ LD+ VG VI++L++ G+LE ++I+F SDNGA         +   +
Sbjct: 281 PHPRPRAAYAAMIRDLDNHVGDVITSLEKHGLLEKTLIVFTSDNGATHASANPDFHVGGA 340

Query: 354 NYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQM-MHISDWLPTLYTAA 411
           +   + S    +G K +++EGG++VPAI+ W  QI   P  ++    +  DW PTL  A 
Sbjct: 341 DPLFFNSTRELKGFKGSIYEGGLRVPAIVSWPGQIP--PATTINTPSYFPDWFPTLCNA- 397

Query: 412 GGDTSRLPLNIDGLDQWSSLLLNT 435
               ++LPL  +GLD  + L L T
Sbjct: 398 ----TQLPLP-EGLDGVNLLPLLT 416


>gi|410097286|ref|ZP_11292268.1| hypothetical protein HMPREF1076_01446 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409224604|gb|EKN17536.1| hypothetical protein HMPREF1076_01446 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 446

 Score =  145 bits (366), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 107/355 (30%), Positives = 175/355 (49%), Gaps = 55/355 (15%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY 123
           ++ P  +  A   K+    ++I+Y    G+ D+   G  +I TPN+D +A +G+   N Y
Sbjct: 12  LSVPLCQAAAKENKNDRPNIVIIYCDDMGYGDIGVTGHPDIKTPNLDRMALDGMRFTNYY 71

Query: 124 -AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEP---RGVPLTERFLPEYLRELGYSTKA 179
            A P  T SR SL+TG+YP+  G +    W   P   RG+   E  + E L+E GY+T  
Sbjct: 72  SASPASTASRYSLLTGRYPVRAGFR----WVLSPDAERGIHPRELTIAELLKEQGYATAI 127

Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD-MRRNLST 238
            GKWHLG  ++EY PL  GF+ + G     + Y + ++  +Y     + G+D +R N   
Sbjct: 128 YGKWHLGSTKKEYLPLQNGFDEYVG-----LPYSNDMIPPKYPDIALMCGNDTLRMNPDQ 182

Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY----LAHLAAHAGNAGKHLEAPQETIN 294
           +        T L+T++A+  I+    +   F+Y    + H+  + G A            
Sbjct: 183 S------ELTALYTEKAISFIKKHKKEN-FFVYVPYAMPHVPLYPGKA------------ 223

Query: 295 QFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN 354
              +I    R TY   V+++D  VG ++SAL+++G+ +N+I+ FMSDNG P +   E   
Sbjct: 224 ---FIGKSKRGTYGDAVEEIDWGVGEILSALKKEGIDKNTIVWFMSDNG-PWLPRNEEG- 278

Query: 355 YRNWGSNYPYRGVKNTLWEGGVKVPAILWSP-----QIQQNPRVSLQMMHISDWL 404
               GS   +R  K + WEGGV+VP  +  P     Q+ +    S+ +   S W+
Sbjct: 279 ----GSAGIFRDGKGSTWEGGVRVPCFVSWPDHLNGQVNEEVITSMDVFATSVWM 329


>gi|305665425|ref|YP_003861712.1| arylsulfatase A [Maribacter sp. HTCC2170]
 gi|88710180|gb|EAR02412.1| arylsulfatase A (precursor) [Maribacter sp. HTCC2170]
          Length = 489

 Score =  145 bits (366), Expect = 6e-32,   Method: Compositional matrix adjust.
 Identities = 120/348 (34%), Positives = 173/348 (49%), Gaps = 43/348 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ--- 147
           G+ D+   GS  I TP++D +A  G+ L + Y AQ VC+ SRA L+TG YP   G+    
Sbjct: 54  GYQDVGVFGSPNIKTPHLDRMASEGVKLTSYYSAQAVCSASRAGLLTGCYPNRIGIHNAL 113

Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--Y 205
           GP        G+  +E  + E L+  GY T   GKWHLG    E+ P   GF+  FG  Y
Sbjct: 114 GP----GNTHGINSSETTMAEMLKNNGYKTAIFGKWHLGH-HPEFMPNKHGFDEFFGIPY 168

Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA--TDLFTKEAVQLIEDQP 263
            N +  Y+      Q           +  N  T  DT+ E +  T   T+ +V  I ++ 
Sbjct: 169 SNDMWPYHP-----QQGPIFNFPDLPLYEN-ETVIDTLTEQSNLTTQITERSVDFI-NRN 221

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
            D P FLY+ H   H          P     +F+  +D  R  Y  ++ ++D SVG +I 
Sbjct: 222 KDNPFFLYVPHPQPHV---------PLFVSEKFKGKSD--RGLYGDVIMEIDWSVGQIIE 270

Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
           AL++ G+ EN+I+IF SDNG P + Y   +     GS  P+R  K T WEGG + P I+ 
Sbjct: 271 ALKKNGLEENTIVIFTSDNG-PWLSYGNHA-----GSALPFREGKGTAWEGGQREPFIMK 324

Query: 384 SP-QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWS 429
            P +++    + + MM I D LPTL   A    S+LP L+IDG + W+
Sbjct: 325 YPNKLKAGEVIDVPMMAI-DLLPTL---ASLTNSKLPELSIDGKNVWN 368


>gi|373956153|ref|ZP_09616113.1| sulfatase [Mucilaginibacter paludis DSM 18603]
 gi|373892753|gb|EHQ28650.1| sulfatase [Mucilaginibacter paludis DSM 18603]
          Length = 468

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 117/341 (34%), Positives = 173/341 (50%), Gaps = 36/341 (10%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGP- 149
           G+ D+  +G  +I TPNID LA  G  L + YA  PVC+PSR  L+TG    H  ++G  
Sbjct: 46  GYGDVGVYGQTKILTPNIDLLAREGTKLTDFYAGAPVCSPSRGVLLTGLNTGHATIRGNM 105

Query: 150 PIWG--AEPRGVPLTER--FLPE------YLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
            I G  A  +G     R   LP+       L+  GY+T  IGKWH+  F    TP   GF
Sbjct: 106 AIKGGLAGNKGSDKIRRAGLLPQDSTIGDLLQRSGYTTGLIGKWHVDGFDTLATPQRHGF 165

Query: 200 ESHFGYLNGVISYYDHILS----DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           +  +G+L   +SY     S    D + R  +L      +N    +     Y +++ T +A
Sbjct: 166 DYFYGWL---VSYPQTYASTYWPDTWYRDGKLVAVPQNQNGQKNY-----YTSEIITDDA 217

Query: 256 VQ-LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           ++ L + +   KP FL + H   H+      L+AP  TI + +  T  +++TYAAMV  L
Sbjct: 218 IRYLAKHKKSKKPFFLMVNHSNPHS-----PLDAPHNTIYEGKDWTS-DQKTYAAMVTYL 271

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNG---APTVEYRETSNYRNWGSNYPYRGVKNTL 371
           D SVG +   L   G+ +N+I+IF SDNG    PT      S++  + S+ P RG K  +
Sbjct: 272 DSSVGKIRKYLIESGLDKNTIVIFTSDNGPRSEPTTSLTAVSDF--FHSSGPLRGYKRDV 329

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
            EGG++VP I+W+    ++  VS    + +D LPTL   AG
Sbjct: 330 TEGGIRVPMIVWNKNRIKSDAVSHTPGYFADILPTLAGIAG 370


>gi|126663704|ref|ZP_01734700.1| putative arylsulfatase [Flavobacteria bacterium BAL38]
 gi|126624287|gb|EAZ94979.1| putative arylsulfatase [Flavobacteria bacterium BAL38]
          Length = 468

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 118/380 (31%), Positives = 186/380 (48%), Gaps = 46/380 (12%)

Query: 80  TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLM 136
           TK   +  ++    G+N+L  +G   I TPNID LA  G+  +N Y    +C PSR +LM
Sbjct: 26  TKKPNIVFILADDMGYNELGSYGGKIIETPNIDQLAKEGMKFSNHYCGSNICAPSRGTLM 85

Query: 137 TGKYPIHTGMQGP---PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           TGK+  H  ++     P  G EP  +P +E  + E L+  GY+T A GKW LG+   E +
Sbjct: 86  TGKHTGHAYIRDNKPLPYEGNEP--IPASEITVAEILKTAGYTTGAFGKWGLGYPASEGS 143

Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRT---VELNGHDMRRNLSTAWDTVGEYATDL 250
           P  +GF+  +GY NG I  +++  S  Y R    VELN      N+   +     Y+ D+
Sbjct: 144 PNNQGFDQFYGY-NGQIHAHNYFTS--YLRKNDLVELNA-----NIDAPYSV---YSADI 192

Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-----ITDPNRR 305
               A++ +E    + P FLY      H      + +   +T+  +       I D +  
Sbjct: 193 IKDRALEFVEVNK-NNPFFLYFCPTLPH----NPYHQPDDKTLEYYAKKTGFPIGDAHSE 247

Query: 306 -----TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
                 YAA+  +LD  VG +++ L+   +L+N++IIF SDNG+     +E  +Y   G 
Sbjct: 248 EFSVPKYAALSSRLDQQVGEIMAKLKELNLLDNTLIIFASDNGSALT--KEEDSYLRTGG 305

Query: 361 NYPYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           +   RG K+ ++EGG+K P I  W  +I      +    HIS +   L T A    ++ P
Sbjct: 306 D--LRGRKSEVYEGGIKSPLIAFWKGKIIPGSSSN----HISAFWDFLPTCAEIVKAKTP 359

Query: 420 LNIDGLDQWSSLLLNTPSRR 439
            NIDG+    +LL  T +++
Sbjct: 360 DNIDGISYLPTLLGKTDNQK 379


>gi|284041117|ref|YP_003391047.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283820410|gb|ADB42248.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 486

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 128/429 (29%), Positives = 211/429 (49%), Gaps = 53/429 (12%)

Query: 78  ALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRASLM 136
           A   +  L  +   G+ DLS  G+ +  TPN+D +A  G    N + AQ VC+ SRA+L+
Sbjct: 35  ATPPNVVLFFMDDLGYGDLSVTGALDYTTPNLDKMAAEGTRFTNFLAAQAVCSASRAALL 94

Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
           TG YP   G+ G  +    P G+   E  L E L+E GY+T   GKWHLG   +++ P+ 
Sbjct: 95  TGCYPNRLGLYGA-LGPNSPIGLNPNEETLAELLKERGYATGMFGKWHLGD-NKQFLPMQ 152

Query: 197 RGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
           +GF+ ++G  Y + +   +      +Y     ++G++    +    D      T   T++
Sbjct: 153 QGFDEYYGVPYSHDMWPLHPAQAQAKYPPLRWIDGNEPGPEIKDLNDA--GKITGTITEK 210

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           AV  I +    KP FLY+ H   H          P  T  +F+      R  +  ++ +L
Sbjct: 211 AVSFIRNHK-KKPFFLYVPHPLPHV---------PLATSARFK--GQSARGIFGDVLTEL 258

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D SVG +++ L+++G+ +N+++IF+SDNG P + Y + +     GS+  +R  K T +EG
Sbjct: 259 DWSVGQIMNELKQQGLDKNTLVIFISDNG-PWLNYGDHA-----GSSGGFREGKGTSFEG 312

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLL 433
           G +VP ++  P +    RVS +++   D LPT+    G   +RLP   IDG+D W +LL 
Sbjct: 313 GHRVPCLVRWPGVVPAGRVSNKLLTALDILPTVANVCG---ARLPKQRIDGVD-WVALL- 367

Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL---GTQEN 490
               + ++++   D++         R+NS+           AVR   WKLV    G    
Sbjct: 368 ----KGDNSVTPRDKFYYYY-----RKNSL----------EAVRQGDWKLVFAHPGRTYE 408

Query: 491 GTMDGYYGQ 499
           G + G  G+
Sbjct: 409 GFLPGQGGK 417


>gi|448409003|ref|ZP_21574483.1| sulfatase [Halosimplex carlsbadense 2-9-1]
 gi|445673783|gb|ELZ26342.1| sulfatase [Halosimplex carlsbadense 2-9-1]
          Length = 496

 Score =  145 bits (366), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 120/396 (30%), Positives = 174/396 (43%), Gaps = 56/396 (14%)

Query: 41  LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDL 96
           L    A +G AG+    P  T          N   Y          +++VY    GW D 
Sbjct: 16  LGAGVALSGCAGRRETTPSAT----------NAPDYPGAPLPERPNIVVVYVDDLGWLDT 65

Query: 97  SFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKY-PIH-----TGMQGP 149
            F GS    TPN+DALA  G++  + YA  P C PSRA   TG+Y P H     T     
Sbjct: 66  GFQGSQYYDTPNLDALADEGVVATSGYANAPNCAPSRACFNTGQYTPRHGVGTVTDPPPR 125

Query: 150 PIWGAEPRGVPLTERF------LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                     P +ER       + E L + GY+T  +GKWHLG     + PL RGF+ + 
Sbjct: 126 RPEARRLETPPNSERLGTDNATIAEVLSDAGYATGFVGKWHLGEADSRHGPLARGFDRNV 185

Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
           G              D   R     G+     L    D  G Y  D  T +A++ + D  
Sbjct: 186 G-------------GDIVGRPPA--GYFSPYRLPNVTDD-GGYLPDRLTDDAIEFVADHA 229

Query: 264 VD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR---TYAAMVKKLDDSVG 319
               P FL  +  + H       L+AP + ++ ++ +   N +   TY AMV  LD +VG
Sbjct: 230 GGPDPFFLLFSTYSVHG-----PLQAPDDAVDAYRDVDCWNGQCNPTYGAMVSNLDANVG 284

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG---SNYPYRGVKNTLWEGGV 376
            +++AL  +G+ E ++++F SDNG    +Y E    R+W    S  P RG K TL+EGG+
Sbjct: 285 RLLAALDERGIREETLVVFTSDNGG-LGDYDEIGLPRDWKRVTSQGPLRGGKRTLYEGGI 343

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           +VP ++  P +    R S   +  +D+ PT    AG
Sbjct: 344 RVPTVVSWPGVVPGGRRSSTPIAGTDFFPTFVDLAG 379


>gi|87308971|ref|ZP_01091109.1| Arylsulfatase [Blastopirellula marina DSM 3645]
 gi|87288314|gb|EAQ80210.1| Arylsulfatase [Blastopirellula marina DSM 3645]
          Length = 482

 Score =  145 bits (365), Expect = 7e-32,   Method: Compositional matrix adjust.
 Identities = 118/365 (32%), Positives = 178/365 (48%), Gaps = 36/365 (9%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ--- 147
           G+ +L  +G   I TPNID LA  GI   + Y+   VC PSR  LMTGK+P H  ++   
Sbjct: 35  GYAELGCYGQKWIKTPNIDRLAAEGIRFTDFYSGNAVCAPSRCCLMTGKHPGHAYVRNNF 94

Query: 148 ------------GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
                       G   W  + R +   E  + E L++ GY+T AIGKW LG F     P 
Sbjct: 95  AQKKLPAESLALGLTDWTGQ-RPLASEEVTIAEVLKQRGYATAAIGKWGLGQFGTTGDPN 153

Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
            +GF+  +G+ N     ++H     Y + +  N     +  +    T   ++ D FTK A
Sbjct: 154 QQGFDLFYGF-NCQTHAHNH-----YPKFLMRNREKEVQPGNNRGATGETFSQDQFTKVA 207

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAG------NAGKHLEA-PQETINQFQYITDPN-RRTY 307
           ++ I DQ  DKP FLYL     H        +  ++  A P+E      YI  P  R  Y
Sbjct: 208 LEFI-DQNKDKPFFLYLPFAIPHLSIQVPDESTAEYTNAIPEEKYVHKGYIQHPTPRAGY 266

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
           AAM+  +D  +G ++S ++  G+  ++++IF SDNGA T +    S+   + S    RG+
Sbjct: 267 AAMITHMDRDIGKILSKIETLGLDGDTLVIFTSDNGA-TYDRLGGSDSDFFESVANLRGL 325

Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
           K +L+EGGV+VP +   P   ++ +VS     + D LPT+   A    ++ P  IDGL  
Sbjct: 326 KGSLYEGGVRVPLVARWPGKIKSGQVSHVPSALWDMLPTI---ADITETKAPQMIDGLSL 382

Query: 428 WSSLL 432
             +LL
Sbjct: 383 APTLL 387


>gi|405952520|gb|EKC20320.1| Arylsulfatase B [Crassostrea gigas]
          Length = 500

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 92/267 (34%), Positives = 143/267 (53%), Gaps = 34/267 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           G ND+ ++   E+ TPN+D LA NG+IL + Y  PVC+PSRA+ MTG+Y    G Q  P+
Sbjct: 41  GNNDIGYNNP-EVDTPNLDNLANNGVILESNYVYPVCSPSRAAFMTGRYAHKIGFQRGPV 99

Query: 152 WGAEPRGVPLTERFLPEYL-RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
              +P  +    + + E L    GY+   IGKWHLG+ +   TP  RGF+S +G+  G  
Sbjct: 100 EHKQPAYIESNYKTVAEKLTTNYGYAAHMIGKWHLGYCKDAVTPTNRGFDSFYGFYGGQE 159

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW--------DTVGEYATDLFTKEAVQLIEDQ 262
           +YY +  S +Y         D R NL+           + V  Y+T  + K A++++ + 
Sbjct: 160 NYYTYT-SARYK--------DFRDNLTAVTPQNPNYPREDVDGYSTFEYKKRAIEIVGNH 210

Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ---YITD-------PNRRTYAAMVK 312
               PLFLYLA  A H+      L+APQE  +++      TD        +R T+ AMV 
Sbjct: 211 DKSVPLFLYLAFQAPHS-----PLQAPQELKDKYPGDWTYTDWDGSTKTTSRATFYAMVT 265

Query: 313 KLDDSVGTVISALQRKGMLENSIIIFM 339
            +D +VG V + ++ +G+L+N+II+ +
Sbjct: 266 AVDQAVGEVYTEMETQGLLDNAIIVLL 292



 Score = 42.7 bits (99), Expect = 0.59,   Method: Compositional matrix adjust.
 Identities = 26/82 (31%), Positives = 44/82 (53%), Gaps = 17/82 (20%)

Query: 395 LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLL 454
           + ++H SDWLPT  + AGG ++ L  +ID                 ++IDG+DQ   LL 
Sbjct: 289 IVLLHASDWLPTFISLAGG-SADLTTDID----------------LTSIDGVDQTQMLLN 331

Query: 455 NTPSRRNSVLINIDEKKRTAAV 476
           N  S R+ +++NI + +R + +
Sbjct: 332 NGESARSKIILNIYDLERPSTI 353


>gi|431797524|ref|YP_007224428.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
 gi|430788289|gb|AGA78418.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
          Length = 442

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 110/331 (33%), Positives = 158/331 (47%), Gaps = 39/331 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D SF  +    TPNID +A  G    N Y    VC+PSRA L++G      G     
Sbjct: 35  GYADFSFQSTKLTTTPNIDRIAQEGAKFTNAYVTASVCSPSRAGLLSGTNQPEFGHIYNY 94

Query: 151 IWGAE------PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
           I G E        G+P   + + EYL+ LGY+T A GKWH GF   E+ P  +GF+  +G
Sbjct: 95  IKGVEYSIQQHEYGIPHNIKTIGEYLQPLGYATAAFGKWHEGF-AEEFQPEQKGFDHFWG 153

Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           +L G   Y+         + +     D+  +      T   Y TD  T EA+        
Sbjct: 154 FLWGANHYHT-------GKAI-----DVIDDGKPVDPTTIPYMTDAITDEALSFASAHEE 201

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVI 322
           D P FLY+A+ A H+      LEA  E I  ++  +  DP R   AAM   LD++VG + 
Sbjct: 202 D-PFFLYIAYNAVHSP-----LEAKPEHIALYEGKFPDDPKREILAAMTHSLDENVGKIF 255

Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AI 381
           + L+  G L+N+II F++DNG      +  S       N P +G K  L+EGG++VP A+
Sbjct: 256 NRLEAMGELDNTIIFFINDNGGS----KHIS-----ADNSPLKGYKGQLYEGGIRVPFAV 306

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
            W  +I +   V   +    D LP++  A G
Sbjct: 307 RWPGKI-KGKTVCHSITSSLDILPSILDAVG 336


>gi|417305609|ref|ZP_12092563.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327538042|gb|EGF24732.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 1004

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 139/484 (28%), Positives = 209/484 (43%), Gaps = 68/484 (14%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQ 125
           ++  +R   A  TK   + +L    G+ DLS  G+  + TP ID +A  G  L + Y A 
Sbjct: 549 VSPADRAKTAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAA 608

Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWG----AEPRGVPLTERFLPEYLRELGYSTKAIG 181
           PVCTPSRA LMTG YP    M     +G     +P+G+   E  + E L   GY T   G
Sbjct: 609 PVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLNTAGYRTGMFG 668

Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---RRNLST 238
           KWHLG  + E+ P  +GF+  FG        Y H +   + R    +   +   + +   
Sbjct: 669 KWHLGD-QPEFLPTKQGFDEFFGIP------YSHDIHPFHPRQNXYHFPSLPLLQNDTVI 721

Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA----------GNAGKHLEA 288
             D   ++ T   T++AV  IE    D+P FLYL H   HA            A   + A
Sbjct: 722 EMDPDADFLTKRLTEQAVSFIERN-KDQPFFLYLPHPIPHAPLHASPPFMEDVADDVIAA 780

Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
            ++      Y T  N   +   + ++D SVG ++ AL+  G+ E ++++F SDNG P   
Sbjct: 781 IEKEDGNIDYATRAN--LFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPP--- 835

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
             + + Y + G     RG K T +EGG++ P ++ W  QI      + ++M   D LPT 
Sbjct: 836 --KNTLYASPGE---LRGHKGTTFEGGMREPTVVRWPGQIPAG-HQNDELMTAMDLLPTF 889

Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
              AG                       P+ R   IDG D W +L   T +  ++   + 
Sbjct: 890 AKLAGA--------------------AIPTDR--VIDGKDIWPTLKGETQTPHDAFFYH- 926

Query: 468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYG--QTRSNKVPLLNFNAIVESKTYQSLQQL 525
               + AAVR   WKL +    NG     Y        KV ++  N  V  K  + L+  
Sbjct: 927 -RGNQLAAVRSGKWKLHV---NNGVAKQLYDLENDLGEKVNVIETNPEVVKKLQRQLKDF 982

Query: 526 SQNI 529
           + +I
Sbjct: 983 AADI 986



 Score =  105 bits (263), Expect = 5e-20,   Method: Compositional matrix adjust.
 Identities = 119/462 (25%), Positives = 186/462 (40%), Gaps = 79/462 (17%)

Query: 75  TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPS 131
           T  A  +   + L+ V   G+ D+  +G+ ++ TPNID LA  G    + + A  VCTPS
Sbjct: 24  TSVAAERPPNVVLIFVDDLGYGDVGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPS 83

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPL----TERFLPEYLRELGYSTKAIGKWHLGF 187
           R  L+TG+YP+   M G  IWG  P    L      + + +  +  GY+T  +GKWHLGF
Sbjct: 84  RYGLLTGQYPVRA-MGGKGIWGPLPTTSGLIIDTNTQTIGKVFKNKGYATACLGKWHLGF 142

Query: 188 FRREYT--------PLYRGFESHFGY--LNGVISYY----DHILSDQYSRTVELNGHDMR 233
                         P   GF+ +FG   +N    Y     D I     S  +   G  + 
Sbjct: 143 KEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVS 202

Query: 234 --------------RNLSTAWDTVGEYATD----LFTKEAVQLIEDQPVDKPLFLYLAHL 275
                            S A      Y  +    L T+ AV+ I ++  D+P FLY A  
Sbjct: 203 PTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITEK-KDEPFFLYFATP 261

Query: 276 AAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSI 335
             H          P     +F+  +      Y   V +LD  VG ++ +L+  G+ +N++
Sbjct: 262 NIH---------HPFTPAPRFKGTSQCG--LYGDFVHELDWMVGEIVQSLEDNGLTDNTL 310

Query: 336 IIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVS 394
           ++F SDNGA        +       N    G K  +WEGG +VP I  W  +I+      
Sbjct: 311 VLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIK------ 364

Query: 395 LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LNTPSRRNSNIDGLDQWSSLL 453
                           AG  + +L   +D    +S+L     PS      D ++   +LL
Sbjct: 365 ----------------AGTQSDQLISQVDLFATFSALTEQEMPSSEQK--DSINMLPALL 406

Query: 454 LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
            +      + L+    + R  A+R   W L +G + +G  +G
Sbjct: 407 DDPNEPLRTELVLAPRQPRNLAIRKGKW-LYIGARGSGGFNG 447


>gi|449136530|ref|ZP_21771910.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
 gi|448884847|gb|EMB15319.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
          Length = 480

 Score =  145 bits (365), Expect = 8e-32,   Method: Compositional matrix adjust.
 Identities = 111/339 (32%), Positives = 163/339 (48%), Gaps = 48/339 (14%)

Query: 78  ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRAS 134
           A++    L ++I    G+ +    G+ EIPTP IDALA +G+   + Y     C+PSRA 
Sbjct: 30  AVSSQPNLVVIIADDLGYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAG 89

Query: 135 LMTGKYPIHTGMQGPPIWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
            M+G+Y    G    P          G+P  ++   E+L+  GY T  IGKWHLG  R  
Sbjct: 90  FMSGRYQSRFGYDLNPTGERNNHPNAGLPPQQKTFVEHLQSAGYHTSLIGKWHLGT-RPP 148

Query: 192 YTPLYRGFESHFGYLNGVISY-----YDHI---LSDQY--SRTVELNGHDMRRNLST--- 238
             P  +GF+  FG+L+    Y     Y+++   L D    +   E N   +R N +    
Sbjct: 149 QVPTSKGFDRFFGFLHEGHFYVPGPPYENVWTMLRDNSLPAGQFETNQRTIRGNYARINE 208

Query: 239 -AWDTV------GE------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
            A+DT       GE      Y TD  T +AV  I  Q    P  + +++ A H+      
Sbjct: 209 PAYDTGNPVLDGGEPIDDWNYLTDTITDKAVDTIS-QAASNPFAMVVSYNAVHS-----P 262

Query: 286 LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
           ++A  E      +I DP RR +A M+  LD  VG +I  L R+ + + ++++F SDNG P
Sbjct: 263 MQASLEDHAAMDHIADPQRRIFAGMLIALDRGVGRIIKKLDRQKLRQETLVVFFSDNGGP 322

Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
           T E           SN P RG K +L+EGGV++P I WS
Sbjct: 323 TAELTS--------SNAPLRGGKGSLYEGGVRIPMI-WS 352


>gi|302535739|ref|ZP_07288081.1| arylsulfatase [Streptomyces sp. C]
 gi|302444634|gb|EFL16450.1| arylsulfatase [Streptomyces sp. C]
          Length = 495

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 108/336 (32%), Positives = 153/336 (45%), Gaps = 42/336 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
           GW DLS +GS  I TPN+D LA  G+   + Y+    C+P+R SL TG+YP  T G    
Sbjct: 91  GWADLSSYGSPHIKTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYTGRYPARTRGGLAE 150

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
           PI   +  G+  T   L   L+E GY+T  IGKWH G+   +Y+P   G+++ FG   G 
Sbjct: 151 PI-ADKSAGLDPTHPTLASLLKEAGYATALIGKWHAGYLP-DYSPTKSGWDTFFGNFGGA 208

Query: 210 ISYYDHI----LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
           + YY  +      D Y    E    D+R            Y T + T+ A + +      
Sbjct: 209 LEYYSKLGLGGEYDLYEGDAEYK--DLR------------YYTRIITERASEYVSRDHGG 254

Query: 266 KPLFLYLAHLAAH-----AGNAGKHLEAPQET----INQFQYITDPNRRTYAAMVKKLDD 316
           +P  L L     H      G+  +  E  +           +    + R Y  MV+ LD 
Sbjct: 255 RPWLLNLNFTTPHWPWIADGDTAESAEIVRRIKAGDARALWHQDGGSVRKYKQMVEDLDR 314

Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
           SVG V+ AL+R G  EN++++F SDNG     Y           N+P  G K +L EGG+
Sbjct: 315 SVGEVLQALRRSGQEENTLVVFSSDNGGERFSY-----------NWPLSGNKASLQEGGI 363

Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           +VP I+  P      +VS   +   DW  TL   AG
Sbjct: 364 RVPNIVRWPARLDGGQVSRVPVFSPDWTATLLEVAG 399


>gi|32471441|ref|NP_864434.1| arylsulfatase [Rhodopirellula baltica SH 1]
 gi|32443282|emb|CAD72113.1| Arylsulfatase [Rhodopirellula baltica SH 1]
          Length = 538

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 113/375 (30%), Positives = 173/375 (46%), Gaps = 46/375 (12%)

Query: 70  DPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVC 128
           +PN +       +  L +    G+ +L  +G  +I TP +D LA  GI L N Y+   VC
Sbjct: 63  EPNAKDATVSRPNIVLIVADDLGYGELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVC 122

Query: 129 TPSRASLMTGKYPIHTGMQ--GPPIWGAEPRG-----------VPLTERFLPEYLRELGY 175
            PSR  LMTGK+P H  ++  G P      R            +P+ E  + EYL+ +GY
Sbjct: 123 APSRCCLMTGKHPGHAHVRNNGDPKIDPAVREALKLEFPGQYPLPVDEVTIAEYLKSVGY 182

Query: 176 STKAIGKWHLGFFRREYTPLYRGFESHFGY--LNGVISYYDHILSDQYSRTVELNGHDMR 233
            T A GKW LG F     P  +GF+  +G+       ++Y + L     + V+  G+D  
Sbjct: 183 RTGAFGKWGLGHFGTTGDPNEQGFDLFYGFNCQRHAHNHYPNFLWRNRVKEVQ-PGNDRT 241

Query: 234 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQ 290
            +  T       Y+ D F  EA + I     +   +P F YL     H       ++ P+
Sbjct: 242 LHGET-------YSQDQFVNEACEFIRQSVAEDKTQPFFAYLPFAVPHLS-----IQVPE 289

Query: 291 ETINQFQ------------YITDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIII 337
           E ++ +             Y+  P  R  YAAMV ++D+ VG V+  +   G+ EN++I+
Sbjct: 290 EEVDAYDGVIEEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQVVDLVDSLGLGENTLIM 349

Query: 338 FMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQM 397
           F SDNG PT +    S+   + S    +G+K  L EGG++VP I     +    R S  +
Sbjct: 350 FTSDNG-PTYDRLGGSDSDYFNSASGMKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWI 408

Query: 398 MHISDWLPTLYTAAG 412
               D+LPT+  AAG
Sbjct: 409 GAWWDFLPTITDAAG 423


>gi|449138176|ref|ZP_21773471.1| arylsulfatase [Rhodopirellula europaea 6C]
 gi|448883200|gb|EMB13738.1| arylsulfatase [Rhodopirellula europaea 6C]
          Length = 516

 Score =  145 bits (365), Expect = 9e-32,   Method: Compositional matrix adjust.
 Identities = 115/371 (30%), Positives = 174/371 (46%), Gaps = 49/371 (13%)

Query: 80  TKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
           T S    +LIV    G+++L  +G  +I TP +D LA  GI L N Y+   VC PSR  L
Sbjct: 48  TASRPNIVLIVADDLGYSELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVCAPSRCCL 107

Query: 136 MTGKYPIHTGMQ--GPPIWGAEPRG-----------VPLTERFLPEYLRELGYSTKAIGK 182
           MTGK+P H  ++  G P      R            +P+ E  + EYL+ +GY T A GK
Sbjct: 108 MTGKHPGHAYVRNNGDPKIDPAVREELKLEFPGQYPLPVDEVTIAEYLKSVGYRTGAFGK 167

Query: 183 WHLGFFRREYTPLYRGFESHFGY--LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAW 240
           W LG F     P  +GF+  +G+       ++Y   L     + V+  G+D   +  T  
Sbjct: 168 WGLGHFGTTGDPNEQGFDLFYGFNCQRHAHNHYPKFLWRNRVKEVQ-PGNDRTLHGET-- 224

Query: 241 DTVGEYATDLFTKEAVQLIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
                Y+ D F  EA + I     +   +P F YL     H       ++ P+E ++ ++
Sbjct: 225 -----YSQDQFVNEACEFIRQSVAEDTTQPFFAYLPFAVPHLS-----IQVPEEEVDAYE 274

Query: 298 ------------YITDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
                       Y+  P  R  YAAMV ++D+ VG ++  +   G+ EN++I+F SDNG 
Sbjct: 275 GVIEEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQIVDLIDSMGLGENTLILFTSDNG- 333

Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWL 404
           PT +    S+   + S    +G+K  L EGG++VP I     +    R S  +    D+L
Sbjct: 334 PTYDRLGGSDSDYFDSASGMKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWIGAWWDFL 393

Query: 405 PTLYTAAGGDT 415
           PT+  AAG D 
Sbjct: 394 PTITDAAGVDV 404


>gi|294633612|ref|ZP_06712170.1| twin-arginine translocation pathway signal [Streptomyces sp. e14]
 gi|292830254|gb|EFF88605.1| twin-arginine translocation pathway signal [Streptomyces sp. e14]
          Length = 490

 Score =  145 bits (365), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 102/334 (30%), Positives = 157/334 (47%), Gaps = 38/334 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
           GW DLS +G+  I TPN+D LA  G+   + Y+    C+P+R SL TG+YP  T G    
Sbjct: 85  GWADLSCYGAPHIRTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYTGRYPGRTKGGLAE 144

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
           PI G+  +G+      +   L+  GY+T  IGKWH G+   +Y+P   G++  FG   G 
Sbjct: 145 PI-GSRTQGLDPHHPTVASLLKRAGYATALIGKWHCGWLP-DYSPTKSGWDEFFGNFGGA 202

Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
           + Y+        S+  +L  +D+    +T  D    Y T + T+ AV+ +  +  D+P  
Sbjct: 203 LEYF--------SKLGQLGDYDLYEGDATYQDL--RYYTTVLTERAVEYVGRRH-DRPWL 251

Query: 270 LYLAHLAAH-----------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
           L L     H           A      + A    +    +    +   Y  MV+ LD S+
Sbjct: 252 LNLNFTTPHWPWLTEDDEETAAGIAADIRA-GNVLGALMHTDGGSVEKYKEMVENLDASI 310

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G V++AL+R G  EN++++F SDNG     Y+           +P  G K+ L EGG++V
Sbjct: 311 GEVLAALRRSGQEENTVVVFASDNGGERFSYQ-----------WPLSGGKSELLEGGIRV 359

Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           P +L  P      +VS   ++  DW  TL    G
Sbjct: 360 PTVLRWPAAVDGHQVSRTPVYSPDWTATLLELGG 393


>gi|423348778|ref|ZP_17326457.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
           CL03T12C32]
 gi|409212738|gb|EKN05774.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
           CL03T12C32]
          Length = 461

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 104/308 (33%), Positives = 156/308 (50%), Gaps = 36/308 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F G+ +I TPNID LA  G I  + + A  V +PSR+ ++TG+Y    G +   
Sbjct: 42  GYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 100

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  G+P  E  LP  L+   Y T  IGKWHLG       P  +GF++ +G L G  
Sbjct: 101 -LDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHLGS-EPSQRPNAKGFDTFYGLLAGHR 158

Query: 211 SY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           SY YD   SD+    +  + NG    R LS  +D+   Y TD    +A Q + +   ++P
Sbjct: 159 SYFYDPETSDKDGNLQQYQYNG----RKLS--FDS---YFTDELASKAQQFVTES--EQP 207

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
             LY++  A H+ N     EA +E + +F+      R+ YAAM+  LD  VG ++  L+ 
Sbjct: 208 FMLYMSFTAPHSPN-----EATEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDELKA 259

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
            G  +N+II F+SDNG  T          N  SN P +G K   +EGG +VP  ++W  +
Sbjct: 260 AGKFDNTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWGDR 310

Query: 387 IQQNPRVS 394
            +++ R +
Sbjct: 311 FKRDQRFT 318


>gi|149196941|ref|ZP_01873994.1| putative arylsulfatase [Lentisphaera araneosa HTCC2155]
 gi|149140051|gb|EDM28451.1| putative arylsulfatase [Lentisphaera araneosa HTCC2155]
          Length = 469

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 109/370 (29%), Positives = 177/370 (47%), Gaps = 61/370 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ DLS +G  +  TPNID +   G++  + Y+   VC PSRA+LMTGK+  H  ++G  
Sbjct: 32  GYGDLSLYGQKKFSTPNIDRIGKEGMVFTDHYSGSTVCAPSRAALMTGKHSGHGLVRGNY 91

Query: 149 ---PPIWGAEPRGVPL--TERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
              P  +G E   +PL   +  L E ++  GY+T  IGKW +G       P  +GF+  +
Sbjct: 92  EVGPHGFGGE---LPLRPEDVSLAEVMKSAGYATGLIGKWGMGMDGTTGEPRKKGFDYSY 148

Query: 204 GYLNGVISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
           G+LN   +++   ++I  +     +  N  D R          G Y +D F ++ ++ +E
Sbjct: 149 GFLNQAHAHHYYPEYIYENGEKLMIPENKDDAR----------GLYISDTFAEKGIEFVE 198

Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ----------------------- 297
           +   DKP FL+ A +  HA      L  P +++N+F+                       
Sbjct: 199 ENK-DKPFFLFWAFVTPHA-----ELLVPDDSLNEFKGKWPETPFVMGKQGGDGTDNPFG 252

Query: 298 -YIT-DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY 355
            Y + D  R  ++ M+ +LD  VG +   L+  G+ +N+II+F SDNG P  E     ++
Sbjct: 253 VYASQDHPRAAFSGMITRLDKRVGDLFDKLEELGIDDNTIIMFSSDNG-PHKEGGADPDF 311

Query: 356 RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
             + SN    G K  L EGG++VP ++  P + +    S       D +PT+   A  D+
Sbjct: 312 --FDSNAELTGYKRDLTEGGIRVPFMVRWPNVVKARSKSSHASAFWDVMPTIAEIANTDS 369

Query: 416 SRLPLNIDGL 425
              P +IDGL
Sbjct: 370 ---PEDIDGL 376


>gi|372208819|ref|ZP_09496621.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 459

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 118/382 (30%), Positives = 180/382 (47%), Gaps = 71/382 (18%)

Query: 81  KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMT 137
           K T + L +V   GW D S  GS+   TP++D L  +G++  N YA   VC+P+RAS++T
Sbjct: 12  KKTNVILFLVDDMGWTDASCLGSDLYKTPHLDKLVSDGMMFTNGYASSTVCSPTRASIVT 71

Query: 138 GKYPIH-------TGMQGP------PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWH 184
           GKYP          G Q P      P W  +   + + E  + E  ++  Y T  +GKWH
Sbjct: 72  GKYPASLRLTDWIEGHQKPFAKMKIPKWNQK---LDIEEVTIAEMFKKNHYKTIHLGKWH 128

Query: 185 LGFFRREYTPLYRGFESHFG-------YLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
           LG    +Y P ++GF+ + G         +G   Y+       Y      NG+       
Sbjct: 129 LGE-DPKYWPEHQGFDINIGGYSAGSPKADGGQGYFS-----PYHNPRMSNGNK------ 176

Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
                 GEY TD    EAV  I+     KP F+ L     H       L+A +E + +++
Sbjct: 177 ------GEYLTDRLANEAVSFIKKNKT-KPFFMNLWLYNVHMP-----LQAKKEVVQKYK 224

Query: 298 YITDP----NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA---PTVEYR 350
              +     N  TYAAM++ +DD++G V+ AL+  G+  N+I++F SDNG    PT    
Sbjct: 225 SKINKGMTHNNATYAAMIEHMDDALGAVVQALKETGLENNTIVVFHSDNGGHLKPT---- 280

Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
                    SN P R  K  L+EGGV+VP +   P   +   VS ++   +D +PTL + 
Sbjct: 281 ---------SNAPLRSGKGDLYEGGVRVPLVFKWPLKIKAGAVSNELAISADIMPTLLSL 331

Query: 411 AGGDTSRLPLNIDGLDQWSSLL 432
              +  ++P  +DG+D   +LL
Sbjct: 332 V-NNKYKMPKGVDGIDLSPALL 352


>gi|227540350|ref|ZP_03970399.1| arylsulfatase [Sphingobacterium spiritivorum ATCC 33300]
 gi|227239674|gb|EEI89689.1| arylsulfatase [Sphingobacterium spiritivorum ATCC 33300]
          Length = 473

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 130/430 (30%), Positives = 203/430 (47%), Gaps = 67/430 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G  +I TPNID LA  G+     YA   VC PSR+SLMTG++  HT ++G  
Sbjct: 40  GYGDLGAYGQKKIQTPNIDRLAAEGMKFEQFYAGTSVCAPSRSSLMTGQHTGHTYIRGNR 99

Query: 151 IWGAEPRGV-PLTE--RFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
               EP G  PL +  +   + L++ GY+T A GKW LG       P  +GF+  +GY  
Sbjct: 100 --EIEPEGQQPLADSVQTFVQLLQKAGYATGAFGKWGLGMVGTTGAPDQKGFDEFYGYNC 157

Query: 208 GVISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
              S+     H+  +   + VEL G+D++  +        +YA  L  ++ ++ I D   
Sbjct: 158 QRQSHRYFPTHLWHN--DQRVELQGNDLKNKV--------QYAPALIQEQTLRFI-DAHK 206

Query: 265 DKPLFLYLAHLAAHAGNAG---KHLE----------------APQETINQFQYITDPNRR 305
           D+P FL++  +  HA  +G   +H +                 P  T+  +  +  P R 
Sbjct: 207 DEPFFLFIPTVLPHAELSGIDDEHYQKYASRFEETPHKGNDYGPGATVTGYASVEKP-RA 265

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE-TSNYRNWGSNYPY 364
           T+A+MV ++D  VG V+  L   G+ EN+I+IF SDNGA    +RE  ++   + S+   
Sbjct: 266 TFASMVSRMDAYVGQVLKKLDDLGLTENTIVIFSSDNGA----HREGGADPVFFNSSGGL 321

Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
           RG K  L+EGG++ P I+  P   ++   S       D +PTL    G   +R     DG
Sbjct: 322 RGYKRDLYEGGIRTPFIVKWPHKVKSHSTSEFTGAFWDVMPTLLDITGAGKARY---TDG 378

Query: 425 LDQWSSLLLNTPSRRNS-------NIDGLDQ------WSSLLLNTPSRRNSV--LINID- 468
           +    +LL  T  ++           DG  Q      W ++ L   +++ +V  L N+  
Sbjct: 379 VSLVPTLLGKTNEQQQHPYLYWEFQEDGGRQAVRAGDWKAVKLQVKNKQKTVTELYNLKK 438

Query: 469 ---EKKRTAA 475
              EKK  AA
Sbjct: 439 DPAEKKNVAA 448


>gi|325107642|ref|YP_004268710.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
 gi|324967910|gb|ADY58688.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
           5305]
          Length = 749

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 124/417 (29%), Positives = 187/417 (44%), Gaps = 62/417 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G+ E+ TP IDALA  G    + YA  P+C+PSRA L+TG YP   G     
Sbjct: 40  GYYDLGCYGATEVETPEIDALAAEGTRFTDYYAAAPICSPSRAGLLTGCYPRRVGNH--- 96

Query: 151 IW---GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
           IW        G+   E  L E   + GY+T  IGKWHLG F   + P  +GF+ +FG L+
Sbjct: 97  IWVHRADSDTGIHPNELTLAELFHQNGYATACIGKWHLG-FHEPFLPQNQGFDHYFGLLH 155

Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
            +         +Q    +  N   ++R    A        T  +T EA+  +E Q  D+P
Sbjct: 156 NLDPVETVYFEEQGGVPLLRNDQVVQRPADPA------ELTKQYTDEAISWME-QHRDQP 208

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
            FLYL H   H          P    ++F+     N   Y   +++LD ++G ++  L+R
Sbjct: 209 FFLYLPHTMLH---------NPLGVSDEFR--GSSNWGEYGDAIQELDHNMGRLMQTLKR 257

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
             + +N+++++ SDNG       +             +G K + WE G++VPAI W P +
Sbjct: 258 LEIDKNTVVVYASDNGRGPGRTDQQK----------LQGRKLSTWEAGIRVPAIAWGPGL 307

Query: 388 Q-QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLD----------------QW 428
             Q+  VS  ++   DW PTL + AG +   +P +  IDG D                Q 
Sbjct: 308 GLQSGAVSSTVVSAMDWYPTLASLAGFE---VPADRVIDGRDFSPLLKGESKIVPPAGQK 364

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
            S+    P +R+ N  G  +W  L+       +        +   AAVR   WKL L
Sbjct: 365 LSINAAVPLQRSWNPAG--EWKDLV--NRYEYSEAFFYHGAQGELAAVRWRDWKLFL 417


>gi|430746414|ref|YP_007205543.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
 gi|430018134|gb|AGA29848.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
           18658]
          Length = 590

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 102/334 (30%), Positives = 156/334 (46%), Gaps = 51/334 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW DLS HG+  + TPNID+LA +G +    Y  PVC P+RA  +TG+Y    G++G   
Sbjct: 35  GWGDLSVHGNTNLKTPNIDSLARDGALFERFYVCPVCAPTRAEFLTGRYHPRGGVRGVTS 94

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
            G     + L E+ + E  +  GY+T A GKWH G  +  Y P  RGF+ ++G+ +G   
Sbjct: 95  GGER---LDLNEKTIAETFKSAGYATGAFGKWHNG-TQFPYHPNARGFDEYYGFTSGHWG 150

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
            Y     +   R V+ NG                + TD  T  A+  I+    D+P F Y
Sbjct: 151 EYFDPPLEHNGRPVQGNG----------------FITDDLTDHAISFIKASK-DRPFFCY 193

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYIT------DPNR------RTYAAMVKKLDDSVG 319
           L     H+      ++ P    ++F+         D  R      R   AM + +D++VG
Sbjct: 194 LPFNTPHS-----PMQVPDRFYDKFKNAALKLRAHDAEREDLMMTRAALAMCENIDENVG 248

Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
            V+  L+   + + +I+++ SDNG             +W  N   RG K +  EGGV+ P
Sbjct: 249 RVLKTLEDLSLDQKTIVLYFSDNGP-----------NSWRWNGGMRGRKGSTDEGGVRSP 297

Query: 380 AIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
            ++ W  QI+   RV+ ++    D LPTL   AG
Sbjct: 298 LLIRWPKQIRPETRVA-KISAAIDLLPTLTDLAG 330


>gi|154493323|ref|ZP_02032643.1| hypothetical protein PARMER_02660 [Parabacteroides merdae ATCC
           43184]
 gi|423346574|ref|ZP_17324262.1| hypothetical protein HMPREF1060_01934 [Parabacteroides merdae
           CL03T12C32]
 gi|423723129|ref|ZP_17697282.1| hypothetical protein HMPREF1078_01342 [Parabacteroides merdae
           CL09T00C40]
 gi|154086533|gb|EDN85578.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
 gi|409219725|gb|EKN12685.1| hypothetical protein HMPREF1060_01934 [Parabacteroides merdae
           CL03T12C32]
 gi|409241554|gb|EKN34322.1| hypothetical protein HMPREF1078_01342 [Parabacteroides merdae
           CL09T00C40]
          Length = 483

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 128/426 (30%), Positives = 196/426 (46%), Gaps = 74/426 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G  +  TPNID LA +G+     Y+   V  PSR+ L+TG +  HT ++G  
Sbjct: 39  GYGDLSCYGQEKFETPNIDKLAQSGMRFTQCYSGTTVSAPSRSCLLTGTHSGHTPVRGN- 97

Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
               +P G   +P   R + E +++ GY T A GKW LG+      P  +G ++ +GY  
Sbjct: 98  -LELDPEGQFPLPDDARTIFEVMKDAGYKTSAFGKWGLGYIGSTGDPKNQGCDTFYGYNC 156

Query: 208 GVI--SYY-DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
            ++  SYY DH+  +   + VEL  +    NL+  +   G Y+ DL   +A++ +++   
Sbjct: 157 QLLAHSYYPDHLWDN--DKRVELEDN----NLNVQYGK-GTYSQDLIHGKALEYLDNMNP 209

Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQ-FQYI--------TDPN------------ 303
           D+P F++   +  HA      L  P+++I Q F+ +        T+P             
Sbjct: 210 DEPFFMWYPTIIPHA-----ELIVPEDSIIQKFRGMYPEKPYKGTEPGSPAFRKGGYCSQ 264

Query: 304 ---RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
                T+AAMV +LD  VG +I  L+ KG+ +N+IIIF SDNG P +E     ++ N  S
Sbjct: 265 FHPHATFAAMVYRLDVYVGQIIQKLKDKGLYDNTIIIFASDNG-PHMEGGADPDFFN--S 321

Query: 361 NYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
           N  YRG K  L+EGG++VP I+ W   +Q N      M    D LPT             
Sbjct: 322 NGIYRGYKRDLYEGGIRVPMIISWPGHVQPNTETDF-MCSFWDVLPTFEE---------- 370

Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL-INIDEKKRTAAVRL 478
                        +  P  +   +DG+     L      + +  L     E     AVR 
Sbjct: 371 -------------IIHPKAKQKEMDGVSMLPLLENRKGQKEHEFLYFEFQELNGRQAVRK 417

Query: 479 DSWKLV 484
             WKLV
Sbjct: 418 GPWKLV 423


>gi|47216038|emb|CAG11369.1| unnamed protein product [Tetraodon nigroviridis]
          Length = 474

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 118/406 (29%), Positives = 184/406 (45%), Gaps = 71/406 (17%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G     TPN+D LA  G+   + Y   PVC+PSRASL+TG+Y   +G+    
Sbjct: 33  GFGDLGCYGHPTSLTPNLDGLAAGGLRFTDFYCTSPVCSPSRASLLTGRYQTRSGVYPGV 92

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           ++     G+PL E  + E L+  GY+T A+GKWHLG   +  T      +  FG L  V 
Sbjct: 93  LYPGSRGGLPLNETTIAEVLKPRGYATAAVGKWHLGGPCQNLTCFPPDVKC-FG-LCDVG 150

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
           +    ++ D+  +   +N  D+ +  S       ++A D  T  A +        +P FL
Sbjct: 151 TVTVPLMHDEVIKQQPVNFLDLEKAYS-------DFAKDFITTSAKR-------KQPFFL 196

Query: 271 YLAHLAAH------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
           Y      H       G AGK L  P                 +   + + D ++G++++ 
Sbjct: 197 YFPSHHTHYPQYAGPGAAGKSLRGP-----------------FGDALLEFDQTIGSLLAT 239

Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-W 383
           L+R G++ N++I F SDNG       E       G+  P R  K T +EGG++ PAI  W
Sbjct: 240 LERTGVINNTLIFFTSDNGP------ELMRMSRGGNAGPLRCGKGTTYEGGMREPAIAYW 293

Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLNTPSRRNSN 442
              IQ  P V+ +M    D LPT  + AG   ++LP + +DG+D  + L     S+R + 
Sbjct: 294 QGLIQ--PGVTHEMASTLDILPTFASLAG---AKLPQVMLDGVDMTNILFSQGKSKREAM 348

Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
           +             PS +N +           A+RL+ +K    TQ
Sbjct: 349 M--------FYPTDPSEKNGLF----------AIRLEKYKAHFYTQ 376


>gi|386381787|ref|ZP_10067486.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
 gi|385670744|gb|EIF93788.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
          Length = 426

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 111/339 (32%), Positives = 155/339 (45%), Gaps = 48/339 (14%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
           GW DLS +G+ EI TPN+D LA  G+   + Y+    C+P+R SL TG+YP  T G    
Sbjct: 23  GWADLSSYGAPEIRTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYTGRYPGRTKGGLAE 82

Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
           PI      G+  T   L   LR+ GY T  IGKWH G+   +Y+P   G+E  FG   G 
Sbjct: 83  PI-ADRSVGLEPTHPTLASLLRDAGYRTALIGKWHCGYLP-DYSPTRSGWEEFFGNFGGA 140

Query: 210 ISYYDHILS----DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
           + YY  + S    D Y    E    D+R            Y T + T+ AV+ +      
Sbjct: 141 LEYYSKLSSGGEYDLYEGDAEY--RDLR------------YYTRILTERAVEYVGRDHGR 186

Query: 266 KPLFLYLAHLAAH-AGNAGKHLEAPQETINQFQYITDPNRRT-----------YAAMVKK 313
            P  L L     H    A    E   E + +   I   +RR            Y  +V+ 
Sbjct: 187 TPWLLNLNFTTPHWPWIADGDEETSAEIVRR---IKAGDRRALWHQDGGSLAKYRQLVED 243

Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
           LD SVG V++AL+R G  E++++ F SDNG     Y           N+P  G K +L +
Sbjct: 244 LDRSVGKVLAALKRSGQEEDTLVFFASDNGGERFSY-----------NWPLSGNKGSLLD 292

Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           GG++VP+IL  P   +  +VS   ++  DW  TL    G
Sbjct: 293 GGIRVPSILRWPARIEGGQVSGVPVYSPDWTATLLDVGG 331


>gi|307726728|ref|YP_003909941.1| sulfatase [Burkholderia sp. CCGE1003]
 gi|307587253|gb|ADN60650.1| sulfatase [Burkholderia sp. CCGE1003]
          Length = 527

 Score =  144 bits (364), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 130/440 (29%), Positives = 196/440 (44%), Gaps = 62/440 (14%)

Query: 92  GWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
           GW DL  +G   +   PTPN+D +A  G+   N Y Q  CT  RAS +TG+ PI T +  
Sbjct: 49  GWGDLGSYGGGTMRGAPTPNLDRMAAEGMRFLNYYGQASCTAGRASFITGRQPIRTSLSS 108

Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF---GY 205
             + G +P G+      + E+L++ GY+T  +GKWHLG  + E  P   GF+  +    Y
Sbjct: 109 VLVPG-DPNGLTKQTPTVAEFLKKAGYTTVQLGKWHLG-DKPENYPTAHGFDEMYDMLAY 166

Query: 206 LNGVISYYDHIL-SDQYSRTVELNGHDMRRNLSTAWDTVGE---------------YATD 249
             GV +Y +  L +D  +          R N+S      G+                A D
Sbjct: 167 YAGVYAYSNRDLHADWPAHDAAFQKVWDRLNVSMVEQKAGQPVKVVKQKFTYEDLATADD 226

Query: 250 LFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
               +AV  I+    D+ P F+YL  L  H  N      +P+           P    Y 
Sbjct: 227 NMRAKAVDWIKAHAKDQHPFFMYLNFLKVHNPNN----PSPRWKGKS------PGGGNYL 276

Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
             + +LDD+ G ++ A++  G+ EN+++I+ +DNGA  V+    + Y       P+RG K
Sbjct: 277 DSLMELDDNSGQIVQAIRDLGIAENTLVIWTTDNGA-WVDAWPDAGYT------PFRGEK 329

Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
            T +EGG +VPAI W P   +   V+  M    DW PT + + GG+T   P   +  D  
Sbjct: 330 GTPFEGGFRVPAIAWWPGHIKPGSVNTDMFSHMDWWPT-FASLGGET---PPPHEWKDNE 385

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
              ++          DG+D   SLL   P +R   +      +   AVR+ S+K V   Q
Sbjct: 386 GKPII---------FDGIDLSDSLLGKGPGKRTDFIYF--SGQSFGAVRVKSFKAVYTAQ 434

Query: 489 ENGTMDGYYGQTRSNKVPLL 508
                D + G  RS K P +
Sbjct: 435 -----DTWLGPQRSMKAPAI 449


>gi|423341845|ref|ZP_17319560.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
           CL02T12C29]
 gi|409219938|gb|EKN12897.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
           CL02T12C29]
          Length = 461

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 102/306 (33%), Positives = 154/306 (50%), Gaps = 40/306 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F GS +I TPNID LA  G I  + + A  V +PSR+ ++TG+Y    G +   
Sbjct: 42  GYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 100

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF--FRREYTPLYRGFESHFGYLNG 208
                  G+P  E  LP  L+  GY T  IGKWHLG   F+R   P  +GF++ +G L G
Sbjct: 101 -LDKPGDGIPDDEELLPALLKRYGYRTGCIGKWHLGSKPFQR---PNAKGFDTFYGLLAG 156

Query: 209 VISY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
             SY YD   SD+    +  + NG  +         +   Y TD    +A Q + +   +
Sbjct: 157 HRSYFYDPETSDKDGNLQQYQYNGQKL---------SFDGYFTDELASKARQFVAES--E 205

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
           +P  LY++  A H+ N     EA +E + +F+      R+ YAAM+  LD  VG ++  L
Sbjct: 206 QPFMLYMSFTAPHSPN-----EATEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDEL 257

Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWS 384
           +  G  +++II F+SDNG  T          N  SN P +G K   +EGG +VP  ++W 
Sbjct: 258 KAVGKFDDTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWG 308

Query: 385 PQIQQN 390
            + +++
Sbjct: 309 DRFKRD 314


>gi|417303299|ref|ZP_12090357.1| arylsulfatase A [Rhodopirellula baltica WH47]
 gi|327540271|gb|EGF26857.1| arylsulfatase A [Rhodopirellula baltica WH47]
          Length = 616

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 99/317 (31%), Positives = 149/317 (47%), Gaps = 39/317 (12%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
           GW DL+ HG+ +I TP +DALA     L+  Y  PVC P+RA+L+TG+YP  +G+ G   
Sbjct: 73  GWGDLAAHGNPKISTPTLDALANKSARLDRFYVSPVCAPTRAALLTGRYPERSGVAGVTG 132

Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
                R   +T   L E  R  GY T   GKWH G  +    P  +GF+  FG+  G  +
Sbjct: 133 RREVMRAEEIT---LAELYRSAGYVTGCFGKWHNG-AQMPLHPNGQGFDEFFGFCGGHFN 188

Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
            YD  L ++    V+ NG                Y TD+ T  AV+ I++   D+P F Y
Sbjct: 189 LYDDALLERNGTPVQTNG----------------YITDVLTDAAVEFIQNHH-DRPFFCY 231

Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYIT-DPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
           +   A H        +  Q+  +++   + D       AMV+ +D +V  ++  L    +
Sbjct: 232 VPFNAPHG-----PFQVRQDLFDRYNDGSIDEKTAAVYAMVQNIDTNVSRLLKCLSDHSL 286

Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
            E +I++F++DNG     +           N   RG K ++ EGG +VP  +  P   Q 
Sbjct: 287 DEETIVVFLTDNGPNGKRF-----------NGGMRGTKGSVHEGGCRVPCFIRWPGNIQP 335

Query: 391 PRVSLQMMHISDWLPTL 407
             +S   +HI D LPTL
Sbjct: 336 QTISQVAVHI-DLLPTL 351


>gi|393784512|ref|ZP_10372675.1| hypothetical protein HMPREF1071_03543 [Bacteroides salyersiae
           CL02T12C01]
 gi|392665493|gb|EIY59017.1| hypothetical protein HMPREF1071_03543 [Bacteroides salyersiae
           CL02T12C01]
          Length = 483

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 111/343 (32%), Positives = 166/343 (48%), Gaps = 40/343 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G  +  TPNID LA NG+     Y+   V  PSR+ L+TG +  HT ++G  
Sbjct: 45  GYGDLSCYGQEKFDTPNIDQLAQNGMRFTQCYSGTTVSAPSRSCLLTGTHSGHTAIRGNK 104

Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
               +P G   +P   R +    ++ GY T A GKW LGF      P  +G +  +GY  
Sbjct: 105 --ELDPEGQFPLPADARTIFHLFKDAGYKTAAFGKWGLGFIGSTGDPNKQGIDYFYGYNC 162

Query: 208 GVI--SYY-DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
            ++  SYY DH+  +   + +EL     + N+       G Y+ DL   +A+  ++ +  
Sbjct: 163 QLLAHSYYPDHLWEN--DQRIEL-----KDNVLEVEYGKGTYSQDLIHSKALSFLDARKQ 215

Query: 265 DKPLFLYLAHLAAHA-----------GNAGKHLEAPQE-------TINQFQYITDPN-RR 305
           ++P FL+   +  HA              GK  E P +          +  Y T  N R 
Sbjct: 216 EEPFFLFYPTIIPHAELIVPEDSIIKKFRGKFPEVPYKGFEPGSPAFRKGGYCTQLNPRA 275

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
           T+AAM+ +LD  VG ++  L+ KGM EN++IIF SDNG P  E     ++ N  SN  +R
Sbjct: 276 TFAAMIYRLDVYVGQLVQKLKEKGMYENTLIIFASDNG-PHQEGGADPDFFN--SNGIWR 332

Query: 366 GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
           G K  L+EGG++VP I+ W   +Q        M    D +PT 
Sbjct: 333 GYKRDLYEGGIRVPMIVSWPGHVQSGTETDF-MCSFWDLMPTF 374


>gi|300771164|ref|ZP_07081040.1| N-acetylgalactosamine 6-sulfate sulfatase [Sphingobacterium
           spiritivorum ATCC 33861]
 gi|300761834|gb|EFK58654.1| N-acetylgalactosamine 6-sulfate sulfatase [Sphingobacterium
           spiritivorum ATCC 33861]
          Length = 473

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 130/431 (30%), Positives = 205/431 (47%), Gaps = 69/431 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DL  +G  +I TPNID LA  G+     YA   VC PSR+SLMTG++  HT ++G  
Sbjct: 40  GYGDLGAYGQKQIQTPNIDRLAAEGMKFEQFYAGASVCAPSRSSLMTGQHTGHTYIRGNR 99

Query: 151 IWGAEPRGV-PLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
               EP G  PL +      + L++ GY+T A GKW LG       P  +GF+  +GY  
Sbjct: 100 --EIEPEGQQPLADSVQTFVQLLQKAGYATGAFGKWGLGMVGTTGAPDQKGFDEFYGYNC 157

Query: 208 GVISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
              S+     H+  +   + VEL+G+D++  +        +YA  L  ++ ++ I D   
Sbjct: 158 QRQSHRYFPTHLWHN--DQRVELHGNDLKNKV--------QYAPALIQEQTLRFI-DVHK 206

Query: 265 DKPLFLYLAHLAAHAGNAG---KHLE----------------APQETINQFQYITDPNRR 305
           D+P FL++  +  HA  +G   +H +                 P  T+  +  +  P R 
Sbjct: 207 DEPFFLFIPTVLPHAELSGIDDEHYQKYASRFQETPHKGNDYGPGATVTGYASVEKP-RA 265

Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE-TSNYRNWGSNYPY 364
           T+A+MV ++D  VG V+  L   G+ EN+I+IF SDNGA    +RE  ++   + S+   
Sbjct: 266 TFASMVSRMDAYVGQVLKKLDDLGLTENTIVIFSSDNGA----HREGGADPVFFNSSGGL 321

Query: 365 RGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
           RG K  L+EGG++ P I+ W  +++ N           D +PTL    G   +R     D
Sbjct: 322 RGYKRDLYEGGIRTPFIVKWPHKVKPNSTSEFAGA-FWDVMPTLLDITGAGKARY---TD 377

Query: 424 GLDQWSSLLLNTPSRRNS-------NIDGLDQ------WSSLLLNTPSRRNSV--LINID 468
           G+    +LL  T  ++         + DG  Q      W ++ L   +++ +V  L N+ 
Sbjct: 378 GVSLVPALLGKTKEQQQHPYLYWEFHEDGGRQAVRAGDWKAVKLQVKNKQKTVTELYNLK 437

Query: 469 ----EKKRTAA 475
               EKK  AA
Sbjct: 438 KDPAEKKNVAA 448


>gi|408528003|emb|CCK26177.1| hypothetical protein BN159_1798 [Streptomyces davawensis JCM 4913]
          Length = 588

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 119/365 (32%), Positives = 175/365 (47%), Gaps = 46/365 (12%)

Query: 77  AALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPS 131
           A  + S  L  +++     GW +L  +G   I TP +DALA  G+   + YA  PVC PS
Sbjct: 44  AVRSGSGRLNFVVILADDLGWGELGSYGQKLIRTPRLDALAAEGLRFTDAYATAPVCAPS 103

Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLG--F 187
           R SL+TG +  H+ ++  P WG   +G  LT+R     E LR LGY T  IGKW  G   
Sbjct: 104 RCSLLTGLHAGHSTVRENP-WGPGGQGA-LTDRDFTFAEALRSLGYRTGIIGKWGFGPEL 161

Query: 188 FRREYTPLYRGFESHFGYLNGVIS---YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
             +   P  RGFE  +GY+    +   + DH+  +   + +  N  + R           
Sbjct: 162 PDQPSHPNSRGFEEFYGYIGHRHAHDYFPDHLWHNGTRQEIPANRGEARE---------- 211

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLA----HLAAHAGNAGKHLEAPQETINQFQYIT 300
            YA DL  + A+  + D    +P  LYL     H  +HA + G++ + P           
Sbjct: 212 VYAPDLLKERALAFL-DAHRAEPFLLYLTPTLPHAPSHAPDQGEYADRPWP--------- 261

Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
            P+RR +AA +   D  VG V+  L   G+   ++++  SDNG P  E R  +N   + +
Sbjct: 262 GPDRR-HAAQITTFDSLVGEVVDRLTALGVAGRTLVLVTSDNG-PHEEGR--TNPELFDA 317

Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
           N P RG K  L+EGGV+VP I WSP+       + +   ++D LPTL   AG      P 
Sbjct: 318 NGPLRGYKRNLYEGGVRVPLIAWSPR-HVPVGTTDRPTPLTDLLPTLADLAGAPA---PT 373

Query: 421 NIDGL 425
           ++DGL
Sbjct: 374 DVDGL 378


>gi|372210171|ref|ZP_09497973.1| sulfatase [Flavobacteriaceae bacterium S85]
          Length = 651

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 116/407 (28%), Positives = 174/407 (42%), Gaps = 77/407 (18%)

Query: 104 IPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ-GPPIWGAEPRGVPL 161
           IPTP +D LA NGII  N + A P C PSRA+LMTG  P   G+Q   P       G+PL
Sbjct: 58  IPTPELDQLANNGIICTNGHVAHPFCGPSRAALMTGVQPSRIGVQYNLPNDINTSLGIPL 117

Query: 162 TERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY 221
            E + P+ L++  Y T A GKWHLGF + +Y PL RGF+  FG+L G  +Y++    D Y
Sbjct: 118 EETYFPKILQQNNYHTAAFGKWHLGFTQGKYQPLDRGFDYFFGFLGGGKAYFEREYEDLY 177

Query: 222 SR--------TVELNGHDMRRNLSTAWDTVG--EYATDLFT------------------- 252
            R        T E      R+    A D     EY TD+ T                   
Sbjct: 178 YRRLGGSNPVTNEYQDPLQRQRDYVAKDEFNQDEYLTDILTDEAINYIAENKTKSDPFFM 237

Query: 253 ---------------KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
                          +E  Q  E+ P  + L     +L          +E   + +  + 
Sbjct: 238 YVAYNAPHTPLQAPAEEIAQFKEENPNFESLVRNSDYLKESGAYELDKVERELKKVKDYD 297

Query: 298 Y-----------ITD----------PNRRTYAAMVKKLDDSVGTVISALQR-KGMLENSI 335
           Y           + D           +R  YA MV  +D ++G +++ L++ + + +N++
Sbjct: 298 YNSLSSSEKEEKVADYREHLLDKFTESRIVYATMVSNMDKNIGRIVAELKKDQKVFDNTM 357

Query: 336 IIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL 395
           I+F+SDNG  T        Y     N+P   +K +++EGG +VP  +  P   ++P+   
Sbjct: 358 IVFLSDNGGYT--------YSKGAVNHPLYALKGSVFEGGHRVPFFIHWPNKIKSPKKYK 409

Query: 396 QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
             +   D  PTL   AG         +DGL     +L N   R N++
Sbjct: 410 YQISSLDLYPTLVDLAGASVPSTK-KLDGLSFMDKVLDNKQIRPNND 455


>gi|323449079|gb|EGB04970.1| hypothetical protein AURANDRAFT_66827 [Aureococcus anophagefferens]
          Length = 918

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 106/357 (29%), Positives = 164/357 (45%), Gaps = 46/357 (12%)

Query: 107 PNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI----------WGAEP 156
           P++ ALA +G+ L N Y Q  CTP+RA+LM+GK+   TG     +          WG   
Sbjct: 102 PHMLALAQDGVRLTNYYGQAFCTPARAALMSGKFGHRTGFANVDVDSAATLEISAWGNFS 161

Query: 157 RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHI 216
                 + FL + L   GY    +GKW+LG     + P  RGF+S+ GY    I Y  H 
Sbjct: 162 LANAGDQIFLSQRLATAGYEVHGVGKWNLGHCNAAFLPPSRGFKSYLGYYGAGIGYVSHE 221

Query: 217 LSDQYSRTVELNGH-------DMRR-----NLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
           +    +       H       DM+R      LS A   +GEY+T LFT E+++ I+   V
Sbjct: 222 VEQSGNVENSFTRHFRDYSLVDMQRCTPQGCLSDA-SVIGEYSTILFTNESLRRIDGFSV 280

Query: 265 DKPLFLYLAHLAAHAGNA--GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
           + P +LY+A+   H         L+ P   +   +      RR +   +  +D  VG + 
Sbjct: 281 ENPTYLYVAYHGVHDDKQSNASGLDYPAAALEALE----ERRRNFGIALHAVDHGVGLLQ 336

Query: 323 SALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
             L+++    + +++  SDN G+P   Y ++       SN PYRG+K   +EGG+K+PA 
Sbjct: 337 QRLEKRS--SDYVLVVHSDNGGSPCGTYCDS-------SNAPYRGMKFFDFEGGLKLPAF 387

Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG---DTSRLPLNIDGLDQWSSLLLNT 435
           ++SP    +      +MH  DW  T      G   D S    + D  D WS++  N+
Sbjct: 388 VYSPTRLASRGTYDGLMHHVDWTATFLRGLAGTLLDCS----DCDSRDHWSAISSNS 440


>gi|148978065|ref|ZP_01814612.1| sulfatase, putative [Vibrionales bacterium SWAT-3]
 gi|145962749|gb|EDK28023.1| sulfatase, putative [Vibrionales bacterium SWAT-3]
          Length = 500

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 121/422 (28%), Positives = 182/422 (43%), Gaps = 52/422 (12%)

Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
           TPNID++A  G++L + YAQP CT  R++ +TG+ P+ TGM    + G  P G+      
Sbjct: 54  TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHTVGLPGG-PVGLHADTPT 112

Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV 225
           LPE L+ +GY T   GK HLG  R E+ P   GF+ ++G+L   ++  ++     + +  
Sbjct: 113 LPEILKTMGYVTGQFGKNHLG-DRDEFLPTMHGFDEYWGWLY-HLNAMEYTEDPDWPKDG 170

Query: 226 ELNGHDMRRNLSTAWDTVG---------------EYATDLFTKEAVQLIEDQ-PVDKPLF 269
            L+    R  +    D  G                   D   K A+  IE     DKP F
Sbjct: 171 SLDAFAPRNVIYAKTDGKGGQIIEDDGALTIERMRTLDDEVNKHAINFIERAVEADKPFF 230

Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
            +      H      HL    E +     +          ++K L D VG +++ ++  G
Sbjct: 231 TWYCPSRGHVWT---HLSPEYEAM-----LGQNGWGLQEVVMKDLYDHVGEMMAKMEELG 282

Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
           + +N+IIIF +DNG   + + +       G   PY G K T WEGGV+ PA++  P    
Sbjct: 283 IADNTIIIFTADNGPEIMTWPD-------GGMTPYHGEKGTTWEGGVRAPALVSWPGKIP 335

Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
              V   +    DWLPTL  AAGG        + G D +            +++DG +Q 
Sbjct: 336 AGSVGNGIFDGMDWLPTLVAAAGGPADLKEKMLKGYDGF-----------KAHLDGYNQV 384

Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
             L     S R  +     E+ +  AVR+  WK     Q +    G+ G       PLL 
Sbjct: 385 DMLTEKGESNRKEIYYY--ERDKLQAVRVGDWKAHFVVQNH----GWSGPKEELNAPLL- 437

Query: 510 FN 511
           FN
Sbjct: 438 FN 439


>gi|311746014|ref|ZP_07719799.1| sulfatase family protein [Algoriphagus sp. PR1]
 gi|126576230|gb|EAZ80508.1| sulfatase family protein [Algoriphagus sp. PR1]
          Length = 489

 Score =  144 bits (363), Expect = 1e-31,   Method: Compositional matrix adjust.
 Identities = 114/356 (32%), Positives = 171/356 (48%), Gaps = 54/356 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGM---- 146
           GW D+   GS+   TPNI  LA  GI+  N YA   +C+P+RA+L+TGKYP   G+    
Sbjct: 54  GWADIEPLGSDFYETPNITKLAKEGILFTNSYAAAAICSPTRAALLTGKYPARLGITDWI 113

Query: 147 ---------QGPP----IWGAEPRGVPLTERFLP-------EYLRELGYSTKAIGKWHLG 186
                     G P    ++  +P   P  + FLP       E ++  GY T  +GKWHLG
Sbjct: 114 RAKFNQNSTSGLPGEYEVFENKPLKTPKIQGFLPLEEITIAERMKAHGYGTLHVGKWHLG 173

Query: 187 FFRREYTPLYRGFESHFGY--LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
                + P  +GF+ + G   L    SY+D  L  +     E+     R+         G
Sbjct: 174 --EEGFYPEDQGFDVNIGGNDLGQPPSYFDPYLPAKPREFYEITTLKPRKE--------G 223

Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
           E+ TD    E V  I++Q   K  F++ A  A H    GK      + + +++     N+
Sbjct: 224 EFLTDREGDEVVNYIQNQK-GKKFFVHWAPYAVHTPIMGK-----PDLVEKYEQKEPGNQ 277

Query: 305 R--TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN-WGSN 361
           R   YAA+V+ +D +VG V+S L+R G+ EN+++IF SDNG          NY N   +N
Sbjct: 278 RNPVYAALVESVDQNVGKVLSELERMGLRENTLVIFTSDNGG------LIGNYDNPITNN 331

Query: 362 YPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
           YP +  K   +EGG+++P I+ W  +I Q   V    +   DW+PT+    G D +
Sbjct: 332 YPLKSQKGYPYEGGIRIPTIVSWPGKIPQG-FVDETPIITMDWIPTILDFMGEDPT 386


>gi|149196404|ref|ZP_01873459.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
 gi|149140665|gb|EDM29063.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
           araneosa HTCC2155]
          Length = 456

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 108/345 (31%), Positives = 174/345 (50%), Gaps = 38/345 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+  L  +G   I TP +D +A  G+ L + YA   VC PSR SLMTG++  HT ++G  
Sbjct: 31  GYGQLGSYGQKMIKTPRLDQMAKEGLRLTDYYAGTAVCAPSRCSLMTGQHVGHTYIRGNK 90

Query: 151 IW--GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
            +  G EP  +P     + E ++E GY+T  IGKW LG+   E  P  +GF+  FGY N 
Sbjct: 91  EYPTGQEP--IPAETITVAEKMKEAGYATALIGKWGLGYPGSEGEPNKQGFDYFFGY-ND 147

Query: 209 VISYYDH----ILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
               ++H    +L ++ + T++ N G ++            EY+  + T EA   I+   
Sbjct: 148 QKHAHNHFPKFLLRNEETLTLKNNSGKEI------------EYSQYMLTDEAKGFIKKNK 195

Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD---PNRRTYAAMVKKLDDSVGT 320
            D P FLYLA++  H+      L+ P +     QY  +     ++ +A M+ +LD  VG+
Sbjct: 196 -DNPFFLYLAYVIPHS-----RLQIPGDDECYLQYKDESWPEKQKKHAGMISRLDKDVGS 249

Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
           ++  L+   + EN++++F SDNGA          + + G   P  G+K +++EGGV+VP 
Sbjct: 250 ILDLLKEMNLAENTLVVFTSDNGAHREGGARPEFFNDSG---PLSGIKRSMYEGGVRVPF 306

Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
           I   P + +  +VS  +    D +PT     G    + P  IDG+
Sbjct: 307 IAHWPGVIKPGQVSNHIGAHWDLMPTACELGG---VQPPEGIDGI 348


>gi|149174163|ref|ZP_01852791.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
           8797]
 gi|148847143|gb|EDL61478.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
           8797]
          Length = 470

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 121/379 (31%), Positives = 177/379 (46%), Gaps = 51/379 (13%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-A 124
           IT P   T+AA  K   +   +V   GW DL  +GS+   +PNID LA  G+     Y A
Sbjct: 22  ITQP---THAADEKPWNVVFFLVDDLGWTDLGCYGSDFYQSPNIDQLAAEGMKFTQNYSA 78

Query: 125 QPVCTPSRASLMTGKYPIHTGMQG-PPIWGAEPRGVPLT---------ERF--LPEYLRE 172
              C+P+R +L+TG YP  T +    P W       PL          +++  LPE LR 
Sbjct: 79  CNACSPTRGALLTGMYPARTHLTDWIPGWAKSYTDFPLKPPEWKKHLDQKYTTLPEALRT 138

Query: 173 LGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM 232
            GY T  +GKWHLG   R   P   GF+ +    N  +    H             G  M
Sbjct: 139 AGYQTFHVGKWHLG--GRGNLPQDHGFDVNISGTNRGLPRSYHF---------PYGGDAM 187

Query: 233 RRNLS-TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
           + + S T  +    Y TD    EAV LI  Q  DKP FLY +  + H+   G+      +
Sbjct: 188 KWDSSLTEAERQDRYLTDRMADEAVALIRQQQ-DKPFFLYCSFYSVHSPIQGR-----PD 241

Query: 292 TINQFQYITDPNRRT---YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
            + +++ +    R     YAAM++ +D+++G V + L+  G+ + ++I+F SDNG     
Sbjct: 242 LVKKYKGLPAGKRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRTLIVFTSDNGGVR-- 299

Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
            R+TSN      N P RG K   WEGG +VPAI+  P +     V  + +   D+ PT+ 
Sbjct: 300 -RKTSN------NDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYPTIL 352

Query: 409 --TAAGGDTSRLPLNIDGL 425
             T   G+T     ++DGL
Sbjct: 353 NITGVAGNTEH-NQSVDGL 370


>gi|265752714|ref|ZP_06088283.1| sulfatase [Bacteroides sp. 3_1_33FAA]
 gi|263235900|gb|EEZ21395.1| sulfatase [Bacteroides sp. 3_1_33FAA]
          Length = 461

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 141/467 (30%), Positives = 206/467 (44%), Gaps = 71/467 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA-SLMTGKYPIHTGMQ--G 148
           GW DL   GS+   TPNID LA  GI+ +N YA    +      +MTG Y    G+    
Sbjct: 31  GWTDLGVMGSDYYETPNIDRLATEGILFDNAYAAAANSAPSRACMMTGMYTPRHGVYTVS 90

Query: 149 PPIWGAEPR----GVPLTERF------LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           PP  G   +     +P  E        + E L+E GY    IGKWHLG       PL +G
Sbjct: 91  PPDRGDRTKRKYIAIPNVEDVCADFVTMAEALQEQGYQCGHIGKWHLGDDEDGTGPLSQG 150

Query: 199 FESHFGYLNGVISY---YDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           F  + G       Y   Y + L D+    V L               +GEY TD  T+EA
Sbjct: 151 FIWNVGGNRAGAPYSYFYPYCLPDKSKCHVGLEE-----------GILGEYLTDRLTEEA 199

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR----TYAAMV 311
           V  I+    + P FL+L+H A H       L+AP   IN+++  T P +      YAAM+
Sbjct: 200 VSFIKSHS-EGPFFLHLSHHAVHT-----VLQAPDSLINKYRNKT-PGKYHKNPIYAAMI 252

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRNWGSNYPYRGVKN 369
           +KLDDSVG +   ++  G+ + +I+IF SDNG   P  +            NYP  G K 
Sbjct: 253 EKLDDSVGRICQVIKTLGIADRTIVIFYSDNGGSEPVTD------------NYPLNGGKG 300

Query: 370 TLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
             +EGG +VP I+ W+ +I+   R S+ +  + D+ PT  T A G   ++P N+DG D +
Sbjct: 301 MPYEGGSRVPLIIRWTGKIEGGIRSSVPITGV-DFYPTFVTLAQG---KIPANLDGKDIF 356

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE--KKRTAAVRLDSWKLVLG 486
            +L+ N  + R+           L  + P+   S L    +   K  +++R   WKL+  
Sbjct: 357 -TLINNNETERD-----------LFWHFPAYLESYLNGGRDFRAKPYSSIRSGDWKLIYH 404

Query: 487 TQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
            ++                 L   N +   + YQ L +  Q    PI
Sbjct: 405 YEDKSMELFNLKNDLGESQDLSGSNPVKRGELYQKLMKWIQETHAPI 451



 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 6/55 (10%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
           +GEY TD  T+EAV  I+    + P FL+L+H A H       L+AP   IN+++
Sbjct: 187 LGEYLTDRLTEEAVSFIKSHS-EGPFFLHLSHHAVHT-----VLQAPDSLINKYR 235


>gi|284040883|ref|YP_003390813.1| sulfatase [Spirosoma linguale DSM 74]
 gi|283820176|gb|ADB42014.1| sulfatase [Spirosoma linguale DSM 74]
          Length = 453

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 108/335 (32%), Positives = 167/335 (49%), Gaps = 46/335 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ DLS +G+ +I TP+ID+L   G+  ++ YA   VC+PSRA+L++G+YP   G+ G  
Sbjct: 40  GYGDLSSYGAPDIRTPHIDSLVRAGMRFSHFYANSSVCSPSRAALLSGRYPEQVGVPGVI 99

Query: 149 ----PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
                  WG   P  V      LP  L++ GY T  +GKWHLG       P  RGF+   
Sbjct: 100 RTMPDDNWGYLSPSAV-----LLPSILKKNGYYTALVGKWHLGL-EPPNLPNDRGFDLFH 153

Query: 204 GYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
           G+   ++  YY H+  D+         + MR N  T       +ATDLFT+ A   +E +
Sbjct: 154 GFEGDMMDDYYTHLRHDR---------NYMRLNRQTI--NPQGHATDLFTQWATDYLEQR 202

Query: 263 P-VDKPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSV 318
                P FLYLA+ A H       ++ P +    +   Q      R     +++ +DD +
Sbjct: 203 AGQSNPFFLYLAYNAPH-----DPIQPPADWLAKVKARQPGISEKRAKLVGLIEHMDDGI 257

Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
           G VI  L+ KG+ EN++I+F+SDNG    +           +N P R  K  ++EGG+++
Sbjct: 258 GKVIQTLRAKGLYENTLIVFVSDNGGKLFD---------GATNGPLRSGKGHMYEGGIRI 308

Query: 379 PA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
           PA ++W  ++    + S Q + + D  PTL  A G
Sbjct: 309 PACVVWPGKVAAQSQ-SQQPLLLMDIFPTLAEATG 342


>gi|149198283|ref|ZP_01875329.1| N-acetylgalactosamine-6-sulfate sulfatase [Lentisphaera araneosa
           HTCC2155]
 gi|149138579|gb|EDM26986.1| N-acetylgalactosamine-6-sulfate sulfatase [Lentisphaera araneosa
           HTCC2155]
          Length = 443

 Score =  144 bits (363), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 120/429 (27%), Positives = 195/429 (45%), Gaps = 68/429 (15%)

Query: 75  TYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSR 132
           T  A  K   + ++I  +G+ D   +G+ +I TP I+ LA +G+   N YA  PVC+P+R
Sbjct: 13  TLVAQDKPNIVFIIIDDFGYADSEPYGAKDIKTPGINELAKDGLKFTNFYANAPVCSPTR 72

Query: 133 ASLMTGKYPIHTGMQGPPIWGA--------------EPRGVPL--TERFLPEYLRELGYS 176
            + +TG++   +G +    +G               +  G+ L   +  LP+ L++ GY 
Sbjct: 73  CAFITGRWQQRSGFEWALGYGGTNSQLKNGQYEAVTDIHGIGLLPEKNHLPKLLKKAGYK 132

Query: 177 TKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNL 236
           T A GKWHLG  + ++ P++ GF+ ++G L G      H     Y    +     +R   
Sbjct: 133 TGAFGKWHLGS-QDKFNPIHHGFDEYYGPLLG------HCDYYTYKYYDDTY--TLREGA 183

Query: 237 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 296
               D+   Y T    + AV  I D+  DKP F+Y+ H+A H+       +  Q T    
Sbjct: 184 KVIKDS--GYLTTNINERAVDFI-DRHADKPFFMYVPHMAVHSPYQSADKKPKQITKTN- 239

Query: 297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
             + D NR  YAAMV+++D  V  +I+ L+ K +   ++ +  SDNG             
Sbjct: 240 --LNDGNRADYAAMVEEVDKGVEMIIAKLKEKKIFHKTLFVVSSDNGGA----------- 286

Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
           ++  N P    K TL+EGG++VP I+  P+      VS Q+    D   T    AG D  
Sbjct: 287 HFSDNAPLFHRKTTLFEGGIRVPCIMHWPEKIGKGVVSDQIAITMDLSKTFLALAGIDEP 346

Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAV 476
               + DG++     LL   + +N+ ++    W S   N+ +RR              AV
Sbjct: 347 ----SYDGIN-----LLPMMTDKNNKVERTLFWRS---NSKARRQK------------AV 382

Query: 477 RLDSWKLVL 485
           R+  WK +L
Sbjct: 383 RMGKWKYIL 391


>gi|189230138|ref|NP_001121393.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Xenopus (Silurana)
           tropicalis]
 gi|158253505|gb|AAI54124.1| Galns protein [Danio rerio]
 gi|183986312|gb|AAI66093.1| LOC100158481 protein [Xenopus (Silurana) tropicalis]
          Length = 488

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 115/367 (31%), Positives = 164/367 (44%), Gaps = 57/367 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           GW DL   G     TP +D +A  G++  N Y A P+C+PSRA+L+TG+ P+  G     
Sbjct: 7   GWGDLGVFGEPSKETPYLDLMAAQGMLFPNFYTANPLCSPSRAALLTGRLPVRNGFY--- 63

Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
              A  R          G+   E  LPE+L+   Y +K +GKWHLG  R +Y PL  GF+
Sbjct: 64  TTNAHARNAYTPQEIVGGISADEILLPEHLKNKHYISKIVGKWHLGH-RTQYLPLKHGFD 122

Query: 201 -------SHFGYLN-----GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
                   HFG  N      +  Y +  +  +Y    E+N      NL           T
Sbjct: 123 EWFGAPNCHFGPYNDSSRPNIPVYNNSEMKGRYYEEFEINVKTGESNL-----------T 171

Query: 249 DLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
            L+ KE +  I  Q +  +P FLY A  A HA                  ++    R  Y
Sbjct: 172 QLYLKEGLDFISQQAMAQRPFFLYWAPDATHA-----------PVYASKPFLGKSQRGRY 220

Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
              V +LDDS+G +++ L   G+  ++++ F SDNGA  +     S     GSN P+   
Sbjct: 221 GDAVMELDDSIGQILAHLVSLGIQNDTLVFFTSDNGAALM-----SGPLQSGSNGPFLCG 275

Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
           K T +EGG++ PAI W P       VS Q+  + D   T  + AG       + IDG+D 
Sbjct: 276 KETTFEGGMREPAIAWWPGQIPAGTVSHQLASVMDLFSTSLSVAGVSPPDDRV-IDGVD- 333

Query: 428 WSSLLLN 434
            S +L N
Sbjct: 334 LSPVLFN 340


>gi|443724469|gb|ELU12468.1| hypothetical protein CAPTEDRAFT_223869 [Capitella teleta]
          Length = 441

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 122/435 (28%), Positives = 194/435 (44%), Gaps = 64/435 (14%)

Query: 159 VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDH-IL 217
           + L   F+ +YL+EL Y+T A GKWHLG+  +E TP YRGF++  G  +G   YY+H   
Sbjct: 1   MDLKYNFISDYLKELKYNTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEGEYYEHTTF 60

Query: 218 SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAA 277
              Y      NG ++ R+ S      G ++ D+   + +  + D+    PLF Y A    
Sbjct: 61  QGMYDWH---NGTELDRSAS------GIHSQDMIEADLLNQL-DKNDGTPLFYYAAFHNT 110

Query: 278 HAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIII 337
           H+    K    P+           P RR Y  +V  +D  +G V++ L+ K M +N+ I 
Sbjct: 111 HSPMHPK----PEHEALYDDLDAPPTRRKYLGLVSGMDAVIGKVVAKLKEKDMFDNTYIF 166

Query: 338 FMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQM 397
           F SDNG    E          G N P RG K++L++GG    +   SP +++    +  +
Sbjct: 167 FSSDNGGDVHE----------GDNSPKRGAKSSLFDGGCHAHSWASSPLLKKKGVEADGL 216

Query: 398 MHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTP 457
           +HI+DWLPT+   AGG+   +P  IDG++Q   +    PS+R   I  +D    L   TP
Sbjct: 217 LHITDWLPTIVKLAGGE---VPDGIDGIEQVDLITNAGPSKRTDMIYNIDM--ELETGTP 271

Query: 458 SRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESK 517
                          T A R   +KL+ G +  G  DGY           ++   I E  
Sbjct: 272 DF------------GTMAARDLRYKLIWGFE--GRSDGYG----------MDAQFIYE-- 305

Query: 518 TYQSLQQLSQNIFLPISNIDKM-----RSTRQQATI--HCGANPAPMTPSPCTNGPCYLF 570
               + Q+ +     + ++DK       +T+Q   +  +       +       GP  LF
Sbjct: 306 -VDRVIQVVKEADAAVEDVDKRGGPYPMNTQQHHILRTYLQTKLVTLDDIEAGTGPMMLF 364

Query: 571 NLGNDPCEQNNIASS 585
           +L  DP E  ++++S
Sbjct: 365 DLQEDPNETTDLSAS 379


>gi|410631133|ref|ZP_11341814.1| arylsulfatase B [Glaciecola arctica BSs20135]
 gi|410149335|dbj|GAC18681.1| arylsulfatase B [Glaciecola arctica BSs20135]
          Length = 454

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 105/322 (32%), Positives = 153/322 (47%), Gaps = 44/322 (13%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +GS +I TP ID LA  G      YA   +C+PSRA LMTG+YPI T +    
Sbjct: 48  GFGDLSINGSGKIKTPAIDQLARTGFNFTQFYASANICSPSRAGLMTGRYPIRTSLGYNV 107

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNG 208
           I   +  G+P TE  L E      Y T  IGKWHLG F  E+ P+  GFE+ +G  + N 
Sbjct: 108 ITAGDTHGLPKTEDTLAELALRADYRTMLIGKWHLGRF-PEHAPMKHGFETFYGVPHSND 166

Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
           + ++  +   +   + VE +   +R                 +T+++V  I D     P 
Sbjct: 167 MANFALYSGDNIVEQPVEQSTLTLR-----------------YTEKSVSFI-DANAKHPF 208

Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
            L+++H   H          P      F   +D     Y  +V++LD S G +++AL+R 
Sbjct: 209 LLFVSHTMPH---------IPLYASEDFAGKSDAG--VYGDVVEELDYSTGKIVAALKRN 257

Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
           G+ +NSIII  SDNG P  E          G+    +G K + +EG  +VP I+  P   
Sbjct: 258 GIFDNSIIIITSDNG-PFFE----------GATAGLKGGKGSTYEGAYRVPLIISWPAGI 306

Query: 389 QNPRVSLQMMHISDWLPTLYTA 410
            NP V+  +    D LPT+  A
Sbjct: 307 ANPAVTDAIAMNIDILPTIAEA 328


>gi|410096843|ref|ZP_11291828.1| hypothetical protein HMPREF1076_01006 [Parabacteroides goldsteinii
           CL02T12C30]
 gi|409225460|gb|EKN18379.1| hypothetical protein HMPREF1076_01006 [Parabacteroides goldsteinii
           CL02T12C30]
          Length = 488

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 125/425 (29%), Positives = 191/425 (44%), Gaps = 72/425 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ DLS +G  +  TPNID LA NG+     Y+   V  PSR+ L+TG +  HT ++G  
Sbjct: 48  GYGDLSCYGQEKFETPNIDQLAQNGMRFTQCYSGTTVSAPSRSCLLTGTHSGHTPVRGNK 107

Query: 151 IWGAEPRG-VPLTE--RFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
               +P G  PL E  R +   +++ GY T A GKW LG+      P  +G E+ +GY  
Sbjct: 108 --ELDPEGQFPLPEGTRTIFHVMQDAGYKTSAFGKWGLGYIGTSGDPKNQGCETFYGYNC 165

Query: 208 GVI--SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
            ++  SYY   L D   R       +++ N+       G Y+ DL   +A+  +++   D
Sbjct: 166 QLLAHSYYPDHLWDNDQRV------ELKDNVLEVQYGKGTYSQDLIHSKALDYLDNLKED 219

Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETI-NQFQYI--------TDPN------------- 303
           +P F++   +  HA      L  P+++I  +F+ +        T+P              
Sbjct: 220 EPFFMWYPTILPHA-----ELIVPEDSIIKKFRGMYPEKAYKGTEPGSPAFRKGGYCTQL 274

Query: 304 --RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
               T+AAM+ +LD  VG ++  L+ KG+ +N+IIIF SDNG P +E     ++ N  SN
Sbjct: 275 HPHATFAAMIYRLDVYVGQIVQKLKDKGLYDNTIIIFASDNG-PHMEGGADPDFFN--SN 331

Query: 362 YPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
             YRG K  L+EGG++VP I+ W   IQ     +  M    D LPT              
Sbjct: 332 GMYRGYKRDLYEGGIRVPMIISWPGHIQPGTETNF-MCSFWDVLPTFEE----------- 379

Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL-INIDEKKRTAAVRLD 479
                       + +P  +   +DG+     L      + +  L     E     AVR  
Sbjct: 380 ------------IISPKAKQKEMDGVSMLPLLQNRKGQKEHEFLYFEFQEMNGRQAVRQG 427

Query: 480 SWKLV 484
            WKLV
Sbjct: 428 PWKLV 432


>gi|449138001|ref|ZP_21773306.1| arylsulfatase A [Rhodopirellula europaea 6C]
 gi|448883380|gb|EMB13908.1| arylsulfatase A [Rhodopirellula europaea 6C]
          Length = 470

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 111/364 (30%), Positives = 176/364 (48%), Gaps = 42/364 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+NDL  +GS  I TPN+D LA  G    + Y A  VC+PSRA+L+TG YP   G+    
Sbjct: 39  GYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHV 98

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
           ++     G+   E  + ++L+  GY+T  +GKWHLG   +E  P   GF+S++G     I
Sbjct: 99  LFPQSNYGLHPEEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGFDSYYG-----I 152

Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGEYATD------LFTKE 254
            Y + +      R  +++  D   + S+A   W+T       + E   D       +T  
Sbjct: 153 PYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTITRRYTDR 212

Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
           A++ +E    DKP FLYL H   H       L  P++      Y  DP +  Y  +++ +
Sbjct: 213 AIEFVEAN-QDKPFFLYLPHSMPHIP-----LYVPEDV-----YDPDP-QNAYKCVIEHI 260

Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
           D  VG ++  ++  G+ + ++I++ SDNG P ++++      + GS  P R  K T +EG
Sbjct: 261 DTEVGRLVQTVRDLGLSKKTLIVYTSDNG-PWLQFKN-----HGGSAGPLRAGKGTTFEG 314

Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
           G +VP I+W+P        S       D LPT+ +  G         IDG+D  S+   +
Sbjct: 315 GQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVTGVALEN-DRKIDGVDLTSTFTSD 373

Query: 435 TPSR 438
             SR
Sbjct: 374 ESSR 377


>gi|154495003|ref|ZP_02034008.1| hypothetical protein PARMER_04049 [Parabacteroides merdae ATCC
           43184]
 gi|154085553|gb|EDN84598.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
          Length = 464

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 103/308 (33%), Positives = 153/308 (49%), Gaps = 36/308 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F G+ +I TPNID LA  G I  + + A  V +PSR+ ++TG+Y    G +   
Sbjct: 45  GYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 103

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  G+P  E  LP  L+   Y T  IGKWHLG       P  +GF++ +G L G  
Sbjct: 104 -LDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHLGS-EPSQRPNAKGFDTFYGLLAGHR 161

Query: 211 SY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           SY YD   SD+    +  + NG    R LS        Y TD    +A Q + +   ++P
Sbjct: 162 SYFYDPETSDKDGNLQQYQYNG----RKLS-----FDGYFTDELASKAQQFVTES--EQP 210

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
             LY++  A H+ N     EA +E + +F+      R+ YAAM+  LD  VG ++  L+ 
Sbjct: 211 FMLYMSFTAPHSPN-----EATEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDELKA 262

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
            G  +N+II F+SDNG  T          N  SN P +G K   +EGG +VP  ++W  +
Sbjct: 263 AGKFDNTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWGDR 313

Query: 387 IQQNPRVS 394
            +++ R +
Sbjct: 314 FKRDQRFT 321


>gi|392391665|ref|YP_006428268.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
 gi|390522743|gb|AFL98474.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
           15997]
          Length = 510

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 119/442 (26%), Positives = 200/442 (45%), Gaps = 72/442 (16%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
           G+ DL+ +G   + TPNID LA  G++    YA  PVC PSRA++MTG++  HT ++G  
Sbjct: 38  GYGDLACYGQKHVETPNIDKLAKQGMMFTQAYAGAPVCAPSRATIMTGQHTGHTHVRGNK 97

Query: 149 ------PPIW----------GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
                 P I           G EP      +  LPE +++ GY+T   GKW  G+     
Sbjct: 98  EYWQKSPKIMYGQNEEYSLVGQEPYDA--NQVILPEVMKKNGYTTGMFGKWAGGYEGSHS 155

Query: 193 TPLYRGFESHFGYLNG--VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE----- 245
            P  RG +  +GY+      SYY + L+    +  +     +  + +  +   GE     
Sbjct: 156 VPEKRGVDEFYGYICQFQAHSYYPNFLNRYSQKEGDTQTIRVVMDENVKYPMTGEDYYKR 215

Query: 246 --YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI-----NQF-- 296
             Y+ D+   +A++ +++Q   KP + +  +   HA      L  P +++      QF  
Sbjct: 216 PQYSADMIHHKALEWLDEQNGKKPFYGFFTYTLPHA-----ELMQPNDSLLKKYKKQFFE 270

Query: 297 -----------QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
                       + T      +AAM+ +LD  VG ++  L+ KG+ +N+I++F SDNG P
Sbjct: 271 DKTWGGSDGSRYHATTHTHAQFAAMITRLDSYVGEIMEKLKEKGLDKNTIVVFSSDNG-P 329

Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLP 405
             E      +  +G +   RG+K   +EGG++VP I+  P   +   VS   +   D +P
Sbjct: 330 HEEGGADPEF--FGRDGKLRGIKRQCYEGGIRVPFIVKWPGKIKPGTVSDHQIAFYDIMP 387

Query: 406 TLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRR--NSV 463
           T              +I G  ++     N  S +NS+IDG+    +LL +   ++  + +
Sbjct: 388 TF------------ADIAGDKKFPKRYFNKKS-QNSSIDGISFLPTLLGDDQKQKAHDHL 434

Query: 464 LINIDEKKRTAAVRLDSWKLVL 485
                E  +  AVR   WKL++
Sbjct: 435 YWEFHETNQI-AVRKGDWKLIV 455


>gi|423725143|ref|ZP_17699283.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
           CL09T00C40]
 gi|409234771|gb|EKN27595.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
           CL09T00C40]
          Length = 461

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 103/308 (33%), Positives = 153/308 (49%), Gaps = 36/308 (11%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
           G+ D  F G+ +I TPNID LA  G I  + + A  V +PSR+ ++TG+Y    G +   
Sbjct: 42  GYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 100

Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
                  G+P  E  LP  L+   Y T  IGKWHLG       P  +GF++ +G L G  
Sbjct: 101 -LDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHLGS-EPSQRPNAKGFDTFYGLLAGHR 158

Query: 211 SY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
           SY YD   SD+    +  + NG    R LS        Y TD    +A Q + +   ++P
Sbjct: 159 SYFYDPETSDKDGNLQQYQYNG----RKLS-----FDGYFTDELASKAQQFVTES--EQP 207

Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
             LY++  A H+ N     EA +E + +F+      R+ YAAM+  LD  VG ++  L+ 
Sbjct: 208 FMLYMSFTAPHSPN-----EATEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDELKA 259

Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
            G  +N+II F+SDNG  T          N  SN P +G K   +EGG +VP  ++W  +
Sbjct: 260 AGKFDNTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWGDR 310

Query: 387 IQQNPRVS 394
            +++ R +
Sbjct: 311 FKRDQRFT 318


>gi|408672294|ref|YP_006872042.1| sulfatase [Emticicia oligotrophica DSM 17448]
 gi|387853918|gb|AFK02015.1| sulfatase [Emticicia oligotrophica DSM 17448]
          Length = 478

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 135/445 (30%), Positives = 203/445 (45%), Gaps = 67/445 (15%)

Query: 75  TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRA 133
           T   L+ +  L  +   G+ DLS +G+ +  TPNID LA  G+   N + AQ VC+ SRA
Sbjct: 18  TKKELSPNIILIFMDDMGYGDLSCYGAIDYRTPNIDKLASEGVRFTNFLAAQAVCSASRA 77

Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
           +L+TG YP   G+ G  +      G+   E  L E L++ GY+T   GKWHLG   + + 
Sbjct: 78  ALLTGCYPNRIGISGA-LMPTSKNGINPEETTLAEMLKQKGYATGIFGKWHLGH-HKTFL 135

Query: 194 PLYRGFESHFG--YLNGV-ISYYD--------HILSDQYSRTVELNGHDMRRNLSTAWDT 242
           PL  GF+ + G  Y N +   YYD        H+    Y     +  +D +  + T  D 
Sbjct: 136 PLQHGFDEYVGLPYSNDMWPVYYDGKPATSTQHM--SAYPVLPLIQNNDKKEEIKTLDDQ 193

Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
                T L+T++AV+ I+     KP FLYL H   H          P    ++F+     
Sbjct: 194 AK--LTTLYTEKAVEFIKKNN-KKPFFLYLPHSMPHV---------PIAVSDKFK--GKS 239

Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
            +  YA ++ ++D SVG ++  L+  G+ +N+++IF SDNG P + +   +     GS+ 
Sbjct: 240 KQGLYADLMMEIDWSVGEIMKTLKENGLDKNTLVIFTSDNG-PWLNFGNHA-----GSSG 293

Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN- 421
            +R  K T +EGG +VP I+          V  Q+    D LPT+    G +   LP N 
Sbjct: 294 GFREGKGTSFEGGQRVPCIMRWKGTTPEGVVCNQLASTIDVLPTIARITGAE---LPKNK 350

Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
           IDG++  S +  +           L Q          RRNS+           AVR+  W
Sbjct: 351 IDGVNILSLIKGD-----------LSQSPRKTFYYYYRRNSL----------EAVRMGHW 389

Query: 482 KLVLGTQENGTM------DGYYGQT 500
           KLV   +    M      DGY G+ 
Sbjct: 390 KLVFEHKGRSYMNQLPGNDGYAGKA 414


>gi|223936271|ref|ZP_03628184.1| sulfatase [bacterium Ellin514]
 gi|223895133|gb|EEF61581.1| sulfatase [bacterium Ellin514]
          Length = 488

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 132/427 (30%), Positives = 196/427 (45%), Gaps = 63/427 (14%)

Query: 85  LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
           L L    G+ DL  +G  +I TPNID LA +G+   + YA   VC PSRA+LMTGK   H
Sbjct: 47  LILADDLGYGDLGCYGQTQIKTPNIDKLAEDGMKFTSFYAGSTVCAPSRATLMTGKNTGH 106

Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
             ++G         G  LT   + + L+  GY+T  IGKW LG       P  +GF+ + 
Sbjct: 107 VNIRGNADLSLN--GEELT---IAKILKLAGYATGCIGKWGLGNEGSPGLPGRQGFDEYL 161

Query: 204 GYLNGVIS---YYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
           GYL+ V +   Y  H+    S      + L  +D         D  G Y+ D FT+ A+ 
Sbjct: 162 GYLDQVQAHDYYPTHLFRSDSKGEESKIALTENDA--------DHKGLYSNDFFTQSALN 213

Query: 258 LIEDQPVDKP--------LFLYLAHLAAHAGN-----AGKHLEAPQE---TINQFQYITD 301
            +    ++KP         FLYL +   HA N      G  +E P     T  Q+  +  
Sbjct: 214 YLR---INKPSKLNKHRSFFLYLPYTLPHANNELGNRTGNGMEVPSTEPYTNEQWPQV-- 268

Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
              +  AAM+ +LD  VG ++  L++  + EN+++IF SDNG P  E      Y N  S 
Sbjct: 269 --EKNKAAMITRLDHYVGEIMDYLKKSKLDENTVVIFASDNG-PHKEGGVNPKYFN--SA 323

Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
              RG+K  L+EGG++VP I+  P   +   +S   +   D+LP   TAA    +  P N
Sbjct: 324 GGLRGIKRDLYEGGIRVPFIVRWPARVKAGSISDAPLAFWDFLP---TAAEIARTSSPTN 380

Query: 422 IDGLDQWSSLLLNTPSRRNSNI------DGLDQ------WSSLL--LNTPSRRNSVLINI 467
           IDG+    +LL    + R+  +       G DQ      W ++   +N P    ++  ++
Sbjct: 381 IDGISFLPTLLGKAQTNRHQYLYWEFHEQGFDQAVRMGDWKAVRHGINGPIELYNLKTDV 440

Query: 468 DEKKRTA 474
            EK   A
Sbjct: 441 SEKDNVA 447


>gi|317479852|ref|ZP_07938971.1| sulfatase [Bacteroides sp. 4_1_36]
 gi|316903981|gb|EFV25816.1| sulfatase [Bacteroides sp. 4_1_36]
          Length = 541

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 126/397 (31%), Positives = 175/397 (44%), Gaps = 88/397 (22%)

Query: 68  ITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ 125
           + D   RT     +   + L++    G++D+  +G  EIPTPNID LA  G+     Y  
Sbjct: 22  MADETIRTSKETDERPNIVLIMADDMGYSDVGCYGG-EIPTPNIDRLAQKGVRYTQFYNS 80

Query: 126 PVCTPSRASLMTGKYPIHTGM----QGPPIWGAEP----RGVPLTERFL-------PEYL 170
               P+RASL+TG YP   G+    + P I   E     RGV     FL        E L
Sbjct: 81  GRSCPTRASLLTGLYPQQAGIGAMSEDPGIKKGEKHPENRGVHGYMGFLNRNCVTIAEVL 140

Query: 171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
           +E GY T   GKWH+G   +E  PL RGFE  +G L G  SY    L  Q  R + L+  
Sbjct: 141 KEAGYHTYMTGKWHVGMHGKEKWPLQRGFEHFYGILAGASSY----LKPQGGRGLTLDNT 196

Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH------------ 278
           ++    S        Y TD FT  A++ I++Q  D P FLYLA+ A H            
Sbjct: 197 NLPAPQSPY------YTTDAFTDYAIRFIDEQTDDNPFFLYLAYNAPHWPLHAKDKDLEK 250

Query: 279 -AGNAGKHLEAPQE----------TIN-----------QFQYITDPNRRT-------YAA 309
            AG   K  EA +E           +N            +  +T+  R +       YAA
Sbjct: 251 FAGKYDKGWEAVREERYRKMVELGIVNPDWKLAEWESRAWNELTEAERDSSSLRMSVYAA 310

Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE-----TSNYRN------- 357
            V  +D ++G +I  L+ +G L+N++I+F+SDNGA    Y E     TS           
Sbjct: 311 QVYCMDYNIGKLIDYLEEEGKLDNTLILFLSDNGACAEPYSEKGFGSTSEINKLDSWVHP 370

Query: 358 -----WG--SNYPYRGVKNTLWEGGVKVPAILWSPQI 387
                W   SN PYR  K   +EGG+  P I+  P++
Sbjct: 371 SYGLPWAQVSNTPYRKYKVRAYEGGIATPLIISWPEV 407



 Score = 39.3 bits (90), Expect = 5.9,   Method: Compositional matrix adjust.
 Identities = 17/33 (51%), Positives = 22/33 (66%)

Query: 15  YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 47
           Y TD FT  A++ I++Q  D P FLYLA+ A H
Sbjct: 206 YTTDAFTDYAIRFIDEQTDDNPFFLYLAYNAPH 238


>gi|150003728|ref|YP_001298472.1| sulfatase [Bacteroides vulgatus ATCC 8482]
 gi|212692493|ref|ZP_03300621.1| hypothetical protein BACDOR_01989 [Bacteroides dorei DSM 17855]
 gi|237709187|ref|ZP_04539668.1| sulfatase [Bacteroides sp. 9_1_42FAA]
 gi|345514223|ref|ZP_08793736.1| sulfatase [Bacteroides dorei 5_1_36/D4]
 gi|423230607|ref|ZP_17217011.1| hypothetical protein HMPREF1063_02831 [Bacteroides dorei
           CL02T00C15]
 gi|423240766|ref|ZP_17221880.1| hypothetical protein HMPREF1065_02503 [Bacteroides dorei
           CL03T12C01]
 gi|423244316|ref|ZP_17225391.1| hypothetical protein HMPREF1064_01597 [Bacteroides dorei
           CL02T12C06]
 gi|149932152|gb|ABR38850.1| sulfatase [Bacteroides vulgatus ATCC 8482]
 gi|212664968|gb|EEB25540.1| arylsulfatase [Bacteroides dorei DSM 17855]
 gi|229456883|gb|EEO62604.1| sulfatase [Bacteroides sp. 9_1_42FAA]
 gi|345456018|gb|EEO47281.2| sulfatase [Bacteroides dorei 5_1_36/D4]
 gi|392630751|gb|EIY24737.1| hypothetical protein HMPREF1063_02831 [Bacteroides dorei
           CL02T00C15]
 gi|392642497|gb|EIY36263.1| hypothetical protein HMPREF1064_01597 [Bacteroides dorei
           CL02T12C06]
 gi|392643728|gb|EIY37477.1| hypothetical protein HMPREF1065_02503 [Bacteroides dorei
           CL03T12C01]
          Length = 473

 Score =  144 bits (362), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 141/467 (30%), Positives = 206/467 (44%), Gaps = 71/467 (15%)

Query: 92  GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA-SLMTGKYPIHTGMQ--G 148
           GW DL   GS+   TPNID LA  GI+ +N YA    +      +MTG Y    G+    
Sbjct: 43  GWTDLGVMGSDYYETPNIDRLATEGILFDNAYAAAANSAPSRACMMTGMYTPRHGVYTVS 102

Query: 149 PPIWGAEPR----GVPLTERF------LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
           PP  G   +     +P  E        + E L+E GY    IGKWHLG       PL +G
Sbjct: 103 PPDRGDRTKRKYIAIPNVEDVCADFVTMAEALQEQGYQCGHIGKWHLGDDEDGTGPLSQG 162

Query: 199 FESHFGYLNGVISY---YDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           F  + G       Y   Y + L D+    V L               +GEY TD  T+EA
Sbjct: 163 FIWNVGGNRAGAPYSYFYPYCLPDKSKCHVGLEE-----------GILGEYLTDRLTEEA 211

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR----TYAAMV 311
           V  I+    + P FL+L+H A H       L+AP   IN+++  T P +      YAAM+
Sbjct: 212 VSFIKSHS-EGPFFLHLSHHAVHT-----VLQAPDSLINKYRNKT-PGKYHKNPIYAAMI 264

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRNWGSNYPYRGVKN 369
           +KLDDSVG +   ++  G+ + +I+IF SDNG   P  +            NYP  G K 
Sbjct: 265 EKLDDSVGRICQVIKTLGIADRTIVIFYSDNGGSEPVTD------------NYPLNGGKG 312

Query: 370 TLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
             +EGG +VP I+ W+ +I+   R S+ +  + D+ PT  T A G   ++P N+DG D +
Sbjct: 313 MPYEGGSRVPLIIRWTGKIEGGIRSSVPITGV-DFYPTFVTLAQG---KIPANLDGKDIF 368

Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE--KKRTAAVRLDSWKLVLG 486
            +L+ N  + R+           L  + P+   S L    +   K  +++R   WKL+  
Sbjct: 369 -TLINNNETERD-----------LFWHFPAYLESYLNGGRDFRAKPYSSIRSGDWKLIYH 416

Query: 487 TQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
            ++                 L   N +   + YQ L +  Q    PI
Sbjct: 417 YEDKSMELFNLKNDLGESQDLSGSNPVKRGELYQKLMKWIQETHAPI 463



 Score = 40.8 bits (94), Expect = 2.1,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 6/55 (10%)

Query: 12  VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
           +GEY TD  T+EAV  I+    + P FL+L+H A H       L+AP   IN+++
Sbjct: 199 LGEYLTDRLTEEAVSFIKSHS-EGPFFLHLSHHAVHT-----VLQAPDSLINKYR 247


>gi|392965484|ref|ZP_10330903.1| sulfatase [Fibrisoma limi BUZ 3]
 gi|387844548|emb|CCH52949.1| sulfatase [Fibrisoma limi BUZ 3]
          Length = 477

 Score =  143 bits (361), Expect = 2e-31,   Method: Compositional matrix adjust.
 Identities = 127/419 (30%), Positives = 191/419 (45%), Gaps = 55/419 (13%)

Query: 92  GWNDLSFH-----GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTG 145
           G+ DL  +     G   I TPN+D +A  G+     YA   VC PSR++LMTG +  HT 
Sbjct: 48  GYGDLGCYRNTGSGQQTIRTPNLDRMAREGMRFTQHYAGSTVCAPSRSTLMTGLHTGHTR 107

Query: 146 MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
           ++G      +P      ++ + EYL+  GY+T  IGKW LG       P  +GF+  +GY
Sbjct: 108 IRGNARVPLQP-----DDKTVAEYLKGAGYTTALIGKWGLGEDGSTGIPRQQGFDYFYGY 162

Query: 206 LNGVIS--YYDHILSDQYSRT--------VELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
           LN   +  +Y   L    ++         VE  G    + + +A     E+  DLFT EA
Sbjct: 163 LNQTHAHNHYPDFLWRNETKVKLPNEVIYVENGG---VKGIGSAATKRVEWTQDLFTNEA 219

Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAAMV 311
           +Q IE Q   +P FLYL++++ HA N  K + A    +  +    +      ++ +AA +
Sbjct: 220 LQFIE-QSKQQPFFLYLSYVSPHANNEHKLVAAHGIEVPDYGPYANEEFPDVQKAFAASI 278

Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
             LD  VG V++ L    + +N+++IF SDNG P  E      Y N  SN   RG+K  L
Sbjct: 279 TYLDAHVGKVLAKLHELKLDDNTLVIFSSDNG-PHAEGGNDPAYFN--SNGGLRGIKRDL 335

Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
           +EGG++VP +   P   +   +S       D+LPT    AG      P  +DGL    +L
Sbjct: 336 YEGGIRVPMLARWPGRIKAGTISNHPSAFWDFLPTACEVAG---VPQPAGLDGLSYLPAL 392

Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
           L                       T  +   +     E+ R  AVR  S+KLV   ++N
Sbjct: 393 LGK--------------------GTQPQHPYLYWEFHEQGRKQAVRQGSYKLVYLLKDN 431



 Score = 41.6 bits (96), Expect = 1.5,   Method: Compositional matrix adjust.
 Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 1/55 (1%)

Query: 3   RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 57
           + + +A     E+  DLFT EA+Q IE Q   +P FLYL++++ HA N  K + A
Sbjct: 198 KGIGSAATKRVEWTQDLFTNEALQFIE-QSKQQPFFLYLSYVSPHANNEHKLVAA 251


  Database: nr
    Posted date:  Mar 3, 2013 10:45 PM
  Number of letters in database: 999,999,864
  Number of sequences in database:  2,912,245
  
  Database: /local_scratch/syshi//blastdatabase/nr.01
    Posted date:  Mar 3, 2013 10:52 PM
  Number of letters in database: 999,999,666
  Number of sequences in database:  2,912,720
  
  Database: /local_scratch/syshi//blastdatabase/nr.02
    Posted date:  Mar 3, 2013 10:58 PM
  Number of letters in database: 999,999,938
  Number of sequences in database:  3,014,250
  
  Database: /local_scratch/syshi//blastdatabase/nr.03
    Posted date:  Mar 3, 2013 11:03 PM
  Number of letters in database: 999,999,780
  Number of sequences in database:  2,805,020
  
  Database: /local_scratch/syshi//blastdatabase/nr.04
    Posted date:  Mar 3, 2013 11:08 PM
  Number of letters in database: 999,999,551
  Number of sequences in database:  2,816,253
  
  Database: /local_scratch/syshi//blastdatabase/nr.05
    Posted date:  Mar 3, 2013 11:13 PM
  Number of letters in database: 999,999,897
  Number of sequences in database:  2,981,387
  
  Database: /local_scratch/syshi//blastdatabase/nr.06
    Posted date:  Mar 3, 2013 11:18 PM
  Number of letters in database: 999,999,649
  Number of sequences in database:  2,911,476
  
  Database: /local_scratch/syshi//blastdatabase/nr.07
    Posted date:  Mar 3, 2013 11:24 PM
  Number of letters in database: 999,999,452
  Number of sequences in database:  2,920,260
  
  Database: /local_scratch/syshi//blastdatabase/nr.08
    Posted date:  Mar 3, 2013 11:25 PM
  Number of letters in database: 64,230,274
  Number of sequences in database:  189,558
  
Lambda     K      H
   0.317    0.134    0.411 

Lambda     K      H
   0.267   0.0410    0.140 


Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,710,577,095
Number of Sequences: 23463169
Number of extensions: 474644220
Number of successful extensions: 1055192
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4727
Number of HSP's successfully gapped in prelim test: 4818
Number of HSP's that attempted gapping in prelim test: 1013310
Number of HSP's gapped (non-prelim): 24058
length of query: 632
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 483
effective length of database: 8,863,183,186
effective search space: 4280917478838
effective search space used: 4280917478838
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)