BLASTP 2.2.26 [Sep-21-2011]
Reference: Altschul, Stephen F., Thomas L. Madden, Alejandro A. Schaffer,
Jinghui Zhang, Zheng Zhang, Webb Miller, and David J. Lipman (1997),
"Gapped BLAST and PSI-BLAST: a new generation of protein database search
programs", Nucleic Acids Res. 25:3389-3402.
Reference for compositional score matrix adjustment: Altschul, Stephen F.,
John C. Wootton, E. Michael Gertz, Richa Agarwala, Aleksandr Morgulis,
Alejandro A. Schaffer, and Yi-Kuo Yu (2005) "Protein database searches
using compositionally adjusted substitution matrices", FEBS J. 272:5101-5109.
Query= psy12301
(632 letters)
Database: nr
23,463,169 sequences; 8,064,228,071 total letters
Searching..................................................done
>gi|328699373|ref|XP_001945817.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 567
Score = 651 bits (1679), Expect = 0.0, Method: Compositional matrix adjust.
Identities = 313/553 (56%), Positives = 405/553 (73%), Gaps = 37/553 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWNDLSFHGS+EIPTPNIDALA+NGI+LNN+Y QPVCTPSR +LMTGKYPI GMQGPP
Sbjct: 39 GWNDLSFHGSDEIPTPNIDALAFNGIVLNNLYTQPVCTPSRVALMTGKYPIKLGMQGPPT 98
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+GAEP G+PL+E+ LPEYLRELGY+T+AIGKWHLGF+++ YTP RGF+SHFGY G +S
Sbjct: 99 YGAEPNGLPLSEKLLPEYLRELGYTTRAIGKWHLGFYKQAYTPTRRGFDSHFGYYTGYVS 158
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD++L D Y E G DMRRN + AWD VG+YATD+FT EAV+LI++QP ++PLF+Y
Sbjct: 159 YYDYLLQDVYQNFGEFQGFDMRRNDTIAWDVVGKYATDVFTDEAVRLIKEQPANQPLFMY 218
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAH+A H GN GK+LEAPQ +N+F +I DPNRR YAAMV KLD+SVG V+ AL K ML
Sbjct: 219 LAHVAVHTGNRGKYLEAPQSEVNKFNHILDPNRRIYAAMVSKLDESVGRVVEALTEKKML 278
Query: 332 ENSIIIFMSDNGAPTVE-----YR-ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
+N+II+FMSDNG+P+ + +R E NWGSN+PYRG+KNTLWEGGVK + +W+P
Sbjct: 279 QNTIIVFMSDNGSPSFDGSGRNFRPEAGVTANWGSNFPYRGIKNTLWEGGVKSASFIWAP 338
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
Q+NPRVS Q+MHI+DWLPTLY+AAGG+ LP N+DG+DQW+SL L
Sbjct: 339 YFQENPRVSKQLMHITDWLPTLYSAAGGNPGFLPKNLDGVDQWTSLTL------------ 386
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
N PS+RN VL+NIDEK+R A + D+WKL++G+ NGT+DG++G TR +
Sbjct: 387 ---------NLPSQRNFVLLNIDEKQRYAGILKDNWKLIVGSTANGTLDGFFGATRP-RT 436
Query: 506 PLLNFNAIVESKTYQSLQQLSQNI------FLPISNIDKMRSTRQQATIHCGANPA-PMT 558
P N A++ S ++L +LS ++ F P+S++ ++ + R ++TI C + A P T
Sbjct: 437 P-YNATAVLYSPAGRALTKLSDSLPFATAGFGPMSSVRELLAMRYESTIRCNPSAAGPRT 495
Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
SPC G LF+L DPCE NN+A +S QLYE LKY+RR LVPQ++ +P A+
Sbjct: 496 QSPCPAGEACLFDLVADPCETNNVAKKYVAVSGQLYEALKYYRRLLVPQTN-RPFDPAAN 554
Query: 619 PKRFNDTWSPWIY 631
P RFN+TWS W+Y
Sbjct: 555 PARFNNTWSTWMY 567
Score = 119 bits (297), Expect = 7e-24, Method: Compositional matrix adjust.
Identities = 51/80 (63%), Positives = 66/80 (82%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRN + AWD VG+YATD+FT EAV+LI++QP ++PLF+YLAH+A H GN GK+LEAPQ
Sbjct: 179 MRRNDTIAWDVVGKYATDVFTDEAVRLIKEQPANQPLFMYLAHVAVHTGNRGKYLEAPQS 238
Query: 61 TINQFQYITDPNRRTYAALT 80
+N+F +I DPNRR YAA+
Sbjct: 239 EVNKFNHILDPNRRIYAAMV 258
>gi|242025544|ref|XP_002433184.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518725|gb|EEB20446.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 610
Score = 564 bits (1454), Expect = e-158, Method: Compositional matrix adjust.
Identities = 287/554 (51%), Positives = 378/554 (68%), Gaps = 46/554 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWNDLSFHGS++I TPN+DALAYNG+ILN+ Y PVCTPSRA+LMTG YPIH GMQG P+
Sbjct: 52 GWNDLSFHGSDQIQTPNLDALAYNGVILNSQYVLPVCTPSRAALMTGMYPIHNGMQGLPL 111
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+EPR +P + LP YL++LGY+T+ +GKWHLG++++E+TP YRGF+SH GY NG++S
Sbjct: 112 EASEPRALP-AGKLLPSYLKDLGYTTRMVGKWHLGYYQKEFTPTYRGFDSHLGYWNGIVS 170
Query: 212 YYDHILSDQYSRT--VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
YYD+IL + +R LNG DMRRN++ A+D G YAT++FT EAV LI + PLF
Sbjct: 171 YYDYILQEDDNRKPRSSLNGFDMRRNITPAYDLQGRYATEMFTDEAVHLIRSHNKNTPLF 230
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LY++HLA HAGN GK LEAPQE IN+F +I DPNRRT+A MV+KLD+SVG +I+AL+
Sbjct: 231 LYMSHLAVHAGNPGKFLEAPQEAINKFLHIADPNRRTFAGMVQKLDESVGRLIAALEESN 290
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML NSII+F +DNGAPT+ Y NWGSNYP RG+K TLWEGG + + +WSP I+
Sbjct: 291 MLHNSIIVFSADNGAPTI-----GGYPNWGSNYPLRGIKETLWEGGTRGVSFIWSPLIKN 345
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
PRVS +++HISDWLPTLY AAGG+ + N+DG++QW
Sbjct: 346 TPRVSNELIHISDWLPTLYAAAGGNMENVK-NLDGVNQW--------------------- 383
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDS----WKLVLGTQENGTMDGYYGQTRSNKV 505
S+ N PS RN +L+NIDE +TAA+R++S WKLV+GT NGT D Y+G+ +NK
Sbjct: 384 PSISENLPSERNEILLNIDEILQTAAIRMNSRKYHWKLVVGTVNNGTYDEYFGEG-TNKF 442
Query: 506 PLLN----FNAIVESKTYQSLQQLSQNIFL--PISNIDKMRSTRQQATIHCG---ANPAP 556
P N FNA+ S+ Y++L++L+Q L P S +KM+ R+ ATI C NP
Sbjct: 443 PENNPPYDFNAVANSQVYRTLERLAQKRLLTYPTSK-EKMKVLRRDATIRCDYKIINPNR 501
Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
PS N C L+++ DPCE NN+ P+++S L+ L HR +LVPQ ++
Sbjct: 502 CNPSDGPNQVC-LYDIKRDPCELNNLTPFYPNVASYLFRALVQHRESLVPQESKELQQDL 560
Query: 617 ADPKRFNDTWSPWI 630
ADP FN TW+PWI
Sbjct: 561 ADPVNFNGTWTPWI 574
Score = 107 bits (267), Expect = 2e-20, Method: Compositional matrix adjust.
Identities = 48/81 (59%), Positives = 61/81 (75%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRN++ A+D G YAT++FT EAV LI + PLFLY++HLA HAGN GK LEAPQE
Sbjct: 193 MRRNITPAYDLQGRYATEMFTDEAVHLIRSHNKNTPLFLYMSHLAVHAGNPGKFLEAPQE 252
Query: 61 TINQFQYITDPNRRTYAALTK 81
IN+F +I DPNRRT+A + +
Sbjct: 253 AINKFLHIADPNRRTFAGMVQ 273
>gi|189236319|ref|XP_975218.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 536
Score = 509 bits (1310), Expect = e-141, Method: Compositional matrix adjust.
Identities = 257/540 (47%), Positives = 352/540 (65%), Gaps = 34/540 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNIDALAYNGIILN+ Y+Q TPSRA+L+TGKYP+ G+QGP I
Sbjct: 19 GWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKYPMKLGLQGPSI 78
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AE R +P + + EY +++GY+T +GKWHLG R TP +RGF+ FG+ NG S
Sbjct: 79 TPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGFYNGFTS 137
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD++ S+ E +G D+RR+ +W+ G+YATDLF + AV +I+ V+ PLF+
Sbjct: 138 YYDYV-SNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNTPLFMM 196
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+AHLA H GN GK LEAPQET+N+F++I DPNRRTYAAMV KLDDS+G V AL+ K ML
Sbjct: 197 IAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGAVFEALEAKNML 256
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N+I++F+SDNGAPTV + NWGSNYP RG+K+TL+EGGV+ A +WSP + Q+
Sbjct: 257 QNTIVVFISDNGAPTV-----GPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSPLLVQSS 311
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS ++HI+DWLPTL+TA GGD S L ++DG+DQWS S
Sbjct: 312 RVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWS---------------------S 350
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L+ + PS RN + +NIDEK R AA+R WKL++GT NG+ +GY+G + + +N
Sbjct: 351 LVYDLPSARNDIPLNIDEKTRNAALRFSYWKLIVGTSGNGSYNGYFGAPLNENIEEQQYN 410
Query: 512 --AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
AI +S + ++++ N P+S D R+ AT+ C A P +G L
Sbjct: 411 TSAINDSPVGRIAKKINYN---PLSETD-FDGLRRVATLKCLDAKAKRNPCDPASGAVCL 466
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+N+ NDPCE+N++A P + ++ L +R+ LVPQ D+ ADPK F W+PW
Sbjct: 467 YNIPNDPCEENDLAKFFPSVVRRIKRTLVDYRQGLVPQIETSIDIENADPKLFQFAWNPW 526
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 65/91 (71%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+RR+ +W+ G+YATDLF + AV +I+ V+ PLF+ +AHLA H GN GK LEAPQE
Sbjct: 157 LRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNTPLFMMIAHLAVHVGNEGKWLEAPQE 216
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
T+N+F++I DPNRRTYAA+ ++ V+
Sbjct: 217 TVNKFKHIRDPNRRTYAAMVSKLDDSIGAVF 247
>gi|270005853|gb|EFA02301.1| hypothetical protein TcasGA2_TC007966 [Tribolium castaneum]
Length = 558
Score = 508 bits (1309), Expect = e-141, Method: Compositional matrix adjust.
Identities = 257/540 (47%), Positives = 352/540 (65%), Gaps = 34/540 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNIDALAYNGIILN+ Y+Q TPSRA+L+TGKYP+ G+QGP I
Sbjct: 41 GWNDVGFHGSNQIPTPNIDALAYNGIILNSHYSQSFGTPSRAALLTGKYPMKLGLQGPSI 100
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AE R +P + + EY +++GY+T +GKWHLG R TP +RGF+ FG+ NG S
Sbjct: 101 TPAEGRSLP-EGKIMSEYFKDMGYATHLVGKWHLGHSRWNDTPTFRGFDHFFGFYNGFTS 159
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD++ S+ E +G D+RR+ +W+ G+YATDLF + AV +I+ V+ PLF+
Sbjct: 160 YYDYV-SNWKINDKEYSGFDLRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNTPLFMM 218
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+AHLA H GN GK LEAPQET+N+F++I DPNRRTYAAMV KLDDS+G V AL+ K ML
Sbjct: 219 IAHLAVHVGNEGKWLEAPQETVNKFKHIRDPNRRTYAAMVSKLDDSIGAVFEALEAKNML 278
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N+I++F+SDNGAPTV + NWGSNYP RG+K+TL+EGGV+ A +WSP + Q+
Sbjct: 279 QNTIVVFISDNGAPTV-----GPHHNWGSNYPLRGIKDTLFEGGVRTVACIWSPLLVQSS 333
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS ++HI+DWLPTL+TA GGD S L ++DG+DQWS S
Sbjct: 334 RVSTDLIHITDWLPTLFTAVGGDLSVLDPDLDGIDQWS---------------------S 372
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L+ + PS RN + +NIDEK R AA+R WKL++GT NG+ +GY+G + + +N
Sbjct: 373 LVYDLPSARNDIPLNIDEKTRNAALRFSYWKLIVGTSGNGSYNGYFGAPLNENIEEQQYN 432
Query: 512 --AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
AI +S + ++++ N P+S D R+ AT+ C A P +G L
Sbjct: 433 TSAINDSPVGRIAKKINYN---PLSETD-FDGLRRVATLKCLDAKAKRNPCDPASGAVCL 488
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+N+ NDPCE+N++A P + ++ L +R+ LVPQ D+ ADPK F W+PW
Sbjct: 489 YNIPNDPCEENDLAKFFPSVVRRIKRTLVDYRQGLVPQIETSIDIENADPKLFQFAWNPW 548
Score = 104 bits (260), Expect = 1e-19, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 65/91 (71%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+RR+ +W+ G+YATDLF + AV +I+ V+ PLF+ +AHLA H GN GK LEAPQE
Sbjct: 179 LRRDTVPSWNDAGKYATDLFAEHAVDVIQKHNVNTPLFMMIAHLAVHVGNEGKWLEAPQE 238
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
T+N+F++I DPNRRTYAA+ ++ V+
Sbjct: 239 TVNKFKHIRDPNRRTYAAMVSKLDDSIGAVF 269
>gi|328783191|ref|XP_396281.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 713
Score = 504 bits (1299), Expect = e-140, Method: Compositional matrix adjust.
Identities = 263/563 (46%), Positives = 346/563 (61%), Gaps = 54/563 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS++IPTPNIDALAYNGIILNN Y +CTPSR++LMTGK PIH GMQ +
Sbjct: 83 GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 142
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ EPRG+PL+E+ LPEYLRE+GY T A+GKWHLG+F++EYTP YRGF+SHFGY NG+
Sbjct: 143 FPTEPRGLPLSEKLLPEYLREIGYKTHAVGKWHLGYFKKEYTPTYRGFDSHFGYWNGLQD 202
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY HI + E G DMRRNL+ AWDTVG+Y+TDLFT EA++LI + D+P+FLY
Sbjct: 203 YYTHITQEPDPAFSEFKGFDMRRNLTVAWDTVGKYSTDLFTNEAIRLINEHDTDRPMFLY 262
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAHLA H GN + AP E I +F YI DP RR AA+V KLD SVG V+ AL+ +GML
Sbjct: 263 LAHLAVHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKLDQSVGDVMDALRNRGML 322
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
ENSI++FMSDNGAP+ N GSNYP RG+KN+ WEGG + A +WSP I+++
Sbjct: 323 ENSIVLFMSDNGAPS-----QGILNNKGSNYPLRGIKNSPWEGGTRGVAAIWSPLIKKSK 377
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS QMM ISDWLPTL +AAG + +L NIDG D WS+
Sbjct: 378 RVSNQMMFISDWLPTLLSAAGVNRKQL----------------------GNIDGFDLWSA 415
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT---------MDGYYGQTRS 502
L+LN S R+ V++NID+ A+ R ++K ++G E G+ +G Q
Sbjct: 416 LVLNKVSSRSEVVLNIDDLSDYASFRRGNFKYIIGRTETGSDWLGASGDPSEGISPQYDP 475
Query: 503 NKVPL----LNFNAIVESKTYQSLQQ------------LSQNIFLPISNIDKMRSTRQQA 546
K+ + + I+ +K L++ L I +++ + R++A
Sbjct: 476 YKILYSKTGVAISGIITAKQAMELKERRKRSIKIYDTALETKFEEKILSVENISELRRKA 535
Query: 547 TIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
I C P PC LFN+ DPCE N+A RP I + L +L +R T +P
Sbjct: 536 QIKCNVQEKDRIPCEPMTAPC-LFNIEKDPCEMVNLAERRPVIMAILERILMKYRITAIP 594
Query: 607 QSHEQPDLVQADPKRFNDTWSPW 629
S+ D +ADP +N+TW+ W
Sbjct: 595 ASNLDGD-PRADPTLWNNTWTSW 616
Score = 106 bits (265), Expect = 4e-20, Method: Compositional matrix adjust.
Identities = 47/80 (58%), Positives = 58/80 (72%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRNL+ AWDTVG+Y+TDLFT EA++LI + D+P+FLYLAHLA H GN + AP E
Sbjct: 223 MRRNLTVAWDTVGKYSTDLFTNEAIRLINEHDTDRPMFLYLAHLAVHKGNEEQLFRAPDE 282
Query: 61 TINQFQYITDPNRRTYAALT 80
I +F YI DP RR AA+
Sbjct: 283 EIAKFSYILDPERRIQAAVV 302
>gi|380012883|ref|XP_003690503.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 671
Score = 502 bits (1293), Expect = e-139, Method: Compositional matrix adjust.
Identities = 266/564 (47%), Positives = 350/564 (62%), Gaps = 55/564 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS++IPTPNIDALAYNGIILNN Y +CTPSR++LMTGK PIH GMQ +
Sbjct: 39 GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 98
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL+E+ LPEYLRE+GY T A+GKWHLG+F++EYTP YRGF+SHFGY NG+
Sbjct: 99 FPAEPRGLPLSEKLLPEYLREVGYKTHAVGKWHLGYFKKEYTPTYRGFDSHFGYWNGLQD 158
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY HI + E G DMRRNL+ AWDTVG+Y+TDLFT EAV+LI + +D+P+FLY
Sbjct: 159 YYTHITQEPDPVYSEYKGFDMRRNLTVAWDTVGKYSTDLFTNEAVRLINEHNIDQPMFLY 218
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAHLA H GN + AP E I +F YI DP RR AA+V KLD SVG V+ AL+ +GML
Sbjct: 219 LAHLAPHKGNEEQLFRAPDEEIAKFSYILDPERRIQAAVVSKLDQSVGDVMDALRNRGML 278
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
ENSI++FMSDNGAP+ N GSNYP RG+KN+ WEGG + A +WSP I+++
Sbjct: 279 ENSIVVFMSDNGAPS-----QGLLSNEGSNYPLRGIKNSPWEGGTRGVAAIWSPLIKKSK 333
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS QMM +SDWLPTL +AAG D +L NIDG D W S+
Sbjct: 334 RVSNQMMFMSDWLPTLLSAAGVDRRQLG-NIDGYDLW---------------------SA 371
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM---------DGYYGQTRS 502
L+LN S R+ V++NID+ AA R +K ++G E G+ +G Q
Sbjct: 372 LVLNKISSRSEVMLNIDDLSDYAAFRRGDFKYIIGRTETGSAWLGASGDPSEGISPQYDP 431
Query: 503 NKVPL----LNFNAIVESKTYQSLQQ-------------LSQNIFLPISNIDKMRSTRQQ 545
+K+ + + I+ +K L++ L N I +++ + R++
Sbjct: 432 HKILYSKTGVAISGIITAKQAMELKEERRKRSVKIYDTALKTNFEEKILSVENISELRRK 491
Query: 546 ATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLV 605
A I C P PC LFN+ DPCE N+A RP I + L +L +R T++
Sbjct: 492 AQIKCNVKEKDRIPCEPMTAPC-LFNIEKDPCEMVNLAEKRPMIMAILERILMKYRLTVI 550
Query: 606 PQSHEQPDLVQADPKRFNDTWSPW 629
P S+ D +A+P +N+TW+ W
Sbjct: 551 PPSNLDGD-PRANPSLWNNTWTSW 573
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 48/80 (60%), Positives = 59/80 (73%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRNL+ AWDTVG+Y+TDLFT EAV+LI + +D+P+FLYLAHLA H GN + AP E
Sbjct: 179 MRRNLTVAWDTVGKYSTDLFTNEAVRLINEHNIDQPMFLYLAHLAPHKGNEEQLFRAPDE 238
Query: 61 TINQFQYITDPNRRTYAALT 80
I +F YI DP RR AA+
Sbjct: 239 EIAKFSYILDPERRIQAAVV 258
>gi|307191747|gb|EFN75189.1| Arylsulfatase B [Harpegnathos saltator]
Length = 583
Score = 491 bits (1264), Expect = e-136, Method: Compositional matrix adjust.
Identities = 249/540 (46%), Positives = 341/540 (63%), Gaps = 34/540 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGSN+IPTPNIDALAY G++L N Y +CTPSRA+L+TGKYPIH GMQ I
Sbjct: 52 GWNDVSFHGSNQIPTPNIDALAYYGVLLKNHYVAALCTPSRAALLTGKYPIHLGMQHEAI 111
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ +EPRG+PL E+ LP+YL+++ Y T +GKWHLG+++ EYTPLYRGF++HFGY NG+
Sbjct: 112 FPSEPRGLPLEEKLLPQYLKDMNYVTHIVGKWHLGYYKMEYTPLYRGFDTHFGYWNGLQD 171
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY H ++ Y+ + G DMRRN + AWDT+G+Y+ DL+T EAV+LI D P+FLY
Sbjct: 172 YYSHKTAEPYTLNI---GMDMRRNFTVAWDTMGKYSVDLYTDEAVRLINTHNTDNPMFLY 228
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LA +A HAGNA + +A E I +F YI DP R+ YAA+V KLD+SVG V+ AL+ + ML
Sbjct: 229 LAQIAPHAGNANQLPQALPEEIEKFSYIIDPKRKRYAAVVSKLDESVGKVVEALKNRNML 288
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NSII+FMSDNGAPT + N GSNYP+RG+K TLWEGGV+ A +WSP I++
Sbjct: 289 SNSIILFMSDNGAPTSGF-----LSNGGSNYPFRGIKKTLWEGGVRGAAAIWSPLIKKRE 343
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS +MH++DWLPTL +AAGG+ + IDGLD W +
Sbjct: 344 RVSYHLMHVTDWLPTLLSAAGGNKEEI----------------------GTIDGLDMWET 381
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG-TQENGTMDGYYGQTRSNKVPLLNF 510
++ S R VLINID+ + A+R+ +K V G +++N G G+ S + P +
Sbjct: 382 IVYGDCSPRKEVLINIDDICDSEAIRIGKYKYVRGESKKNNDWVGDSGRNSSEQRPPYSP 441
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
++ SK ++ L N + +++ KM RQQA +HC PC LF
Sbjct: 442 ETVLNSKVGIAIASLPGNNNVALTSA-KMLELRQQAEVHCNVTEEEKVQCDTQLAPC-LF 499
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
+L DPCE NI P ++ +L E+L HR T++ ++ D V+A P +N TW W+
Sbjct: 500 DLERDPCEMINIIKQEPLVAKELQEVLLQHRSTMLLPNNRIID-VRASPILWNCTWVCWM 558
Score = 95.1 bits (235), Expect = 1e-16, Method: Compositional matrix adjust.
Identities = 43/80 (53%), Positives = 56/80 (70%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRN + AWDT+G+Y+ DL+T EAV+LI D P+FLYLA +A HAGNA + +A E
Sbjct: 189 MRRNFTVAWDTMGKYSVDLYTDEAVRLINTHNTDNPMFLYLAQIAPHAGNANQLPQALPE 248
Query: 61 TINQFQYITDPNRRTYAALT 80
I +F YI DP R+ YAA+
Sbjct: 249 EIEKFSYIIDPKRKRYAAVV 268
>gi|383859596|ref|XP_003705279.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 689
Score = 486 bits (1252), Expect = e-134, Method: Compositional matrix adjust.
Identities = 269/614 (43%), Positives = 366/614 (59%), Gaps = 53/614 (8%)
Query: 39 LYLAHLAAHAGNAG-KHLEAPQETINQFQYITDPNR--RTYAALTKSTTLTLLIV--YGW 93
L+LA L + G L +P+ IN F + P + + + + +++ GW
Sbjct: 13 LFLAELTVNVGAIFFSSLLSPKVKIN-FPSLGKPGKDPKHVDGCRQKPHIIIILADDLGW 71
Query: 94 NDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWG 153
ND+SFHGS++IPTPNIDA+AYNGIILN+ Y +CTPSR +LMTGK PIH GMQ +
Sbjct: 72 NDVSFHGSDQIPTPNIDAIAYNGIILNSHYVAALCTPSRTALMTGKNPIHLGMQHSVLLP 131
Query: 154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
+EPRG+PL+E+ LPEYLRE+GY T A+GKWHLG+FRREYTP +RGF++HFGY NG+ YY
Sbjct: 132 SEPRGLPLSEKLLPEYLREVGYRTHAVGKWHLGYFRREYTPTFRGFDTHFGYWNGLQDYY 191
Query: 214 DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLA 273
HI + + E G DMRRNL+ AWDT G+Y+TDLFT+EAV+LI + D P+FLYLA
Sbjct: 192 THITQEPDPQFGEFMGLDMRRNLTAAWDTQGKYSTDLFTEEAVRLINEHDKDDPMFLYLA 251
Query: 274 HLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLEN 333
HLA H GN + L A E I +F YI DP RR AA+V K+D SVG V+ AL+++GMLEN
Sbjct: 252 HLAPHKGNPNRLLRASDEDIARFSYILDPERRIQAAVVSKVDQSVGEVMDALRKRGMLEN 311
Query: 334 SIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRV 393
SI++FMSDNGAPT + N GSNYP RG+K++ WEGG + A +WSP I+++ RV
Sbjct: 312 SIVLFMSDNGAPT-----QGSLSNQGSNYPLRGIKDSPWEGGTRGVAAIWSPLIKESQRV 366
Query: 394 SLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLL 453
S QMM I+DWLPTL +AAG D+ +L NIDG D W +L
Sbjct: 367 SNQMMFIADWLPTLLSAAGVDSKQL----------------------GNIDGFDLWPALR 404
Query: 454 LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM-DGYYGQTRSNKVPLLNFNA 512
+ S R+ V++NID+ + A++R +K V+G E G G G P + +
Sbjct: 405 SDKISPRSDVVLNIDDISKYASIRRGDFKYVIGETETGRAWVGATGAPSEGIPPKYDPDK 464
Query: 513 IVESKTYQSLQQL---------------SQNIFL--PISNIDKMRSTRQQATIHCGANPA 555
++ SK ++ + S I I + +K+ R A + C
Sbjct: 465 VLYSKAGVAISGVLTAKQAKELKKKKNTSSKIDFEEKILSAEKIFQLRNDARVQCNVKEE 524
Query: 556 PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLV 615
P T PC LFN+ DPCE N+A RP I + + +L +R T VP S+ D
Sbjct: 525 NRIPCDPTAAPC-LFNIEKDPCEMVNLADKRPLILAIMERVLMNYRLTTVPPSNVNGD-P 582
Query: 616 QADPKRFNDTWSPW 629
+A+P +N+TW+ W
Sbjct: 583 RANPALWNNTWTTW 596
Score = 102 bits (254), Expect = 6e-19, Method: Compositional matrix adjust.
Identities = 72/203 (35%), Positives = 99/203 (48%), Gaps = 42/203 (20%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRNL+ AWDT G+Y+TDLFT+EAV+LI + D P+FLYLAHLA H GN + L A E
Sbjct: 210 MRRNLTAAWDTQGKYSTDLFTEEAVRLINEHDKDDPMFLYLAHLAPHKGNPNRLLRASDE 269
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILN 120
I +F YI DP RR AA+ ++ V +DAL G++ N
Sbjct: 270 DIARFSYILDPERRIQAAVVSKVDQSVGEV------------------MDALRKRGMLEN 311
Query: 121 N--MYAQPVCTPSRASLMT--GKYPIHTGMQGPPIWGAEPRGV-----PLTER------- 164
+ ++ P++ SL YP+ G++ P W RGV PL +
Sbjct: 312 SIVLFMSDNGAPTQGSLSNQGSNYPLR-GIKDSP-WEGGTRGVAAIWSPLIKESQRVSNQ 369
Query: 165 ------FLPEYLRELGYSTKAIG 181
+LP L G +K +G
Sbjct: 370 MMFIADWLPTLLSAAGVDSKQLG 392
>gi|270006267|gb|EFA02715.1| hypothetical protein TcasGA2_TC008439 [Tribolium castaneum]
Length = 648
Score = 472 bits (1215), Expect = e-130, Method: Compositional matrix adjust.
Identities = 255/570 (44%), Positives = 347/570 (60%), Gaps = 50/570 (8%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
A TK + +++ G+ND+ FHGSNEIPTPNIDALAYNG+ILN+ Y Q +CTPSR++
Sbjct: 18 AQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQALCTPSRSAF 77
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
+TGKYPIH GMQ I EP G+PL E LP+YL+ GY+T AIGKWHLGFFR+EYTP
Sbjct: 78 LTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPT 137
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
YRGF+SHFGY G+ YY H + ++ +G+DMRRN++ W G+Y+T LFT EA
Sbjct: 138 YRGFDSHFGYWQGLQDYYKHTVHATFTPE---HGYDMRRNMTVDWSAQGKYSTTLFTDEA 194
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
V+LI + + P+F+YLAHLA H+GN L+AP E I +F +I DP RR YAAMV LD
Sbjct: 195 VRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYAAMVSMLD 254
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
SVG+VI+AL+ K MLENSII+FMSDNGA + + N GSNYP RG KN+ WEG
Sbjct: 255 KSVGSVIAALRDKHMLENSIIVFMSDNGA-----KPDGIHANHGSNYPLRGNKNSAWEGA 309
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
++ A +WSP I++ RVS +MHISDWLPT YTAAG + + LP
Sbjct: 310 MRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELP---------------- 353
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
+DG+D W+S+ S R +L NIDE A+R+ +WK + G+ NG DG
Sbjct: 354 ------KMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALRVGNWKYLYGSTTNGKSDG 407
Query: 496 YYGQTRSNKVPLLNFNAIVESK---------TYQSLQQLSQ---NIFLPISNIDKMRSTR 543
+YG + + + + +A++ S+ TYQ +++ Q N + + + +++ R
Sbjct: 408 WYGSSGRDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQGDTNFTHKLLDSETIKTLR 467
Query: 544 QQATIHC-GANPAPMTPSPCTNG---PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
A + C N + S N PC LFN+ DPCEQ N+A+ RP I + L
Sbjct: 468 GAAEVKCPRVNFEEIPESKKCNAVESPC-LFNIKEDPCEQINLAAERPMIVLNMEMALAR 526
Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
++T +P P ADP ++N+TW W
Sbjct: 527 FKQTALP-IRNVPRDPNADPAKWNNTWVNW 555
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 41/80 (51%), Positives = 55/80 (68%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRN++ W G+Y+T LFT EAV+LI + + P+F+YLAHLA H+GN L+AP E
Sbjct: 171 MRRNMTVDWSAQGKYSTTLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDE 230
Query: 61 TINQFQYITDPNRRTYAALT 80
I +F +I DP RR YAA+
Sbjct: 231 EIAKFGHIADPERRIYAAMV 250
>gi|189236827|ref|XP_972832.2| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
Length = 646
Score = 470 bits (1209), Expect = e-129, Method: Compositional matrix adjust.
Identities = 255/570 (44%), Positives = 345/570 (60%), Gaps = 52/570 (9%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
A TK + +++ G+ND+ FHGSNEIPTPNIDALAYNG+ILN+ Y Q +CTPSR++
Sbjct: 18 AQTKKPNIIVIVADDMGFNDVGFHGSNEIPTPNIDALAYNGVILNSHYTQALCTPSRSAF 77
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
+TGKYPIH GMQ I EP G+PL E LP+YL+ GY+T AIGKWHLGFFR+EYTP
Sbjct: 78 LTGKYPIHLGMQHLVILEPEPWGLPLNETILPQYLKRNGYATHAIGKWHLGFFRKEYTPT 137
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
YRGF+SHFGY G+ YY H + +G+DMRRN++ W G+Y+T LFT EA
Sbjct: 138 YRGFDSHFGYWQGLQDYYKHTV-----HFTPEHGYDMRRNMTVDWSAQGKYSTTLFTDEA 192
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
V+LI + + P+F+YLAHLA H+GN L+AP E I +F +I DP RR YAAMV LD
Sbjct: 193 VRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDEEIAKFGHIADPERRIYAAMVSMLD 252
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
SVG+VI+AL+ K MLENSII+FMSDNGA + + N GSNYP RG KN+ WEG
Sbjct: 253 KSVGSVIAALRDKHMLENSIIVFMSDNGA-----KPDGIHANHGSNYPLRGNKNSAWEGA 307
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
++ A +WSP I++ RVS +MHISDWLPT YTAAG + + LP
Sbjct: 308 MRCVAAIWSPLIKKPQRVSNSLMHISDWLPTFYTAAGLNKTELP---------------- 351
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
+DG+D W+S+ S R +L NIDE A+R+ +WK + G+ NG DG
Sbjct: 352 ------KMDGVDMWASISEGKDSPRTELLHNIDEIYNYGALRVGNWKYLYGSTTNGKSDG 405
Query: 496 YYGQTRSNKVPLLNFNAIVESK---------TYQSLQQLSQ---NIFLPISNIDKMRSTR 543
+YG + + + + +A++ S+ TYQ +++ Q N + + + +++ R
Sbjct: 406 WYGSSGRDPLYTYDDSAVLASQTGSTLAGLTTYQQIKEKHQGDTNFTHKLLDSETIKTLR 465
Query: 544 QQATIHC-GANPAPMTPSPCTNG---PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
A + C N + S N PC LFN+ DPCEQ N+A+ RP I + L
Sbjct: 466 GAAEVKCPRVNFEEIPESKKCNAVESPC-LFNIKEDPCEQINLAAERPMIVLNMEMALAR 524
Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
++T +P P ADP ++N+TW W
Sbjct: 525 FKQTALP-IRNVPRDPNADPAKWNNTWVNW 553
Score = 92.8 bits (229), Expect = 5e-16, Method: Compositional matrix adjust.
Identities = 41/80 (51%), Positives = 55/80 (68%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRN++ W G+Y+T LFT EAV+LI + + P+F+YLAHLA H+GN L+AP E
Sbjct: 169 MRRNMTVDWSAQGKYSTTLFTDEAVRLIREHNTENPMFMYLAHLAPHSGNDDDPLQAPDE 228
Query: 61 TINQFQYITDPNRRTYAALT 80
I +F +I DP RR YAA+
Sbjct: 229 EIAKFGHIADPERRIYAAMV 248
>gi|156552077|ref|XP_001604760.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 710
Score = 468 bits (1203), Expect = e-129, Method: Compositional matrix adjust.
Identities = 256/569 (44%), Positives = 348/569 (61%), Gaps = 61/569 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS++IPTPNIDALAYNG+ILN+ Y +CTPSR++L+TGKYPIHTGMQ I
Sbjct: 55 GWNDVSFHGSDQIPTPNIDALAYNGVILNSHYVSALCTPSRSALLTGKYPIHTGMQHLVI 114
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEPRG+PL E+ LP+YL+E GY+T AIGKWH GF RREYTP YRGF+SHFGY G+
Sbjct: 115 LEAEPRGLPLHEKILPQYLKEAGYATHAIGKWHQGFHRREYTPTYRGFDSHFGYWQGLQD 174
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFL 270
YY H + + L G DMRRN+S A DT G+Y+TDLFT EAV+LIE+ +P P+FL
Sbjct: 175 YYTHEVGSSNPKEGFL-GFDMRRNMSLARDTYGKYSTDLFTDEAVRLIEEHRPEAGPMFL 233
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLAHLA H+GN + L+AP E + +F Y+ DP RR YAAM+ KLD SVG V+SAL+RK M
Sbjct: 234 YLAHLAPHSGNDNEPLQAPDEEVAKFSYVEDPERRIYAAMMSKLDQSVGEVVSALRRKNM 293
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L+NSI++FM+DNGA T + N GSNYP RG+K + WEG V+ A +WSP IQ+
Sbjct: 294 LQNSIVVFMADNGAAT-----QGIHYNRGSNYPLRGIKASAWEGAVRGAAAVWSPLIQRP 348
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
R+ ++M I+DWLPTL +A+G L R ++NIDG+DQW
Sbjct: 349 KRIYNELMSIADWLPTLLSASG--------------------LRDVVRVSANIDGVDQWP 388
Query: 451 SLLLNTPSR-RNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
++ PS RN +L+NID +A+R +K VLGT NG + +YG+T + L
Sbjct: 389 AISGVAPSPPRNEILVNIDPIFNYSALRRGEFKYVLGTVGNG--EEWYGETGRPENQGLE 446
Query: 510 -------------------FNAIVESKTYQSLQQLSQNIFLPISN----------IDKMR 540
++ +K ++ + ++ + + + +D++
Sbjct: 447 GASPTYDPETVLMSKAGTAIAGLLTAKQVSEIRAVRKDTYGRVKDDEMATSKLLTVDELL 506
Query: 541 STRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYH 600
R A++ C + PC LFN+ DPCEQ N+A+SR I + L E L +
Sbjct: 507 KLRSSASLRCTVPESERVACHPLQSPC-LFNIKEDPCEQRNLAASRAMILATLEEALLKY 565
Query: 601 RRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
R T +P S+ P+ +A+P +N TW W
Sbjct: 566 RVTALPPSN-VPNDPKANPAFWNHTWVNW 593
Score = 99.8 bits (247), Expect = 4e-18, Method: Compositional matrix adjust.
Identities = 47/80 (58%), Positives = 61/80 (76%), Gaps = 1/80 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
MRRN+S A DT G+Y+TDLFT EAV+LIE+ +P P+FLYLAHLA H+GN + L+AP
Sbjct: 194 MRRNMSLARDTYGKYSTDLFTDEAVRLIEEHRPEAGPMFLYLAHLAPHSGNDNEPLQAPD 253
Query: 60 ETINQFQYITDPNRRTYAAL 79
E + +F Y+ DP RR YAA+
Sbjct: 254 EEVAKFSYVEDPERRIYAAM 273
>gi|270005303|gb|EFA01751.1| hypothetical protein TcasGA2_TC007349 [Tribolium castaneum]
Length = 543
Score = 462 bits (1190), Expect = e-127, Method: Compositional matrix adjust.
Identities = 252/537 (46%), Positives = 336/537 (62%), Gaps = 46/537 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS +IPTPNIDALAY+G+IL N Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 34 GWNDVGFHGSGQIPTPNIDALAYSGLILQNYYVTPICTPSRSALMTGKYPIHTGMQHTVL 93
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+GAEPRG+PLTE+ LPEYLRELGY+ + +GKWHLG + +EYTPLYRGF+SH GY G
Sbjct: 94 FGAEPRGLPLTEKILPEYLRELGYTNRLVGKWHLGSYTKEYTPLYRGFDSHLGYWTGHQD 153
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYDH + G DMRRN+ A+D G+Y+TD+FT+EAV++IE+ PLFLY
Sbjct: 154 YYDHTAVENPGW-----GFDMRRNMDLAYDLHGQYSTDVFTQEAVKIIENHNTTNPLFLY 208
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAH+A H+ N L AP ET+ +F I R+ +AAM+ KLDDSVG V+ +L ++ ML
Sbjct: 209 LAHVAVHSANPYNPLPAPDETVEKFSNIPSYKRQRFAAMLSKLDDSVGAVVESLSKRKML 268
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NSIIIF +DNG + N SN+P RGVKNT+WEGGV+ +LWSP I+Q
Sbjct: 269 KNSIIIFSTDNGGAAAGFN-----LNAASNFPLRGVKNTVWEGGVRGAGLLWSPLIKQPQ 323
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RV+ Q MHISDWLPTL +A G ++S NIDGLD W SL N
Sbjct: 324 RVAQQFMHISDWLPTLLSAVGANSSL--KNIDGLDLWESLSQNK---------------- 365
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
N+P R+ +L NID+ AA+ + +WKLV GT NG DG+YG + + + +
Sbjct: 366 ---NSP--RSEILHNIDDIYGNAALTVGAWKLVKGTTYNGAWDGWYGPSGRDYT--YDLS 418
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN-PAPMTPSPCTNGPCYLF 570
+V S T +L+ ++ ++ +P D++ R + I C + P P P T C LF
Sbjct: 419 LVVNSPTGVALKSINMSL-VP----DEITKLRNASRIDCATDQPIPCKPLEET---C-LF 469
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
N+ DPCE N+A P I L E L + T +P + D + +PK +N+TW+
Sbjct: 470 NVIQDPCELRNLAKENPKIVMNLEETLNKYNNTALPPGNLPLD-PRGNPKYWNNTWT 525
Score = 82.4 bits (202), Expect = 7e-13, Method: Compositional matrix adjust.
Identities = 38/79 (48%), Positives = 54/79 (68%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRN+ A+D G+Y+TD+FT+EAV++IE+ PLFLYLAH+A H+ N L AP E
Sbjct: 169 MRRNMDLAYDLHGQYSTDVFTQEAVKIIENHNTTNPLFLYLAHVAVHSANPYNPLPAPDE 228
Query: 61 TINQFQYITDPNRRTYAAL 79
T+ +F I R+ +AA+
Sbjct: 229 TVEKFSNIPSYKRQRFAAM 247
>gi|332024600|gb|EGI64798.1| Arylsulfatase J [Acromyrmex echinatior]
Length = 528
Score = 461 bits (1185), Expect = e-127, Method: Compositional matrix adjust.
Identities = 243/546 (44%), Positives = 335/546 (61%), Gaps = 44/546 (8%)
Query: 83 TTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
+ + LL V+GWND+ FHGS +IPTPNIDALAY+G++L+ Y P+CTPSR++LMTGKYPI
Sbjct: 12 SLILLLAVHGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPI 71
Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
HTGMQ + GAEPRG+PL E+ LPEYLRELGY+T +GKWHLGF+++EYTP YRGF++H
Sbjct: 72 HTGMQHGVLKGAEPRGLPLREKLLPEYLRELGYNTHIVGKWHLGFYKKEYTPTYRGFDTH 131
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
G+ G Y+DH + G D+RR + AWD G+Y+TD+FTKEAV+LI++
Sbjct: 132 IGFWTGHHDYFDHTAVEN-----PYWGLDIRRGMQPAWDLHGQYSTDIFTKEAVRLIDNH 186
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
+P+FLYLAH A H+GN L AP E + +F I D NRR +A M+ KLD SVG V+
Sbjct: 187 NSSRPMFLYLAHAAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGMLSKLDQSVGQVV 246
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL+++ ML +SIIIF SDNG P + N SN+P RGVKNTLWEGGV+ ++
Sbjct: 247 DALRKRDMLRDSIIIFSSDNGGPAAGFN-----LNAASNWPLRGVKNTLWEGGVRGTGLI 301
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
WSP++ + RVS QM+HI+DWLPTL TAAGGD+S L
Sbjct: 302 WSPRLVRPGRVSRQMLHITDWLPTLITAAGGDSSNL-----------------------T 338
Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
+DG+D W++L +T S R +VL NID+ +A+ + WKL+ G+ G DG+YG S
Sbjct: 339 VDGMDLWNALSEDTESPRANVLHNIDDIYGVSAITVGDWKLIRGSTYKGVWDGWYGP--S 396
Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
+ + + + ++ S T ++ + I D + R+ A I C +
Sbjct: 397 GREWVYDVDGVISSTTAHAIASVGLGI-----TTDVVLMLRENALIKCPLRNDSLPICKP 451
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLY-ELLKYHRRTLVPQSHEQPDLVQADPKR 621
C LFN+ DPCE NN+ P I +L EL K++ L+P + P + DP
Sbjct: 452 LEEAC-LFNIYQDPCEDNNLVKQFPTIVRKLQDELKKFNSSALLPGN--LPWDSKGDPNL 508
Query: 622 FNDTWS 627
++ TW+
Sbjct: 509 WDHTWN 514
Score = 86.3 bits (212), Expect = 5e-14, Method: Compositional matrix adjust.
Identities = 39/79 (49%), Positives = 54/79 (68%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+RR + AWD G+Y+TD+FTKEAV+LI++ +P+FLYLAH A H+GN L AP E
Sbjct: 156 IRRGMQPAWDLHGQYSTDIFTKEAVRLIDNHNSSRPMFLYLAHAAVHSGNPYNPLPAPDE 215
Query: 61 TINQFQYITDPNRRTYAAL 79
+ +F I D NRR +A +
Sbjct: 216 EVAKFNNIFDYNRRRFAGM 234
>gi|307207313|gb|EFN85063.1| Arylsulfatase B [Harpegnathos saltator]
Length = 532
Score = 455 bits (1170), Expect = e-125, Method: Compositional matrix adjust.
Identities = 245/537 (45%), Positives = 332/537 (61%), Gaps = 44/537 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS +IPTPNIDALAY+G++L+ Y P+CTPSR++LMTGKYPIH GMQ +
Sbjct: 26 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALMTGKYPIHIGMQHGVL 85
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
GAEPRG+PL E+ LPEYLR+LGYST +GKWHLGF+ +EYTP YRGF SH G+ G
Sbjct: 86 KGAEPRGLPLHEKILPEYLRDLGYSTHIVGKWHLGFYTKEYTPTYRGFASHTGFWTGHQD 145
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+DH + G DMRR++ AWD G+Y+TD+FTKEA++LI+ +PLFLY
Sbjct: 146 YFDHTAVES-----PYWGLDMRRDMEPAWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLY 200
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAH A H+GN L AP E + +F I D NRR +A ++ KLD SVG V+ L++KGML
Sbjct: 201 LAHAAVHSGNPYNPLPAPDEEVAKFDNIFDYNRRRFAGILSKLDKSVGQVVEGLRKKGML 260
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+S+IIF +DNG P + N SN+P RGVKNTLWEGGV+ ++WSP++ ++
Sbjct: 261 RDSVIIFSTDNGGPAAGFN-----LNAASNWPLRGVKNTLWEGGVRGAGLIWSPKLTRSG 315
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS Q++HI+DWLPTL TAAGGD S LNIDG+D WS+L S+
Sbjct: 316 RVSRQLLHITDWLPTLITAAGGDPSN--LNIDGMDLWSAL---------SD--------- 355
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
+T S R SVL NID+ +AV + WKLV G+ NG DG+YG S + + +
Sbjct: 356 ---DTESPRTSVLHNIDDIYGVSAVTVGDWKLVQGSTYNGAWDGWYGP--SGREWVYDVG 410
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
++ S+ +++ + I + +R R++A I C + PC LFN
Sbjct: 411 GVIGSEAGRAVASVGLGI-----TAETVRMLREKAMIKCPPKNDSLPLCKPLQEPC-LFN 464
Query: 572 LGNDPCEQNNIASSRPDISSQLY-ELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
+ DPCE NN+ P I +L EL K++ + P + P + DP ++ TW+
Sbjct: 465 VYQDPCEDNNLVRQSPTIVRKLQEELKKFNSSAIYPGN--LPWDNKGDPSLWDHTWN 519
Score = 86.3 bits (212), Expect = 4e-14, Method: Compositional matrix adjust.
Identities = 40/79 (50%), Positives = 54/79 (68%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR++ AWD G+Y+TD+FTKEA++LI+ +PLFLYLAH A H+GN L AP E
Sbjct: 161 MRRDMEPAWDLHGQYSTDVFTKEALRLIDRHNSSRPLFLYLAHAAVHSGNPYNPLPAPDE 220
Query: 61 TINQFQYITDPNRRTYAAL 79
+ +F I D NRR +A +
Sbjct: 221 EVAKFDNIFDYNRRRFAGI 239
>gi|307167595|gb|EFN61139.1| Arylsulfatase B [Camponotus floridanus]
Length = 519
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 239/541 (44%), Positives = 329/541 (60%), Gaps = 41/541 (7%)
Query: 87 LLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM 146
++ GWND+ FHGS +IPTPNIDALAY+G++L+ Y +CTPSR++LMTGKYPIHTGM
Sbjct: 5 VVFFEGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTSICTPSRSALMTGKYPIHTGM 64
Query: 147 QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
Q + GAEPRG+PL E+ LPEYLRELGYST +GKWHLGF++REYTP YRGF++H GY
Sbjct: 65 QHSILKGAEPRGLPLHEKILPEYLRELGYSTHIVGKWHLGFYKREYTPTYRGFDTHIGYW 124
Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
G YYDH + G DMRR + AWD GEY+TD+FTKEAV+LI + +
Sbjct: 125 TGHHDYYDHTAVEN-----PYWGLDMRRGMKPAWDLHGEYSTDVFTKEAVKLINNHNSSR 179
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
P+FLYLAH A H+GN L AP E + +F I D NRR +A ++ KLD SVG V+ AL
Sbjct: 180 PMFLYLAHAAVHSGNPYNPLPAPDEEVAKFNNIFDYNRRRFAGILSKLDQSVGQVVEALY 239
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
+ ML++S+I+F +DNG P + N SN+P RGVKNTLWEGGV+ ++WSP+
Sbjct: 240 KNNMLQDSVIVFSTDNGGPAAGFN-----LNAASNWPLRGVKNTLWEGGVRGAGLIWSPR 294
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
+ + RVS QM+HI+DWLPTL TAAGG+ S L ++IDG+
Sbjct: 295 LVRAGRVSRQMIHITDWLPTLITAAGGNASNL----------------------TSIDGV 332
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
+ W +L +T S R ++L NID+ +A+ + WK + G+ NG DG+YG S +
Sbjct: 333 NLWDALSEDTESPRTNILHNIDDIYGVSAITVGDWKFIQGSTYNGVWDGWYGP--SGREW 390
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
+ + + ++ S +++ + ++ I + S R+ A I C + P
Sbjct: 391 IYDADGVISSTAGRAIASVGLSVTTEI-----ISSLRENAMIKCPPRNDSLPICRPLQEP 445
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LFN+ DPCE NN+ P I +L E LK T + + D +ADP ++ TW
Sbjct: 446 C-LFNIYQDPCEDNNLVKQFPTIVRKLQEELKKFNNTAILPGNLPWD-SKADPNLWDHTW 503
Query: 627 S 627
+
Sbjct: 504 N 504
Score = 88.2 bits (217), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 41/79 (51%), Positives = 53/79 (67%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR + AWD GEY+TD+FTKEAV+LI + +P+FLYLAH A H+GN L AP E
Sbjct: 145 MRRGMKPAWDLHGEYSTDVFTKEAVKLINNHNSSRPMFLYLAHAAVHSGNPYNPLPAPDE 204
Query: 61 TINQFQYITDPNRRTYAAL 79
+ +F I D NRR +A +
Sbjct: 205 EVAKFNNIFDYNRRRFAGI 223
>gi|242024962|ref|XP_002432895.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
gi|212518404|gb|EEB20157.1| arylsulfatase J precursor, putative [Pediculus humanus corporis]
Length = 533
Score = 454 bits (1168), Expect = e-125, Method: Compositional matrix adjust.
Identities = 253/542 (46%), Positives = 339/542 (62%), Gaps = 42/542 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +HGS+EIPTPNIDALAYNGIILN Y PVCTPSR++LMTG++PIH GMQ +
Sbjct: 21 GWNDVGYHGSDEIPTPNIDALAYNGIILNRYYVLPVCTPSRSALMTGRHPIHNGMQHRVL 80
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+G E RG+PLTE+ LPEYL++LGYST +GKWHLGF+++EYTPLYRGFESH G+ G
Sbjct: 81 FGVETRGLPLTEKLLPEYLQKLGYSTHIVGKWHLGFYKKEYTPLYRGFESHIGFWTGHQD 140
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYDH ++ L G DMR + AW GEY+T ++T+E+V++I++ KPLFLY
Sbjct: 141 YYDHTAEEE-----RLWGLDMRHGMKPAWYLHGEYSTHVYTRESVKIIKNYNSTKPLFLY 195
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+AH A H+GN L AP +T+++ +I + NRR YAAMV +LD SVG +ISAL+ ML
Sbjct: 196 VAHAAVHSGNKYNPLPAPDKTVDKLDHIQNYNRRRYAAMVSELDTSVGNIISALKDANML 255
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NSI+IF +DNG P NY N+ SN+P RGVKNTLWEGGV+ ++WS + + N
Sbjct: 256 SNSIVIFSTDNGGPA----NGLNY-NYASNWPMRGVKNTLWEGGVRGVGVVWSSKFKPNS 310
Query: 392 --RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
RVS +MMHI DWLPTLY AAGGD L N+DG + W+SLL D L
Sbjct: 311 GGRVSNRMMHICDWLPTLYAAAGGDVRDLGENLDGYNMWNSLL-----------DDL--- 356
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
PS RN +L NID+ +A+ + WKLV GT NG D ++G T N N
Sbjct: 357 -------PSPRNEILHNIDDIFGNSALTMSQWKLVKGTTYNGFYDSWFGPTGRNPGYFYN 409
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN----G 565
++ S + ++L L +P+ N + +R R++ATI CG + + S +N
Sbjct: 410 IGQVLSSLSNKALSTLQT---VPL-NPESIRRIREEATIDCGNHQLISSESRFSNCNPLK 465
Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
LFN+ DPCE +N+A S P + ++ E L Y T V + DL + +PK +
Sbjct: 466 ESCLFNVDEDPCEIDNVAKSFPQVVKEMEERLLYWNSTAVKPAKVINDL-RGNPKFWGYV 524
Query: 626 WS 627
W+
Sbjct: 525 WT 526
Score = 78.2 bits (191), Expect = 1e-11, Method: Compositional matrix adjust.
Identities = 35/80 (43%), Positives = 54/80 (67%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MR + AW GEY+T ++T+E+V++I++ KPLFLY+AH A H+GN L AP +
Sbjct: 156 MRHGMKPAWYLHGEYSTHVYTRESVKIIKNYNSTKPLFLYVAHAAVHSGNKYNPLPAPDK 215
Query: 61 TINQFQYITDPNRRTYAALT 80
T+++ +I + NRR YAA+
Sbjct: 216 TVDKLDHIQNYNRRRYAAMV 235
>gi|340727296|ref|XP_003401982.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 579
Score = 451 bits (1161), Expect = e-124, Method: Compositional matrix adjust.
Identities = 245/544 (45%), Positives = 329/544 (60%), Gaps = 46/544 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGSN+IPTPNIDAL YNGIILN Y P TPSR + TG+YPI GMQG I
Sbjct: 42 GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADI 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G EPRG+PL + LPE+LR LGY+TK IGKWH+G++ +YTPL+RGF++ G+ N IS
Sbjct: 102 RGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYIS 161
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD+ S+Q ++G+DM R A+ EYATD+FT EA+ +IE+ +++PL+L
Sbjct: 162 YYDYNYSNQ-----NMSGYDMHRGDDPAYGMNREYATDMFTSEAINIIENHELNRPLYLQ 216
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
L+HLA H+ LE P N + +I +PNRR YA MV KLD+SVG + AL KG
Sbjct: 217 LSHLAVHS-----PLEQPANVYNDREPIHIREPNRRKYAKMVSKLDESVGRIFHALGEKG 271
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML++S+I+F++DNGA +V YRN+GSNYP RG K TL+EGGV+ A LWS ++Q+
Sbjct: 272 MLKDSLILFLTDNGAASV-----GRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLQR 326
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
RVS +++HI+DWLPTLY+AAGGD L IDG+DQW
Sbjct: 327 AARVSNELVHITDWLPTLYSAAGGDLKDL----------------------GEIDGIDQW 364
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L R+ +L+NIDE +T +KL+ GT E+G DGYYG + L
Sbjct: 365 RLLSEGRGRGRDKLLLNIDEVLKTEGAIYSRFKLLRGTIESGYYDGYYGDSGKTLETLPT 424
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA---PMTPSPCTNGP 566
+ IV T + L+ ++ P++ M RQ AT+ C AN +PC
Sbjct: 425 YTDIVMKSTVS--ESLTYHLGGPVTQPSTMVRLRQDATVQCNANFTYYRRQMLTPCNVTE 482
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LF++ NDPCE NIA + I+ L L+ + + LV Q+ + D ADP+ N TW
Sbjct: 483 C-LFDIVNDPCETRNIADTYVRIARDLNYYLEQYSQILVKQTKKPIDW-SADPRNKNGTW 540
Query: 627 SPWI 630
PW+
Sbjct: 541 GPWL 544
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 48/82 (58%), Gaps = 7/82 (8%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
M R A+ EYATD+FT EA+ +IE+ +++PL+L L+HLA H+ LE P
Sbjct: 177 MHRGDDPAYGMNREYATDMFTSEAINIIENHELNRPLYLQLSHLAVHS-----PLEQPAN 231
Query: 61 TINQFQ--YITDPNRRTYAALT 80
N + +I +PNRR YA +
Sbjct: 232 VYNDREPIHIREPNRRKYAKMV 253
>gi|383853606|ref|XP_003702313.1| PREDICTED: arylsulfatase J-like [Megachile rotundata]
Length = 544
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 246/555 (44%), Positives = 335/555 (60%), Gaps = 44/555 (7%)
Query: 73 RRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
R AA L GWND+ FHGS +IPTPNIDALAY+G++L+ Y P+CTPSR
Sbjct: 18 RHVTAAERPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTPSR 77
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
++LMTGKYPIHTGMQ + AEPRG+PL E+ LPEYL+ELGY T +GKWHLGF+ ++Y
Sbjct: 78 SALMTGKYPIHTGMQHGVLKCAEPRGLPLQEKLLPEYLKELGYRTHIVGKWHLGFYTKQY 137
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
TP YRGF+SH G+ +G Y+DH + G DMRR + AWD G+Y+TD+FT
Sbjct: 138 TPTYRGFDSHIGFWSGHQDYFDHTAVES-----PYWGLDMRRGMEAAWDLHGQYSTDVFT 192
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
EAV+LI + KPLFLYLAH A H+GN L AP + +F I D NRR +AAM+
Sbjct: 193 SEAVKLINNHNDSKPLFLYLAHAAVHSGNPYDPLPAPDVDVAKFTNIFDYNRRRFAAMLS 252
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
KLD+SVG V+ ALQRKGML+NSII+F +DNG P + N SN+P RG KNTLW
Sbjct: 253 KLDESVGEVVEALQRKGMLKNSIIVFSTDNGGPPAGFN-----LNAASNWPLRGTKNTLW 307
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
EGGV+ ++WSP++ + RVS Q+ HISDWLPTL TAAGGD S L+IDGL
Sbjct: 308 EGGVRGAGLIWSPRLIRPGRVSRQLFHISDWLPTLLTAAGGDPSN--LSIDGL------- 358
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
D W +L +T S R SVL NID+ + + + WK + G+ G
Sbjct: 359 --------------DLWHALSEDTESPRQSVLHNIDDIFGVSGITIGEWKYIQGSTYEGQ 404
Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
DG+YG S + + + ++ S T +++ + ++ + + +R R+QA I C
Sbjct: 405 WDGWYGP--SGREWVYDIGGVIGSPTGRAVASVGYSL-----SSENVRRLREQAMIKCRP 457
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE-LLKYHRRTLVPQSHEQ 611
+ PC LF++ DPCE NN+ + P+I+ + E L K++ L+P +
Sbjct: 458 RNDTLPLCKPLIAPC-LFHVLKDPCEDNNLINEYPEIARAMREDLKKWNATALLPGN--L 514
Query: 612 PDLVQADPKRFNDTW 626
P +A+P+ ++ TW
Sbjct: 515 PWDKRANPELWDHTW 529
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 41/79 (51%), Positives = 52/79 (65%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR + AWD G+Y+TD+FT EAV+LI + KPLFLYLAH A H+GN L AP
Sbjct: 172 MRRGMEAAWDLHGQYSTDVFTSEAVKLINNHNDSKPLFLYLAHAAVHSGNPYDPLPAPDV 231
Query: 61 TINQFQYITDPNRRTYAAL 79
+ +F I D NRR +AA+
Sbjct: 232 DVAKFTNIFDYNRRRFAAM 250
>gi|383847821|ref|XP_003699551.1| PREDICTED: arylsulfatase B-like [Megachile rotundata]
Length = 575
Score = 449 bits (1155), Expect = e-123, Method: Compositional matrix adjust.
Identities = 247/548 (45%), Positives = 337/548 (61%), Gaps = 56/548 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNIDAL YNGIILN Y P TPSR++ +TG YPI GMQG I
Sbjct: 38 GWNDVGFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRSAFLTGLYPIRIGMQGDGI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G EPRG+PL + LPE+LR+LGY+TK IGKWH+G+ +YTPL+RGF+ G+ N IS
Sbjct: 98 RGGEPRGLPLDIKILPEHLRDLGYTTKLIGKWHMGYHTPQYTPLHRGFDFFLGFYNSHIS 157
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD+ S+Q ++G+D+ R A EYATDLFTKEAV++IE + +PL+L
Sbjct: 158 YYDYHYSNQ-----NMSGYDLHRGDDPAHGINREYATDLFTKEAVRMIETHELPRPLYLQ 212
Query: 272 LAHLAAHAGNAGKHLEAPQETIN--QFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
++HLA HA LE P++ N +F YI +PNRR YA MV +LDDSVG ++ AL KG
Sbjct: 213 ISHLAVHAP-----LEQPRDEYNDGRFSYIREPNRRKYARMVSRLDDSVGRIVHALGEKG 267
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML +S+I+F++DNGA +V YRNWGSNYP RG K TL+EGGV+ A LWSP++Q+
Sbjct: 268 MLRDSLILFLTDNGAASV-----GRYRNWGSNYPLRGTKYTLYEGGVRSVAALWSPRLQK 322
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
RVS Q++HI+DWLPTLY+AAGGD L IDG+D W
Sbjct: 323 VARVSNQLIHITDWLPTLYSAAGGDLKDL----------------------GQIDGIDLW 360
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
L R+ +L+NIDE ++T +KL+ G+ ENG DG+YG + R+ ++P
Sbjct: 361 RVLSEGYGRGRDKLLLNIDEIQKTEGAIYSRFKLLRGSVENGYYDGFYGDSGRTLEIPPY 420
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP------SPC 562
+ ++ S QS ++ ++ P++ M R+ AT+ C MT + C
Sbjct: 421 T-DVVLRSSVSQS---ITYHLGGPVTQPSTMIQLRRDATVQC----QNMTYHYRHAFTTC 472
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
C LF++ NDPCE N+A + P I+ L L+++ L+ Q D + ADP+R
Sbjct: 473 NVTEC-LFDIVNDPCETKNVADAYPRIARDLDLYLEHYGHILMRQIKHPVDWL-ADPRRR 530
Query: 623 NDTWSPWI 630
N+TW PW+
Sbjct: 531 NNTWEPWM 538
Score = 67.8 bits (164), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 35/69 (50%), Positives = 47/69 (68%), Gaps = 7/69 (10%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN--QFQYITDP 71
EYATDLFTKEAV++IE + +PL+L ++HLA HA LE P++ N +F YI +P
Sbjct: 186 EYATDLFTKEAVRMIETHELPRPLYLQISHLAVHAP-----LEQPRDEYNDGRFSYIREP 240
Query: 72 NRRTYAALT 80
NRR YA +
Sbjct: 241 NRRKYARMV 249
>gi|340710385|ref|XP_003393772.1| PREDICTED: arylsulfatase J-like [Bombus terrestris]
Length = 545
Score = 448 bits (1153), Expect = e-123, Method: Compositional matrix adjust.
Identities = 237/551 (43%), Positives = 333/551 (60%), Gaps = 42/551 (7%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
AA+ L GWND+ FHGS +IPTPNIDALAY+G++L+ Y P+CTPSR++LM
Sbjct: 23 AAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALM 82
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TGK+PIHTGMQ + AEPRG+PL E+ LP+YLRELGYST +GKWHLGF+ +EYTP+Y
Sbjct: 83 TGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYTPMY 142
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
RGF+SH G+ +G Y+DH + G DMRR L++AWD G+Y+TD+FTKEAV
Sbjct: 143 RGFDSHIGFWSGHHDYFDHSAVES-----PYWGLDMRRGLNSAWDLHGQYSTDIFTKEAV 197
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
+LI D +P+FLYL+H A H+GN+ L AP + + +F I + RR +A M+ KLD+
Sbjct: 198 KLINDHNASRPMFLYLSHAAVHSGNSYNPLPAPDQDVAKFTNIFNYERRRFAGMLSKLDE 257
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
SVG V+ AL++KGML+NS+I+F +DNG P + N SN+P RG KNTLWEGGV
Sbjct: 258 SVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFN-----LNAASNWPLRGTKNTLWEGGV 312
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
+ +LWSP++ + RVS Q+ HISDWLPTL TAAGG+ S L
Sbjct: 313 RGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAAGGNPSNL------------------ 354
Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
+IDG+D W +L +T S R VL NID+ A + WK + G+ NG DG+
Sbjct: 355 -----SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQGSTYNGQWDGW 409
Query: 497 YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
YG S + + + ++ S +++ + + + D +R R+ A I C
Sbjct: 410 YGP--SGREWVYDIGGVIGSTAGRAVASVGLAL-----SADAIRRLREDAMIKCPPKNDS 462
Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
+ PC LFN+ DPCE NN+ + P + ++L + L+ T + + D +
Sbjct: 463 LPLCKPLEAPC-LFNVQQDPCEDNNLINKYPSLVTELQDKLRKLNATAILPGNLPWD-NK 520
Query: 617 ADPKRFNDTWS 627
A+P ++ TW+
Sbjct: 521 ANPDYWDHTWT 531
Score = 85.9 bits (211), Expect = 7e-14, Method: Compositional matrix adjust.
Identities = 38/79 (48%), Positives = 55/79 (69%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR L++AWD G+Y+TD+FTKEAV+LI D +P+FLYL+H A H+GN+ L AP +
Sbjct: 173 MRRGLNSAWDLHGQYSTDIFTKEAVKLINDHNASRPMFLYLSHAAVHSGNSYNPLPAPDQ 232
Query: 61 TINQFQYITDPNRRTYAAL 79
+ +F I + RR +A +
Sbjct: 233 DVAKFTNIFNYERRRFAGM 251
>gi|350415537|ref|XP_003490674.1| PREDICTED: arylsulfatase J-like [Bombus impatiens]
Length = 545
Score = 446 bits (1146), Expect = e-122, Method: Compositional matrix adjust.
Identities = 236/551 (42%), Positives = 331/551 (60%), Gaps = 42/551 (7%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
AA+ L GWND+ FHGS +IPTPNIDALAY+G++L+ Y P+CTPSR++LM
Sbjct: 23 AAVQPHIVFILADDLGWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVTPICTPSRSALM 82
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TGK+PIHTGMQ + AEPRG+PL E+ LP+YLRELGYST +GKWHLGF+ +EYTP+Y
Sbjct: 83 TGKHPIHTGMQHGVLKCAEPRGLPLHEKLLPQYLRELGYSTHIVGKWHLGFYTKEYTPMY 142
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
RGF+SH G+ +G Y+DH + G DMRR L++AWD G+Y+TD+FTKEAV
Sbjct: 143 RGFDSHIGFWSGHHDYFDHSAVES-----PYWGLDMRRGLNSAWDLHGQYSTDIFTKEAV 197
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
+LI D +P+FLYL H A H+GN+ L P + + +F I + RR +A M+ KLD+
Sbjct: 198 KLINDHNASRPMFLYLPHAAVHSGNSYNPLPVPDQDVAKFTNIFNYERRRFAGMLSKLDE 257
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
SVG V+ AL++KGML+NS+I+F +DNG P + N SN+P RG KNTLWEGGV
Sbjct: 258 SVGQVVEALRKKGMLKNSVIVFSTDNGGPPAGFN-----LNAASNWPLRGTKNTLWEGGV 312
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
+ +LWSP++ + RVS Q+ HISDWLPTL TAAGG+ S L
Sbjct: 313 RGAGLLWSPRLVRPGRVSRQLFHISDWLPTLITAAGGNPSNL------------------ 354
Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
+IDG+D W +L +T S R VL NID+ A + WK + G+ NG DG+
Sbjct: 355 -----SIDGMDLWYALSEDTESPRKMVLHNIDDIFGVAGITYGDWKYIQGSTYNGQWDGW 409
Query: 497 YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
YG S + + + ++ S +++ + + + D +R R+ A I C
Sbjct: 410 YGP--SGREWVYDIGGVIGSTAGRAVASVGLTL-----SADAIRRLREDAMIKCPPKNDS 462
Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
+ PC LFN+ DPCE NN+ + P + ++L + L+ T + + D +
Sbjct: 463 LPLCKPLEAPC-LFNVQQDPCEDNNLINKYPSLVTELRDRLRKLNATAILPGNLPWD-NR 520
Query: 617 ADPKRFNDTWS 627
A+P ++ TW+
Sbjct: 521 ANPDYWDHTWT 531
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 37/79 (46%), Positives = 53/79 (67%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR L++AWD G+Y+TD+FTKEAV+LI D +P+FLYL H A H+GN+ L P +
Sbjct: 173 MRRGLNSAWDLHGQYSTDIFTKEAVKLINDHNASRPMFLYLPHAAVHSGNSYNPLPVPDQ 232
Query: 61 TINQFQYITDPNRRTYAAL 79
+ +F I + RR +A +
Sbjct: 233 DVAKFTNIFNYERRRFAGM 251
>gi|350422929|ref|XP_003493332.1| PREDICTED: arylsulfatase B-like [Bombus impatiens]
Length = 581
Score = 442 bits (1137), Expect = e-121, Method: Compositional matrix adjust.
Identities = 251/544 (46%), Positives = 333/544 (61%), Gaps = 44/544 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGSN+IPTPNIDAL YNGIILN Y P TPSR + TG+YPI GMQG I
Sbjct: 42 GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRIGMQGADI 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G EPRG+PL + LPE+LR LGY+TK IGKWH+G++ +YTPL+RGF++ G+ N IS
Sbjct: 102 RGGEPRGLPLNIKILPEHLRGLGYTTKLIGKWHMGYYTPQYTPLHRGFDTFLGFYNSYIS 161
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD+ S+Q ++G+DM R A+ EYATD+FT+EA+ +IE+ +++PL+L
Sbjct: 162 YYDYSYSNQ-----NMSGYDMHRGDDPAYGMNREYATDMFTREAINIIENHELNRPLYLQ 216
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
L+HLA HA LE P N + +I +PNRR YA MV KLD+SVG + AL KG
Sbjct: 217 LSHLAVHA-----PLEQPMNVYNDREPIHIREPNRRKYAKMVSKLDESVGRIFHALGEKG 271
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML++S+I+F++DNGA +V YRN+GSNYP RG K TL+EGGV+ A LWS ++Q+
Sbjct: 272 MLKDSLILFLTDNGAASV-----GRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLQK 326
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
RVS +++HI+DWLPTLY+AAGGD L IDG+DQW LL+ S R
Sbjct: 327 AARVSNELVHITDWLPTLYSAAGGDLKDLG-KIDGIDQWR--LLSEGSGRGRG------- 376
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
R+ +L+NIDE +T +KL+ GT ENG DGYYG + L
Sbjct: 377 ----------RDKLLLNIDEVLKTEGAIYSRFKLLRGTNENGYYDGYYGDSGKTLETLPT 426
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA---PMTPSPCTNGP 566
+ IV T + L+ ++ P++ M RQ AT+ C AN +PC
Sbjct: 427 YTDIVMKSTVS--ESLTYHLGGPVTQPSTMVRLRQDATVQCNANFTYYRRQMLTPCNVTE 484
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LF++ NDPCE NIA + I+ L L+ + LV Q + D ADP+ NDTW
Sbjct: 485 C-LFDIVNDPCETRNIAETYVRIARDLNYFLEEYSHILVKQMKKPIDW-SADPRNRNDTW 542
Query: 627 SPWI 630
PW+
Sbjct: 543 GPWV 546
Score = 63.9 bits (154), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 49/82 (59%), Gaps = 7/82 (8%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
M R A+ EYATD+FT+EA+ +IE+ +++PL+L L+HLA HA LE P
Sbjct: 177 MHRGDDPAYGMNREYATDMFTREAINIIENHELNRPLYLQLSHLAVHA-----PLEQPMN 231
Query: 61 TINQFQ--YITDPNRRTYAALT 80
N + +I +PNRR YA +
Sbjct: 232 VYNDREPIHIREPNRRKYAKMV 253
>gi|242025556|ref|XP_002433190.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212518731|gb|EEB20452.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 570
Score = 440 bits (1132), Expect = e-120, Method: Compositional matrix adjust.
Identities = 245/540 (45%), Positives = 328/540 (60%), Gaps = 38/540 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGSN+I TPNIDALAYNGIILN+ Y +CTPSRASLMTGKYP GMQ I
Sbjct: 58 GWNDVSFHGSNQIQTPNIDALAYNGIILNSHYVPALCTPSRASLMTGKYPTSLGMQHLVI 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+PL E +PEY + GY+T A+GKWHLGFF++EYTP+YRGF+SHFG+ NG
Sbjct: 118 LSPEPWGLPLNETLMPEYFNKNGYATHAVGKWHLGFFKKEYTPIYRGFDSHFGHWNGFQD 177
Query: 212 YYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLF 269
YYDH +SD L G+DMRRN + G Y TD+FTKEA+++I++ K PLF
Sbjct: 178 YYDHTTMSDS------LKGYDMRRNFEVDYSYQGMYTTDVFTKEAIKIIDNHNSQKGPLF 231
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL+HLA H+GN +AP++ I++ + I DP R+ YAAMV KLD+SVG V+SAL++
Sbjct: 232 LYLSHLAPHSGNPDNPFQAPEDEISKHECINDPGRKIYAAMVTKLDESVGQVVSALEKNK 291
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML NSIIIFMSDNGA T + N GSNYP RG+K + WEGGV+ A +WSP + +
Sbjct: 292 MLNNSIIIFMSDNGAATYGL-----HSNRGSNYPLRGLKESPWEGGVRGTAAIWSPFLNK 346
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
RVS Q+MH+SDWLPTL TAA GL+ S+ L+N IDG+D W
Sbjct: 347 TKRVSKQLMHMSDWLPTLLTAA------------GLNYSSTQLINK-------IDGIDMW 387
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ L + PS R V N DE + +++ +DSWK V GT + G D ++ + N
Sbjct: 388 NVLSNDLPSPRKEVFNNYDEIENYSSLMIDSWKYVEGTAQEGKADYWFEEPSRNNCSEYR 447
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
V ++ L++ S I + + TR T +T P C L
Sbjct: 448 ----VSNEDIFRLRRDSTIICDNPTFSSSLSITRNNHTDVKNKTKYVLTCDPLLKRFC-L 502
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FNL +DPCE+ N+A PD+ ++ L ++++V + +P+ ++P +N TW W
Sbjct: 503 FNLNDDPCERLNLADVFPDVVKRIKNRLLELKKSVV-KPLNKPEDPYSNPMFYNGTWVNW 561
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 37/81 (45%), Positives = 54/81 (66%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MRRN + G Y TD+FTKEA+++I++ K PLFLYL+HLA H+GN +AP+
Sbjct: 193 MRRNFEVDYSYQGMYTTDVFTKEAIKIIDNHNSQKGPLFLYLSHLAPHSGNPDNPFQAPE 252
Query: 60 ETINQFQYITDPNRRTYAALT 80
+ I++ + I DP R+ YAA+
Sbjct: 253 DEISKHECINDPGRKIYAAMV 273
>gi|307215080|gb|EFN89887.1| Arylsulfatase B [Harpegnathos saltator]
Length = 593
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 237/548 (43%), Positives = 331/548 (60%), Gaps = 43/548 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGSN+IPTPNIDAL YNGIILN Y P TPSRA+ TG YPI GMQG I
Sbjct: 42 GWNDVSFHGSNQIPTPNIDALGYNGIILNRHYVLPSSTPSRAAFFTGLYPIRIGMQGEGI 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G EPRG+PL R LPEYLR LGY+TK IGKWH+G+ ++TPL+RGF++ G+ N +S
Sbjct: 102 QGGEPRGLPLNIRILPEYLRGLGYTTKLIGKWHVGYHTPQHTPLHRGFDAFLGFYNSHVS 161
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD+ S Q ++G+DM R + A+ EYATDLFT EA+++I+ +PL+L
Sbjct: 162 YYDYRYSYQ-----NMSGYDMHRGDNPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQ 216
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
++HLA HA + +F++I + NRR YA+MV +LD+SVG ++ L +GML
Sbjct: 217 ISHLAVHAPIESPDDDHRNSNRERFKHIPEVNRRNYASMVTRLDESVGRIVHTLGNRGML 276
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+S+I+F++DNGAPT+ +RNWGSN P RG+K TL+EGGV+ A+LWSP++++
Sbjct: 277 RDSLILFLTDNGAPTI-----GKFRNWGSNLPLRGMKYTLYEGGVRGVAVLWSPRLRKAA 331
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RV +++HI+DWLPT Y+AAGGD L IDG++QW
Sbjct: 332 RVCNELVHITDWLPTFYSAAGGDLKDL----------------------GEIDGVNQWRM 369
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF- 510
L P R S+L+NIDE RT S+KL+ G+ E G DGY+ + + +
Sbjct: 370 LNDGEPVARQSLLLNIDEVSRTEGAIYKSFKLLRGSIEGGHYDGYHTIREPSWIEHVEHK 429
Query: 511 -NAIVESKTY-----QSL--QQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
N E +Y +SL Q ++ ++ P++ + RQ+AT+HC +N + + C
Sbjct: 430 KNEPTEVPSYTETVLKSLVSQSITYHLGGPVTQPSTINQLRQEATVHCRSNGSYNRFTTC 489
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
C LF++ NDPCE NIA P I+ +L L+ + R + Q D + ADP+R
Sbjct: 490 NVTEC-LFDINNDPCETKNIAQQYPRIARELDLFLEKYGRNVTRQMKVPVDWL-ADPRRT 547
Query: 623 NDTWSPWI 630
N TW PW+
Sbjct: 548 NGTWEPWM 555
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 46/80 (57%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
M R + A+ EYATDLFT EA+++I+ +PL+L ++HLA HA +
Sbjct: 177 MHRGDNPAYGLNAEYATDLFTDEAMKIIQRHEPPRPLYLQISHLAVHAPIESPDDDHRNS 236
Query: 61 TINQFQYITDPNRRTYAALT 80
+F++I + NRR YA++
Sbjct: 237 NRERFKHIPEVNRRNYASMV 256
>gi|380026538|ref|XP_003697007.1| PREDICTED: arylsulfatase J-like [Apis florea]
Length = 543
Score = 440 bits (1131), Expect = e-120, Method: Compositional matrix adjust.
Identities = 233/535 (43%), Positives = 322/535 (60%), Gaps = 42/535 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS +IPTPNIDALAY+G++L+ Y P+CTPSR++LMTGK+PIHTGMQ +
Sbjct: 36 GWNDVGFHGSGQIPTPNIDALAYSGLLLDRYYVSPICTPSRSALMTGKHPIHTGMQHGVL 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEPRG+PL E+ LPEY R+LGYST +GKWHLGF+ +EYTP+YRGF+SH G+ +G
Sbjct: 96 KCAEPRGLPLHEKLLPEYFRDLGYSTHIVGKWHLGFYTKEYTPMYRGFDSHIGFWSGHHD 155
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+DH ++ G DMRR L AWD G+Y+TD+FTKEAV+LI++ +P+FLY
Sbjct: 156 YFDHSAVEE-----PYWGLDMRRGLEPAWDLHGQYSTDVFTKEAVKLIDNHNTSRPMFLY 210
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAH A H+GN L A + + +F I + NRR +A ++ KLD+SVG V+ AL+R+ ML
Sbjct: 211 LAHAAVHSGNPYNPLPAHDQDVAKFTNIFNYNRRRFAGILSKLDESVGLVVEALRRRKML 270
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
ENS+I+F +DNG + N SN+P RG KNTLWEGGV+ +LWSP++ +
Sbjct: 271 ENSVIVFSTDNGGAPAGFN-----LNAASNWPLRGTKNTLWEGGVRGVGLLWSPKLIKPG 325
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RV Q++HISDWLPTL TAAGGD S L IDG+D W +
Sbjct: 326 RVGRQLIHISDWLPTLLTAAGGDPSNL-----------------------TIDGMDIWHA 362
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L +T S R VL NID+ A + WK + G+ NG DG+YG S + + +
Sbjct: 363 LSEDTESPRKMVLHNIDDIYGVAGITYGDWKFIQGSTYNGQWDGWYGP--SGREWVYDVG 420
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
++ S +++ + ++ + D +R R+ A I C + PC LFN
Sbjct: 421 GVIGSAAGRAVASVGLSL-----SADNIRKLREDAMIKCPPRNDNLPLCKPLEAPC-LFN 474
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
+ DPCE NN+ P + S L E L+ + V + D +A+P ++ TW
Sbjct: 475 VRQDPCEDNNLIEKYPSVVSDLKERLRVLNASAVLPGNLPWD-KRANPDYWDHTW 528
Score = 82.0 bits (201), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/79 (48%), Positives = 53/79 (67%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR L AWD G+Y+TD+FTKEAV+LI++ +P+FLYLAH A H+GN L A +
Sbjct: 171 MRRGLEPAWDLHGQYSTDVFTKEAVKLIDNHNTSRPMFLYLAHAAVHSGNPYNPLPAHDQ 230
Query: 61 TINQFQYITDPNRRTYAAL 79
+ +F I + NRR +A +
Sbjct: 231 DVAKFTNIFNYNRRRFAGI 249
>gi|156547171|ref|XP_001603886.1| PREDICTED: arylsulfatase B [Nasonia vitripennis]
Length = 581
Score = 438 bits (1127), Expect = e-120, Method: Compositional matrix adjust.
Identities = 245/547 (44%), Positives = 331/547 (60%), Gaps = 45/547 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHG+NEIPTPNIDALAYNG+ILN Y P+CTPSR++LMTG+YPI GMQG P+
Sbjct: 46 GWNDVSFHGANEIPTPNIDALAYNGVILNKYYTMPICTPSRSALMTGRYPIRDGMQGTPM 105
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEPRG+PL +PE +R LGY T+ +GKWHLG+ +YTP+ RGF++ FGY NG IS
Sbjct: 106 RPAEPRGIPLNVSLMPEQMRRLGYETRLVGKWHLGYTTEDYTPVRRGFDTFFGYYNGFIS 165
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTV--GEYATDLFTKEAVQLIEDQPVDKPLF 269
YYD+ + ++ T E+ G+D+ R+ S +++ EY TDL T EA ++I + KPLF
Sbjct: 166 YYDYWIG--WNDTNEVTGYDLHRDESDSFELAHSSEYFTDLITDEAEKIIRNNKNAKPLF 223
Query: 270 LYLAHLAAHAGNAGKHLEAPQET-----IN-QFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
L ++HLA HAG+ K + P E +N F YI D R YA M+ LD+SVG V+
Sbjct: 224 LEISHLAVHAGS--KVHDDPLEVRRTDDVNASFPYIEDYQHRKYAGMMAALDESVGRVVK 281
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
AL+ MLENSIIIFMSDNGAPTV Y N GSNYP RG+K ++EG + A ++
Sbjct: 282 ALKEAEMLENSIIIFMSDNGAPTVGL-----YNNTGSNYPMRGIKGGMFEGAARAAACIF 336
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
SP I+ + RVS ++MHI DWLPTLYTAAGG+ L DG +L L+
Sbjct: 337 SPLIKAHSRVSEELMHIVDWLPTLYTAAGGNPMDLQSQFDG-----ALPLD--------- 382
Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
G+ QWSS++ PS R S+L+NIDE + A + KLV G + DGYYG + ++
Sbjct: 383 -GVSQWSSIVAGGPSSRQSLLVNIDEAQGFEAAIIGRHKLVKGMTKE---DGYYGNSGND 438
Query: 504 -KVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
P N ++ S S+ +L+ + + RQ++ I C P T +
Sbjct: 439 PSFPAYNVKKVLSSTAGASIGKLAG---FASPSARRALWLRQKSVITC----KPFTSAAN 491
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
+G C LF+L DPCE +++S P I +L L +RR L+PQ++ D PK F
Sbjct: 492 CSGTC-LFDLSKDPCETRDLSSKLPLIVKKLESFLGEYRRVLMPQTNSPQDACGL-PKYF 549
Query: 623 NDTWSPW 629
N + PW
Sbjct: 550 NGVYMPW 556
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 41/75 (54%), Gaps = 8/75 (10%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET-----IN-QFQY 67
EY TDL T EA ++I + KPLFL ++HLA HAG+ K + P E +N F Y
Sbjct: 199 EYFTDLITDEAEKIIRNNKNAKPLFLEISHLAVHAGS--KVHDDPLEVRRTDDVNASFPY 256
Query: 68 ITDPNRRTYAALTKS 82
I D R YA + +
Sbjct: 257 IEDYQHRKYAGMMAA 271
>gi|357612332|gb|EHJ67925.1| hypothetical protein KGM_21236 [Danaus plexippus]
Length = 563
Score = 438 bits (1126), Expect = e-120, Method: Compositional matrix adjust.
Identities = 233/529 (44%), Positives = 322/529 (60%), Gaps = 43/529 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNID +A++G+ L+N Y P+CTPSRA+LMTGKYPIHTGMQ I
Sbjct: 60 GWNDVGFHGSNQIPTPNIDIMAWSGVSLHNYYVTPICTPSRAALMTGKYPIHTGMQHTVI 119
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PLTE+ LP+YL+ELGY T +GKWHLG +++EY PL RGF+SH G+ NG I
Sbjct: 120 FAAEPRGLPLTEKILPQYLKELGYKTHLVGKWHLGSYKKEYLPLNRGFDSHLGFWNGKID 179
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YDH ++ G D RR+ STA D G+YATD++T EAV++I+ PLFL
Sbjct: 180 MYDHTNQEK-----GYWGFDFRRDFSTAHDLFGQYATDVYTNEAVKIIKSHNTSSPLFLM 234
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L+H A H GN + + AP++ F +I D RR +AA++ KLD+SVG V++AL+ KG+L
Sbjct: 235 LSHSAVHTGNPSEPIRAPEKLFVNFTHIQDFQRRKFAAVLTKLDESVGEVVAALKAKGVL 294
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+SI++F +DNG + + N SNYP RGVKNTLWEGGV+ LWSP I +
Sbjct: 295 NDSIVVFTTDNGGAAAGFND-----NAASNYPLRGVKNTLWEGGVRGAGWLWSPFIDKRS 349
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RV+ Q MH+ DWLPTL +AAG + S + +IDG+DQW +
Sbjct: 350 RVATQRMHLVDWLPTLLSAAGMNVSSIK-HIDGVDQW---------------------CA 387
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L + PS R S++ NID++ +A++ WK+ GT G+ DG+YG + + +
Sbjct: 388 LSQDLPSARESLVHNIDDESGSASITYKQWKVHKGTNYGGSWDGWYGP--AGREGAYDTT 445
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
++ SK +L + LP + + + R +AT+ CG A P PC LFN
Sbjct: 446 RLLASKAAGALLDIG---MLP--DTEHILRLRSEATVECGDREA--LPCRPLEAPC-LFN 497
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
+ DPCE N+A PD+ + + L RT VP ++ QP DPK
Sbjct: 498 IDEDPCETRNLADIHPDVLQVMLKELDRVNRTAVPPNN-QPLTPGGDPK 545
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 52/82 (63%), Gaps = 1/82 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+ STA D G+YATD++T EAV++I+ PLFL L+H A H GN + + AP++
Sbjct: 195 FRRDFSTAHDLFGQYATDVYTNEAVKIIKSHNTSSPLFLMLSHSAVHTGNPSEPIRAPEK 254
Query: 61 TINQFQYITDPNRRTYAA-LTK 81
F +I D RR +AA LTK
Sbjct: 255 LFVNFTHIQDFQRRKFAAVLTK 276
>gi|170040779|ref|XP_001848165.1| arylsulfatase B [Culex quinquefasciatus]
gi|167864376|gb|EDS27759.1| arylsulfatase B [Culex quinquefasciatus]
Length = 585
Score = 436 bits (1122), Expect = e-119, Method: Compositional matrix adjust.
Identities = 241/541 (44%), Positives = 321/541 (59%), Gaps = 43/541 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS +IPTPNIDALAY+GIILN YA P+CTPSRASLMTGK+PI+ GMQ I
Sbjct: 42 GWNDVSFHGSLQIPTPNIDALAYSGIILNRHYAPPLCTPSRASLMTGKHPINIGMQHHVI 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L ++ +PEY RE GY T+ +GKWHLGFFR+ YTP RGF+SH+GY+ I
Sbjct: 102 EVDEPWGLGLDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTMRGFDSHYGYIGPYID 161
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED--QPVDKPLF 269
Y+DH L Q S T G DMRRNL + G YATDLFT+EAV+LI D Q PLF
Sbjct: 162 YWDHSL--QMSNT-STRGLDMRRNLQVDYSARGTYATDLFTREAVRLIHDHNQTSANPLF 218
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
L + HLA H GN ++AP+E + F +I DP RR AAM+ KLDD VG ++ AL++ G
Sbjct: 219 LVVTHLAPHTGNEDDPMQAPEEDVELFSFIKDPKRRVLAAMISKLDDGVGQIVQALKKSG 278
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML N++++F +DNGAPT+ + N GSN+P RG K + WEG V+ A +WSP +
Sbjct: 279 MLNNTVVLFYADNGAPTI-----GKHSNGGSNFPLRGQKYSPWEGAVRTAAAIWSPLLNN 333
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
RVS Q +H+SDWLPTL AAG + + IDG DQW +L N N+ G
Sbjct: 334 TGRVSNQWIHVSDWLPTLARAAGIEANFSGSEIDGQDQWEAL-------NNPNLTG---- 382
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
R +V+ NID+ + ++ + WKLV GT G D + G
Sbjct: 383 ----------RTTVMHNIDQFQHYSSYTKNGWKLVNGTSWEGAFDDWMGSLDEED----- 427
Query: 510 FNAIVESKTYQSLQQLSQNI-FLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
S+ S++ +S + L ++D + R+ AT+HC P PC
Sbjct: 428 ----ELSEEEYSVRLISSVVGRLTKVDLDDVARIRRDATVHCPYIPGGTIDCEPQKSPC- 482
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LFN+ DPCE NNIA+ RPD+ QL + ++ +R + + QP +DP+ +N TW+
Sbjct: 483 LFNILEDPCETNNIANQRPDLLKQLGQDVERYRLSAT-EPRNQPADPYSDPRYYNRTWTW 541
Query: 629 W 629
W
Sbjct: 542 W 542
Score = 83.2 bits (204), Expect = 4e-13, Method: Compositional matrix adjust.
Identities = 41/81 (50%), Positives = 51/81 (62%), Gaps = 2/81 (2%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIED--QPVDKPLFLYLAHLAAHAGNAGKHLEAP 58
MRRNL + G YATDLFT+EAV+LI D Q PLFL + HLA H GN ++AP
Sbjct: 179 MRRNLQVDYSARGTYATDLFTREAVRLIHDHNQTSANPLFLVVTHLAPHTGNEDDPMQAP 238
Query: 59 QETINQFQYITDPNRRTYAAL 79
+E + F +I DP RR AA+
Sbjct: 239 EEDVELFSFIKDPKRRVLAAM 259
>gi|328789569|ref|XP_624454.2| PREDICTED: arylsulfatase J-like [Apis mellifera]
Length = 546
Score = 433 bits (1114), Expect = e-118, Method: Compositional matrix adjust.
Identities = 235/536 (43%), Positives = 319/536 (59%), Gaps = 42/536 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHG ++IPTPNIDALAY G++L+ Y P+CTPSR++LMTGK+PIHTGMQ +
Sbjct: 39 GWNDVGFHGLSQIPTPNIDALAYTGLLLDRYYVSPICTPSRSALMTGKHPIHTGMQHGVL 98
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEPRG+PL E+ LPEYLR LGYST +GKWHLGF+ +EYTP YRGF+SH G+ +G
Sbjct: 99 KCAEPRGLPLQEKLLPEYLRNLGYSTHMVGKWHLGFYTKEYTPTYRGFDSHLGFWSGHHD 158
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+DH ++ G DMRR L AWD G+Y+TD+FTKEAV+LI++ +P+FLY
Sbjct: 159 YFDHTAVEE-----PYWGLDMRRGLEPAWDLHGQYSTDVFTKEAVRLIDNHNTSRPMFLY 213
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L+H A H+GN L A + +F I D NRR +A M+ KLD+SVG V+ AL+++ ML
Sbjct: 214 LSHAAVHSGNPYNPLPAHDHDVAKFPKILDYNRRRFAGMLSKLDESVGLVVEALRKRKML 273
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
ENS+I+F +DNG P + N SN+P RG KNTLWEGGV+ ++WSP++
Sbjct: 274 ENSVIVFSTDNGGPPAGFN-----LNAASNFPLRGTKNTLWEGGVRGTGLVWSPKLINPG 328
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R+S Q++HISDWLPTL TA GGD S L IDG+D W +
Sbjct: 329 RISRQLIHISDWLPTLLTAVGGDPSNLA-----------------------IDGVDMWRA 365
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L +T S R VL NID+ A + WK + G+ NG D +YG S + +
Sbjct: 366 LNEDTISPRKMVLHNIDDIYGIAGITYGDWKFIQGSTYNGQWDDWYGP--SGRDWAYDPY 423
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
A++ES ++ S + L NI+K+ +Q A I C + PC LFN
Sbjct: 424 AVMESAAGRA--AASVGLSLTAGNIEKL---QQGAAIKCSTRNENLPLCKPLEAPC-LFN 477
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
+ DPCE+NN+ P I S L E L + + + D + +P ++ TW+
Sbjct: 478 VREDPCEKNNLIEKYPSIVSDLKEKLSLLNASAILPGNLPWD-KRGNPDYWDHTWT 532
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 38/79 (48%), Positives = 52/79 (65%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR L AWD G+Y+TD+FTKEAV+LI++ +P+FLYL+H A H+GN L A
Sbjct: 174 MRRGLEPAWDLHGQYSTDVFTKEAVRLIDNHNTSRPMFLYLSHAAVHSGNPYNPLPAHDH 233
Query: 61 TINQFQYITDPNRRTYAAL 79
+ +F I D NRR +A +
Sbjct: 234 DVAKFPKILDYNRRRFAGM 252
>gi|321470034|gb|EFX81012.1| hypothetical protein DAPPUDRAFT_303738 [Daphnia pulex]
Length = 557
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 239/560 (42%), Positives = 328/560 (58%), Gaps = 63/560 (11%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+GWND+SFHGS +IPTPN+DALA++G+IL N Y P+CTPSR++LMTGK+PIHTGMQ
Sbjct: 36 FGWNDVSFHGSKQIPTPNLDALAFSGLILQNYYVTPLCTPSRSALMTGKHPIHTGMQHDV 95
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
++G G+PL+E LPEYL++LGY +GKWHLG ++ YTPL+RGF+SH+GY G
Sbjct: 96 LYGYSRYGLPLSEITLPEYLKDLGYKNHIVGKWHLGHYKSVYTPLFRGFDSHYGYWTGHQ 155
Query: 211 SYYDHILSDQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YYDH VE N G+DMRRN S W G+Y T L T EA +I V PL
Sbjct: 156 DYYDHT-------AVEWNAWGYDMRRNHSVDWSAYGKYTTTLLTDEACDVITKHDVSSPL 208
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLY+AHLA H+ N L+AP+ET+ F I + RR YAAMV +LD SVG ++ +L
Sbjct: 209 FLYVAHLAVHSANPYSPLQAPEETVEMFSSIENLQRRRYAAMVHELDVSVGKIVKSLGDN 268
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
MLEN++I+F +DNG P + + N SN+P +GVKNT WEGGV+ ++WSP I
Sbjct: 269 NMLENTVIVFSTDNGGPAEGFNQ-----NAASNWPLKGVKNTPWEGGVRAAGLIWSPLIP 323
Query: 389 QNPR--VSLQMMHISDWLPTLYTAAG-----GDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
++ R V +M ISDWLPTL+ AAG GD + L +DG+ W
Sbjct: 324 KSRRGQVMSNLMDISDWLPTLFEAAGQSRICGDVATLK-GLDGVSHW------------- 369
Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT- 500
+++L + PS RN VL NID++ AA+R ++WKLV GT G+ DG+YG +
Sbjct: 370 --------NTVLYDKPSARNHVLHNIDDQLGYAAIRKENWKLVKGTTYEGSWDGWYGPSG 421
Query: 501 RSNK-----------VPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
R+N+ V L+ NA+++S Q + L +S + A I
Sbjct: 422 RTNESSTNFINSVDDVSLIYKNAVLDSDAAQYIAMLG------MSATQNLYQQLLDAEIS 475
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
CG P P + C LF++ DPCE NN+A P I+ L LL ++ T V +
Sbjct: 476 CGERPVDAHPCLPLDALC-LFDIEQDPCEYNNLAEKMPHITEDLLSLLSWYNSTAVAPLN 534
Query: 610 EQPDLVQADPKRFNDTWSPW 629
PD + ++PK +N T++ W
Sbjct: 535 TSPDPM-SNPKYWNYTYTNW 553
Score = 80.1 bits (196), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 38/87 (43%), Positives = 50/87 (57%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRN S W G+Y T L T EA +I V PLFLY+AHLA H+ N L+AP+E
Sbjct: 172 MRRNHSVDWSAYGKYTTTLLTDEACDVITKHDVSSPLFLYVAHLAVHSANPYSPLQAPEE 231
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTL 87
T+ F I + RR YAA+ +++
Sbjct: 232 TVEMFSSIENLQRRRYAAMVHELDVSV 258
>gi|328788250|ref|XP_624148.3| PREDICTED: arylsulfatase B-like isoform 2 [Apis mellifera]
Length = 564
Score = 432 bits (1111), Expect = e-118, Method: Compositional matrix adjust.
Identities = 239/544 (43%), Positives = 329/544 (60%), Gaps = 48/544 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNIDALAYNGIILN Y P TPSR + TG YPI GMQG I
Sbjct: 42 GWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGI 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G EPRG+PL + LPE+LR LGY TK IGKWH+GF +YTPL+RGF++ FG+ N I+
Sbjct: 102 RGGEPRGLPLHIKILPEHLRGLGYVTKLIGKWHMGFHTLQYTPLHRGFDTFFGFYNSHIT 161
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD+ S+Q + G+DM A+ EYATDLFT EA+++IE+ + +PL+L
Sbjct: 162 YYDYEYSNQ-----NMTGYDMHCGDDPAYGMKREYATDLFTNEAIKIIENHELPRPLYLQ 216
Query: 272 LAHLAAHAGNAGKHLEAPQETI-NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
++HLA HA +E P ++ ++ I +PNRR YA MV KLD+SVG V+ AL KGM
Sbjct: 217 ISHLAVHAP-----IEQPDDSSRDEIVQIREPNRRKYAKMVSKLDESVGRVVHALGEKGM 271
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L +S+I+F++DNGA ++ YRN+GSNYP RG K TL+EGGV+ A LWS ++++
Sbjct: 272 LRDSLILFLTDNGAASI-----GRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLEKG 326
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RV +++HI+DWLPTLY+AAGGD L IDG+DQW
Sbjct: 327 ARVFKKLIHITDWLPTLYSAAGGDLKDL----------------------GKIDGIDQWR 364
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLLN 509
L R +L+NIDE T +KL+ G+ NG D YYG + R+ + P
Sbjct: 365 VLSEGQGHGREKLLLNIDEVMITEGAIYSRFKLLRGSVGNGYYDKYYGDSGRTLETPPYT 424
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA---PMTPSPCTNGP 566
+++S QS ++ ++ P++ M R++AT+ C N + + + C
Sbjct: 425 -EVVLKSAVSQS---ITYHLGGPVTQPSTMVQLRREATVQCHPNMSYYYRHSFTFCNVTE 480
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LF++ NDPCE NIA + I+ L L+++ R L+ Q + D + ADPKR N+TW
Sbjct: 481 C-LFDIVNDPCETKNIAEAYARIARDLDLYLEHYGRVLMKQIRKPVDWL-ADPKRRNNTW 538
Query: 627 SPWI 630
PW+
Sbjct: 539 EPWM 542
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/74 (40%), Positives = 47/74 (63%), Gaps = 6/74 (8%)
Query: 8 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI-NQFQ 66
A+ EYATDLFT EA+++IE+ + +PL+L ++HLA HA +E P ++ ++
Sbjct: 184 AYGMKREYATDLFTNEAIKIIENHELPRPLYLQISHLAVHAP-----IEQPDDSSRDEIV 238
Query: 67 YITDPNRRTYAALT 80
I +PNRR YA +
Sbjct: 239 QIREPNRRKYAKMV 252
>gi|380025784|ref|XP_003696648.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 579
Score = 431 bits (1109), Expect = e-118, Method: Compositional matrix adjust.
Identities = 237/544 (43%), Positives = 329/544 (60%), Gaps = 48/544 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNIDALAYNGIILN Y P TPSR + TG YPI GMQG I
Sbjct: 42 GWNDVGFHGSNQIPTPNIDALAYNGIILNRHYVLPSSTPSRIAFFTGLYPIRIGMQGDGI 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G EPRG+PL + LPE+LR LGY+TK IGKWH+G+ +YTPL+RGF++ FG+ N I+
Sbjct: 102 RGGEPRGLPLHIKILPEHLRGLGYTTKLIGKWHMGYHTPQYTPLHRGFDTFFGFYNSHIT 161
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD+ S+Q + G+DM R A EY TDLFTKEA+++IE+ + +PL+L
Sbjct: 162 YYDYEYSNQ-----NMTGYDMHRGDDPAHGIKREYVTDLFTKEAIKIIENHELPRPLYLQ 216
Query: 272 LAHLAAHAGNAGKHLEAPQE-TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
++HLA HA +E P + + ++ I +PNRR YA MV KLD+SVG ++ AL KGM
Sbjct: 217 ISHLAVHAP-----IEQPDDSSSDEIIQIREPNRRKYAKMVSKLDESVGRIVHALGEKGM 271
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L +S+I+F++DNGA ++ YRN+GSNYP RG K TL+EGGV+ A LWS ++++
Sbjct: 272 LRDSLILFLTDNGAASI-----GRYRNYGSNYPLRGTKYTLYEGGVRGVAALWSSRLEKA 326
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RV +++HI+DWLPTLY+AAGGD L IDG+DQW
Sbjct: 327 ARVFKKLIHITDWLPTLYSAAGGDLKDL----------------------GEIDGIDQWR 364
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLLN 509
L R +L+NIDE +T +KL+ G+ NG D YYG + R+ + P
Sbjct: 365 VLSEEQGRGREKLLLNIDEVMKTEGAIYSRFKLLRGSIGNGYYDKYYGDSGRTLETPPYT 424
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA---PMTPSPCTNGP 566
+++S QS ++ ++ P++ M R++AT+ C N + + C
Sbjct: 425 -EVVLKSAVSQS---ITYHLGGPVTQPSTMIQLRREATVQCHPNVTYYYRHSFTFCNVTE 480
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LF++ NDPCE NIA + I+ L L+ + R L+ Q + D + ADP+R N+TW
Sbjct: 481 C-LFDIVNDPCETKNIAEAYARIARDLDLYLEQYGRVLMKQIRKPVDWL-ADPRRRNNTW 538
Query: 627 SPWI 630
PW+
Sbjct: 539 EPWM 542
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 48/81 (59%), Gaps = 6/81 (7%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
M R A EY TDLFTKEA+++IE+ + +PL+L ++HLA HA +E P +
Sbjct: 177 MHRGDDPAHGIKREYVTDLFTKEAIKIIENHELPRPLYLQISHLAVHAP-----IEQPDD 231
Query: 61 -TINQFQYITDPNRRTYAALT 80
+ ++ I +PNRR YA +
Sbjct: 232 SSSDEIIQIREPNRRKYAKMV 252
>gi|345495280|ref|XP_001606377.2| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 545
Score = 431 bits (1108), Expect = e-118, Method: Compositional matrix adjust.
Identities = 242/572 (42%), Positives = 332/572 (58%), Gaps = 48/572 (8%)
Query: 62 INQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGI 117
I F Y+ AA ++T ++ + GWND+ FHGS +IPTPNIDALAY+G+
Sbjct: 5 IFLFSYLLAARNWIVAAAPETTPPHIIFIVADDLGWNDVGFHGSGQIPTPNIDALAYSGL 64
Query: 118 ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYST 177
ILN Y P+CTPSR++LMTGKYPIHTGMQ + GAEPRG+PL E+ LPEYLRELGY T
Sbjct: 65 ILNRYYVSPICTPSRSALMTGKYPIHTGMQRGVLKGAEPRGLPLKEKLLPEYLRELGYRT 124
Query: 178 KAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
+GKWHLGF+ +EYTP YRGFESH GY G YYDH ++ G DMRRN+
Sbjct: 125 HIVGKWHLGFYTKEYTPTYRGFESHLGYWTGHQDYYDHSAVEE-----PYWGMDMRRNME 179
Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
AWD G+Y+TD+FTKEAV+LI+ +P+FLYLAH A H+ N L E +++ +
Sbjct: 180 PAWDLHGQYSTDVFTKEAVKLIKSHNASQPMFLYLAHAAVHSANPYNPLPVSDEIVDKIK 239
Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
I + RR +A M +D SVG V+ AL+++ ML + +I+F +DNG P + N
Sbjct: 240 -IPEYKRRRFAGMASAMDASVGAVVEALEKRNMLRDCLIVFSTDNGGPAEGFN-----LN 293
Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
SN+P RGVK+TL+EGGV+ ++WS ++ + R+S HI+DWLPTL AAGG
Sbjct: 294 AASNWPLRGVKDTLFEGGVRGVGLVWSERLARPGRLSEATFHIADWLPTLLRAAGGY--- 350
Query: 418 LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
+N G IDGLD W +L + S R VL NID+ TAA+
Sbjct: 351 --VNATG-----------------KIDGLDAWEALADDKESPRRVVLHNIDDVLGTAAIS 391
Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESK-TYQSLQQLSQNIFLPISNI 536
+D WKL+ G+ G + +YG S + + A+ E ++L++L + + +
Sbjct: 392 VDQWKLIKGSTYRGRWNQWYGP--SGRDYPYDVEAVAEGCLVAKALKRLGRGL-----DR 444
Query: 537 DKMRSTRQQATIHCGANP-APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE 595
K+ R QATIHCG P P C L ++ DPCE+ N+A +PD+ +L E
Sbjct: 445 GKLVELRNQATIHCGEQPQQPQKACNPLEAAC-LIDVRADPCERRNLADEQPDVVRRLEE 503
Query: 596 LLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
L + VP + D QADP ++ TW+
Sbjct: 504 ELARLNASAVPPGNLPWD-SQADPALWDHTWN 534
Score = 74.7 bits (182), Expect = 1e-10, Method: Compositional matrix adjust.
Identities = 36/90 (40%), Positives = 55/90 (61%), Gaps = 1/90 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRN+ AWD G+Y+TD+FTKEAV+LI+ +P+FLYLAH A H+ N L E
Sbjct: 174 MRRNMEPAWDLHGQYSTDVFTKEAVKLIKSHNASQPMFLYLAHAAVHSANPYNPLPVSDE 233
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIV 90
+++ + I + RR +A + + ++ V
Sbjct: 234 IVDKIK-IPEYKRRRFAGMASAMDASVGAV 262
>gi|158300602|ref|XP_552160.3| AGAP012047-PA [Anopheles gambiae str. PEST]
gi|157013239|gb|EAL38777.3| AGAP012047-PA [Anopheles gambiae str. PEST]
Length = 564
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 231/569 (40%), Positives = 331/569 (58%), Gaps = 42/569 (7%)
Query: 62 INQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGI 117
Q + I+ P+ RT + S ++ + GWND+ FHGS +IPTPN+DALAY+GI
Sbjct: 16 FQQGEAISCPSNRTENSSMDSARPNIIFILADDLGWNDVGFHGSAQIPTPNLDALAYSGI 75
Query: 118 ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYST 177
ILN Y P+CTPSR++LMTGKYPIHTGMQ ++ EPRG+PL+E+ LP+YL++LGYS
Sbjct: 76 ILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVLYAMEPRGLPLSEKLLPQYLKDLGYSN 135
Query: 178 KAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
+GKWHLG ++ +TP+ RGF+SH G+ G H ++D + G DMRR
Sbjct: 136 HIVGKWHLGHYQLRFTPMQRGFDSHTGFWTG-----HHHMNDHTAVEHGHWGLDMRRGYD 190
Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
A+D G+Y T + EA+ +++ PLFLY+AH A H+ N L AP ET+
Sbjct: 191 VAYDLHGQYTTHVLGAEAIAIVQGHNKSSPLFLYVAHAAVHSANPYDFLPAPDETVANLG 250
Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
+I + RR +AAM+ +LD +VG+++ AL + MLEN+II+F SDNG P + + N
Sbjct: 251 HIENYRRRKFAAMMVELDRTVGSLVDALHARDMLENTIIVFSSDNGGPADGFND-----N 305
Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
SN+P RGVKNTLWEGG++ +WSP +Q RVS QM+ + DWLPTLY AAGG+ S
Sbjct: 306 AASNWPLRGVKNTLWEGGLRAAGFIWSPLLQNVSRVSHQMVQVCDWLPTLYEAAGGNVSA 365
Query: 418 LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
LP ++DG+ W L SS P+RR +L NID+ TAA+
Sbjct: 366 LPTDLDGISVWHEL------------------SS---GAPTRRIEILHNIDDIWGTAALT 404
Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
+ +WKLV G+ N T DG+YG + + S + +++LS LP +
Sbjct: 405 VGNWKLVKGSHYNRTWDGWYGPAGIRDEKAYALDKLANSPAGKVMEELS---LLPTP--E 459
Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
++ R++AT+ CGA + PC L+++ +DPCE NN+A I L L
Sbjct: 460 RITQLRREATVSCGAGAHMASECDPLERPC-LYDVESDPCEYNNLAEEHLHILQSLLARL 518
Query: 598 KYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
+ T VP S+ + D + +P+R+N TW
Sbjct: 519 ADYNSTAVPPSNLEDD-PRGEPQRWNFTW 546
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 44/79 (55%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR A+D G+Y T + EA+ +++ PLFLY+AH A H+ N L AP E
Sbjct: 185 MRRGYDVAYDLHGQYTTHVLGAEAIAIVQGHNKSSPLFLYVAHAAVHSANPYDFLPAPDE 244
Query: 61 TINQFQYITDPNRRTYAAL 79
T+ +I + RR +AA+
Sbjct: 245 TVANLGHIENYRRRKFAAM 263
>gi|157108842|ref|XP_001650409.1| arylsulfatase b [Aedes aegypti]
gi|108879187|gb|EAT43412.1| AAEL005134-PA [Aedes aegypti]
Length = 675
Score = 428 bits (1101), Expect = e-117, Method: Compositional matrix adjust.
Identities = 236/538 (43%), Positives = 319/538 (59%), Gaps = 34/538 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNIDALAY+GIILN Y P+CTPSRASLMTGK PI+ GMQ I
Sbjct: 43 GWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKNPINIGMQHYVI 102
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L ++ +PEY +E GY T +GKWHLGF ++YTP RGF++H GYL +
Sbjct: 103 VSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGYLGPYVD 162
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+D+ L ++S G+DMR NL+ +D+ G YATD FTK A +IE PLFL
Sbjct: 163 YWDYTL--KFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDPLFLV 220
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+ HLA HA N L+AP+E I +F YI+D RR YAAMV KLDDSVG + ++L+ K ML
Sbjct: 221 VNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIFNSLRSKNML 280
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NSII+FMSDNGAPT + N GSNYP RG+K+ WE + A +WSP +Q+
Sbjct: 281 DNSIILFMSDNGAPTAAL-----HANTGSNYPLRGIKSVPWEAATRCVAAIWSPLLQERQ 335
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS Q +HISDWLPTL +AAG D +P S+ +S IDG DQW +
Sbjct: 336 RVSNQFIHISDWLPTLASAAGID---IPF----------------SKDHSEIDGQDQWEA 376
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L +T + R VL IDE ++ + +K V GT NG+ DG+YGQ ++ L +
Sbjct: 377 LSYDTGNPRRVVLNMIDEIYGYSSYMENGFKFVNGTYSNGSYDGWYGQPNTSDQTLSDDQ 436
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
I + + + I + D ++ R+ A ++C P +P PC LF+
Sbjct: 437 YIDLVLQTEITRWAGETI-----SRDTIKYLRKHARVNCNHQPEANKCNPLKR-PC-LFD 489
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+ NDPCE N+++ P +L ++ +RR L + +P A+P F W+ W
Sbjct: 490 IINDPCELNDLSHKFPMKFRELRSTVQTYRR-LATKPRNKPADPAANPANFGGVWTWW 546
Score = 82.4 bits (202), Expect = 6e-13, Method: Compositional matrix adjust.
Identities = 40/80 (50%), Positives = 50/80 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MR NL+ +D+ G YATD FTK A +IE PLFL + HLA HA N L+AP+E
Sbjct: 181 MRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDPLFLVVNHLAPHAANDDDPLQAPEE 240
Query: 61 TINQFQYITDPNRRTYAALT 80
I +F YI+D RR YAA+
Sbjct: 241 DIRKFDYISDERRRIYAAMV 260
>gi|157108840|ref|XP_001650408.1| arylsulfatase b [Aedes aegypti]
gi|108879186|gb|EAT43411.1| AAEL005134-PB [Aedes aegypti]
Length = 607
Score = 427 bits (1097), Expect = e-116, Method: Compositional matrix adjust.
Identities = 236/538 (43%), Positives = 319/538 (59%), Gaps = 34/538 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNIDALAY+GIILN Y P+CTPSRASLMTGK PI+ GMQ I
Sbjct: 43 GWNDVGFHGSNQIPTPNIDALAYDGIILNRHYTAPMCTPSRASLMTGKNPINIGMQHYVI 102
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L ++ +PEY +E GY T +GKWHLGF ++YTP RGF++H GYL +
Sbjct: 103 VSDEPWGLGLDQKIMPEYFKEAGYRTHLVGKWHLGFSAKQYTPTMRGFDTHVGYLGPYVD 162
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+D+ L ++S G+DMR NL+ +D+ G YATD FTK A +IE PLFL
Sbjct: 163 YWDYTL--KFSPPKSFQGYDMRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDPLFLV 220
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+ HLA HA N L+AP+E I +F YI+D RR YAAMV KLDDSVG + ++L+ K ML
Sbjct: 221 VNHLAPHAANDDDPLQAPEEDIRKFDYISDERRRIYAAMVSKLDDSVGQIFNSLRSKNML 280
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NSII+FMSDNGAPT + N GSNYP RG+K+ WE + A +WSP +Q+
Sbjct: 281 DNSIILFMSDNGAPTAAL-----HANTGSNYPLRGIKSVPWEAATRCVAAIWSPLLQERQ 335
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS Q +HISDWLPTL +AAG D +P S+ +S IDG DQW +
Sbjct: 336 RVSNQFIHISDWLPTLASAAGID---IPF----------------SKDHSEIDGQDQWEA 376
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L +T + R VL IDE ++ + +K V GT NG+ DG+YGQ ++ L +
Sbjct: 377 LSYDTGNPRRVVLNMIDEIYGYSSYMENGFKFVNGTYSNGSYDGWYGQPNTSDQTLSDDQ 436
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
I + + + I + D ++ R+ A ++C P +P PC LF+
Sbjct: 437 YIDLVLQTEITRWAGETI-----SRDTIKYLRKHARVNCNHQPEANKCNPLKR-PC-LFD 489
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+ NDPCE N+++ P +L ++ +RR L + +P A+P F W+ W
Sbjct: 490 IINDPCELNDLSHKFPMKFRELRSTVQTYRR-LATKPRNKPADPAANPANFGGVWTWW 546
Score = 82.4 bits (202), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 40/80 (50%), Positives = 50/80 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MR NL+ +D+ G YATD FTK A +IE PLFL + HLA HA N L+AP+E
Sbjct: 181 MRNNLNVDYDSNGTYATDHFTKAASSIIERHDTKDPLFLVVNHLAPHAANDDDPLQAPEE 240
Query: 61 TINQFQYITDPNRRTYAALT 80
I +F YI+D RR YAA+
Sbjct: 241 DIRKFDYISDERRRIYAAMV 260
>gi|170050440|ref|XP_001861313.1| arylsulfatase b [Culex quinquefasciatus]
gi|167872047|gb|EDS35430.1| arylsulfatase b [Culex quinquefasciatus]
Length = 552
Score = 426 bits (1095), Expect = e-116, Method: Compositional matrix adjust.
Identities = 231/535 (43%), Positives = 314/535 (58%), Gaps = 40/535 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS +IPTPN+DALAY+GIILN Y P+CTPSRA+LMTG+YPIHTGMQ +
Sbjct: 36 GWNDVGFHGSAQIPTPNLDALAYSGIILNRYYVTPICTPSRAALMTGRYPIHTGMQHAVL 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+G EPRG+PL E+ LPEYLRELGY +GKWHLG + R YTPL RGF+SH G+ G
Sbjct: 96 YGMEPRGLPLEEKLLPEYLRELGYKNHIVGKWHLGHYTRRYTPLERGFDSHVGFWTG--- 152
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
H + D + E G DMRR A+D G+Y T + EAV I + + PLFLY
Sbjct: 153 --HHHMFDHSAVETETWGLDMRRGYDVAYDLHGKYTTHVIRDEAVARIGNHSIGDPLFLY 210
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+AH A H+ N L AP T+ +++ RR +AAM+ +LD+SVG ++ AL+ KGML
Sbjct: 211 VAHAAVHSANPYDFLPAPDVTVAGLEHVEPYPRRKFAAMLSELDESVGAIVEALKIKGML 270
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N+II+F SDNG P + N SN+P RGVKNTLWEGGV+ +WSP I ++
Sbjct: 271 DNTIIVFSSDNGGPAEGFNS-----NAASNWPLRGVKNTLWEGGVRAAGFIWSPLIAESR 325
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS QM+HISDWLPTL AAG + LP +DG++ W SLL N N
Sbjct: 326 RVSQQMIHISDWLPTLLDAAGYNVETLPKTLDGINVWKSLL-------NGN--------- 369
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
+ R +L NID+ +AA+ +++WK++ GT G D +YG N
Sbjct: 370 -----STERREILHNIDDIWGSAALTVNNWKILKGTNYKGQWDNWYGPPGDRDPEAYNIT 424
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
A+ T ++L +++ LP +R+ R AT+HC + T +P PC L++
Sbjct: 425 AVQHCPTGKALSEMN---LLPTEA--TIRNLRLNATVHC-SKEVRATCNPLEQ-PC-LYD 476
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
+ DPCE +N+A P I + + L + T VP + D + DP+ + TW
Sbjct: 477 VLQDPCELDNLAERFPAILDSMLKKLDAYNATAVPPGNLALD-TRGDPRFWGYTW 530
Score = 61.2 bits (147), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 43/79 (54%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR A+D G+Y T + EAV I + + PLFLY+AH A H+ N L AP
Sbjct: 171 MRRGYDVAYDLHGKYTTHVIRDEAVARIGNHSIGDPLFLYVAHAAVHSANPYDFLPAPDV 230
Query: 61 TINQFQYITDPNRRTYAAL 79
T+ +++ RR +AA+
Sbjct: 231 TVAGLEHVEPYPRRKFAAM 249
>gi|281363223|ref|NP_610807.3| CG8646 [Drosophila melanogaster]
gi|17945274|gb|AAL48694.1| RE14504p [Drosophila melanogaster]
gi|272432448|gb|AAF58475.2| CG8646 [Drosophila melanogaster]
Length = 562
Score = 424 bits (1091), Expect = e-116, Method: Compositional matrix adjust.
Identities = 239/541 (44%), Positives = 323/541 (59%), Gaps = 43/541 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS EIPTPNIDALAY+GIILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 37 GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL E+ LP+YL ELGY++ GKWHLG ++ +YTPLYRGF SH G+ +G
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHVGFWSGHQD 156
Query: 212 YYDHILSDQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PL 268
Y DH VE N G DMR A+D G Y TD+ T +V++I + K PL
Sbjct: 157 YNDHT-------AVENNQWGLDMRNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPL 209
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLY+AH A H+ N L P + + +I + RR +AAMV K+D+SVG ++ L++
Sbjct: 210 FLYVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDNSVGQIVDQLRKS 269
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
MLENSIIIF SDNG P + N+ SNYP +GVKNTLWEGGV+ ++WSP ++
Sbjct: 270 NMLENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLK 324
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
++ RVS Q MHI DWLPTL AAGG + S + IDG
Sbjct: 325 KSQRVSNQTMHIIDWLPTLLEAAGGQPAL-------------------SNLSKQIDGQSI 365
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
W +L+ + S R +VL NID+ +AA+ + WKLV GT G+ DG+YG L
Sbjct: 366 WRALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGWYGPAGERDPRLY 425
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGP 566
++ + S+ ++L+ L LP S D+ R R AT+ C + T T + P
Sbjct: 426 DWQLVGRSRAGKALEALK---MLP-SRADQQR-IRAAATVSCPGQSSQGTSCVATAFSAP 480
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LF++ +DPCEQ N+A P++ + L L+ T VP S++ D +ADP+ +N TW
Sbjct: 481 C-LFHIRDDPCEQYNLAKQYPEVVNALMTELERFNATAVPPSNKPAD-PRADPRFWNYTW 538
Query: 627 S 627
+
Sbjct: 539 T 539
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 43/81 (53%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A+D G Y TD+ T +V++I + K PLFLY+AH A H+ N L P
Sbjct: 172 MRNGTQVAYDLHGHYTTDVITDHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPD 231
Query: 60 ETINQFQYITDPNRRTYAALT 80
+ + +I + RR +AA+
Sbjct: 232 NDVIKMSHIPNYKRRKFAAMV 252
>gi|307187655|gb|EFN72627.1| Arylsulfatase B [Camponotus floridanus]
Length = 591
Score = 423 bits (1088), Expect = e-115, Method: Compositional matrix adjust.
Identities = 237/557 (42%), Positives = 338/557 (60%), Gaps = 60/557 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ FHGS++I TPNIDAL YNGIILN Y P TPSR + TG+YPI GMQG I
Sbjct: 41 GWDDVGFHGSDQIRTPNIDALGYNGIILNRHYVLPSSTPSRTAFFTGQYPIRMGMQGEDI 100
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G EPRG+PL R LPE+LR+LGY TK IGKWHLG++ ++TPL RGF+S G+ N +S
Sbjct: 101 QGGEPRGIPLNVRILPEFLRDLGYMTKLIGKWHLGYYTPQHTPLRRGFDSFLGFYNSHVS 160
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY++ S Q ++G+DM R + A+ + +Y TD FT EA+++IE +PL+L
Sbjct: 161 YYNYKYSFQ-----NMSGYDMHRGDAPAYGSTDKYVTDFFTDEAIKIIEYYDPSRPLYLQ 215
Query: 272 LAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
++HLA HA LE PQ+ +QF +I + NRR YA MV +LD+SVG ++ AL K
Sbjct: 216 ISHLAVHA-----PLEGPQDYNHYDSQFLHIREINRRKYAKMVLRLDNSVGRIVQALGNK 270
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
GML+NS+I+F++DNGA + +RN+GSNYP RG+K TL+EGGV+ A+LWSP+++
Sbjct: 271 GMLKNSLILFLTDNGAAPI-----GKFRNYGSNYPLRGMKYTLYEGGVRGVALLWSPRLR 325
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ RVS ++MH++DWLPTLY+ AGGD L IDG+DQW L N ++
Sbjct: 326 KTARVSDELMHVTDWLPTLYSIAGGDVRDLG-EIDGIDQWCML--------NGSL----- 371
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
PS R+ +L+NIDE +T +KLV G+ E G DGY+ VP++
Sbjct: 372 --------PSARDRLLLNIDEVSKTEGAIYKQFKLVRGSIEGGLYDGYFYDHIERNVPIM 423
Query: 509 NF-----NAIVESKTYQSLQQLSQNI-------FL--PISNIDKMRSTRQQATIHCGA-N 553
IV+ + + +++ FL P++ + R++AT++C + N
Sbjct: 424 LIMPSDPKTIVQEDVPPYTETILRSVVSETITSFLGDPVTQPSTIIQLRREATVNCRSRN 483
Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
+ +T C C LF++ NDPCE NNIA IS +L + L+ + +V + + P
Sbjct: 484 RSFLT---CNVTEC-LFDVNNDPCETNNIAEQYSRISRELDKSLEQYWH-IVKRQMKIPV 538
Query: 614 LVQADPKRFNDTWSPWI 630
ADP++ N+TW PWI
Sbjct: 539 DWLADPRKTNNTWEPWI 555
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 8/83 (9%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
M R + A+ + +Y TD FT EA+++IE +PL+L ++HLA HA LE PQ+
Sbjct: 176 MHRGDAPAYGSTDKYVTDFFTDEAIKIIEYYDPSRPLYLQISHLAVHA-----PLEGPQD 230
Query: 61 ---TINQFQYITDPNRRTYAALT 80
+QF +I + NRR YA +
Sbjct: 231 YNHYDSQFLHIREINRRKYAKMV 253
>gi|380025315|ref|XP_003696421.1| PREDICTED: arylsulfatase B-like [Apis florea]
Length = 546
Score = 419 bits (1077), Expect = e-114, Method: Compositional matrix adjust.
Identities = 230/542 (42%), Positives = 330/542 (60%), Gaps = 44/542 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHG+N+IPTPNIDALAYNG+IL Y P+CTPSR + +TG+YPI TGMQG P+
Sbjct: 35 GWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 94
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EPR +PL LPEYLR+LGY+T +GKWH+G++ +TP RGF++ FGY NG IS
Sbjct: 95 KAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYNGYIS 154
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H + +L+ H+ +NLS A++ EY TDL T+ A +I++ KPL+L
Sbjct: 155 YFNHTIKQNNHVGYDLHYHN-SKNLSVAYNF--EYITDLITERAENIIKNHDRRKPLYLQ 211
Query: 272 LAHLAAHAGNAGKHLEA--PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
L+HLA H+ +A + +E QET +YI D NRR YA +V +D+SVG VI AL +
Sbjct: 212 LSHLAVHSSDAKEVMEVRDEQETNATLEYIEDYNRRKYAGVVTAMDESVGRVIKALGQSS 271
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
MLENSII+FMSDNGA T E N+GSNYP RG+K TL+EGG++ A ++S IQ
Sbjct: 272 MLENSIIVFMSDNGAQTEGLLE-----NYGSNYPLRGLKFTLFEGGIRGVACVYSRLIQN 326
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
+ R+S +++HI+DWLPT Y+AAGG+ L N+DG+DQW +++ S R
Sbjct: 327 SSRISNELIHITDWLPTFYSAAGGNLENLEENMDGVDQWDTIV----SGRE--------- 373
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
S+R SVL+NIDE + ++ L +KL+ G +N YYG S P
Sbjct: 374 --------SKRGSVLLNIDEVENVSSALLGKYKLING--KNIQYSDYYGDNGTSVSYPEY 423
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
N ++++ S ++ + N L I ++ K+R+ +A + C T P C
Sbjct: 424 NVSSVLHSFVGSTIFH-TTNFTLNIEDVIKLRN---KARVVCNN----FTSYPKCVDKC- 474
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LF++ NDPCE +++ P+I +L + + + L+ Q++ D + P+ FN TW P
Sbjct: 475 LFDVYNDPCETTDLSDEYPEIVKKLNYFITEYEKVLLHQTNAPVD-PHSYPENFNGTWMP 533
Query: 629 WI 630
WI
Sbjct: 534 WI 535
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 50/82 (60%), Gaps = 4/82 (4%)
Query: 3 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA--PQE 60
+NLS A++ EY TDL T+ A +I++ KPL+L L+HLA H+ +A + +E QE
Sbjct: 176 KNLSVAYNF--EYITDLITERAENIIKNHDRRKPLYLQLSHLAVHSSDAKEVMEVRDEQE 233
Query: 61 TINQFQYITDPNRRTYAALTKS 82
T +YI D NRR YA + +
Sbjct: 234 TNATLEYIEDYNRRKYAGVVTA 255
>gi|198455736|ref|XP_001360091.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
gi|198135374|gb|EAL24665.2| GA21235 [Drosophila pseudoobscura pseudoobscura]
Length = 545
Score = 418 bits (1074), Expect = e-114, Method: Compositional matrix adjust.
Identities = 235/543 (43%), Positives = 318/543 (58%), Gaps = 62/543 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS +IPTPNIDALAY+GIILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 37 GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL E+ LP+YL +LGY++ GKWHLG ++ EYTPL+RGF SH
Sbjct: 97 YAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFSSH--------- 147
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
L G DMR A++ G+Y TD+ TK +V +I + K PLFL
Sbjct: 148 -------------KNLWGLDMRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFL 194
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H+GN L P + + + +I RR YAAMV K+D+SVG ++ L++ M
Sbjct: 195 YVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVGVIVDQLRKSNM 254
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSIIIF SDNG P + N+ SNYP RGVKNTLWEGGV+ ++WSP +++
Sbjct: 255 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLRGVKNTLWEGGVRAAGLVWSPLLEKR 309
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN----IDGL 446
RV + MHISDWLPTL AAGG P+ N+N +DG+
Sbjct: 310 QRVVDETMHISDWLPTLIEAAGG---------------------IPALANANLSQSLDGM 348
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
W +L+ PS R +VL NID+ +AA+ + WK++ GT NG DG+YG
Sbjct: 349 SIWQTLVHAYPSPRKTVLHNIDDIWGSAALTVGDWKMIQGTNYNGRWDGWYGPGGERDPH 408
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--N 564
L ++N +++S + +++ L LP S D+ R R+ AT+ C T T +
Sbjct: 409 LYDWNLVLKSSSGKAMTGLK---MLP-SRADQQR-LRETATVTCQGQSTQGTSCVATAFS 463
Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
PC LF++ +DPCEQ N+A P + L L++ T VP S+ QPD +A P FN
Sbjct: 464 APC-LFHVRDDPCEQFNLAEQYPKVLDLLMAELQHINSTAVPPSN-QPDDPRASPMYFNY 521
Query: 625 TWS 627
TW+
Sbjct: 522 TWT 524
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A++ G+Y TD+ TK +V +I + K PLFLY+AH A H+GN L P
Sbjct: 155 MRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFLYVAHAAGHSGNPYNPLPVPD 214
Query: 60 ETINQFQYITDPNRRTYAALTKSTTLTLLIV 90
+ + + +I RR YAA+ ++ ++
Sbjct: 215 DDVMKLDHILHYKRRRYAAMVAKMDESVGVI 245
>gi|195148952|ref|XP_002015426.1| GL11077 [Drosophila persimilis]
gi|194109273|gb|EDW31316.1| GL11077 [Drosophila persimilis]
Length = 545
Score = 417 bits (1073), Expect = e-114, Method: Compositional matrix adjust.
Identities = 235/543 (43%), Positives = 318/543 (58%), Gaps = 62/543 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS +IPTPNIDALAY+GIILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 37 GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL E+ LP+YL +LGY++ GKWHLG ++ EYTPL+RGF SH
Sbjct: 97 YAAEPRGLPLKEKILPQYLNDLGYTSHISGKWHLGHWKLEYTPLFRGFSSH--------- 147
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
L G DMR A++ G+Y TD+ TK +V +I + K PLFL
Sbjct: 148 -------------KNLWGLDMRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFL 194
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H+GN L P + + + +I RR YAAMV K+D+SVG ++ L++ M
Sbjct: 195 YVAHAAGHSGNPYNPLPVPDDDVMKLDHILHYKRRRYAAMVAKMDESVGVIVDQLRKSNM 254
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSIIIF SDNG P + N+ SNYP RGVKNTLWEGGV+ ++WSP +++
Sbjct: 255 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLRGVKNTLWEGGVRAAGLVWSPLLEKR 309
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN----IDGL 446
RV + MHISDWLPTL AAGG P+ N+N +DG+
Sbjct: 310 QRVVDETMHISDWLPTLIEAAGG---------------------IPALANANLSQSLDGM 348
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
W +L+ PS R +VL NID+ +AA+ + WK++ GT NG DG+YG
Sbjct: 349 SIWQTLVHAYPSPRKTVLHNIDDIWGSAALTVGDWKMIQGTNYNGRWDGWYGPGGERDPH 408
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--N 564
L ++N +++S + +++ L LP S D+ R R+ AT+ C T T +
Sbjct: 409 LYDWNLVLKSSSGKAMTGLK---MLP-SRADQQR-LRETATVTCQGQSTQGTSCVATAFS 463
Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
PC LF++ +DPCEQ N+A P + L L++ T VP S+ QPD +A P FN
Sbjct: 464 APC-LFHVRDDPCEQFNLAEQYPKVLDLLVAELQHINSTAVPPSN-QPDDPRASPMYFNY 521
Query: 625 TWS 627
TW+
Sbjct: 522 TWT 524
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 31/91 (34%), Positives = 49/91 (53%), Gaps = 1/91 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A++ G+Y TD+ TK +V +I + K PLFLY+AH A H+GN L P
Sbjct: 155 MRNGTDVAYNLHGQYTTDVITKHSVSVIANHDAAKGPLFLYVAHAAGHSGNPYNPLPVPD 214
Query: 60 ETINQFQYITDPNRRTYAALTKSTTLTLLIV 90
+ + + +I RR YAA+ ++ ++
Sbjct: 215 DDVMKLDHILHYKRRRYAAMVAKMDESVGVI 245
>gi|118779434|ref|XP_309303.3| AGAP011348-PA [Anopheles gambiae str. PEST]
gi|116131546|gb|EAA05277.3| AGAP011348-PA [Anopheles gambiae str. PEST]
Length = 573
Score = 417 bits (1072), Expect = e-114, Method: Compositional matrix adjust.
Identities = 236/540 (43%), Positives = 314/540 (58%), Gaps = 38/540 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGSN+IPTPNIDALAY+GIILN Y P+CTPSRASLMTGK+P++ GMQ I
Sbjct: 39 GWNDVSFHGSNQIPTPNIDALAYDGIILNRHYVPPLCTPSRASLMTGKHPMNIGMQDHVI 98
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L ++ +P+Y RE GY T +GKWHLGFFRR YTP YRGF+SHFGYL I
Sbjct: 99 ISDEPWGLGLDQKLMPQYFREAGYRTHLVGKWHLGFFRRAYTPTYRGFDSHFGYLGPYID 158
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+DH L Q + T G DMRRN + +D G YATDLF EAV+LI+ KPLFL
Sbjct: 159 YWDHSL--QMNET-SARGLDMRRNTAVNYDANGTYATDLFNDEAVRLIDSHNRSKPLFLV 215
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L HLA H GN L+AP + I +F YI DP RRT AAMV ++D VG + +L+R+ ML
Sbjct: 216 LTHLAPHTGNEDDPLQAPADEIAKFDYIQDPKRRTLAAMVSRIDTGVGRIYRSLERRRML 275
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
N+II+F +DNGAPT+ S GSNYP RG K + WEG V+ A++WS + +
Sbjct: 276 NNTIILFYADNGAPTLGIHANS-----GSNYPLRGQKESPWEGAVRGAALIWSRLLPRKG 330
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
VS Q +H+SDWLPTL AAG + I S NS IDG +QWS+
Sbjct: 331 IVSNQWLHVSDWLPTLGHAAG-------IRI--------------SPNNSPIDGQNQWST 369
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN-- 509
L ++ S R V+ N + ++ WK V GT G D + GQ + +
Sbjct: 370 LTSSSNSGRTVVMNNAHNEFTYSSYIKRGWKYVNGTSFKGAYDRWLGQLKDGEQISHEEY 429
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
+ ++ S+T S QL+++ + + + + Q T++ A P PC L
Sbjct: 430 YKRLIASQTIGSSMQLTRDDVKRLRSKAMVECSSAQGTVYVQAIPCEPLVKPC------L 483
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DPCE+ N+A P++ L ++ +R +P QP ++DP N TW+ W
Sbjct: 484 FNVVEDPCERVNLADQYPELLKDLQSDVERFKRDALP-PRNQPSDSRSDPALHNYTWTWW 542
Score = 87.8 bits (216), Expect = 1e-14, Method: Compositional matrix adjust.
Identities = 44/91 (48%), Positives = 54/91 (59%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRN + +D G YATDLF EAV+LI+ KPLFL L HLA H GN L+AP +
Sbjct: 176 MRRNTAVNYDANGTYATDLFNDEAVRLIDSHNRSKPLFLVLTHLAPHTGNEDDPLQAPAD 235
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
I +F YI DP RRT AA+ + +Y
Sbjct: 236 EIAKFDYIQDPKRRTLAAMVSRIDTGVGRIY 266
>gi|195057745|ref|XP_001995315.1| GH22700 [Drosophila grimshawi]
gi|193899521|gb|EDV98387.1| GH22700 [Drosophila grimshawi]
Length = 542
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 228/539 (42%), Positives = 314/539 (58%), Gaps = 54/539 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ F GS +IPTPNIDALAY+G+ILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 43 GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 102
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL + LP+YL ELGY++ GKWHLG ++R YTPLYRGF SH GY
Sbjct: 103 YAAEPRGLPLDLKILPQYLNELGYTSHIAGKWHLGHWKRVYTPLYRGFSSHHGYW----- 157
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFL 270
G DMR A+D G+Y TD+ T+ ++Q+I + +P PLFL
Sbjct: 158 -----------------GLDMRNGTEIAYDLHGQYTTDVITQHSLQVIANHKPAKGPLFL 200
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H+GN L A + + + I RR YAA+V +D+SVG ++ L++ M
Sbjct: 201 YVAHAAVHSGNPYNPLPASDDAVRRLDKIQHYKRRKYAALVTAMDESVGKIVEQLRKSRM 260
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSII+F +DNG P + N+ SNYP RGVKNTLWEGGV+ A+LWSPQ+ +
Sbjct: 261 LENSIIVFSTDNGGPAEGFNS-----NFASNYPLRGVKNTLWEGGVRGAALLWSPQLTKR 315
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
PR + Q+MH+ DWLPTL AAGG + L + +DG+ W
Sbjct: 316 PRTAEQLMHMVDWLPTLVEAAGGKVALSKLEA------------------AKLDGVSLWQ 357
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+L+ N S RNS+L NID+ +AA+ + WKLV G+ NG+ DG+YG N+
Sbjct: 358 ALVRNEASPRNSILHNIDDIWGSAALSVGVWKLVQGSHYNGSWDGWYGPAGLRNPHDYNW 417
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN--GPCY 568
+ + Q++QQL LP D +R R+ AT+ C +P N
Sbjct: 418 QLVTKCAAGQAMQQLK---MLP-QQADIIR-LRKAATVSCKEAKVKPSPDIACNPLKEIC 472
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
LFN+ +DPCEQ N+A P + + L L+ T +P ++ +PD + DP+ +N TW+
Sbjct: 473 LFNVLDDPCEQRNLAKQHPKVLALLLNELQLFNATAIPPAN-KPDDPRGDPRLWNHTWT 530
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 47/83 (56%), Gaps = 1/83 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A+D G+Y TD+ T+ ++Q+I + +P PLFLY+AH A H+GN L A
Sbjct: 161 MRNGTEIAYDLHGQYTTDVITQHSLQVIANHKPAKGPLFLYVAHAAVHSGNPYNPLPASD 220
Query: 60 ETINQFQYITDPNRRTYAALTKS 82
+ + + I RR YAAL +
Sbjct: 221 DAVRRLDKIQHYKRRKYAALVTA 243
>gi|156547173|ref|XP_001603910.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 578
Score = 416 bits (1070), Expect = e-113, Method: Compositional matrix adjust.
Identities = 225/545 (41%), Positives = 325/545 (59%), Gaps = 44/545 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHG+ +IPTPNIDAL YNGIILN Y P C+P+RA+ +TGKYPI GMQG I
Sbjct: 36 GWNDVGFHGATQIPTPNIDALGYNGIILNKHYVLPSCSPTRAAFLTGKYPIRMGMQGAGI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G EPRG+P+ + LPEYL+ LGY T IGKWH+G+ ++ P RGF+ +G+ N I
Sbjct: 96 AGGEPRGLPVHVQTLPEYLQGLGYETNLIGKWHVGYHTPKHLPNRRGFDYFYGFYNSHIG 155
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD+ +YS+ ++G DM N TA+ T G YATD FT+ A+ +I ++ P++L
Sbjct: 156 YYDY----RYSQG-NMSGFDMHINGETAYGTDGVYATDRFTQAAIDVIYRHDLESPMYLQ 210
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
++HLA HA + P + ++F++I++P RR YA MV +LDDS+G ++S+L +GML
Sbjct: 211 VSHLAPHAPMDVPFEDNPYD--DEFRHISEPKRRAYAKMVARLDDSLGRIVSSLGDRGML 268
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NS+I+FMSDNG T+ +RNW SN+P RG K TL+EGGV+ A +WSP++
Sbjct: 269 KNSVIVFMSDNGGATI-----GKFRNWASNWPLRGTKYTLFEGGVRSVAAMWSPKLLSKG 323
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS Q+ H++DWLPTL+ AGGD L IDG+ QW
Sbjct: 324 RVSEQLFHVTDWLPTLFQVAGGDMRDL----------------------GPIDGVSQWDV 361
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLLNF 510
L + T + R+ +L+NIDE +T A +K+V G+ +G D G+ R +K P+ N
Sbjct: 362 LSIGTGNVRDKILLNIDEYTQTEAAIDKRFKIVRGSLYHGYYDNVEGEIGRGHKNPMYNT 421
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS-----PCTNG 565
+ I++S +++ +++ +P++ M ++QAT+ C N M + P N
Sbjct: 422 SRILKSAVSDAIR---EHLGVPVTQESVMWELQRQATVLCRPNMTNMQYTSRYTLPSCNQ 478
Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
LF+LG DPCE NN+A S P +L EL L+ + P ADP+R+N T
Sbjct: 479 TACLFDLGIDPCETNNVAKSYPAQLQEL-ELFLDKYSNLLTRQIRMPVDYMADPRRWNYT 537
Query: 626 WSPWI 630
W PW+
Sbjct: 538 WQPWL 542
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 48/80 (60%), Gaps = 2/80 (2%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
M N TA+ T G YATD FT+ A+ +I ++ P++L ++HLA HA + P +
Sbjct: 171 MHINGETAYGTDGVYATDRFTQAAIDVIYRHDLESPMYLQVSHLAPHAPMDVPFEDNPYD 230
Query: 61 TINQFQYITDPNRRTYAALT 80
++F++I++P RR YA +
Sbjct: 231 --DEFRHISEPKRRAYAKMV 248
>gi|307187654|gb|EFN72626.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 415 bits (1067), Expect = e-113, Method: Compositional matrix adjust.
Identities = 238/544 (43%), Positives = 329/544 (60%), Gaps = 49/544 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHG++EIPTPNIDALAYNG+ILN Y P+CTPSR + +TGKYPI TGMQG P+
Sbjct: 15 GWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPICTPSRTAFLTGKYPIRTGMQGYPL 74
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
GAEPRG+ L LPEYL++LGY+T +GKWH+G+ R Y P +RGF++ GY NG I
Sbjct: 75 QGAEPRGILLNNILLPEYLQKLGYATHLVGKWHVGYHTRNYGPTHRGFDTFAGYYNGYIQ 134
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y++H L + E G+D+ R + + +Y TDL T EA +I KPL+
Sbjct: 135 YFNHTLYES-----EQLGYDLHRIIGDDHKIEYRYDYMTDLITDEAENIISSHNPAKPLY 189
Query: 270 LYLAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
L +AHLAAH+ +A + +E +ET YI D NRR YA++V LD+SVG VI AL++
Sbjct: 190 LQVAHLAAHSSDAEEEMEVRNWKETNATLGYIEDINRRKYASVVATLDESVGRVIDALKK 249
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
ML NSII+F+SDNGA T + + N+GSNYP RG+K +L+EGG++ A ++SP I
Sbjct: 250 TDMLNNSIIVFISDNGAQTEGFLQ-----NYGSNYPLRGLKFSLFEGGIRGAACIYSPLI 304
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ RVS Q+ HI+DWLPTLY+AAGG++S L +DG DQWS++ S +N
Sbjct: 305 DRPSRVSNQLFHITDWLPTLYSAAGGNSSDLK-QLDGFDQWSAI----KSAKN------- 352
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV-P 506
S+R+S+LINIDEK+ + A + +KLV T E YY + +N + P
Sbjct: 353 ----------SKRSSILINIDEKENSEAALIGHYKLVTDTSE---YQKYYNYSGNNALYP 399
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
N I+ S ++ +S + N K+ R++ATI C N S CTN
Sbjct: 400 KYNVTTILASPAASAIASISISTL----NASKIMQLRKEATIIC-KNFTDF--SNCTNRS 452
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LF+L DPCE ++ S P +L + ++ L+ Q++ D P+ FNDTW
Sbjct: 453 C-LFDLNKDPCETTDLFSKHPKDVKRLNMFIDRYKSVLMNQTNAPVDPA-GFPRNFNDTW 510
Query: 627 SPWI 630
PW+
Sbjct: 511 MPWL 514
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 42/71 (59%), Gaps = 2/71 (2%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
+Y TDL T EA +I KPL+L +AHLAAH+ +A + +E +ET YI D
Sbjct: 165 DYMTDLITDEAENIISSHNPAKPLYLQVAHLAAHSSDAEEEMEVRNWKETNATLGYIEDI 224
Query: 72 NRRTYAALTKS 82
NRR YA++ +
Sbjct: 225 NRRKYASVVAT 235
>gi|195333848|ref|XP_002033598.1| GM21416 [Drosophila sechellia]
gi|194125568|gb|EDW47611.1| GM21416 [Drosophila sechellia]
Length = 542
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 233/539 (43%), Positives = 315/539 (58%), Gaps = 59/539 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS EIPTPNIDALAY+GIILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 37 GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL E+ LP+YL ELGY++ GKWHLG ++ +YTPLYRGF SH+G
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHWGL------ 150
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
DMR A+D G Y TD+ T ++V++I + K PLFL
Sbjct: 151 -------------------DMRNGTQVAYDLHGHYTTDVITDQSVKVIANHNATKGPLFL 191
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H+ N L P + + +I + RR +AAMV K+DDSVG ++ L++ M
Sbjct: 192 YVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDDSVGQIVDQLRKSNM 251
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSIIIF SDNG P + N+ SNYP +GVKNTLWEGGV+ ++WSP ++++
Sbjct: 252 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKS 306
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS Q MHI DWLPTL AAGG + L + IDG W
Sbjct: 307 QRVSNQTMHIVDWLPTLLEAAGGQPALANL-------------------SKQIDGQSIWR 347
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+L+ + S R +VL NID+ +AA+ + WKLV GT G+ DG+YG L ++
Sbjct: 348 ALVHDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGWYGPAGERDPRLYDW 407
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGPCY 568
+ S+ ++L+ L LP S D+ R R AT+ C + T T + PC
Sbjct: 408 QLVGRSRAGKALEALK---MLP-SRADQQR-IRAAATVSCPGQSSQGTSCVATAFSAPC- 461
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
LF++ +DPCEQ N+A P++ S L L+ T VP S++ D +ADP+ +N TW+
Sbjct: 462 LFHIRDDPCEQYNLAKQYPEVVSALMAELERFNATAVPPSNKPAD-PRADPRFWNFTWT 519
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 44/81 (54%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A+D G Y TD+ T ++V++I + K PLFLY+AH A H+ N L P
Sbjct: 152 MRNGTQVAYDLHGHYTTDVITDQSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPD 211
Query: 60 ETINQFQYITDPNRRTYAALT 80
+ + +I + RR +AA+
Sbjct: 212 NDVIKMSHIPNYKRRKFAAMV 232
>gi|328788246|ref|XP_395125.4| PREDICTED: arylsulfatase B-like [Apis mellifera]
Length = 562
Score = 414 bits (1063), Expect = e-112, Method: Compositional matrix adjust.
Identities = 226/542 (41%), Positives = 334/542 (61%), Gaps = 43/542 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHG+N+IPTPNIDALAYNG+IL Y P+CTPSR + +TG+YPI TGMQG P+
Sbjct: 50 GWNDVSFHGANQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 109
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EPR +PL LPEYLR+LGY+T +GKWH+G++ +TP RGF++ FGY +G IS
Sbjct: 110 KAGEPRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPTRRGFDTFFGYYSGYIS 169
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H + +L+ +D +NLS ++ EY TDL T+ A +I++ KPL+L
Sbjct: 170 YFNHTIKQDDHIGYDLH-YDNSKNLSIDYNF--EYTTDLITERAENIIKNHDRRKPLYLQ 226
Query: 272 LAHLAAHAGNAGKHLEA--PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
L HLAAH+ +A + +E QET +YI D NRR YA +V +D+SVG VI AL +
Sbjct: 227 LCHLAAHSSDAKEVMEVRDEQETNATLKYIEDYNRRKYAGVVTAMDESVGRVIKALGQSS 286
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
MLENSII+F+SDNGA T E N+GSNYP RG+K TL+EGG++ A ++S IQ
Sbjct: 287 MLENSIIVFISDNGAQTEGLLE-----NYGSNYPLRGLKFTLFEGGIRGVACVYSRLIQN 341
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
+ R+S ++MHI+DWLPT Y+AAGG+ L N+DG+DQW ++ + G +
Sbjct: 342 SSRISNELMHITDWLPTFYSAAGGNLENLEENMDGVDQWDTI-----------VSGKE-- 388
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
S+R SVL+NIDE + ++ + +KL++ + N + YYG S P
Sbjct: 389 --------SKRESVLLNIDEVEDVSSALIGKYKLIINGK-NIQYNDYYGDNGTSVSYPEY 439
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
N ++++ S ++ + N L I ++ K+R+ +A + C + S C + C
Sbjct: 440 NVSSVLHSIAGSTIFH-ATNFTLNIKDVIKLRN---KARVVCN---NFTSYSKCVD-KC- 490
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LF++ NDPCE ++++ P+I +L + + + L+ Q++ D ++ P+ FN TW P
Sbjct: 491 LFDIYNDPCETTDLSNEHPEIVKKLNYFIAEYEKVLLHQTNAPVD-PRSYPENFNGTWMP 549
Query: 629 WI 630
WI
Sbjct: 550 WI 551
Score = 60.1 bits (144), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 49/82 (59%), Gaps = 4/82 (4%)
Query: 3 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA--PQE 60
+NLS ++ EY TDL T+ A +I++ KPL+L L HLAAH+ +A + +E QE
Sbjct: 191 KNLSIDYNF--EYTTDLITERAENIIKNHDRRKPLYLQLCHLAAHSSDAKEVMEVRDEQE 248
Query: 61 TINQFQYITDPNRRTYAALTKS 82
T +YI D NRR YA + +
Sbjct: 249 TNATLKYIEDYNRRKYAGVVTA 270
>gi|193641058|ref|XP_001942872.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 575
Score = 410 bits (1053), Expect = e-111, Method: Compositional matrix adjust.
Identities = 234/556 (42%), Positives = 321/556 (57%), Gaps = 47/556 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS +IPTPNIDALAYNG ILN Y QP CTPSRA+L+TGKYPI G+QGPPI
Sbjct: 44 GWNDVGFHGSIQIPTPNIDALAYNGAILNRHYVQPTCTPSRAALLTGKYPIRYGLQGPPI 103
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ +P E+ LP+YL+ELGYST +GKWHLG +++ +TP RGF+SHFGY NG IS
Sbjct: 104 ASGKASALPTNEKILPQYLKELGYSTHLVGKWHLGHYQKRFTPTKRGFDSHFGYWNGYIS 163
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTV--GEYATDLFTKEAVQLIEDQPVDK-PL 268
Y + S +RT ++G D RR A + + YATD+FT+EA ++IE + +
Sbjct: 164 YRN---STHATRT--MSGIDARRGFERAGNEMDRDRYATDVFTEEARKIIESSKRENTEM 218
Query: 269 FLYLAHLAAHAGNAG-KHLEAPQETINQ--FQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
FL ++HLA H+GN+G HLE +T N F YI + NRR YA MV LDDSVG +I +L
Sbjct: 219 FLMVSHLAVHSGNSGPNHLEVLNKTYNDEAFGYIENENRRLYAGMVTSLDDSVGKIIESL 278
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
MLENSI++F+SDNGAPT + + N+GSN+P RG K ++ EGGV+ A +WSP
Sbjct: 279 HEHEMLENSIVVFISDNGAPTDD--PLWGHDNFGSNWPLRGKKASVLEGGVRGVAAIWSP 336
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+++ VS + HI+DWLPTLYTAAGG+ L IDG
Sbjct: 337 WLKEKNTVSNNLFHITDWLPTLYTAAGGNFEDL----------------------GKIDG 374
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ-ENGTMDGYYGQTRSNK 504
++QW SL + R+ +LINIDE + A+ + WK+V + N + Y N
Sbjct: 375 VNQWKSLTDTSKQVRSKILINIDETRGEEALIFNQWKVVKSNRTSNMVFNLKYAGDPGNA 434
Query: 505 VPLLNFNAIVESKTYQSLQQ---LSQNIFLPISNI-DKMRSTRQQATIH---CG---ANP 554
P N + +V S + L + L+Q+ + S D +R A + C +NP
Sbjct: 435 GPKYNMSGVVASTVGKRLSKINCLTQDSCMNASTARDVFYVSRSLAKVDGQTCPERVSNP 494
Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
P C +G C LF++ DPCE N+A P +L+ ++ + Q+ +
Sbjct: 495 NPDQQYECFDGYC-LFDILQDPCEYRNVAKQNPQTLDATIHMLEQFKKEMTMQAPDPIID 553
Query: 615 VQADPKRFNDTWSPWI 630
ADP+RF W PW+
Sbjct: 554 PDADPRRFAGYWEPWL 569
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 46/72 (63%), Gaps = 4/72 (5%)
Query: 15 YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAG-KHLEAPQETINQ--FQYITD 70
YATD+FT+EA ++IE + +FL ++HLA H+GN+G HLE +T N F YI +
Sbjct: 195 YATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSGPNHLEVLNKTYNDEAFGYIEN 254
Query: 71 PNRRTYAALTKS 82
NRR YA + S
Sbjct: 255 ENRRLYAGMVTS 266
>gi|195485249|ref|XP_002091013.1| GE12487 [Drosophila yakuba]
gi|194177114|gb|EDW90725.1| GE12487 [Drosophila yakuba]
Length = 544
Score = 409 bits (1050), Expect = e-111, Method: Compositional matrix adjust.
Identities = 233/539 (43%), Positives = 314/539 (58%), Gaps = 59/539 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS EIPTPNIDALAY+GIILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 37 GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL E+ LP+YL ELGY++ GKWHLG ++ +YTPL+RGF SH+G
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLHRGFSSHWGL------ 150
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
DMR A+D G Y TD+ T +V++I K PLFL
Sbjct: 151 -------------------DMRNGTQVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFL 191
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H+ N L P + + +I RR +AAMV K+DDSVG ++ L++ M
Sbjct: 192 YVAHAACHSSNPYNPLPVPDNDVLKMSHIPHYKRRKFAAMVSKMDDSVGQIVDQLRKSNM 251
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSIIIF SDNG P + N+ SNYP +GVKNTLWEGGV+ ++WSP ++++
Sbjct: 252 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKS 306
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS Q MHI DWLPTL AAGG P + L+Q IDG W
Sbjct: 307 QRVSNQTMHIIDWLPTLLEAAGG-----PPAVANLNQ--------------QIDGQSIWR 347
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+L+ + S R +VL NID+ +AA+ + WKLV GT G+ DG+YG ++
Sbjct: 348 ALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGWYGPAGERDPRQYDW 407
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGPCY 568
+ +S+ ++L L LP S D+ R R AT+ C A + T T + PC
Sbjct: 408 QLVAKSRVGKALDALK---MLP-SRADQQR-IRAAATVTCPAQSSQGTSCLATAFSAPC- 461
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
LF++ +DPCEQ N+A P++ + L L+ T VP S++ D +ADP+ +N TW+
Sbjct: 462 LFHIRDDPCEQYNLAKQYPEVVNALMTELERFNATAVPPSNKPAD-PRADPRFWNYTWT 519
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A+D G Y TD+ T +V++I K PLFLY+AH A H+ N L P
Sbjct: 152 MRNGTQVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAHAACHSSNPYNPLPVPD 211
Query: 60 ETINQFQYITDPNRRTYAALT 80
+ + +I RR +AA+
Sbjct: 212 NDVLKMSHIPHYKRRKFAAMV 232
>gi|91084739|ref|XP_970972.1| PREDICTED: similar to arylsulfatase b [Tribolium castaneum]
gi|270008608|gb|EFA05056.1| hypothetical protein TcasGA2_TC015151 [Tribolium castaneum]
Length = 558
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 231/544 (42%), Positives = 323/544 (59%), Gaps = 43/544 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G ND+ +N+IPTPNIDAL YNG++L+ Y Q CTPSRA+ +TG YPI + MQG PI
Sbjct: 38 GHNDIGLR-TNQIPTPNIDALGYNGVVLDRYYVQNACTPSRAAFLTGNYPIRSAMQGLPI 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E R +PL +P++L+ LGY T +GKWHLG R TP +GF+SHFGY NG
Sbjct: 97 VAGENRSLPLNMPLMPQHLKNLGYRTHIVGKWHLGSAYRSSTPTEKGFDSHFGYWNGFTG 156
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYD+ +D S +E G D+ T G+YAT +FT+ A+ +IE +PLFL
Sbjct: 157 YYDY-FTDFNSTAIE--GFDLHDRFETERGYQGQYATRVFTERALDIIEGHNTTRPLFLL 213
Query: 272 LAHLAAHAGNAGKHLEAPQETINQ--FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
+ HLAAHAG G L P E Q + YI DP RR YA +V +LD S+G V+ L +
Sbjct: 214 MTHLAAHAGRDGTELGVPNEVEAQRTYSYIQDPRRRLYAEIVAELDRSIGQVVRKLSERQ 273
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
MLENSII+F SDNGAPTV Y N GSN+P RG+K T +EGG++ A ++SP +++
Sbjct: 274 MLENSIILFFSDNGAPTV-----GPYTNSGSNWPLRGIKLTNFEGGIRGTATIFSPLLKK 328
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V+ +++H+SDWLPT Y AAGG+ + L IDG++QW
Sbjct: 329 RGYVNKELIHVSDWLPTFYAAAGGNLADLG-PIDGVNQW--------------------- 366
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDS--WKLVLGTQENGTMDGYYGQT-RSNKVP 506
+L L+TPS R+ +L+NI+E+ T ++ D+ +KLV G E GT DGY+G + RS P
Sbjct: 367 PTLSLDTPSPRSEILVNINEQDNTTSIITDNGRFKLVTGAFEGGTYDGYFGDSGRSPDTP 426
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
+ A+++S+T ++Q+L+Q PI+ ++R TR Q + N + P C+ P
Sbjct: 427 PYDPFAVLQSETNIAIQELTQT---PITR-QQIRVTRAQIDLSWCRNDSFRPPLNCSQ-P 481
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LF+L NDPCE NI P+I+ +L + LVPQ D A+P+R+N+TW
Sbjct: 482 C-LFDLENDPCETTNIIEENPEIAERLLRRIADFFEVLVPQGDSTID-PNANPQRYNNTW 539
Query: 627 SPWI 630
W+
Sbjct: 540 CTWL 543
Score = 67.4 bits (163), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 33/70 (47%), Positives = 42/70 (60%), Gaps = 2/70 (2%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ--FQYITD 70
G+YAT +FT+ A+ +IE +PLFL + HLAAHAG G L P E Q + YI D
Sbjct: 186 GQYATRVFTERALDIIEGHNTTRPLFLLMTHLAAHAGRDGTELGVPNEVEAQRTYSYIQD 245
Query: 71 PNRRTYAALT 80
P RR YA +
Sbjct: 246 PRRRLYAEIV 255
>gi|156537546|ref|XP_001607560.1| PREDICTED: arylsulfatase B-like [Nasonia vitripennis]
Length = 571
Score = 407 bits (1045), Expect = e-110, Method: Compositional matrix adjust.
Identities = 228/550 (41%), Positives = 317/550 (57%), Gaps = 60/550 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSNEIPTPNIDALAY G+ILN YA P CTPSR + +TG++PI G+QG P+
Sbjct: 37 GWNDVGFHGSNEIPTPNIDALAYGGVILNRHYALPTCTPSRTAFLTGRHPIRMGLQGIPM 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEPRGVPL ER LPEYLRELGY T+ +GKWHLG++ ++TP RGF+S GY GVI+
Sbjct: 97 NVAEPRGVPLHERLLPEYLRELGYVTRLVGKWHLGYYTDKHTPTRRGFDSFVGYYGGVIT 156
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG--------EYATDLFTKEAVQLIEDQP 263
Y++H ++ D + WDT G +Y TD + +A +I++
Sbjct: 157 YFNHTVT-----------KDKHTGIDYHWDTSGKIEPFDNDQYVTDFISDQAEAVIKNHD 205
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQ--ETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
KPLFL LAH+AAHA +E E + YI D NRR YA +V +DDSVG V
Sbjct: 206 RKKPLFLQLAHVAAHASENRDPIEVRNMTEVNDTLSYIPDINRRKYAGVVTAMDDSVGRV 265
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ AL+ ML NSIIIFMSDNG+PT E + Y N+GSNYP RG+K T++EGGV+VPA
Sbjct: 266 VKALKDANMLSNSIIIFMSDNGSPTAE----APYTNYGSNYPLRGIKATVFEGGVRVPAC 321
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
++SP+++ RVS ++ HI+DW PTLY AGGD S++ ++DG+DQWSS+ + S++
Sbjct: 322 VFSPRLKDRFRVSDELFHITDWFPTLYKLAGGDLSKIQ-DLDGVDQWSSI---SGSQK-- 375
Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
S R S+L+NIDE A +KL+ G D YYG+
Sbjct: 376 ----------------SNRESLLVNIDEVSNPEAAISGYYKLIRGINR---YDDYYGKDG 416
Query: 502 SNKVP-LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
++ P + ++ S +++ L P I ++R+ +AT+ C
Sbjct: 417 NDYSPKTYDVTGVLSSLAGRAIASLGNQYLPPQKRITELRN---KATLRCEKK----DDR 469
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
P C LF++ DPCE NI P++ L ++ ++ L+ Q++ D P+
Sbjct: 470 PSCRDTC-LFDIVKDPCETTNIIDEHPEMEIVLNTYIQGYKAVLMNQTNTYIDPAGY-PE 527
Query: 621 RFNDTWSPWI 630
FN W PW+
Sbjct: 528 NFNGYWMPWM 537
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 31/84 (36%), Positives = 42/84 (50%), Gaps = 10/84 (11%)
Query: 9 WDTVG--------EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ- 59
WDT G +Y TD + +A +I++ KPLFL LAH+AAHA +E
Sbjct: 174 WDTSGKIEPFDNDQYVTDFISDQAEAVIKNHDRKKPLFLQLAHVAAHASENRDPIEVRNM 233
Query: 60 -ETINQFQYITDPNRRTYAALTKS 82
E + YI D NRR YA + +
Sbjct: 234 TEVNDTLSYIPDINRRKYAGVVTA 257
>gi|350422910|ref|XP_003493325.1| PREDICTED: arylsulfatase I-like [Bombus impatiens]
Length = 563
Score = 406 bits (1043), Expect = e-110, Method: Compositional matrix adjust.
Identities = 218/542 (40%), Positives = 320/542 (59%), Gaps = 44/542 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHG+++IPTPNIDALAYNG+IL Y P+CTPSR + +TG+YPI TGMQG P+
Sbjct: 50 GWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGYPL 109
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E R +PL LPEYLR+LGY+T +GKWH+G++ +TP YRGF++ GY +G I+
Sbjct: 110 KAGEERAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGYIT 169
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H + +L+ +D+ NLS + EY TDL T+ A +I + KPL+L
Sbjct: 170 YFKHTIEQNLHVGYDLH-YDVAGNLSVKYSH--EYMTDLITERAEDIIFNHNRSKPLYLQ 226
Query: 272 LAHLAAHAGNAGKHLEA--PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
L+H+AAH+ +A ++E +ET YI D +RR A +V +D+SVG V+ AL++
Sbjct: 227 LSHVAAHSSDAKANMEVRDEEETNATLGYIEDFDRRKLAGVVTAMDESVGRVVQALRQTN 286
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
MLENSII+FMSDNGA TV N+GSNYP RG+K TL+EGGV+ A ++SP I+
Sbjct: 287 MLENSIIVFMSDNGAQTVGL-----LNNYGSNYPLRGLKFTLFEGGVRGVACVYSPAIKN 341
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
R+S Q++H++DWLPT Y+AAGG+ L N+DG+D QW
Sbjct: 342 PSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLDGVD---------------------QW 380
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
++++ +RR +VL+NIDEK+ + + +KL+ G N + YYG + S P
Sbjct: 381 ATIVSEKKTRRKNVLLNIDEKQALSGALMGRYKLINGA--NVSYGDYYGDSGTSESYPEY 438
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
N + ++ S +L +S F N +++ R+ AT+ C T P C
Sbjct: 439 NVSNVLHSAAGSALASVSNATF----NSERIIELRRAATVVCDN----FTSYPRCLDRC- 489
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LF++ DPCE +++ P++ +L + + L Q+ D ++ P+ FN TW P
Sbjct: 490 LFDVYKDPCETTDVSREHPEVVDELNRFIAEYFAVLKTQTSGLVD-PRSYPEHFNGTWMP 548
Query: 629 WI 630
WI
Sbjct: 549 WI 550
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 45/81 (55%), Gaps = 4/81 (4%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA--PQET 61
NLS + EY TDL T+ A +I + KPL+L L+H+AAH+ +A ++E +ET
Sbjct: 192 NLSVKYSH--EYMTDLITERAEDIIFNHNRSKPLYLQLSHVAAHSSDAKANMEVRDEEET 249
Query: 62 INQFQYITDPNRRTYAALTKS 82
YI D +RR A + +
Sbjct: 250 NATLGYIEDFDRRKLAGVVTA 270
>gi|195431744|ref|XP_002063888.1| GK15669 [Drosophila willistoni]
gi|194159973|gb|EDW74874.1| GK15669 [Drosophila willistoni]
Length = 556
Score = 404 bits (1038), Expect = e-110, Method: Compositional matrix adjust.
Identities = 228/540 (42%), Positives = 311/540 (57%), Gaps = 57/540 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS +IPTPNIDALAY+G+ILN Y P+CTPSR+SLMTGKY IHTGMQ +
Sbjct: 37 GFNDVGFHGSAQIPTPNIDALAYSGLILNRYYVAPICTPSRSSLMTGKYAIHTGMQHTVL 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+GAEPRG+PL E+ LP+YL +LGY++ GKWHLG ++ YTPL RGF SH G+
Sbjct: 97 YGAEPRGLPLEEKLLPQYLNDLGYTSHIAGKWHLGHWKMPYTPLRRGFNSHHGFW----- 151
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
G DMR A++ G+Y TD+ T+ ++ +I + PV K PLFL
Sbjct: 152 -----------------GLDMRNGSRVAYELHGQYTTDVITQHSIDVIANHPVSKGPLFL 194
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH AAH+GN L + + + I RR YAAMV KLD+SVG ++ L+R M
Sbjct: 195 YVAHAAAHSGNPYNPLPVADDEVIKLGRIEHYKRRRYAAMVSKLDESVGIIVDQLRRSNM 254
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSII+F +DNG + N+ SNYP RGVKNTLWEGGV+ +LWS + +
Sbjct: 255 LENSIIVFATDNGGAAEGFN-----LNFASNYPLRGVKNTLWEGGVRGVGLLWSQRYLKR 309
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
PRV+ Q MHI+DWLPT AGG + + +++DG W
Sbjct: 310 PRVAEQTMHITDWLPTFVEVAGGKEAL-------------------ANFTTSLDGQSIWQ 350
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+L+ + S R SVL NID+ ++A+ + WKL+ GT G DG+YG L N+
Sbjct: 351 ALVQDEASPRKSVLHNIDDIWNSSALTVGDWKLIKGTNYKGAWDGWYGPAGERDPQLYNW 410
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN---GPC 567
+ +S +++ QL LP S D+ R R AT++C + N PC
Sbjct: 411 QQVAKSLAGKAMLQLK---VLP-SRADQQR-LRSAATVNCPNQQSNTQKGSVCNPLQTPC 465
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
LFNL +DPCEQ N+A+ P + + L LK+ T +P S+ PD + +PK +N TW+
Sbjct: 466 -LFNLRDDPCEQYNLATQYPLVLNSLLSELKHFNSTAIPPSN-LPDDPRGNPKFWNYTWT 523
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 32/91 (35%), Positives = 50/91 (54%), Gaps = 1/91 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A++ G+Y TD+ T+ ++ +I + PV K PLFLY+AH AAH+GN L
Sbjct: 155 MRNGSRVAYELHGQYTTDVITQHSIDVIANHPVSKGPLFLYVAHAAAHSGNPYNPLPVAD 214
Query: 60 ETINQFQYITDPNRRTYAALTKSTTLTLLIV 90
+ + + I RR YAA+ ++ I+
Sbjct: 215 DEVIKLGRIEHYKRRRYAAMVSKLDESVGII 245
>gi|91084737|ref|XP_970917.1| PREDICTED: similar to arylsulfatase B [Tribolium castaneum]
Length = 531
Score = 404 bits (1037), Expect = e-110, Method: Compositional matrix adjust.
Identities = 224/542 (41%), Positives = 310/542 (57%), Gaps = 64/542 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+SFHGS++IPTPN+ +A GIIL+ Y Q CTPSR +L+TG+YPI +GMQG P+
Sbjct: 35 GYNDVSFHGSSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPL 94
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E R +PL +P + + LGY T +GKWHLG +E TPL +GF+SHFGY NG +
Sbjct: 95 KAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVG 154
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+D++ + + G D+ W + G YAT+LFT+ ++ +IE V PLFL
Sbjct: 155 YFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLV 214
Query: 272 LAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
++HLAAH G G L P +T ++F YI DP RR YA +V LD S+G +++ L K
Sbjct: 215 VSHLAAHTGQNGSELGVPDVDQTNHEFSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQ 274
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML+NSI++F SDNGA TV Y N GSN+P RGVK + +EGGV+V A ++SP +
Sbjct: 275 MLDNSIVLFFSDNGAQTV-----GMYENSGSNWPLRGVKFSDFEGGVRVAATIYSPLFHK 329
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
VS ++HISDWLPTLY+AAGGD + L IDG+DQW +L +N
Sbjct: 330 KGYVSEHLIHISDWLPTLYSAAGGDVAHLG-QIDGIDQWDAL---------TN------- 372
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
N PS R +LINIDE A+ D +KL+ G+ GT D YYG + R + P
Sbjct: 373 -----NNPSNRTEILINIDEVDENFAIIRDKFKLIQGSYHEGTFDQYYGDSGRGPENPTP 427
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
N N +++ R+ Q ++ CT G C
Sbjct: 428 NPNHTT-------------------TDLSWCRAPDQTPILN------------CTKG-C- 454
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LF+L DPCE NI S P+I++QLYE + + LVPQ ++ D ++DP +N+TW
Sbjct: 455 LFDLDKDPCETTNIIESEPEIANQLYEKIAQFWKELVPQRNKDTD-PKSDPIFYNNTWCT 513
Query: 629 WI 630
W+
Sbjct: 514 WL 515
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 46/75 (61%), Gaps = 2/75 (2%)
Query: 8 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQF 65
W + G YAT+LFT+ ++ +IE V PLFL ++HLAAH G G L P +T ++F
Sbjct: 182 VWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNHEF 241
Query: 66 QYITDPNRRTYAALT 80
YI DP RR YA +
Sbjct: 242 SYIQDPRRRLYAGVV 256
>gi|195380485|ref|XP_002049001.1| GJ21349 [Drosophila virilis]
gi|194143798|gb|EDW60194.1| GJ21349 [Drosophila virilis]
Length = 531
Score = 403 bits (1036), Expect = e-109, Method: Compositional matrix adjust.
Identities = 234/540 (43%), Positives = 315/540 (58%), Gaps = 56/540 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ F GS +IPTPNIDALAY+G+ILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 32 GFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSALMTGKYPIHTGMQHTVL 91
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL + LP+YL +LGY++ GKWHLG ++R YTPLYRGF SH GY
Sbjct: 92 YAAEPRGLPLDLKILPQYLNDLGYTSHIAGKWHLGHWQRVYTPLYRGFSSHHGYW----- 146
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
G DMR A+D G+Y+TD+ T+ ++ +I K PLFL
Sbjct: 147 -----------------GLDMRNGTEVAYDLHGQYSTDVITQHSLNVISKHNATKGPLFL 189
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H+GN L + + + I RR YAA+V +D+SVG ++ L++ M
Sbjct: 190 YVAHAAVHSGNPYNPLPVKDDAVRRLDTIQHYKRRKYAALVTAMDESVGKIVEQLRKSRM 249
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LE SII+F +DNG P + N+ SNYP RGVKNTLWEGGV+ A+LWSPQ+ +
Sbjct: 250 LETSIIVFSTDNGGPAEGFNS-----NFASNYPLRGVKNTLWEGGVRGAALLWSPQLTKR 304
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
PR + Q MHI DWLPTL AAGG+ + L + +DG+ W
Sbjct: 305 PRTAEQTMHIVDWLPTLVEAAGGEAALAKLG------------------TAKLDGISLWQ 346
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+LL N S R SVL NID+ +AA+ + WKLV GT NG+ DG+YG ++
Sbjct: 347 ALLKNETSPRKSVLHNIDDIWGSAALSVGDWKLVKGTNYNGSWDGWYGPAGMRNPHDYDW 406
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC-GANP--APMTPSPCTNGPC 567
++ +S T Q++QQL LP D +R R+ A + C ANP AP + PC
Sbjct: 407 QSVAKSATGQAMQQLK---MLP-KTADMIR-LREAANVICKDANPKDAPHIACNPLSEPC 461
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
LFN+ DPCEQ N A P + L + L+ T VP ++ +PD +ADP+ +N TW+
Sbjct: 462 -LFNVREDPCEQRNQAKQHPKVLELLLKELQLLNATAVPPAN-KPDDPRADPRLWNHTWT 519
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 30/83 (36%), Positives = 44/83 (53%), Gaps = 1/83 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A+D G+Y+TD+ T+ ++ +I K PLFLY+AH A H+GN L
Sbjct: 150 MRNGTEVAYDLHGQYSTDVITQHSLNVISKHNATKGPLFLYVAHAAVHSGNPYNPLPVKD 209
Query: 60 ETINQFQYITDPNRRTYAALTKS 82
+ + + I RR YAAL +
Sbjct: 210 DAVRRLDTIQHYKRRKYAALVTA 232
>gi|193641124|ref|XP_001950120.1| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 599
Score = 402 bits (1032), Expect = e-109, Method: Compositional matrix adjust.
Identities = 240/568 (42%), Positives = 329/568 (57%), Gaps = 56/568 (9%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
+LIV GWND+ FHGS +IPTPNIDALAYNG+ILN Y QP CTPSRA+L+TGKYPI
Sbjct: 37 ILIVADDLGWNDVGFHGSIQIPTPNIDALAYNGVILNRHYVQPTCTPSRAALLTGKYPIR 96
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
G+QG PI P +PL E+ LP+YL++LGYST +GKWHLG + ++TP+ RGF+SHF
Sbjct: 97 YGLQGFPIIAGVPLALPLNEKILPQYLKDLGYSTHLVGKWHLGANKNQHTPIKRGFDSHF 156
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIE-D 261
GY NG ISY + S T + G D RR A D V YATD+FT EA ++I+
Sbjct: 157 GYWNGFISYRNSTHS-----TGLMVGKDARRGFERAGDEMVDRYATDIFTDEANKVIKLC 211
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETIN--QFQYITDPNRRTYAAMVKKLDDSV 318
+ DKP+FL ++HLA H G G + LE +T N +F YI + RR YA M+ LD+SV
Sbjct: 212 KNHDKPMFLMVSHLAVHTGVPGPNILEVSNKTHNDIRFDYIENKERRLYAGMLTSLDESV 271
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G++I +L GMLE+SI++F+SDNGAP + Y N GSN+P RG K + +GGV+
Sbjct: 272 GSIIESLDNNGMLEDSIVLFISDNGAPADD--PIWGYGNSGSNWPLRGEKGAVLDGGVRG 329
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
A +WSP +++ R+ + HI+DWLPTLYTAAGGD L IDG+DQW S L S+
Sbjct: 330 VAAIWSPWLKKKHRIFENLFHITDWLPTLYTAAGGDFEDLG-QIDGVDQWES--LTETSK 386
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD-GYY 497
R R+ VLINIDE A+ + WK+V + N T D YY
Sbjct: 387 RT-------------------RSMVLINIDETSGEEALIFNHWKVV---KSNRTSDIAYY 424
Query: 498 GQTR---SNKVPLLNFNAIVESKTYQSLQQLS-----QNIFLPISNIDKMRSTRQQATI- 548
+ N P N A+ ES L +++ + ++ D + R QA I
Sbjct: 425 LKYSGEPGNVGPDYNMTAVAESLAGSRLSKINCLVSKADCMDTLTTYDLFYNVRSQAKID 484
Query: 549 -HCG---ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL 604
C ++P P C + PC LF++ DPCE NIAS P+ + ++L + +
Sbjct: 485 GKCSERISDPDPNARYECFDSPC-LFDIQRDPCEYQNIASRNPEALNMTADILAQFKNEM 543
Query: 605 VPQSHEQPDLVQADPKRFNDTWSPWIYR 632
Q++ + D ++DP+ F+ W W+ +
Sbjct: 544 TMQNYPETD-PKSDPRFFDGYWDTWLEK 570
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 37/86 (43%), Positives = 50/86 (58%), Gaps = 5/86 (5%)
Query: 2 RRNLSTAWD-TVGEYATDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKH-LEAP 58
RR A D V YATD+FT EA ++I+ + DKP+FL ++HLA H G G + LE
Sbjct: 181 RRGFERAGDEMVDRYATDIFTDEANKVIKLCKNHDKPMFLMVSHLAVHTGVPGPNILEVS 240
Query: 59 QETIN--QFQYITDPNRRTYAALTKS 82
+T N +F YI + RR YA + S
Sbjct: 241 NKTHNDIRFDYIENKERRLYAGMLTS 266
>gi|340727298|ref|XP_003401983.1| PREDICTED: arylsulfatase B-like [Bombus terrestris]
Length = 563
Score = 401 bits (1031), Expect = e-109, Method: Compositional matrix adjust.
Identities = 218/542 (40%), Positives = 326/542 (60%), Gaps = 44/542 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHG+++IPTPNIDALAYNG+IL Y P+CTPSR + +TG+YPI TGMQG P+
Sbjct: 50 GWNDVSFHGADQIPTPNIDALAYNGVILQRHYVLPICTPSRTAFLTGRYPIRTGMQGHPL 109
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E R +PL LPEYLR+LGY+T +GKWH+G++ +TP YRGF++ GY +G ++
Sbjct: 110 DPGEVRAIPLNNTLLPEYLRKLGYATHLVGKWHVGYYSDYHTPAYRGFDTFLGYYSGFMT 169
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H + + +L+ +D+ NLS + EY TDL T+ A +I + KPL+L
Sbjct: 170 YFNHTIEQNHHVGYDLH-YDVAGNLSVKYSH--EYMTDLITERAEDIILNHNHSKPLYLQ 226
Query: 272 LAHLAAHAGNAGKHLEA--PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
L+H+AAH+ N K +E +ET YI D +RR A +V LD+SVG V+ AL++
Sbjct: 227 LSHIAAHSSNINKTVEVRDEEETNATLGYIEDFDRRKLAGVVTALDESVGRVVQALKQAN 286
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
MLENSII+FMSDNGA TV + S GSNYP RG+K TL+EGGV+ A ++SP I+
Sbjct: 287 MLENSIILFMSDNGAQTVGLLDNS-----GSNYPLRGMKFTLFEGGVRGAACVYSPAIKN 341
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
R+S Q++H++DWLPT Y+AAGG+ L N+DG+DQW++++ S N
Sbjct: 342 PSRISNQLIHVTDWLPTFYSAAGGNLEDLEENLDGVDQWATIV----SEEN--------- 388
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
+RR ++L+NIDEK+ + + +KL+ G + YYG + S P
Sbjct: 389 --------TRRKNILLNIDEKEDLSGALMGRYKLINGAKLQ--YGDYYGDSGMSESYPEY 438
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
N + ++ S ++ +S N+ L N +++ R+ A + C T P C
Sbjct: 439 NVSNVLHSAAGFAVASVS-NVTL---NSERIIELRRAAMVVCDN----FTSYPKCLDRC- 489
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LF++ NDPCE +++ P++ ++ + + L+ Q+ + D ++ P+ FN TW P
Sbjct: 490 LFDVYNDPCETTDVSREHPEVVDEINRFIAEYFTVLMTQTPTKTD-PRSYPENFNGTWMP 548
Query: 629 WI 630
WI
Sbjct: 549 WI 550
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 44/81 (54%), Gaps = 4/81 (4%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA--PQET 61
NLS + EY TDL T+ A +I + KPL+L L+H+AAH+ N K +E +ET
Sbjct: 192 NLSVKYSH--EYMTDLITERAEDIILNHNHSKPLYLQLSHIAAHSSNINKTVEVRDEEET 249
Query: 62 INQFQYITDPNRRTYAALTKS 82
YI D +RR A + +
Sbjct: 250 NATLGYIEDFDRRKLAGVVTA 270
>gi|449680619|ref|XP_002157149.2| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 502
Score = 400 bits (1029), Expect = e-109, Method: Compositional matrix adjust.
Identities = 226/540 (41%), Positives = 321/540 (59%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGSNEIPTPNID LA NG+IL+N Y P+CTPSR+++MTG+YPIHTGMQ I
Sbjct: 31 GWNDISFHGSNEIPTPNIDRLANNGVILDNYYVLPICTPSRSAIMTGRYPIHTGMQQDTI 90
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+G P GV L E+FLP+YL++ GY T +GKWHLGFF ++YTP YRGF+S++G G
Sbjct: 91 FGPNPYGVGLNEKFLPQYLKQQGYKTHGVGKWHLGFFAKQYTPTYRGFDSYYGSYLGKGD 150
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H ++ YS G D+ N + + G Y+T+++T EA+ I + +PLFLY
Sbjct: 151 YWNHSNTETYS------GLDLHDNENGVFSQDGNYSTEMYTAEAISCINNHNSSEPLFLY 204
Query: 272 LAHLAAHAGNAGKH-LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
LA+ A H+ N + L+APQE I++F YI RR YAAM+ +D VG V AL +K M
Sbjct: 205 LAYQAVHSANTEEDPLQAPQEWIDKFSYIKHEQRRKYAAMLGYMDYGVGRVHDALAKKKM 264
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L+NSIIIF +DNG P + NW +N+P RGVK TL+EGGV+ + ++S I ++
Sbjct: 265 LDNSIIIFTTDNGGPANGFD-----YNWANNFPLRGVKATLFEGGVRGVSFVYSKLI-ES 318
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
PRVS +++HI+DWLPTL AGG S DG +DG DQW+
Sbjct: 319 PRVSHELIHITDWLPTLVNLAGGKVS------DGF-----------------LDGFDQWA 355
Query: 451 SLLLNTPSRRNSVLINIDEKK-RTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+L S+RN VL+NIDEK + A+R+ SWK++ +E DG+Y P +
Sbjct: 356 TLQNKQSSQRNEVLLNIDEKVWKNEALRVGSWKII---KEGNYWDGWY--------PPPS 404
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
FN + S N F +S +T+ CG + P+ + C + C L
Sbjct: 405 FN------------EQSNNSFSYLS-----------STVKCGHD-IPIVINHC-DSYC-L 438
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
F++ DPCE N+++ P++ ++L L +R+++VP + ++DPK N W PW
Sbjct: 439 FHIDEDPCEINDLSKKFPEVLAELINRLNTYRQSMVPPRNNMTIDPRSDPKLHNGVWQPW 498
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 31/68 (45%), Positives = 45/68 (66%), Gaps = 1/68 (1%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITDP 71
G Y+T+++T EA+ I + +PLFLYLA+ A H+ N + L+APQE I++F YI
Sbjct: 177 GNYSTEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKHE 236
Query: 72 NRRTYAAL 79
RR YAA+
Sbjct: 237 QRRKYAAM 244
>gi|312382061|gb|EFR27642.1| hypothetical protein AND_05535 [Anopheles darlingi]
Length = 881
Score = 397 bits (1020), Expect = e-107, Method: Compositional matrix adjust.
Identities = 234/543 (43%), Positives = 311/543 (57%), Gaps = 46/543 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G ND+ FHGSN+IPTPNIDALAY+GIILN Y+ P+CTPSRA+LMTG++P++ GMQ I
Sbjct: 45 GLNDVGFHGSNQIPTPNIDALAYDGIILNRHYSAPMCTPSRAALMTGRHPMNVGMQHYVI 104
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L E+ LP+Y RE GY T IGKWHLGF+ Y P RGF++H GYL I
Sbjct: 105 DSDEPWGLGLQEKLLPQYFREAGYRTHLIGKWHLGFYAEPYLPTNRGFDTHIGYLGPYID 164
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
Y+ +I S S T E G+D+R+NL+ + G YATD FT+ AV++I + + L
Sbjct: 165 YWSYI-SKMDSATFE--GYDLRQNLAVNYKPNGTYATDYFTEAAVEIIRSHNRTGERMLL 221
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
L HLA H GN L+AP+ETI +F YI D +RRTYAAM+ KLDDSVG V ALQ+ GM
Sbjct: 222 VLNHLAPHTGNDDAPLQAPEETIEKFAYIRDTDRRTYAAMMSKLDDSVGQVYRALQQHGM 281
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSII+F+SDNG T + N GSNYP RG K++ WEG V+ A++WSP ++
Sbjct: 282 LENSIIVFISDNGGVT-----RGMHSNSGSNYPLRGQKHSPWEGAVRTAALIWSPLLKDR 336
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN---IDGLD 447
RVS Q HISDWLPTL +AAG P+ R N IDG+D
Sbjct: 337 QRVSNQWFHISDWLPTLASAAG----------------------IPAYRGDNLSDIDGID 374
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
QW +L T + R V+ N+DE + + K V GT +G DG+YG +
Sbjct: 375 QWEALAYGTGNPRQRVMNNLDEIFGYTSYVDNGLKYVNGTTLDGVNDGWYGA-------I 427
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
+ + + + Y S L + + R+ A I CG + +P PC
Sbjct: 428 DDTDTLPSNAEYLRTVMASHVGRLTKPDEKVVSYLREHARIDCGNDIGNRDCNPLRR-PC 486
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELL-KYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
LF++ NDPCE +++ PD Q+ E + KY R P++ +P ADP +F W
Sbjct: 487 -LFDIINDPCEMHDLFERYPDKVRQIQERVEKYRLRAAKPRN--RPHDPAADPSKFGGVW 543
Query: 627 SPW 629
+ W
Sbjct: 544 NWW 546
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 55/92 (59%), Gaps = 1/92 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+R+NL+ + G YATD FT+ AV++I + + L L HLA H GN L+AP+
Sbjct: 182 LRQNLAVNYKPNGTYATDYFTEAAVEIIRSHNRTGERMLLVLNHLAPHTGNDDAPLQAPE 241
Query: 60 ETINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
ETI +F YI D +RRTYAA+ ++ VY
Sbjct: 242 ETIEKFAYIRDTDRRTYAAMMSKLDDSVGQVY 273
>gi|291244830|ref|XP_002742299.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 559
Score = 396 bits (1017), Expect = e-107, Method: Compositional matrix adjust.
Identities = 195/408 (47%), Positives = 264/408 (64%), Gaps = 33/408 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS++IPTPNID LAY+G++L+N Y QP+CTP+RA+LMTG++PIH G+Q I
Sbjct: 39 GWDDVSFHGSDQIPTPNIDELAYSGVLLHNYYVQPICTPTRAALMTGRHPIHLGLQDGVI 98
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ P G+PL E +P+YL+ LGY T +GKWHLGFF +YTPLYRGF++HFGY NG
Sbjct: 99 VASHPYGLPLNETIMPQYLKPLGYDTHIVGKWHLGFFAWQYTPLYRGFDTHFGYYNGEEG 158
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYDH + + G D R N GEY+T+LFT A ++I + +KPLFLY
Sbjct: 159 YYDHTAEEP-----KYIGLDFRNNTELFKSAYGEYSTELFTSYAEKIIHNHNKNKPLFLY 213
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAH A H+GN+ LEAP + ++F YI D RRT+A MV LDDSVG + AL+ G+
Sbjct: 214 LAHQAVHSGNSYSPLEAPYKYTSRFPYIQDERRRTFAGMVSALDDSVGNITRALKHSGLY 273
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
EN II+F +DNG P + N+ SN+P RG+K+TLWEGGV+ + SP +++
Sbjct: 274 ENCIILFSTDNGGPAAGFDA-----NYASNWPLRGIKHTLWEGGVRGDGFIHSPLLEKPG 328
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R+S M+H+ DWLPT+Y AGG+ S LP N+DG + W +L SR
Sbjct: 329 RISTDMIHVCDWLPTIYHIAGGNVSSLPTNLDGFNVWDTL-----SR------------- 370
Query: 452 LLLNTPSRRNSVLINIDEKK--RTAAVRLDSWKLVLGTQENGTMDGYY 497
TPS R +L NID K AA+R+ +K++LGT+E+G DG+Y
Sbjct: 371 ---GTPSPRTEILHNIDPHKNHNAAALRVGEFKIILGTEEHGEWDGWY 415
Score = 73.9 bits (180), Expect = 3e-10, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 50/82 (60%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
R N GEY+T+LFT A ++I + +KPLFLYLAH A H+GN+ LEAP +
Sbjct: 174 FRNNTELFKSAYGEYSTELFTSYAEKIIHNHNKNKPLFLYLAHQAVHSGNSYSPLEAPYK 233
Query: 61 TINQFQYITDPNRRTYAALTKS 82
++F YI D RRT+A + +
Sbjct: 234 YTSRFPYIQDERRRTFAGMVSA 255
>gi|194756524|ref|XP_001960527.1| GF13402 [Drosophila ananassae]
gi|190621825|gb|EDV37349.1| GF13402 [Drosophila ananassae]
Length = 541
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 229/539 (42%), Positives = 309/539 (57%), Gaps = 59/539 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS +IPTPNIDALAY+GIILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 38 GFNDVGFHGSAQIPTPNIDALAYSGIILNRYYVTPICTPSRSALMTGKYPIHTGMQHAVL 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+ L E+ LP+YL +LGY++ GKWHLG ++ +YTPL+RGF SH+G
Sbjct: 98 YAAEPRGLSLKEKILPQYLNDLGYTSHIAGKWHLGHWKLKYTPLFRGFSSHWGL------ 151
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
DMR A+D G Y TD+ T AV++I + PLFL
Sbjct: 152 -------------------DMRNGTEVAYDLHGRYTTDVITDHAVKVIANHNTTSGPLFL 192
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H+ N L P + + +I RR +AAMV K+D+SVG ++ L++ M
Sbjct: 193 YVAHAACHSSNPYNPLPVPDNEVMKLGHIPHYKRRKFAAMVTKMDESVGLIVDQLRKSNM 252
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSIIIF SDNG P + N+ SNYP +GVKNTLWEGGV+ ++WSP ++
Sbjct: 253 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLIWSPLLKSR 307
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS Q +HISDWLPTL AAGG + S + IDG W
Sbjct: 308 QRVSTQTIHISDWLPTLLEAAGGAAAL-------------------SNLSKGIDGQSIWQ 348
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+L+ + PS R V+ NID+ +A + + WKLV GT G DG+YG L ++
Sbjct: 349 ALVHDEPSPRLKVIHNIDDIWGSAGLSVGDWKLVKGTNYYGKWDGWYGPAGERDPHLYDW 408
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC-GANP-APMTPSPCTNGPCY 568
I +S++ ++L L LP S ++ R R AT+ C G +P + + PC
Sbjct: 409 QQIPKSRSGKALNGLK---MLP-SQAEQQR-LRAAATVKCLGQSPQGSSCKATAFSAPC- 462
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
LFN+ +DPCEQ N+A P + L E LKY T VP ++ +P +A PK +N TW+
Sbjct: 463 LFNIHDDPCEQFNLAEQYPKVLKALMEELKYQNSTAVPPAN-KPGDRRASPKYWNYTWT 520
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 34/92 (36%), Positives = 46/92 (50%), Gaps = 2/92 (2%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A+D G Y TD+ T AV++I + PLFLY+AH A H+ N L P
Sbjct: 153 MRNGTEVAYDLHGRYTTDVITDHAVKVIANHNTTSGPLFLYVAHAACHSSNPYNPLPVPD 212
Query: 60 ETINQFQYITDPNRRTYAAL-TKSTTLTLLIV 90
+ + +I RR +AA+ TK LIV
Sbjct: 213 NEVMKLGHIPHYKRRKFAAMVTKMDESVGLIV 244
>gi|270008609|gb|EFA05057.1| hypothetical protein TcasGA2_TC015152 [Tribolium castaneum]
Length = 563
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 229/561 (40%), Positives = 310/561 (55%), Gaps = 85/561 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+SFHGS++IPTPN+ +A GIIL+ Y Q CTPSR +L+TG+YPI +GMQG P+
Sbjct: 50 GYNDVSFHGSSQIPTPNLAKMATRGIILDRFYTQSTCTPSRTALLTGQYPIRSGMQGYPL 109
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E R +PL +P + + LGY T +GKWHLG +E TPL +GF+SHFGY NG +
Sbjct: 110 KAGENRSLPLNMPTMPLHFQNLGYKTHLVGKWHLGAAYKEDTPLGKGFDSHFGYWNGFVG 169
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+D++ + + G D+ W + G YAT+LFT+ ++ +IE V PLFL
Sbjct: 170 YFDYVSFSKMDNGTLVKGLDLHDQFEPVWGSQGRYATELFTERSLDVIEGHDVRVPLFLV 229
Query: 272 LAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
++HLAAH G G L P +T ++F YI DP RR YA +V LD S+G +++ L K
Sbjct: 230 VSHLAAHTGQNGSELGVPDVDQTNHEFSYIQDPRRRLYAGVVSHLDASIGRIMAKLDEKQ 289
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML+NSI++F SDNGA TV Y N GSN+P RGVK + +EGGV+V A ++SP +
Sbjct: 290 MLDNSIVLFFSDNGAQTV-----GMYENSGSNWPLRGVKFSDFEGGVRVAATIYSPLFHK 344
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
VS ++HISDWLPTLY+AAGGD + L IDG+DQW +L +N
Sbjct: 345 KGYVSEHLIHISDWLPTLYSAAGGDVAHLG-QIDGIDQWDAL---------TN------- 387
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
N PS R +LINIDE A+ D +KL+ GT + YY QT L
Sbjct: 388 -----NNPSNRTEILINIDEVDENFAIIRDKFKLI-----QGTPNYYYQQT-------LL 430
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSP-------- 561
FN +S +Y T Q G P TP+P
Sbjct: 431 FNC--KSGSYH-------------------EGTFDQYYGDSGRGPENPTPNPNHTTTDLS 469
Query: 562 ------------CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
CT G C LF+L DPCE NI S P+I++QLYE + + LVPQ +
Sbjct: 470 WCRAPDQTPILNCTKG-C-LFDLDKDPCETTNIIESEPEIANQLYEKIAQFWKELVPQRN 527
Query: 610 EQPDLVQADPKRFNDTWSPWI 630
+ D ++DP +N+TW W+
Sbjct: 528 KDTD-PKSDPIFYNNTWCTWL 547
Score = 71.2 bits (173), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 46/75 (61%), Gaps = 2/75 (2%)
Query: 8 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQF 65
W + G YAT+LFT+ ++ +IE V PLFL ++HLAAH G G L P +T ++F
Sbjct: 197 VWGSQGRYATELFTERSLDVIEGHDVRVPLFLVVSHLAAHTGQNGSELGVPDVDQTNHEF 256
Query: 66 QYITDPNRRTYAALT 80
YI DP RR YA +
Sbjct: 257 SYIQDPRRRLYAGVV 271
>gi|195124259|ref|XP_002006611.1| GI18487 [Drosophila mojavensis]
gi|193911679|gb|EDW10546.1| GI18487 [Drosophila mojavensis]
Length = 528
Score = 395 bits (1015), Expect = e-107, Method: Compositional matrix adjust.
Identities = 233/556 (41%), Positives = 315/556 (56%), Gaps = 59/556 (10%)
Query: 76 YAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
YA L G+ND+ F GS +IPTPNIDALAY+G+ILN Y P+CTPSR++L
Sbjct: 16 YAIAQPHIIFILADDLGFNDVGFRGSAQIPTPNIDALAYSGLILNRYYVNPICTPSRSAL 75
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
MT KYPIHTGMQ ++ AEPRG+PL + LP+YL +LGY++ GKWHLG ++R YTPL
Sbjct: 76 MTAKYPIHTGMQHTVLYAAEPRGLPLNLKILPQYLNDLGYTSHIAGKWHLGHWKRVYTPL 135
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
YRGF SH+G DMR A+D G+Y TD+ T+ +
Sbjct: 136 YRGFSSHWGL-------------------------DMRNGTELAYDLHGQYTTDVITQHS 170
Query: 256 VQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
+ +I K PLFLY+AH A H+GN L A + + + I D RR YAA+V +
Sbjct: 171 LNVIAKHNSTKGPLFLYVAHAAVHSGNPYNPLPAKDDIVRRLGTIQDYKRRKYAALVTAM 230
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
DDSVG ++ L++ ML+NSIIIF +DNG P + N+ SNYP RGVKNTLWEG
Sbjct: 231 DDSVGRIVEQLRKSHMLDNSIIIFSTDNGGPAEGFNS-----NFASNYPLRGVKNTLWEG 285
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
GV+ ++WS Q+ + PRV+ Q MHI+DWLPTL AAGG+ + LN
Sbjct: 286 GVRGAGLVWSAQLSKRPRVADQTMHIADWLPTLLEAAGGEAALAQLN------------- 332
Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
N+ +DG+ W +LL N S R SVL NID+ +AA+ + WKLV GT NG+ D
Sbjct: 333 -----NAQLDGMSVWQALLKNEVSPRKSVLHNIDDIWGSAALSVGDWKLVKGTNYNGSWD 387
Query: 495 GYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP 554
G+YG + + ++ + +S Q+LQQL LP D MR+ R+ A + C
Sbjct: 388 GWYGPAGTRNLRDYDWYLVAKSLAGQALQQLK---VLP-KTADMMRA-REAANVICKDAR 442
Query: 555 APMTPSPCTN---GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQ 611
P+ N PC LFN+ DPCEQ N A P + + + L+ T VP ++ +
Sbjct: 443 LKDAPNISCNPLKAPC-LFNIREDPCEQRNQADVYPKVLELMLKELQLINSTAVPPAN-K 500
Query: 612 PDLVQADPKRFNDTWS 627
PD P+ +N TW+
Sbjct: 501 PDDPLGYPRFWNYTWT 516
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 45/83 (54%), Gaps = 1/83 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A+D G+Y TD+ T+ ++ +I K PLFLY+AH A H+GN L A
Sbjct: 147 MRNGTELAYDLHGQYTTDVITQHSLNVIAKHNSTKGPLFLYVAHAAVHSGNPYNPLPAKD 206
Query: 60 ETINQFQYITDPNRRTYAALTKS 82
+ + + I D RR YAAL +
Sbjct: 207 DIVRRLGTIQDYKRRKYAALVTA 229
>gi|156408341|ref|XP_001641815.1| predicted protein [Nematostella vectensis]
gi|156228955|gb|EDO49752.1| predicted protein [Nematostella vectensis]
Length = 512
Score = 395 bits (1014), Expect = e-107, Method: Compositional matrix adjust.
Identities = 218/563 (38%), Positives = 313/563 (55%), Gaps = 81/563 (14%)
Query: 76 YAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPS 131
+ ALT + ++ + GW+D+SFHGS +IPTPNID LA G+ILNN Y P+CTP+
Sbjct: 15 FVALTANKKPHIVFIVADDLGWDDVSFHGSGQIPTPNIDGLAKTGVILNNYYVSPICTPT 74
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
R+++MTGKYPIHTGMQ I A+P G+ L E +P+YL+ LGY+T +GKWHLGFF+ E
Sbjct: 75 RSAIMTGKYPIHTGMQHSVILAAQPYGLGLNETLMPQYLKRLGYATHGVGKWHLGFFKYE 134
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
YTP+ RGF+S+FGY G Y+DH +++Y ++L HD +++ T W G Y++DLF
Sbjct: 135 YTPIQRGFDSYFGYWCGKGDYWDHSNNEKYGWGLDL--HDSEQDVWTEW---GHYSSDLF 189
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
++AV +I PLFLYL A H+ N + L+AP + I++F+ I D RR +AAMV
Sbjct: 190 AEKAVNVISTHNASVPLFLYLPFQAVHSANFIQPLQAPPDLIDKFKNIKDERRRIFAAMV 249
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D ++ V+ +L+ + M NSII+F +DNG P + N SN+P RGVK TL
Sbjct: 250 SSMDGAIKKVVDSLKARSMYNNSIIVFTTDNGGPANGFDS-----NMASNFPLRGVKRTL 304
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGG++ A + SP I + RV ++MH+SDWLPTLYT AGGD L
Sbjct: 305 WEGGIRGTAFIHSPLITKPGRVMTELMHVSDWLPTLYTVAGGDIHDL------------- 351
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
N+DG D W S+ + S R ++ NID AA R WK+V+ +
Sbjct: 352 ---------QNLDGFDLWDSISTDAMSPREEMVHNIDPVNWEAAYRFREWKIVV--NQTK 400
Query: 492 TMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR---STRQQATI 548
M G+Y P+ NI++ +T + A +
Sbjct: 401 YMSGWY----------------------------------PLPNIEEREPHPATLRDAVV 426
Query: 549 HCGANPAPMTPSPCT--NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
CG P P P CT +GPC LFN+ NDPCE N+A +I + + L+ +++ +VP
Sbjct: 427 KCG--PPPEIPVNCTASDGPC-LFNIKNDPCEYVNLAKKELEILNNMLIWLEGYKKGMVP 483
Query: 607 QSHEQPDLVQADPKRFNDTWSPW 629
+ D A+P + W+ W
Sbjct: 484 IRNTPLD-PSANPANYGGVWTAW 505
Score = 63.5 bits (153), Expect = 3e-07, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 44/75 (58%)
Query: 8 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 67
W G Y++DLF ++AV +I PLFLYL A H+ N + L+AP + I++F+
Sbjct: 177 VWTEWGHYSSDLFAEKAVNVISTHNASVPLFLYLPFQAVHSANFIQPLQAPPDLIDKFKN 236
Query: 68 ITDPNRRTYAALTKS 82
I D RR +AA+ S
Sbjct: 237 IKDERRRIFAAMVSS 251
>gi|307215079|gb|EFN89886.1| Arylsulfatase B [Harpegnathos saltator]
Length = 557
Score = 394 bits (1013), Expect = e-107, Method: Compositional matrix adjust.
Identities = 219/544 (40%), Positives = 313/544 (57%), Gaps = 49/544 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SF+G +EIPTPNID+LAYNG+ILN Y P+CTPSR + TG+YPI +GMQG P+
Sbjct: 48 GWNDVSFNGGDEIPTPNIDSLAYNGVILNRHYVLPICTPSRTAFFTGQYPIRSGMQGYPL 107
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
GAEPR +PL LP+YLR+LGY+T +GKWH+G+ +TP RGF++ GY +G I
Sbjct: 108 QGAEPRSIPLNNILLPQYLRKLGYATHLVGKWHVGYQTNNHTPTNRGFDTFLGYYSGYIE 167
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y+ H L + +G+D+ R++ + +Y TDL T EA +I KPL+
Sbjct: 168 YFSHNLVENGQ-----SGYDIHRSVGDNHTIEYRYDYMTDLITDEAENIISSHNPAKPLY 222
Query: 270 LYLAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
L LAHLA HA +E + T + YI D NRR +A+ V +D+S+G V+ AL+R
Sbjct: 223 LQLAHLAPHASTVDDVIEVRSWKATNDTLGYIRDINRRKFASAVVTMDESIGRVVDALRR 282
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
ML+NSII+FMSDNGAPT + N+GSNYP RG+K + +EG V+ A ++SP I
Sbjct: 283 ADMLKNSIIVFMSDNGAPT----KDQILYNFGSNYPLRGMKQSFFEGAVRGVACIYSPLI 338
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
RVS Q+ HI+DWLPTLY AAGGD S L +DG+D
Sbjct: 339 DFPSRVSTQLFHITDWLPTLYAAAGGDASDL----------------------KALDGID 376
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVP 506
QWS + RR +L+NI+EK + A + +KL+ ++ YYG T +
Sbjct: 377 QWSMIKNADNGRRQFLLVNINEKTDSKAALIGRYKLI---RDQSEYTKYYGSTGKDPSYC 433
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
N ++ S ++ +S++ +IDK++ R+++ + C + S CT
Sbjct: 434 EYNVTNVLASPVASAIADISRSSL----DIDKIKQLREKSRVVCRVSRF----SRCTKRT 485
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LF++ DPCE N++S P I +L L+ ++ LV Q+++ D P FNDTW
Sbjct: 486 C-LFDIKKDPCETTNVSSKYPKILKKLNALIDSYQSILVNQTNKPVDPA-GFPCNFNDTW 543
Query: 627 SPWI 630
PW+
Sbjct: 544 MPWL 547
Score = 49.3 bits (116), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/67 (41%), Positives = 36/67 (53%), Gaps = 2/67 (2%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
+Y TDL T EA +I KPL+L LAHLA HA +E + T + YI D
Sbjct: 198 DYMTDLITDEAENIISSHNPAKPLYLQLAHLAPHASTVDDVIEVRSWKATNDTLGYIRDI 257
Query: 72 NRRTYAA 78
NRR +A+
Sbjct: 258 NRRKFAS 264
>gi|194883566|ref|XP_001975872.1| GG20328 [Drosophila erecta]
gi|190659059|gb|EDV56272.1| GG20328 [Drosophila erecta]
Length = 478
Score = 393 bits (1010), Expect = e-106, Method: Compositional matrix adjust.
Identities = 219/495 (44%), Positives = 290/495 (58%), Gaps = 58/495 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS +IPTPNIDALAY+GIILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 37 GFNDVGFHGSADIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL E+ LP+YL ELGY++ GKWHLG ++ +YTPLYRGF SH+G
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYASHIAGKWHLGHWKLKYTPLYRGFSSHWGL------ 150
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
DMR A+D G Y TD+ T +V++I K PLFL
Sbjct: 151 -------------------DMRNGTEVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFL 191
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H+ N L P + + +I RR +AAMV K+DDSVG ++ L++ M
Sbjct: 192 YVAHAACHSSNPYNPLPVPDNDVIKMAHIPHYKRRKFAAMVSKMDDSVGQIVDQLRKSNM 251
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSIIIF SDNG P + N+ SNYP +GVKNTLWEGGV+ ++WSP ++++
Sbjct: 252 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKS 306
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS Q MHI DWLPTL AAGG P + L + IDG W
Sbjct: 307 QRVSNQTMHIIDWLPTLLEAAGG-----PAAVANLSK--------------QIDGQSIWR 347
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+L+ + S R +VL NID+ +AA+ + WKLV GT G+ DG+YG L ++
Sbjct: 348 ALVQDKASPRLNVLHNIDDIWGSAALSVGDWKLVKGTNYRGSWDGWYGPAGERNPRLYDW 407
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGPCY 568
+ S+ ++L+ L LP S D+ R R AT+ C A + T T + PC
Sbjct: 408 QMVARSRAGKALEALK---MLP-SQADQQR-IRAAATVSCPAQSSKGTSCLATAFSAPC- 461
Query: 569 LFNLGNDPCEQNNIA 583
LF++ +DPCEQ N+A
Sbjct: 462 LFHIRDDPCEQYNLA 476
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 41/81 (50%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A+D G Y TD+ T +V++I K PLFLY+AH A H+ N L P
Sbjct: 152 MRNGTEVAYDLHGHYTTDVITDHSVKVIASHNATKGPLFLYVAHAACHSSNPYNPLPVPD 211
Query: 60 ETINQFQYITDPNRRTYAALT 80
+ + +I RR +AA+
Sbjct: 212 NDVIKMAHIPHYKRRKFAAMV 232
>gi|156406805|ref|XP_001641235.1| predicted protein [Nematostella vectensis]
gi|156228373|gb|EDO49172.1| predicted protein [Nematostella vectensis]
Length = 498
Score = 392 bits (1007), Expect = e-106, Method: Compositional matrix adjust.
Identities = 227/546 (41%), Positives = 313/546 (57%), Gaps = 73/546 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS +IPTPN+DALA +G+ILNN Y P+ TPSRAS MTGKYPIH G+Q +
Sbjct: 12 GWDDISFHGSPQIPTPNLDALANSGVILNNYYVSPMDTPSRASFMTGKYPIHMGVQHDTL 71
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P GVPLTE+FLPE+LRE+GY T A+GKW LGFF +EYTP YRGF+S FG+
Sbjct: 72 HNRQPFGVPLTEKFLPEFLREMGYQTHAVGKWQLGFFAKEYTPTYRGFDSFFGFWTSHED 131
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY+H+ +D G D+RRNL + D G Y T+L +EA ++IE+ DKPLFLY
Sbjct: 132 YYNHVANDGGY------GIDLRRNLDVSNDHTGVYGTELLAREADEVIENHSGDKPLFLY 185
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAH A H GN + L+AP+ +++F+YITD RRT+A MV LD+S+ ++++L+RKG+
Sbjct: 186 LAHQAVHVGNMDEPLQAPKRHVDKFKYITDERRRTFAGMVSSLDESMRQLVTSLKRKGLY 245
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NSIIIF +DNG ++ GSN+P RG KNTLWEGGV+ A + SP I++
Sbjct: 246 QNSIIIFTTDNGGAAGGLDMSA-----GSNFPLRGNKNTLWEGGVRGVAFVHSPLIKRPG 300
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RV +MH SDWLPTL+ AGG + +L IDG D W + TPS R + G+D
Sbjct: 301 RVYDGLMHASDWLPTLHLLAGG-SPKLLGQIDGYDLWDGISKGTPSPRYEVLHGIDTM-- 357
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
+ AA+R+ +K++L D Y
Sbjct: 358 ------------------QGNRAALRVGDYKVILNQDLKFYSDWY--------------- 384
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTR--QQATIHCG-ANPAPM--TPSPCTN-- 564
K + +L D +R R AT+ C +P P+ T +P N
Sbjct: 385 -----KRPEGEHEL-----------DYIRKPRLLSNATVDCTIKSPHPLLYTHAPLCNPQ 428
Query: 565 -GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
PC LFN+ DPCE +N+A+ P+ L + LK++ + P + + D + A+P R
Sbjct: 429 KKPC-LFNIKWDPCEYHNLAAFMPNTLKVLLDRLKFYAEGMKPVIYPRADDL-ANPDRLG 486
Query: 624 DTWSPW 629
W+PW
Sbjct: 487 GVWTPW 492
Score = 88.6 bits (218), Expect = 9e-15, Method: Compositional matrix adjust.
Identities = 40/82 (48%), Positives = 56/82 (68%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+RRNL + D G Y T+L +EA ++IE+ DKPLFLYLAH A H GN + L+AP+
Sbjct: 146 LRRNLDVSNDHTGVYGTELLAREADEVIENHSGDKPLFLYLAHQAVHVGNMDEPLQAPKR 205
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+++F+YITD RRT+A + S
Sbjct: 206 HVDKFKYITDERRRTFAGMVSS 227
>gi|170040781|ref|XP_001848166.1| arylsulfatase b [Culex quinquefasciatus]
gi|167864377|gb|EDS27760.1| arylsulfatase b [Culex quinquefasciatus]
Length = 657
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 220/538 (40%), Positives = 307/538 (57%), Gaps = 54/538 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNIDALAY GIILN Y P+CTPSRA++MTG+ PI GMQ I
Sbjct: 42 GWNDVGFHGSNQIPTPNIDALAYGGIILNRHYTAPMCTPSRAAIMTGRNPISVGMQHYVI 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L ++ +PEY RE GY T +GKWHLGFF ++YTP RGF+SH Y
Sbjct: 102 DSDEPWGLGLDQKIMPEYFREAGYRTHLVGKWHLGFFAQQYTPTMRGFDSHTNY------ 155
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
G+DMR NL+ +D G YATD FT A ++I+ +PLFL
Sbjct: 156 ----------------TGYDMRHNLAVDYDANGTYATDHFTDAASRIIDKHNPSEPLFLM 199
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+ HLA H GN L+AP+E I +F++I+D NRR YAAMV KLDDSVG + +L+ K M+
Sbjct: 200 VNHLAPHTGNDNDPLQAPEERIKKFEHISDENRRIYAAMVSKLDDSVGAIFKSLRAKNMI 259
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NSII+F+SDNGAP+ + N GSN+P RG KN+ WEG + A +WSP +++
Sbjct: 260 QNSIILFISDNGAPSFGL-----HSNSGSNFPLRGQKNSPWEGATRNVAAIWSPLLEERQ 314
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS Q +HISDWLPT + AG RLP +R +S +DG+DQW +
Sbjct: 315 RVSNQYIHISDWLPTFASIAG---IRLP----------------QARNSSEVDGVDQWEA 355
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L +T R +L IDE ++ + +K V GT G DG+YGQ +N+ P +
Sbjct: 356 LSYDTGYPRRVILNMIDEIFGYSSYMENGFKYVNGTTGGGNFDGWYGQL-NNEDPHPSDE 414
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
+++ + QLS ++ + + ++ R+ + + C P T +P C LF+
Sbjct: 415 EYIKTVLETEIAQLSGDVL----SSELIKHLRKHSRVVCHKPLHPTTCNPLKRA-C-LFD 468
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+ NDPCE N+I++ P +L ++ RR L + +P +P W+ W
Sbjct: 469 IVNDPCELNDISAREPFKFRELRTKVETLRR-LAKKPRNKPHDPAGNPANHGGVWTWW 525
Score = 81.3 bits (199), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 38/80 (47%), Positives = 52/80 (65%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MR NL+ +D G YATD FT A ++I+ +PLFL + HLA H GN L+AP+E
Sbjct: 160 MRHNLAVDYDANGTYATDHFTDAASRIIDKHNPSEPLFLMVNHLAPHTGNDNDPLQAPEE 219
Query: 61 TINQFQYITDPNRRTYAALT 80
I +F++I+D NRR YAA+
Sbjct: 220 RIKKFEHISDENRRIYAAMV 239
>gi|195591209|ref|XP_002085335.1| GD14734 [Drosophila simulans]
gi|194197344|gb|EDX10920.1| GD14734 [Drosophila simulans]
Length = 579
Score = 392 bits (1006), Expect = e-106, Method: Compositional matrix adjust.
Identities = 234/559 (41%), Positives = 311/559 (55%), Gaps = 46/559 (8%)
Query: 80 TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
TK + +LI G ND+SFHGSN+I TPNIDALAYNGI+LN Y +CTPSRA+L+TG
Sbjct: 26 TKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTG 85
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
KYPIHTGMQ I EP G+P ER +PE R+ GYST +GKWHLGF+R++ TP RG
Sbjct: 86 KYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRG 145
Query: 199 FESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
F+ HFGY NG I YYDH +L YS G D RR+L + G YAT+ FT EA
Sbjct: 146 FDHHFGYYNGYIDYYDHQVRLLDRNYSA-----GLDFRRDLEPCPEANGTYATEAFTSEA 200
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
++IE KPLF+ L+HLA H GN ++AP+E + +F +I DP RRTYA MV LD
Sbjct: 201 KRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMVSSLD 260
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
SV I AL+ GML NSII+ SDNGAPT+ + N GSNYPYRG K + WEGG
Sbjct: 261 KSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGG 315
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
++ LWSP +++ VS Q +H DWLPTL AAG SL +
Sbjct: 316 IRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG----------------VSLPQDL 359
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
P +DG++ W L N + +++ +DE ++ D+ K V G+ G D
Sbjct: 360 P------LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFEGRYDQ 413
Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----G 551
+ G+ +N+ L+ + + +Q L N L D++R R +AT C G
Sbjct: 414 WLGELETNEDDPLS-ESYEQHVLASDVQSLLGNRGL---TEDRIRQMRSEATETCPPVEG 469
Query: 552 ANP-APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
NP P PC+ F+L DPCE+ N+A P QL + ++ R+T +P +
Sbjct: 470 QNPLEPHFKCEPLKAPCF-FDLAKDPCERYNLAQIYPLQLQQLADEVEQIRKTAIPSARV 528
Query: 611 QPDLVQADPKRFNDTWSPW 629
+A+P N W W
Sbjct: 529 PHSDSRANPTFHNGNWEWW 547
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+L + G YAT+ FT EA ++IE KPLF+ L+HLA H GN ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEE 236
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +I DP RRTYA + S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMVSS 258
>gi|307187653|gb|EFN72625.1| Arylsulfatase B [Camponotus floridanus]
Length = 525
Score = 391 bits (1005), Expect = e-106, Method: Compositional matrix adjust.
Identities = 223/545 (40%), Positives = 318/545 (58%), Gaps = 64/545 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHG++EIPTPNIDALAYNG+ILN Y P+CTPSR + +TGKYPI TGMQG +
Sbjct: 25 GWNDVSFHGADEIPTPNIDALAYNGVILNRHYVLPLCTPSRTAFLTGKYPIRTGMQGYVL 84
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEPRG+PL + LPEYLR+LGY+T +GKWH+G+ + YTP RGF++ GY NG I
Sbjct: 85 QPAEPRGIPLNDTLLPEYLRKLGYATHLVGKWHVGYHTKNYTPTRRGFDTFLGYYNGYIH 144
Query: 212 YYDHILSDQYSRTVELNGHDM---RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
Y++H + D+ + + + H + R + +D Y TD+ T E +I KPL
Sbjct: 145 YFNHTILDEEQKYLGYDFHRIVGENRTIEYRYD----YITDIITDEVENIIFSHNPAKPL 200
Query: 269 FLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
+L ++H AAH+G G ++ +ET YI D NRR YA++V LD+SVG +I AL+
Sbjct: 201 YLQVSHDAAHSGGIGIEMQVRDWKETNATLGYIEDINRRKYASVVATLDESVGRIIDALR 260
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
+ ML+NSII+F+SDNGA T + + N+GSNYP RG+K +L+EGGV+ A ++SP
Sbjct: 261 KTDMLKNSIIVFISDNGAQTEGFLQ-----NYGSNYPLRGLKFSLFEGGVRGAAYIYSPL 315
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
I + RVS Q+ HI+DWLPTLY+AAGG++S L +DG DQWS++ S D
Sbjct: 316 IDRLSRVSTQLFHITDWLPTLYSAAGGNSSDLK-QLDGFDQWSAI--------KSAED-- 364
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV- 505
S+R S+LINIDE+ + + YY + +N +
Sbjct: 365 -----------SKRKSILINIDERDTSEYQK------------------YYNYSGNNALY 395
Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
P N I+ S ++ +S ++ N +K+ R++ATI C M S CTN
Sbjct: 396 PKYNAINILASPVASAIASISTSML----NANKIMQLRKEATIVC---KNFMDFSNCTNR 448
Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
C LF++ DPCE ++++ P +L + ++ LV Q + D FN+T
Sbjct: 449 SC-LFDVKEDPCETTDLSAKYPKEVERLNMFIDRYKSVLVKQPNTPIDPA-GYAYHFNNT 506
Query: 626 WSPWI 630
W PW+
Sbjct: 507 WIPWL 511
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 39/71 (54%), Gaps = 2/71 (2%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
+Y TD+ T E +I KPL+L ++H AAH+G G ++ +ET YI D
Sbjct: 177 DYITDIITDEVENIIFSHNPAKPLYLQVSHDAAHSGGIGIEMQVRDWKETNATLGYIEDI 236
Query: 72 NRRTYAALTKS 82
NRR YA++ +
Sbjct: 237 NRRKYASVVAT 247
>gi|270008947|gb|EFA05395.1| hypothetical protein TcasGA2_TC015567 [Tribolium castaneum]
Length = 513
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 224/544 (41%), Positives = 316/544 (58%), Gaps = 50/544 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G ND+ F+GS +IPTP+IDALAYNGIIL+ Y Q CTPSRA+L+TG+YPI GMQG PI
Sbjct: 6 GRNDVGFYGSGQIPTPSIDALAYNGIILDRFYTQCSCTPSRAALLTGQYPIRLGMQGLPI 65
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E + +PL +P+YL+ LGY T +GKWHLG+ E TPL RGF+SHFGY NG +
Sbjct: 66 RAGENKSLPLDVVTMPQYLKRLGYKTHLVGKWHLGYAHIEDTPLQRGFDSHFGYWNGFVG 125
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+++ + + + G D+ + AW G+YATDLFT +A+++I++ +KPLFL
Sbjct: 126 YFNYTAVYELANDTMVKGFDLFDGVVPAWQERGKYATDLFTHKAMKIIDEHNSEKPLFLV 185
Query: 272 LAHLAAHAGNAGKHLEAPQ--ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LAHLA H G G L P + +F +I +P RR YA +V +LDDSVG ++ L K
Sbjct: 186 LAHLAGHTGEDGVELGVPDVAQAETRFSFIKNPKRRLYADIVSRLDDSVGQIVEKLDEKN 245
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+L+NSII+F+SDNGA T+ Y N GSN+P RG+K T +EGG++ PA ++SP Q
Sbjct: 246 LLDNSIILFLSDNGAQTI-----GVYENSGSNWPLRGLKLTEFEGGIRAPAAIYSPLFHQ 300
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
VS +++HI+D+LPT Y AAGG+ L IDG++QW L S++
Sbjct: 301 KGYVSKELIHITDFLPTFYAAAGGNLEYLG-KIDGVNQWEVL-----SQQK--------- 345
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT-RSNKVPLL 508
P +R+ +L+N +E +A+ + +KLV G Y+G + R+ + P
Sbjct: 346 -------PGKRDEILVNFNEIDNVSAIIMGQYKLVQGRLAFPEYGDYHGDSGRNPEDPPY 398
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH-C-GANPAPMTPSPCTNGP 566
NF A+ LS+NI N +K+ S R Q + C G P+ CT G
Sbjct: 399 NFTAV-----------LSKNI--NQQNRNKIESLRNQTDLSWCRGEKNQPLI--DCTEG- 442
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
C LFNL +DPCE NI I+ +L L + LVPQ D +++P+ +N TW
Sbjct: 443 C-LFNLESDPCETTNIIQEEKIIAGKLLGRLFEFSQILVPQQVAIFD-PESNPEFYNGTW 500
Query: 627 SPWI 630
W+
Sbjct: 501 CTWL 504
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 34/75 (45%), Positives = 47/75 (62%), Gaps = 2/75 (2%)
Query: 8 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ--ETINQF 65
AW G+YATDLFT +A+++I++ +KPLFL LAHLA H G G L P + +F
Sbjct: 153 AWQERGKYATDLFTHKAMKIIDEHNSEKPLFLVLAHLAGHTGEDGVELGVPDVAQAETRF 212
Query: 66 QYITDPNRRTYAALT 80
+I +P RR YA +
Sbjct: 213 SFIKNPKRRLYADIV 227
>gi|291242646|ref|XP_002741217.1| PREDICTED: predicted protein-like [Saccoglossus kowalevskii]
Length = 526
Score = 390 bits (1003), Expect = e-106, Method: Compositional matrix adjust.
Identities = 225/547 (41%), Positives = 306/547 (55%), Gaps = 78/547 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS++IPTPNID LAY+G++L+N Y QPVCTP+R +L+TG+YP+H G+Q I
Sbjct: 38 GWDDVSFHGSHQIPTPNIDELAYSGVLLHNYYVQPVCTPTRGALLTGRYPMHLGLQHFVI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+PL E LP YL++LGYST +GKWHLGFF +EYTP YRGF+SH+GY G
Sbjct: 98 TPNEPVGLPLNETTLPTYLKKLGYSTHMVGKWHLGFFAKEYTPTYRGFDSHYGYFLGHQD 157
Query: 212 YYDH--ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
YY H + ++Q+ G D+R N+ T GEY +LFT +A +LI D KPLF
Sbjct: 158 YYTHNALWNNQW-------GFDLRHNMDLQRSTFGEYGPELFTTQAEKLIYDHDHKKPLF 210
Query: 270 LYLAHLAAHAGNAGKH---LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
LY AH A H GN+G + LEAP + ++F +I D RR YA MV LDDSVG + AL+
Sbjct: 211 LYFAHQAVHYGNSGPNGTLLEAPYKYTSRFPHIADHQRRIYAGMVSALDDSVGNITRALK 270
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
+ G+ NSII+F +DNG P E + N P RG KNTLWEGG++ A + S
Sbjct: 271 KSGLYNNSIIVFSTDNGGP----HECEAF-----NTPLRGTKNTLWEGGIRGAAFIHSVL 321
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
+++ RV MMH+ DWLPT+Y AGG+ S L N+DG
Sbjct: 322 LEKPKRVCEGMMHVVDWLPTMYHVAGGNLSDL----------------------HNLDGF 359
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
+ W +L S R +L NID +TAA+R+ + K+++G NK
Sbjct: 360 NMWDTLSSGVSSPRTEILHNIDPITQTAALRVGNLKVIVG---------------ENKSD 404
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC--TN 564
I K +L + + S + +HC + P P+ C
Sbjct: 405 WFVPPQITFGKNGTNLTK-------------QFGSDHRGIIVHCSSVP-PVAYKNCDPKQ 450
Query: 565 GPCYLFNLGNDPCEQNNIASSRPDI-SSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
PC L+N+ NDPCE N+A PD+ S LY+L Y+ ++ P P +++P N
Sbjct: 451 KPC-LYNIKNDPCEYYNLADEYPDVLQSLLYKLNDYNSTSVTPWV--IPPDPRSNPSLHN 507
Query: 624 DTWSPWI 630
DTWSPWI
Sbjct: 508 DTWSPWI 514
Score = 72.8 bits (177), Expect = 6e-10, Method: Compositional matrix adjust.
Identities = 37/85 (43%), Positives = 50/85 (58%), Gaps = 3/85 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH---LEA 57
+R N+ T GEY +LFT +A +LI D KPLFLY AH A H GN+G + LEA
Sbjct: 173 LRHNMDLQRSTFGEYGPELFTTQAEKLIYDHDHKKPLFLYFAHQAVHYGNSGPNGTLLEA 232
Query: 58 PQETINQFQYITDPNRRTYAALTKS 82
P + ++F +I D RR YA + +
Sbjct: 233 PYKYTSRFPHIADHQRRIYAGMVSA 257
>gi|24666175|ref|NP_649023.1| CG7402 [Drosophila melanogaster]
gi|7293925|gb|AAF49287.1| CG7402 [Drosophila melanogaster]
Length = 579
Score = 390 bits (1001), Expect = e-105, Method: Compositional matrix adjust.
Identities = 234/560 (41%), Positives = 311/560 (55%), Gaps = 48/560 (8%)
Query: 80 TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
TK + +LI G ND+SFHGSN+I TPNIDALAYNGI+LN Y +CTPSRA+L+TG
Sbjct: 26 TKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTG 85
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
KYPIHTGMQ I EP G+P ER +PE R+ GYST +GKWHLGF+R++ TP RG
Sbjct: 86 KYPIHTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRG 145
Query: 199 FESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
F+ HFGY NG I YYDH +L YS G D RR+L + G YAT+ FT EA
Sbjct: 146 FDHHFGYYNGYIDYYDHQVRMLDRNYSA-----GLDFRRDLEPCPEANGTYATEAFTSEA 200
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
++IE KPLF+ L+HLA H GN ++AP+E + +F +I DP RRTYA M+ LD
Sbjct: 201 KRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLD 260
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
SV I AL+ GML NSII+ SDNGAPT+ + N GSNYPYRG K + WEGG
Sbjct: 261 KSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGG 315
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
++ LWSP +++ VS Q +H DWLPTL AAG SL +
Sbjct: 316 IRSAGALWSPLLKERGYVSNQAIHAVDWLPTLAGAAG----------------VSLPQDL 359
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
P +DG++ W L N + +++ +DE ++ D+ K V G+ G D
Sbjct: 360 P------LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQ 413
Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----G 551
+ G+ +N+ L + + +Q L N L D++R R +AT C G
Sbjct: 414 WLGELETNEDDPLG-ESYEQHVLASDVQSLLGNRGL---TKDRIRQMRSEATETCPPIEG 469
Query: 552 ANPAPMTPSPCT--NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
NP + C PC+ F+L DPCE+ N+A P QL + L+ R+T +P +
Sbjct: 470 QNPLE-SHFKCEPLKAPCF-FDLAKDPCERYNLAQMYPLQLQQLADELEQIRKTAIPSAR 527
Query: 610 EQPDLVQADPKRFNDTWSPW 629
+A+P N W W
Sbjct: 528 VPHSDSRANPTFHNGNWEWW 547
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+L + G YAT+ FT EA ++IE KPLF+ L+HLA H GN ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEE 236
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +I DP RRTYA + S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMISS 258
>gi|195441662|ref|XP_002068622.1| GK20325 [Drosophila willistoni]
gi|194164707|gb|EDW79608.1| GK20325 [Drosophila willistoni]
Length = 550
Score = 389 bits (1000), Expect = e-105, Method: Compositional matrix adjust.
Identities = 229/567 (40%), Positives = 314/567 (55%), Gaps = 56/567 (9%)
Query: 78 ALTKSTTLTLLIV----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
A T ST ++I+ G+ND+SFHGSN+I TPNIDALAYNG++LN +Y +CTPSRA
Sbjct: 20 AETPSTKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGVLLNKLYVPNLCTPSRA 79
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
+L+TGKYPIHTGMQ I EP G+P ER +PE+ R+ GYST+ IGKWHLGF+ ++ T
Sbjct: 80 TLLTGKYPIHTGMQHYVIITDEPWGLPKQERLMPEFFRDAGYSTQLIGKWHLGFWEKDLT 139
Query: 194 PLYRGFESHFGYLNGVISYYD---HILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL 250
P RGF+ H+GY NG I YYD H+L+ Y++ V D RR+L G YATD
Sbjct: 140 PTMRGFDHHYGYYNGYIDYYDHTLHMLTKNYTKGV-----DFRRDLDPCPQDNGTYATDA 194
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
FT EA ++IE +PLF+ L+HLA H GN ++AP+E + +F +ITDP RRTYA M
Sbjct: 195 FTAEAKRVIEQHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHITDPKRRTYAGM 254
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
+ LD SVG + AL GML NSII+ SDNGAPTV + N GSNYP+RG K +
Sbjct: 255 ISSLDKSVGQTVKALSDNGMLNNSIILLYSDNGAPTVGI-----HSNQGSNYPFRGQKES 309
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
WEGG++ +WSP +QQ VS Q++H DWLPTL AA LP N+
Sbjct: 310 PWEGGIRSAGAIWSPLLQQRGYVSNQIIHAIDWLPTLAAAA---NVELPENL-------- 358
Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
+DG++ W SL N + +L +D+ ++ D+ K V G+
Sbjct: 359 -----------KLDGVNMWPSLSGNIEPQPRRLLHVMDDVFGYSSYMRDNLKYVNGSSFE 407
Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNI----DKMRSTRQQA 546
G D + G S + L+ Y + Q LS + I + +++ R++A
Sbjct: 408 GQYDQWLGDLESGQDDPLS--------PYYAQQVLSSEVHAVIGSQSVTNERIEQMREEA 459
Query: 547 TIHCGANP----APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRR 602
T C N + PC+ FNL DPCE+ N+A+ P L + ++ R+
Sbjct: 460 THRCPLNEQDHLQEIYLCEPLKAPCF-FNLAKDPCERFNLANLYPLQLQFLSQEVEQFRQ 518
Query: 603 TLVPQSHEQPDLVQADPKRFNDTWSPW 629
VP + +A+PK W W
Sbjct: 519 NAVPSARVPISDERANPKLHGGYWEWW 545
Score = 82.0 bits (201), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 38/82 (46%), Positives = 51/82 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+L G YATD FT EA ++IE +PLF+ L+HLA H GN ++AP+E
Sbjct: 176 FRRDLDPCPQDNGTYATDAFTAEAKRVIEQHDKSRPLFMVLSHLAVHTGNEDNPMQAPEE 235
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +ITDP RRTYA + S
Sbjct: 236 EVAKFAHITDPKRRTYAGMISS 257
>gi|195494692|ref|XP_002094947.1| GE19935 [Drosophila yakuba]
gi|194181048|gb|EDW94659.1| GE19935 [Drosophila yakuba]
Length = 577
Score = 389 bits (999), Expect = e-105, Method: Compositional matrix adjust.
Identities = 226/554 (40%), Positives = 311/554 (56%), Gaps = 45/554 (8%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
+ L+ G ND+SFHGSN+I TPNIDALAYNGI+LN Y +CTPSRA+L+TGKYPIH
Sbjct: 31 VIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTGKYPIH 90
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
TGMQ I EP G+P ER +PE R+ GYST +GKWHLGF+R++ TP RGF+ HF
Sbjct: 91 TGMQHFVIITDEPWGLPSRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRGFDHHF 150
Query: 204 GYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
GY NG I YYDH +L YS +G D RR+L + G YAT+ FT EA ++IE
Sbjct: 151 GYYNGYIDYYDHQVRMLDRNYS-----HGLDFRRDLEPCPEANGTYATEAFTSEAKRIIE 205
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
KPLF+ ++HLA H GN ++AP+E + +F +I DP RRTYA M+ LD SV
Sbjct: 206 QHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQ 265
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
I AL+ ML NSII+ SDNGAPT+ + N GSNYPYRG K + WEGG++
Sbjct: 266 TIGALKDNDMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGGIRSAG 320
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
LWSP +++ VS Q +H DWLPTL AAG SL + P
Sbjct: 321 ALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG----------------VSLPQDLP---- 360
Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
+DG++ W L N + +++ +DE ++ D+ K + G+ G D + G+
Sbjct: 361 --LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMKDTLKYINGSSFQGRYDHWLGEL 418
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----GANP-A 555
++N+ L+ + + +Q + N +L D++R R +AT C G NP
Sbjct: 419 KTNEDHPLS-ESYEQHVLASDVQSILGNRWL---TEDRIRQMRSEATEMCPPVEGLNPLE 474
Query: 556 PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLV 615
P PC+ F+L DPCE+ N+A + P QL + ++ R+T +P +
Sbjct: 475 PQFKCEPLKAPCF-FDLAKDPCERYNLAETYPLQLQQLADEVEQIRKTAIPSARVPYSDS 533
Query: 616 QADPKRFNDTWSPW 629
+A+P N W W
Sbjct: 534 RANPTFHNGNWEWW 547
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 51/82 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+L + G YAT+ FT EA ++IE KPLF+ ++HLA H GN ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEE 236
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +I DP RRTYA + S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMISS 258
>gi|391327192|ref|XP_003738089.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 594
Score = 388 bits (997), Expect = e-105, Method: Compositional matrix adjust.
Identities = 228/548 (41%), Positives = 310/548 (56%), Gaps = 57/548 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SF GS +IPTPN+DALA G+IL N Y QP CTPSRA+L+TG YPIH+GMQ I
Sbjct: 94 GWNDVSFTGSGQIPTPNLDALASAGVILQNHYVQPFCTPSRAALLTGMYPIHSGMQHYVI 153
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+PL + LP++L++LGY T IGKWHLG F++E+ P RGF+SH GY NG I
Sbjct: 154 RSREPWGLPLDFKLLPQHLKDLGYRTHLIGKWHLGQFKKEFLPTRRGFDSHLGYYNGYID 213
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H + Y R L+ + EYAT LFT A ++I D VD PLFLY
Sbjct: 214 YFTH--NHTYKRDSALDFFKDEVPYHSE-----EYATRLFTDRAEEIIRDHDVDNPLFLY 266
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
AHLA H +APQETI++F Y+ D NR T+AAM+ +LD SVG V+ AL +KG+L
Sbjct: 267 FAHLAVHRATDRDPFQAPQETIDKFSYVGDRNRTTFAAMLAELDVSVGRVVEALAKKGIL 326
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N+II+F SDNG + T+ N G+N+P RG K TL+EGG +VPA +WSP I++
Sbjct: 327 DNTIILFSSDNGG-----QATAPMENTGTNFPLRGQKRTLFEGGTRVPAFVWSPLIRRPR 381
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RV M+HI DWLPT+Y+ AGGD P N+ IDG++ W +
Sbjct: 382 RVFYDMVHIVDWLPTIYSLAGGD----PRNL------------------GRIDGMNVWGA 419
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
+ S R +L NID +R A+R+ +KL+ G N T + N+ +
Sbjct: 420 VSDGMASPRTEILYNIDPVQRDLAIRVGRYKLLYGPMYNLT------EWYDNRGFGYASH 473
Query: 512 AIVE-----SKTYQSLQQLSQNIFLPISNIDKMRST----RQQATIHCGANPAPMTPSPC 562
+++E S + L+ L + D R R++ +HCG NP+ T
Sbjct: 474 SVLERYMDKSAVARVLRDLG--YWWKGREEDYSRRPTWQWREETRVHCG-NPSSATACNP 530
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
+ PC LF+L DPCE NN+ ++ L L+ H R+ P ++ +PD P
Sbjct: 531 SEQPC-LFDLATDPCEYNNLYGQLTQVARYLLSRLEAHARSARPINN-KPDDYSFQP--- 585
Query: 623 NDTWSPWI 630
N TW PWI
Sbjct: 586 NATWGPWI 593
Score = 77.8 bits (190), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 43/109 (39%), Positives = 58/109 (53%), Gaps = 18/109 (16%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
EYAT LFT A ++I D VD PLFLY AHLA H +APQETI++F Y+ D NR
Sbjct: 240 EYATRLFTDRAEEIIRDHDVDNPLFLYFAHLAVHRATDRDPFQAPQETIDKFSYVGDRNR 299
Query: 74 RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
T+AA+ +++ V ++ALA GI+ N +
Sbjct: 300 TTFAAMLAELDVSVGRV------------------VEALAKKGILDNTI 330
>gi|260795396|ref|XP_002592691.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
gi|229277914|gb|EEN48702.1| hypothetical protein BRAFLDRAFT_57230 [Branchiostoma floridae]
Length = 485
Score = 387 bits (994), Expect = e-105, Method: Compositional matrix adjust.
Identities = 193/408 (47%), Positives = 261/408 (63%), Gaps = 30/408 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS++IPTPN+D+LAY+G+IL N Y P+CTP+R+++MTG++PIHTG+Q I
Sbjct: 3 GWNDVSFHGSDQIPTPNLDSLAYSGVILGNYYVSPICTPTRSAIMTGRHPIHTGLQHGVI 62
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
GA P G+PL E LP+YL+ LGY+T +GKWHLG E+TP +RGF+S+FGYL G +
Sbjct: 63 SGATPFGLPLNETILPQYLKPLGYATHIVGKWHLGHHAWEFTPTFRGFDSYFGYLTGKDN 122
Query: 212 YYDHILSDQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
YYDH D+ + EL G D+R W G Y+T+LF EA ++I KPLF
Sbjct: 123 YYDHT-DDESNSPEELGYKGLDLRNGTEPVWTENGTYSTELFATEAERIITSHDTSKPLF 181
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL H A H+GN L+APQ+ I++F +I P RRT+AAMV LDD+VG V AL +G
Sbjct: 182 LYLPHQAVHSGNPDNPLQAPQKYIDKFPHIQHPGRRTFAAMVSALDDAVGNVTKALSARG 241
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
MLENS+IIF +DNG P + + N+ SN+P RGVKNTLWEGGV + SP I+Q
Sbjct: 242 MLENSVIIFTTDNGGPAAGFDQ-----NYASNWPLRGVKNTLWEGGVHGTGFVHSPLIKQ 296
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
R + +++H+ D LPT+Y AGGD++ L N+DG + W
Sbjct: 297 PKRTTHELLHVCDLLPTIYELAGGDSTEL----------------------KNLDGTNVW 334
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
++ S R VL NID K++TAA+R +K++LG G DG+Y
Sbjct: 335 ETISRGVQSPRVEVLHNIDPKRKTAALRYGDYKIILGEAYKGAWDGWY 382
Score = 76.3 bits (186), Expect = 5e-11, Method: Compositional matrix adjust.
Identities = 35/82 (42%), Positives = 49/82 (59%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R W G Y+T+LF EA ++I KPLFLYL H A H+GN L+APQ+
Sbjct: 144 LRNGTEPVWTENGTYSTELFATEAERIITSHDTSKPLFLYLPHQAVHSGNPDNPLQAPQK 203
Query: 61 TINQFQYITDPNRRTYAALTKS 82
I++F +I P RRT+AA+ +
Sbjct: 204 YIDKFPHIQHPGRRTFAAMVSA 225
Score = 39.3 bits (90), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 22/56 (39%), Positives = 28/56 (50%), Gaps = 1/56 (1%)
Query: 575 DPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
DPCE NNIA + + L L+ + T VP QP + +PK WSPWI
Sbjct: 399 DPCEFNNIADWNKPLVNFLMGRLEAYSETAVP-IRNQPTDPRGNPKYHGGVWSPWI 453
>gi|195328507|ref|XP_002030956.1| GM25726 [Drosophila sechellia]
gi|194119899|gb|EDW41942.1| GM25726 [Drosophila sechellia]
Length = 579
Score = 387 bits (994), Expect = e-104, Method: Compositional matrix adjust.
Identities = 232/559 (41%), Positives = 311/559 (55%), Gaps = 46/559 (8%)
Query: 80 TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
TK + +LI G ND+SFHGSN+I TPNIDALAYNGI+LN Y +CTPSRA+L+TG
Sbjct: 26 TKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTG 85
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
KYPI+TGMQ I EP G+P ER +PE R+ GYST +GKWHLGF+R++ TP RG
Sbjct: 86 KYPIYTGMQHFVIITDEPWGLPQRERLMPEIFRDAGYSTHLVGKWHLGFWRKDLTPTMRG 145
Query: 199 FESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
F+ HFGY NG I YYDH +L YS G D RR+L + G YAT+ FT EA
Sbjct: 146 FDHHFGYYNGYIDYYDHQVRLLDRNYSA-----GLDFRRDLEPCPEANGTYATEAFTSEA 200
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
++IE KPLF+ L+HLA H GN ++AP+E + +F +I DP RRTYA MV LD
Sbjct: 201 KRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMVSSLD 260
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
SV I AL+ GML NSII+ SDNGAPT+ + N GSNYPYRG K + WEGG
Sbjct: 261 KSVAQTIGALKDNGMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGG 315
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
++ LWSP +++ VS Q +H DWLPTL AAG SL +
Sbjct: 316 IRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG----------------VSLPQDL 359
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
P +DG++ W L N + +++ +DE ++ D+ K V G+ G D
Sbjct: 360 P------LDGINLWPMLSGNEEPKPRTMIHVLDEVFGYSSYMKDTLKYVNGSSFEGRYDQ 413
Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----G 551
+ G+ +N+ L+ + + +Q L N L D++R R +AT C G
Sbjct: 414 WLGELETNEDDPLS-ESYEKHVLASDVQSLLGNRGL---TEDRIRQMRSEATETCPPVEG 469
Query: 552 ANP-APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
NP P PC+ F+L DPCE+ N+A P +L + ++ R+T +P +
Sbjct: 470 QNPLEPHFKCEPLKAPCF-FDLAKDPCERYNLAQIYPLQLQELADEVEQIRKTAIPSARV 528
Query: 611 QPDLVQADPKRFNDTWSPW 629
+A+P N W W
Sbjct: 529 PHSDSRANPTFHNANWEWW 547
Score = 80.9 bits (198), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 51/82 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+L + G YAT+ FT EA ++IE KPLF+ L+HLA H GN ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTSEAKRIIEQHDKSKPLFMVLSHLAVHTGNEDSPMQAPEE 236
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +I DP RRTYA + S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMVSS 258
>gi|194871664|ref|XP_001972882.1| GG13640 [Drosophila erecta]
gi|190654665|gb|EDV51908.1| GG13640 [Drosophila erecta]
Length = 578
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 230/560 (41%), Positives = 308/560 (55%), Gaps = 48/560 (8%)
Query: 80 TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
TK + +LI G ND+SFHGSN+I TPNIDALAYNGI+LN Y +CTPSRA+L+TG
Sbjct: 26 TKPNIVIILIDDMGMNDVSFHGSNQILTPNIDALAYNGILLNKHYVPNLCTPSRATLLTG 85
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
KYPIHTGMQ I EP G+P ER +PE RE GYST +GKWHLGF+ ++ TP RG
Sbjct: 86 KYPIHTGMQHFVIITDEPWGLPQRERLMPEIFREAGYSTHLVGKWHLGFWHKDLTPTRRG 145
Query: 199 FESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
F+ HFGY NG I YYDH +L YS G D RR+L + G YAT+ FT EA
Sbjct: 146 FDHHFGYYNGYIDYYDHQVRMLDRNYSA-----GLDFRRDLEPCPEANGTYATEAFTAEA 200
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
++IE KPLF+ ++HLA H GN ++AP+E + +F +I DP RRTYA M+ LD
Sbjct: 201 KRIIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLD 260
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
SV + AL+ GML NSII+ SDNGAPT+ + N GSNYPYRG K + WEGG
Sbjct: 261 KSVAQTVGALKDNGMLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGG 315
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
++ LWSP +++ VS Q +H DWLPTL AAG SL +
Sbjct: 316 IRSAGALWSPLLKERGYVSNQAIHAIDWLPTLAGAAG----------------VSLPQDL 359
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
P +DG++ W L N + +++ +DE ++ D+ K V G+ G D
Sbjct: 360 P------LDGINLWPLLSGNEQPKPRTMIHVLDEVFGYSSYMKDTLKYVNGSSFQGRYDQ 413
Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPIS-NIDKMRSTRQQATIHC---- 550
+ G+ ++ + + ES L Q+I D++R R +AT C
Sbjct: 414 WLGELETSAD-----DPLSESYEQHVLASDVQSILGNRGLTEDRIRQMRSEATELCPPVE 468
Query: 551 GANP-APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
G NP P PC+ F+L NDPCE+ N+A + P QL + + R+T + +
Sbjct: 469 GQNPLEPHFKCEPLKAPCF-FDLANDPCERYNLARTYPLQLQQLADEVVLIRKTAISSAR 527
Query: 610 EQPDLVQADPKRFNDTWSPW 629
+A+P N W W
Sbjct: 528 VAHSDSRANPAYHNGNWEWW 547
Score = 79.7 bits (195), Expect = 4e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 51/82 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+L + G YAT+ FT EA ++IE KPLF+ ++HLA H GN ++AP+E
Sbjct: 177 FRRDLEPCPEANGTYATEAFTAEAKRIIEQHDKSKPLFMVMSHLAVHTGNEDSPMQAPEE 236
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +I DP RRTYA + S
Sbjct: 237 EVAKFPHIRDPKRRTYAGMISS 258
>gi|158287209|ref|XP_564139.3| AGAP011347-PA [Anopheles gambiae str. PEST]
gi|157019541|gb|EAL41524.3| AGAP011347-PA [Anopheles gambiae str. PEST]
Length = 634
Score = 384 bits (987), Expect = e-104, Method: Compositional matrix adjust.
Identities = 231/575 (40%), Positives = 328/575 (57%), Gaps = 53/575 (9%)
Query: 60 ETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIIL 119
T+ Y +P +R + L + + GWND+ FHGSN+I TP+IDALAY+G+IL
Sbjct: 19 RTLAVAAYGQNPPKRPHIVLITADDM------GWNDVGFHGSNQIATPHIDALAYDGVIL 72
Query: 120 NNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKA 179
N Y+ P+CTPSRA+LMTG++PI+ GMQ I EP G+ L +R +P+Y R GY T
Sbjct: 73 NRHYSAPMCTPSRAALMTGRHPINVGMQHYVIDSDEPWGLGLDQRIMPQYFRAAGYRTHM 132
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
IGKWHLGFF Y P RGF++H GYL + Y+ ++ S S T E G+DMR+N
Sbjct: 133 IGKWHLGFFTEHYIPTNRGFDTHIGYLGPYVDYWSYV-SKMNSGTFE--GYDMRQNQFVN 189
Query: 240 WDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
+ G YATD FT A +I +P+ L + HLA HAGN L+APQETI++F Y
Sbjct: 190 YAANGTYATDYFTSAARDIIAQHGKSGQPMLLVMNHLAPHAGNDDDPLQAPQETIDRFAY 249
Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
I + +RRTYAAM+ KLDDSVG V +L+ GML+NSII+F+SDNG T + N
Sbjct: 250 IGNRDRRTYAAMMSKLDDSVGAVYESLRANGMLDNSIIVFLSDNGGVT-----RGMHSNT 304
Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG---GDT 415
GSNYP+RG K++ WEG V+ A++WSP ++ RVS Q HISDWLPTL +AAG G++
Sbjct: 305 GSNYPFRGQKHSPWEGAVRTAALIWSPLLKDTQRVSNQWFHISDWLPTLASAAGIALGNS 364
Query: 416 SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAA 475
S +IDG+DQW +L T P +R ++ N+DE +
Sbjct: 365 SF--SDIDGIDQWEALAYGT-------------------GNPRQR--LMNNLDEIFGYTS 401
Query: 476 VRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISN 535
+ +K + GT NG D +YG+T PL + + + + Y S ++ +
Sbjct: 402 YMENGFKYINGTTLNGVNDAWYGETD----PL---DQLPDDEQYLSSVMATRIARAGRLD 454
Query: 536 IDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE 595
+ R+ AT++CG + A +P PC LF++ NDPCE ++I+ P +
Sbjct: 455 EKLISYLRRHATVNCGNSIATKPCNPLVK-PC-LFDIINDPCEMHDISDEYPKRLRGIRN 512
Query: 596 LLKYHR-RTLVPQSHEQPDLVQADPKRFNDTWSPW 629
L+ +R R P++ +P +ADP +F W+ W
Sbjct: 513 RLELYRLRAAKPRN--RPHDPEADPSKFGGVWNWW 545
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 39/92 (42%), Positives = 53/92 (57%), Gaps = 1/92 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
MR+N + G YATD FT A +I +P+ L + HLA HAGN L+APQ
Sbjct: 182 MRQNQFVNYAANGTYATDYFTSAARDIIAQHGKSGQPMLLVMNHLAPHAGNDDDPLQAPQ 241
Query: 60 ETINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
ETI++F YI + +RRTYAA+ ++ VY
Sbjct: 242 ETIDRFAYIGNRDRRTYAAMMSKLDDSVGAVY 273
>gi|242023422|ref|XP_002432133.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212517507|gb|EEB19395.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 514
Score = 384 bits (986), Expect = e-104, Method: Compositional matrix adjust.
Identities = 195/399 (48%), Positives = 253/399 (63%), Gaps = 33/399 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGSN+IPTPNIDALA+ GIILNN Y PVCTPSR++L+TGKYPIHTG+Q I
Sbjct: 42 GWNDVGFHGSNQIPTPNIDALAFTGIILNNYYVAPVCTPSRSALLTGKYPIHTGLQHGVI 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ P G+ L E+ LPEYLR L Y T+ +GKWHLG F+++YTP YRGF+SH+GY G
Sbjct: 102 HGSAPYGLNLNEKLLPEYLRSLNYVTRHVGKWHLGSFKKDYTPEYRGFDSHYGYWTGHQD 161
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYDH + G+DMRR ++ G Y TDLFT EAV++I+ +KPLFLY
Sbjct: 162 YYDHTAIENPG----FWGYDMRRGMNVTRSDFGYYTTDLFTNEAVKVIKGHDSNKPLFLY 217
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAHLA H+GN L+AP ET+ +F YI D NRR +A M+ KLD+SVG V+ AL M+
Sbjct: 218 LAHLATHSGNKYSPLQAPAETVAKFNYIKDKNRRLFAGMLSKLDESVGKVVEALADSNMI 277
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
N +I+F +DNG P + N SN+P RGVK+TLWEGGV+ +WSP + +
Sbjct: 278 NNCVILFSTDNGGPAGGFN-----LNAASNWPLRGVKDTLWEGGVRGVGFIWSPFLPSS- 331
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
+VS M+HI+DWLPTL L L S S+IDG++ W +
Sbjct: 332 KVSNAMIHITDWLPTL-----------------------LSLTNASNSISDIDGINVWPN 368
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
L PS R +L+NID +++ AA+R +WK G N
Sbjct: 369 LSKGLPSVRKEILLNIDTERKIAALRYKNWKYKKGNANN 407
Score = 89.0 bits (219), Expect = 7e-15, Method: Compositional matrix adjust.
Identities = 41/79 (51%), Positives = 53/79 (67%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR ++ G Y TDLFT EAV++I+ +KPLFLYLAHLA H+GN L+AP E
Sbjct: 178 MRRGMNVTRSDFGYYTTDLFTNEAVKVIKGHDSNKPLFLYLAHLATHSGNKYSPLQAPAE 237
Query: 61 TINQFQYITDPNRRTYAAL 79
T+ +F YI D NRR +A +
Sbjct: 238 TVAKFNYIKDKNRRLFAGM 256
>gi|443694453|gb|ELT95582.1| hypothetical protein CAPTEDRAFT_115907 [Capitella teleta]
Length = 561
Score = 384 bits (985), Expect = e-103, Method: Compositional matrix adjust.
Identities = 219/560 (39%), Positives = 310/560 (55%), Gaps = 58/560 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS ++ TPN+DALAY+G+IL N Y QP+CTPSRA+LMTG++PIHTGMQ I
Sbjct: 37 GWNDVGFHGSEQVLTPNLDALAYDGVILENYYVQPICTPSRAALMTGRHPIHTGMQQNVI 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEP G+ L E P+YL+++GY T +GKWHLGFF +YTP+ RGF+SH+GY G
Sbjct: 97 YSAEPYGLGLNEIIFPQYLKQIGYKTHIVGKWHLGFFAEQYTPIERGFDSHYGYYMGAED 156
Query: 212 YYDHILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y+ HI + + + + G D RRN G+Y+T+LFT EA +I +PLF+
Sbjct: 157 YWVHIAGNAHEVSFNASWGLDFRRNGEVVKTAFGQYSTELFTTEAENIIASHNQSEPLFM 216
Query: 271 YLAHLAAHAGN---AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
Y+A A H+ N LEAP + +F +I + RR +AAMV LDDSVG + AL
Sbjct: 217 YVAQQAVHSANPYTGDAELEAPFKYYEKFPHIKNEKRRKFAAMVSALDDSVGNITKALHA 276
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
KG+LEN+II F +DNG P + N SN+P RGVK +LWEGGV+ +WSP +
Sbjct: 277 KGILENTIICFSTDNGGPASGFD-----FNDASNWPLRGVKESLWEGGVRGTGFVWSPLM 331
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ + S MM I D+LPTL AAG D S+L ++DG+DQW + S R +
Sbjct: 332 KTSGYTSPAMMQIFDFLPTLLKAAGYDMSKLSPDLDGIDQWEII-----SERKDAV---- 382
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT--------MDGYYG- 498
R +L NID + T A+R++ K+V+ +Q T D +YG
Sbjct: 383 ------------RTEMLHNIDNTEGTRALRINEMKVVV-SQGGSTRSWPLWYSTDQFYGG 429
Query: 499 ------QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
++ + ++ V + L + +N+ P+ + ++ IHCG
Sbjct: 430 YNHTDHSVSADLSAKWDIDSEVAPRVNSDLTSVLRNLGRPLRSGTPLK-------IHCGP 482
Query: 553 NPAPMTPSPCTNG--PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
P P C G PC LFN+ DPCE +N+A RPD+ S + + L ++ T V ++
Sbjct: 483 VP-PSARGSCDVGKKPC-LFNITADPCEFHNLADERPDMLSTMLDRLDQYQSTAVAPRNK 540
Query: 611 QPDLVQADPKRFNDTWSPWI 630
D P + W PWI
Sbjct: 541 PVD-PNGLPSKHGGIWEPWI 559
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/85 (35%), Positives = 45/85 (52%), Gaps = 3/85 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN---AGKHLEA 57
RRN G+Y+T+LFT EA +I +PLF+Y+A A H+ N LEA
Sbjct: 178 FRRNGEVVKTAFGQYSTELFTTEAENIIASHNQSEPLFMYVAQQAVHSANPYTGDAELEA 237
Query: 58 PQETINQFQYITDPNRRTYAALTKS 82
P + +F +I + RR +AA+ +
Sbjct: 238 PFKYYEKFPHIKNEKRRKFAAMVSA 262
>gi|390361962|ref|XP_789345.3| PREDICTED: arylsulfatase I-like, partial [Strongylocentrotus
purpuratus]
Length = 514
Score = 383 bits (983), Expect = e-103, Method: Compositional matrix adjust.
Identities = 215/546 (39%), Positives = 306/546 (56%), Gaps = 72/546 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+S HGS++IPTPNID LA +G+ L N Y P+CTPSR+++MTG++PIHTG+Q I
Sbjct: 22 GWDDVSLHGSSQIPTPNIDTLAQDGVTLTNYYVSPLCTPSRSAIMTGRHPIHTGLQFGVI 81
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
P G+ L E+ + +YL+ LGYST A+GKWHLG+F +EYTP +RGF+S FG+ NG
Sbjct: 82 SPEAPYGLGLEEKTMAQYLKTLGYSTHAVGKWHLGYFAKEYTPTWRGFDSFFGFYNGRGD 141
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY H E++G+D+ +N G+Y+TD+F +EA Q+I+ +PLFLY
Sbjct: 142 YYTH-----EEVQSEVSGYDLHKNGKVYRPAFGQYSTDIFNQEAEQIIKAHNASQPLFLY 196
Query: 272 LAHLAAHAG-NAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
LAH A HAG + L+AP + +F +I RR YAAMV LDDSVG++ L+ G+
Sbjct: 197 LAHQAVHAGVYPDRLLQAPDKYYQRFPHIETEGRRMYAAMVSALDDSVGSISQTLRDAGL 256
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+NSII+F +DNG T ++ + ++ NW P RG K+TLWEGGV+ A + SP I++
Sbjct: 257 YDNSIIVFTTDNGGATYDFFDGTHASNW----PLRGGKHTLWEGGVRGTAFVNSPLIKKP 312
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
R S QMMH+ DWLPTL++ AGGD L N DG DQW
Sbjct: 313 RRFSDQMMHVCDWLPTLHSVAGGDPKEL----------------------KNSDGFDQWD 350
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+L N PS RN +L NI+ ++ AA+R+ +KL G F
Sbjct: 351 ALSNNAPSPRNEILHNIEPIEKYAAIRVGDFKLHYGA----------------------F 388
Query: 511 NAIVESKTYQSL---QQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG-- 565
+ SK + + + N+ + N ++R +QA + C G
Sbjct: 389 GGKIGSKYFSGWYPPEGATSNMAEYVPNSFRVRCPSKQAN----------ASTNCDYGDR 438
Query: 566 -PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
PC L+N+ +DPCE NNIA DI + L+ +RR+ V + QP Q++PK +
Sbjct: 439 KPC-LYNIRDDPCEYNNIADWNQDIVEAMTARLREYRRSAV-EPRNQPLDPQSNPKLHGN 496
Query: 625 TWSPWI 630
W PW+
Sbjct: 497 VWEPWV 502
Score = 62.4 bits (150), Expect = 7e-07, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 1/83 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQ 59
+ +N G+Y+TD+F +EA Q+I+ +PLFLYLAH A HAG + L+AP
Sbjct: 157 LHKNGKVYRPAFGQYSTDIFNQEAEQIIKAHNASQPLFLYLAHQAVHAGVYPDRLLQAPD 216
Query: 60 ETINQFQYITDPNRRTYAALTKS 82
+ +F +I RR YAA+ +
Sbjct: 217 KYYQRFPHIETEGRRMYAAMVSA 239
>gi|443732842|gb|ELU17406.1| hypothetical protein CAPTEDRAFT_127365 [Capitella teleta]
Length = 502
Score = 379 bits (973), Expect = e-102, Method: Compositional matrix adjust.
Identities = 213/505 (42%), Positives = 292/505 (57%), Gaps = 43/505 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS ++PTPNIDALA +GIIL+N Y +C+PSR +LMTGK+PI G+Q I
Sbjct: 35 GWDDVSFHGSRQVPTPNIDALASDGIILDNYYVHTLCSPSRGALMTGKHPIQIGLQRGVI 94
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A+P G+ L E+ LPEYL LGY + +GKWHLG EYTP++RGFESHFG+ G S
Sbjct: 95 MPAQPSGLGLKEKLLPEYLNTLGYKSHMVGKWHLGMCAEEYTPMHRGFESHFGFYQGCES 154
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y H+ + +G D N G+Y+T LFT +A QL+ + P+FLY
Sbjct: 155 YTTHMCGN--------SGLDFWLNEEPDHSAGGQYSTSLFTAKAEQLLAEHDTASPMFLY 206
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAH A H GN + AP + ++ +I+D RR A M+ LD+SVG V AL KGML
Sbjct: 207 LAHQAVHVGNQDQKFYAPDKYTDKLSFISDDRRRQMAGMLTALDESVGKVTKALHAKGML 266
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
EN+II+F SDNGAP + + GSN+P RG K ++WEGG +VPA +WSP +++
Sbjct: 267 ENTIIVFSSDNGAPHIGNSD----HTVGSNFPLRGGKPSIWEGGSRVPAFVWSPLFKKSG 322
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
+S Q+MHISDWLPTL AAG DTS LP N+ G+ QW +
Sbjct: 323 YISKQLMHISDWLPTLLQAAGYDTSALPTNLYGVSQW---------------------QT 361
Query: 452 LLLNTPSRRNSVLINI--DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV--PL 507
L N PS R+SVL N+ K ++ A+R+ KL+ ++ G+Y S+ + P
Sbjct: 362 LQENGPSPRDSVLYNVIAGCKDKSWALRVGDMKLLRAQGKSKKNAGWYQPFSSDILSQPK 421
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NG 565
AI+ + LQ I + +K ST + I CGA PA + + C +
Sbjct: 422 SRDPAIISVVPAELLQSQVTPILRELGRGEKSSSTEHR--IDCGAPPAGVVYN-CIGMDK 478
Query: 566 PCYLFNLGNDPCEQNNIASSRPDIS 590
PC LFN+ DPCE +N+A+ P+I+
Sbjct: 479 PC-LFNITADPCEYHNLANDMPEIA 502
Score = 62.0 bits (149), Expect = 9e-07, Method: Compositional matrix adjust.
Identities = 27/67 (40%), Positives = 40/67 (59%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G+Y+T LFT +A QL+ + P+FLYLAH A H GN + AP + ++ +I+D
Sbjct: 179 GQYSTSLFTAKAEQLLAEHDTASPMFLYLAHQAVHVGNQDQKFYAPDKYTDKLSFISDDR 238
Query: 73 RRTYAAL 79
RR A +
Sbjct: 239 RRQMAGM 245
>gi|198466304|ref|XP_002135153.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
gi|198150538|gb|EDY73780.1| GA23896 [Drosophila pseudoobscura pseudoobscura]
Length = 577
Score = 379 bits (972), Expect = e-102, Method: Compositional matrix adjust.
Identities = 226/563 (40%), Positives = 312/563 (55%), Gaps = 50/563 (8%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
A+ + + L+ G+ND+SFHGSN+I TPNIDALAYNGI+LN Y +CTPSRA+L+
Sbjct: 24 ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TGKYPIHTGMQ I EP G+P ER +PE R+ GY+T +GKWHLGF+R++ TP
Sbjct: 84 TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143
Query: 197 RGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
RGF+ HFGY NG + YYD +L YS G D RR+L + G YAT+ FT
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYS-----TGLDFRRDLEPCREAEGTYATEAFTT 198
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
EA ++IE ++PLF+ L+HLA H GN ++AP+E + +F +I DP RRTYA M+
Sbjct: 199 EARKVIERHDKNRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISS 258
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
LD SVG + AL GML NSI++ SDNGAPTV + N GSNYP+RG K + WE
Sbjct: 259 LDKSVGQTMRALADNGMLNNSIVLLYSDNGAPTVGI-----HSNAGSNYPFRGQKESPWE 313
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
GG++ LWSP +QQ VS Q +H DWLPTL AAG +
Sbjct: 314 GGIRSVGALWSPLLQQRGYVSNQAIHAIDWLPTLAAAAG--------------------V 353
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
+ PS + +DGL+ W SL + + ++L +D+ ++ D+ K V G+ G
Sbjct: 354 SLPS--DLRLDGLNLWPSLSASAQPQPRNLLHVLDDVFGYSSYTQDTLKYVNGSSFEGRY 411
Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLS--QNIFLPIS-NIDKMRSTRQQATIHC 550
D + G+ L + S+ Y L + S Q + ++ D++R R +AT C
Sbjct: 412 DHWLGE-------LEDGEEDPNSRDYGQLVRASEVQAVLGSVAPTDDRIRQMRDEATQRC 464
Query: 551 GANPA----PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
P+ C LF+L NDPCE+ N+A+ P L + ++ R T +P
Sbjct: 465 PIQERDPEDPVYLCEPLKAAC-LFDLANDPCERYNLANLYPLQLQLLAQEVEELRLTALP 523
Query: 607 QSHEQPDLVQADPKRFNDTWSPW 629
+ V+ADP + W W
Sbjct: 524 SARVPRADVRADPALHDGHWEWW 546
Score = 80.1 bits (196), Expect = 3e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 52/82 (63%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+L + G YAT+ FT EA ++IE ++PLF+ L+HLA H GN ++AP+E
Sbjct: 177 FRRDLEPCREAEGTYATEAFTTEARKVIERHDKNRPLFMVLSHLAVHTGNEDNPMQAPEE 236
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +I DP RRTYA + S
Sbjct: 237 EVAKFAHIRDPKRRTYAGMISS 258
>gi|427781895|gb|JAA56399.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 554
Score = 378 bits (970), Expect = e-102, Method: Compositional matrix adjust.
Identities = 217/541 (40%), Positives = 299/541 (55%), Gaps = 44/541 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS++IPTPN+D LA +G+ILNN Y P CTPSRA+LMTG YPI TGMQG PI
Sbjct: 47 GWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLYPIRTGMQGMPI 106
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEP G+P R LP+YL+E GY T +GKWHLG ++ TP RGF+S +GY G
Sbjct: 107 DVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFDSFYGYYYGESD 166
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H +S + G D N W +G Y+T +FTK A +IE++ KPL L
Sbjct: 167 YFAHTISYE-----NHTGLDFWLNKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLV 221
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+ H A H + L+APQE I++F YI + NR YA MV LD SVG V+ AL GML
Sbjct: 222 ITHQATHCALERERLQAPQENIDKFPYIGEKNRTIYAGMVDALDQSVGQVVKALSDAGML 281
Query: 332 ENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
EN+II+F SDN G P E+ N R G N+P RG K T+WEGG + A +WSP + +
Sbjct: 282 ENTIIVFSSDNGGGPWGEH----NSR--GFNWPLRGAKGTVWEGGTRAAAFVWSPLLARR 335
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS Q+MHI+DWLPT Y+ AGG+ ++L +DG D W
Sbjct: 336 RRVSHQLMHITDWLPTFYSIAGGNAAKL----------------------VELDGHDMWR 373
Query: 451 SLLLNTPSRRNSVLINIDEK-KRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L S R +L N D K ++AA+R +KLVL E G Y + P +
Sbjct: 374 PLSHGRRSPRVELLYNYDYKYTKSAALRNHRYKLVL--DETGRFADRY-KVPGGSRPRHD 430
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
+ + T S+ + F ++ +S R++A + CG T+ YL
Sbjct: 431 LDYLAAKSTAASVLR----DFYKKKQLNFPKSWRRKARLTCGKQKKINFSWSATS--VYL 484
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
F++ DPCE NN+AS P++ + L + L + +P+ + ++ P+ TW+PW
Sbjct: 485 FDVVTDPCELNNLASKLPNVVASLKKRLDAYGAAALPERNNDTYDPRSFPENHGGTWAPW 544
Query: 630 I 630
+
Sbjct: 545 V 545
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 45/79 (56%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
N W +G Y+T +FTK A +IE++ KPL L + H A H + L+APQE I+
Sbjct: 185 NKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALERERLQAPQENID 244
Query: 64 QFQYITDPNRRTYAALTKS 82
+F YI + NR YA + +
Sbjct: 245 KFPYIGEKNRTIYAGMVDA 263
>gi|241378410|ref|XP_002409154.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215497456|gb|EEC06950.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 511
Score = 377 bits (968), Expect = e-102, Method: Compositional matrix adjust.
Identities = 213/501 (42%), Positives = 291/501 (58%), Gaps = 44/501 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS +IPTPN+D LA +GIILNN Y QP CTPSRA+LMTG YPIHTGMQ +
Sbjct: 2 GWDDVSFHGSAQIPTPNMDTLAADGIILNNYYVQPACTPSRAALMTGLYPIHTGMQHGVL 61
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEP G+PL+ +P+YL+ LGY T +GKW+LG ++ YTP +RGF+S +GY + V
Sbjct: 62 SPAEPYGLPLSVSIMPQYLKNLGYETHIVGKWNLGNYKLSYTPTFRGFDSFYGYYSAVED 121
Query: 212 YYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY+H +L D + G D N + G Y+T L+T+ LIE++ V KP FL
Sbjct: 122 YYNHTVLWDNQT------GLDFWLNTQPLRNVSGIYSTQLYTERTKFLIENRDVSKPFFL 175
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YL + A H GN L+APQE I++F YI + NR +A M LD+S+G V AL M
Sbjct: 176 YLPYQAVHCGNFDDPLQAPQENIDKFPYIGEENRTIFAGMTDALDESIGEVFRALFDADM 235
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L+N+II+ +DNG + N G N+P RG K TLWEGG + A +WSP +++
Sbjct: 236 LDNTIIVLSADNGGLPFGFES-----NRGYNFPLRGGKGTLWEGGCRASAFVWSPLLKRK 290
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
VS QMMHI+DW+PTLY+AAGG+ L +DG D W
Sbjct: 291 GVVSDQMMHITDWVPTLYSAAGGNPQTL----------------------GKLDGTDMWV 328
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
L + S R +L NID R +A+R ++KLVLG G D Y T ++ P +
Sbjct: 329 PLAYGSASPRYEILYNIDPIDRNSALRYTNYKLVLGIANGGAYDQRYLTTGGSR-PYGDL 387
Query: 511 NAIV-ESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
+ ++ +S+ Q L++ +L + RQ+AT+ CG N P + P YL
Sbjct: 388 DLLMAQSEAAQVLREYYGTGYLKFPD-----QWRQRATLTCG-NKVPKNFVSLS--PPYL 439
Query: 570 FNLGNDPCEQNNIASSRPDIS 590
F+L DPCE NNIA+S+P +S
Sbjct: 440 FDLTADPCEINNIANSQPGVS 460
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/70 (45%), Positives = 45/70 (64%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G Y+T L+T+ LIE++ V KP FLYL + A H GN L+APQE I++F YI + N
Sbjct: 149 GIYSTQLYTERTKFLIENRDVSKPFFLYLPYQAVHCGNFDDPLQAPQENIDKFPYIGEEN 208
Query: 73 RRTYAALTKS 82
R +A +T +
Sbjct: 209 RTIFAGMTDA 218
>gi|443701814|gb|ELU00075.1| hypothetical protein CAPTEDRAFT_177949 [Capitella teleta]
Length = 545
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 214/546 (39%), Positives = 299/546 (54%), Gaps = 47/546 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+S HGS +IPTPNIDALA +GI+LNN Y QP+CTPSRA+L+TGK+P+H G+Q I
Sbjct: 38 GWDDISLHGSEQIPTPNIDALAADGILLNNYYVQPICTPSRAALLTGKHPVHLGLQHNTI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ L ER LPEYL LGY + +GKWHLG+F ++TP YRGF+SHFGYLNG
Sbjct: 98 PAPSAYGLGLNERILPEYLNTLGYDSHMVGKWHLGYFTPQHTPTYRGFKSHFGYLNGCED 157
Query: 212 YYDHILSDQYSRTVELNGH--DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y DH L+ +S T+ ++G D + + G+Y+T++FT A +LI +PLF
Sbjct: 158 YLDHTLAYDFS-TLGMDGWGLDFWNDTKIHRTSFGQYSTEIFTTRAEELIRSN-TGEPLF 215
Query: 270 LYLAHLAAHAGNA---GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
LY++H A H+GN G L+AP + N+F YI RR A M+ LDDSVG + L
Sbjct: 216 LYMSHQAVHSGNPGLNGSKLQAPWKYFNKFNYIQSDERRRLAGMISALDDSVGNITRTLH 275
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
KGML N+II+F +DNG P + + N SN+P RG K T+WEGG++ +WSP
Sbjct: 276 AKGMLNNTIIVFSTDNGGPADRFDD-----NCASNWPLRGGKRTMWEGGLRGNGFVWSPL 330
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
++ + VS M I DWLPTL AAG D +LP +++G+ QW
Sbjct: 331 LKHSNYVSEHFMSIMDWLPTLLDAAGYDLGKLPRDVEGMSQW------------------ 372
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
SL N+PS R+ +L NID +RL K+V G G+Y P
Sbjct: 373 ---KSLSENSPSPRDYLLHNIDPLTGEGGLRLGDMKVVFGGGNTKAHRGWY-------PP 422
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN-- 564
+ + S + +IF + + Q + CG P +T + C +
Sbjct: 423 EMRWGNETVSAPSHLFETDLWDIFHDLGRPPVQKKISVQ--VECGPAPPDVTTN-CKSWI 479
Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
PC LFN+ DPCE +N++ +PD+ +L ++K T+ P ++ D P R
Sbjct: 480 EPC-LFNISADPCEYHNLSEDQPDMLGKLLSIVKAINATVTP-PYDNSDDPNGYPGRHGG 537
Query: 625 TWSPWI 630
W PWI
Sbjct: 538 VWKPWI 543
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 44/75 (58%), Gaps = 4/75 (5%)
Query: 11 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA---GKHLEAPQETINQFQY 67
+ G+Y+T++FT A +LI +PLFLY++H A H+GN G L+AP + N+F Y
Sbjct: 189 SFGQYSTEIFTTRAEELIRSN-TGEPLFLYMSHQAVHSGNPGLNGSKLQAPWKYFNKFNY 247
Query: 68 ITDPNRRTYAALTKS 82
I RR A + +
Sbjct: 248 IQSDERRRLAGMISA 262
>gi|427793479|gb|JAA62191.1| Putative arylsulfatase b, partial [Rhipicephalus pulchellus]
Length = 512
Score = 377 bits (967), Expect = e-101, Method: Compositional matrix adjust.
Identities = 220/556 (39%), Positives = 303/556 (54%), Gaps = 67/556 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+S+HG +I TPNIDALA+NGI L YAQP+CTPSRA+L+TG+YPI+ G+Q I
Sbjct: 1 GWNDVSYHGCPQIRTPNIDALAWNGIRLRRYYAQPLCTPSRAALLTGRYPINMGLQHSVI 60
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ EPRG+PL++ LP++L +LGY T +GKWH+GFF++EYTP RGFE H G+ I
Sbjct: 61 YNEEPRGLPLSDTLLPQWLADLGYVTHHLGKWHIGFFKKEYTPTMRGFERHVGFWGAYID 120
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY H Y G DMRRNL A + G Y T L TKEA+++IE+ PVDKPLFLY
Sbjct: 121 YYKH--EKAYLGPTRSPGLDMRRNLFLARNDTGRYVTQLLTKEALEVIENHPVDKPLFLY 178
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAHLA H+ L+ P + I Q++ I R YA MV LD+SVG V AL + GML
Sbjct: 179 LAHLAPHSAGPQDPLQVPDKYIEQYRDIGSTERMKYAGMVSALDESVGLVFEALAKNGML 238
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
N++ +F SDNGA N+ S++P RG K T WEGGV+ A+LWS + P
Sbjct: 239 SNTLFVFTSDNGA-----DAAGPNANYASSWPLRGQKYTPWEGGVRAAALLWSHAL--GP 291
Query: 392 RVSL-----QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
RV + HI+DWLPTLY AGGD + L +IDG+
Sbjct: 292 RVRAGGDYEHLFHITDWLPTLYQLAGGDPAHL----------------------GDIDGV 329
Query: 447 DQWSSLLLNTPSR-----RNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
++L+ + + R VL+NID +A+ +K V G E G + +Y
Sbjct: 330 SHVNALVGDASPQPPQPPRTEVLVNIDPIDNYSAIIDGRFKFVAGVAERGIYEQWY---- 385
Query: 502 SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDK-------MRSTRQQATIHCGANP 554
I+ + + S + L Q P++ + K + A I CG
Sbjct: 386 ----------PILGNVDWNSTEPLEQCEASPVARVLKSFGPEPACGPSGGAAPIDCGVRD 435
Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPD-ISSQLYELLKYHRRTLVPQSHEQPD 613
P T PC LF+L DPCE N++++ RP+ + S L +L +Y ++ P +
Sbjct: 436 PAKACQP-TVAPC-LFDLREDPCEYNDVSAERPELVKSLLTKLRRYEDASMPPANIPHDP 493
Query: 614 LVQADPKRFNDTWSPW 629
L +DP +++ W PW
Sbjct: 494 L--SDPGLYDNAWIPW 507
Score = 84.7 bits (208), Expect = 1e-13, Method: Compositional matrix adjust.
Identities = 46/123 (37%), Positives = 67/123 (54%), Gaps = 18/123 (14%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRNL A + G Y T L TKEA+++IE+ PVDKPLFLYLAHLA H+ L+ P +
Sbjct: 139 MRRNLFLARNDTGRYVTQLLTKEALEVIENHPVDKPLFLYLAHLAPHSAGPQDPLQVPDK 198
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILN 120
I Q++ I R YA + + ++ +V+ +ALA NG++ N
Sbjct: 199 YIEQYRDIGSTERMKYAGMVSALDESVGLVF------------------EALAKNGMLSN 240
Query: 121 NMY 123
++
Sbjct: 241 TLF 243
>gi|241654408|ref|XP_002411325.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503955|gb|EEC13449.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 510
Score = 375 bits (964), Expect = e-101, Method: Compositional matrix adjust.
Identities = 217/544 (39%), Positives = 301/544 (55%), Gaps = 53/544 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+SFHGS +IPTPNID LA +G+ILNN Y P CTPSRA+LMTG YPIHTGMQ I
Sbjct: 14 GWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPTCTPSRAALMTGLYPIHTGMQSDII 73
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A P G+PL + LP++ R+LGY IGKWHLGFF+ Y P+ RGF++ FG+ G
Sbjct: 74 EPAAPWGLPLENKILPQHFRDLGYDVNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSND 133
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY+H + + G + L+ AW + + L T + + +P F Y
Sbjct: 134 YYNHTSGSENNE----QGVSLVATLTAAWGRIAQGVEPLLTDSPLNVYP-----QPFFCY 184
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+H A H+ + +AP + +F YI + NR YA M+ LD+SVG V+ AL GML
Sbjct: 185 FSHHAVHSALMAEPFQAPARNVLKFPYIGESNRTIYAGMLDALDESVGRVVEALDNAGML 244
Query: 332 ENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
E++II+F SDN GAP E N G N+P RG K TLWEG V+VPA +WSP+ +
Sbjct: 245 EDTIIVFSSDNGGAPCGE------ASNQGFNWPLRGAKFTLWEGSVRVPAFVWSPKFLKQ 298
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS Q+MHISDWLPTLYTAAGGD S L + IDG+ D W
Sbjct: 299 SRVSNQLMHISDWLPTLYTAAGGDVSALGV-IDGV---------------------DMWH 336
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY---GQTR-SNKVP 506
SL ++ S+R +L+NID + +R +KLV+G +G ++ +Y G TR +N +
Sbjct: 337 SLTHSSASKRKEILLNIDASFNISGLRYKQYKLVVGGGFDGELNDHYYFTGGTRPTNNID 396
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
L N ++ + + + + P++ + + ++ + N PM P
Sbjct: 397 ELR-NESTMARVLKKFYEKQRRSWSPLTWSENVAVDCRRDRLT--ENFVPMQPP------ 447
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
YLFN+ DPCE NN+A ++ L + L + T+VP S QPD +A P+ W
Sbjct: 448 -YLFNIEQDPCELNNLAKTKVHTLRFLMKKLDAYNSTVVP-SLFQPDDPRAYPEYHGGIW 505
Query: 627 SPWI 630
SPW+
Sbjct: 506 SPWL 509
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 20/75 (26%), Positives = 34/75 (45%), Gaps = 5/75 (6%)
Query: 5 LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
L+ AW + + L T + + +P F Y +H A H+ + +AP + +
Sbjct: 154 LTAAWGRIAQGVEPLLTDSPLNVYP-----QPFFCYFSHHAVHSALMAEPFQAPARNVLK 208
Query: 65 FQYITDPNRRTYAAL 79
F YI + NR YA +
Sbjct: 209 FPYIGESNRTIYAGM 223
>gi|194748066|ref|XP_001956470.1| GF24578 [Drosophila ananassae]
gi|190623752|gb|EDV39276.1| GF24578 [Drosophila ananassae]
Length = 542
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 224/547 (40%), Positives = 302/547 (55%), Gaps = 47/547 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G ND+SFHGSN+I TPNIDALAYNG++LN Y +CTPSRA+L+TGKYPIHTGMQ I
Sbjct: 2 GMNDVSFHGSNQILTPNIDALAYNGVLLNKHYVPNLCTPSRATLLTGKYPIHTGMQHWVI 61
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+P ER +PE RE GYST +GKWHLGF+R++ TP RGF+ H+GY NG I
Sbjct: 62 ITDEPWGLPKKERLMPELFREAGYSTHLVGKWHLGFWRQDLTPTMRGFDHHYGYYNGYID 121
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YYDH +L YS G D RR+ G YAT+ FT EA ++IE+ KPL
Sbjct: 122 YYDHQVRLLGTNYSA-----GLDFRRDFEPNPKANGTYATEAFTSEAKRIIEEHDKSKPL 176
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
F+ L+HLA H GN ++AP+E + +F +I DP RRTYA M+ LD SVG + AL+
Sbjct: 177 FMVLSHLAVHTGNEDNPMQAPEEEVAKFSHIKDPKRRTYAGMISSLDKSVGQTMRALKDN 236
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
ML N+I++ SDNGAPTV + N GSNYPYRG K + WEGG++ LWSP ++
Sbjct: 237 DMLNNTIVLLYSDNGAPTV-----GIHSNAGSNYPYRGQKESPWEGGIRSVGALWSPLLE 291
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ VS Q +H DWLPTL AAG LP +++ +DG +
Sbjct: 292 KQGYVSNQAIHAIDWLPTLAAAAG---VSLPQDLE-------------------LDGENL 329
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
W SL + + +DE ++ D+ K V G+ +G D + G +N+ L
Sbjct: 330 WPSLSEKAEPKPRKFIHVLDEVFGYSSYMSDTLKYVNGSSFDGQYDYWLGDLEANEDDPL 389
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA--NPAPMTP----SPC 562
+ E +Q + + L I +MRS +AT C A P+ P P
Sbjct: 390 S-GIYGEQVLASEVQSVLGSHGLTEDRILQMRS---EATQKCPAIEGQDPLQPHYRCEPL 445
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
PC+ F+L DPCE+ N+A P +L + ++ R+T +P + +ADP
Sbjct: 446 V-APCF-FDLAQDPCERYNLARLYPLQLEKLRQEVEEFRQTAIPSARVPFSDERADPIFH 503
Query: 623 NDTWSPW 629
N W W
Sbjct: 504 NGNWEWW 510
Score = 79.0 bits (193), Expect = 8e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 50/82 (60%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+ G YAT+ FT EA ++IE+ KPLF+ L+HLA H GN ++AP+E
Sbjct: 140 FRRDFEPNPKANGTYATEAFTSEAKRIIEEHDKSKPLFMVLSHLAVHTGNEDNPMQAPEE 199
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +I DP RRTYA + S
Sbjct: 200 EVAKFSHIKDPKRRTYAGMISS 221
>gi|241598569|ref|XP_002404905.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502397|gb|EEC11891.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 533
Score = 375 bits (962), Expect = e-101, Method: Compositional matrix adjust.
Identities = 222/561 (39%), Positives = 312/561 (55%), Gaps = 55/561 (9%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
+A K + ++V GW+D+SFHGS +IPTPN+D LA +G+ILNN Y Q CTPSRA+
Sbjct: 20 SASKKPPNIIFMLVDDLGWDDVSFHGSPQIPTPNMDVLAGDGVILNNYYVQHFCTPSRAA 79
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
LMTG YPIH G+Q I A+P G+PL + +PE+ +++GY T IGKWHLG+FR+EYTP
Sbjct: 80 LMTGLYPIHNGLQDFVIDVAQPYGLPLYLKVMPEFFKDMGYETHMIGKWHLGYFRKEYTP 139
Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
YRGF+S +GY NG YY+H ++ S++ +++R+ T
Sbjct: 140 TYRGFDSFYGYYNGAEDYYNHSITKVISQS-----YNIRQGSVTKKRIENYIKNTELVLL 194
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
++ +PLFLYLA+ + H LEAP+E I +F YI + NR +A V L
Sbjct: 195 SLTFYLSILFSQPLFLYLAYQSVHG-----PLEAPEENIMKFPYIGEENRTIFAGAVDAL 249
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D SVG V+ L ML+N+II+ +DNG + N G N+P RG K TLWEG
Sbjct: 250 DQSVGEVLETLYEADMLDNTIIVLSTDNGGLPFGVQS-----NRGYNFPLRGAKGTLWEG 304
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLL 433
G + A +WSP + QN RVS QM+HI+DWLPTLY+AAG + + L +DG D
Sbjct: 305 GTRGSAFIWSPLLNQNRRVSDQMLHITDWLPTLYSAAGMRGNPVSLGKVDGYD------- 357
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
W+ L N PS R VL NID +A+R ++KLVLGT E+G
Sbjct: 358 --------------MWTQLSYNLPSSRYEVLYNIDPTDLNSALRYTNYKLVLGTYEDGEF 403
Query: 494 DGYYGQTRSNKVPLLNFN-AIVESKTYQSLQQLSQ--NIFLPISNIDKMRSTRQQATIHC 550
D + T ++ P + A+ +SK + L+ N LP R+ RQ+AT++C
Sbjct: 404 DQRFRTTGGSR-PYGDLEIAMAQSKAARVLKGFYNRGNFLLP-------RNWRQKATVNC 455
Query: 551 GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK-YHRRTLVPQSH 609
+N P + + P YLF+L DPCE NNIAS+ + L + L+ Y ++ P +
Sbjct: 456 VSNNGP--KNFVSQAPPYLFDLTKDPCELNNIASAEAGLVDFLKQRLQSYASSSIAPLN- 512
Query: 610 EQPDLVQADPKRFNDTWSPWI 630
+P + P+ N TW PWI
Sbjct: 513 -KPTDPRGYPENLNGTWGPWI 532
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 34/61 (55%), Gaps = 9/61 (14%)
Query: 35 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA----ALTKSTTLTLLIV 90
+PLFLYLA+ + H LEAP+E I +F YI + NR +A AL +S L +
Sbjct: 206 QPLFLYLAYQSVHG-----PLEAPEENIMKFPYIGEENRTIFAGAVDALDQSVGEVLETL 260
Query: 91 Y 91
Y
Sbjct: 261 Y 261
>gi|241619161|ref|XP_002407085.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500931|gb|EEC10425.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 588
Score = 374 bits (961), Expect = e-101, Method: Compositional matrix adjust.
Identities = 214/539 (39%), Positives = 301/539 (55%), Gaps = 38/539 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+S++G +I TPNIDALA+NGI L Y QP+CTPSRA+LMTG+YPIHTGMQ I
Sbjct: 76 GWNDVSYNGCPQIRTPNIDALAWNGIRLQRYYTQPMCTPSRAALMTGRYPIHTGMQHFVI 135
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EPRG+PL + LP++L +LGY ++ +GKWHLGF+++EYTP RGF+ H G G +
Sbjct: 136 LQNEPRGLPLKFKLLPQWLGDLGYVSQMLGKWHLGFYKKEYTPTMRGFQKHIGSWGGFVD 195
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY HI ++ +G D R+ LS + G+Y T+ T+ A ++IE+ P++KPLFLY
Sbjct: 196 YYSHIRFNKIG--FSHSGLDFRQGLSEGREFDGQYYTEFMTEAATRVIENHPLEKPLFLY 253
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAHLA H N L+ P++ +++ I NR YA MV LD+SVG V+ AL ++GML
Sbjct: 254 LAHLAPHGANRHDPLQVPKKYSDKYHDIGHWNRTMYAGMVSALDESVGAVVEALGKRGML 313
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+++++F SDNG T N+ S++P++G K TLWEGG+ VP +WSP
Sbjct: 314 SDTVLVFSSDNGGDT-----NGENPNYASSWPFKGQKRTLWEGGIHVPGFIWSPLFSGMR 368
Query: 392 RVSL-QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ HISDWLPTLY AGGD S L +IDG+
Sbjct: 369 GFDYNNIFHISDWLPTLYQLAGGDPSDL----------------------GDIDGISHLD 406
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
SL + + R +LINID + +A+ +KLV G G +D ++ Q N ++
Sbjct: 407 SLSRRSETPRKELLINIDPIENVSAIIEGHFKLVSGVVRGGVLDEWF-QVPGNIT--WDY 463
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
N + + ++ +N ++ + S I CG P P T PC LF
Sbjct: 464 NRARQECETSLVARVLRNAGHDVACGSEDGSF--PTPIKCGKR-DPSKPCVPTVAPC-LF 519
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+L DPCE NNIA ++ +L L+ +R T VP + P QAD N+ WS W
Sbjct: 520 DLSKDPCEYNNIAEQHNEVLQRLLGKLEGYRETSVPPGN-LPSDEQADAALHNNVWSSW 577
Score = 66.2 bits (160), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 31/82 (37%), Positives = 49/82 (59%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
R+ LS + G+Y T+ T+ A ++IE+ P++KPLFLYLAHLA H N L+ P++
Sbjct: 214 FRQGLSEGREFDGQYYTEFMTEAATRVIENHPLEKPLFLYLAHLAPHGANRHDPLQVPKK 273
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+++ I NR YA + +
Sbjct: 274 YSDKYHDIGHWNRTMYAGMVSA 295
>gi|195582835|ref|XP_002081231.1| GD10911 [Drosophila simulans]
gi|194193240|gb|EDX06816.1| GD10911 [Drosophila simulans]
Length = 633
Score = 373 bits (957), Expect = e-100, Method: Compositional matrix adjust.
Identities = 204/460 (44%), Positives = 271/460 (58%), Gaps = 55/460 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS EIPTPNIDALAY+GIILN Y P+CTPSR++LMTGKYPIHTGMQ +
Sbjct: 37 GFNDVGFHGSAEIPTPNIDALAYSGIILNRYYVAPICTPSRSALMTGKYPIHTGMQHTVL 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ AEPRG+PL E+ LP+YL ELGY++ GKWHLG ++ +YTPLYRGF SH+G
Sbjct: 97 YAAEPRGLPLEEKILPQYLNELGYTSHIAGKWHLGHWKLKYTPLYRGFSSHWGL------ 150
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
DMR A+D G Y TD+ T+ +V++I + K PLFL
Sbjct: 151 -------------------DMRNGTQVAYDLHGHYTTDVITEHSVKVIANHNATKGPLFL 191
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H+ N L P + + +I + RR +AAMV K+DDSVG ++ L++ M
Sbjct: 192 YVAHAACHSSNPYNPLPVPDNDVIKMSHIPNYKRRKFAAMVSKMDDSVGQIVDQLRKSNM 251
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSIIIF SDNG P + N+ SNYP +GVKNTLWEGGV+ ++WSP ++++
Sbjct: 252 LENSIIIFSSDNGGPAQGFN-----LNFASNYPLKGVKNTLWEGGVRAAGLMWSPLLKKS 306
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS Q MHI DWLPTL AAGG + L + IDG W
Sbjct: 307 QRVSNQTMHIVDWLPTLLEAAGGQPALANL-------------------SKQIDGQSIWR 347
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+L+ + S R +VL NID+ +AA+ + WKLV GT G+ DG+YG L ++
Sbjct: 348 ALVQDKASPRLNVLHNIDDIWGSAAISVGDWKLVKGTNYRGSWDGWYGPAGERDPRLYDW 407
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC 550
+ S+ ++L+ L LP S D+ R AT+ C
Sbjct: 408 QLVGRSRAGKALEGLK---MLP-SRADQQR-IPAAATVSC 442
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/81 (35%), Positives = 44/81 (54%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
MR A+D G Y TD+ T+ +V++I + K PLFLY+AH A H+ N L P
Sbjct: 152 MRNGTQVAYDLHGHYTTDVITEHSVKVIANHNATKGPLFLYVAHAACHSSNPYNPLPVPD 211
Query: 60 ETINQFQYITDPNRRTYAALT 80
+ + +I + RR +AA+
Sbjct: 212 NDVIKMSHIPNYKRRKFAAMV 232
>gi|427779723|gb|JAA55313.1| Putative arylsulfatase b [Rhipicephalus pulchellus]
Length = 593
Score = 372 bits (954), Expect = e-100, Method: Compositional matrix adjust.
Identities = 217/575 (37%), Positives = 301/575 (52%), Gaps = 73/575 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS++IPTPN+D LA +G+ILNN Y P CTPSRA+LMTG YPI TGMQG PI
Sbjct: 47 GWDDVSFHGSSQIPTPNLDTLAADGVILNNYYVTPFCTPSRAALMTGLYPIRTGMQGMPI 106
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEP G+P R LP+YL+E GY T +GKWHLG ++ TP RGF+S +GY G
Sbjct: 107 DVAEPWGLPTDVRILPQYLKEFGYETHLVGKWHLGSYKESLTPTCRGFDSFYGYYYGESD 166
Query: 212 YYDHILS-------------------------------DQYSRTVELNGH---DMRRNLS 237
Y+ H +S D ++ T+ H D N
Sbjct: 167 YFAHTISYVRHLSWAFFLTRKCXCRGFDSFYGYYYGESDYFAHTISYENHTGLDFWLNKK 226
Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
W +G Y+T +FTK A +IE++ KPL L + H A H + L+APQE I++F
Sbjct: 227 PVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALERERLQAPQENIDKFP 286
Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYR 356
YI + NR YA MV LD SVG V+ AL GMLEN+II+F SDN G P E+
Sbjct: 287 YIGEKNRTIYAGMVDALDQSVGQVVKALSDAGMLENTIIVFSSDNGGGPWGEHNSR---- 342
Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
G N+P RG K T+WEGG + A +WSP + + RVS Q+MHI+DWLPT Y+ AGG+ +
Sbjct: 343 --GFNWPLRGAKGTVWEGGTRAAAFVWSPLLARRRRVSHQLMHITDWLPTFYSIAGGNAA 400
Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEK-KRTAA 475
+L +DG D W L S R +L N D K ++AA
Sbjct: 401 KL----------------------VELDGHDMWRPLSHGRRSPRVELLYNYDYKYTKSAA 438
Query: 476 VRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISN 535
+R +KLVL E G Y + P + + + T S+ + F
Sbjct: 439 LRNHRYKLVL--DETGRFADRY-KVPGGSRPRHDLDYLAAKSTAASVLR----DFYKKKQ 491
Query: 536 IDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE 595
++ +S R++A + CG T+ YLF++ DPCE NN+AS P++ + L +
Sbjct: 492 LNFPKSWRRKARLTCGKQKKINFSWSATS--VYLFDVVTDPCELNNLASKLPNVVASLKK 549
Query: 596 LLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
L + +P+ + ++ P+ TW+PW+
Sbjct: 550 RLDAYGAAALPERNNDTYDPRSFPENHGGTWAPWV 584
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 31/79 (39%), Positives = 45/79 (56%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
N W +G Y+T +FTK A +IE++ KPL L + H A H + L+APQE I+
Sbjct: 224 NKKPVWSEIGTYSTSVFTKRAQYIIENRTKSKPLLLVITHQATHCALERERLQAPQENID 283
Query: 64 QFQYITDPNRRTYAALTKS 82
+F YI + NR YA + +
Sbjct: 284 KFPYIGEKNRTIYAGMVDA 302
>gi|241638976|ref|XP_002410783.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215503545|gb|EEC13039.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 527
Score = 370 bits (951), Expect = e-100, Method: Compositional matrix adjust.
Identities = 220/558 (39%), Positives = 302/558 (54%), Gaps = 54/558 (9%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
+ L + L+ GW+D+SFHGS +IPTPN+DALA +GIILNN Y QPVCTPSRA+LM
Sbjct: 19 SKLPPNIIFMLVDGLGWDDVSFHGSPQIPTPNMDALAADGIILNNYYVQPVCTPSRAALM 78
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TG YPIHTG+Q + AEP G+PL + +PEY ++LGY T IGKW+LG++ +EYTP Y
Sbjct: 79 TGMYPIHTGLQHGVLLAAEPNGLPLEFKIMPEYFKDLGYETHLIGKWNLGYYMKEYTPTY 138
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAW--DTVGEYATDLFTKE 254
RGF+S +G+ N Y+ H N + ++ + W + Y T L
Sbjct: 139 RGFDSFYGFYNYEEDYFTH------------NLEFVNQSNAMVWRPSSFCVYLT-LSPDT 185
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
L+ P FLYL+H + H + L+AP+E I +F YI D R YA MV L
Sbjct: 186 GFDLLSASIERGPFFLYLSHQSVHGASGNDPLQAPEENIAKFPYIGDERRTKYAGMVDAL 245
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D+S G V+ AL GML N+II+ S NG + N GSN+P RG K LWEG
Sbjct: 246 DESAGDVLEALYEAGMLANTIIVMSSANGG-----LSSGVESNAGSNFPLRGGKGALWEG 300
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
G + A +WSP + Q RVS QMMHI+DWLPTLY AA G+ P N+
Sbjct: 301 GTRASAFIWSPLLYQKNRVSDQMMHITDWLPTLYAAAQGN----PFNL------------ 344
Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
+DG + W+ L N PS R +L NID +T+A+R ++KLVLG +
Sbjct: 345 ------GTLDGQNLWNHLSYNLPSPRYELLYNIDPIDQTSAIRFLNYKLVLGESSGSEEN 398
Query: 495 GYYGQTRSNKVPLLNFN-AIVESKTYQSLQQL-SQNIFLPISNIDKMRSTRQQATIHCGA 552
+ +T N P + + A+ +SK + L+ ++ I SN RQ+A ++CG
Sbjct: 399 EHRLKTPGNSRPYRDLDIAMAQSKAARVLRNFHNRGILFMDSN------WRQKAALNCGD 452
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
+ + P YLF+L DPCE NI P S L L + +T + S++Q
Sbjct: 453 DE---RENVVPGAPPYLFDLAKDPCEMRNIVLIEPGWVSILTLRLHIYEQTSIAPSNKQI 509
Query: 613 DLVQADPKRFNDTWSPWI 630
D + P+ N TW+PWI
Sbjct: 510 D-PRGFPENLNGTWAPWI 526
Score = 46.2 bits (108), Expect = 0.057, Method: Compositional matrix adjust.
Identities = 19/47 (40%), Positives = 27/47 (57%)
Query: 36 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKS 82
P FLYL+H + H + L+AP+E I +F YI D R YA + +
Sbjct: 198 PFFLYLSHQSVHGASGNDPLQAPEENIAKFPYIGDERRTKYAGMVDA 244
>gi|443724925|gb|ELU12719.1| hypothetical protein CAPTEDRAFT_140387 [Capitella teleta]
Length = 542
Score = 369 bits (946), Expect = 4e-99, Method: Compositional matrix adjust.
Identities = 215/551 (39%), Positives = 304/551 (55%), Gaps = 55/551 (9%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+GW+D+ FHGS +IPTPN+DALA +GIIL+N Y+QP+CTPSR SL+TGK+PI G+Q
Sbjct: 34 FGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV 93
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I+ A+P G+ L E+ LPEYL+ LGY + +GKWHLGFF EYTP+ RGF+SH+G+
Sbjct: 94 IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGASE 153
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y HI + G D N G Y+T LFT +A QL+ + +P+FL
Sbjct: 154 DYMTHIGG--------MGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQTEPMFL 205
Query: 271 YLAHLAAHAGNA---GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
Y +H A H + G L AP + ++F +I D + A M+ LD+SVG + +L
Sbjct: 206 YFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLTKSLHT 265
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
GMLEN+IIIF SDNGAP N SNYP RG K ++WEGG +VPA +WSP +
Sbjct: 266 NGMLENTIIIFSSDNGAP------HENTDVCSSNYPLRGAKTSVWEGGTRVPAFVWSPLL 319
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+++ VS QMMHISDWLPTL AAG + S LP ++ G+ QW
Sbjct: 320 KKSGYVSKQMMHISDWLPTLLEAAGYNMSALPGDLYGVSQW------------------- 360
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTA-AVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
+L N PS RNS+L N D KK+ + A+R+ KL+ Q G + G + +
Sbjct: 361 --QALQENGPSARNSMLYNADHKKQGSWALRVGDMKLL---QAEGATQLHSGWQQPFAIN 415
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDK-MRSTRQQATIHCGANPAPMTPSPC--T 563
L + E +Q+ + + + + MR++ + ++CG + +T C T
Sbjct: 416 LQSQLPSNEPNNTNYVQEEQSQVASILEELGRGMRNSTTKYFVNCGFISSGVTSHSCIDT 475
Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDI----SSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
PC L+N+ DPCE +N+A P++ + L +LLK + + +P A P
Sbjct: 476 GRPC-LYNVTADPCEYHNLAKDLPEVVERLEASLDDLLKMYIEPI-----NKPADPAAYP 529
Query: 620 KRFNDTWSPWI 630
K W PWI
Sbjct: 530 KNHGGVWMPWI 540
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA---GKHLEAPQETINQFQYIT 69
G Y+T LFT +A QL+ + +P+FLY +H A H + G L AP + ++F +I
Sbjct: 179 GHYSTTLFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIA 238
Query: 70 DPNRRTYAALTKS 82
D + A + +
Sbjct: 239 DVECQHMAGMISA 251
>gi|195441668|ref|XP_002068625.1| GK20324 [Drosophila willistoni]
gi|194164710|gb|EDW79611.1| GK20324 [Drosophila willistoni]
Length = 525
Score = 368 bits (945), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 223/550 (40%), Positives = 314/550 (57%), Gaps = 52/550 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF GSN TPNIDALAY+G+ILNN+Y +CTPSR++L+TGKYPI TGMQ I
Sbjct: 6 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTPAMCTPSRSALLTGKYPISTGMQHYVI 65
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+PL E + E ++ GY T +GKWHLG +R +TP RGF++H GYL I
Sbjct: 66 VNDQPWGLPLNETTMAEIFQQNGYYTSLLGKWHLGMSQRNFTPTKRGFDTHLGYLGAYID 125
Query: 212 YYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD-KP 267
YYD S YSR GHD R NL + D VG Y TD+ + AV+LIE V KP
Sbjct: 126 YYDQTYLQSSQNYSR-----GHDFRDNLEASHDKVGHYVTDILSDAAVELIEKHNVTAKP 180
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL L+HLA HA N L+AP E ++QF+YI + + R YAAMV +LD SVG VI AL
Sbjct: 181 LFLLLSHLAPHAANDNDPLQAPMEELSQFEYIQNKSHRYYAAMVSRLDKSVGRVIEALAN 240
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
+ ML NSI++F+SDNG P+V + N SNYP RG K+T WEGG++ A +WS +
Sbjct: 241 REMLRNSIVLFLSDNGGPSV-----GEHANTASNYPLRGQKHTPWEGGIRSSAAIWSTEF 295
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLNTPSRRNSNIDGL 446
++ V Q ++I D LPTL +AAG S P +++DGL+ WS+L S + +
Sbjct: 296 ERLGSVWKQRIYIGDLLPTLCSAAG--ISLDPSMHLDGLNLWSALKYGYESVEREIVHSI 353
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
D +D + A +R WK++ GT +G DG+ GQ +++V
Sbjct: 354 DA------------------VDPQPHLAYMR-GKWKIINGTTNSGQYDGWLGQRITDEVD 394
Query: 507 --LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN 564
++++ ++ + T QL + NI ++ RQQA I C P P PC
Sbjct: 395 PRSVHYDQLIRNTTVWQQLQLLADKPQQERNITEL---RQQAQILC---PTPANAKPCMP 448
Query: 565 --GPCYLFNLGNDPCEQNNIASSRPDISS--QLYELL-KYHRRTLVPQSHEQPDLVQADP 619
PC+ F++ DPCEQNN+ + S L+E + +Y R +P + +P + DP
Sbjct: 449 LIAPCF-FDIEVDPCEQNNLYDEYRNTSIFLDLWERIDQYARNAHLPNN--KPADPKCDP 505
Query: 620 KRFNDTWSPW 629
+ + + W+ W
Sbjct: 506 RYYRNEWTWW 515
Score = 76.6 bits (187), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 51/81 (62%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQ 59
R NL + D VG Y TD+ + AV+LIE V KPLFL L+HLA HA N L+AP
Sbjct: 144 FRDNLEASHDKVGHYVTDILSDAAVELIEKHNVTAKPLFLLLSHLAPHAANDNDPLQAPM 203
Query: 60 ETINQFQYITDPNRRTYAALT 80
E ++QF+YI + + R YAA+
Sbjct: 204 EELSQFEYIQNKSHRYYAAMV 224
>gi|241844558|ref|XP_002415497.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215509709|gb|EEC19162.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 529
Score = 368 bits (945), Expect = 5e-99, Method: Compositional matrix adjust.
Identities = 221/561 (39%), Positives = 308/561 (54%), Gaps = 50/561 (8%)
Query: 77 AALTKSTTLTLLIVY------GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
A +KS+T+ I++ GW D+SF G +IPTPN+D LA GIILNN Y QP+C P
Sbjct: 10 AVFSKSSTVPPNIIFILADDLGWADVSFRGDPQIPTPNLDVLASQGIILNNYYVQPLCAP 69
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR +LM+G YPIHTG+Q EP G+P +PEYL+ LGY+T IGKWHLG+ +
Sbjct: 70 SRGALMSGLYPIHTGLQHLVPGPGEPWGLPTNLTIMPEYLKNLGYATHMIGKWHLGYHKE 129
Query: 191 EYTPLYRGFESHFGYLNGVISYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
YTP RGF+S +GYLNG YYDH IL S G D N + + Y+T+
Sbjct: 130 SYTPTRRGFDSFYGYLNGGEDYYDHTILWSNAS------GLDFWENTTPVRNEGNHYSTE 183
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
LFTK+A LI+ KP+FLY +H A H G+ LEAP I F YI + NR +A
Sbjct: 184 LFTKKAQSLIKHHDPAKPMFLYFSHQAVHCGDYKVELEAPALAIAHFPYIKELNRSIHAG 243
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V +LD SVG V+ AL ++GML NSI+IF +DNG S Y N+P RG K
Sbjct: 244 AVYELDKSVGLVMEALNKRGMLSNSIVIFSTDNGGLPWGVEPNSGY-----NWPLRGSKE 298
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
T WEGG + A +WSP + ++ R+S QMMHI+DWLPTLY+AAGG+ S L
Sbjct: 299 TNWEGGARGAAFVWSPLLFKSGRLSNQMMHITDWLPTLYSAAGGNVSTL----------- 347
Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
NIDG D W +L + S R VLINID + ++A+ + K+VLG+
Sbjct: 348 -----------GNIDGKDMWKALSEDLESPRQEVLINIDPIENSSALIVGRHKVVLGSFN 396
Query: 490 NGTMDGYYGQTRSNKVPLLNFN-AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
G+ D ++ P+ + ++ S+T + L+ F + + + R +A +
Sbjct: 397 EGSHDMRMKAPGGSR-PVDGLDQMMLSSRTGKVLKD-----FYNVRQLTVRPNWRNEAVV 450
Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
C P + P Y F+L +DPCE NN+A+S +L + +K + + +VP +
Sbjct: 451 RCDRYAPPNNFVAAS--PPYYFDLEHDPCELNNLAASNVTELEELIKKIKEYAKGMVPPA 508
Query: 609 HEQPDLVQADPKRFNDTWSPW 629
++ D + PK + W PW
Sbjct: 509 NKPLD-PRGLPKYNHGLWGPW 528
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 33/78 (42%), Positives = 43/78 (55%), Gaps = 4/78 (5%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
Y+T+LFTK+A LI+ KP+FLY +H A H G+ LEAP I F YI + NR
Sbjct: 179 HYSTELFTKKAQSLIKHHDPAKPMFLYFSHQAVHCGDYKVELEAPALAIAHFPYIKELNR 238
Query: 74 RTYAA----LTKSTTLTL 87
+A L KS L +
Sbjct: 239 SIHAGAVYELDKSVGLVM 256
>gi|241619159|ref|XP_002407084.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500930|gb|EEC10424.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 502
Score = 364 bits (935), Expect = 7e-98, Method: Compositional matrix adjust.
Identities = 194/503 (38%), Positives = 290/503 (57%), Gaps = 44/503 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+S+HGS +I TPNIDALA+NGI LN Y QP+CTPSR++ +TG YP++TGMQ I
Sbjct: 37 GWNDVSYHGSPQILTPNIDALAWNGIRLNRYYTQPLCTPSRSAFLTGCYPMNTGMQHSVI 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EPRG+PL + LP++L + GY ++ +GKWHLG+++ EYTP RGF+SH G G
Sbjct: 97 LTTEPRGLPLHYKLLPQWLGDFGYVSRMLGKWHLGYYKEEYTPTMRGFQSHVGSWEGFSD 156
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY HI+ + +T ++GHD RR++ + + G Y T + T+EA+++I+D P +KPLFLY
Sbjct: 157 YYSHIMDFSW-QTWSISGHDFRRDMQKSKEDDGRYYTHVMTEEALKIIKDHPNEKPLFLY 215
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+AHLA H+GN + L+AP + + + I P+R YA M+ LD+SVG V AL R+G+L
Sbjct: 216 IAHLAVHSGNQPEPLKAPTKYTDPYMDIGHPSRTLYAGMLSVLDESVGAVFEALGRRGIL 275
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+I++ +DNGA T+ + ++GS +P+RG K T WEGGV+VPA++WSP ++
Sbjct: 276 NETIVVLTTDNGAGI-----TTQFSSFGSGWPFRGQKGTAWEGGVRVPAVVWSPLFSEHR 330
Query: 392 RVSLQ-MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
++ + H+SDWLPT Y AGGD S L IDG+ Q
Sbjct: 331 GAVVESLFHVSDWLPTFYELAGGDPSNL----------------------GEIDGISQLD 368
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY----GQTRSNKVP 506
+L R +++NID + +A+ + +KL+ G + G D +Y TR+ K
Sbjct: 369 TLRHREKLPRTEIVLNIDPIENVSAIIVYPFKLMQGDDQGGHYDDWYPFMGNVTRTQKDA 428
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
A V + + + F+ D + ++ CG P P
Sbjct: 429 REGCEASVVYRV------MKHSGFVVTCGQDPVTYA---TSVKCGEKDPTKACKPIVK-P 478
Query: 567 CYLFNLGNDPCEQNNIASSRPDI 589
C LF+L DPCE N+I++ ++
Sbjct: 479 C-LFDLSKDPCEYNDISAENIEV 500
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 32/79 (40%), Positives = 52/79 (65%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR++ + + G Y T + T+EA+++I+D P +KPLFLY+AHLA H+GN + L+AP +
Sbjct: 176 FRRDMQKSKEDDGRYYTHVMTEEALKIIKDHPNEKPLFLYIAHLAVHSGNQPEPLKAPTK 235
Query: 61 TINQFQYITDPNRRTYAAL 79
+ + I P+R YA +
Sbjct: 236 YTDPYMDIGHPSRTLYAGM 254
>gi|157103779|ref|XP_001648126.1| arylsulfatase b [Aedes aegypti]
gi|108880481|gb|EAT44706.1| AAEL003960-PA [Aedes aegypti]
Length = 472
Score = 363 bits (933), Expect = 1e-97, Method: Compositional matrix adjust.
Identities = 207/493 (41%), Positives = 283/493 (57%), Gaps = 44/493 (8%)
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
MTGKYPIHTGMQ ++G EPRG+PLTE+ LP+YL+ELGY GKWHLG + R++TPL
Sbjct: 1 MTGKYPIHTGMQHAVLYGMEPRGLPLTEKLLPQYLKELGYKNHIYGKWHLGSYTRKHTPL 60
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTK 253
RGF+SH G+ G +DH VE N G DMRR A+D G Y T +
Sbjct: 61 ERGFDSHVGFWTGHHHMFDHT-------AVETNAWGLDMRRGFDVAYDLHGYYTTHVIRD 113
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
E+V I +P+FLY++H A H+ N L AP ET+ + I++ +RR +AAM+ +
Sbjct: 114 ESVAAIRAHNTSQPMFLYVSHAATHSANPYDFLPAPDETVERLAGISNYSRRKFAAMLTE 173
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
LD SVG +I AL +GML+NSII+F +DNG P + N SN+P RG KNTLWE
Sbjct: 174 LDQSVGAIIVALSERGMLDNSIIVFSTDNGGPAEGFN-----NNAASNWPLRGTKNTLWE 228
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
GGV+ +WSP I++ RVS Q+MHISDWLPTL AAG D ++LP N+DG+ W
Sbjct: 229 GGVRGAGCIWSPLIEEKRRVSHQIMHISDWLPTLLDAAGYDLNKLPSNLDGISVW----- 283
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
P RN ++ +RR+ +L NID+ +AA+ +D WK+V GT G
Sbjct: 284 --PELRNGDV--------------TRRHEILHNIDDIWGSAALTVDDWKVVKGTNYEGKW 327
Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
D +YG + N ++I T ++L +L + P I K+ R++AT+ C AN
Sbjct: 328 DAWYGPAGDHDTKSYNLSSIFGCPTGKALSKLK--MLPPEEEIIKL---RREATVQC-AN 381
Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
+P + PC LF+L NDPCE N+A PDI L + L + T VP + P
Sbjct: 382 GTRNKCNPLES-PC-LFDLFNDPCEFENLADQYPDILEVLLKKLSDYNATAVPPGN-MPL 438
Query: 614 LVQADPKRFNDTW 626
+ DP+ + TW
Sbjct: 439 DSRGDPRFWGYTW 451
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 44/79 (55%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR A+D G Y T + E+V I +P+FLY++H A H+ N L AP E
Sbjct: 92 MRRGFDVAYDLHGYYTTHVIRDESVAAIRAHNTSQPMFLYVSHAATHSANPYDFLPAPDE 151
Query: 61 TINQFQYITDPNRRTYAAL 79
T+ + I++ +RR +AA+
Sbjct: 152 TVERLAGISNYSRRKFAAM 170
>gi|443722750|gb|ELU11510.1| hypothetical protein CAPTEDRAFT_23094, partial [Capitella teleta]
Length = 549
Score = 363 bits (931), Expect = 2e-97, Method: Compositional matrix adjust.
Identities = 220/581 (37%), Positives = 306/581 (52%), Gaps = 84/581 (14%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+GW+D+ FHGS +IPTPN+DALA +GIIL+N Y+QP+CTPSR SL+TGK+PI G+Q
Sbjct: 11 FGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV 70
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I+ A+P G+ L E+ LPEYL+ LGY + +GKWHLGFF EYTP+ RGF+SH+G+
Sbjct: 71 IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGASE 130
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y HI + G D N G Y+T LFT +A QL+ + +P+FL
Sbjct: 131 DYMTHIGG--------MGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQTEPMFL 182
Query: 271 YLAHLAAHAGNA---GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
Y +H A H + G L AP + ++F +I D + A M+ LD+SVG + +L
Sbjct: 183 YFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLTKSLHT 242
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
GMLEN+IIIF SDNGAP N SNYP RG K ++WEGG +VPA +WSP +
Sbjct: 243 NGMLENTIIIFSSDNGAPH------ENTDVCSSNYPLRGAKTSVWEGGTRVPAFVWSPLL 296
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+++ VS QMMHISDWLPTL AAG + S LP ++ G+
Sbjct: 297 KKSGYVSKQMMHISDWLPTLLEAAGYNMSALP---------------------GDLYGVS 335
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTA-AVRLDSWKLVLGTQENGTMDG----------- 495
QW +L N PS RNS+L N D KK+ + A+R+ KL+ G
Sbjct: 336 QWQALQENGPSARNSMLYNADHKKQGSWALRVGDMKLLQAEGATQLHSGWQQPFAINLQS 395
Query: 496 -----------YYGQTRSNKVPLLNFNAIVESKTYQS---------LQQLSQNIFLPISN 535
Y + +S +L + + K +QL++ + +
Sbjct: 396 QLPSNEPNNTNYVQEEQSQVASILEEHETLAGKRRHEATFWLASTLCKQLAEPAVIQQAR 455
Query: 536 IDKMRSTRQQATIHCGANPAPMTPSPCTNG--PCYLFNLGNDPCEQNNIASSRPDI---- 589
+K RS R +CG + P C N PC L+N+ DPCE N+A P++
Sbjct: 456 QNKRRSGR--TVPNCGFASTGVAPRSCVNTGRPC-LYNVTADPCEYYNLAEYLPEVVERL 512
Query: 590 SSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
++L +LLK R++ P + +P A PK W PWI
Sbjct: 513 ETRLDDLLK---RSIEPIN--KPADPAAYPKNHGGVWMPWI 548
Score = 47.4 bits (111), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA---GKHLEAPQETINQFQYIT 69
G Y+T LFT +A QL+ + +P+FLY +H A H + G L AP + ++F +I
Sbjct: 156 GHYSTTLFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIA 215
Query: 70 DPNRRTYAALTKS 82
D + A + +
Sbjct: 216 DVECQHMAGMISA 228
>gi|391345592|ref|XP_003747069.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 557
Score = 361 bits (926), Expect = 8e-97, Method: Compositional matrix adjust.
Identities = 208/518 (40%), Positives = 295/518 (56%), Gaps = 47/518 (9%)
Query: 81 KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
K T + +++ GWND SFHGS EIPTPN+DALA +G+IL + YAQP+CTP+RA+L+TG
Sbjct: 78 KPTNVVMILADDMGWNDASFHGSAEIPTPNLDALASSGVILQSHYAQPMCTPTRAALLTG 137
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
YP HTGMQ I EP G+PL + LP YL E Y + +GKWHLG + P R
Sbjct: 138 LYPFHTGMQNFVIRTGEPWGLPLDYKILPHYLDEAYYHSHLVGKWHLGMHNPAFLPTARH 197
Query: 199 FESHFGYLNGVISYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
F +H GY NG I Y+ H +S + L+ H ++ + YAT LFTK AV
Sbjct: 198 FNTHVGYYNGFIDYFTHEHISPGNDSLIGLDWH-----INEENENEEGYATHLFTKRAVN 252
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
LIE+ +PLF+ L+HLA HAG +AP+E+I +F +I D NR+ YAAMV +LD S
Sbjct: 253 LIENHKSTEPLFILLSHLAPHAGCKRDPFQAPRESIEKFAHIKDQNRKVYAAMVDELDQS 312
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
VG V+ AL RK +L++++I+F+SDNG T T N GSNYP+RG K TL+EGG +
Sbjct: 313 VGQVVEALYRKRLLDSTMIVFLSDNGGQT-----TGVMNNTGSNYPFRGQKRTLFEGGTR 367
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
V A +WS I + PR+ +MH++DWLPT+ AG P +DG D
Sbjct: 368 VSAFVWSTDIVKKPRIESGLMHVTDWLPTILKRAG---LSYPRELDGRD----------- 413
Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
QWS L PS R VL N++ ++AA+R D +KL++G + ++ +
Sbjct: 414 ----------QWSMLSEGAPSARAEVLYNLNYVDQSAALRNDRFKLIVGPRNGVSLLNDW 463
Query: 498 GQTRSNKVPLLNFNAIV-ESKTYQSLQQLSQNIFLPISNIDKMRST----RQQATIHCGA 552
Q K + ++ ES T ++L++ ++ + + D + + R +A I CG+
Sbjct: 464 DQLHKGKHSRAQLDGMMDESLTARTLKE--AGLWWKMRHGDTLSADHSTWRDKAEIKCGS 521
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDIS 590
+ T PC LF+L NDPCE NN+ + P +S
Sbjct: 522 HDK--TLCDVHRSPC-LFDLENDPCELNNVIAKYPAVS 556
Score = 73.2 bits (178), Expect = 4e-10, Method: Compositional matrix adjust.
Identities = 34/66 (51%), Positives = 45/66 (68%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
YAT LFTK AV LIE+ +PLF+ L+HLA HAG +AP+E+I +F +I D NR+
Sbjct: 241 YATHLFTKRAVNLIENHKSTEPLFILLSHLAPHAGCKRDPFQAPRESIEKFAHIKDQNRK 300
Query: 75 TYAALT 80
YAA+
Sbjct: 301 VYAAMV 306
>gi|194871676|ref|XP_001972885.1| GG13639 [Drosophila erecta]
gi|190654668|gb|EDV51911.1| GG13639 [Drosophila erecta]
Length = 584
Score = 360 bits (924), Expect = 1e-96, Method: Compositional matrix adjust.
Identities = 214/549 (38%), Positives = 306/549 (55%), Gaps = 51/549 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF GSN TPNIDALAY+G+ILNN+Y P+CTPSRA+L+TGKYPI+TGMQ I
Sbjct: 46 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+PL E + E RE GY T +GKWHLGF +R +TP RGF+ H GYL +
Sbjct: 106 VNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHLGYLGAYVD 165
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VDKPL 268
YY Q + NGHD R NL ++ D VG Y TD+ T AV+ IED +PL
Sbjct: 166 YYTQSYEQQ---SKGYNGHDFRDNLKSSHDQVGHYITDVLTDAAVKEIEDHASKNSSQPL 222
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FL L HLA HA N ++AP E +++F+YI + R YAAMV KLD SVG+VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDNPMQAPAEEVSRFEYIRNKTHRYYAAMVSKLDQSVGSVIDALARQ 282
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
ML+NSI++F+SDNG PT T+ SNYP RG K++ WEG ++ A +WS + +
Sbjct: 283 EMLQNSIVLFLSDNGGPTQGQHSTT-----ASNYPLRGQKDSPWEGALRSSAAIWSTEFE 337
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ V Q ++I D LPTL AA G + L++DGL+ WS+L S + +D+
Sbjct: 338 RLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDE 396
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP-- 506
D + WK++ GT G DG+ G +++V
Sbjct: 397 -------------------DVAVPHLSYTRGKWKVISGTTNQGLYDGWLGHRETSEVDPR 437
Query: 507 LLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP-SPC-- 562
+ + +V + +Q LQQ++ S + R QA+I C P P T PC
Sbjct: 438 AVEYEELVRNTSVWQQLQQVT-------SGERNISELRDQASIEC---PNPATGVRPCLP 487
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISS--QLYELLKYHRRTLVPQSHEQPDLVQADPK 620
PC LF++ DPCE++N+ + + ++ L+ ++ + P +++ D DP+
Sbjct: 488 LEAPC-LFDIEADPCERSNLYAEYKNSTTFLDLWARIQQFAKQAQPPNNKPAD-PDCDPR 545
Query: 621 RFNDTWSPW 629
F++ W+ W
Sbjct: 546 FFHNEWTWW 554
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
R NL ++ D VG Y TD+ T AV+ IED +PLFL L HLA HA N ++A
Sbjct: 183 FRDNLKSSHDQVGHYITDVLTDAAVKEIEDHASKNSSQPLFLLLNHLAPHAANDDNPMQA 242
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P E +++F+YI + R YAA+
Sbjct: 243 PAEEVSRFEYIRNKTHRYYAAMV 265
>gi|443734861|gb|ELU18717.1| hypothetical protein CAPTEDRAFT_218441 [Capitella teleta]
Length = 500
Score = 357 bits (917), Expect = 8e-96, Method: Compositional matrix adjust.
Identities = 213/544 (39%), Positives = 284/544 (52%), Gaps = 86/544 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+S HGS EIPTPNID LA +GI+LNN Y QP+CTPSRA+LMTG++P+H G+Q I
Sbjct: 36 GWDDISLHGSQEIPTPNIDLLATDGILLNNYYVQPICTPSRAALMTGRHPVHLGLQHDVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A+P G+ L E LP+YL+ LGYST +GKWHLGF+ +E+TP RGF+SH GY G
Sbjct: 96 VWAQPYGLGLNETLLPQYLKTLGYSTHMVGKWHLGFYDKEHTPTKRGFDSHLGYYTGCED 155
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYDH + T + G D + G+Y+T++FT EA ++I + V KPLFLY
Sbjct: 156 YYDHT----WGFTKQDWGLDFWHDREVDRSAFGQYSTEVFTSEAERVIAEHDVSKPLFLY 211
Query: 272 LAHLAAHAGNAGK--HLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LA A H+GN G L+AP + + F I RR +A MV LDDSV V AL KG
Sbjct: 212 LAQQAVHSGNPGNKVRLQAPWKYVKNFMGIKSEERRVFAGMVSALDDSVSNVTKALHEKG 271
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+L ++IIIF +DNG P +Y + N+P RG K T+WEGGV+ +WSP ++
Sbjct: 272 ILNDTIIIFSTDNGGPA----NRQDYND-ACNWPLRGSKRTMWEGGVRGNGFIWSPLLEN 326
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
+ VS +M I DW+PT+ AAG D S+LP ++DGL QW +L SR S
Sbjct: 327 SGYVSEHLMQIVDWVPTVLEAAGFDMSKLPPSLDGLSQWKAL-----SRHES-------- 373
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
S R +L NID A+R K+V G + G+Y
Sbjct: 374 --------SPRKVLLHNIDPINGDGALRTTDMKIVFGGGDTERYPGWYEPEE-------- 417
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A++ S + + P+ P PC +
Sbjct: 418 --ALIRSPSKR---------------------------------PSSTACKPWLK-PC-V 440
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF----NDT 625
FN+ DPCE NIA PDI +L + + +T VP P V +DPK N
Sbjct: 441 FNITADPCEYYNIADFHPDILQELLQKVDEFNQTAVP-----PRNVPSDPKGLPINNNGV 495
Query: 626 WSPW 629
W PW
Sbjct: 496 WGPW 499
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 30/72 (41%), Positives = 44/72 (61%), Gaps = 2/72 (2%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGK--HLEAPQETINQFQYITD 70
G+Y+T++FT EA ++I + V KPLFLYLA A H+GN G L+AP + + F I
Sbjct: 184 GQYSTEVFTSEAERVIAEHDVSKPLFLYLAQQAVHSGNPGNKVRLQAPWKYVKNFMGIKS 243
Query: 71 PNRRTYAALTKS 82
RR +A + +
Sbjct: 244 EERRVFAGMVSA 255
>gi|403182690|gb|EJY57566.1| AAEL017192-PA [Aedes aegypti]
Length = 1007
Score = 357 bits (916), Expect = 1e-95, Method: Compositional matrix adjust.
Identities = 216/561 (38%), Positives = 296/561 (52%), Gaps = 62/561 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFH S +I TPNID LAY+G+ILN Y P T S+ +LMTG +P+ G Q
Sbjct: 467 GWNDVSFHSSKQIFTPNIDVLAYHGVILNRHYCAPFGTASQVALMTGSHPLSVGTQSASN 526
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P + + +PEY R+ GY+T IGKW LGF R++YTP RGF+SHFG+L I
Sbjct: 527 EPDQPWTLDPELKLMPEYFRDAGYATHLIGKWGLGFSRKDYTPTQRGFDSHFGFLGPYID 586
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+DH L R G DMRRNL + G YATDLF EAV+LI + KPL L
Sbjct: 587 YWDHSLR---LRNTSTRGLDMRRNLDVDYSVNGSYATDLFNGEAVRLIREHDQKKPLLLV 643
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L HLA H GN ++AP E + +F YI D RR AAM+ K+D+ VG +I L + ML
Sbjct: 644 LTHLAPHTGNEDDPMQAPAEEVEKFDYIRDEKRRVLAAMISKVDEGVGQIIQTLAERDML 703
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV---------------------KNT 370
+NSI++F +DNGAPTV + N GSN+P RGV K +
Sbjct: 704 DNSIVLFYADNGAPTV-----GMHSNSGSNFPLRGVCIDNWSTLHIRCTNGCFSFQQKYS 758
Query: 371 LWEGGVKVPAILWSPQIQQNP-RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
WEG V+ A +WSP + RVS Q +H+SDWLPTL AAG I+G+
Sbjct: 759 PWEGAVRTVAAIWSPLLNLTAGRVSNQWIHVSDWLPTLAHAAG---------IEGIPI-- 807
Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
S IDG +QW +L + RN V+ NIDE ++ + WK V GT
Sbjct: 808 ----------GSEIDGRNQWEALKNPAITVRNVVMNNIDELHHYSSYSRNGWKYVNGTSW 857
Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
G D + G+ + L +P+ +++ + R+ AT+
Sbjct: 858 EGKFDNWMGELEDEDELSEEEYVV-------RLAGSVVGRMMPL-DLEHVARLRRAATVE 909
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
C P PC LFNL +DPCE+NN+A + DI +L E ++ +R+T V +
Sbjct: 910 CEVEGQVKACQP-MESPC-LFNLMDDPCERNNVALDQLDILEELREEVERYRQTAV-EPR 966
Query: 610 EQPDLVQADPKRFNDTWSPWI 630
+P ++DP +N+TW+ W+
Sbjct: 967 NKPADPRSDPGFYNNTWTWWL 987
Score = 81.6 bits (200), Expect = 1e-12, Method: Compositional matrix adjust.
Identities = 39/79 (49%), Positives = 47/79 (59%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRNL + G YATDLF EAV+LI + KPL L L HLA H GN ++AP E
Sbjct: 604 MRRNLDVDYSVNGSYATDLFNGEAVRLIREHDQKKPLLLVLTHLAPHTGNEDDPMQAPAE 663
Query: 61 TINQFQYITDPNRRTYAAL 79
+ +F YI D RR AA+
Sbjct: 664 EVEKFDYIRDEKRRVLAAM 682
>gi|24666163|ref|NP_649020.1| CG7408, isoform B [Drosophila melanogaster]
gi|281366395|ref|NP_001163462.1| CG7408, isoform C [Drosophila melanogaster]
gi|281366397|ref|NP_001163463.1| CG7408, isoform D [Drosophila melanogaster]
gi|23093214|gb|AAF49290.2| CG7408, isoform B [Drosophila melanogaster]
gi|272455230|gb|ACZ94733.1| CG7408, isoform C [Drosophila melanogaster]
gi|272455231|gb|ACZ94734.1| CG7408, isoform D [Drosophila melanogaster]
Length = 585
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 216/550 (39%), Positives = 305/550 (55%), Gaps = 53/550 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF GSN TPNIDALAY+G+ILNN+Y P+CTPSRA+L+TGKYPI+TGMQ I
Sbjct: 46 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+PL E + E RE GY T +GKWHLG +R +TP RGF+ H GYL +
Sbjct: 106 VNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHLGYLGAYVD 165
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPL 268
YY Q NGHD R +L + D VG Y TDL T AV+ IED + +PL
Sbjct: 166 YYTQSYEQQ---NKGYNGHDFRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPL 222
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FL L HLA HA N ++AP E +++F+YI++ R YAAMV +LD SVG+VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDALARQ 282
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
ML+NSII+F+SDNG PT T+ SNYP RG KN+ WEG ++ A +WS + +
Sbjct: 283 EMLQNSIILFLSDNGGPTQGQHSTT-----ASNYPLRGQKNSPWEGALRSSAAIWSTEFE 337
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ V Q ++I D LPTL AA G + L++DGL+ WS+L S + +D+
Sbjct: 338 RLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDE 396
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP-- 506
D + + WK++ GT G DG+ G +++V
Sbjct: 397 -------------------DVAEPHLSYTRGKWKVISGTTNQGLYDGWLGHRETSEVDPR 437
Query: 507 LLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP-SPC-- 562
+ + +V + + LQQ+S NI ++ R Q+ I C P P T PC
Sbjct: 438 AVEYEELVRNTSVWLQLQQVS----FGERNISEL---RDQSRIEC---PDPATGVKPCLP 487
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQADP 619
GPC LF++ DPCE++N+ + + S ++ L + Q+H +P DP
Sbjct: 488 LEGPC-LFDIEADPCERSNLYAEYQN--STIFLDLWSRIQQFAKQAHPPNNKPGDPNCDP 544
Query: 620 KRFNDTWSPW 629
+ +++ W+ W
Sbjct: 545 RFYHNEWTWW 554
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
R +L + D VG Y TDL T AV+ IED + +PLFL L HLA HA N ++A
Sbjct: 183 FRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQA 242
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P E +++F+YI++ R YAA+
Sbjct: 243 PAEEVSRFEYISNKTHRYYAAMV 265
>gi|195494699|ref|XP_002094950.1| GE19934 [Drosophila yakuba]
gi|194181051|gb|EDW94662.1| GE19934 [Drosophila yakuba]
Length = 591
Score = 356 bits (914), Expect = 2e-95, Method: Compositional matrix adjust.
Identities = 215/548 (39%), Positives = 300/548 (54%), Gaps = 49/548 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF GSN TPNIDALAY+G+ILNN+Y P+CTPSRA+L+TGKYPI+TGMQ I
Sbjct: 46 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+PL E + E RE GY T +GKWHLGF +R +TP RGF+ HFGYL +
Sbjct: 106 VNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGFSQRNFTPTQRGFDRHFGYLGAYVD 165
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPL 268
YY Q NGHD R NL + D VG Y TD+ T AV+ IED + +PL
Sbjct: 166 YYTQSYEQQ---NKGYNGHDFRDNLKSTHDHVGRYITDVLTDAAVKEIEDHGSKNSSQPL 222
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FL L HLA HA N ++AP E +++F+YI + R YAAMV KLD SVG VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDNPMQAPAEEVSRFEYIGNKTHRYYAAMVSKLDQSVGYVIDALARQ 282
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
ML+NSI++F+SDNG PT T+ SNYP RG K++ WEG ++ A +WS + +
Sbjct: 283 EMLQNSIVLFLSDNGGPTQGQHSTT-----ASNYPLRGQKDSPWEGALRSSAAIWSTEFE 337
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ V Q ++I D LPTL AAG S P L++DGL+ WS+L S + +D
Sbjct: 338 RLGSVWKQQIYIGDLLPTL--AAGAGISPDPALHLDGLNLWSALKYGYESVEREIVHVID 395
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV-- 505
+ D + WK++ GT G DG+ G +++V
Sbjct: 396 E-------------------DVAMPHLSYTRGKWKVISGTTNQGLYDGWLGHRETSEVDP 436
Query: 506 -PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN 564
+ + + ++ LQQ+S NI ++ R QA+I C + P
Sbjct: 437 RAAVYEELVRNTSVWRQLQQVS----FGERNISEL---RDQASIECPDTANGVRPCLPLE 489
Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQADPKR 621
PC LF++ DPCE++N+ + + S + L + QSH +P DP+
Sbjct: 490 APC-LFDIEADPCERSNLYAEYKN--STTFLDLWARIQQFAKQSHPPNNKPGDPDCDPRF 546
Query: 622 FNDTWSPW 629
+++ W+ W
Sbjct: 547 YHNEWTWW 554
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
R NL + D VG Y TD+ T AV+ IED + +PLFL L HLA HA N ++A
Sbjct: 183 FRDNLKSTHDHVGRYITDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDNPMQA 242
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P E +++F+YI + R YAA+
Sbjct: 243 PAEEVSRFEYIGNKTHRYYAAMV 265
>gi|195591201|ref|XP_002085331.1| GD14733 [Drosophila simulans]
gi|194197340|gb|EDX10916.1| GD14733 [Drosophila simulans]
Length = 585
Score = 356 bits (913), Expect = 3e-95, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 303/550 (55%), Gaps = 53/550 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF GS+ TPNIDALAY+G+ILNN+Y P+CTPSRA+L+TGKYPI+TGMQ I
Sbjct: 46 GFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+P+ E + E RE GY T +GKWHLG +R +TP RGF+ HFGYL +
Sbjct: 106 VNDQPWGLPINETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGYLGAYVD 165
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPL 268
YY Q NGHD R NL + D VG Y TD+ T AV+ IED + +PL
Sbjct: 166 YYTQSYEQQ---NKGYNGHDFRDNLKSTHDYVGHYVTDVLTDAAVKEIEDHGSKNSSQPL 222
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FL L HLA HA N ++AP E +++F+YI++ R YAAMV +LD SVG+VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKTHRYYAAMVSRLDKSVGSVIDALARQ 282
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
ML+NSII+F+SDNG PT T+ SNYP RG KN+ WEG ++ A +WS + +
Sbjct: 283 EMLQNSIILFLSDNGGPTEGQHSTT-----ASNYPLRGQKNSPWEGALRSSAAIWSTEFE 337
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ V Q ++I D LPTL AA G + L++DGL+ WS+L S + +D+
Sbjct: 338 RLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDE 396
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP-- 506
D + WK++ GT G DG+ G +++V
Sbjct: 397 -------------------DVAVPHLSYTRGKWKVISGTTNEGLYDGWLGHRETSEVDPR 437
Query: 507 LLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP-SPC-- 562
+ + ++ + + LQQ+S S + R Q+ I C P P T PC
Sbjct: 438 AVEYEELIRNTSVWLQLQQVS-------SGERNISELRYQSRIEC---PDPATGVKPCLP 487
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE---QPDLVQADP 619
GPC LF++ DPCE++N+ + + S ++ L + Q+H +P DP
Sbjct: 488 LEGPC-LFDIEADPCERSNLYAEYQN--STIFLDLWARIQQFAKQAHSPNNKPGDPNCDP 544
Query: 620 KRFNDTWSPW 629
+ +++ W+ W
Sbjct: 545 RFYHNEWTWW 554
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
R NL + D VG Y TD+ T AV+ IED + +PLFL L HLA HA N ++A
Sbjct: 183 FRDNLKSTHDYVGHYVTDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQA 242
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P E +++F+YI++ R YAA+
Sbjct: 243 PAEEVSRFEYISNKTHRYYAAMV 265
>gi|195166553|ref|XP_002024099.1| GL22854 [Drosophila persimilis]
gi|194107454|gb|EDW29497.1| GL22854 [Drosophila persimilis]
Length = 548
Score = 355 bits (911), Expect = 4e-95, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 295/550 (53%), Gaps = 50/550 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF GSN TPNIDALAY+G+ILNN+Y +CTPSRA+L+TGKYPI+TGMQ I
Sbjct: 6 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGMQHYVI 65
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+P E+ + E RE GY T +GKWHLG +R +TP RGF+ H GYL +
Sbjct: 66 VNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYLGAYVD 125
Query: 212 YYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VD 265
YYD Y+R GHD R NL+ D VG Y TD+ + AV+LIE
Sbjct: 126 YYDQTYQQNGKNYAR-----GHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
+PLFL L+HLA HA NA ++AP E + +F+YI + R YAAMV +LD SVG VI AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRYYAAMVSRLDRSVGRVIDAL 240
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
R+ ML++SI++F+SDNG PTV ++ SNYP RG KN+ WEGG++ A +WS
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSST-----ASNYPLRGQKNSPWEGGLRSSAAIWST 295
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+ ++ V Q +I D LPTL AA G L +DG++ WS+L S +
Sbjct: 296 EFERLGTVWKQQTYIGDLLPTL-AAAAGIAPDPDLKLDGMNLWSALKYGYESLEREIVHN 354
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
+D +D A WK+V GT DG+ G+ + +V
Sbjct: 355 ID-------------------MDPSAPYLAYTRGKWKVVNGTTNAALFDGWLGERVTREV 395
Query: 506 PLLNFN---AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
+ N I + +Q LQ + + N + S RQQA I C P P
Sbjct: 396 DPRSANYEELIRNTSVWQHLQ-----LLMGSQNERNITSLRQQAQIECPEPPGATKPCLP 450
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQADP 619
PC LF++ DPCE+NN+ + + L L + + Q+H P+ DP
Sbjct: 451 LLAPC-LFDIDADPCERNNLYEDYRNTTIFLDLLQRIEQ--FASQAHAPNNMPEDPDCDP 507
Query: 620 KRFNDTWSPW 629
+ +++ W+ W
Sbjct: 508 RFYHNEWTWW 517
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VDKPLFLYLAHLAAHAGNAGKHLEA 57
R NL+ D VG Y TD+ + AV+LIE +PLFL L+HLA HA NA ++A
Sbjct: 144 FRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSSQPLFLLLSHLAPHAANADDPMQA 203
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P E + +F+YI + R YAA+
Sbjct: 204 PAEELAKFEYIRNETHRYYAAMV 226
>gi|195328499|ref|XP_002030952.1| GM25725 [Drosophila sechellia]
gi|194119895|gb|EDW41938.1| GM25725 [Drosophila sechellia]
Length = 585
Score = 355 bits (910), Expect = 5e-95, Method: Compositional matrix adjust.
Identities = 213/550 (38%), Positives = 303/550 (55%), Gaps = 53/550 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF GS+ TPNIDALAY+G+ILNN+Y P+CTPSRA+L+TGKYPI+TGMQ I
Sbjct: 46 GFDDVSFRGSDNFLTPNIDALAYSGVILNNLYVAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+PL E + E RE GY T +GKWHLG +R +TP RGF+ HFGYL +
Sbjct: 106 VNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLGLSQRNFTPTERGFDRHFGYLGAYVD 165
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPL 268
YY Q NGHD R NL + D VG Y TD+ T AV+ IED + +PL
Sbjct: 166 YYTQSYEQQ---NKGYNGHDFRDNLKSTHDHVGHYVTDVLTDAAVKEIEDHGSKNSSQPL 222
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FL L HLA HA N ++AP E +++F+YI++ R YAAMV +LD SVG+VI AL R+
Sbjct: 223 FLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKAHRYYAAMVSRLDKSVGSVIDALARQ 282
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
ML+NSII+F+SDNG PT T+ SNYP RG KN+ WEG ++ A +WS + +
Sbjct: 283 EMLQNSIILFLSDNGGPTEGQHSTT-----ASNYPLRGQKNSPWEGALRSSAAIWSTEFE 337
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ V Q ++I D LPTL AA G + L++DGL+ WS+L S + +D+
Sbjct: 338 RLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLDGLNLWSALKYGYESVEREIVHVIDE 396
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP-- 506
D + WK++ GT G DG+ G +++V
Sbjct: 397 -------------------DVAVPHLSYTRGKWKVISGTTNEGLYDGWLGHRETSEVDPG 437
Query: 507 LLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP-SPC-- 562
+ + +V + + LQQ++ S + R ++ I C P P T PC
Sbjct: 438 AVEYEELVRNTSVWLQLQQVT-------SGERNISELRDKSRIEC---PDPATGVKPCLP 487
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE---QPDLVQADP 619
GPC LF++ DPCE++N+ + + S ++ L + Q+H +P DP
Sbjct: 488 LEGPC-LFDIEADPCERSNLYAEYQN--STIFLDLWERIQQFAKQAHSPNNKPGDPNCDP 544
Query: 620 KRFNDTWSPW 629
+ +++ W+ W
Sbjct: 545 RFYHNEWTWW 554
Score = 70.1 bits (170), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
R NL + D VG Y TD+ T AV+ IED + +PLFL L HLA HA N ++A
Sbjct: 183 FRDNLKSTHDHVGHYVTDVLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQA 242
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P E +++F+YI++ R YAA+
Sbjct: 243 PAEEVSRFEYISNKAHRYYAAMV 265
>gi|390364993|ref|XP_003730725.1| PREDICTED: arylsulfatase I-like isoform 1 [Strongylocentrotus
purpuratus]
Length = 479
Score = 354 bits (909), Expect = 7e-95, Method: Compositional matrix adjust.
Identities = 194/493 (39%), Positives = 274/493 (55%), Gaps = 60/493 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS++IPTP+IDALA G+IL N Y P+CTP+R+++MTGK+PIHTG+Q I
Sbjct: 40 GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ E +P+YLR LGY T +GKWHLGF+ +E+TP+ RGFES FGY G
Sbjct: 100 IADEPYGLGTNETIMPQYLRSLGYRTHMVGKWHLGFYSKEHTPIERGFESTFGYYLGQQD 159
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H Q R L G D N S G+Y+T+++T++ ++I + +PL++Y
Sbjct: 160 YFTH--ETQVKRKHTLTGFDFHVNGSIYKPVFGQYSTEIYTEKTQEIIRNHNPQEPLYIY 217
Query: 272 LAHLAAHAGN-AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
LAH A H+ N G+ L+AP + +F IT+ NRR +AAMV LDDS+G + L+ +
Sbjct: 218 LAHQAVHSANYNGQRLQAPYKYYERFPNITNENRRKFAAMVSALDDSLGNITQTLKESSL 277
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
N++I+F +DNG P + N+ +N+P RGVK+T WEGG++ LW I++
Sbjct: 278 YNNTVIVFTTDNGGPAHGFDA-----NYANNWPLRGVKDTTWEGGLRGAGFLWGALIEKP 332
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
R S MMH+ DW+PTLY AGG+TS L ++DG+D W
Sbjct: 333 GRTSDGMMHVCDWVPTLYGLAGGNTSTL----------------------QHLDGIDVWP 370
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
L PS R +L NID + +A+R+ +KLV G NG+ +Y + V
Sbjct: 371 MLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQGQNYNGSWSDWYPPEGESSVD---- 426
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
V+SK P+ N + + A +P PC LF
Sbjct: 427 ---VDSK--------------PVPNAFVVSCPSKPANASTNCDPK---EKPC------LF 460
Query: 571 NLGNDPCEQNNIA 583
N+ +DPCE NNIA
Sbjct: 461 NIRHDPCEFNNIA 473
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 48/71 (67%), Gaps = 1/71 (1%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN-AGKHLEAPQETINQFQYITDP 71
G+Y+T+++T++ ++I + +PL++YLAH A H+ N G+ L+AP + +F IT+
Sbjct: 190 GQYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSANYNGQRLQAPYKYYERFPNITNE 249
Query: 72 NRRTYAALTKS 82
NRR +AA+ +
Sbjct: 250 NRRKFAAMVSA 260
>gi|198466297|ref|XP_002135151.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
gi|198150535|gb|EDY73778.1| GA23895 [Drosophila pseudoobscura pseudoobscura]
Length = 548
Score = 354 bits (908), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 214/552 (38%), Positives = 296/552 (53%), Gaps = 54/552 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF GSN TPNIDALAY+G+ILNN+Y +CTPSRA+L+TGKYPI+TGMQ I
Sbjct: 6 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAAMCTPSRAALLTGKYPINTGMQHYVI 65
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+P E+ + E RE GY T +GKWHLG +R +TP RGF+ H GYL +
Sbjct: 66 VNNQPWGLPQQEKTMAEIFRENGYYTSLLGKWHLGMSQRNFTPTQRGFDHHLGYLGAYVD 125
Query: 212 YYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VD 265
YYD Y+R GHD R NL+ D VG Y TD+ + AV+LIE
Sbjct: 126 YYDQTYQQNGKNYAR-----GHDFRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSS 180
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
+PLFL L+HLA HA NA ++AP E + +F+YI + R YAAMV +LD SVG VI AL
Sbjct: 181 QPLFLLLSHLAPHAANADDPMQAPAEELAKFEYIRNETHRHYAAMVSRLDRSVGRVIDAL 240
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
R+ ML++SI++F+SDNG PTV T+ SNYP RG KN+ WEGG++ A +WS
Sbjct: 241 ARQQMLQDSIVLFLSDNGGPTVGEHSTT-----ASNYPLRGQKNSPWEGGLRSSAAIWST 295
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+ ++ V Q +I D LPTL AAG + P + +DG
Sbjct: 296 EFERLGTVWKQQTYIGDLLPTLAAAAG-------------------IAPDPDLK---LDG 333
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQ--TR 501
++ WS+L S ++ NID A WK+V GT DG+ G+ TR
Sbjct: 334 MNLWSALKYGYESLEREIVHNIDMDPSAPYLAYTRGKWKVVNGTTNAALFDGWLGERVTR 393
Query: 502 SNKVPLLNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
N+ ++ + +Q LQ + + N + S RQQA I C P
Sbjct: 394 EMDPRSANYEELIRNTSVWQHLQ-----LLMGSQNERNITSLRQQAQIECPEPSGATKPC 448
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQA 617
PC LF++ DPCE+NN+ + + L L + + Q+H P+
Sbjct: 449 LPLLAPC-LFDIDADPCERNNLYEDYRNTTIFLDLLQRIEQ--FASQAHAPNNMPEDPDC 505
Query: 618 DPKRFNDTWSPW 629
DP+ +++ W+ W
Sbjct: 506 DPRFYHNEWTWW 517
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 49/83 (59%), Gaps = 3/83 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP---VDKPLFLYLAHLAAHAGNAGKHLEA 57
R NL+ D VG Y TD+ + AV+LIE +PLFL L+HLA HA NA ++A
Sbjct: 144 FRLNLNVTHDQVGHYVTDVLSDAAVELIEQHSGSNSSQPLFLLLSHLAPHAANADDPMQA 203
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P E + +F+YI + R YAA+
Sbjct: 204 PAEELAKFEYIRNETHRHYAAMV 226
>gi|346464549|gb|AEO32119.1| hypothetical protein [Amblyomma maculatum]
Length = 531
Score = 353 bits (907), Expect = 1e-94, Method: Compositional matrix adjust.
Identities = 212/536 (39%), Positives = 302/536 (56%), Gaps = 53/536 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D S HGS +IPTPN+DALA G++LNN Y QP+C+PSR +LMTG YP H G++ P +
Sbjct: 37 GWADTSLHGSAQIPTPNLDALASTGVLLNNYYVQPLCSPSRGALMTGLYPAHNGIRMPLV 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
GA+ G+PL + LPE+L++LGY T +GKWHLG+F YTP YRGF++ FG+ NG I
Sbjct: 97 -GAQVAGLPLQFKILPEHLKDLGYETHIVGKWHLGYFNLNYTPTYRGFDTFFGFHNGPID 155
Query: 212 YYDHILSDQYSRTVELNGH---DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY I+ E GH D S YAT A +I ++ KPL
Sbjct: 156 YYRGIM--------EQEGHVGLDFWNGTSALPLKERTYATARLQNHAKSIIANRNTSKPL 207
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLAH A H+ + + L+AP E +F YI D +R+ A M+ LD SVG ++ L++
Sbjct: 208 FLYLAHQAVHSVYSPEFLQAPVENTKKFPYIRDSSRKILAGMMDALDQSVGELVEELEKA 267
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+LE++I++F +DNG + T N G N+P RG+K T+WEGG++ A +WSP+++
Sbjct: 268 GILEDTILVFSTDNGG----HPYTRADPNRGFNWPLRGIKGTVWEGGIRGTAFIWSPRLE 323
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ RVS Q+MHISDWLPTLY+AAGGD L PL DG+ D
Sbjct: 324 VSRRVSTQLMHISDWLPTLYSAAGGDVQTLGPL--DGI---------------------D 360
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
W SL PS R VL+ + + AA+R + K+VLGT + Y + P
Sbjct: 361 MWQSLSTGGPSPRIEVLLEFNNETDRAALRYLNHKIVLGTFSDEVSKRY--EVIGGVRPG 418
Query: 508 LNFNAIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
++ +A+ SK + L++ N L + DK S R++AT+ C + S +
Sbjct: 419 IDLDALQRNSKAGRVLKRFYGNPQLFLHGFDK--SWRRRATVRCNEHGRTNFNS---SSK 473
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
+LF+L +DPCE N+A S + + L+ L + RTL P+ ++ + DPK F
Sbjct: 474 VFLFDLESDPCELRNLADSNVTLLTDLFVKLGAYLRTLRPEINK-----EIDPKGF 524
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 38/68 (55%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
YAT A +I ++ KPLFLYLAH A H+ + + L+AP E +F YI D +R+
Sbjct: 185 YATARLQNHAKSIIANRNTSKPLFLYLAHQAVHSVYSPEFLQAPVENTKKFPYIRDSSRK 244
Query: 75 TYAALTKS 82
A + +
Sbjct: 245 ILAGMMDA 252
>gi|195436072|ref|XP_002066002.1| GK11604 [Drosophila willistoni]
gi|194162087|gb|EDW76988.1| GK11604 [Drosophila willistoni]
Length = 567
Score = 352 bits (903), Expect = 4e-94, Method: Compositional matrix adjust.
Identities = 215/571 (37%), Positives = 301/571 (52%), Gaps = 47/571 (8%)
Query: 75 TYAALTKSTTLT----LLIV----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP 126
T K+TT T ++I+ G++D+SF G E TPNIDALAY+G IL+N+YA
Sbjct: 14 TICCRVKATTTTKRPNIIIILADDMGFDDVSFRGGREFLTPNIDALAYHGRILDNLYAPA 73
Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
+CTPSR +L++G+YP HTG Q I EP G+ L +PE +E GYST IGKWHLG
Sbjct: 74 MCTPSRGALLSGRYPAHTGTQHFVISNEEPWGLTLNATLMPEIFKEAGYSTNLIGKWHLG 133
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
F EYTP RGF+ H+GY I YY Q G+D RRN+ G Y
Sbjct: 134 FASPEYTPTRRGFDYHYGYWGAYIDYYQR--RSQMPVANYSMGYDFRRNMDLECQNRGVY 191
Query: 247 ATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
TDL T+EA +I E ++ FL L+HLA H N L+AP+E I +F YI DP RR
Sbjct: 192 ITDLLTQEAEHVIREKAAANETFFLMLSHLATHTANDNDPLQAPEEEIRKFAYIKDPRRR 251
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
YAAMV KLD SVG +I+ L + LENSI+IF SDNGAP+V + N GSN+P R
Sbjct: 252 KYAAMVAKLDQSVGRIITTLAQTEQLENSIVIFYSDNGAPSV-----GQFANTGSNWPLR 306
Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
G KNT WEGG++V +WS Q+Q Q ++++DWLPTL AAG D L +DGL
Sbjct: 307 GQKNTPWEGGIRVAGAIWSTQLQARGNKFTQPIYVADWLPTLAHAAGIDLPESSLQLDGL 366
Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
D W L S+ +++ +L +D+ +++ L WK +
Sbjct: 367 DLWPQL---AGSKDAAHVP----------------REILHALDDVWHVSSMLLGHWKYIN 407
Query: 486 GTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQ 545
GT +GT D + + N +VE + + + L+ + K+ S R
Sbjct: 408 GTTASGTYDQVLTYRELDDLDPRNSRYVVEVRNSPASRALAHFDLQRLGQ-QKITSLRHS 466
Query: 546 ATIHCGANPAPMTP--SPCTNGPCYLFNLGNDPCEQNNIASSRPD--ISSQLYELLKYHR 601
A++ CG P C L+++ DPCEQNN+ S+ + L + L+ R
Sbjct: 467 ASVRCGNLQRGCNPLVEEC------LYDIAADPCEQNNLIYSKQHEAVLMALRKRLQEFR 520
Query: 602 RTLVPQSHEQPDLVQADPKRFNDTWSPWIYR 632
+ +P + +P + Q+DP TW ++ +
Sbjct: 521 LSALPPGN-RPSMRQSDPSLHECTWDNFLVK 550
Score = 71.2 bits (173), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/81 (44%), Positives = 46/81 (56%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T+EA +I E ++ FL L+HLA H N L+AP+
Sbjct: 177 FRRNMDLECQNRGVYITDLLTQEAEHVIREKAAANETFFLMLSHLATHTANDNDPLQAPE 236
Query: 60 ETINQFQYITDPNRRTYAALT 80
E I +F YI DP RR YAA+
Sbjct: 237 EEIRKFAYIKDPRRRKYAAMV 257
>gi|390364995|ref|XP_798154.3| PREDICTED: arylsulfatase I-like isoform 2 [Strongylocentrotus
purpuratus]
Length = 476
Score = 352 bits (902), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 191/493 (38%), Positives = 274/493 (55%), Gaps = 63/493 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS++IPTP+IDALA G+IL N Y P+CTP+R+++MTGK+PIHTG+Q I
Sbjct: 40 GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ E +P+YLR LGY T +GKWHLGFF+ TP +RGFES++GY G+
Sbjct: 100 IADEPYGLGTNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGYYGGMQD 159
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H ++ L G D N S G+Y+T+++T++ ++I + +PL++Y
Sbjct: 160 YFTHESTEH-----TLTGFDFHVNGSIYKPVFGQYSTEIYTEKTQEIIRNHNPQEPLYIY 214
Query: 272 LAHLAAHAGN-AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
LAH A H+ N G+ L+AP + +F IT+ NRR +AAMV LDDS+G + L+ +
Sbjct: 215 LAHQAVHSANYNGQRLQAPYKYYERFPNITNENRRKFAAMVSALDDSLGNITQTLKESSL 274
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
N++I+F +DNG P + N+ +N+P RGVK+T WEGG++ LW I++
Sbjct: 275 YNNTVIVFTTDNGGPAHGFDA-----NYANNWPLRGVKDTTWEGGLRGAGFLWGALIEKP 329
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
R S MMH+ DW+PTLY AGG+TS L ++DG+D W
Sbjct: 330 GRTSDGMMHVCDWVPTLYGLAGGNTSTL----------------------QHLDGIDVWP 367
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
L PS R +L NID + +A+R+ +KLV G NG+ +Y + V
Sbjct: 368 MLSRAEPSPREEILHNIDPVRNVSAIRIGDYKLVQGQNYNGSWSDWYPPEGESSVD---- 423
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
V+SK P+ N + + A +P PC LF
Sbjct: 424 ---VDSK--------------PVPNAFVVSCPSKPANASTNCDPK---EKPC------LF 457
Query: 571 NLGNDPCEQNNIA 583
N+ +DPCE NNIA
Sbjct: 458 NIRHDPCEFNNIA 470
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 48/71 (67%), Gaps = 1/71 (1%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN-AGKHLEAPQETINQFQYITDP 71
G+Y+T+++T++ ++I + +PL++YLAH A H+ N G+ L+AP + +F IT+
Sbjct: 187 GQYSTEIYTEKTQEIIRNHNPQEPLYIYLAHQAVHSANYNGQRLQAPYKYYERFPNITNE 246
Query: 72 NRRTYAALTKS 82
NRR +AA+ +
Sbjct: 247 NRRKFAAMVSA 257
>gi|194748074|ref|XP_001956474.1| GF24576 [Drosophila ananassae]
gi|190623756|gb|EDV39280.1| GF24576 [Drosophila ananassae]
Length = 583
Score = 351 bits (901), Expect = 5e-94, Method: Compositional matrix adjust.
Identities = 214/551 (38%), Positives = 303/551 (54%), Gaps = 54/551 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF GSN TPNIDALAY+G+ILNN+Y P+CTPSRA+L+TGKYPI+TGMQ I
Sbjct: 46 GFDDVSFRGSNNFLTPNIDALAYSGVILNNLYTAPMCTPSRAALLTGKYPINTGMQHYVI 105
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+PL E + + R GY T IGKWHLG +R YTP RGF+ H GYL +
Sbjct: 106 VNDQPWGLPLNETTMADIFRGNGYRTSLIGKWHLGMSQRNYTPTLRGFDYHLGYLGAYVD 165
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPL 268
YY+ +Q S+ GHD R NL + V +Y TD+ T AV+ I+D KPL
Sbjct: 166 YYNQSY-EQVSKGYR--GHDFRENLKPNHEHVDKYVTDILTDAAVREIDDHAAKNNSKPL 222
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FL L HLA HA N ++AP + ++ F+YI D R YAAMV +LD SVG V+ AL R+
Sbjct: 223 FLLLNHLAPHAANDADPMQAPADELSGFEYIRDETHRYYAAMVSRLDKSVGHVVDALARQ 282
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
ML+NSI++F+SDNG PT+ T+ SNYP RG KN+ WEGG++ A +WS + +
Sbjct: 283 DMLQNSILLFLSDNGGPTLGEHATT-----ASNYPLRGQKNSPWEGGIRSSAAIWSTEFE 337
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ V Q ++I D LPTL AAG + P R +DGL+
Sbjct: 338 RLGSVWKQQIYIGDLLPTLAAAAG-------------------ISPDPDLR---LDGLNL 375
Query: 449 WSSLLLNTPSRRNSVLINIDEK--KRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
WS+L S ++ IDE+ + A WK+V GT +G DG+ G+ + +V
Sbjct: 376 WSALKYGYESVEREIVHVIDEEAPEPHLAYTRGKWKVVSGTTNSGRYDGWLGERVTTEVD 435
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNID----KMRSTRQQATIHCGANPA-PMTPSP 561
S+ Y++L Q + +++L + + + RQQA I C A TP
Sbjct: 436 -------PRSQEYEALIQ-NTSVWLQLQKVSGGERNISGLRQQARIECPATTEDEHTPCL 487
Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH---EQPDLVQAD 618
PC LF++ DPCEQNN+ + + +++ L + Q+H +P +
Sbjct: 488 PLEAPC-LFDIEADPCEQNNLYEEYKNTT--VFQDLWQRVQQFTAQAHPPNNKPGDPSCN 544
Query: 619 PKRFNDTWSPW 629
P +++ W+ W
Sbjct: 545 PAFYHNEWTWW 555
Score = 64.7 bits (156), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 34/83 (40%), Positives = 46/83 (55%), Gaps = 3/83 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
R NL + V +Y TD+ T AV+ I+D KPLFL L HLA HA N ++A
Sbjct: 183 FRENLKPNHEHVDKYVTDILTDAAVREIDDHAAKNNSKPLFLLLNHLAPHAANDADPMQA 242
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P + ++ F+YI D R YAA+
Sbjct: 243 PADELSGFEYIRDETHRYYAAMV 265
>gi|332016485|gb|EGI57378.1| Arylsulfatase I [Acromyrmex echinatior]
Length = 502
Score = 350 bits (899), Expect = 9e-94, Method: Compositional matrix adjust.
Identities = 200/504 (39%), Positives = 286/504 (56%), Gaps = 57/504 (11%)
Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
GMQG I G EPRG+PL+ R LPE+LR LGY+TK IGKWHLG+ ++TPL+RGF+S G
Sbjct: 2 GMQGESIQGGEPRGLPLSVRILPEHLRGLGYTTKMIGKWHLGYHTPQHTPLHRGFDSFLG 61
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+ N +SYYD+ S Q ++G+DM R + A+ +Y TDLFT EAV++I+
Sbjct: 62 FYNSHVSYYDYKYSYQ-----NMSGYDMHRGDAPAYGLTDKYVTDLFTDEAVRIIQTHDP 116
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETIN---QFQYITDPNRRTYAAMVKKLDDSVGTV 321
+PL+L ++HLA HA LE PQ+ + +F +I + NRR YA MV KLDDSVG +
Sbjct: 117 SRPLYLQISHLAVHAP-----LENPQDYDHYDKRFMHIVEQNRRKYAKMVSKLDDSVGLI 171
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
++ L +GML +S+I+F++DNGA V +RN+GSN+P RG+K TL+EGGV+ A+
Sbjct: 172 VNTLGSRGMLRDSVILFLTDNGAAPV-----GKFRNYGSNFPLRGMKYTLYEGGVRGVAV 226
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
LWSP++ + RV +MHI+DWLPTLY AA GD L
Sbjct: 227 LWSPRLSKAARVCNNLMHITDWLPTLYAAADGDLRDL----------------------G 264
Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
IDG++QW L P R+ +L+NIDE +T +KL+ G+ E+G DG++
Sbjct: 265 KIDGVNQWRMLNDGYPIVRDKLLLNIDEISKTEGAIYKQYKLLRGSIEDGHYDGHHRHDL 324
Query: 502 SNKVPLL----------NFNAIVESKTYQSL-----QQLSQNIFLPISNIDKMRSTRQQA 546
+L N V T L Q ++ ++ P++ M R +A
Sbjct: 325 ERGYNILPPFFGNNSRKNIQENVPPYTETILISPVSQSITYHLGGPVTQPSTMVQLRNEA 384
Query: 547 TIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
T+ C N + + C C LF++ NDPCE NIA P I+ +L ++L+ + R +
Sbjct: 385 TVSCRFNVSYYSFVTCNVTEC-LFDIENDPCETKNIAEQYPRIARELDQILEQYARNVAM 443
Query: 607 QSHEQPDLVQADPKRFNDTWSPWI 630
Q D + ADP+R N+TW PWI
Sbjct: 444 QIKPSVDWL-ADPRRTNNTWEPWI 466
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 32/83 (38%), Positives = 48/83 (57%), Gaps = 8/83 (9%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
M R + A+ +Y TDLFT EAV++I+ +PL+L ++HLA HA LE PQ+
Sbjct: 84 MHRGDAPAYGLTDKYVTDLFTDEAVRIIQTHDPSRPLYLQISHLAVHAP-----LENPQD 138
Query: 61 TIN---QFQYITDPNRRTYAALT 80
+ +F +I + NRR YA +
Sbjct: 139 YDHYDKRFMHIVEQNRRKYAKMV 161
>gi|195328473|ref|XP_002030939.1| GM24306 [Drosophila sechellia]
gi|194119882|gb|EDW41925.1| GM24306 [Drosophila sechellia]
Length = 554
Score = 348 bits (893), Expect = 5e-93, Method: Compositional matrix adjust.
Identities = 207/546 (37%), Positives = 290/546 (53%), Gaps = 46/546 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF G E TPNIDALAY+G +L+ +YA +CTPSR +L++G+YPIHTG Q I
Sbjct: 38 GFDDVSFRGGREFVTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP + L +PE +E GYST +GKWHLGF R EYTP RGF+ HFGY I
Sbjct: 98 SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
Y+ + S+ N G+D RRN+ G Y TDL T EA +LI+D ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTTEAERLIKDHADKEQP 212
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL L+HLAAH N L+AP+E I +F YI DPNRR YAAM+ KLD SVG +IS L
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSVGRIISTLSS 272
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
L+NSI+IF SDNGAP+V + N GSN+P RG KNT WEGGV+V +WS ++
Sbjct: 273 TDQLDNSIVIFYSDNGAPSV-----GMFSNTGSNFPLRGQKNTPWEGGVRVAGAIWSSKL 327
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
Q + Q ++++DWLPTL AAG + L +DG+D W L
Sbjct: 328 QARGSIFSQPLYVADWLPTLSRAAGIELDS-SLKLDGIDLWPEL---------------- 370
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
S + P +L +D+ R +A+++ WK V GT +G D + +
Sbjct: 371 ---SGSADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTADGRYDSVLTYRELDDLDP 427
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
V + + + LS+ ++ ++ TR+ A + CG P C
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISLTRRLAAVRCGDLQRSCNPLLEEC--- 483
Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
L+++ DPCEQNN+ S D+ + L ++ R + + + +ADP
Sbjct: 484 ---LYDISTDPCEQNNLVYSERHSDVLTALRRRVR-ELRASASRPGNRASMPEADPTLHT 539
Query: 624 DTWSPW 629
W +
Sbjct: 540 CAWESF 545
Score = 82.0 bits (201), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 41/80 (51%), Positives = 51/80 (63%), Gaps = 1/80 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA +LI+D ++PLFL L+HLAAH N L+AP+
Sbjct: 176 FRRNMDLECRDRGVYVTDLLTTEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPE 235
Query: 60 ETINQFQYITDPNRRTYAAL 79
E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFSYIKDPNRRKYAAM 255
>gi|195021979|ref|XP_001985494.1| GH14469 [Drosophila grimshawi]
gi|193898976|gb|EDV97842.1| GH14469 [Drosophila grimshawi]
Length = 560
Score = 347 bits (890), Expect = 1e-92, Method: Compositional matrix adjust.
Identities = 217/565 (38%), Positives = 298/565 (52%), Gaps = 90/565 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+S G+ E TPNIDALAY+G +L+ +YA +CTPSR +L++GKYPIHTG Q I
Sbjct: 44 GFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKYPIHTGTQHSVI 103
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L +PE R+ GYST +GKWHLGF R EYTP YRGF+ H+GY G I
Sbjct: 104 LNEEPWGLALNATLMPEIFRDAGYSTNLVGKWHLGFVRPEYTPTYRGFDYHYGYWGGYID 163
Query: 212 YY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKP 267
YY + SD YS G+D RRN+ G Y TDL T EA ++I +QP ++P
Sbjct: 164 YYQRRSQMPSDNYSM-----GYDFRRNMQVECTDRGVYVTDLLTNEAERIIREQPAKEQP 218
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL L+HLA H GN L+AP+E + +F +I DPNRR YAAMV KLD+SVG VI+AL R
Sbjct: 219 LFLILSHLAPHTGNEIDPLQAPEEELQKFVHINDPNRRLYAAMVSKLDESVGRVITALAR 278
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
LENSI+IF SDNGAP+V + N GSN+P +G K++ WEGG++V +WSP +
Sbjct: 279 TDQLENSIVIFYSDNGAPSVGI-----FANTGSNWPLKGQKDSPWEGGLRVAGAIWSPLL 333
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ + Q +++ D+LPTL AAG + L++DG+D W L+ P
Sbjct: 334 KARGNLFTQPIYVGDFLPTLAHAAGIELDS-SLHLDGIDLWPQ--LSGPK---------- 380
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD------------- 494
+ P +L +D+ R ++ L +K V GT G D
Sbjct: 381 -------DAPHVPREILHQLDDVGRMQSLLLGQYKYVNGTTSKGQYDQLLTYRKLDDLDP 433
Query: 495 --GYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC-- 550
YG N +A+ + +QQ++Q D++ + A + C
Sbjct: 434 RASIYGMAVRN-------SAVSKELASFDIQQITQ---------DRIDILSRSAAVICSD 477
Query: 551 ---GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR------ 601
G N C L++L DPCEQNN+A S+ Q E+L R
Sbjct: 478 MQRGCN---ALLEEC------LYDLSTDPCEQNNLAYSK-----QHSEILAIMRQRVQQL 523
Query: 602 RTLVPQSHEQPDLVQADPKRFNDTW 626
R+ +P+ ADP TW
Sbjct: 524 RSGAVAPGNRPNERDADPSLHECTW 548
Score = 80.5 bits (197), Expect = 2e-12, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 52/81 (64%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA ++I +QP ++PLFL L+HLA H GN L+AP+
Sbjct: 182 FRRNMQVECTDRGVYVTDLLTNEAERIIREQPAKEQPLFLILSHLAPHTGNEIDPLQAPE 241
Query: 60 ETINQFQYITDPNRRTYAALT 80
E + +F +I DPNRR YAA+
Sbjct: 242 EELQKFVHINDPNRRLYAAMV 262
>gi|195494733|ref|XP_002094965.1| GE22117 [Drosophila yakuba]
gi|194181066|gb|EDW94677.1| GE22117 [Drosophila yakuba]
Length = 565
Score = 347 bits (889), Expect = 2e-92, Method: Compositional matrix adjust.
Identities = 208/546 (38%), Positives = 289/546 (52%), Gaps = 46/546 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF G E TPNIDALAY+G +L+ +YA +CTPSR +L++G+YPIHTG Q I
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP + L +PE +E GYST +GKWHLGF R EYTP RGF+ HFGY I
Sbjct: 98 SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
Y+ + S+ N G+D RRN+ G Y TDL T EA +LI+ ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAERLIKGHAGKEQP 212
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL L+HLAAH N L+AP+E I +F YI DPNRR YAAM+ KLD SVG ++S+L
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSVGRIMSSLAA 272
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
LENSI+IF SDNGAP+V + N GSN+P RG KNT WEGGV+V +WS ++
Sbjct: 273 TDQLENSIVIFYSDNGAPSV-----GMFANTGSNFPLRGQKNTPWEGGVRVAGAIWSSKL 327
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
Q + Q ++++DWLPTL AAG L +DG+D W+ L
Sbjct: 328 QARGSIFSQPLYVADWLPTLCRAAGIKLDS-TLKLDGIDLWAEL---------------- 370
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
S + P +L +D+ R AA++L WK V GT G D + +
Sbjct: 371 ---SGSADAPHVPREILHILDDVWRLAALQLGQWKYVNGTTAAGRYDSLLTYRELDDLDP 427
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
V + + + LS+ ++ ++ TR+ A + CG P C
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISITRRLAAVRCGDLQRSCNPLLEEC--- 483
Query: 566 PCYLFNLGNDPCEQNNIASSRP--DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
L+++ DPCEQNN+ S D+ + L ++ R + + + +ADP
Sbjct: 484 ---LYDISTDPCEQNNLVYSEKHSDVLAALRRRVQ-ELRASASRPGNRASMPEADPTLHT 539
Query: 624 DTWSPW 629
W +
Sbjct: 540 CAWESF 545
Score = 79.0 bits (193), Expect = 7e-12, Method: Compositional matrix adjust.
Identities = 40/80 (50%), Positives = 50/80 (62%), Gaps = 1/80 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA +LI+ ++PLFL L+HLAAH N L+AP+
Sbjct: 176 FRRNMELECRDRGVYVTDLLTAEAERLIKGHAGKEQPLFLMLSHLAAHTANEDDPLQAPE 235
Query: 60 ETINQFQYITDPNRRTYAAL 79
E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFSYIKDPNRRKYAAM 255
>gi|386771363|ref|NP_730304.2| CG32191 [Drosophila melanogaster]
gi|229368437|gb|ACQ59088.1| MIP05773p [Drosophila melanogaster]
gi|383291992|gb|AAN11683.2| CG32191 [Drosophila melanogaster]
Length = 564
Score = 345 bits (884), Expect = 5e-92, Method: Compositional matrix adjust.
Identities = 207/546 (37%), Positives = 291/546 (53%), Gaps = 46/546 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF G E TPNIDALAY+G +L+ +YA +CTPSR +L++G+YPIHTG Q I
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP + L +PE +E GYST +GKWHLGF R EYTP RGF+ HFGY I
Sbjct: 98 SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
Y+ + S+ N G+D RRN+ G Y TDL T EA +LI+D ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMELECRDRGVYVTDLLTAEAERLIKDHADKEQP 212
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL L+HLAAH N L+AP+E I +F YI DPNRR YAAM+ KLD SVG +I+AL
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDPNRRKYAAMISKLDQSVGRIITALSS 272
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
LENSI+IF SDNGAP+V + N GSN+P RG KNT WEGGV+V +WS +
Sbjct: 273 TDQLENSIVIFYSDNGAPSV-----GMFSNTGSNFPLRGQKNTPWEGGVRVAGAIWSSGL 327
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
Q + Q ++++DWLPTL AA + L +DG+D W L
Sbjct: 328 QARGSIFRQPLYVADWLPTLSRAADIELDS-SLKLDGIDLWPEL---------------- 370
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
S + P +L +D+ R +A+++ WK V GT +G D + +
Sbjct: 371 ---SGSADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTASGRYDSVLTYRELDDLDP 427
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
+ V + + + LS+ ++ ++ TR+ A + CG P C
Sbjct: 428 RDSRYAVTVRNSATSRALSRYDLRRLTQ-QRISLTRRLAAVRCGDLQRSCNPLLEEC--- 483
Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
L+++ +DPCEQNN+ S D+ + L ++ R + + + +ADP
Sbjct: 484 ---LYDILSDPCEQNNLVYSERHSDVLTALRRRVQ-ELRASASRPGNRASMPEADPTLHT 539
Query: 624 DTWSPW 629
W +
Sbjct: 540 CAWESF 545
Score = 82.0 bits (201), Expect = 9e-13, Method: Compositional matrix adjust.
Identities = 41/80 (51%), Positives = 51/80 (63%), Gaps = 1/80 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA +LI+D ++PLFL L+HLAAH N L+AP+
Sbjct: 176 FRRNMELECRDRGVYVTDLLTAEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPE 235
Query: 60 ETINQFQYITDPNRRTYAAL 79
E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFSYIKDPNRRKYAAM 255
>gi|194871740|ref|XP_001972898.1| GG15780 [Drosophila erecta]
gi|190654681|gb|EDV51924.1| GG15780 [Drosophila erecta]
Length = 565
Score = 344 bits (883), Expect = 7e-92, Method: Compositional matrix adjust.
Identities = 207/546 (37%), Positives = 287/546 (52%), Gaps = 46/546 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF G E TPNIDALAY+G +L+ YA +CTPSR +L++G+YPIHTG Q I
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP + L +PE + GYST +GKWHLGF R EYTP RGF+ HFGY I
Sbjct: 98 SNEEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
Y+ + S+ N G+D RRN+ G Y TDL T EA +LI+D ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMELECRDRGAYVTDLLTAEAERLIKDHADKEQP 212
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL L+HLAAH N L+AP+E I +F YI DPNRR YAA++ KLD SVG ++S L
Sbjct: 213 LFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIISKLDQSVGRIMSTLAA 272
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
LENSI+IF SDNGAP+V + N GSN+P RG KNT WEGGV+V +WS ++
Sbjct: 273 TDQLENSIVIFYSDNGAPSV-----GMFANTGSNFPLRGQKNTPWEGGVRVAGAIWSSKL 327
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
Q + Q ++++DWLPTL AAG + L +DG+D W L
Sbjct: 328 QARGSIFSQPLYVADWLPTLSHAAGIELDS-SLKLDGIDLWPEL---------------- 370
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
S + P +L +D+ R AA++L WK V GT G D + +
Sbjct: 371 ---SGSADAPHVPREILHILDDIWRVAALQLGQWKYVNGTTAAGRYDSLLTYRELDDLDP 427
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
V + + + LS+ ++ ++ TR+ A + CG P C
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISITRRLAAVRCGDLQRSCNPLLEEC--- 483
Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
L+++ DPCEQNN+ S D+ + L ++ R + + + +ADP
Sbjct: 484 ---LYDISTDPCEQNNLVYSERHSDVLAALRRRVR-ELRASASRPGNRASMPEADPALHT 539
Query: 624 DTWSPW 629
W +
Sbjct: 540 CAWESF 545
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 41/80 (51%), Positives = 51/80 (63%), Gaps = 1/80 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA +LI+D ++PLFL L+HLAAH N L+AP+
Sbjct: 176 FRRNMELECRDRGAYVTDLLTAEAERLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPE 235
Query: 60 ETINQFQYITDPNRRTYAAL 79
E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFAYIKDPNRRKYAAI 255
>gi|195166561|ref|XP_002024103.1| GL22855 [Drosophila persimilis]
gi|194107458|gb|EDW29501.1| GL22855 [Drosophila persimilis]
Length = 559
Score = 344 bits (883), Expect = 8e-92, Method: Compositional matrix adjust.
Identities = 217/563 (38%), Positives = 297/563 (52%), Gaps = 72/563 (12%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
A+ + + L+ G+ND+SFHGSN+I TPNIDALAYNGI+LN Y +CTPSRA+L+
Sbjct: 24 ASAKPNIIIILIDDMGFNDVSFHGSNQILTPNIDALAYNGILLNRHYVPNLCTPSRATLL 83
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TGKYPIHTGMQ I EP G+P ER +PE R+ GY+T +GKWHLGF+R++ TP
Sbjct: 84 TGKYPIHTGMQHFVIVTDEPWGLPRQERLMPELFRDAGYATHLVGKWHLGFWRKDLTPTM 143
Query: 197 RGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
RGF+ HFGY NG + YYD +L YS G D RR+L + G YAT+ FT
Sbjct: 144 RGFDHHFGYYNGYMDYYDQTVRMLDRNYS-----TGLDFRRDLEPCREAEGTYATEAFTT 198
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
EA ++IE +PLF+ L+HLA H GN ++AP+E + +F +I DP RRTYA M+
Sbjct: 199 EARKVIERHDKSRPLFMVLSHLAVHTGNEDNPMQAPEEEVAKFAHIRDPKRRTYAGMISS 258
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
LD SVG + AL GML NSI++ SDNGAPTV + N GSNYP+RG
Sbjct: 259 LDKSVGQTMRALADNGMLNNSIVLLYSDNGAPTVGI-----HSNAGSNYPFRG------- 306
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
Q VS Q +H DWLPTL AAG +
Sbjct: 307 ---------------QRGYVSNQAIHAIDWLPTLAAAAG--------------------V 331
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
+ PS + +DGL+ W SL + + ++L +D+ ++ D+ K V G+ G
Sbjct: 332 SLPS--DLRLDGLNLWPSLSASAQPQPRNLLHVLDDVFGYSSYTQDTLKYVNGSSFEGRY 389
Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLS--QNIFLPIS-NIDKMRSTRQQATIHC 550
D + G+ L + SK Y L + S Q + ++ D++R R +AT C
Sbjct: 390 DHWLGE-------LEDGEEDPNSKDYGQLVRASEVQAVLGSVAPTDDRIRQMRDEATQRC 442
Query: 551 GANPA----PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
P+ C LF+L NDPCE+ N+A+ P L + ++ R T +P
Sbjct: 443 PIQERDPEDPVYLCEPLKAAC-LFDLANDPCERYNLANLYPLQLQLLAQEVEELRLTALP 501
Query: 607 QSHEQPDLVQADPKRFNDTWSPW 629
+ V+ADP + W W
Sbjct: 502 SARVPRADVRADPALHDGHWEWW 524
Score = 79.3 bits (194), Expect = 5e-12, Method: Compositional matrix adjust.
Identities = 36/82 (43%), Positives = 51/82 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+L + G YAT+ FT EA ++IE +PLF+ L+HLA H GN ++AP+E
Sbjct: 177 FRRDLEPCREAEGTYATEAFTTEARKVIERHDKSRPLFMVLSHLAVHTGNEDNPMQAPEE 236
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +I DP RRTYA + S
Sbjct: 237 EVAKFAHIRDPKRRTYAGMISS 258
>gi|198466274|ref|XP_001353949.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
gi|198150525|gb|EAL29685.2| GA16747 [Drosophila pseudoobscura pseudoobscura]
Length = 575
Score = 344 bits (882), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 197/499 (39%), Positives = 280/499 (56%), Gaps = 41/499 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF G E TPNIDALA++G IL+ +YA +CTPSR +L++G+YPIHTG Q I
Sbjct: 48 GFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 107
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L +PE ++ GYST IGKWHLGF R EYTP RGF+ H+GY I
Sbjct: 108 SNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGFDYHYGYWGAYID 167
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFL 270
YY S +R L G+D RRN+ G Y TDL T EA ++I E + ++PLFL
Sbjct: 168 YYQR-RSKMPARNYSL-GYDFRRNMELECRDRGVYVTDLLTNEAERVIREREGQEEPLFL 225
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
L+HLA H N L+AP+E I +F YI DPNRR YAAMV KLD SVG ++SAL G
Sbjct: 226 VLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGRIVSALNSTGQ 285
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSI+IF SDNGAP+V + N GSN+P RG KNT WEGGV+V +WS Q+Q
Sbjct: 286 LENSIVIFYSDNGAPSV-----GMFANTGSNWPLRGQKNTPWEGGVRVAGAIWSAQLQAR 340
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ Q ++++DWLP+L AAG + LP ++ +DG+D W
Sbjct: 341 GNIFTQPIYVADWLPSLAHAAGIE---LPHTLE-------------------LDGIDLWP 378
Query: 451 SLL--LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
L ++ P +L +D++ R + ++ WK + GT G D + +
Sbjct: 379 QLAGPVDAPHVPREILHILDDEWRVTSFQMGHWKYINGTTSAGQYDQVLTYRELDDLDPR 438
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNGP 566
+V + + + L+++ ++ ++ + R+ A + CG P C
Sbjct: 439 ESRYVVTVRNSPTSRALARHDIRRMTQ-QRVMTLRRSAAVRCGDLQRGCNPLVEEC---- 493
Query: 567 CYLFNLGNDPCEQNNIASS 585
++++ +DPCE+NN+ S
Sbjct: 494 --VYDIWDDPCEKNNLVYS 510
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 50/81 (61%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA ++I E + ++PLFL L+HLA H N L+AP+
Sbjct: 186 FRRNMELECRDRGVYVTDLLTNEAERVIREREGQEEPLFLVLSHLATHTANEDDPLQAPE 245
Query: 60 ETINQFQYITDPNRRTYAALT 80
E I +F YI DPNRR YAA+
Sbjct: 246 EEIRKFAYIKDPNRRKYAAMV 266
>gi|195591175|ref|XP_002085318.1| GD12374 [Drosophila simulans]
gi|194197327|gb|EDX10903.1| GD12374 [Drosophila simulans]
Length = 554
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 206/546 (37%), Positives = 288/546 (52%), Gaps = 46/546 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF G E TPNIDALAY+G +L+ +YA +CTPSR +L++G+YPIHTG Q I
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP + L +PE +E GYST +GKWHLGF R EYTP RGF+ HFGY I
Sbjct: 98 SNEEPWALTLNATLMPEIFKEAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
Y+ + S+ N G+D RRN+ G Y TDL T EA +LI+D ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMDLECRDRGVYVTDLLTTEAERLIKDHADKEQP 212
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL L+HLAAH N L+AP+E I +F YI D NRR YAAM+ KLD SVG +IS L
Sbjct: 213 LFLMLSHLAAHTANEDDPLQAPEEEIQKFSYIKDSNRRKYAAMISKLDQSVGRIISTLSS 272
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
LENSI+IF SDNGAP+V + N GSN+P RG KNT WEGGV+V +W+ ++
Sbjct: 273 TDQLENSIVIFYSDNGAPSV-----GMFSNTGSNFPLRGQKNTPWEGGVRVAGAIWNSKL 327
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
Q + Q ++++DWLPTL AAG + L +DG+D W L
Sbjct: 328 QARGSIFSQPLYVADWLPTLSRAAGIELDS-SLKLDGIDLWPEL---------------- 370
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
S + P +L +D+ R +A+++ WK V GT G D + +
Sbjct: 371 ---SGTADAPHVPREILHILDDVWRLSALQMGQWKYVNGTTAAGRYDSVLTYRELDDLDP 427
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
V + + + LS+ ++ ++ TR+ A + CG P C
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISLTRRLAAVRCGDLQRSCNPLLEEC--- 483
Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
L+++ DPCEQNN+ S D+ + L ++ R + + + +ADP
Sbjct: 484 ---LYDISTDPCEQNNLVYSERHSDVLTALRRRVQ-ELRASASRPGNRASMPEADPTLHT 539
Query: 624 DTWSPW 629
W +
Sbjct: 540 CAWESF 545
Score = 78.6 bits (192), Expect = 9e-12, Method: Compositional matrix adjust.
Identities = 40/80 (50%), Positives = 50/80 (62%), Gaps = 1/80 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA +LI+D ++PLFL L+HLAAH N L+AP+
Sbjct: 176 FRRNMDLECRDRGVYVTDLLTTEAERLIKDHADKEQPLFLMLSHLAAHTANEDDPLQAPE 235
Query: 60 ETINQFQYITDPNRRTYAAL 79
E I +F YI D NRR YAA+
Sbjct: 236 EEIQKFSYIKDSNRRKYAAM 255
>gi|346465011|gb|AEO32350.1| hypothetical protein [Amblyomma maculatum]
Length = 500
Score = 343 bits (881), Expect = 1e-91, Method: Compositional matrix adjust.
Identities = 197/495 (39%), Positives = 284/495 (57%), Gaps = 40/495 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D S HGS +IPTPN+DALA G++LNN Y QP+C+PSRA+LMTG YP H G++ P+
Sbjct: 37 GWADTSLHGSAQIPTPNLDALASTGVLLNNYYVQPLCSPSRAALMTGLYPAHNGIR-MPL 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
GA+ G+PL + LPE+L++LGY T +GKWHLG YTP YRGF++ FG+ NG I
Sbjct: 96 MGAQVAGLPLQFKILPEHLKDLGYETHMVGKWHLGHSSLNYTPTYRGFDTFFGFYNGPID 155
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY I+ + + L+ + R L YAT F A +I ++ KPLFLY
Sbjct: 156 YYHGIMEQE--GHIGLDFWNGTRALPLEERI---YATTRFQDHANYIIANRNASKPLFLY 210
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LAH A H+ + L+AP E +F ++ D +R++ A MV LD SVG ++ L++ G+L
Sbjct: 211 LAHQAVHSAYEPEFLQAPGENTKKFPFLGDASRKSLAGMVDALDQSVGELVEELEKAGIL 270
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
E++I++F +DNG + T + N G N+P RG+K T+WEGG++ A +WSP+++ +
Sbjct: 271 EDTILVFSTDNGG----HPYTRSDPNRGFNWPLRGIKGTVWEGGIRGTAFIWSPRLELSS 326
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
RVS Q+MHISDWLPTLY+AAGG+ L +DG+ D W S
Sbjct: 327 RVSTQLMHISDWLPTLYSAAGGNVQNLG-RLDGI---------------------DMWQS 364
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L PS R VL+ + + AA+R + K+VLG N + Y + P ++ +
Sbjct: 365 LSTGGPSPRTEVLLEFNNETDRAALRYLNHKIVLGIYSNEVSERY--EVVGGVRPGIDLD 422
Query: 512 AIVE-SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
+ SK + L + N L + DK S R +AT+ C N P + + +LF
Sbjct: 423 VLQRNSKAGRVLARFYGNPQLFLRGFDK--SWRSRATVRCNEN-GPTNFNSSSQ--VFLF 477
Query: 571 NLGNDPCEQNNIASS 585
+L DPCE +N+A S
Sbjct: 478 DLETDPCELHNLADS 492
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 39/68 (57%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
YAT F A +I ++ KPLFLYLAH A H+ + L+AP E +F ++ D +R+
Sbjct: 185 YATTRFQDHANYIIANRNASKPLFLYLAHQAVHSAYEPEFLQAPGENTKKFPFLGDASRK 244
Query: 75 TYAALTKS 82
+ A + +
Sbjct: 245 SLAGMVDA 252
>gi|194919176|ref|XP_001983034.1| GG19815 [Drosophila erecta]
gi|190647645|gb|EDV45033.1| GG19815 [Drosophila erecta]
Length = 565
Score = 343 bits (880), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 206/546 (37%), Positives = 287/546 (52%), Gaps = 46/546 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF G E TPNIDALAY+G +L+ YA +CTPSR +L++G+YPIHTG Q I
Sbjct: 38 GFDDVSFRGGREFLTPNIDALAYHGRLLDRFYAPAMCTPSRGALLSGRYPIHTGTQHFVI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP + L +PE + GYST +GKWHLGF R EYTP RGF+ HFGY I
Sbjct: 98 SNKEPWALTLNATLMPEIFKGAGYSTNLVGKWHLGFSRPEYTPTRRGFDYHFGYWGAYID 157
Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
Y+ + S+ N G+D RRN+ G Y TDL T EA +LI+D ++P
Sbjct: 158 YF-----QRRSKMPVANYSLGYDFRRNMELECRDRGAYVTDLLTAEAERLIKDHADKEQP 212
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL L+HLAAH N L+AP+E I +F YI DPNRR YAA++ KLD SVG ++S L
Sbjct: 213 LFLMLSHLAAHTANKDDPLQAPEEEIQKFAYIKDPNRRKYAAIISKLDQSVGRIMSTLAA 272
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
LENSI+IF SDNGAP++ + N GSN+P RG KNT WEGGV+V +WS ++
Sbjct: 273 TDQLENSIVIFYSDNGAPSL-----GMFANTGSNFPLRGQKNTPWEGGVRVAGAIWSSKL 327
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
Q + Q ++++DWLPTL AAG + L +DG+D W L
Sbjct: 328 QARGSIFSQPLYVADWLPTLSHAAGIELDS-SLKLDGIDLWPEL---------------- 370
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
S + P +L +D+ R AA++L WK V GT G D + +
Sbjct: 371 ---SGSADAPHVPREILHILDDIWRVAALQLGQWKYVNGTTAAGRYDSLLTYRELDDLDP 427
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNG 565
V + + + LS+ ++ ++ TR+ A + CG P C
Sbjct: 428 RESRYAVTVRNSATSRALSRYDLRRLTQ-QRISITRRLAAVRCGDLQRSCNPLLEEC--- 483
Query: 566 PCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
L+++ DPCEQNN+ S D+ + L ++ R + + + +ADP
Sbjct: 484 ---LYDISTDPCEQNNLVYSERHSDVLAALRRRVR-ELRASASRPGNRASMPEADPALHT 539
Query: 624 DTWSPW 629
W +
Sbjct: 540 CAWESF 545
Score = 82.0 bits (201), Expect = 8e-13, Method: Compositional matrix adjust.
Identities = 41/80 (51%), Positives = 51/80 (63%), Gaps = 1/80 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA +LI+D ++PLFL L+HLAAH N L+AP+
Sbjct: 176 FRRNMELECRDRGAYVTDLLTAEAERLIKDHADKEQPLFLMLSHLAAHTANKDDPLQAPE 235
Query: 60 ETINQFQYITDPNRRTYAAL 79
E I +F YI DPNRR YAA+
Sbjct: 236 EEIQKFAYIKDPNRRKYAAI 255
>gi|195166525|ref|XP_002024085.1| GL22750 [Drosophila persimilis]
gi|194107440|gb|EDW29483.1| GL22750 [Drosophila persimilis]
Length = 575
Score = 343 bits (879), Expect = 2e-91, Method: Compositional matrix adjust.
Identities = 204/542 (37%), Positives = 296/542 (54%), Gaps = 44/542 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF G E TPNIDALA++G IL+ +YA +CTPSR +L++G+YPIHTG Q I
Sbjct: 48 GFDDVSFRGGREFLTPNIDALAFHGRILDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 107
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L +PE ++ GYST IGKWHLGF R EYTP RGF+ H+GY I
Sbjct: 108 SNEEPWGLTLNATLMPEIFQQAGYSTNLIGKWHLGFSRPEYTPTRRGFDYHYGYWGAYID 167
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFL 270
YY S +R L G+D RRN+ G Y TDL T EA ++I E + ++PLFL
Sbjct: 168 YYQR-RSKMPARNYSL-GYDFRRNMELECRDRGVYVTDLLTNEAERVIREREGQEQPLFL 225
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
L+HLA H N L+AP+E I +F YI DPNRR YAAMV KLD SVG ++SAL G
Sbjct: 226 VLSHLATHTANEDDPLQAPEEEIRKFAYIKDPNRRKYAAMVSKLDQSVGRIVSALNSTGQ 285
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSI+IF SDNGAP+V + N GSN+P RG KNT WEGGV+V +WS Q+Q
Sbjct: 286 LENSIVIFYSDNGAPSV-----GMFANTGSNWPLRGQKNTPWEGGVRVAGAIWSAQLQAR 340
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ Q ++++DWLP+L AAG LP ++ +DG+D W
Sbjct: 341 GNIFTQPIYVADWLPSLAHAAG---IGLPHTLE-------------------LDGIDLWP 378
Query: 451 SLL--LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
L ++ P +L +D++ R + ++ WK + GT G D + +
Sbjct: 379 QLAGPVDAPHVPREILHILDDEWRVTSFQMGHWKYINGTTSAGQYDQVLTYRELDDLDPR 438
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP--SPCTNGP 566
+V + + + L+++ ++ ++ + R+ A + CG P C
Sbjct: 439 ESRYVVTVRNSPTSRALARHDIRRMTQ-QRVMTLRRSAAVRCGDLQRGCNPLVEEC---- 493
Query: 567 CYLFNLGNDPCEQNNIASS--RPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
++++ +DPCE+NN+ S ++ + L + ++ R + Q + ++DP
Sbjct: 494 --VYDIWDDPCEKNNLVYSTRHSEVLAALRQRVQMLREGASAPGNRQ-GIRRSDPTLHKC 550
Query: 625 TW 626
TW
Sbjct: 551 TW 552
Score = 77.4 bits (189), Expect = 2e-11, Method: Compositional matrix adjust.
Identities = 39/81 (48%), Positives = 50/81 (61%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA ++I E + ++PLFL L+HLA H N L+AP+
Sbjct: 186 FRRNMELECRDRGVYVTDLLTNEAERVIREREGQEQPLFLVLSHLATHTANEDDPLQAPE 245
Query: 60 ETINQFQYITDPNRRTYAALT 80
E I +F YI DPNRR YAA+
Sbjct: 246 EEIRKFAYIKDPNRRKYAAMV 266
>gi|194748096|ref|XP_001956485.1| GF25237 [Drosophila ananassae]
gi|190623767|gb|EDV39291.1| GF25237 [Drosophila ananassae]
Length = 570
Score = 342 bits (878), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 213/547 (38%), Positives = 286/547 (52%), Gaps = 52/547 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+SF G E TPNIDALAY+G +L+ +YA +CTPSR +L++G+YPIHTG Q I
Sbjct: 41 GFDDVSFRGGREFITPNIDALAYHGRLLDRLYAPAMCTPSRGALLSGRYPIHTGTQHFVI 100
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP + +PE + GYST +GKWHLGF R EYTP +RGF+ H+GY I
Sbjct: 101 SNEEPWALDSNATLMPEIFQRAGYSTNLVGKWHLGFSRPEYTPTHRGFDYHYGYWGAYID 160
Query: 212 YYDHILSDQYSRTVELN---GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKP 267
YY + S+ N G+D RRN+ G Y TDL T EA +LI++Q DKP
Sbjct: 161 YY-----QRRSKMPVANYSLGYDFRRNMELECRDRGTYVTDLLTTEAERLIKEQAGKDKP 215
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL L+HLA H N L+AP+E I +F YI DPNRR YAAMV KLD SVG +IS L
Sbjct: 216 LFLMLSHLATHTANEDDPLQAPEEEIRKFSYIKDPNRRKYAAMVSKLDQSVGRIISTLAS 275
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
LENSI+IF SDNGAP+V + N GSN+P RG KNT WEGGV+V +WS ++
Sbjct: 276 TDQLENSIVIFYSDNGAPSV-----GMFANTGSNWPLRGQKNTPWEGGVRVAGAVWSSKL 330
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ + Q +++ DWLPTL AA D P + GL DG+D
Sbjct: 331 EARGSIFTQPIYVGDWLPTLAHAA--DIELEPKILAGL------------------DGID 370
Query: 448 QWSSL--LLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
W L + P +L ++D+ + ++ + WK V GT G D + +
Sbjct: 371 LWPQLSGAADAPHVPREILHSLDDVWKVGSLLMGHWKYVNGTTSAGQYDSVLTYREFDDI 430
Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR--STRQQATIHCGANPAPMTP--SP 561
V + + + L++ L + + + R R+ A +HCG P
Sbjct: 431 DPRESRYAVTIRNSPASRALTR---LDLRRLTQQRIAGLRRSAAVHCGDFQRSCNPLLED 487
Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR--RTLVPQSHEQPDLVQADP 619
C LF++ DPCEQNN+A S S L L K R R + + + ADP
Sbjct: 488 C------LFDISADPCEQNNLAYSERH-SDVLAALRKRFRELRASASRPGNRASMPSADP 540
Query: 620 KRFNDTW 626
TW
Sbjct: 541 TWHTCTW 547
Score = 83.6 bits (205), Expect = 3e-13, Method: Compositional matrix adjust.
Identities = 42/81 (51%), Positives = 51/81 (62%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA +LI++Q DKPLFL L+HLA H N L+AP+
Sbjct: 179 FRRNMELECRDRGTYVTDLLTTEAERLIKEQAGKDKPLFLMLSHLATHTANEDDPLQAPE 238
Query: 60 ETINQFQYITDPNRRTYAALT 80
E I +F YI DPNRR YAA+
Sbjct: 239 EEIRKFSYIKDPNRRKYAAMV 259
>gi|195021983|ref|XP_001985495.1| GH14468 [Drosophila grimshawi]
gi|193898977|gb|EDV97843.1| GH14468 [Drosophila grimshawi]
Length = 619
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 214/548 (39%), Positives = 293/548 (53%), Gaps = 56/548 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+S G+ E TPNIDALAY+G +L+ +YA +CTPSR +L++GKYPIHTG Q I
Sbjct: 44 GFDDISIRGAREFLTPNIDALAYHGRLLDRLYAPSMCTPSRGALLSGKYPIHTGTQHYVI 103
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L +PE RE GYST +GKWHLGF R EYTP +RGF++H+GY I
Sbjct: 104 GNEEPWGLALNTTLMPEIFREAGYSTNLVGKWHLGFSRPEYTPTHRGFDNHYGYWGAYID 163
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFL 270
YY Q G+D RRN+ G Y TDL T EA ++I +QP ++PLFL
Sbjct: 164 YYQR--RSQMPLGNYSVGYDFRRNMQVECTDRGVYVTDLLTNEAERIIREQPAKEQPLFL 221
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
L+HLA H GN K L+AP+E + +F +I DPNRR YAAMV KLD+SVG VI+AL R
Sbjct: 222 ILSHLAPHTGNTNKPLQAPEEELRKFTHIKDPNRRLYAAMVSKLDESVGRVITALARTDQ 281
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
LENSI+IF SDNGAP+V + N GSN+P +G KN+ WEGG++V +WSP ++
Sbjct: 282 LENSIVIFYSDNGAPSV-----GMFSNTGSNWPLKGQKNSPWEGGLRVAGAIWSPLLKAR 336
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ Q +++ D+LPTL AAG + L++DG+D W L
Sbjct: 337 GNLFTQPIYVGDFLPTLAHAAGIELDS-SLHLDGIDLWPQL------------------- 376
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
S + P +L +D+ R ++ +K V GT G D + +
Sbjct: 377 SGPKDAPHVPREILHQLDDVWRVQSLFQGQYKYVNGTTLAGQFDQVLTYRELDDLDPRAS 436
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC--- 567
+VE + + + L++ D R T QA I+ + A + S G C
Sbjct: 437 RYVVEVRNSPASRALAR--------YDLQRIT--QARINTISRSAAVICSDMQRG-CNAL 485
Query: 568 ---YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR------RTLVPQSHEQPDLVQAD 618
L++L DPCEQNN+A S+ Q E+L R R + +P AD
Sbjct: 486 LEECLYDLSTDPCEQNNLAYSK-----QHSEILTSMRQRVQQLRAGAMAPNNRPSERDAD 540
Query: 619 PKRFNDTW 626
P TW
Sbjct: 541 PSLHECTW 548
Score = 84.3 bits (207), Expect = 2e-13, Method: Compositional matrix adjust.
Identities = 40/81 (49%), Positives = 53/81 (65%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA ++I +QP ++PLFL L+HLA H GN K L+AP+
Sbjct: 182 FRRNMQVECTDRGVYVTDLLTNEAERIIREQPAKEQPLFLILSHLAPHTGNTNKPLQAPE 241
Query: 60 ETINQFQYITDPNRRTYAALT 80
E + +F +I DPNRR YAA+
Sbjct: 242 EELRKFTHIKDPNRRLYAAMV 262
>gi|241156195|ref|XP_002407716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215494207|gb|EEC03848.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 548
Score = 342 bits (877), Expect = 3e-91, Method: Compositional matrix adjust.
Identities = 203/544 (37%), Positives = 288/544 (52%), Gaps = 49/544 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS++IPTPNID LA +GI L+N Y QP+CTPSRA+LMTG YPI TGMQ I
Sbjct: 48 GWDDVSFHGSSQIPTPNIDVLAADGITLHNYYVQPMCTPSRAALMTGLYPIRTGMQHWVI 107
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGK--WHLGFFRREYTPLYRGFESHFGYLNGV 209
EP G+PL + +PE+L++LGYST +GK + L F L + +
Sbjct: 108 RSPEPWGLPLELKLMPEHLKDLGYSTHLVGKVLFDLKKFIVSVNRLCINESTEVCH---- 163
Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
++ + Y G D R + G+YAT LFT A+ +IE KPLF
Sbjct: 164 -TFVSAVTLCIYFVYKSHAGLDFRNGEEPFHNDTGQYATTLFTDRAISIIEQHNQTKPLF 222
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL+HLA H + L+AP E + +F YI + +R YA MV LD SVG V+ AL R G
Sbjct: 223 LYLSHLAPHGATHDEPLQAPDENVEKFDYIGEEDRTIYAGMVDALDVSVGRVVEALSRAG 282
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
MLENSI+ F SDNGA +R N G N+P RG+K ++WEGGV+VP +WSP +++
Sbjct: 283 MLENSIVAFSSDNGAVPFGFRS-----NRGFNWPLRGIKASVWEGGVRVPGFVWSPLLRK 337
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
+ R+S QMMHI+DWLPTLY AAGG+T L +DG + W
Sbjct: 338 SARLSTQMMHITDWLPTLYAAAGGNTKNL----------------------GQLDGHNMW 375
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+L S R +L+NID +A +R KL++ NGT D + VP
Sbjct: 376 EALSRGWRSPRTEILLNIDPITGSAGLRFGKHKLLI-PGVNGTTDAH--------VPTTG 426
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKM---RSTRQQATIHCGANPAPMTPSPCTNGP 566
+ E Q L+ + N+D + R A + CG PA + +
Sbjct: 427 YPRPTEDMDTMMQQSLAAKVLRRFYNVDGIWFRRGWENSAEVDCG--PAEWQSNFVSGRV 484
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
LF++ DPCE N+A++R + L+ + ++P +++ D ++ P++ W
Sbjct: 485 YNLFDIERDPCEMINLAATRRGTVRMMMRKLQSYAAAMIPPRNKEED-PRSFPEKHGGVW 543
Query: 627 SPWI 630
+ W+
Sbjct: 544 TTWL 547
Score = 70.1 bits (170), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 31/75 (41%), Positives = 46/75 (61%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G+YAT LFT A+ +IE KPLFLYL+HLA H + L+AP E + +F YI + +
Sbjct: 197 GQYATTLFTDRAISIIEQHNQTKPLFLYLSHLAPHGATHDEPLQAPDENVEKFDYIGEED 256
Query: 73 RRTYAALTKSTTLTL 87
R YA + + +++
Sbjct: 257 RTIYAGMVDALDVSV 271
>gi|390356459|ref|XP_003728793.1| PREDICTED: arylsulfatase I-like [Strongylocentrotus purpuratus]
Length = 613
Score = 341 bits (874), Expect = 9e-91, Method: Compositional matrix adjust.
Identities = 216/612 (35%), Positives = 323/612 (52%), Gaps = 91/612 (14%)
Query: 73 RRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
+R Y + L GWND+SFHGS++IPTP+IDALA G+IL N Y P+CTP+R
Sbjct: 20 QRVYTDQPPNIIFILADDLGWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTR 79
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGK---------- 182
+++MTGK+PIHTG++ I + P G+ L E+ +P+YLRELGY T +GK
Sbjct: 80 SAIMTGKHPIHTGLEHGVIGVSHPYGLGLEEKLMPQYLRELGYRTHMVGKVSLEHGVIGV 139
Query: 183 ---WHLGF-------FRRE------------YTPLYRGFESHFGYLNGVISYYDH-ILSD 219
+ LG + RE TP +RGFES +GY G+ YY H I SD
Sbjct: 140 SHPYGLGLEEKLMHQYLRELGYRTHMVGKESLTPSHRGFESFYGYYAGMGDYYTHEITSD 199
Query: 220 QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 279
+ G D + S G+Y+T++FT+ ++I +PL++YLAH A H+
Sbjct: 200 G-----NMTGFDFHMDGSVHKPVFGQYSTEIFTERTQEIILKHNPKEPLYIYLAHQAVHS 254
Query: 280 GN-AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
N G+ L+AP E +F IT NRR YAAMV LDDS+G + +L+ + N++I+F
Sbjct: 255 ANYDGQRLQAPHEYYKRFPNITHENRRKYAAMVAALDDSIGNITQSLKESSLYNNTLIVF 314
Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMM 398
+DNG P + N+ SN+P RG+K+T WEGG++ LW I++ R+S MM
Sbjct: 315 TTDNGGPANGFD-----FNYASNWPLRGMKHTTWEGGLRGVGFLWGALIEKPGRMSDGMM 369
Query: 399 HISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPS 458
H+ DW+PTLY AGG+TS L ++DG+D W L S
Sbjct: 370 HVCDWVPTLYGLAGGNTSTL----------------------QHLDGIDVWPMLSRGKQS 407
Query: 459 RRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY------GQTRSNKV-----PL 507
R +L ID +A+R+ +KLV G + G +G+Y Q+ ++ PL
Sbjct: 408 PRQEILHTIDPLLDLSAIRIGDYKLVQGQRGTGR-NGWYPPEGVGKQSPRQEILHTIDPL 466
Query: 508 LNFNAIV--ESKTYQSLQQLSQNIFLP-----ISNIDKMRSTRQQATIHCGANPAPMTPS 560
L+ +AI + K Q + +N + P ++++D + + C PA + +
Sbjct: 467 LDLSAIRIGDYKLVQGQRGTGRNGWYPPEGVSLADLDS-KPVPNAFVVSCPYKPANASTN 525
Query: 561 PCT--NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
C PC LFN+ +DPCE NNIA+ DI L ++ +R T+VP +++PD ++
Sbjct: 526 -CNPYKKPC-LFNIRHDPCEFNNIANWNQDIVELLLTRIEEYRATMVPIRNKEPD-PRSY 582
Query: 619 PKRFNDTWSPWI 630
PK ++ W PW+
Sbjct: 583 PKYHDNCWVPWV 594
Score = 60.8 bits (146), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 45/73 (61%), Gaps = 1/73 (1%)
Query: 11 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN-AGKHLEAPQETINQFQYIT 69
G+Y+T++FT+ ++I +PL++YLAH A H+ N G+ L+AP E +F IT
Sbjct: 217 VFGQYSTEIFTERTQEIILKHNPKEPLYIYLAHQAVHSANYDGQRLQAPHEYYKRFPNIT 276
Query: 70 DPNRRTYAALTKS 82
NRR YAA+ +
Sbjct: 277 HENRRKYAAMVAA 289
>gi|332016484|gb|EGI57377.1| Arylsulfatase B [Acromyrmex echinatior]
Length = 438
Score = 340 bits (872), Expect = 1e-90, Method: Compositional matrix adjust.
Identities = 186/470 (39%), Positives = 281/470 (59%), Gaps = 50/470 (10%)
Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
PVCTPSR + ++G++P+ TGMQG P+ AEPRG+ L + LPEYLR+LGY+T +GKWH+
Sbjct: 2 PVCTPSRVAFLSGRHPLRTGMQGYPLKAAEPRGLHLNDTLLPEYLRKLGYTTHLLGKWHV 61
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST--AWDTV 243
G+ R Y P RGF++ GY NGVI Y++H + + E G+D+ R + +
Sbjct: 62 GYLTRNYVPTRRGFDTFLGYFNGVIQYFNHTIEEN-----EQVGYDLHRIVGDNHTVEYR 116
Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITD 301
+Y T+L T+EA +I +KP++L LAHLA+HA NA + +E +ET YI D
Sbjct: 117 YDYMTNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEAMEVYDWEETNATLGYIQD 176
Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
NRR +A++V LD S+G V+ AL +K ML+N+II+F+SDNGA T E Y+N+GSN
Sbjct: 177 VNRRKFASVVTTLDKSIGRVVDALNKKDMLKNTIILFISDNGAQT----EGPLYQNFGSN 232
Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
YP RG+K +L+EGGV+ A ++SP I+ + +VS Q+ HI+DWLPTLY+AAGG++S L
Sbjct: 233 YPLRGLKFSLYEGGVRGAACIYSPLIENSSKVSTQLFHITDWLPTLYSAAGGNSSDL-YQ 291
Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
+DG+DQ WS++ S+R SV+INI++ +AA+ + +
Sbjct: 292 LDGIDQ---------------------WSTIKTQEASKRKSVVINIEDVDYSAAL-MGRY 329
Query: 482 KLVLGTQENGTMDGYYGQTRSN--KVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKM 539
KL+ Q+ +Y + N P N I+ES ++ +S +I +K+
Sbjct: 330 KLI---QDKSDFQKHYTNYKGNDPSYPKYNVTNILESPAASAIANISNHIVTA----NKI 382
Query: 540 RSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDI 589
R++AT+ C + S C + C LF++ DPCE +I+S P++
Sbjct: 383 EELRKEATVVC----ENLMTSDCESRTC-LFDIYEDPCEVTDISSQYPEV 427
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 44/71 (61%), Gaps = 2/71 (2%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
+Y T+L T+EA +I +KP++L LAHLA+HA NA + +E +ET YI D
Sbjct: 118 DYMTNLITEEAENIISSHNTEKPMYLQLAHLASHASNAEEAMEVYDWEETNATLGYIQDV 177
Query: 72 NRRTYAALTKS 82
NRR +A++ +
Sbjct: 178 NRRKFASVVTT 188
>gi|443705024|gb|ELU01769.1| hypothetical protein CAPTEDRAFT_23096, partial [Capitella teleta]
Length = 354
Score = 338 bits (868), Expect = 4e-90, Method: Compositional matrix adjust.
Identities = 171/356 (48%), Positives = 227/356 (63%), Gaps = 17/356 (4%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+GW+D+ FHGS +IPTPN+DALA +GIIL+N Y+QP+CTPSR SL+TGK+PI G+Q
Sbjct: 11 FGWDDVGFHGSRKIPTPNLDALASDGIILSNHYSQPLCTPSRGSLLTGKHPIQIGLQRGV 70
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I+ A+P G+ L E+ LPEYL+ LGY + +GKWHLGFF EYTP+ RGF+SH+G+
Sbjct: 71 IYSAQPFGLGLKEKLLPEYLKTLGYKSHMVGKWHLGFFADEYTPMRRGFDSHYGFYGASE 130
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y HI + G D N G Y+T LFT +A QL+ + +P+FL
Sbjct: 131 DYMTHIGG--------MGGLDFWLNGQPDRSGQGHYSTTLFTTKAEQLLAEHNQTEPMFL 182
Query: 271 YLAHLAAHAGNA---GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
Y +H A H + G L AP + ++F +I D + A M+ LD+SVG + +L
Sbjct: 183 YFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIADVECQHMAGMISALDESVGNLTKSLHA 242
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
GMLEN+IIIF SDNGAP N SNYP RG K+++WEGG +VPA +WSP +
Sbjct: 243 NGMLENTIIIFSSDNGAP------HGNTDVCSSNYPLRGAKSSVWEGGTRVPAFVWSPLL 296
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
+++ VS QMMHISDWLPTL AAG D S LP ++ G+ QW +L N PS RNS +
Sbjct: 297 KKSGYVSKQMMHISDWLPTLLKAAGYDMSALPGDLYGVSQWQALQENGPSARNSML 352
Score = 47.4 bits (111), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 24/73 (32%), Positives = 39/73 (53%), Gaps = 3/73 (4%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA---GKHLEAPQETINQFQYIT 69
G Y+T LFT +A QL+ + +P+FLY +H A H + G L AP + ++F +I
Sbjct: 156 GHYSTTLFTTKAEQLLAEHNQTEPMFLYFSHQAVHTSHPKEDGHRLFAPSQYTDKFPFIA 215
Query: 70 DPNRRTYAALTKS 82
D + A + +
Sbjct: 216 DVECQHMAGMISA 228
>gi|390360370|ref|XP_791935.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 374
Score = 338 bits (866), Expect = 6e-90, Method: Compositional matrix adjust.
Identities = 171/344 (49%), Positives = 221/344 (64%), Gaps = 12/344 (3%)
Query: 89 IVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
I GW+D+S HGS++I TPNID LA G+ L N Y P+CTP+R+++MTGK+PIHTGMQ
Sbjct: 40 IEQGWDDVSLHGSSQILTPNIDTLAQEGVTLTNYYVSPICTPTRSAIMTGKHPIHTGMQH 99
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
I EP G+ L E+ + ++L+ LGYST A+GKWHLG+F +Y P RGF+S FGY NG
Sbjct: 100 DTIGADEPWGLGLDEKTMAQHLKSLGYSTHAVGKWHLGYFAEDYIPTRRGFDSFFGYYNG 159
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H ++ G+D+ +N W G+Y+T++FT+EA Q+I+ +PL
Sbjct: 160 RGDYYTHEDTEG-----GFGGYDLHKNGEVHWPDFGQYSTEIFTEEAQQIIKTHNASQPL 214
Query: 269 FLYLAHLAAHAGNAGKHL-EAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
FLYLAH A HAG GK L EAP + F I RR +AAMV LDDSVG + LQ
Sbjct: 215 FLYLAHQAVHAGVYGKDLVEAPHKYYQMFPNIKTEGRRMFAAMVAGLDDSVGNISQTLQD 274
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
G+ NSII+F +DNG PT Y N SN+P RG K+TLWEGGV+ A + SP I
Sbjct: 275 AGLYNNSIIVFTTDNGGPTNGYD-----GNHASNWPLRGCKHTLWEGGVRGTAFVNSPLI 329
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
++ R S +MMH+ DWLPTLY AGGD L N+DG DQW +L
Sbjct: 330 EKPRRFSDRMMHVCDWLPTLYGVAGGDPKEL-TNLDGFDQWDAL 372
Score = 69.7 bits (169), Expect = 4e-09, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 48/81 (59%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHL-EAPQ 59
+ +N W G+Y+T++FT+EA Q+I+ +PLFLYLAH A HAG GK L EAP
Sbjct: 178 LHKNGEVHWPDFGQYSTEIFTEEAQQIIKTHNASQPLFLYLAHQAVHAGVYGKDLVEAPH 237
Query: 60 ETINQFQYITDPNRRTYAALT 80
+ F I RR +AA+
Sbjct: 238 KYYQMFPNIKTEGRRMFAAMV 258
>gi|391325967|ref|XP_003737498.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 513
Score = 337 bits (865), Expect = 9e-90, Method: Compositional matrix adjust.
Identities = 204/554 (36%), Positives = 297/554 (53%), Gaps = 78/554 (14%)
Query: 80 TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMT 137
TK+ + L++V GW+D+ HGS++IPTPNID LA G++L+N Y QP+CTPSRASLMT
Sbjct: 26 TKAPNVVLIVVDDLGWDDIGLHGSSQIPTPNIDKLAEEGVVLDNYYTQPICTPSRASLMT 85
Query: 138 GKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
GKYP+ G+Q I A P G+P + +P+YL + Y +GKWHLG R E+ P R
Sbjct: 86 GKYPVRLGLQHDVISAATPFGLPSNFKIMPQYLHDKNYDCHIVGKWHLGHSRSEFLPTRR 145
Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVEL-NGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
GF+ HFGY G +Y H +D L G D+ N A + G+Y+TD++T +
Sbjct: 146 GFKDHFGYRLGSSDHYSHYGADDSDVPGSLFYGLDLWHNEVPAKEFNGKYSTDIYTHRST 205
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLD 315
++ +PLFLYLA+ A HAGN + L+APQ +++F I + RR YAAMV +D
Sbjct: 206 DILRMHNKSRPLFLYLAYQAVHAGNPDQALQAPQSIVDRFSSSIRNDRRRRYAAMVSAVD 265
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
++G V+ A++ G N+++ F +DNG P +N R+ GSNYP R K TLWEGG
Sbjct: 266 TAIGNVMGAIRANGFAGNTLVFFTNDNGGPI-----NANDRSPGSNYPLRAGKFTLWEGG 320
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
V+ I W+PQ+ + P + HI D LPT+ +AAG S +P +DG+ W SL
Sbjct: 321 VRGTGIFWAPQVLK-PGKFGGLSHIVDVLPTILSAAG--MSSVP-ELDGVSLWKSL---- 372
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
+R+ N R +L+NID TAA RL +KL+ + NG MDG
Sbjct: 373 SEKRSEN----------------PRTELLLNIDPVDHTAAYRLGDFKLI---KSNGLMDG 413
Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA 555
++ + P +A S+T + +L +F+ +D+
Sbjct: 414 WF------RTP----DAANHSRTIKC--KLDDPVFI-CRGVDR----------------- 443
Query: 556 PMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLV 615
PC LFN+ DPCEQ+++ + PDI ++ E L +R + P + +
Sbjct: 444 -----PC------LFNIAEDPCEQDDVYADNPDIVKEIMERLHRYRHEMHPVEDKNVTRL 492
Query: 616 QADPKRFNDTWSPW 629
ADP RF+ W PW
Sbjct: 493 -ADPARFDYIWMPW 505
Score = 61.6 bits (148), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 48/81 (59%), Gaps = 1/81 (1%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDP 71
G+Y+TD++T + ++ +PLFLYLA+ A HAGN + L+APQ +++F I +
Sbjct: 193 GKYSTDIYTHRSTDILRMHNKSRPLFLYLAYQAVHAGNPDQALQAPQSIVDRFSSSIRND 252
Query: 72 NRRTYAALTKSTTLTLLIVYG 92
RR YAA+ + + V G
Sbjct: 253 RRRRYAAMVSAVDTAIGNVMG 273
>gi|242008416|ref|XP_002425002.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
gi|212508631|gb|EEB12264.1| arylsulfatase B precursor, putative [Pediculus humanus corporis]
Length = 532
Score = 333 bits (855), Expect = 1e-88, Method: Compositional matrix adjust.
Identities = 203/533 (38%), Positives = 307/533 (57%), Gaps = 51/533 (9%)
Query: 77 AALTKSTT------LTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCT 129
A + KST +T+LI GW D FHGS++I TPN+DALAY+G+ILN Y P CT
Sbjct: 17 AVVCKSTAQQPPHIITILIDDLGWTDTGFHGSDQIKTPNMDALAYSGMILNRHYVLPSCT 76
Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
PSR++L+TG YPI TGMQG P+ G + R +PL+ + PE+L+ LGY T +GKWHLG+
Sbjct: 77 PSRSALLTGLYPIRTGMQGMPLKGGDVRNLPLSFKLKPEFLKNLGYRTHLVGKWHLGYRT 136
Query: 190 REYTPLYRGFESHFGYLNGVISYYD--HILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
+ P RGF+S FGY NG + Y+ H + + G+D+ RN YA
Sbjct: 137 INHLPNQRGFDSFFGYYNGYVDYFKFGHNQTVAGEKIEYFYGYDLHRNGEIYQTDKDTYA 196
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET-INQ-FQYITDPNRR 305
T LFT+EA ++I++ +PL+LY +HLA H G+ +E P++ +N+ + +I RR
Sbjct: 197 TRLFTREAEKIIKNHNESEPLYLYFSHLATHTGDDDIGMEVPEDADVNKTYGHIKHYGRR 256
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
+A +++LD SVG V+ AL+ K ML+NSII+ MSDNG TV NW SN+P
Sbjct: 257 AFAGCLEELDKSVGEVMEALKEKNMLDNSIILIMSDNGGHTVSVDLPP---NWSSNWPLG 313
Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
G K TL+EGGV+ A++WSP + + ++ +HI+DWLPTLY+AAGG+ L + DG+
Sbjct: 314 GTKFTLFEGGVRSVALIWSPLLPKG-VINDDFIHITDWLPTLYSAAGGNPKELGI-FDGI 371
Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS-WKLV 484
DQW +L +RNS + R ++L+NID+K + V +++ WK++
Sbjct: 372 DQWEAL------KRNSAV---------------ARENILLNIDDKTGSEGVIIENRWKVL 410
Query: 485 LGTQEN-GTMDGYYGQTRS-NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRST 542
G + G+ D ++G+ + + P + N I+ S+T Q + L + N + +
Sbjct: 411 KGGYKTYGSYDFHWGKPENIHPNPKYDPNDIINSQTGQIVSNLKH--YKNSLNHSSIETL 468
Query: 543 RQQATIHCG--ANPAPMT------PSPCTNGPCY-LFNLGNDPCEQNNIASSR 586
R+++T+ C N PM PSP + LF++ ND CE N + ++
Sbjct: 469 RKKSTVTCQPLVNNIPMNDYGFIEPSPEVRCKDFCLFDIINDQCEVNPVENNE 521
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 42/66 (63%), Gaps = 2/66 (3%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET-INQ-FQYITDPN 72
YAT LFT+EA ++I++ +PL+LY +HLA H G+ +E P++ +N+ + +I
Sbjct: 195 YATRLFTREAEKIIKNHNESEPLYLYFSHLATHTGDDDIGMEVPEDADVNKTYGHIKHYG 254
Query: 73 RRTYAA 78
RR +A
Sbjct: 255 RRAFAG 260
>gi|156402612|ref|XP_001639684.1| predicted protein [Nematostella vectensis]
gi|156226814|gb|EDO47621.1| predicted protein [Nematostella vectensis]
Length = 380
Score = 333 bits (854), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 167/359 (46%), Positives = 230/359 (64%), Gaps = 11/359 (3%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS +IPTPN+D LA G+ILNN Y P+CTP+RASLMTGKYPIH GMQ I
Sbjct: 16 GWDDVSFHGSPQIPTPNLDYLATRGVILNNYYVSPICTPTRASLMTGKYPIHLGMQHFVI 75
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ A+P G+PL E LP+YL+ GY T IGKWHLGFF +EYTP YRGF+S +G +
Sbjct: 76 YAAQPYGLPLGEITLPQYLQIQGYKTAGIGKWHLGFFAKEYTPTYRGFDSFYGMWSAKAD 135
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H S G DMR N+ +YAT++FT+EA+++IE+ +PLFLY
Sbjct: 136 YWNHT-----SFENGFWGTDMRNNMEPVTTDKDKYATEVFTREALKVIENHNKSEPLFLY 190
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDP-NRRTYAAMVKKLDDSVGTVISALQRKGM 330
+AH A H+ N L+AP++ + +F + D R+ YAAMV +DDS+G V AL++ M
Sbjct: 191 IAHQAPHSANPHDPLQAPEDKVKKFSGVIDKIERQQYAAMVTCVDDSIGEVFRALEKNRM 250
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L NS+I+F +DNG + RN GSNYP RG K+ +WEGGV+ A ++S I+
Sbjct: 251 LNNSVILFTTDNGGAPHGFN-----RNQGSNYPLRGGKDMMWEGGVRGTAFIYSDLIKHK 305
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
RVS ++ ++DW+PTLY AGG L ++DG + W ++ PS R + +D W
Sbjct: 306 GRVSTDLIDVTDWVPTLYYLAGGTPGYLEPHMDGKNVWETISNKEPSPRKEVLHNIDPW 364
Score = 65.5 bits (158), Expect = 9e-08, Method: Compositional matrix adjust.
Identities = 31/81 (38%), Positives = 51/81 (62%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MR N+ +YAT++FT+EA+++IE+ +PLFLY+AH A H+ N L+AP++
Sbjct: 151 MRNNMEPVTTDKDKYATEVFTREALKVIENHNKSEPLFLYIAHQAPHSANPHDPLQAPED 210
Query: 61 TINQFQYITDP-NRRTYAALT 80
+ +F + D R+ YAA+
Sbjct: 211 KVKKFSGVIDKIERQQYAAMV 231
>gi|241676246|ref|XP_002411524.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215504222|gb|EEC13716.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 490
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 190/498 (38%), Positives = 269/498 (54%), Gaps = 55/498 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+SFHGS +IPTPNID LA +G+ILNN Y P+CTPSRA+LMTG YPI GMQ I
Sbjct: 37 GWGDVSFHGSTQIPTPNIDVLAGDGVILNNYYVLPLCTPSRAALMTGLYPIRNGMQLTSI 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A P G+PL + LP++ ++LGY IGKWHLGFF+ Y P+ RGF++ FG+ G
Sbjct: 97 QAAGPWGLPLENKILPQHFKDLGYDVNMIGKWHLGFFKTPYVPIKRGFDTFFGFYTGSND 156
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL--- 268
YY+H + + + + S +T+ + L+ + + + PL
Sbjct: 157 YYNHTSGSSHRKILAVTS-------SVQVNTLEKGRLSLWGPRELSVCFLHQIYSPLNFY 209
Query: 269 ----FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
F Y++H A H + +AP + +F YI +PNR YA M+ LD+S+G V+ A
Sbjct: 210 LQPFFCYISHQAVHHALNAEMFQAPARNVLKFSYIGEPNRTIYAGMLDALDESMGRVVEA 269
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
L G+LE++II+F SDNG N G N+P RG K TLWEG V+VPA +W+
Sbjct: 270 LSSAGILEDTIIVFSSDNGGSPYGLES-----NRGYNWPLRGAKFTLWEGSVRVPAFVWN 324
Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
P++ + RVS Q+MHISDWLPTLYTAAGG+ S L IDG+
Sbjct: 325 PKLLKKSRVSNQLMHISDWLPTLYTAAGGNASALG-GIDGV------------------- 364
Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY---GQTR 501
D W SL + S+R +L+NID + +R +KLV+G +G +D +Y G TR
Sbjct: 365 --DMWHSLTHGSASKRKEILLNIDSLYNISGLRYKQYKLVVGGGFDGELDDHYYFPGGTR 422
Query: 502 -SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
+N + L N ++ + L + + P+ + + + C N +T +
Sbjct: 423 PTNNIDQLR-NQSTMARVLKELYEKQDRSWSPL-------AWSKNVAVDCRRN--RLTEN 472
Query: 561 PCTNGPCYLFNLGNDPCE 578
P YLFN+ DPCE
Sbjct: 473 FVPMQPPYLFNIEEDPCE 490
Score = 42.4 bits (98), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 16/45 (35%), Positives = 25/45 (55%)
Query: 35 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAL 79
+P F Y++H A H + +AP + +F YI +PNR YA +
Sbjct: 211 QPFFCYISHQAVHHALNAEMFQAPARNVLKFSYIGEPNRTIYAGM 255
>gi|391330458|ref|XP_003739677.1| PREDICTED: arylsulfatase J-like [Metaseiulus occidentalis]
Length = 633
Score = 333 bits (853), Expect = 2e-88, Method: Compositional matrix adjust.
Identities = 208/590 (35%), Positives = 306/590 (51%), Gaps = 74/590 (12%)
Query: 81 KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
++ + L IV GW+D+SFH + +IPTPNIDA+ + ++LN+ Y Q CTPSR +L+TG
Sbjct: 55 RAPDIILFIVDDMGWSDVSFHANGQIPTPNIDAMCSDAVLLNSHYVQASCTPSRGALLTG 114
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
KYPI G+Q I + L R LP+YLR+LGY+T +GKWHLGF+ +Y P RG
Sbjct: 115 KYPIKIGLQEYVIQPGRQEALHLKHRLLPQYLRDLGYATHLVGKWHLGFYAEDYLPENRG 174
Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
F+S +G+ NG +YY+H SD R G+D N + D G+YATD+ T+ L
Sbjct: 175 FDSFYGFYNGAGTYYNHSASDADGRI----GYDWHLNKESDPDAHGKYATDIITQRVKHL 230
Query: 259 IEDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETIN--QFQYITDPNRRTYAAMVKKLD 315
I+ + +KP+FL ++H+A H G N + E ++ I + +I +R YA M++ LD
Sbjct: 231 IQSRDPEKPMFLMISHMAPHGGDNEDELFEVDRQWIEDPEIAHIMVESRTKYAGMIRALD 290
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
D+ G + AL+ ML+NS+I+F SDNG P++ +Y+ SN+P RG K ++WEGG
Sbjct: 291 DAFGQTVVALKEAEMLQNSVILFTSDNGGPSI---VDESYKTGASNWPLRGQKFSMWEGG 347
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
V+V + LWSP ++ ++ + HI+DWLPT Y AGG+ LP++IDG D W S+ NT
Sbjct: 348 VRVSSFLWSPLLKSQRYINKHLYHITDWLPTFYKLAGGELRDLPIDIDGHDIWDSIDRNT 407
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
S R + + D + N RR + + + DSW G GTMD
Sbjct: 408 DSPRKNLLINYDTGGT--ENFAVRRGNFKLMHGKPPPEVDPWYDSWYATAG----GTMDD 461
Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRST-RQQATIHCGANP 554
+ ++ K +S TY L + + ++ R R+ + C N
Sbjct: 462 FMTLQKAMK----------KSVTYDILSRR--------PDFEERRDDWREAGKVKCKINR 503
Query: 555 APMTPSP------------------------CTNG---PCYLFNLGNDPCEQNNIASS-R 586
M SP CTN PC LF+L DPCE + + R
Sbjct: 504 G-MADSPRGLDYRTYTVDDMLKASVIKNDEICTNSNEQPC-LFDLLEDPCELRQYSGADR 561
Query: 587 PDISSQLYEL----LKYHRRTLVPQSH--EQPDLVQADPKRFNDTWSPWI 630
I +L E+ L+ + P S E D ADP++FN TW P+I
Sbjct: 562 EQIIRELSEIAEQYLRLKEAAVAPWSSRMEHTD-ADADPQKFNGTWMPYI 610
Score = 46.6 bits (109), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 43/76 (56%), Gaps = 3/76 (3%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETIN--QFQ 66
D G+YATD+ T+ LI+ + +KP+FL ++H+A H G N + E ++ I +
Sbjct: 213 DAHGKYATDIITQRVKHLIQSRDPEKPMFLMISHMAPHGGDNEDELFEVDRQWIEDPEIA 272
Query: 67 YITDPNRRTYAALTKS 82
+I +R YA + ++
Sbjct: 273 HIMVESRTKYAGMIRA 288
>gi|195379278|ref|XP_002048407.1| GJ13952 [Drosophila virilis]
gi|194155565|gb|EDW70749.1| GJ13952 [Drosophila virilis]
Length = 574
Score = 332 bits (850), Expect = 5e-88, Method: Compositional matrix adjust.
Identities = 201/567 (35%), Positives = 296/567 (52%), Gaps = 42/567 (7%)
Query: 67 YITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA 124
++ + TY K + +++ G++D+S G E TPNIDAL ++G +L+ +YA
Sbjct: 19 WVKASEKNTYRTGNKLPNVIIIMADDMGFDDISLRGGREFLTPNIDALGFHGRLLDRLYA 78
Query: 125 QPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWH 184
+CTPSR +L++GKYPIHTG Q I EP + L +PE R GYST +GKWH
Sbjct: 79 PAMCTPSRGALLSGKYPIHTGTQHFVISNEEPWSLMLNTTLMPEIFRSAGYSTNLVGKWH 138
Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
LGF R EYTP +RGF+ H+GY I YY Q + G+D RRN+ G
Sbjct: 139 LGFARPEYTPTHRGFDYHYGYWGAYIDYYQR--RSQMPEKTYIVGYDFRRNMEVECTDRG 196
Query: 245 EYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
Y TDL T EA ++I++ KPLFL + HLA H N L+AP+E + +F +I DPN
Sbjct: 197 VYVTDLLTNEAERIIQETAAKQKPLFLMINHLATHTANDNDPLQAPEEEVQKFLHIKDPN 256
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
R YAAMV KLD SVG VI+AL R LENSI+IF SDNGAP+V + N GSN+P
Sbjct: 257 HRKYAAMVSKLDQSVGRVITALARAEQLENSIVIFYSDNGAPSV-----GMFANTGSNWP 311
Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
+G KN+ WEGGV+V ++WS + + Q +++ D+LPTL AAG + L +D
Sbjct: 312 LKGQKNSPWEGGVRVAGVIWSQLLTARGHLFTQPIYVGDFLPTLAHAAGIELPE-SLQLD 370
Query: 424 GLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
G+D W L + + P +L ++D+ +++ L WK
Sbjct: 371 GIDLWPQLAGSK-------------------DAPHVPREILHSLDDVSHFSSLLLGQWKY 411
Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTR 543
V GT G D +++ +VE + + + L++ ++ +M + R
Sbjct: 412 VNGTTLAGQFDQQLTHRELDELDPRARRYVVEVRNSPASKALARYDLQRLTQ-HRMHTLR 470
Query: 544 QQATIHCG--ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKY 599
+ AT+ CG C L++L DPCE NN+A S +I + + + ++
Sbjct: 471 RLATVRCGDMQRGCNALLEEC------LYDLSVDPCELNNLAYSERHAEILATMRQRVQQ 524
Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTW 626
R +P + +P + A+P TW
Sbjct: 525 LRAEALPPGN-RPGVSYANPNLHECTW 550
Score = 71.6 bits (174), Expect = 1e-09, Method: Compositional matrix adjust.
Identities = 35/81 (43%), Positives = 47/81 (58%), Gaps = 1/81 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA ++I++ KPLFL + HLA H N L+AP+
Sbjct: 184 FRRNMEVECTDRGVYVTDLLTNEAERIIQETAAKQKPLFLMINHLATHTANDNDPLQAPE 243
Query: 60 ETINQFQYITDPNRRTYAALT 80
E + +F +I DPN R YAA+
Sbjct: 244 EEVQKFLHIKDPNHRKYAAMV 264
>gi|22450123|emb|CAD33828.1| glucosinolate sulphatase [Plutella xylostella]
Length = 547
Score = 331 bits (848), Expect = 9e-88, Method: Compositional matrix adjust.
Identities = 197/561 (35%), Positives = 305/561 (54%), Gaps = 48/561 (8%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
AA + + GW+D S HGS + TPN+D L +G+ L+ Y +C+P+R +++
Sbjct: 19 AATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVL 78
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TGKY GMQG P+ AE RG+PL ER + +YL++ GY T+ +GKWH+G E P Y
Sbjct: 79 TGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTY 138
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
RGFE+HFG G I YY++ +Q + G + +L W T G Y TD++T+++
Sbjct: 139 RGFENHFGVRGGFIDYYEYNAQEQLDGR-PVTGLCLFDDLQPDWTTEG-YITDVYTEKST 196
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
+IE+ V +PL+L L H A H GN L+AP E + +++ RR +AAMVKKLDD
Sbjct: 197 TIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLDD 256
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
S+G +++ L++KGMLEN+II F +DNGAPTV S GSNYP RGVK + WEGG+
Sbjct: 257 SIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANS-----GSNYPLRGVKKSPWEGGI 311
Query: 377 KVPAILWS-PQIQQN----PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+ A++W+ P++ +V MH +DW+PTL A G ++P +DG+ WS +
Sbjct: 312 RGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIG---EKIPAGLDGIPMWSHI 368
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
+ N PS R + ID+ ++V L KLV GT +
Sbjct: 369 IENKPSPRTE----------------------IFEIDDYFNHSSVTLGRHKLVKGTIDE- 405
Query: 492 TMDGYYGQTRSNKV--PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
++ +YG+ + P + +SK ++SL + I +P+ + D M + R +A +
Sbjct: 406 SLSKHYGEDLRGIIGTPPDYKQKLRDSKAWESL----ETIGIPL-DADVM-ADRDEAIVT 459
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
CG N P SP C L+++ DPCE +++ P ++ L L+ ++P+
Sbjct: 460 CG-NVVPKPCSPSAESWC-LYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREG 517
Query: 610 EQPDLVQADPKRFNDTWSPWI 630
+ ++ PK FN TW ++
Sbjct: 518 QYVADPKSAPKYFNYTWDAYL 538
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
+L W T G Y TD++T+++ +IE+ V +PL+L L H A H GN L+AP E +
Sbjct: 176 DLQPDWTTEG-YITDVYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVR 234
Query: 64 QFQYITDPNRRTYAALTK 81
+++ RR +AA+ K
Sbjct: 235 AQRHVELHPRRIFAAMVK 252
>gi|391326893|ref|XP_003737944.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 528
Score = 328 bits (842), Expect = 4e-87, Method: Compositional matrix adjust.
Identities = 203/562 (36%), Positives = 280/562 (49%), Gaps = 75/562 (13%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
AA + + L ++ GW+D+S HGS++IPTPNID LA G++L N Y Q +CTPSR +LM
Sbjct: 17 AASSPNIVLIVIDDLGWDDISLHGSDQIPTPNIDKLAAEGVLLENYYTQAICTPSRGALM 76
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TGKYPIH G+Q I GA+P G+P + +P+YL Y + IGKWHLG R E P
Sbjct: 77 TGKYPIHLGLQYDVIQGAQPYGLPTDFKIMPQYLSGTCYKSHIIGKWHLGHSRSELLPTR 136
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVEL-NGHDMRRNLSTAWDTVGEYATDLFTKEA 255
RGF SHFG+ G Y++H + E+ G D+ N G YA DLFTK A
Sbjct: 137 RGFHSHFGFRLGHSDYFNHWGEESSPVKNEMYAGLDLWSNEVPIKKYHGTYANDLFTKRA 196
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKL 314
+ ++E PLFLYLAH A H G+ L+AP + +F+ I D R+ YAAMV +
Sbjct: 197 ISILETHNKTTPLFLYLAHQAVHVGDGENPLQAPLSFVKKFEGKIYDEKRKHYAAMVSAM 256
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D S+G ++ + G EN+II +DNG PT + + SN+P RG K TLWEG
Sbjct: 257 DASIGELMHGISTNGFAENTIIFLTNDNGGPTNGMASSGS-----SNHPLRGCKYTLWEG 311
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
G + A+ W P+ + M HI+D LPTL TA+G + N DG D W +L N
Sbjct: 312 GTRGSALFWYPK-KLGAGTYKGMAHITDVLPTLLTASGNAVAG---NFDGYDLWDALSTN 367
Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRR-------NSVLINIDEKKRTAAVRLDSWKLVLGT 487
R+ + P R +S+L NID + AVR +KL+
Sbjct: 368 AQPRKET-------------VEPQRTYGNEHDFHSLLYNIDPVTHSWAVRSGDFKLITDN 414
Query: 488 QENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQAT 547
NG G+Y ++ + TR T
Sbjct: 415 ALNGEYSGWY------------------------------------PTVESVNLTR---T 435
Query: 548 IHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQ 607
+ CG+ S PC LFNL DPCE+N+I RPD+ +L L ++ ++ P
Sbjct: 436 VKCGSASKACDGS---KRPC-LFNLAADPCEENDIFDERPDMVRKLMVKLNEYQTSMRPM 491
Query: 608 SHEQPDLVQADPKRFNDTWSPW 629
+ +P+ ADP RF+ W PW
Sbjct: 492 EN-RPNSAAADPARFDFVWMPW 512
Score = 63.2 bits (152), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 31/76 (40%), Positives = 44/76 (57%), Gaps = 1/76 (1%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDP 71
G YA DLFTK A+ ++E PLFLYLAH A H G+ L+AP + +F+ I D
Sbjct: 185 GTYANDLFTKRAISILETHNKTTPLFLYLAHQAVHVGDGENPLQAPLSFVKKFEGKIYDE 244
Query: 72 NRRTYAALTKSTTLTL 87
R+ YAA+ + ++
Sbjct: 245 KRKHYAAMVSAMDASI 260
>gi|195128415|ref|XP_002008659.1| GI13615 [Drosophila mojavensis]
gi|193920268|gb|EDW19135.1| GI13615 [Drosophila mojavensis]
Length = 576
Score = 327 bits (837), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 206/566 (36%), Positives = 295/566 (52%), Gaps = 48/566 (8%)
Query: 72 NRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV 127
+R T T S ++I+ G++DLS G E TPNIDALA++G +L+ +YA +
Sbjct: 22 DRETNTTRTGSKLPNIIIIMADDMGFDDLSIRGGREFLTPNIDALAFHGRLLDRLYAPAM 81
Query: 128 CTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
CTPSR +L++GKYPIHTG Q I EP + L +PE R GYST +GKWHLG+
Sbjct: 82 CTPSRGALLSGKYPIHTGTQHFVISNQEPWSLKLNTTLMPEIFRAAGYSTNLVGKWHLGY 141
Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
R E+TP +RGF+ H+GY I YY Q + G+D RRN+ G Y
Sbjct: 142 ARPEFTPTHRGFDYHYGYWGAYIDYYQR--RSQMPDKTYIMGYDFRRNMEVECADRGVYM 199
Query: 248 TDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
TDL T EA ++I++ +PLFL + HLA H GN L+ P+E + +F +I DPN R
Sbjct: 200 TDLLTNEAERVIQETAAKQQPLFLMINHLAVHTGNDNDPLQVPEEELQKFTHIKDPNHRK 259
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YAAMV KLD SVG VI AL R LENSI+IF SDNGAP+V + N GSN+P +G
Sbjct: 260 YAAMVSKLDKSVGRVIGALARAEQLENSIVIFYSDNGAPSV-----GMFANTGSNWPLKG 314
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP--LNIDG 424
KN+ WEGG++V +WSP ++ V Q +++ D+LPTL AAG + LP L +DG
Sbjct: 315 QKNSPWEGGLRVAGAIWSPLLKARSSVFTQPIYVGDFLPTLAHAAGIE---LPDSLQLDG 371
Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
+D W L+ P P +L ++D+ +++ L WK V
Sbjct: 372 IDLWPQ--LSGPK-----------------EAPHVPREILHSLDDVWNVSSLLLGHWKYV 412
Query: 485 LGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQ 544
GT G D Q + + +VE + + + L++ ++ +M++
Sbjct: 413 NGTTLAGQYDQQLVQRELDDLDPRANRYVVEVRNSLASRALARFDQQRLTQ-HRMKNIMI 471
Query: 545 QATIHCG--ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSR--PDISSQLYELLKYH 600
A + CG C L+++ DPCE+NN+A S +I + + + +K
Sbjct: 472 DAKVDCGDMQRGCNALLEEC------LYDMSVDPCERNNLAYSEQHAEILATMRQRIKQL 525
Query: 601 RRTLVPQSHEQPDLVQADPKRFNDTW 626
R VP P + A+P TW
Sbjct: 526 RAGAVP-IQNLPGVAYANPTLHECTW 550
Score = 70.5 bits (171), Expect = 2e-09, Method: Compositional matrix adjust.
Identities = 36/93 (38%), Positives = 51/93 (54%), Gaps = 1/93 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ 59
RRN+ G Y TDL T EA ++I++ +PLFL + HLA H GN L+ P+
Sbjct: 184 FRRNMEVECADRGVYMTDLLTNEAERVIQETAAKQQPLFLMINHLAVHTGNDNDPLQVPE 243
Query: 60 ETINQFQYITDPNRRTYAALTKSTTLTLLIVYG 92
E + +F +I DPN R YAA+ ++ V G
Sbjct: 244 EELQKFTHIKDPNHRKYAAMVSKLDKSVGRVIG 276
>gi|22450115|emb|CAC86338.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 326 bits (836), Expect = 2e-86, Method: Compositional matrix adjust.
Identities = 195/554 (35%), Positives = 301/554 (54%), Gaps = 48/554 (8%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
AA + + GW+D S HGS + TPN+D L +G+ L+ Y +C+P+R +++
Sbjct: 19 AATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVL 78
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TGKY GMQG P+ AE RG+PL ER + +YL++ GY T+ +GKWH+G E P Y
Sbjct: 79 TGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTY 138
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
RGFE+HFG G I YY++ +Q + G + +L W T G Y TD++T+++
Sbjct: 139 RGFENHFGVRGGFIDYYEYNAQEQLDGR-PVTGLCLFDDLQPDWTTEG-YITDVYTEKST 196
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
+IE+ V +PL+L L H A H GN L+AP E + +++ RR +AAMVKKLDD
Sbjct: 197 TIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLDD 256
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
S+G +++ L++KGMLEN+II F +DNGAPTV S GSNYP RGVK + WEGG+
Sbjct: 257 SIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANS-----GSNYPLRGVKKSPWEGGI 311
Query: 377 KVPAILWS-PQIQQN----PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+ A++W+ P++ +V MH +DW+PTL A G ++P +DG+ WS +
Sbjct: 312 RGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIG---EKIPAGLDGIPMWSHI 368
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
+ N PS R + ID+ ++V L KLV GT +
Sbjct: 369 IENKPSPRTE----------------------IFEIDDYFNHSSVTLGRHKLVKGTIDE- 405
Query: 492 TMDGYYGQTRSNKV--PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
++ +YG+ + P + +SK ++SL + I +P+ + D M + R +A +
Sbjct: 406 SLSKHYGEDLRGIIGTPPDYKQKLRDSKAWESL----ETIGIPL-DADVM-ADRDEAIVT 459
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
CG N P SP C L+++ DPCE +++ P ++ L L+ ++P+
Sbjct: 460 CG-NVVPKPCSPSAESWC-LYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREG 517
Query: 610 EQPDLVQADPKRFN 623
+ ++ PK FN
Sbjct: 518 QYVSDPKSAPKYFN 531
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
+L W T G Y TD++T+++ +IE+ V +PL+L L H A H GN L+AP E +
Sbjct: 176 DLQPDWTTEG-YITDVYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVR 234
Query: 64 QFQYITDPNRRTYAALTK 81
+++ RR +AA+ K
Sbjct: 235 AQRHVELHPRRIFAAMVK 252
>gi|22450117|emb|CAC86342.1| glucosinolate sulfatase [Plutella xylostella]
gi|22450119|emb|CAC86343.1| glucosinolate sulfatase [Plutella xylostella]
Length = 532
Score = 326 bits (835), Expect = 3e-86, Method: Compositional matrix adjust.
Identities = 195/554 (35%), Positives = 301/554 (54%), Gaps = 48/554 (8%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
AA + + GW+D S HGS + TPN+D L +G+ L+ Y +C+P+R +++
Sbjct: 19 AATKPHVIMIMADDMGWDDTSTHGSKSVLTPNLDVLTRSGVSLHRYYTHALCSPARTAVL 78
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TGKY GMQG P+ AE RG+PL ER + +YL++ GY T+ +GKWH+G E P Y
Sbjct: 79 TGKYAHTVGMQGMPLSNAEERGIPLEERLISQYLQDAGYRTQMVGKWHVGHAFFEQLPTY 138
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
RGFE+HFG G I YY++ +Q + G + +L W T G Y TD++T+++
Sbjct: 139 RGFENHFGVRGGFIDYYEYNAQEQLDGR-PVTGLCLFDDLQPDWTTEG-YITDVYTEKST 196
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
+IE+ V +PL+L L H A H GN L+AP E + +++ RR +AAMVKKLDD
Sbjct: 197 TIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVRAQRHVELHPRRIFAAMVKKLDD 256
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
S+G +++ L++KGMLEN+II F +DNGAPTV S GSNYP RGVK + WEGG+
Sbjct: 257 SIGEIVATLEKKGMLENTIITFSTDNGAPTVGLGANS-----GSNYPLRGVKKSPWEGGI 311
Query: 377 KVPAILWS-PQIQQN----PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+ A++W+ P++ +V MH +DW+PTL A G ++P +DG+ WS +
Sbjct: 312 RGNAMIWAGPEVAPGNAWRGKVYDGNMHAADWVPTLLEAIG---EKIPAGLDGIPMWSHI 368
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
+ N PS R + ID+ ++V L KLV GT +
Sbjct: 369 IENKPSPRTE----------------------IFEIDDYFNHSSVTLGRHKLVKGTIDE- 405
Query: 492 TMDGYYGQTRSNKV--PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
++ +YG+ + P + +SK ++SL + I +P+ + D M + R +A +
Sbjct: 406 SLSKHYGEDLRGIIGTPPDYKQKLRDSKAWESL----ETIGIPL-DADVM-ADRDEAIVT 459
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
CG N P SP C L+++ DPCE +++ P ++ L L+ ++P+
Sbjct: 460 CG-NVVPKPCSPSAESWC-LYDIIEDPCELRDLSEELPQLAQILLYRLEQEEAKIIPREG 517
Query: 610 EQPDLVQADPKRFN 623
+ ++ PK FN
Sbjct: 518 QYVADPKSAPKYFN 531
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/78 (37%), Positives = 46/78 (58%), Gaps = 1/78 (1%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
+L W T G Y TD++T+++ +IE+ V +PL+L L H A H GN L+AP E +
Sbjct: 176 DLQPDWTTEG-YITDVYTEKSTTIIENHNVSEPLYLLLTHHAPHNGNEDASLQAPPEEVR 234
Query: 64 QFQYITDPNRRTYAALTK 81
+++ RR +AA+ K
Sbjct: 235 AQRHVELHPRRIFAAMVK 252
>gi|328705055|ref|XP_001946210.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 470
Score = 325 bits (834), Expect = 4e-86, Method: Compositional matrix adjust.
Identities = 195/497 (39%), Positives = 272/497 (54%), Gaps = 46/497 (9%)
Query: 141 PIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
P+ Q I EP G+PL E LPE+L +LGY++ A+GKWHLG+F++ YTP YRGF+
Sbjct: 5 PLLARTQHNVILEPEPWGLPLNEILLPEHLNKLGYTSHAVGKWHLGYFKKAYTPTYRGFK 64
Query: 201 SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
S +G+ NG YY H++ ++ G DMRR+L+ W +VG+Y+T LFTKEA +I
Sbjct: 65 SFYGFWNGYQDYYTHMVQATFA---SFEGFDMRRDLNPDWSSVGKYSTHLFTKEATDIIT 121
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
PLFLYLAHLA HAG L+APQE IN FQ I D RR YA M+K LD+SV
Sbjct: 122 KHNNSVPLFLYLAHLAPHAGTYENPLQAPQEDINSFQSIKDKYRRKYAGMMKNLDNSVAE 181
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
V +AL ML+N+I +F+SDNGAPT +RN+GSN+P +G K T WEGG++ A
Sbjct: 182 VFNALHTNNMLDNTIFVFVSDNGAPT-----NGIHRNYGSNWPLKGEKATPWEGGIRTAA 236
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
+WS + + + +MHISDWLPTLY AAGG+T L
Sbjct: 237 FVWSKLLSKKKTLPNPLMHISDWLPTLYQAAGGNTQDL---------------------- 274
Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
IDG+ W S N S R +L NID+ AA+R ++K + G+ G +D Y+ +
Sbjct: 275 GAIDGISMWESFSNNIKSPRKQILHNIDDITGYAAIRDTNFKYIKGSTFLGYLD-YWSGS 333
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
S N +AI+ S T L Q++ + P + S R +T+ C M P
Sbjct: 334 LSPSTIRYNVDAILNSTTAGILSQINGDSLTP----SLIESLRNSSTVRCDFE-KDMKPC 388
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK--YHRRTL--VPQSHEQPDLVQ 616
PC LF++ DPCE N+ + +P+ + + K Y +L +S +P V+
Sbjct: 389 KPFIKPC-LFHIVKDPCELVNL-NYKPNSKMRKFVEAKIDYFETSLGKFRESASKPMNVR 446
Query: 617 ----ADPKRFNDTWSPW 629
A+P +N+TW W
Sbjct: 447 GTKDANPALYNNTWISW 463
Score = 90.9 bits (224), Expect = 2e-15, Method: Compositional matrix adjust.
Identities = 46/91 (50%), Positives = 58/91 (63%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRR+L+ W +VG+Y+T LFTKEA +I PLFLYLAHLA HAG L+APQE
Sbjct: 93 MRRDLNPDWSSVGKYSTHLFTKEATDIITKHNNSVPLFLYLAHLAPHAGTYENPLQAPQE 152
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
IN FQ I D RR YA + K+ ++ V+
Sbjct: 153 DINSFQSIKDKYRRKYAGMMKNLDNSVAEVF 183
>gi|449684458|ref|XP_002164438.2| PREDICTED: arylsulfatase I-like, partial [Hydra magnipapillata]
Length = 784
Score = 314 bits (805), Expect = 7e-83, Method: Compositional matrix adjust.
Identities = 197/549 (35%), Positives = 288/549 (52%), Gaps = 77/549 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS +IPTPNID+LA +G+ILNN Y P ++ MTGKY H G Q +
Sbjct: 1 GWDDVSFHGSPQIPTPNIDSLAKSGVILNNYYVSPSSFATKTEFMTGKYATHLGTQHGVL 60
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV-I 210
+P G+P TE+ LP+YL+E GY+ A+GKW LG+++ E P RGF+ +G L
Sbjct: 61 HNKQPFGLPHTEKILPQYLKEAGYNNYAVGKWALGYYKEEMLPWKRGFDFFYGGLTSSGK 120
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D E G D+RRN + G Y T+++T+EAV +I++ +KPLFL
Sbjct: 121 DYYTHSAFD------ENYGLDLRRNNEVIHNETGNYITEVYTREAVNIIKNYNDNKPLFL 174
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+AH A H GNA L+AP+ + + +I + R+ +A MV LD+SV + AL +KG+
Sbjct: 175 YVAHQAVHTGNADDPLQAPESYLKKLNHIKNIKRKLFAGMVLALDESVLNITIALAQKGL 234
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L+N++I+F +D G V +E S+ SN+P RG K T+WEGG++ A ++S I+
Sbjct: 235 LDNTVIVFTTDAGG-AVGGQELSS----ASNFPLRGSKLTVWEGGIRAVAFVYSDLIKNK 289
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVSL+M H +DWL T AG +N G+ +DG D W
Sbjct: 290 GRVSLEMYHSTDWLLTTLGLAG-------INTKGIK----------------VDGFDVWK 326
Query: 451 SLLLNTPSRRNSVLINIDE-KKRTAAVRLDSWKLVLGTQENGTMDGYYGQ--TRSNKVPL 507
S+ S R +L D K ++R+ +KLV+ G+YG R ++
Sbjct: 327 SISEGEVSPRFEILHQFDPLTKNLCSLRVGDYKLVINQD-----IGFYGDWYPRPAEIGE 381
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN-PAP---MTPSPC- 562
L + +E+ LP A + C N P P M PC
Sbjct: 382 LRYLKRIET--------------LP------------NAKVTCDKNYPHPFLEMHAPPCD 415
Query: 563 -TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKR 621
PC +FN+ DPCE +N+A P+ L + L+++R TLV ++ D QADP+
Sbjct: 416 PMKKPC-MFNIQWDPCEFHNLAEFMPNTLKVLLDRLQFYRLTLVKPTYPSVD-SQADPEN 473
Query: 622 FNDTWSPWI 630
+ WSPW+
Sbjct: 474 SDGVWSPWV 482
Score = 68.9 bits (167), Expect = 7e-09, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 50/80 (62%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+RRN + G Y T+++T+EAV +I++ +KPLFLY+AH A H GNA L+AP+
Sbjct: 136 LRRNNEVIHNETGNYITEVYTREAVNIIKNYNDNKPLFLYVAHQAVHTGNADDPLQAPES 195
Query: 61 TINQFQYITDPNRRTYAALT 80
+ + +I + R+ +A +
Sbjct: 196 YLKKLNHIKNIKRKLFAGMV 215
>gi|260794559|ref|XP_002592276.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
gi|229277492|gb|EEN48287.1| hypothetical protein BRAFLDRAFT_206928 [Branchiostoma floridae]
Length = 520
Score = 311 bits (797), Expect = 7e-82, Method: Compositional matrix adjust.
Identities = 193/541 (35%), Positives = 281/541 (51%), Gaps = 75/541 (13%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YGWND+ +HGS I TPN+D LA G+ L N Y QP+CTPSR+ LMTG+Y IH G+Q
Sbjct: 52 YGWNDIGYHGS-VIRTPNLDRLAAEGVKLENYYIQPICTPSRSQLMTGRYQIHFGLQHSI 110
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
IW +P G+PL E LP+ L+E GYST +GKWHLGF++ EYTPL+RGF++ +G+L G
Sbjct: 111 IWPPQPSGLPLDEVTLPQRLKEGGYSTHIVGKWHLGFYKEEYTPLHRGFDTFYGFLTGSE 170
Query: 211 SYYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
++Y H S NG D+R D G Y+T LF K+A+++I Q KP+F
Sbjct: 171 NHYSHRNSGGMPGFRPGWNGLDLRDQDRPVTDQNGTYSTHLFAKKAIEVIAQQDKSKPMF 230
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL A HA L+APQ+ I+ +++I D NRR YAAM +D++VG + AL++ G
Sbjct: 231 LYLPFQAVHA-----PLQAPQKYISMYRHINDYNRRMYAAMTTAMDEAVGNITDALKQYG 285
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+ +N++++F +DNG + G+N+P RG K +LWEGG++ + SP I+
Sbjct: 286 LWDNTVLVFSTDNGGQVMAG---------GNNWPLRGWKGSLWEGGIRGVGFVNSPLIKA 336
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ- 448
R S ++HISDW PTL AGG T+ +DG D W ++ PS R + +D
Sbjct: 337 KGRTSDALIHISDWFPTLVGLAGGSTNGTK-PLDGHDVWEAISEGKPSPRKEILHNIDPL 395
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
+ ++L P + N AA+R WKL + GY G + +P
Sbjct: 396 FRTILYLPPHQWGDDTFNTSMH---AAIRSGDWKL---------LTGYPGN--GSWMPPP 441
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
+ + I E Q L +
Sbjct: 442 SSDLICEESDDPPGQHL------------------------------------------W 459
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LFN+ DP E+ +++ P + L E L ++++ VP + PD +A+P D WSP
Sbjct: 460 LFNIREDPEERTDLSKKYPLVVQDLLEKLAAYKKSSVPIFYPPPD-PRANPALHGDLWSP 518
Query: 629 W 629
W
Sbjct: 519 W 519
Score = 64.3 bits (155), Expect = 2e-07, Method: Compositional matrix adjust.
Identities = 33/73 (45%), Positives = 47/73 (64%), Gaps = 5/73 (6%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
D G Y+T LF K+A+++I Q KP+FLYL A HA L+APQ+ I+ +++I
Sbjct: 202 DQNGTYSTHLFAKKAIEVIAQQDKSKPMFLYLPFQAVHA-----PLQAPQKYISMYRHIN 256
Query: 70 DPNRRTYAALTKS 82
D NRR YAA+T +
Sbjct: 257 DYNRRMYAAMTTA 269
>gi|156362330|ref|XP_001625732.1| predicted protein [Nematostella vectensis]
gi|156212578|gb|EDO33632.1| predicted protein [Nematostella vectensis]
Length = 491
Score = 310 bits (795), Expect = 1e-81, Method: Compositional matrix adjust.
Identities = 193/539 (35%), Positives = 282/539 (52%), Gaps = 83/539 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ FHGS +I TPNID LA NG+IL+N Y QPVCTP+RASLMTGKYPIHTG+Q I
Sbjct: 36 GWSDVGFHGS-KIQTPNIDRLAANGVILDNYYVQPVCTPTRASLMTGKYPIHTGLQHGII 94
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
P G+PL LP+ LR+ GYST +GKWHLGF+ E TP YRGF++ +G+ +G +
Sbjct: 95 HNGRPYGLPLNLTLLPQKLRKAGYSTHMLGKWHLGFYNWESTPTYRGFDTFYGFYSGAEN 154
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
+Y H+ D Y D+R N D G Y+ LFTK A Q++ PLF+Y
Sbjct: 155 HYTHV-QDHYL--------DLRDNEEIVRDQNGTYSAHLFTKRAEQIVRAHDPSTPLFMY 205
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A H+ ++AP+E I+++ +I DP RRTYAAMV +DD++G + A + G+
Sbjct: 206 MAFQNVHSP-----VQAPKEYIDRYSFIKDPLRRTYAAMVTIMDDALGNLTRAFDKAGLW 260
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
EN+I+IF +DNG +N G +YP RG K+TLWEGGV+ A + ++Q+
Sbjct: 261 ENTILIFSTDNGGVP---------KNGGYDYPLRGRKDTLWEGGVRGVAFVHGVALEQSG 311
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
+MH++DW PTL + AGG LD+ + ++DG D W S
Sbjct: 312 VKCKALMHVTDWYPTLVSLAGGS----------LDE------------DEDLDGYDVWES 349
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
+ S R +L NID + + LG G R + LL
Sbjct: 350 ISHGVESPRKELLHNID------TINIPPGDGSLGFSTTGI------GLRVGDMKLL--- 394
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
V + +Y F+P + R+ IH N PM L+N
Sbjct: 395 MAVPNISY----------FIPPED----RNGSVDWYIHSN-NKVPMV-------EVALYN 432
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
+ DP E++++ PD+ ++L ++++R+T VP +++ D + N W+PW+
Sbjct: 433 ITADPYEKHDLHDKLPDVVTRLQLRVEHYRKTAVPPANKPKDPYARQVAKQNGAWTPWV 491
Score = 58.9 bits (141), Expect = 8e-06, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 45/80 (56%), Gaps = 5/80 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R N D G Y+ LFTK A Q++ PLF+Y+A H+ ++AP+E
Sbjct: 166 LRDNEEIVRDQNGTYSAHLFTKRAEQIVRAHDPSTPLFMYMAFQNVHSP-----VQAPKE 220
Query: 61 TINQFQYITDPNRRTYAALT 80
I+++ +I DP RRTYAA+
Sbjct: 221 YIDRYSFIKDPLRRTYAAMV 240
>gi|405956212|gb|EKC22964.1| Arylsulfatase B [Crassostrea gigas]
Length = 491
Score = 310 bits (793), Expect = 2e-81, Method: Compositional matrix adjust.
Identities = 189/548 (34%), Positives = 286/548 (52%), Gaps = 89/548 (16%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
LLI+ +G+ND+ +HGS EI TPN+D LA G+ L N Y QP+CTP+R+ LM+G+Y IH
Sbjct: 27 LLIIADDFGYNDIGYHGS-EIKTPNLDKLAGEGVKLENYYVQPICTPTRSQLMSGRYQIH 85
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
TG+Q I +P G+PL LP+ L+E+GYST A+GKWHLGF++ EY P RGF+SHF
Sbjct: 86 TGLQHGVIRPPQPNGLPLDSAILPQKLKEVGYSTHAVGKWHLGFYKEEYLPTNRGFDSHF 145
Query: 204 GYLNGVISYYDHILSDQYSRTV-ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
GYL G Y+ H R + G D+R N++ A + G Y+T LF ++A +++ +
Sbjct: 146 GYLTGAEDYFKH------DRCFGAMCGTDLRDNMNPA-NYTGVYSTHLFAQKAAEVVNNH 198
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
DKPLFLYL + HA L+ P++ + +I D RRTYA MV +D+++G +
Sbjct: 199 NTDKPLFLYLPFQSVHAP-----LQVPEQYTKPYMHIQDKQRRTYAGMVSAMDEAIGNLT 253
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
+++GM N++++F +DNG G+N+P RG K++LWEGGV ++
Sbjct: 254 DVFKKRGMWNNTLMVFSTDNGG---------QIHAGGNNFPLRGWKDSLWEGGVHGVGLV 304
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL-LLNTPSRRNS 441
+++ VS ++H+SDW PTL + GG T +DG+DQW ++ ++P R S
Sbjct: 305 HGQMLKRKGSVSRDLIHVSDWFPTLISMVGG-TLNGTKPLDGMDQWKTINEGSSPGPRQS 363
Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
+ +D + P + D + R AAVR+ WKLV G NG+
Sbjct: 364 ILHNIDP-----MYPPVGQRVPSSPFDNRIR-AAVRMGDWKLVTGFPGNGSW-------- 409
Query: 502 SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSP 561
VP + ++I SK S+NI
Sbjct: 410 ---VPPPHMDSITLSK-----DSASKNI-------------------------------- 429
Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKR 621
+LFN+ DP E N+++ + + + ELL + +T VP + D +A+PK
Sbjct: 430 ------WLFNITEDPNEHNDVSGTYQHVVINMLELLAQYNKTAVPCRYPAMD-SKANPKY 482
Query: 622 FNDTWSPW 629
W PW
Sbjct: 483 HGGYWGPW 490
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 47/82 (57%), Gaps = 6/82 (7%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R N++ A + G Y+T LF ++A +++ + DKPLFLYL + HA L+ P++
Sbjct: 169 LRDNMNPA-NYTGVYSTHLFAQKAAEVVNNHNTDKPLFLYLPFQSVHAP-----LQVPEQ 222
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +I D RRTYA + +
Sbjct: 223 YTKPYMHIQDKQRRTYAGMVSA 244
>gi|156378148|ref|XP_001631006.1| predicted protein [Nematostella vectensis]
gi|156218038|gb|EDO38943.1| predicted protein [Nematostella vectensis]
Length = 584
Score = 309 bits (792), Expect = 3e-81, Method: Compositional matrix adjust.
Identities = 190/549 (34%), Positives = 280/549 (51%), Gaps = 110/549 (20%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS +IPTP ID A G+ILNN Y P+CTPSRAS+MTGKYPI+ GM
Sbjct: 38 GWDDVSFHGSPQIPTPYIDFYANRGVILNNYYVSPMCTPSRASMMTGKYPINLGM----- 92
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
WHLGFF +EYTP+YRGF+S +G+ N
Sbjct: 93 -------------------------------WHLGFFTKEYTPVYRGFDSFYGFWNAKTD 121
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H S G D+R N+ G Y T+LFT+EAV++IE PLFLY
Sbjct: 122 YWNH-----SSYENNFWGVDLRDNMEPVQSEDGTYGTELFTREAVKVIEAHDTSTPLFLY 176
Query: 272 LAHLAAHAGNAGKHLEAPQETIN--------QFQ-YITDPNRRTYAAMVKKLDDSVGTVI 322
+AH A H N + L+APQ+ I+ +F+ I D R+ YAAMV LD SVG +
Sbjct: 177 VAHQAVHTANPNEPLQAPQDKIDVSLKQRQQRFKGTIDDDQRQVYAAMVTSLDQSVGDIF 236
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW--GSNYPYRGVKNTLWEGGVKVPA 380
+AL ++ ML +S++IF +DNG NW GSN+P RG K+ LWEGGVK A
Sbjct: 237 AALSKRHMLRDSVVIFTTDNGGAPYGL-------NWNRGSNFPLRGGKDMLWEGGVKGVA 289
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
++S I+Q RVS +++ ++DW+PT+Y AGG L N+DG
Sbjct: 290 FVYSDLIKQKGRVSKELIDVTDWVPTIYHLAGGTAEFLVPNMDG---------------- 333
Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
N+ WS++ PS R+ +L NID ++ A +R +K+V G + G+Y +
Sbjct: 334 KNV-----WSTISEGAPSPRDEILHNIDPWRKFAGLRKGKYKIVQGMDDTYKGVGWYDRY 388
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
+ +L + Q LP + ID ++ ++ C +
Sbjct: 389 PGH-----------------ALSSMKQPELLPGAVIDCKKTFDEERK--CDS-------- 421
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
++G LF++ DPCE +++++ P++ +++ L+Y++ +P P A+P
Sbjct: 422 --SDGKFCLFDMEEDPCEYHDLSNQLPEVLAEMKTRLEYYKNIALPPWFP-PINKAANPA 478
Query: 621 RFNDTWSPW 629
F WSPW
Sbjct: 479 NFGGFWSPW 487
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 36/91 (39%), Positives = 51/91 (56%), Gaps = 9/91 (9%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R N+ G Y T+LFT+EAV++IE PLFLY+AH A H N + L+APQ+
Sbjct: 137 LRDNMEPVQSEDGTYGTELFTREAVKVIEAHDTSTPLFLYVAHQAVHTANPNEPLQAPQD 196
Query: 61 TIN--------QFQ-YITDPNRRTYAALTKS 82
I+ +F+ I D R+ YAA+ S
Sbjct: 197 KIDVSLKQRQQRFKGTIDDDQRQVYAAMVTS 227
>gi|195403369|ref|XP_002060263.1| GJ19825 [Drosophila virilis]
gi|194140907|gb|EDW57358.1| GJ19825 [Drosophila virilis]
Length = 324
Score = 309 bits (791), Expect = 4e-81, Method: Compositional matrix adjust.
Identities = 154/302 (50%), Positives = 202/302 (66%), Gaps = 17/302 (5%)
Query: 74 RTYAALTKSTTLTLLIV----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCT 129
R A +T+ T ++I+ G+ND+SFHGSN+I TPNIDA AYNG+ILN Y +CT
Sbjct: 17 RQVAGVTQPTNPNIIIILIDDMGFNDVSFHGSNQILTPNIDAFAYNGVILNRYYVPNLCT 76
Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
PSRA+L+TGKYPIH GMQ EP G+PL ER +P++ R+ GYST+ +GKWHLGF+R
Sbjct: 77 PSRAALLTGKYPIHNGMQHFVQIPDEPWGLPLGERLMPQFFRDAGYSTQLVGKWHLGFWR 136
Query: 190 REYTPLYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
+++TP+ RGF+ HFGY NG I YYDH +L Y+ G D RR+L G Y
Sbjct: 137 QDHTPIMRGFDHHFGYYNGYIDYYDHTHYMLDRNYTA-----GADFRRDLQRCHSDNGTY 191
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
AT+ FTKEA ++IE + +PLF+ L+HLA H GN + ++AP E + +F +I+DP RRT
Sbjct: 192 ATEAFTKEARRIIEQHDLSRPLFMVLSHLAVHTGNENQPMQAPYEEVAKFVHISDPKRRT 251
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV LD SVG I AL +GML NSI++ SDNGAPT+ S GSN+P+RG
Sbjct: 252 YAGMVSSLDKSVGQTIRALSDRGMLNNSIVLLYSDNGAPTLGIHSNS-----GSNHPFRG 306
Query: 367 VK 368
V
Sbjct: 307 VS 308
Score = 82.8 bits (203), Expect = 5e-13, Method: Compositional matrix adjust.
Identities = 37/82 (45%), Positives = 53/82 (64%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+L G YAT+ FTKEA ++IE + +PLF+ L+HLA H GN + ++AP E
Sbjct: 177 FRRDLQRCHSDNGTYATEAFTKEARRIIEQHDLSRPLFMVLSHLAVHTGNENQPMQAPYE 236
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ +F +I+DP RRTYA + S
Sbjct: 237 EVAKFVHISDPKRRTYAGMVSS 258
>gi|322778941|gb|EFZ09355.1| hypothetical protein SINV_05168 [Solenopsis invicta]
Length = 775
Score = 307 bits (787), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 189/502 (37%), Positives = 272/502 (54%), Gaps = 62/502 (12%)
Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
Y + GMQG P+ GAE RG+PL LPEYLR LGY+T +GKWH+G+ + + P RGF
Sbjct: 295 YKLLRGMQGYPLRGAERRGIPLNNTLLPEYLRRLGYTTHLVGKWHVGYHTKNFGPTRRGF 354
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE---------YATDL 250
++ GY G+I Y++H L + G+D+ R VGE Y TDL
Sbjct: 355 DTFLGYYTGMIQYFNHTLYESGQL-----GYDLHR-------IVGENHTVEYRYDYMTDL 402
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDPNRRTYA 308
T E +I +KP++L L+HLA HA +A + +E +ET + F YI D NRR YA
Sbjct: 403 LTDEVESIISSHNTEKPMYLQLSHLAPHASDAEEVMEVRDWKETNDTFGYIKDLNRRKYA 462
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
++V LD SVG V+ AL+RK +L N+IIIFM+DNGA T Y N+GSNYP RGVK
Sbjct: 463 SVVATLDKSVGRVVDALKRKDILNNTIIIFMADNGAQTEGYLA-----NYGSNYPLRGVK 517
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
TL+EG V+ A ++SP I ++ RVS ++ HI+DWLPTLY+AAGG+ S LDQ
Sbjct: 518 FTLFEGAVRGVACIYSPLIDRSSRVSTKLFHITDWLPTLYSAAGGNPS-------DLDQ- 569
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
+DG++QWS++ R S+L+NI E A + +KLV
Sbjct: 570 --------------LDGINQWSTIKKVKIDERRSILVNIQEDNEAAL--MGKYKLVRDRS 613
Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
+N YY + P N ++ S ++ +S ++ K+++ R++ATI
Sbjct: 614 DNQKHYDYYSGNNVS-YPKYNVTNVLTSPAASAIASVSAHVLTAA----KIKALRKKATI 668
Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
C PS C N C LFN+ DPCE +++S + +L + ++ L+ +S
Sbjct: 669 VC---KNLTNPSSCKNRTC-LFNIYKDPCEFTDLSSKNTKVVERLSVFIDSYKSVLMKES 724
Query: 609 HEQPDLVQADPKRFNDTWSPWI 630
+ D P FN+TW PW+
Sbjct: 725 NTPVDPA-GFPCHFNNTWMPWL 745
Score = 258 bits (660), Expect = 5e-66, Method: Compositional matrix adjust.
Identities = 144/346 (41%), Positives = 203/346 (58%), Gaps = 47/346 (13%)
Query: 146 MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
MQG I G EPRG+PL + LPE+L+ LGY+T IGKWHLG+ ++TP YRGF+ G+
Sbjct: 1 MQGEDIQGGEPRGLPLNVKILPEHLQGLGYTTNLIGKWHLGYHTLQHTPSYRGFDYFCGF 60
Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
N +SY+D+ S Q ++G+DM + A+ +Y TDLFT +AV++IE+
Sbjct: 61 YNSHVSYHDYKYSYQ-----NMSGYDMHCGDAPAYGLNDKYVTDLFTDKAVKIIENHDSF 115
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
+PL+L ++HLA HA LE PQ+ + +F +I + +RR YA MV KLDDSVG ++
Sbjct: 116 RPLYLQISHLAVHAP-----LENPQDYDHSDRRFIHIREQHRRKYARMVSKLDDSVGRIV 170
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
L +GML +S+I+F++DNGA + +RN+GSNYP RG GGV+ ++L
Sbjct: 171 HTLGNRGMLRDSLILFLTDNGAAPI-----GKFRNFGSNYPLRG-------GGVRGVSVL 218
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
WSP++ + RV ++HISDWLPTLY AA GD S L
Sbjct: 219 WSPRLNKPARVCDNLIHISDWLPTLYAAADGDLSDL----------------------GE 256
Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
IDG+DQW L P R+ +L+NIDE +T +KL+ G Q
Sbjct: 257 IDGVDQWRMLNDGHPIVRDKLLLNIDEISKTEGAIYKHYKLLRGMQ 302
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 29/71 (40%), Positives = 43/71 (60%), Gaps = 2/71 (2%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP--QETINQFQYITDP 71
+Y TDL T E +I +KP++L L+HLA HA +A + +E +ET + F YI D
Sbjct: 397 DYMTDLLTDEVESIISSHNTEKPMYLQLSHLAPHASDAEEVMEVRDWKETNDTFGYIKDL 456
Query: 72 NRRTYAALTKS 82
NRR YA++ +
Sbjct: 457 NRRKYASVVAT 467
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/70 (40%), Positives = 44/70 (62%), Gaps = 8/70 (11%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITD 70
+Y TDLFT +AV++IE+ +PL+L ++HLA HA LE PQ+ + +F +I +
Sbjct: 95 KYVTDLFTDKAVKIIENHDSFRPLYLQISHLAVHAP-----LENPQDYDHSDRRFIHIRE 149
Query: 71 PNRRTYAALT 80
+RR YA +
Sbjct: 150 QHRRKYARMV 159
>gi|241595184|ref|XP_002404450.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215502341|gb|EEC11835.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 311
Score = 307 bits (786), Expect = 1e-80, Method: Compositional matrix adjust.
Identities = 152/321 (47%), Positives = 200/321 (62%), Gaps = 10/321 (3%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D S HGS++IPTPN+DA+A +GIILN Y QP+CTPSRA+LMTG+YP H GMQ I
Sbjct: 1 GWDDTSIHGSSQIPTPNMDAIAADGIILNQHYVQPLCTPSRAALMTGRYPFHVGMQHSVI 60
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEP +PL +PEY R LGY T +GKWHLG++ R+Y P+ RGF++ G+ N +
Sbjct: 61 KPAEPWALPLNYTLMPEYFRCLGYKTHMVGKWHLGYYDRQYVPIKRGFDTFLGFYNPSLD 120
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY+ + GHD R W EYAT +T + V++I P+FL+
Sbjct: 121 YYNQNFTGN-----NHTGHDFRCGDQNYWAEEKEYATYYYTNKTVEIIRRHNKSAPMFLF 175
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L+H A H L+ P + F YI + NR +A MV LD SVG V+ LQ+ GML
Sbjct: 176 LSHQAPHVSGGRPLLQVPTHGVRNFSYIGENNRTLFAGMVDALDQSVGIVVDTLQKAGML 235
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
++II+F SDNGA ++ N G N+P RG K TL+EGGV+ A LWS +++
Sbjct: 236 NDTIIVFCSDNGALPWGFKS-----NRGYNWPLRGGKFTLYEGGVRGTAFLWSRRLKNTK 290
Query: 392 RVSLQMMHISDWLPTLYTAAG 412
RVS Q+MHISDWLPTLY+AAG
Sbjct: 291 RVSNQLMHISDWLPTLYSAAG 311
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 40/82 (48%)
Query: 9 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
W EYAT +T + V++I P+FL+L+H A H L+ P + F YI
Sbjct: 144 WAEEKEYATYYYTNKTVEIIRRHNKSAPMFLFLSHQAPHVSGGRPLLQVPTHGVRNFSYI 203
Query: 69 TDPNRRTYAALTKSTTLTLLIV 90
+ NR +A + + ++ IV
Sbjct: 204 GENNRTLFAGMVDALDQSVGIV 225
>gi|77993374|ref|NP_254278.1| arylsulfatase B precursor [Rattus norvegicus]
gi|148887336|sp|P50430.2|ARSB_RAT RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S
gi|81158016|tpe|CAI84982.1| TPA: arylsulfatase B [Rattus norvegicus]
gi|195539740|gb|AAI68241.1| Arylsulfatase B [Rattus norvegicus]
Length = 528
Score = 304 bits (779), Expect = 7e-80, Method: Compositional matrix adjust.
Identities = 198/572 (34%), Positives = 286/572 (50%), Gaps = 85/572 (14%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + AA L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 30 PARASDAAPPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 88
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L++ GY+T +GKWHLG +R+
Sbjct: 89 SRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRK 148
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 149 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIECLNGTRCALDLRDGEEPAKEYTDIY 205
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A LI + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 206 STNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRRI 260
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV LD++VG V AL+ +G+ N+++IF +DNG T R+ G+N+P RG
Sbjct: 261 YAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQT---------RSGGNNWPLRG 311
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K TLWEGG++ + SP ++Q S ++MHI+DWLPTL AGG T +DG D
Sbjct: 312 RKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTK-PLDGFD 370
Query: 427 QWSSLLLNTPSRR-----NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT---AAVRL 478
W ++ +PS R N + D D NT +N T A +R
Sbjct: 371 VWETISEGSPSPRVELLLNIDPDFFDGLPCPGKNTTPEKNDSFPLEHSAFNTSIHAGIRY 430
Query: 479 DSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDK 538
+WKL+ G G Q+ ++VP S+ ++ ++L
Sbjct: 431 KNWKLLTGYPGCGYWFPPPSQSNISEVP--------------SVDSPTKTLWL------- 469
Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
F++ DP E+++++ P I L L+
Sbjct: 470 -------------------------------FDINRDPEERHDVSREHPHIVQNLLSRLQ 498
Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
Y+ VP S+ P + DPK WSPW+
Sbjct: 499 YYHEHSVP-SYFPPLDPRCDPKG-TGVWSPWM 528
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 205 YSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRR 259
Query: 75 TYAALT 80
YA +
Sbjct: 260 IYAGMV 265
>gi|260794113|ref|XP_002592054.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
gi|229277268|gb|EEN48065.1| hypothetical protein BRAFLDRAFT_250400 [Branchiostoma floridae]
Length = 478
Score = 302 bits (774), Expect = 3e-79, Method: Compositional matrix adjust.
Identities = 195/558 (34%), Positives = 274/558 (49%), Gaps = 110/558 (19%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YGWND+ +HGS I TPN+D LA G+ L N Y QP+C+PSR LMTG+Y IH G+Q
Sbjct: 13 YGWNDIGYHGSF-IKTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRYQIHYGLQHSV 71
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I P G+PL E LP+ L+E+GYST +GKWHLGFFR+EY PL RGF++ +G+L G
Sbjct: 72 ITHDRPHGLPLDEVTLPQKLKEIGYSTHLVGKWHLGFFRQEYLPLRRGFDTFYGFLTGGE 131
Query: 211 SYYDHILSDQYSRTV-ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y+ H + YS E +G D+R D G Y+T LF ++A+ +I +KP+F
Sbjct: 132 DYWSHRRPNVYSTDASEYHGLDLRDQDEPVLDQNGTYSTHLFQRKAIDIIAHHDRNKPMF 191
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL+ A HA L+AP+ I +++ + + R YAAMV +D++VG V AL+ G
Sbjct: 192 LYLSFQAVHAP-----LQAPEGYIEKYKSVQNILIRKYAAMVTAMDEAVGNVTDALKGSG 246
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+ +N+++IF +DNGA R+ GSN+P RG KNTLWEGG++ + S +++
Sbjct: 247 LWDNTVLIFSTDNGA----------RRSAGSNWPLRGWKNTLWEGGIRGVCFVTSNLLER 296
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
S ++HISDW PTL A G T + +DG D W
Sbjct: 297 KGTKSDALIHISDWFPTLIHLAQGST----------------------HDSKPLDGFDVW 334
Query: 450 SSLLLNTPSRRNSVLINIDEKKRT-----------------AAVRLDSWKLVLGTQENGT 492
+S+ S R +L+NID K AA+R WKL+ G Q+ G
Sbjct: 335 NSISTGAKSTREEILLNIDPLKCKIGFTCASGGGNFNTLTYAAIRSRDWKLLTGFQDQGG 394
Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
K R T G
Sbjct: 395 W--------------------------------------------KHRKT-------SGN 403
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
+ T SP N +LFN+ NDP E+ +++ PD+ L LK + RT VP P
Sbjct: 404 HFEKPTDSP--NKHLWLFNIRNDPQERTDLSEKYPDVVQDLLMKLKTYNRTAVP-PFWPP 460
Query: 613 DLVQADPKRFNDTWSPWI 630
+A+P D + PW+
Sbjct: 461 RDPRANPVLHGDVFGPWL 478
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 42/73 (57%), Gaps = 5/73 (6%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
D G Y+T LF ++A+ +I +KP+FLYL+ A HA L+AP+ I +++ +
Sbjct: 163 DQNGTYSTHLFQRKAIDIIAHHDRNKPMFLYLSFQAVHAP-----LQAPEGYIEKYKSVQ 217
Query: 70 DPNRRTYAALTKS 82
+ R YAA+ +
Sbjct: 218 NILIRKYAAMVTA 230
>gi|410929555|ref|XP_003978165.1| PREDICTED: arylsulfatase B-like [Takifugu rubripes]
Length = 516
Score = 301 bits (771), Expect = 6e-79, Method: Compositional matrix adjust.
Identities = 184/541 (34%), Positives = 281/541 (51%), Gaps = 62/541 (11%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+GW D+ +H S EI TP +D L+ G+ L N Y QP+CTPSR LMTG+Y IHTGMQ
Sbjct: 35 FGWYDVGYHLS-EIRTPILDKLSSGGVRLENYYVQPLCTPSRNQLMTGRYQIHTGMQHQI 93
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
IW +P VPL E+ LP+ ++E GY+T +GKWHLG ++++ P RGF+S+ GYL G
Sbjct: 94 IWPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCLPTRRGFDSYLGYLTGSE 153
Query: 211 SYYDHILSDQYSR-TVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
YY HI S + D+R + A G Y+T+LF++ AV +IE +PLF
Sbjct: 154 DYYTHIRCHPISALNLTRCALDLREAEAVARSYKGTYSTELFSQRAVSIIEKHTSTEPLF 213
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LY+A A HA L+ P+ + + +I D +RR+YA MV +D++VG + ALQ++G
Sbjct: 214 LYVAFQAVHA-----PLQVPERYVAPYSFIQDHSRRSYAGMVSAMDEAVGNITLALQKRG 268
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+ EN++++F +DNG T+ + GSN+P RG K +LWEGGV+ + SP ++Q
Sbjct: 269 LWENTVLVFSTDNGGQTL---------SGGSNWPLRGRKWSLWEGGVRGVGFVASPLLEQ 319
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
+S Q++HISDWLPTL AGG T+ +DG D W
Sbjct: 320 PGTISRQLIHISDWLPTLVGLAGGSTN----------------------GTKPLDGFDVW 357
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+++ S R +L NID A + + + Q G G +R+N ++
Sbjct: 358 NTISKGFASPRLELLHNIDPLYNDIAPCPGTRRPLTLAQVAG------GASRTNSGFNVS 411
Query: 510 FNAIVESKTYQSLQQL-SQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
+A + ++ L +I+ P + ST +A + P
Sbjct: 412 IHAAIRYSNWKLLTGYPGCDIWFPRPGHNGSDSTPSEAELQ---------------KPVR 456
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
L+N+ DP E+N +++ P + +L L +H+++ VP + D + DP W P
Sbjct: 457 LYNIEKDPEERNEVSAQFPAVVDRLLARLYHHQKSAVPINFPDDD-PKCDPGP-TGAWGP 514
Query: 629 W 629
W
Sbjct: 515 W 515
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 46/82 (56%), Gaps = 5/82 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R + A G Y+T+LF++ AV +IE +PLFLY+A A HA L+ P+
Sbjct: 176 LREAEAVARSYKGTYSTELFSQRAVSIIEKHTSTEPLFLYVAFQAVHA-----PLQVPER 230
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ + +I D +RR+YA + +
Sbjct: 231 YVAPYSFIQDHSRRSYAGMVSA 252
>gi|241596950|ref|XP_002404637.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215500440|gb|EEC09934.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 406
Score = 301 bits (770), Expect = 9e-79, Method: Compositional matrix adjust.
Identities = 157/396 (39%), Positives = 225/396 (56%), Gaps = 33/396 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+S+HGS++IPTPNID LA +GIIL + Y QP+ TP+RA+L+TG YPIHTG Q I
Sbjct: 40 GWHDVSYHGSDQIPTPNIDVLAMDGIILFHNYVQPLSTPTRAALLTGLYPIHTGTQRLDI 99
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGK---WHLGFFRREYTPLYRGFESHFGYLNG 208
A+P G+ LP+ L + ++G WHLGF + E+ P RGF++ +G NG
Sbjct: 100 GSADPIGLSADFTLLPQLSVTLADNFTSLGARSGWHLGFCKDEFKPTKRGFDTFYGIYNG 159
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
Y+ H D ++++GH ++ + G Y T L +AVQLI ++P +KP
Sbjct: 160 DSDYWTHFARDN---NIDVSGHALKDEKRALVEEAGRYLTSLLANQAVQLIHNRPKNKPF 216
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLY A A H G + L+AP+E I++F Y+ D +R+ +A + +LD SVG ++ AL
Sbjct: 217 FLYFAPTAVHCGGSNGSLQAPKEYISKFGYLADYDRQLFAGSLAELDKSVGLIVEALYVS 276
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G L N++I F +DNG V + N N+P RG K T+ EGGV+ P LWS +
Sbjct: 277 GQLNNTVIAFSTDNGGAPVGFSA-----NTSPNWPLRGTKGTVAEGGVRGPGFLWSSSLT 331
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
RV+ Q+ H++DW+PT YTAAGG L + IDG+DQ
Sbjct: 332 TRGRVTQQLFHVTDWMPTFYTAAGGQAKDLTM----------------------IDGVDQ 369
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
W SL PS R VL NID +A+R + +K++
Sbjct: 370 WESLTRGAPSPRKEVLYNIDPISDQSAIRGERYKIM 405
Score = 67.8 bits (164), Expect = 2e-08, Method: Compositional matrix adjust.
Identities = 34/86 (39%), Positives = 50/86 (58%), Gaps = 4/86 (4%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
+ G Y T L +AVQLI ++P +KP FLY A A H G + L+AP+E I++F Y+
Sbjct: 189 EEAGRYLTSLLANQAVQLIHNRPKNKPFFLYFAPTAVHCGGSNGSLQAPKEYISKFGYLA 248
Query: 70 DPNRRTYAA----LTKSTTLTLLIVY 91
D +R+ +A L KS L + +Y
Sbjct: 249 DYDRQLFAGSLAELDKSVGLIVEALY 274
>gi|260794561|ref|XP_002592277.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
gi|229277493|gb|EEN48288.1| hypothetical protein BRAFLDRAFT_71008 [Branchiostoma floridae]
Length = 598
Score = 300 bits (769), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 194/588 (32%), Positives = 286/588 (48%), Gaps = 105/588 (17%)
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIV------YGWNDLSFHGSNEIPTPNIDALAY 114
+++ FQ Y+ + +S++ IV YGWND+ +HGS I TPN+D LA
Sbjct: 96 SVSYFQKSVRRQDVVYSDIQESSSGKPNIVFILADDYGWNDIGYHGS-VIRTPNLDRLAA 154
Query: 115 NGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELG 174
G+ L N Y QP+C+PSR LMTG+Y I G+Q IW +P G+PL E LP+ L+E G
Sbjct: 155 EGVKLENYYVQPLCSPSRCQLMTGRYQIRYGLQHSLIWPPQPSGLPLDEVTLPQRLKEGG 214
Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY-SRTVELNGHDMR 233
YST +GKWHLGF++++YTP +RGF++ +GYL G Y+ H + +G D+R
Sbjct: 215 YSTHIVGKWHLGFYKQDYTPTHRGFDTFYGYLTGAEDYWTHRQKGGLPGQPQTWSGLDLR 274
Query: 234 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 293
D G Y+T LF +A+++I Q +KP+FL+L+ A H L+AP+E I
Sbjct: 275 DQNRPVTDQNGTYSTHLFANKAIEIIAQQDKNKPMFLFLSFQAVH-----DPLQAPEEDI 329
Query: 294 NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS 353
+++ +I+D NRR YAAM +D +VG V AL++ G+ +N+++IF +DNG
Sbjct: 330 SRYSHISDTNRRVYAAMTTIMDQAVGNVTRALKQYGLWDNTVLIFSTDNGG--------- 380
Query: 354 NYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
G N+P RG K +LWEGGV+ + SP I+ R S ++HISDW PTL A G
Sbjct: 381 RVDRGGINWPLRGWKGSLWEGGVRGVGFVNSPLIKAKGRTSDALIHISDWFPTLVGLASG 440
Query: 414 DTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD------------QWSSLLLNTPSRRN 461
T+ +DG D W ++ PS R + +D QW + NT
Sbjct: 441 STNGT-KPLDGHDVWEAISDGKPSPRREILHNIDPMFHTVPSPRPHQWGDRVFNTSVH-- 497
Query: 462 SVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQS 521
AA+R WKL + GY G T
Sbjct: 498 ------------AAIRSGDWKL---------LTGYPGNT--------------------- 515
Query: 522 LQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNN 581
S + ST+++ P+ +LFN+ DP E+ +
Sbjct: 516 ------------SRVPPPSSTKEE-------------PADTPGKHLWLFNIREDPEERTD 550
Query: 582 IASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
++ P + +L E L + RT VP + D QA+P + W PW
Sbjct: 551 LSQKHPGVVQELLEKLARYNRTAVPVFYPSFD-PQANPALHGNIWGPW 597
Score = 63.5 bits (153), Expect = 4e-07, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 47/71 (66%), Gaps = 5/71 (7%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
D G Y+T LF +A+++I Q +KP+FL+L+ A H L+AP+E I+++ +I+
Sbjct: 282 DQNGTYSTHLFANKAIEIIAQQDKNKPMFLFLSFQAVH-----DPLQAPEEDISRYSHIS 336
Query: 70 DPNRRTYAALT 80
D NRR YAA+T
Sbjct: 337 DTNRRVYAAMT 347
>gi|350400025|ref|XP_003485710.1| PREDICTED: arylsulfatase B-like, partial [Bombus impatiens]
Length = 301
Score = 300 bits (768), Expect = 1e-78, Method: Compositional matrix adjust.
Identities = 136/217 (62%), Positives = 168/217 (77%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS++IPTPNIDALAYNGIILNN Y +CTPSR++LMTGK PIH GMQ +
Sbjct: 85 GWNDVSFHGSDQIPTPNIDALAYNGIILNNHYVPALCTPSRSALMTGKNPIHLGMQHSVL 144
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ EPRG+PL+E+ LP+YL+E+GY T A+GKWHLG+F+++YTP YRGF+SHFGY NG+
Sbjct: 145 YPTEPRGLPLSEKLLPQYLQEIGYKTHAVGKWHLGYFKKQYTPTYRGFDSHFGYWNGLED 204
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY HI + S+ E G DMRRNL+ AWDT G+YATDLFT EAV+LI + ++P+FLY
Sbjct: 205 YYTHIAQEPDSQYNEYKGFDMRRNLTVAWDTAGKYATDLFTNEAVRLINEHDTERPMFLY 264
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
LAHLA H GN + L AP E I +F YI DP RR A
Sbjct: 265 LAHLAVHKGNENQLLRAPDEEIAKFSYILDPERRIQA 301
Score = 102 bits (255), Expect = 5e-19, Method: Compositional matrix adjust.
Identities = 47/77 (61%), Positives = 56/77 (72%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MRRNL+ AWDT G+YATDLFT EAV+LI + ++P+FLYLAHLA H GN + L AP E
Sbjct: 225 MRRNLTVAWDTAGKYATDLFTNEAVRLINEHDTERPMFLYLAHLAVHKGNENQLLRAPDE 284
Query: 61 TINQFQYITDPNRRTYA 77
I +F YI DP RR A
Sbjct: 285 EIAKFSYILDPERRIQA 301
>gi|125656171|ref|NP_033842.3| arylsulfatase B precursor [Mus musculus]
gi|81158036|tpe|CAI84992.1| TPA: arylsulfatase B [Mus musculus]
gi|195934801|gb|AAI68412.1| Arylsulfatase B [synthetic construct]
Length = 534
Score = 299 bits (766), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 196/572 (34%), Positives = 289/572 (50%), Gaps = 85/572 (14%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + A L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 36 PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L+E GY+T +GKWHLG +R+
Sbjct: 95 SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV +D++VG V AL+ G+ N++ IF +DNG T R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K TLWEGG++ + SP ++Q S ++MHI+DWLPTL AGG T+
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNG--------- 368
Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
+DG + W ++ PS R +L NID+ D + +
Sbjct: 369 -------------TKPLDGFNMWKTISEGHPSPRVELLHNIDQ---------DFFDGLPC 406
Query: 487 TQENGTMDGYYGQTRSNKVPLLN--FNAIVESKT-YQSLQQLSQNI-----FLPISNIDK 538
+N T + + PL + FN + + Y++ + L+ + F P S
Sbjct: 407 PGKNMT------PAKDDSFPLEHSAFNTSIHAGIRYKNWKLLTGHPGCGYWFPPPSQ--- 457
Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
+N + + P +LF++ DP E+++++ P I L L+
Sbjct: 458 -------------SNVSEIPPVDPPTKTLWLFDINQDPEERHDVSREHPHIVQNLLSRLQ 504
Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
Y+ VP SH P + DPK WSPW+
Sbjct: 505 YYHEHSVP-SHFPPLDPRCDPKS-TGVWSPWM 534
Score = 55.5 bits (132), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 IYAGMV 271
>gi|122065132|sp|P50429.3|ARSB_MOUSE RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|74152170|dbj|BAE32375.1| unnamed protein product [Mus musculus]
Length = 534
Score = 299 bits (765), Expect = 3e-78, Method: Compositional matrix adjust.
Identities = 196/572 (34%), Positives = 289/572 (50%), Gaps = 85/572 (14%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + A L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 36 PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L+E GY+T +GKWHLG +R+
Sbjct: 95 SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV +D++VG V AL+ G+ N++ IF +DNG T R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K TLWEGG++ + SP ++Q S ++MHI+DWLPTL AGG T+
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNG--------- 368
Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
+DG + W ++ PS R +L NID+ D + +
Sbjct: 369 -------------TKPLDGFNMWKTISEGHPSPRVELLHNIDQ---------DFFDGLPC 406
Query: 487 TQENGTMDGYYGQTRSNKVPLLN--FNAIVESKT-YQSLQQLSQNI-----FLPISNIDK 538
+N T + + PL + FN + + Y++ + L+ + F P S
Sbjct: 407 PGKNMT------PAKDDSFPLEHSAFNTSIHAGIRYKNWKLLTGHPGCGYWFPPPSQ--- 457
Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
+N + + P +LF++ DP E+++++ P I L L+
Sbjct: 458 -------------SNVSEIPPVGPPTKTLWLFDINQDPEERHDVSREHPHIVQNLLSRLQ 504
Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
Y+ VP SH P + DPK WSPW+
Sbjct: 505 YYHEHSVP-SHFPPLDPRCDPKS-TGVWSPWM 534
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 IYAGMV 271
>gi|260803290|ref|XP_002596523.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
gi|229281781|gb|EEN52535.1| hypothetical protein BRAFLDRAFT_231623 [Branchiostoma floridae]
Length = 492
Score = 298 bits (763), Expect = 6e-78, Method: Compositional matrix adjust.
Identities = 170/416 (40%), Positives = 234/416 (56%), Gaps = 36/416 (8%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YGWND+ +H S+ I TPN+D LA G+ L N Y QP+C+PSRA LMTG+Y I GMQ
Sbjct: 26 YGWNDIGYH-SSLIQTPNLDRLAQEGVKLENYYIQPICSPSRAQLMTGRYQIRYGMQHSV 84
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ P G+PL E LP+ L+E GY+T +GKWHLG F++EY P +RGF++ FG+L G
Sbjct: 85 LMSDRPHGLPLGEVTLPQVLKESGYATHIVGKWHLGHFKKEYLPTWRGFDTFFGFLGGGE 144
Query: 211 SYYDHILSDQYSRTVE-LNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y+ H + ++ T E D G Y+T +F ++++ LI DKP+F
Sbjct: 145 DYFTHRIPNEIVETPETYRAFDFWDGSKPCLSENGSYSTHVFARKSIDLISRHDKDKPMF 204
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL A HA LEAP+E IN++ +I N RTYA + +D++VG + AL+ G
Sbjct: 205 LYLPFQAVHAP-----LEAPEEFINKYTHIRSKNMRTYAGVTTAMDEAVGNITRALKDHG 259
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
M EN+++IF +DNGA GSN+P RG KNTLWEGGV+ + SP ++
Sbjct: 260 MWENTVLIFSTDNGASKA-----------GSNWPLRGFKNTLWEGGVRGVGFVSSPLLKV 308
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
R+S ++HISDW PT+ AGG T PL DG D W ++ TPS R + +D
Sbjct: 309 KQRISDALLHISDWFPTIVRIAGGSTFGTKPL--DGYDVWDTISEGTPSPRTEILHNID- 365
Query: 449 WSSLLLNTPSRRNSV----LINIDEKKRTAAVRLDSWKLVLGTQENG-----TMDG 495
L+ T S + + N K AA+R+ WKL+ G NG T DG
Sbjct: 366 --PLIAATNSDASKIHGHSTFNTSVK---AAIRVRDWKLLTGNPGNGDWQLQTFDG 416
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 30/70 (42%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G Y+T +F ++++ LI DKP+FLYL A HA LEAP+E IN++ +I N
Sbjct: 179 GSYSTHVFARKSIDLISRHDKDKPMFLYLPFQAVHAP-----LEAPEEFINKYTHIRSKN 233
Query: 73 RRTYAALTKS 82
RTYA +T +
Sbjct: 234 MRTYAGVTTA 243
>gi|260813923|ref|XP_002601665.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
gi|229286967|gb|EEN57677.1| hypothetical protein BRAFLDRAFT_228559 [Branchiostoma floridae]
Length = 478
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 274/542 (50%), Gaps = 76/542 (14%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YGW+D+ +HGS I TP +D LA G+ L N Y QP+C+PSR LMTG+Y I G+Q
Sbjct: 11 YGWDDIGYHGSF-IQTPKLDRLAKEGVKLENYYVQPICSPSRCQLMTGRYQIRYGLQHSV 69
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I P G+PL E LP+ L+E GYST +GKWHLGFFR+E+ PL RGF+ +GYL G
Sbjct: 70 ITSDRPHGLPLDEVTLPQKLKENGYSTYVVGKWHLGFFRKEHMPLQRGFDKFYGYLTGGE 129
Query: 211 SYYDHILSDQYSRT-VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y+ H + Y++ + +G D+R D G Y+T LF K+A+++I++ KP+F
Sbjct: 130 DYWTHRRPNLYAKDPLAFHGLDLRDQDKPVLDQNGTYSTHLFAKKAIEIIQNHERSKPMF 189
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL A H+ LEAP E + +++ + +P +TYAAMV +D++VG V AL+R
Sbjct: 190 LYLPFQAVHSP-----LEAPTEYVEKYKDVKNPLVKTYAAMVSAMDEAVGNVTDALKRTR 244
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+ +N+++IF +DNGA RN GSN+P RG KNTLWEGGV+ + S +++
Sbjct: 245 LWDNTVLIFSTDNGA----------RRNAGSNWPLRGWKNTLWEGGVRGVGFVTSKLLRK 294
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
R ++HISDW PTL + + S +DG D W
Sbjct: 295 GKRKCDALIHISDWFPTLLRIS----------------------RSVSNGTKPLDGFDIW 332
Query: 450 SSLLLNTPSRRNSVLINIDE-KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
++ TPS R +L NID +T LDS ++ Y RS LL
Sbjct: 333 DTISSGTPSPRKEILHNIDPLHPKTDLSSLDS------LHDHIFNPSVYAAIRSGDWKLL 386
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
+ + Y ++ +K + H +
Sbjct: 387 TGDQGLGGWKYSG------------TSGNKYEDAVEPPNKHL-----------------W 417
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LFN+ +DP E+N+++ PD+ +L E L + +T +P D +A+P D + P
Sbjct: 418 LFNIRSDPQERNDLSVQYPDVVQELLERLAAYNKTALPPFWPSRD-PRANPALHGDLFGP 476
Query: 629 WI 630
W+
Sbjct: 477 WL 478
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/73 (36%), Positives = 45/73 (61%), Gaps = 5/73 (6%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
D G Y+T LF K+A+++I++ KP+FLYL A H+ LEAP E + +++ +
Sbjct: 161 DQNGTYSTHLFAKKAIEIIQNHERSKPMFLYLPFQAVHSP-----LEAPTEYVEKYKDVK 215
Query: 70 DPNRRTYAALTKS 82
+P +TYAA+ +
Sbjct: 216 NPLVKTYAAMVSA 228
>gi|390360193|ref|XP_788463.2| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 537
Score = 296 bits (757), Expect = 3e-77, Method: Compositional matrix adjust.
Identities = 194/598 (32%), Positives = 287/598 (47%), Gaps = 113/598 (18%)
Query: 73 RRTYAALTKSTTLTLLIV---------------------YGWNDLSFHGSNEIPTPNIDA 111
R + A L ++ T+T+LI YGW D+ +H S I TPN+D
Sbjct: 11 RCSCAFLVEALTVTVLICTGLIKGATGKPPHIVFIVADDYGWFDVGYHNST-IKTPNLDL 69
Query: 112 LAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR 171
LA G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q I +P +PL E LP+ L+
Sbjct: 70 LASRGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHFVIIAPQPNCLPLNETTLPQKLK 129
Query: 172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELN--- 228
E GY+T +GKWHLGF++ E PL RGF+S FGYL+G+ Y+ H S + E N
Sbjct: 130 ESGYATHLVGKWHLGFYKNECMPLQRGFDSSFGYLSGMQDYWTHFRSGSFPGFPEGNHWL 189
Query: 229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 288
G D N AW+ G Y+ +FT+ A ++I+ ++PLFLYL + H L+
Sbjct: 190 GIDFWDNNRVAWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHG-----PLQV 244
Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
P++ + + + D R+TYA MV +D++VG V+ +LQ G+ +++++F +DNG +
Sbjct: 245 PEKYMKPYAHFQDVGRQTYAGMVATMDEAVGKVVDSLQEAGLWNDTVLVFTTDNGGTPGK 304
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPT 406
G+N+P RG KNTLWEGGV + P I + VS MHISDW PT
Sbjct: 305 S---------GNNWPLRGTKNTLWEGGVHGVGFITGPMIPAGVQGTVSKHFMHISDWFPT 355
Query: 407 LYTA-AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS---NID-----------GLDQWSS 451
L AGG+T+ L L D + W+S+ TPS R NID G D+ +
Sbjct: 356 LIEGVAGGNTAGLAL--DSYNMWNSITKGTPSPRKELLHNIDPYIRADHPFGYGYDEETD 413
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
++ + R AA+R+ WKL + G+ G++ P N +
Sbjct: 414 MIYPLSGLYPKMAAEFSTDMR-AAIRVGEWKL---------LTGFPGRSGWYPPPEWNIH 463
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
I P N +LFN
Sbjct: 464 PI--------------------------------------------DPVEAANKVTWLFN 479
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+ DPCE+N+++ P++ ++L L+ + +T VP V+ADP + W PW
Sbjct: 480 ITADPCEKNDLSYQHPEVVTELVGRLEAYYKTSVPVRFPN-QTVKADPALHHGVWGPW 536
Score = 46.6 bits (109), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 23/79 (29%), Positives = 42/79 (53%), Gaps = 5/79 (6%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
N AW+ G Y+ +FT+ A ++I+ ++PLFLYL + H L+ P++ +
Sbjct: 196 NNRVAWEYTGNYSQFVFTERAQRVIQQHNPNQPLFLYLPLQSVHG-----PLQVPEKYMK 250
Query: 64 QFQYITDPNRRTYAALTKS 82
+ + D R+TYA + +
Sbjct: 251 PYAHFQDVGRQTYAGMVAT 269
>gi|296194262|ref|XP_002744878.1| PREDICTED: arylsulfatase B [Callithrix jacchus]
Length = 534
Score = 295 bits (755), Expect = 4e-77, Method: Compositional matrix adjust.
Identities = 188/552 (34%), Positives = 280/552 (50%), Gaps = 87/552 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 57 GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 115
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FTK A LI + P +KPL
Sbjct: 176 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATTLITNHPPEKPL 233
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 234 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAVGNVTAALKSH 288
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 289 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 339
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
Q + +++HISDWLPTL A G T+ +DG D W ++ +PS R + +D
Sbjct: 340 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDMWKTISEGSPSPRVELLHNID- 397
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGTMDGYYG 498
+ ++P RNS D+ AA+R +WKL+ G G
Sbjct: 398 -PDFVDSSPCPRNSTAPAKDDSSLPEHSAFNTSVHAAIRHRNWKLLTGYPGCGYWFPPPS 456
Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
Q +++P SL ++ ++L
Sbjct: 457 QYNVSEIP--------------SLDPPTKTLWL--------------------------- 475
Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
F++ DP E+++++ P I ++L L+++ + VP D + D
Sbjct: 476 -----------FDIDRDPEERHDLSREYPHIVTKLLSRLQFYHKHSVPVYFPAQD-PRCD 523
Query: 619 PKRFNDTWSPWI 630
PK W PW+
Sbjct: 524 PKA-TGVWGPWM 534
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 211 YSTNIFTKRATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 YYAGMV 271
>gi|344272682|ref|XP_003408160.1| PREDICTED: arylsulfatase B [Loxodonta africana]
Length = 532
Score = 294 bits (753), Expect = 8e-77, Method: Compositional matrix adjust.
Identities = 187/550 (34%), Positives = 287/550 (52%), Gaps = 83/550 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS+ I TP++DALA G+ L+N Y QP+CTPSR+ L++G+Y IHTG+Q I
Sbjct: 55 GWNDVGFHGSS-IRTPHLDALAAGGVRLDNYYVQPLCTPSRSQLLSGRYQIHTGLQHQII 113
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 114 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 173
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FT+ A LI + P +KPL
Sbjct: 174 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATALIANHPPEKPL 231
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NRR YA MV +D++VG V +AL+++
Sbjct: 232 FLYLALQSVH-----EPLQVPEEYLKPYDFIEDKNRRHYAGMVSLMDEAVGNVTAALKKR 286
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ +N++ IF +DNG T+ G+N+P RG K TLWEGGV+ + SP ++
Sbjct: 287 GLWDNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGVGFVASPLLK 337
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD- 447
Q + +++HISDWLPTL AGG T+ +DG D W ++ +PS R + +D
Sbjct: 338 QKGVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWKTVSEGSPSPRVELLHNIDP 396
Query: 448 ---QWSSLLLNT--PSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQT 500
S N+ P++ +S + + AA+R +WKL+ G G Q+
Sbjct: 397 NFVDISPCPGNSTAPAKGDSSFPDYTAFNTSFHAAIRHRNWKLLTGYPGCGYWFPPPSQS 456
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
+++P SL ++ ++L
Sbjct: 457 NVSEIP--------------SLDPPTKTLWL----------------------------- 473
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
F++ DP E+++++ P + QL L+++ + VP + D + DP+
Sbjct: 474 ---------FDIEQDPEERHDLSREYPHVIKQLLSRLQFYHKHSVPVYYPAQD-PRCDPE 523
Query: 621 RFNDTWSPWI 630
W PW+
Sbjct: 524 S-TGAWGPWM 532
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NRR
Sbjct: 209 YSTNVFTERATALIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIEDKNRR 263
Query: 75 TYAALT 80
YA +
Sbjct: 264 HYAGMV 269
>gi|74140818|dbj|BAE34455.1| unnamed protein product [Mus musculus]
Length = 458
Score = 294 bits (752), Expect = 1e-76, Method: Compositional matrix adjust.
Identities = 169/428 (39%), Positives = 242/428 (56%), Gaps = 31/428 (7%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + A L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 36 PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L+E GY+T +GKWHLG +R+
Sbjct: 95 SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV +D++VG V AL+ G+ N++ IF +DNG T R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K TLWEGG++ + SP ++Q S ++MHISDWLPTL AGG T+ +DG +
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTNGTK-PLDGFN 376
Query: 427 QWSSLLLNTPSRRNSNIDGLDQ--WSSLLL----NTPSRRNSVLINIDEKKRT--AAVRL 478
W ++ PS R + +DQ + L TP++ +S + + A +R
Sbjct: 377 MWKTISEGHPSPRVELLHNIDQDFFDGLPCPGKNMTPAKDDSFPLEHSAFNTSIHAGIRY 436
Query: 479 DSWKLVLG 486
+WKL+ G
Sbjct: 437 KNWKLLTG 444
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 IYAGMV 271
>gi|241834402|ref|XP_002414990.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509202|gb|EEC18655.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 425
Score = 293 bits (750), Expect = 2e-76, Method: Compositional matrix adjust.
Identities = 178/467 (38%), Positives = 247/467 (52%), Gaps = 61/467 (13%)
Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
P+CTPSR++LMTG YPIH GMQ I A P G+PL + LP++ ++LGY IGKWHL
Sbjct: 2 PLCTPSRSALMTGLYPIHNGMQSTVILPAAPWGLPLENKILPQHFKDLGYDVNMIGKWHL 61
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
GFF+ Y P+ RGF++ FG G YY+H + ++ G TVG
Sbjct: 62 GFFQTPYVPMKRGFDTFFGLYTGSNDYYNHTSGSVSKKQEQITG--------ALCLTVGG 113
Query: 246 YATDLFTKEAVQLIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
T L+ D P++ +P F Y +H A H + +AP + +F YI +
Sbjct: 114 -----ITHGVEPLLTDSPLNFYPQPFFCYFSHQAVHRALDSETFQAPARNVLKFPYIEES 168
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
NR YA M+ LD+S+G V+ AL GMLE++II+F SDNG N G N+
Sbjct: 169 NRTIYAGMLDALDESMGRVVEALSSAGMLEDTIIVFSSDNGGGPYGIES-----NRGFNW 223
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
P RG K T+WEGGV+VPA +WSP+ + RVS Q+MHISDWLPTLYTAAGGD S L I
Sbjct: 224 PLRGAKFTVWEGGVRVPAFVWSPKFLKKSRVSNQLMHISDWLPTLYTAAGGDASALG-TI 282
Query: 423 DGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWK 482
DG+D W+SL + R++ S+R +L+N + + +R +K
Sbjct: 283 DGVDMWNSL-----THRSA----------------SKRKEILLNSNSLYNISGLRYKQYK 321
Query: 483 LVLGTQENGTMDGYY---GQTR-SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI--SNI 536
LV+G +G +D +Y G TR +N + L N ++ + + + P+ SN
Sbjct: 322 LVVGGGFDGELDDHYYFPGGTRPTNDIDQLR-NESTTARVLRKFYEKQDRSWSPLAWSNN 380
Query: 537 DKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIA 583
+ R + T N PM P YLFN+ DPCE NN+A
Sbjct: 381 VAVDCRRNKLT----ENFVPMQPP-------YLFNIEKDPCELNNLA 416
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 18/56 (32%), Positives = 29/56 (51%), Gaps = 3/56 (5%)
Query: 27 LIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAL 79
L+ D P++ +P F Y +H A H + +AP + +F YI + NR YA +
Sbjct: 121 LLTDSPLNFYPQPFFCYFSHQAVHRALDSETFQAPARNVLKFPYIEESNRTIYAGM 176
>gi|126317548|ref|XP_001381590.1| PREDICTED: arylsulfatase B [Monodelphis domestica]
Length = 522
Score = 293 bits (749), Expect = 3e-76, Method: Compositional matrix adjust.
Identities = 191/548 (34%), Positives = 275/548 (50%), Gaps = 79/548 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H SN I TP++DAL+ G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 45 GWNDVGYHDSN-IFTPHLDALSAQGVRLENYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 103
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P +PL E+ LPE LRE GY T +GKWHLG FR+E P RGF++ FGYL G
Sbjct: 104 WPCQPSCIPLDEKLLPELLREAGYVTHMVGKWHLGMFRKECLPTRRGFDTFFGYLLGSED 163
Query: 212 YYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H + V D R A Y+T++FT+ AV LI + P KPLFL
Sbjct: 164 YYTHKRCVHIDALKVTRCALDFRDGEDIAAGYENMYSTNVFTERAVNLIANHPAQKPLFL 223
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA + H + L+ P+E + + +I + NR+ YA MV +D++VG V ALQ+ G+
Sbjct: 224 YLALQSVH-----EPLQVPEEYLQPYDFIQNKNRQHYAGMVTLMDEAVGNVTRALQKYGL 278
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
N++ IF +DNG T+ G+N+P RG K TLWEGG++ + SP +++
Sbjct: 279 WNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGIRGVGFVSSPLLKRK 329
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW- 449
S +++HISDWLPTL AGG T+ +DG D W ++ +PS R + +D
Sbjct: 330 RVKSSELIHISDWLPTLVNLAGGRTNGTK-PLDGFDVWKTISEGSPSPRMELLHNIDPGF 388
Query: 450 -SSLLLNTPSRRNSVLINIDEKKRT------AAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
+ + P+ S T AA+R +WKL + GY G
Sbjct: 389 VDTAPCDRPASPPSTRDPFPSAHSTFNVSIHAAIRYGNWKL---------LTGYPGCGHW 439
Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
P L SN+ ++ S P P
Sbjct: 440 FPPPSL-------------------------SNVTELPS-----------------PDPP 457
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
T +LF++ DP E+ +++S P I +L + L+Y+ + VP + D DP+
Sbjct: 458 TKN-LWLFDIQQDPEERFDLSSQHPKIVKKLLDRLQYYHKNSVPVFYPDED-PHCDPEAA 515
Query: 623 NDTWSPWI 630
W PW+
Sbjct: 516 G-VWGPWM 522
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ AV LI + P KPLFLYLA + H + L+ P+E + + +I + NR+
Sbjct: 199 YSTNVFTERAVNLIANHPAQKPLFLYLALQSVH-----EPLQVPEEYLQPYDFIQNKNRQ 253
Query: 75 TYAALT 80
YA +
Sbjct: 254 HYAGMV 259
>gi|155372077|ref|NP_001094645.1| arylsulfatase B precursor [Bos taurus]
gi|151554899|gb|AAI48140.1| ARSB protein [Bos taurus]
Length = 533
Score = 292 bits (747), Expect = 4e-76, Method: Compositional matrix adjust.
Identities = 164/408 (40%), Positives = 237/408 (58%), Gaps = 33/408 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP +DALA G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 56 GWNDVGFHGS-AIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 114
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 115 LPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FT+ A LI + P +KPL
Sbjct: 175 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNVFTERATTLITNHPPEKPL 232
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NRR YA M +D++VG V +AL+R+
Sbjct: 233 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDEAVGNVTAALERR 287
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 288 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 338
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ + +++HISDWLPTL AGG T+ +DG D W+++ +PS R + +D
Sbjct: 339 RKGVKTRELIHISDWLPTLVKLAGGSTNGTK-PLDGFDVWNTISEGSPSPRMELLHNID- 396
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
+ + P NS+ + DE AAVR +WKL+ G
Sbjct: 397 -PNFVDTAPCPGNSMALAKDESSLLEYSAFNTSIHAAVRHQNWKLLTG 443
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NRR
Sbjct: 210 YSTNVFTERATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRR 264
Query: 75 TYAALT 80
YA +
Sbjct: 265 YYAGMA 270
>gi|179030|gb|AAA51779.1| arylsulfatase B precursor [Homo sapiens]
Length = 533
Score = 291 bits (746), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDMWKTI 380
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
+PS R + +D + + ++P RNS+ D+ AA+R +W
Sbjct: 381 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 438
Query: 482 KLVLG 486
KL+ G
Sbjct: 439 KLLTG 443
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264
Query: 75 TYAALT 80
YA +
Sbjct: 265 HYAGMV 270
>gi|410924964|ref|XP_003975951.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 574
Score = 291 bits (746), Expect = 6e-76, Method: Compositional matrix adjust.
Identities = 184/540 (34%), Positives = 274/540 (50%), Gaps = 73/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +H I TP +D LA G+ L N Y QP+CTPSR+ LMTG+Y IHTG+Q I
Sbjct: 57 GFNDIGYHNPT-IKTPTLDKLAAEGVRLENYYVQPICTPSRSQLMTGRYQIHTGLQHSII 115
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
++P +P LPE LR+ GYST +GKWHLGF+R+ P +GF++ FG L G +
Sbjct: 116 RPSQPSCLPSHMDTLPERLRQAGYSTHLVGKWHLGFYRKACLPTRKGFDTFFGSLTGSVD 175
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
+Y+++ D + G+D+ AW G+Y+T LFT+ A +++E P +KPLFL
Sbjct: 176 HYNYLSCDGPG----VCGYDLHDGEGVAWGQEGKYSTTLFTQRARKILESHNPTEKPLFL 231
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
L+ A H L+ P+ I ++ + + RR AAMV +D++V V AL++ G
Sbjct: 232 LLSLQAVHTP-----LQTPKSYIYPYRDMANIARRKLAAMVSTVDEAVRNVTYALRKYGF 286
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
NS+II+ +DNGA GSN+P RG K T WEGG++ A + SP +++
Sbjct: 287 YRNSVIIYSTDNGAQPF---------TGGSNWPLRGRKGTYWEGGIRGVAFVHSPLLKRR 337
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS ++HI+DW PTL AGG+ S+ +DG D W ++ S R + +D
Sbjct: 338 RRVSKALLHITDWFPTLVGLAGGNISQ-SRGLDGFDVWPTISEGRDSPRQEILHNID--- 393
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
P ++ ++ + AA+R+ WKL+ G +G + VP
Sbjct: 394 ------PLHKHVTSESLWDTSIQAAIRVGDWKLLTGYPGHG-----------DWVP---- 432
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
V L +NI P N T +LF
Sbjct: 433 -PQVLPTLPGRWWDLERNILSPYKN--------------------------STYKNIWLF 465
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
N+ DPCE++++A RPD+ QL L Y+ +T VP + PD +ADP + W PW+
Sbjct: 466 NITADPCERHDLADQRPDVVLQLLARLAYYNQTAVP-VYFPPDDPRADPSQHEGAWVPWV 524
Score = 45.4 bits (106), Expect = 0.083, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 42/79 (53%), Gaps = 6/79 (7%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
AW G+Y+T LFT+ A +++E P +KPLFL L+ A H L+ P+ I
Sbjct: 197 GVAWGQEGKYSTTLFTQRARKILESHNPTEKPLFLLLSLQAVHTP-----LQTPKSYIYP 251
Query: 65 FQYITDPNRRTYAALTKST 83
++ + + RR AA+ +
Sbjct: 252 YRDMANIARRKLAAMVSTV 270
>gi|410260404|gb|JAA18168.1| arylsulfatase B [Pan troglodytes]
gi|410260406|gb|JAA18169.1| arylsulfatase B [Pan troglodytes]
gi|410260408|gb|JAA18170.1| arylsulfatase B [Pan troglodytes]
gi|410341765|gb|JAA39829.1| arylsulfatase B [Pan troglodytes]
gi|410341769|gb|JAA39831.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 291 bits (745), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 40 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 323 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
+PS R + +D + + ++P RNS+ D+ AA+R +W
Sbjct: 382 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 439
Query: 482 KLVLG 486
KL+ G
Sbjct: 440 KLLTG 444
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 HYAGMV 271
>gi|410260412|gb|JAA18172.1| arylsulfatase B [Pan troglodytes]
gi|410341771|gb|JAA39832.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 291 bits (745), Expect = 7e-76, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 41 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 217
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 218 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 272
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 273 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 323
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 324 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 382
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
+PS R + +D + + ++P RNS+ D+ AA+R +W
Sbjct: 383 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 440
Query: 482 KLVLG 486
KL+ G
Sbjct: 441 KLLTG 445
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266
Query: 75 TYAALT 80
YA +
Sbjct: 267 HYAGMV 272
>gi|38569405|ref|NP_000037.2| arylsulfatase B isoform 1 precursor [Homo sapiens]
gi|114223|sp|P15848.1|ARSB_HUMAN RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|179077|gb|AAA51784.1| arylsulfatase B precursor (EC 3.1.6.1) [Homo sapiens]
gi|119616227|gb|EAW95821.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
gi|119616229|gb|EAW95823.1| arylsulfatase B, isoform CRA_a [Homo sapiens]
Length = 533
Score = 291 bits (744), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 380
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
+PS R + +D + + ++P RNS+ D+ AA+R +W
Sbjct: 381 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 438
Query: 482 KLVLG 486
KL+ G
Sbjct: 439 KLLTG 443
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264
Query: 75 TYAALT 80
YA +
Sbjct: 265 HYAGMV 270
>gi|410226860|gb|JAA10649.1| arylsulfatase B [Pan troglodytes]
gi|410292332|gb|JAA24766.1| arylsulfatase B [Pan troglodytes]
Length = 535
Score = 291 bits (744), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 41 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 217
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 218 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 272
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 273 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 323
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 324 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 382
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
+PS R + +D + + ++P RNS+ D+ AA+R +W
Sbjct: 383 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 440
Query: 482 KLVLG 486
KL+ G
Sbjct: 441 KLLTG 445
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266
Query: 75 TYAALT 80
YA +
Sbjct: 267 HYAGMV 272
>gi|410226852|gb|JAA10645.1| arylsulfatase B [Pan troglodytes]
gi|410226856|gb|JAA10647.1| arylsulfatase B [Pan troglodytes]
gi|410226858|gb|JAA10648.1| arylsulfatase B [Pan troglodytes]
gi|410292328|gb|JAA24764.1| arylsulfatase B [Pan troglodytes]
Length = 534
Score = 291 bits (744), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 40 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
+PS R + +D + + ++P RNS+ D+ AA+R +W
Sbjct: 382 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 439
Query: 482 KLVLG 486
KL+ G
Sbjct: 440 KLLTG 444
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 HYAGMV 271
>gi|825628|emb|CAA51272.1| arylsulfatase [Homo sapiens]
gi|189067435|dbj|BAG37417.1| unnamed protein product [Homo sapiens]
Length = 533
Score = 291 bits (744), Expect = 9e-76, Method: Compositional matrix adjust.
Identities = 167/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLMKLARGHTNGTK-PLDGFDVWKTI 380
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
+PS R + +D + + ++P RNS+ D+ AA+R +W
Sbjct: 381 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 438
Query: 482 KLVLG 486
KL+ G
Sbjct: 439 KLLTG 443
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264
Query: 75 TYAALT 80
YA +
Sbjct: 265 HYAGMV 270
>gi|291190498|ref|NP_001167123.1| Arylsulfatase B precursor [Salmo salar]
gi|223648254|gb|ACN10885.1| Arylsulfatase B precursor [Salmo salar]
Length = 528
Score = 291 bits (744), Expect = 1e-75, Method: Compositional matrix adjust.
Identities = 188/547 (34%), Positives = 274/547 (50%), Gaps = 66/547 (12%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+GWND+ +HGS EI TPN+D L+ G+ L N Y QP+CTPSR LMTG+Y I TGMQ
Sbjct: 39 FGWNDVGYHGS-EIKTPNLDKLSAKGVRLENYYVQPLCTPSRNQLMTGRYQIRTGMQHQI 97
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
IW +P VPL E+ LP+ +RE GY+T +GKWHLG +R++ P RGF+S+FGYL G
Sbjct: 98 IWPCQPYCVPLDEKLLPQLMREAGYATHMVGKWHLGMYRKDCLPTRRGFDSYFGYLTGSE 157
Query: 211 SYYDHILSDQYSRTVELN----GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
Y+ H + S LN D+R A G Y+T LFT +I Q K
Sbjct: 158 DYFSH---QRCSYVPPLNVTRCAVDLREGEEVATGYTGTYSTQLFTDRVTSIIAKQNSKK 214
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
PLFLY+A A HA L+ P+ + + +I DPNRR YA MV +D++VG + ALQ
Sbjct: 215 PLFLYVALQAVHA-----PLQVPERYVAPYSFIKDPNRRLYAGMVSAMDEAVGNITLALQ 269
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
G+ +N++++F +DNG T+ GSN+P RG K +LWEGGV+ + SP
Sbjct: 270 ETGLWDNTVLVFSTDNGGQTLAG---------GSNWPLRGRKWSLWEGGVRGVGFVTSPL 320
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
++Q VS Q++HISDWLPTL AGG T+ +DG + W+++ S R + +
Sbjct: 321 LKQPGTVSRQLIHISDWLPTLVGLAGGSTNGTK-PLDGYNVWNAISKGYASPRLELLHNI 379
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
D + ++I +A+ RL N + G G + SN P
Sbjct: 380 D--------------PLYVDIAPCPGSASQRLPV--------SNAPVSG--GGSTSN--P 413
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS----TRQQATIHCGANPAPMTPSPC 562
N+ + +++ + + D+ + TI NP
Sbjct: 414 WSWSNSGFNVSIHAAIRSMQWKLLTGYPGCDRWFTRPGLENTSGTIR-SVNPLK------ 466
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
P LF++ DP E++ ++ P++ L L ++ R P + D + DP
Sbjct: 467 ---PVMLFDVEADPEERDEVSDQHPNVVDNLLRRLGHYLRRSQPITFPDDD-PRCDPG-- 520
Query: 623 NDTWSPW 629
W PW
Sbjct: 521 TGAWGPW 527
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 41/82 (50%), Gaps = 5/82 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R A G Y+T LFT +I Q KPLFLY+A A HA L+ P+
Sbjct: 180 LREGEEVATGYTGTYSTQLFTDRVTSIIAKQNSKKPLFLYVALQAVHA-----PLQVPER 234
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ + +I DPNRR YA + +
Sbjct: 235 YVAPYSFIKDPNRRLYAGMVSA 256
>gi|297675538|ref|XP_002815731.1| PREDICTED: arylsulfatase B [Pongo abelii]
Length = 534
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 166/425 (39%), Positives = 243/425 (57%), Gaps = 35/425 (8%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 40 AGASQPPHLVFLLADDLGWNDVGFHGS-RIHTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGG++ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 323 WEGGIRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSW 481
+PS R + +D + + ++P RNS+ D+ AA+R +W
Sbjct: 382 SEGSPSPRIELLHNID--PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNW 439
Query: 482 KLVLG 486
KL+ G
Sbjct: 440 KLLTG 444
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 HYAGMV 271
>gi|301609482|ref|XP_002934299.1| PREDICTED: arylsulfatase J-like [Xenopus (Silurana) tropicalis]
Length = 564
Score = 290 bits (741), Expect = 2e-75, Method: Compositional matrix adjust.
Identities = 180/541 (33%), Positives = 277/541 (51%), Gaps = 76/541 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+C+PSR+ +TGKY IHTG+Q I
Sbjct: 58 GYRDIGYHGS-EIRTPTLDKLASEGVRLENYYVQPICSPSRSQFITGKYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
++P +PL LP+ L++ GY T +GKWHLGF+++E P RGF+S FG L G
Sbjct: 117 RPSQPNCLPLDNMTLPQKLKKAGYQTHMVGKWHLGFYKKECMPTQRGFDSFFGSLLGSGD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY+H D + + G+D+ N + AWD G Y+T+++T+ +Q++ +KPLFL
Sbjct: 177 YYNHYKCD----SPGICGYDLYENNNAAWDQDNGIYSTEMYTQRVLQILSSHNPNKPLFL 232
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ + I + NRR YAAM+ LDD+V V +AL++ G
Sbjct: 233 YIAYQAVHS-----PLQAPGNYLDNYNSILNENRRRYAAMLSCLDDAVNNVTTALKKYGF 287
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+NSIII+ SDNG + G+N+P RG K T WEGG++ + SP ++
Sbjct: 288 YDNSIIIYSSDNGGQPMA---------GGNNWPLRGSKGTYWEGGIRAVGFVHSPILKMK 338
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ +++HI+DW PTL T AGG+ + +DG D W ++ S R + +D
Sbjct: 339 GYICKELVHITDWFPTLVTLAGGEIDE-EIQLDGYDIWETISEGKRSPRQEILHNIDP-- 395
Query: 451 SLLLNTPSRRNS--VLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
+ T +R S I +A+R++ WKL+ G G + +P
Sbjct: 396 ---MYTKARNGSWAAGYGIWNTAIQSAIRVNHWKLLTGNPGYG-----------DWIPPQ 441
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
+F SN+ R ++ T+ G + +
Sbjct: 442 SF-----------------------SNVGPNRWHNERVTLASGKS-------------VW 465
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LFN+ DP E+ +++ PDI QL L +T VP + P +++PK W P
Sbjct: 466 LFNITADPYERIDLSDRYPDIVKQLLRRLAQFNKTAVPVRYP-PRDPRSNPKLNGGVWGP 524
Query: 629 W 629
W
Sbjct: 525 W 525
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 34/105 (32%), Positives = 56/105 (53%), Gaps = 12/105 (11%)
Query: 1 MRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T+++T+ +Q++ +KPLFLY+A+ A H+ L+AP
Sbjct: 193 LYENNNAAWDQDNGIYSTEMYTQRVLQILSSHNPNKPLFLYIAYQAVHS-----PLQAPG 247
Query: 60 ETINQFQYITDPNRRTYAALTK------STTLTLLIVYGWNDLSF 98
++ + I + NRR YAA+ + T L YG+ D S
Sbjct: 248 NYLDNYNSILNENRRRYAAMLSCLDDAVNNVTTALKKYGFYDNSI 292
>gi|332224806|ref|XP_003261559.1| PREDICTED: arylsulfatase B isoform 1 [Nomascus leucogenys]
Length = 535
Score = 289 bits (740), Expect = 3e-75, Method: Compositional matrix adjust.
Identities = 164/408 (40%), Positives = 236/408 (57%), Gaps = 33/408 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q I
Sbjct: 58 GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FTK A+ LI + P +KPL
Sbjct: 177 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPL 234
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 289
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 340
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
Q + +++HISDWLPTL A G T+ +DG D W ++ +PS R + +D
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID- 398
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
+ + ++P RNS D+ AA+R +WKL+ G
Sbjct: 399 -PNFVDSSPCPRNSTAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 445
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266
Query: 75 TYAALT 80
YA +
Sbjct: 267 HYAGMV 272
>gi|351697185|gb|EHB00104.1| Arylsulfatase B, partial [Heterocephalus glaber]
Length = 503
Score = 288 bits (737), Expect = 6e-75, Method: Compositional matrix adjust.
Identities = 165/415 (39%), Positives = 235/415 (56%), Gaps = 36/415 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS + TP++DALA G+ L+N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 27 GWNDVGFHGS-RLRTPHLDALAAGGVQLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 85
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +++E P RGF+++FGYL G
Sbjct: 86 WPCQPSCVPLDEKLLPQLLQEAGYATHMVGKWHLGMYQKECLPTRRGFDTYFGYLLGSED 145
Query: 212 YYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
YY H I + +R D R A Y+T++FT A LI + P +K
Sbjct: 146 YYTHEHCVFIKALNVTRCAL----DFRDGEEVATQYKNLYSTNIFTNRATSLIANHPPEK 201
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
PLFLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 202 PLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHLYAGMVSLMDEAVGNVTAALK 256
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
G+ N++ IF +DNG T+ G+N+P RG K TLWEGGV+ + SP
Sbjct: 257 SHGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGVGFVASPL 307
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
++Q + +++HISDWLPTL AGGDT +DG D W ++ +PS R + +
Sbjct: 308 LKQKGVKNRELIHISDWLPTLVKLAGGDTHGTK-PLDGFDVWRTISEGSPSPRMELLHNI 366
Query: 447 DQWSSLLLNTPSRRNSVLINIDEK---------KRTAAVRLDSWKLVLGTQENGT 492
D S + +P NS D+ AA+R +WKL+ G G+
Sbjct: 367 D--PSFVDLSPCPGNSTTPAKDDSFPEYSTFNTSVHAAIRHGNWKLLTGFPGCGS 419
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 181 YSTNIFTNRATSLIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 235
Query: 75 TYAALT 80
YA +
Sbjct: 236 LYAGMV 241
>gi|109077718|ref|XP_001108389.1| PREDICTED: arylsulfatase B isoform 6 [Macaca mulatta]
Length = 534
Score = 288 bits (736), Expect = 8e-75, Method: Compositional matrix adjust.
Identities = 163/408 (39%), Positives = 235/408 (57%), Gaps = 33/408 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q I
Sbjct: 57 GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FTK A LI + P +KPL
Sbjct: 176 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALITNHPPEKPL 233
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 234 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 288
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 289 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 339
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
Q + +++HISDWLPTL AGG + +DG D W ++ +PS R + +D
Sbjct: 340 QKGVKNRELIHISDWLPTLVKLAGGHINGTK-PLDGFDVWKTISEGSPSPRVELLHNID- 397
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
+ + ++P NS+ D+ AA+R +WKL+ G
Sbjct: 398 -PNFVDSSPCPGNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 444
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 211 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 HYAGMV 271
>gi|214010121|ref|NP_001135731.1| arylsulfatase B precursor [Felis catus]
gi|461542|sp|P33727.1|ARSB_FELCA RecName: Full=Arylsulfatase B; Short=ASB; AltName:
Full=N-acetylgalactosamine-4-sulfatase; Short=G4S;
Flags: Precursor
gi|258856|gb|AAB23941.1| arylsulfatase B [Felis catus]
Length = 535
Score = 288 bits (736), Expect = 9e-75, Method: Compositional matrix adjust.
Identities = 164/414 (39%), Positives = 234/414 (56%), Gaps = 33/414 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGSN I TP++D LA G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GWNDVSFHGSN-IRTPHLDELAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D S V D R A Y+T++FT+ A LI P +KPL
Sbjct: 177 YYSHERCALID--SLNVTRCALDFRDGEQVATGYKNMYSTNIFTERATALITSHPPEKPL 234
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVSLMDEAVGNVTAALKSH 289
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGG++ + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGIRGVGFVASPLLK 340
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
Q + +++HISDWLPTL A G T +DG D W ++ +PS R + +D
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGSTKGTK-PLDGFDVWKTISEGSPSPRKELLHNID- 398
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGT 492
+ + +P S+ D+ AA+R +WKL+ G G
Sbjct: 399 -PNFVDISPCPGKSLAPAKDDSSHPAYLAFNTSLHAAIRHGNWKLLTGYPGCGC 451
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 212 YSTNIFTERATALITSHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266
Query: 75 TYAALT 80
YA +
Sbjct: 267 YYAGMV 272
>gi|114326200|ref|NP_001041598.1| arylsulfatase B precursor [Canis lupus familiaris]
gi|81158050|tpe|CAI84999.1| TPA: arylsulfatase B [Canis lupus familiaris]
Length = 535
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 163/414 (39%), Positives = 236/414 (57%), Gaps = 33/414 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ FHGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GWHDVGFHGS-RIRTPHLDALAAAGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FT+ A LI + P +KPL
Sbjct: 177 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALISNHPPEKPL 234
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NRR YA MV +D++VG V +AL+
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRRYYAGMVSLMDEAVGNVTAALKSH 289
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ +F +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 290 GLWNNTVFVFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 340
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ S +++HISDWLPTL AGG T +DG D W ++ +PS R + +D
Sbjct: 341 RKGVKSRELVHISDWLPTLVGLAGGSTKGTK-PLDGFDVWRTISEGSPSPRMELLHNID- 398
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGT 492
+ + +P S+ D+ AA+R +WKL+ G G
Sbjct: 399 -PNFVDISPCPGQSLAPAKDDSSHPGYFSFNTSLHAAIRHGNWKLLTGYPGCGC 451
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NRR
Sbjct: 212 YSTNIFTERATALISNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIHDKNRR 266
Query: 75 TYAALT 80
YA +
Sbjct: 267 YYAGMV 272
>gi|380795845|gb|AFE69798.1| arylsulfatase B isoform 1 precursor, partial [Macaca mulatta]
Length = 506
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 163/408 (39%), Positives = 235/408 (57%), Gaps = 33/408 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q I
Sbjct: 29 GWNDVGFHGSC-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 87
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 88 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 147
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FTK A LI + P +KPL
Sbjct: 148 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALITNHPPEKPL 205
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 206 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 260
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 261 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 311
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
Q + +++HISDWLPTL AGG + +DG D W ++ +PS R + +D
Sbjct: 312 QKGVKNRELIHISDWLPTLVKLAGGHINGTK-PLDGFDVWKTISEGSPSPRVELLHNID- 369
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
+ + ++P NS+ D+ AA+R +WKL+ G
Sbjct: 370 -PNFVDSSPCPGNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 416
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 183 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 237
Query: 75 TYAALT 80
YA +
Sbjct: 238 HYAGMV 243
>gi|156359506|ref|XP_001624809.1| predicted protein [Nematostella vectensis]
gi|156211610|gb|EDO32709.1| predicted protein [Nematostella vectensis]
Length = 488
Score = 287 bits (735), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 190/581 (32%), Positives = 281/581 (48%), Gaps = 102/581 (17%)
Query: 54 HLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALA 113
H+ A + Q +++ N A+ L+ GW DL +HGS I TPNI+ LA
Sbjct: 3 HINALSLVLLSAQILSEAN-----AIPPHIVFILVDDLGWFDLGYHGS-VIRTPNINQLA 56
Query: 114 YNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLREL 173
+GIIL+N Y QP+CTP+R++LMTGKYPIH G Q I +P G+PL LPE L++
Sbjct: 57 GDGIILDNYYVQPLCTPTRSALMTGKYPIHLGTQHGVILPGQPMGLPLDSSTLPEQLKQQ 116
Query: 174 GYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMR 233
GY+T +GKWHLGF++ ++ P RGF+S +GY G ++ H + L D R
Sbjct: 117 GYATHIVGKWHLGFYKEDFVPTKRGFDSFYGYYCGAEDHFTHNV---------LGFLDFR 167
Query: 234 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 293
N D G Y T FTK AV I PLFLYL H ++AP E I
Sbjct: 168 DNDLIVKDQKGTYGTRAFTKRAVDTIHRHNSSSPLFLYLPFQNVHG-----PVQAPPEYI 222
Query: 294 NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS 353
+++ +I D RRT+AAMV +D+++G V SAL+ G+ EN++++F +DNG
Sbjct: 223 DKYSFIKDKTRRTHAAMVDIMDEAIGNVTSALKSAGLWENTLLVFSTDNGG--------- 273
Query: 354 NYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
+ G NYP RG KNTLWEGGV+ + P ++ + +MH++DW PTL AGG
Sbjct: 274 IHTAGGYNYPLRGEKNTLWEGGVRGAGFVSGPMAPRHGMIYNGLMHVTDWYPTLVHLAGG 333
Query: 414 DTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT 473
++DG+D W +L +T S R +L NID K
Sbjct: 334 SM------------------------QDSLDGVDLWDALKQDTASPRKEILHNIDLKIDV 369
Query: 474 AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
+ +G ++R K+ L NA T+ +L + I
Sbjct: 370 PTAFV--------------FEGVALRSRDMKLLLKVPNA-----TWLVPPELREP---GI 407
Query: 534 SNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
+ + + S + Q I L+N+ DP E+++++ PDI +++
Sbjct: 408 TANEGLGSAKMQEVIEVA-----------------LYNITADPTERHDLSGKFPDIVNEM 450
Query: 594 YELLKYHRRTLVPQSHEQPDLVQADPKRF-----NDTWSPW 629
+ + ++R LVP P + + DPK N WSPW
Sbjct: 451 KKRVDFYRGGLVP-----PIIKKNDPKAVLTAIKNGAWSPW 486
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/80 (37%), Positives = 38/80 (47%), Gaps = 5/80 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
R N D G Y T FTK AV I PLFLYL H ++AP E
Sbjct: 166 FRDNDLIVKDQKGTYGTRAFTKRAVDTIHRHNSSSPLFLYLPFQNVHG-----PVQAPPE 220
Query: 61 TINQFQYITDPNRRTYAALT 80
I+++ +I D RRT+AA+
Sbjct: 221 YIDKYSFIKDKTRRTHAAMV 240
>gi|187956367|gb|AAI50662.1| Arsb protein [Mus musculus]
Length = 431
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 154/365 (42%), Positives = 216/365 (59%), Gaps = 23/365 (6%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + A L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 36 PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L+E GY+T +GKWHLG +R+
Sbjct: 95 SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV +D++VG V AL+ G+ N++ IF +DNG T R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K TLWEGG++ + SP ++Q S ++MHISDWLPTL AGG T+ +DG +
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVDLAGGSTNGTK-PLDGFN 376
Query: 427 QWSSL 431
W ++
Sbjct: 377 MWKTI 381
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 IYAGMV 271
>gi|327263080|ref|XP_003216349.1| PREDICTED: arylsulfatase B-like [Anolis carolinensis]
Length = 521
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 190/550 (34%), Positives = 276/550 (50%), Gaps = 83/550 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ +HGS +I TP +DAL+ G+ L Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 44 GWQDVGWHGS-QIRTPVLDALSAAGVRLERYYIQPLCTPSRSQLLTGRYQIHTGLQHEII 102
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LPE L+E GY T +GKWHLG +R E P RGF+++FGYL G
Sbjct: 103 WPCQPSCVPLDEKLLPELLKEAGYVTHMVGKWHLGMYRNECLPTRRGFDTYFGYLLGSED 162
Query: 212 YYDHI-LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H S+ V D+R A Y+T++FT+ A LI + +KPLFL
Sbjct: 163 YYSHEHCVPIVSKNVTRCALDLRDGEKIADGFKNMYSTNVFTQRAQDLIANHQPEKPLFL 222
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA + H + L+ P++ + + +I D RR YA MV LD++VG + AL+ +G+
Sbjct: 223 YLALQSVH-----EPLQVPEKYVEPYSFIKDEKRRKYAGMVSILDEAVGNITEALKTRGL 277
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
N+++IF +DNG T+ G+N+P RG K TLWEGGV+ + SP ++Q
Sbjct: 278 WNNTVLIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGAGFVASPLLKQR 328
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
S +++HISDWLPTL AGG T+ +DG D W +L PS R + +D
Sbjct: 329 GVESHELIHISDWLPTLVKLAGGSTNDTK-PLDGFDVWKTLSEGEPSPRKELLHNID--P 385
Query: 451 SLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGTMDGYYGQT 500
+ +P + N + +D AA+R WKL+ G G +
Sbjct: 386 IFVDPSPCKGNRSCLAVDYAAALKYSIFNISIHAAIRQGKWKLLTGNP---------GCS 436
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
P L FN ESKT M+
Sbjct: 437 HWFPPPSL-FN---ESKT--------------------------------------MSSD 454
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
P T +LF++ +D E+N+++ PDI +L L+++ + VP + D + DP+
Sbjct: 455 PLTKK-LWLFDIVHDAEERNDLSDQYPDIVKKLLIRLQHYYKNSVPVFYPDDD-PRCDPE 512
Query: 621 RFNDTWSPWI 630
W PW+
Sbjct: 513 A-TGAWGPWM 521
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/66 (36%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + +KPLFLYLA + H + L+ P++ + + +I D RR
Sbjct: 198 YSTNVFTQRAQDLIANHQPEKPLFLYLALQSVH-----EPLQVPEKYVEPYSFIKDEKRR 252
Query: 75 TYAALT 80
YA +
Sbjct: 253 KYAGMV 258
>gi|149059062|gb|EDM10069.1| arylsulfatase B [Rattus norvegicus]
Length = 517
Score = 287 bits (734), Expect = 1e-74, Method: Compositional matrix adjust.
Identities = 194/573 (33%), Positives = 279/573 (48%), Gaps = 98/573 (17%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + AA L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 30 PARASDAAPPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 88
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L++ GY+T +GKWHLG +R+
Sbjct: 89 SRSQLLTGRYQIHMGLQHYLIMTCQPNCVPLDEKLLPQLLKDAGYATHMVGKWHLGMYRK 148
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 149 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIECLNGTRCALDLRDGEEPAKEYTDIY 205
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A LI + P +K + L+ P+E + + +I D +RR
Sbjct: 206 STNIFTKRATTLIANHPPEKSVH--------------DPLQVPEEYMEPYDFIQDKHRRI 251
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV LD++VG V AL+ +G+ N+++IF +DNG T R+ G+N+P RG
Sbjct: 252 YAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQT---------RSGGNNWPLRG 302
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGL 425
K TLWEGG++ + SP ++Q S ++MHI+DWLPTL AGG T PL DG
Sbjct: 303 RKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNLAGGSTHGTKPL--DGF 360
Query: 426 DQWSSLLLNTPSRR-----NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT---AAVR 477
D W ++ +PS R N + D D NT +N T A +R
Sbjct: 361 DVWETISEGSPSPRVELLLNIDPDFFDGLPCPGKNTTPEKNDSFPLEHSAFNTSIHAGIR 420
Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
+WKL+ G G + T+ LQ L + ++D
Sbjct: 421 YKNWKLLTGYPGCGGL-----------------------LTWVRLQ-------LQVPSVD 450
Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
T +LF++ DP E+++++ P I L L
Sbjct: 451 SPTKT------------------------LWLFDINRDPEERHDVSREHPHIVQNLLSRL 486
Query: 598 KYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
+Y+ VP S+ P + DPK WSPW+
Sbjct: 487 QYYHEHSVP-SYFPPLDPRCDPKG-TGVWSPWM 517
>gi|296483766|tpg|DAA25881.1| TPA: arylsulfatase B [Bos taurus]
Length = 429
Score = 287 bits (734), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 156/381 (40%), Positives = 227/381 (59%), Gaps = 23/381 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP +DALA G++L+N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 56 GWNDVGFHGS-AIRTPRLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 114
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 115 LPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 174
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FT+ A LI + P +KPL
Sbjct: 175 YYSHERCTLIDALN--VTRCALDFRDGEEVATGYKNMYSTNVFTERATTLITNHPPEKPL 232
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NRR YA M +D++VG V +AL+R+
Sbjct: 233 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDEAVGNVTAALERR 287
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 288 GLWNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLLK 338
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ + +++HISDWLPTL AGG T+ +DG D W+++ +PS R + +D
Sbjct: 339 RKGVKTRELIHISDWLPTLVKLAGGSTNGTK-PLDGFDVWNTISEGSPSPRMELLHNID- 396
Query: 449 WSSLLLNTPSRRNSVLINIDE 469
+ + P NS+ + DE
Sbjct: 397 -PNFVDTAPCPGNSMALAKDE 416
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NRR
Sbjct: 210 YSTNVFTERATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRR 264
Query: 75 TYAALT 80
YA +
Sbjct: 265 YYAGMA 270
>gi|348535399|ref|XP_003455188.1| PREDICTED: arylsulfatase B [Oreochromis niloticus]
Length = 519
Score = 286 bits (733), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 166/428 (38%), Positives = 239/428 (55%), Gaps = 36/428 (8%)
Query: 74 RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
+T AA L +GW D+ +HGS EI TPN+D L+ G+ L N Y QP+CTPSR
Sbjct: 23 QTLAARQPHVVFILADDFGWYDIGYHGS-EIRTPNLDKLSAGGVRLENYYVQPLCTPSRN 81
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
LMTG+Y IHTGMQ IW +P VPL E+ LP+ ++E GY+T +GKWHLG ++++
Sbjct: 82 QLMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLMKEAGYATHMVGKWHLGMYKKDCL 141
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
P RGF+++ GYL G Y+ H Q S + D+R A G Y+T+L +
Sbjct: 142 PTRRGFDTYLGYLTGSEDYFTHFRCYQSPSLNLSRCALDLRDGEEVATGYKGVYSTELLS 201
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
+ A+ +IE KPLF+Y+A A HA L+ P+ + + +I D NRR YA MV
Sbjct: 202 QRAISIIERHISQKPLFMYVALQAVHA-----PLQVPERYVTPYSFIKDTNRRLYAGMVS 256
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
+D++VG + ALQ++G+ N++++F +DNG T+ + GSN+P RG K +LW
Sbjct: 257 AMDEAVGNITMALQKEGLWNNTVLVFSTDNGGQTL---------SGGSNWPLRGRKWSLW 307
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
EGGV+ + SP ++Q V+ +++HISDWLPTL AGG T+ +DG D W+++
Sbjct: 308 EGGVRGVGFVASPLLKQPGTVNRELIHISDWLPTLVGLAGGSTTGTK-PLDGFDVWNAIS 366
Query: 433 LNTPSRRNS---NIDGL-------DQWSSLLLNTPSR----RNSVLINIDEKKRTAAVRL 478
S R NID L Q L SR +S ++I AA+R
Sbjct: 367 KGFASPRLELLHNIDPLYYDTAPCKQHKLTLAQVVSRDMWANSSFNVSIH-----AAIRS 421
Query: 479 DSWKLVLG 486
WKL+ G
Sbjct: 422 SKWKLLTG 429
Score = 53.5 bits (127), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 40/70 (57%), Gaps = 5/70 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G Y+T+L ++ A+ +IE KPLF+Y+A A HA L+ P+ + + +I D N
Sbjct: 193 GVYSTELLSQRAISIIERHISQKPLFMYVALQAVHA-----PLQVPERYVTPYSFIKDTN 247
Query: 73 RRTYAALTKS 82
RR YA + +
Sbjct: 248 RRLYAGMVSA 257
>gi|344265150|ref|XP_003404649.1| PREDICTED: arylsulfatase I-like [Loxodonta africana]
Length = 573
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 187/542 (34%), Positives = 271/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 60 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 118
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 119 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 178
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 179 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTLLYAQRASHILASHSPRRPLFLY 234
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 235 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 289
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NSIIIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 290 NNSIIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 340
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 341 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 396
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 397 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 443
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 444 LAAFPGS----------------WWNLERMASARQA---------------------VWL 466
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L Y+ RT +P + E P +A P W
Sbjct: 467 FNISADPYEREDLAGQRPDVVRALLARLAYYNRTAIPVRYPAENP---RAHPDFNGGAWG 523
Query: 628 PW 629
PW
Sbjct: 524 PW 525
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 200 SVAWGLSGQYSTLLYAQRASHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 254
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 255 RTMGNVARRKYAAMV 269
>gi|395825538|ref|XP_003785985.1| PREDICTED: arylsulfatase B [Otolemur garnettii]
Length = 532
Score = 286 bits (732), Expect = 2e-74, Method: Compositional matrix adjust.
Identities = 159/406 (39%), Positives = 234/406 (57%), Gaps = 29/406 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS+ I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y I G+Q I
Sbjct: 55 GWNDVGFHGSS-IRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRMGLQHQII 113
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 114 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 173
Query: 212 YYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H + V D R A Y+T++FT+ A LI + P +KPLFL
Sbjct: 174 YYSHERCTLINALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALITNHPPEKPLFL 233
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA + H + L+ P+E + + +I D NR YA MV +D+++G V +AL+ G+
Sbjct: 234 YLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAIGNVTAALKSSGL 288
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++Q
Sbjct: 289 WNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLKQK 339
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ +++HISDWLPTL AGG T+ +DG D W ++ +PS R + +D
Sbjct: 340 GVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWETISEGSPSPRVELLHNID--P 396
Query: 451 SLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
+ + ++P NS+ D+ AA+R +WKL+ G
Sbjct: 397 NFVDSSPCLGNSITPAKDDSSLPGYSAFNTSVHAAIRHGNWKLLTG 442
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 209 YSTNIFTERATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 263
Query: 75 TYAALT 80
YA +
Sbjct: 264 HYAGMV 269
>gi|26350439|dbj|BAC38859.1| unnamed protein product [Mus musculus]
Length = 431
Score = 286 bits (731), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 153/365 (41%), Positives = 216/365 (59%), Gaps = 23/365 (6%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + A L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 36 PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 94
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L+E GY+T +GKWHLG +R+
Sbjct: 95 SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 154
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 155 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 211
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 212 STNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRRI 266
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV +D++VG V AL+ G+ N++ IF +DNG T R+ G+N+P RG
Sbjct: 267 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 317
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K TLWEGG++ + SP ++Q S ++MHI+DWLPTL AGG T+ +DG +
Sbjct: 318 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNGTK-PLDGFN 376
Query: 427 QWSSL 431
W ++
Sbjct: 377 MWKTI 381
Score = 55.5 bits (132), Expect = 9e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 211 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 IYAGMV 271
>gi|157831133|pdb|1FSU|A Chain A, 4-Sulfatase (Human)
Length = 492
Score = 286 bits (731), Expect = 3e-74, Method: Compositional matrix adjust.
Identities = 163/408 (39%), Positives = 236/408 (57%), Gaps = 33/408 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP++DALA G++L+N Y QP+ TPSR+ L+TG+Y I TG+Q I
Sbjct: 15 GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLXTPSRSQLLTGRYQIRTGLQHQII 73
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 74 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 133
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FTK A+ LI + P +KPL
Sbjct: 134 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPL 191
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 192 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 246
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 247 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 297
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
Q + +++HISDWLPTL A G T+ +DG D W ++ +PS R + +D
Sbjct: 298 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID- 355
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
+ + ++P RNS+ D+ AA+R +WKL+ G
Sbjct: 356 -PNFVDSSPCPRNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 402
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 169 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 223
Query: 75 TYAALT 80
YA +
Sbjct: 224 HYAGMV 229
>gi|326919013|ref|XP_003205778.1| PREDICTED: arylsulfatase J-like [Meleagris gallopavo]
Length = 573
Score = 285 bits (730), Expect = 4e-74, Method: Compositional matrix adjust.
Identities = 181/540 (33%), Positives = 272/540 (50%), Gaps = 72/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 61 GFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGKYQIHTGLQHSII 119
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+RRE P RGF++ FG L G
Sbjct: 120 RPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDTFFGSLLGSGD 179
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T++ Q++ KP+FL
Sbjct: 180 YYTHFKCD----SPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFL 235
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP + ++ I + NRR YAAM+ LD+++ V AL++ G
Sbjct: 236 YIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINNVTLALKKYGY 290
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+NSIII+ SDNG + GSN+P RG K T WEGG++ + SP ++
Sbjct: 291 YDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 341
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
V +++HI+DW PTL T A G + +DG D W ++ + RR+ +D L
Sbjct: 342 GSVCKELVHITDWFPTLITLAEGQIDE-DIQLDGYDIWETI---SEGRRSPRVDILHNID 397
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+ + + I +A+R++ WKL+ G GY S+ VP F
Sbjct: 398 PIYTKAKNGSWAAGYGIWNTAIQSAIRVNHWKLLTGNP------GY-----SDWVPPQAF 446
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
SN+ R ++ + G +LF
Sbjct: 447 -----------------------SNMGPNRWHNERVSWSAGKT-------------VWLF 470
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
N+ DP E+ ++++ PD+ QL L +T VP + D +++PK W PW
Sbjct: 471 NITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPKLNGGVWGPWF 529
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T++ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 196 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 250
Query: 60 ETINQFQYITDPNRRTYAAL 79
+ ++ I + NRR YAA+
Sbjct: 251 KYFEHYRSINNINRRRYAAM 270
>gi|363733898|ref|XP_420639.3| PREDICTED: arylsulfatase J [Gallus gallus]
Length = 573
Score = 285 bits (730), Expect = 5e-74, Method: Compositional matrix adjust.
Identities = 181/540 (33%), Positives = 272/540 (50%), Gaps = 72/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 61 GFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPMCTPSRSQFITGKYQIHTGLQHSII 119
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+RRE P RGF++ FG L G
Sbjct: 120 RPTQPNCLPLDNITLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGFDTFFGSLLGSGD 179
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T++ Q++ KP+FL
Sbjct: 180 YYTHFKCD----SPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFL 235
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP + ++ I + NRR YAAM+ LD+++ V AL++ G
Sbjct: 236 YIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEAINNVTLALKKYGY 290
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+NSIII+ SDNG + GSN+P RG K T WEGG++ + SP ++
Sbjct: 291 YDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 341
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
V +++HI+DW PTL T A G + +DG D W ++ + RR+ +D L
Sbjct: 342 GSVCKELVHITDWFPTLITLAEGQIDE-DIQLDGYDIWETI---SEGRRSPRVDILHNID 397
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+ + + I +A+R++ WKL+ G GY S+ VP F
Sbjct: 398 PIYTKAKNGSWAAGYGIWNTAIQSAIRVNHWKLLTGNP------GY-----SDWVPPQAF 446
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
SN+ R ++ + G +LF
Sbjct: 447 -----------------------SNVGPNRWHNERVSWSAGKT-------------VWLF 470
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
N+ DP E+ ++++ PD+ QL L +T VP + D +++PK W PW
Sbjct: 471 NITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPKLNGGVWGPWF 529
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T++ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 196 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 250
Query: 60 ETINQFQYITDPNRRTYAAL 79
+ ++ I + NRR YAA+
Sbjct: 251 KYFEHYRSINNINRRRYAAM 270
>gi|348557289|ref|XP_003464452.1| PREDICTED: arylsulfatase B-like [Cavia porcellus]
Length = 520
Score = 285 bits (728), Expect = 7e-74, Method: Compositional matrix adjust.
Identities = 161/424 (37%), Positives = 236/424 (55%), Gaps = 37/424 (8%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
L L GWND+ FHGS + TP++DALA G+ L+N Y QP+CTPSR+ L+TG+Y IH
Sbjct: 35 VLVLADDLGWNDVGFHGS-RLRTPHLDALAAGGVRLDNYYTQPLCTPSRSQLLTGRYQIH 93
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++F
Sbjct: 94 TGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYF 153
Query: 204 GYLNGVISYYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
GYL G Y+ H I + +R D R A + Y+ ++F A+ L
Sbjct: 154 GYLLGSEDYFSHEHCVFIKALNVTRCAL----DFRDGEEVATEYKNMYSANIFANRAISL 209
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I + P +KPLFLYLA + H + L+ P+E + + +I D NRR YA MV +D++V
Sbjct: 210 IANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIRDKNRRLYAGMVSLMDEAV 264
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V +AL+ G+ N++ IF +DNG T+ G+N+P RG K TLWEGG++
Sbjct: 265 GNVTAALESHGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGIRG 315
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ SP ++Q + +++HISDWLPTL AGG T +DG + W ++ +PS
Sbjct: 316 VGFVASPLLKQKGVRTRELIHISDWLPTLVKLAGGSTQGTK-ALDGFNVWETISEGSPSP 374
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDE----------KKRTAAVRLDSWKLVLGTQ 488
R + +D + + +P N D+ AA+R +WKL+ G
Sbjct: 375 RVELLHNIDPY--FVDPSPCPNNGTAPAKDDYSLLEYSTFNTSVHAAIRHRNWKLLTGFP 432
Query: 489 ENGT 492
G+
Sbjct: 433 GCGS 436
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+ ++F A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NRR
Sbjct: 197 YSANIFANRAISLIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIRDKNRR 251
Query: 75 TYAALT 80
YA +
Sbjct: 252 LYAGMV 257
>gi|1089794|dbj|BAA08412.1| Arylsulfatase B [Rattus norvegicus]
Length = 473
Score = 284 bits (727), Expect = 9e-74, Method: Compositional matrix adjust.
Identities = 189/545 (34%), Positives = 274/545 (50%), Gaps = 87/545 (15%)
Query: 99 HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRG 158
HGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y IH G+Q I +P
Sbjct: 3 HGS-VIRTPHLDALAAGGVVLDNYYVQPLCTPSRSQLLTGRYQIHMGLQHYLIMTCQPNC 61
Query: 159 VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILS 218
VPL E+ LP+ L++ G ST +GKWHLG +R+E P RGF+++FGYL G YY H
Sbjct: 62 VPLDEKLLPQLLKDAGSSTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTH--- 118
Query: 219 DQYSRTVELNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAH 274
+ + LNG D+R A + Y+T++FTK A LI + P +KPLFLYLA
Sbjct: 119 EACAPIECLNGTRCALDLRDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAF 178
Query: 275 LAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENS 334
+ H L+ P+E + + +I D +RR YA MV LD++VG V AL+ +G+ N+
Sbjct: 179 QSVH-----DPLQVPEEYMEPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNT 233
Query: 335 IIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVS 394
++IF +DNG T R+ G+N+P RG K TLWEGG++ + SP ++Q S
Sbjct: 234 VLIFSTDNGGQT---------RSGGNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKS 284
Query: 395 LQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRR-----NSNIDGLDQ 448
++MHI+DWLPTL AGG T PL DG D W ++ +PS R N + D D
Sbjct: 285 RELMHITDWLPTLVNLAGGSTHGTKPL--DGFDVWETISEGSPSPRVELLLNIDPDFFDG 342
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT---AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
NT +N T A +R +WKL+ G G Q+ ++V
Sbjct: 343 LPCPGKNTTPEKNDSFPLEHSAFNTSIHAGIRYKNWKLLTGYPGCGYWFPPPSQSNISEV 402
Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
P S+ ++ ++L
Sbjct: 403 P--------------SVDSPTKTLWL---------------------------------- 414
Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
F++ DP E+++++ P I L L+Y+ VP S+ P + DPK
Sbjct: 415 ----FDINRDPEERHDVSREHPHIVQNLLSRLQYYHEHSVP-SYFPPLDPRCDPKG-TGV 468
Query: 626 WSPWI 630
WSPW+
Sbjct: 469 WSPWM 473
Score = 56.6 bits (135), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 150 YSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRR 204
Query: 75 TYAALT 80
YA +
Sbjct: 205 IYAGMV 210
>gi|114145559|ref|NP_001041346.1| arylsulfatase I precursor [Rattus norvegicus]
gi|123779983|sp|Q32KJ8.1|ARSI_RAT RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158022|tpe|CAI84985.1| TPA: arylsulfatase I [Rattus norvegicus]
gi|149064375|gb|EDM14578.1| similar to RIKEN cDNA 9330196J05 [Rattus norvegicus]
Length = 573
Score = 284 bits (727), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 185/540 (34%), Positives = 272/540 (50%), Gaps = 76/540 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ KPLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKR 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASIRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++A RPD+ L L + RT +P + + +A P W PW
Sbjct: 465 FNISADPYEREDLADQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWGPW 523
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ KPLFLY+A A H L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHSPQKPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|403285505|ref|XP_003934063.1| PREDICTED: arylsulfatase I [Saimiri boliviensis boliviensis]
Length = 572
Score = 284 bits (726), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 275/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ + +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P+ W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAAYNRTAIPVRYPAENP---RAHPEFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 43/75 (57%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ + +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|426350600|ref|XP_004042858.1| PREDICTED: arylsulfatase I [Gorilla gorilla gorilla]
Length = 569
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDMWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|109077724|ref|XP_001108177.1| PREDICTED: arylsulfatase B isoform 2 [Macaca mulatta]
Length = 414
Score = 283 bits (725), Expect = 1e-73, Method: Compositional matrix adjust.
Identities = 157/380 (41%), Positives = 220/380 (57%), Gaps = 42/380 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q I
Sbjct: 57 GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FTK A LI + P +KPL
Sbjct: 176 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALITNHPPEKPL 233
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 234 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 288
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 289 GLWNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLLK 339
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
Q + +++HISDWLPTL AGG +I+G +DG D
Sbjct: 340 QKGVKNRELIHISDWLPTLVKLAGG-------HING---------------TKPLDGFDV 377
Query: 449 WSSLLLNTPSRRNSVLINID 468
W ++ +PS R +L NID
Sbjct: 378 WKTISEGSPSPRVELLHNID 397
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 211 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 HYAGMV 271
>gi|397517762|ref|XP_003829075.1| PREDICTED: arylsulfatase I [Pan paniscus]
gi|410214522|gb|JAA04480.1| arylsulfatase family, member I [Pan troglodytes]
gi|410261150|gb|JAA18541.1| arylsulfatase family, member I [Pan troglodytes]
gi|410300016|gb|JAA28608.1| arylsulfatase family, member I [Pan troglodytes]
gi|410336277|gb|JAA37085.1| arylsulfatase family, member I [Pan troglodytes]
Length = 569
Score = 283 bits (725), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQAV---------------------WL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|426384277|ref|XP_004058697.1| PREDICTED: arylsulfatase B-like [Gorilla gorilla gorilla]
Length = 408
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 152/359 (42%), Positives = 216/359 (60%), Gaps = 21/359 (5%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q I
Sbjct: 58 GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FTK A+ LI + P +KPL
Sbjct: 177 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPREKPL 234
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 289
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLLK 340
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
Q + +++HISDWLPTL A G T+ +DG D W ++ +PS R + +D
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID 398
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 212 YSTNIFTKRAIALITNHPREKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266
Query: 75 TYAALT 80
YA +
Sbjct: 267 HYAGMV 272
>gi|158255166|dbj|BAF83554.1| unnamed protein product [Homo sapiens]
Length = 413
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDMWKTI 380
Query: 432 LLNTPSRRNSNIDGLD 447
+PS R + +D
Sbjct: 381 SEGSPSPRIELLHNID 396
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264
Query: 75 TYAALT 80
YA +
Sbjct: 265 HYAGMV 270
>gi|410260410|gb|JAA18171.1| arylsulfatase B [Pan troglodytes]
gi|410341767|gb|JAA39830.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 40 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 323 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381
Query: 432 LLNTPSRRNSNIDGLD 447
+PS R + +D
Sbjct: 382 SEGSPSPRIELLHNID 397
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 HYAGMV 271
>gi|114599506|ref|XP_001140908.1| PREDICTED: arylsulfatase B isoform 2 [Pan troglodytes]
Length = 415
Score = 283 bits (724), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 41 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 99
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 100 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 159
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 160 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 217
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 218 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 272
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 273 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 323
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 324 WEGGVRGVGFVASPLLKQKGMKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 382
Query: 432 LLNTPSRRNSNIDGLD 447
+PS R + +D
Sbjct: 383 SEGSPSPRIELLHNID 398
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266
Query: 75 TYAALT 80
YA +
Sbjct: 267 HYAGMV 272
>gi|59797060|ref|NP_001012301.1| arylsulfatase I precursor [Homo sapiens]
gi|74722581|sp|Q5FYB1.1|ARSI_HUMAN RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|58201084|gb|AAW66665.1| arylsulfatase I [Homo sapiens]
gi|120538357|gb|AAI29997.1| Arylsulfatase family, member I [Homo sapiens]
gi|120538621|gb|AAI29996.1| Arylsulfatase family, member I [Homo sapiens]
gi|220983388|dbj|BAH11166.1| arylsulfatase I [Homo sapiens]
Length = 569
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|355750329|gb|EHH54667.1| hypothetical protein EGM_15550 [Macaca fascicularis]
Length = 569
Score = 283 bits (723), Expect = 2e-73, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 274/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P+ W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPEFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|296193239|ref|XP_002744413.1| PREDICTED: arylsulfatase I [Callithrix jacchus]
Length = 569
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 274/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ + +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 43/75 (57%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ + +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILANHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|441598315|ref|XP_004087449.1| PREDICTED: arylsulfatase B isoform 2 [Nomascus leucogenys]
Length = 415
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 152/359 (42%), Positives = 216/359 (60%), Gaps = 21/359 (5%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q I
Sbjct: 58 GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 117 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 176
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FTK A+ LI + P +KPL
Sbjct: 177 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPL 234
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 235 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 289
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++
Sbjct: 290 GLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLK 340
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
Q + +++HISDWLPTL A G T+ +DG D W ++ +PS R + +D
Sbjct: 341 QKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID 398
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 212 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 266
Query: 75 TYAALT 80
YA +
Sbjct: 267 HYAGMV 272
>gi|410226854|gb|JAA10646.1| arylsulfatase B [Pan troglodytes]
gi|410292330|gb|JAA24765.1| arylsulfatase B [Pan troglodytes]
Length = 414
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 40 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 98
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 99 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 158
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 159 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 216
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 217 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 271
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 272 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 322
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 323 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 381
Query: 432 LLNTPSRRNSNIDGLD 447
+PS R + +D
Sbjct: 382 SEGSPSPRIELLHNID 397
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 211 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 HYAGMV 271
>gi|348514291|ref|XP_003444674.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 570
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 276/542 (50%), Gaps = 77/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +H S++I TP +D LA +G+ L N Y QP+CTPSR+ +TG+Y IHTG+Q I
Sbjct: 59 GFNDIGYH-SSDIRTPVLDKLAADGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P + LP+ L+ELGYST +GKWHLGF+++E P RGF+++FG L G ++
Sbjct: 118 RPCQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSVN 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
YY + D L G D+ S AW G+Y+T L+T+ +++ P +PLF+
Sbjct: 178 YYTYDGCD----GAGLCGFDLHEGESVAWGQSGKYSTHLYTQRVRKILATHDPQSQPLFI 233
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+L+ A H L+ P E I + + + RR YAAMV +D++V + AL++ G
Sbjct: 234 FLSFQAVHT-----PLQYPDEYIYPYLGLENVARRKYAAMVSAVDEAVRNITYALRKYGY 288
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+NS+IIF +DNG + + GSN+P RG K T WEGG++ + SP +++
Sbjct: 289 YQNSVIIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGIRGLGFVHSPLLRKK 339
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS ++HI+DW PTL AGGD S L +DG + W ++ S R + +D
Sbjct: 340 RRVSKALVHITDWYPTLVGLAGGDES-LTEGVDGYNVWEAISEGKESPRLEILHNIDP-- 396
Query: 451 SLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
L +R S+ I A++R WKL+ G D YG ++ L
Sbjct: 397 ---LYNHARTGSLQKGYGIWNTAIQASIRAGDWKLLTG-------DPGYGDWIPPQI-LP 445
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
F P S + R TR + T+ +
Sbjct: 446 GF---------------------PSSWWKQERHTRPRKTV-------------------W 465
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LFN+ DP E+ +++ RPD+ QL L Y+ RT VP + D +ADP W P
Sbjct: 466 LFNISGDPYERYDLSEQRPDVVKQLLARLVYYNRTAVPVRYPSED-PRADPYLNGGAWVP 524
Query: 629 WI 630
W+
Sbjct: 525 WV 526
Score = 43.1 bits (100), Expect = 0.45, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 41/79 (51%), Gaps = 6/79 (7%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
S AW G+Y+T L+T+ +++ P +PLF++L+ A H L+ P E I
Sbjct: 199 SVAWGQSGKYSTHLYTQRVRKILATHDPQSQPLFIFLSFQAVHT-----PLQYPDEYIYP 253
Query: 65 FQYITDPNRRTYAALTKST 83
+ + + RR YAA+ +
Sbjct: 254 YLGLENVARRKYAAMVSAV 272
>gi|38569407|ref|NP_942002.1| arylsulfatase B isoform 2 precursor [Homo sapiens]
gi|20809799|gb|AAH29051.1| Arylsulfatase B [Homo sapiens]
gi|119616228|gb|EAW95822.1| arylsulfatase B, isoform CRA_b [Homo sapiens]
Length = 413
Score = 283 bits (723), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 155/376 (41%), Positives = 222/376 (59%), Gaps = 23/376 (6%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A ++ L L+ GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+
Sbjct: 39 AGASRPPHLVFLLADDLGWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQ 97
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P
Sbjct: 98 LLTGRYQIRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLP 157
Query: 195 LYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
RGF+++FGYL G YY H L D + V D R A Y+T++F
Sbjct: 158 TRRGFDTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIF 215
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
TK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 216 TKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMV 270
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D++VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +L
Sbjct: 271 SLMDEAVGNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSL 321
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGGV+ + SP ++Q + +++HISDWLPTL A G T+ +DG D W ++
Sbjct: 322 WEGGVRGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGHTNGTK-PLDGFDVWKTI 380
Query: 432 LLNTPSRRNSNIDGLD 447
+PS R + +D
Sbjct: 381 SEGSPSPRIELLHNID 396
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 210 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 264
Query: 75 TYAALT 80
YA +
Sbjct: 265 HYAGMV 270
>gi|432885639|ref|XP_004074694.1| PREDICTED: arylsulfatase B-like [Oryzias latipes]
Length = 520
Score = 282 bits (722), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 183/562 (32%), Positives = 281/562 (50%), Gaps = 72/562 (12%)
Query: 75 TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
T AA + +GWND+ +H S EI TPN+D L+ G+ L N Y QP+C+PSR
Sbjct: 23 TLAAPRPHIVFIMADDFGWNDVGYHNS-EIKTPNLDLLSAKGVRLQNYYVQPLCSPSRNQ 81
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
LMTG+Y IHTGMQ IW +P VPL E+ LP+ L+E GY+T +GKWHLG ++++ P
Sbjct: 82 LMTGRYQIHTGMQHQIIWPCQPYCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYKKDCLP 141
Query: 195 LYRGFESHFGYLNGVISYYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
+RGF+S+FGY G YY H I + +R D+R A G Y+T+
Sbjct: 142 THRGFDSYFGYYLGSEDYYTHTRCYPITALNLTRCAL----DLRDGEEVATAYKGAYSTE 197
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
LF++ AV +I +PLFLY+A A H + L+ P+ + + +I D +RR YA
Sbjct: 198 LFSQRAVSVIAKHNASQPLFLYVAMQAVH-----EPLQVPERYVTPYSFIKDVSRRKYAG 252
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
MV +D++VG + AL++ G+ +N++++F +DNG T R GSN+P RG K
Sbjct: 253 MVSAMDEAVGNISLALRQAGLWDNTVLVFSTDNGGQT---------RAGGSNWPLRGRKW 303
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
LWEGG++ + SP ++ V+ ++MHISDWLPTL AGG TS
Sbjct: 304 ALWEGGIRGVGFVSSPLLENPGTVNNELMHISDWLPTLVRLAGGSTS------------- 350
Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
+DG + W+++ S R +L NID + G Q+
Sbjct: 351 ---------GTKPLDGFNMWNTISKGFASPRLELLHNIDTLYEDTSP-------CPGRQQ 394
Query: 490 NGTM-DGYYGQTRSNKVPLLNFNAIVESKT-YQSLQQLSQNIFLPISNIDKMRSTRQQAT 547
T+ G+ G+T + +FN + + Y + + L+ P +T + ++
Sbjct: 395 QLTLAQGFRGETWARS----SFNVSIHAAIRYLNWKLLTGYPGCPTWFPRPEHNTSESSS 450
Query: 548 IHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQ 607
P+ P LF++ DP E+N +++ P + L L+ H + +P
Sbjct: 451 TSVD----PLKP-------VMLFDIQKDPEERNEVSAQFPAVVDYLLTRLQSHLKNSLPI 499
Query: 608 SHEQPDLVQADPKRFNDTWSPW 629
+ D + DP + W PW
Sbjct: 500 NFPDDD-PRCDPGP-SGAWGPW 519
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 40/70 (57%), Gaps = 5/70 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G Y+T+LF++ AV +I +PLFLY+A A H + L+ P+ + + +I D +
Sbjct: 192 GAYSTELFSQRAVSVIAKHNASQPLFLYVAMQAVH-----EPLQVPERYVTPYSFIKDVS 246
Query: 73 RRTYAALTKS 82
RR YA + +
Sbjct: 247 RRKYAGMVSA 256
>gi|332822312|ref|XP_527073.3| PREDICTED: arylsulfatase I isoform 2 [Pan troglodytes]
Length = 569
Score = 282 bits (722), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 272/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPQQPNCLPLDQVTLPHKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQAV---------------------WL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|260788446|ref|XP_002589261.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
gi|229274436|gb|EEN45272.1| hypothetical protein BRAFLDRAFT_213093 [Branchiostoma floridae]
Length = 470
Score = 282 bits (722), Expect = 3e-73, Method: Compositional matrix adjust.
Identities = 178/540 (32%), Positives = 266/540 (49%), Gaps = 80/540 (14%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YGW+D+ +H ++ I TPN+D LA G+ L N Y QPVC+PSR LMTG+Y IH G+Q
Sbjct: 11 YGWDDIGYH-NHFIHTPNLDRLASEGVKLENYYVQPVCSPSREQLMTGRYQIHYGLQHGV 69
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I P G+PL E LP+ L++ GY T +GKWHLGF ++EYTPLYRGF+ +G+L G
Sbjct: 70 IRNDRPHGLPLDEVTLPQRLKDNGYRTYMVGKWHLGFCKKEYTPLYRGFDKFYGFLTGSE 129
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y+ H R + G D+R D G Y+T LF ++A +I ++P+FL
Sbjct: 130 DYWTH------RRYKGVRGLDLRDQDEPVLDENGTYSTHLFARKATDMILKHDQNQPMFL 183
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YL A H L+ P++ + ++ +I R YAA V +D++VG V +AL+ G+
Sbjct: 184 YLPFQAVHGP-----LQVPEKYLQEYMHINFTVDRIYAAQVTAMDEAVGNVTNALKNSGL 238
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
EN+++IF +DNGA R+ GSN+P RG KNTLWEGGV+ + S +++
Sbjct: 239 WENTVLIFSTDNGA----------RRHTGSNWPLRGWKNTLWEGGVRGVGFVNSNLLKRK 288
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ S ++HISDW PTL + G T+ +DG D W
Sbjct: 289 GKTSDALIHISDWFPTLLRISDGSTTG----------------------TKPLDGFDVWD 326
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
++ S R +L NID +Y + PL ++
Sbjct: 327 AISKGKTSPRKELLHNIDP--------------------------FYDDQTGSYSPL-SY 359
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
+ I + +Y +L+ + R R+ +P P N +LF
Sbjct: 360 DNIFNTSSYSALRSGDWKLLTGYQGAGWNR--RRPYNHEFEESPDP------PNKRLWLF 411
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
N+ DP E+N+++ PDI L E L + +T VP PD + +P+ D + PW+
Sbjct: 412 NVREDPGERNDLSKMHPDIVQDLLEKLAVYNKTAVPIFWPDPD-PRMNPQLHGDVFGPWL 470
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 36/69 (52%), Gaps = 5/69 (7%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
D G Y+T LF ++A +I ++P+FLYL A H L+ P++ + ++ +I
Sbjct: 154 DENGTYSTHLFARKATDMILKHDQNQPMFLYLPFQAVHGP-----LQVPEKYLQEYMHIN 208
Query: 70 DPNRRTYAA 78
R YAA
Sbjct: 209 FTVDRIYAA 217
>gi|109079349|ref|XP_001108178.1| PREDICTED: arylsulfatase I-like [Macaca mulatta]
gi|402873074|ref|XP_003900411.1| PREDICTED: arylsulfatase I [Papio anubis]
gi|355691752|gb|EHH26937.1| hypothetical protein EGK_17023 [Macaca mulatta]
Length = 569
Score = 282 bits (721), Expect = 4e-73, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 273/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLATFPGSWW----NLERMASVRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLAEYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|395817250|ref|XP_003782086.1| PREDICTED: arylsulfatase I [Otolemur garnettii]
Length = 572
Score = 282 bits (721), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 270/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 177 YYTYDNCDGPG----VCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 339 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 441
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 442 LAAFPGS----------------WWNLERMASVRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|326677480|ref|XP_003200848.1| PREDICTED: arylsulfatase B-like, partial [Danio rerio]
Length = 358
Score = 281 bits (720), Expect = 5e-73, Method: Compositional matrix adjust.
Identities = 150/341 (43%), Positives = 205/341 (60%), Gaps = 17/341 (4%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS EI TP++D LA G+ L+N Y QP+CTPSR LMTG+Y I TG+Q I
Sbjct: 34 GWNDVGFHGS-EIKTPHLDRLAAQGVRLDNYYVQPLCTPSRNQLMTGRYQIRTGLQHQII 92
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ LRE GY T +GKWHLG F+++ P +RGF+S FGYL G
Sbjct: 93 WPCQPYCVPLDEKLLPQVLRERGYHTHMVGKWHLGMFQKDCLPTHRGFQSFFGYLTGSED 152
Query: 212 YYDHI-LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H S V D+R + A + G Y+T+L T+ A +I D+PLFL
Sbjct: 153 YYTHKRCSPIAPLNVTRCALDLRDGDAVALNYSGRYSTELLTERATHIITQHTPDQPLFL 212
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A A HA L+ P I + +I DP+RR YA MV +D++VG + LQ G+
Sbjct: 213 YVALQAVHAP-----LQVPDHYIAPYSFIQDPHRRRYAGMVSAMDEAVGNITHTLQETGL 267
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+N+++IF +DNG T+ G+N+P RG K TLWEGGV+ A + P I++
Sbjct: 268 WDNTVLIFSTDNGGQTLYG---------GNNWPLRGRKWTLWEGGVRGVAFVSGPLIEKP 318
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
V+ +++HISDWLPT+ AG T+ +DG D W S+
Sbjct: 319 GAVNRELIHISDWLPTIAGLAGASTNGTK-PLDGFDVWQSI 358
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 44/82 (53%), Gaps = 5/82 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R + A + G Y+T+L T+ A +I D+PLFLY+A A HA L+ P
Sbjct: 174 LRDGDAVALNYSGRYSTELLTERATHIITQHTPDQPLFLYVALQAVHAP-----LQVPDH 228
Query: 61 TINQFQYITDPNRRTYAALTKS 82
I + +I DP+RR YA + +
Sbjct: 229 YIAPYSFIQDPHRRRYAGMVSA 250
>gi|327274122|ref|XP_003221827.1| PREDICTED: arylsulfatase J-like [Anolis carolinensis]
Length = 564
Score = 281 bits (720), Expect = 6e-73, Method: Compositional matrix adjust.
Identities = 182/541 (33%), Positives = 272/541 (50%), Gaps = 74/541 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TG+Y IHTG+Q I
Sbjct: 54 GFRDVGYHGS-EIRTPTLDRLAAEGVKLENYYVQPMCTPSRSQFITGRYQIHTGLQHSVI 112
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 113 RPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMPTQRGFDTFFGSLLGSGD 172
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T++ Q++ KP+FL
Sbjct: 173 YYTHYKCD----SPRMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFL 228
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP +++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 229 YIAYQAVHS-----PLQAPGMYYERYRSINNINRRRYAAMLACLDEAINNVTLALKRYGY 283
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NS+II+ SDNG PT G+N+P RG K T WEGG++ + SP ++
Sbjct: 284 YNNSVIIYSSDNGGQPTAG----------GNNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 333
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL T A G L +DG D W ++ + SRR+ +D L
Sbjct: 334 KGCVCKELIHITDWFPTLITLAEGQIDEDTL-LDGYDVWETI---SESRRSPRMDILHNI 389
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R++ WKL+ G GY S+ VP +
Sbjct: 390 DPIYTKAKNGSWAAGFGIWNTAIQSAIRVNHWKLLTGNP------GY-----SDWVPPQS 438
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ + G +L
Sbjct: 439 F-----------------------SNVGPSRWHNERVSWTAGKT-------------VWL 462
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++S PD+ QL L +T VP + D +++P W PW
Sbjct: 463 FNITADPYERVDLSSKYPDVVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPNFNGGVWGPW 521
Query: 630 I 630
Sbjct: 522 F 522
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T++ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 189 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 243
Query: 60 ETINQFQYITDPNRRTYAAL 79
+++ I + NRR YAA+
Sbjct: 244 MYYERYRSINNINRRRYAAM 263
>gi|126291233|ref|XP_001378869.1| PREDICTED: arylsulfatase I [Monodelphis domestica]
Length = 584
Score = 281 bits (719), Expect = 7e-73, Method: Compositional matrix adjust.
Identities = 182/543 (33%), Positives = 280/543 (51%), Gaps = 79/543 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 65 GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 123
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E+GYST +GKWHLGF+++ P RGF++ G L G +
Sbjct: 124 RPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDTFLGSLTGNVD 183
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G+D+ + AW+ G+Y+T L+ + A Q++ +PLFLY
Sbjct: 184 YYTYDNCD----GPGVCGYDLHEGENVAWEQSGQYSTLLYAQRASQILASHSPHQPLFLY 239
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 240 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGYY 294
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NS+IIF +DNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 295 DNSVIIFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLLKRKR 345
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL T AGG S +DG + W ++ S R + +D
Sbjct: 346 RTSRALVHITDWYPTLVTLAGGSVSEAE-GLDGYNVWPAISEGQVSPRMEILHNIDP--- 401
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I A++R+ WKL+ G + G D QT L N
Sbjct: 402 --LYNHAKHGSLEGGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LAN 451
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F ++ +L++L+ TRQ +L
Sbjct: 452 FPG-----SWWNLERLADG-------------TRQSV---------------------WL 472
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A+ RPD+ L L Y+ RT +P + E P +A P W
Sbjct: 473 FNISADPYEREDLAAQRPDVVRSLLARLVYYNRTAIPVRYPAENP---RAHPDFNGGAWG 529
Query: 628 PWI 630
PW+
Sbjct: 530 PWV 532
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW+ G+Y+T L+ + A Q++ +PLFLY+A A H L++P+E + ++
Sbjct: 205 NVAWEQSGQYSTLLYAQRASQILASHSPHQPLFLYVAFQAVHT-----PLQSPREYLYRY 259
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 260 RTMGNVARRKYAAMV 274
>gi|84370328|ref|NP_001033588.1| arylsulfatase I precursor [Mus musculus]
gi|123779975|sp|Q32KI9.1|ARSI_MOUSE RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
gi|81158040|tpe|CAI84994.1| TPA: arylsulfatase I [Mus musculus]
gi|148677850|gb|EDL09797.1| mCG6034 [Mus musculus]
gi|187954139|gb|AAI38971.1| Arylsulfatase i [Mus musculus]
gi|187954429|gb|AAI41170.1| Arylsulfatase i [Mus musculus]
Length = 573
Score = 281 bits (718), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 184/540 (34%), Positives = 271/540 (50%), Gaps = 76/540 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKKKR 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG TS +DG D W ++ S R + +D
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTTSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASIRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++A RPD+ L L + RT +P + + +A P W PW
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWGPW 523
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ PLFLY+A A H L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHNPQNPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|61876881|ref|XP_593725.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|297477411|ref|XP_002689338.1| PREDICTED: arylsulfatase I [Bos taurus]
gi|296485188|tpg|DAA27303.1| TPA: arylsulfatase I-like [Bos taurus]
Length = 574
Score = 281 bits (718), Expect = 9e-73, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 61 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 119
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+ELGYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 179
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + ++ +PLFLY
Sbjct: 180 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 235
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 236 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRHGFY 290
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 291 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 341
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 342 RTSRALVHITDWYPTLVALAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 397
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 398 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 444
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 445 LAAFPGS----------------WWNLERMASARQA---------------------VWL 467
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 468 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 524
Query: 628 PW 629
PW
Sbjct: 525 PW 526
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 201 NVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 255
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 256 RTMGNVARRKYAAMV 270
>gi|114326196|ref|NP_001041583.1| arylsulfatase I precursor [Canis lupus familiaris]
gi|81158064|tpe|CAI85006.1| TPA: arylsulfatase I [Canis lupus familiaris]
Length = 575
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 61 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 119
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 179
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + ++ +PLFLY
Sbjct: 180 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 235
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + SAL+R G
Sbjct: 236 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSALKRYGFY 290
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 291 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 341
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 342 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 397
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 398 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 444
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 445 LAAFPGS----------------WWNLERMASARQA---------------------VWL 467
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 468 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 524
Query: 628 PW 629
PW
Sbjct: 525 PW 526
Score = 45.8 bits (107), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 201 NVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 255
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 256 RTMGNVARRKYAAMV 270
>gi|311250496|ref|XP_003124150.1| PREDICTED: arylsulfatase I-like [Sus scrofa]
Length = 573
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 183/542 (33%), Positives = 271/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 60 GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 118
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L++LGY+T +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 119 RPRQPNCLPLDQVTLPQRLQQLGYATHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 178
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + +++ +PLFLY
Sbjct: 179 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTLLYAQRVSRILAGHSPRRPLFLY 234
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 235 VAFQAVHT-----PLQSPREYLYRYRGMGNVARRKYAAMVTCMDEAVRNITGALKRYGFY 289
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 290 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRTR 340
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 341 RTSRALLHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 396
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 397 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 443
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 444 LAAFPGSWW----------------NLERMASARQA---------------------VWL 466
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 467 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 523
Query: 628 PW 629
PW
Sbjct: 524 PW 525
Score = 47.0 bits (110), Expect = 0.035, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + +++ +PLFLY+A A H L++P+E + ++
Sbjct: 200 SVAWGLSGQYSTLLYAQRVSRILAGHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 254
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 255 RGMGNVARRKYAAMV 269
>gi|351713085|gb|EHB16004.1| Arylsulfatase I [Heterocephalus glaber]
Length = 573
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 185/542 (34%), Positives = 270/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 59 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMLGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 178 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 233
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ +++ +RR YAAMV +D++V + AL+R G
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRTMSNVSRRKYAAMVTCMDEAVRNITWALKRYGFY 288
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NSIIIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 289 NNSIIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLKRKR 339
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL A G S +DG D W ++ S R + +D
Sbjct: 340 RTSHALVHITDWYPTLVGLARGTVSAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 395
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 396 --LYNHARHGSLEGGFGIWNTAIQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 442
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 443 LAAFPGSWW----------------NLERMASARQA---------------------VWL 465
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 522
Query: 628 PW 629
PW
Sbjct: 523 PW 524
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 199 SVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 253
Query: 66 QYITDPNRRTYAALT 80
+ +++ +RR YAA+
Sbjct: 254 RTMSNVSRRKYAAMV 268
>gi|218563492|sp|Q32KH7.2|ARSI_CANFA RecName: Full=Arylsulfatase I; Short=ASI; Flags: Precursor
Length = 573
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 59 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + ++ +PLFLY
Sbjct: 178 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 233
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + SAL+R G
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITSALKRYGFY 288
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 289 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 339
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 340 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 395
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 396 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 442
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 443 LAAFPGS----------------WWNLERMASARQA---------------------VWL 465
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 522
Query: 628 PW 629
PW
Sbjct: 523 PW 524
Score = 45.8 bits (107), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 199 NVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 253
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 254 RTMGNVARRKYAAMV 268
>gi|355669614|gb|AER94587.1| arylsulfatase family, member J [Mustela putorius furo]
Length = 600
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 188/580 (32%), Positives = 280/580 (48%), Gaps = 74/580 (12%)
Query: 52 GKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDA 111
G+ LE +E + Q P T A L G+ D+ +HGS EI TP +D
Sbjct: 47 GQGLEEEEEATFRAQSGERPEPSTAATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDR 105
Query: 112 LAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR 171
LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I +P +PL LP+ L+
Sbjct: 106 LAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLK 165
Query: 172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
E+GYST +GKWHLGF+R+E P RGF++ FG L G YY H D + + G+D
Sbjct: 166 EVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYD 221
Query: 232 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 276
Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEY 349
++ I + NRR YAAM+ LD+++ V AL+ G NSII++ SDNG PT
Sbjct: 277 RYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIVYSSDNGGQPTAG- 335
Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
GSN+P RG K T WEGG++ + SP ++ V +++HI+DW PTL +
Sbjct: 336 ---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLIS 386
Query: 410 AAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE 469
A G + +DG D W ++ + R+ +D L + + + I
Sbjct: 387 LAEGQIDE-DIQLDGYDVWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWN 442
Query: 470 KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI 529
+A+R+ WKL+ G GY S+ VP +F
Sbjct: 443 TAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQSF------------------- 472
Query: 530 FLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDI 589
SN+ R ++ T+ G + +LFN+ DP E+ +++ P I
Sbjct: 473 ----SNLGPNRWHNERITLSTGKS-------------VWLFNITADPYERVDLSHRYPGI 515
Query: 590 SSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
QL L +T VP + D +++P+ W PW
Sbjct: 516 VKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|440901665|gb|ELR52564.1| Arylsulfatase I, partial [Bos grunniens mutus]
Length = 565
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 270/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 52 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 110
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+ELGYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 111 RPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 170
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + ++ +PLFLY
Sbjct: 171 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 226
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 227 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRHGFY 281
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 282 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 332
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 333 RTSRALVHITDWYPTLVALAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 388
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 389 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 435
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 436 LAAFPGS----------------WWNLERMASARQA---------------------VWL 458
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 459 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 515
Query: 628 PW 629
PW
Sbjct: 516 PW 517
Score = 44.7 bits (104), Expect = 0.14, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 192 NVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 246
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 247 RTMGNVARRKYAAMV 261
>gi|291401248|ref|XP_002717219.1| PREDICTED: arylsulfatase J [Oryctolagus cuniculus]
Length = 601
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 186/557 (33%), Positives = 275/557 (49%), Gaps = 76/557 (13%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
AA T L ++ G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+
Sbjct: 72 AAATSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQ 130
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
+TGKY IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+R+E P
Sbjct: 131 FITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMP 190
Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTK 253
RGF++ FG L G YY H D + + G+D+ N S AWD G Y+T ++T+
Sbjct: 191 TKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDSAAWDHDNGIYSTQMYTQ 246
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
Q++ KP+FLY+A+ A H+ L+AP ++ I + NRR YAAM+
Sbjct: 247 RVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSC 301
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLW 372
LD++V V AL+ G NSI+I+ SDNG PT GSN+P RG K T W
Sbjct: 302 LDEAVSNVTLALKTYGFYNNSILIYSSDNGGQPTAG----------GSNWPLRGSKGTYW 351
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
EGG++ + SP ++ V +++HI+DW PTL + A G + +DG D W ++
Sbjct: 352 EGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI- 409
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
+ R+ +D L + + + I +A+R+ WKL+ G
Sbjct: 410 --SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN----- 462
Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
GY S+ VP +F SN+ R ++ T+ G
Sbjct: 463 -PGY-----SDWVPPQSF-----------------------SNLGPNRWHNERITLSTGK 493
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
+ +LFN+ DP E+ +++S P I QL L +T VP +
Sbjct: 494 S-------------IWLFNITADPYERVDLSSRYPGIVKQLLRRLSQFNKTAVPVRYPPK 540
Query: 613 DLVQADPKRFNDTWSPW 629
D +++P+ W PW
Sbjct: 541 D-PRSNPRLNGGVWGPW 556
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N S AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 224 LYENDSAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 278
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 279 RYFEHYRSIININRRRYAAM 298
>gi|443690889|gb|ELT92899.1| hypothetical protein CAPTEDRAFT_165852 [Capitella teleta]
Length = 484
Score = 281 bits (718), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 174/538 (32%), Positives = 267/538 (49%), Gaps = 96/538 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ ++ + EI TPN+D LA NG+ILN YA C+PSR +L+TG+YP G+Q +
Sbjct: 39 GWNDVGWN-NPEIKTPNLDRLASNGVILNASYALSTCSPSRTALLTGRYPFKLGLQHGVV 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+PL LP+ L+ LGYST AIGKWHLGF R EYTP +RGF+S +G+ +G
Sbjct: 98 KKGKPYGLPLNITLLPQKLKHLGYSTHAIGKWHLGFCRWEYTPTFRGFDSFYGFYSGSED 157
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY +T + G+D R N + +Y+T + + AV++I+ +PLFLY
Sbjct: 158 YYKR-------KTAAIRGYDFRMNTKVFKPKIKKYSTLDYGRRAVKIIQAHKRTEPLFLY 210
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+ A H+ L+ P+ +++ I D NRR ++ MV +D ++G I AL+++ ML
Sbjct: 211 MPFQAVHS-----PLQVPKSFEFKYRNIVDRNRRIFSGMVSAMDSAIGMTIKALKQRNML 265
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ-IQQN 390
N++I+F+SDNG G+N+P RG K TLWEGG +VP+ +W +Q+
Sbjct: 266 TNTMIVFLSDNGGTPFFG---------GNNWPLRGSKATLWEGGTRVPSFVWGKMLLQKA 316
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
S +M+H DW PTL + AGG + ++DG+DQW L S+R
Sbjct: 317 GYTSNEMIHAVDWFPTLVSLAGGTPEK---DVDGVDQWQML-----SQRQ---------- 358
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
PSRR + I+ AA+R +KL+ GT G G+Y
Sbjct: 359 ------PSRRKEFVYGINSITGRAAIRRGKYKLIEGTA--GHYSGWYPA----------- 399
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
P S + KM+ +Q H + L+
Sbjct: 400 ---------------------PSSRVVKMKRMKQLNKRHVNS--------------TLLY 424
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
N+ DP E N++++ P ++ + L ++ V + + + + PK + WSP
Sbjct: 425 NIQVDPIESNDLSAKYPHLTQFMLRRLNRYKAQEVEAFNPRKERY-SQPKFWGGNWSP 481
>gi|149698442|ref|XP_001503367.1| PREDICTED: arylsulfatase J [Equus caballus]
Length = 598
Score = 280 bits (717), Expect = 1e-72, Method: Compositional matrix adjust.
Identities = 187/580 (32%), Positives = 282/580 (48%), Gaps = 74/580 (12%)
Query: 52 GKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDA 111
G+ LE +E + Q P T + L G+ D+ +HGS EI TP +D
Sbjct: 47 GQGLEDEEEAAFRAQAGERPEPSTASTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDK 105
Query: 112 LAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR 171
LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I +P +PL LP+ L+
Sbjct: 106 LAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLK 165
Query: 172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD 231
E+GYST +GKWHLGF+R+E P RGF++ FG L G YY H D + + G+D
Sbjct: 166 EVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYD 221
Query: 232 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPS 276
Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEY 349
++ I + NRR YAAM+ LD+++ V AL+ G NSIII+ SDNG PT
Sbjct: 277 RYFEHYRSIVNINRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG- 335
Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
GSN+P RG K T WEGG++ + SP ++ V +++HI+DW PTL +
Sbjct: 336 ---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLIS 386
Query: 410 AAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE 469
A G + +DG D W ++ + R+ +D L + + + I
Sbjct: 387 LAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNIDPIYTKAKNGSLAAGYGIWN 442
Query: 470 KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI 529
+A+R+ WKL+ G GY S+ VP +F
Sbjct: 443 TAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQSF------------------- 472
Query: 530 FLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDI 589
SN+ R ++ T+ G + +LFN+ DP E+ ++++ P I
Sbjct: 473 ----SNLGPNRWHNERITLSTGKS-------------VWLFNITADPYERVDLSNRYPGI 515
Query: 590 SSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+L L + +T VP + D +++P+ W PW
Sbjct: 516 VKKLLRRLSQYNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPS 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIVNINRRRYAAM 296
>gi|403275516|ref|XP_003929486.1| PREDICTED: arylsulfatase J [Saimiri boliviensis boliviensis]
Length = 601
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 182/540 (33%), Positives = 270/540 (50%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 89 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 147
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 148 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 207
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD+ G Y+T ++T+ Q++ KPLFL
Sbjct: 208 YYTHYKCD----SPGMCGYDLYENDNAAWDSDNGIYSTQMYTQRVQQILASHNPTKPLFL 263
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 264 YLAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 318
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 319 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 368
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + IDG D W ++ + R+ +D L
Sbjct: 369 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQIDGYDIWETI---SEGLRSPRVDILHNI 424
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 425 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 473
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 474 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 497
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 498 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 556
Score = 52.0 bits (123), Expect = 9e-04, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 45/80 (56%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD+ G Y+T ++T+ Q++ KPLFLYLA+ A H+ L+AP
Sbjct: 224 LYENDNAAWDSDNGIYSTQMYTQRVQQILASHNPTKPLFLYLAYQAVHS-----PLQAPG 278
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 279 RYFEHYRSIININRRRYAAM 298
>gi|301765544|ref|XP_002918191.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Ailuropoda
melanoleuca]
Length = 573
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 269/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 59 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + ++ +PLFLY
Sbjct: 178 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 233
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 288
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 289 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 339
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 340 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 395
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 396 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 442
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 443 LAAFPGS----------------WWNLERMASARQA---------------------VWL 465
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 522
Query: 628 PW 629
PW
Sbjct: 523 PW 524
Score = 47.4 bits (111), Expect = 0.027, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 199 SVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 253
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 254 RAMGNVARRKYAAMV 268
>gi|291387626|ref|XP_002710353.1| PREDICTED: arylsulfatase I-like [Oryctolagus cuniculus]
Length = 571
Score = 280 bits (716), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 272/542 (50%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 177 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPRRPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG S +DG D W ++ S R + +D
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGSASAAD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 440 ----------QTLASFPGSWW----NLERMASVRQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 SVAWGLSGQYSTMLYAQRASHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RTMGNVARRKYAAMV 267
>gi|410949653|ref|XP_003981535.1| PREDICTED: arylsulfatase I, partial [Felis catus]
Length = 570
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 269/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 56 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 114
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 115 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 174
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + ++ +PLFLY
Sbjct: 175 YYTYDNCDGPG----VCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLY 230
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 231 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 285
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 286 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 336
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 337 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 392
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 393 --LYNHARHGSLEAGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 439
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 440 LAAFPGS----------------WWNLERMASARQA---------------------VWL 462
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 463 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 519
Query: 628 PW 629
PW
Sbjct: 520 PW 521
Score = 47.0 bits (110), Expect = 0.028, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 196 SVAWGLSGQYSTMLYAQRVSHILASHSPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 250
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 251 RTMGNVARRKYAAMV 265
>gi|125820285|ref|XP_692237.2| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 568
Score = 280 bits (715), Expect = 2e-72, Method: Compositional matrix adjust.
Identities = 178/557 (31%), Positives = 265/557 (47%), Gaps = 107/557 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +H S EI +P +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 55 GFNDIGYH-SGEIRSPTLDKLASEGVRLENYYVQPLCTPSRSQLITGRYQIHTGLQHSII 113
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R++ P RGF ++FG L G +
Sbjct: 114 RPRQPNCLPLDVVTLPQRLQEIGYSTHMVGKWHLGFYRKDCLPTRRGFHTYFGSLTGSVD 173
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
YY + D S L G D+ S AW G+Y+T L+T+ +++ P +PLF+
Sbjct: 174 YYTYGSCDGKS----LCGFDLHEGESVAWGRAGKYSTHLYTQRVRKILATHDPTSQPLFI 229
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+L+ A H L+ P+E I ++ I + RR YAAMV +D++V + AL++ G
Sbjct: 230 FLSLQAVHTP-----LKPPKEYIYPYRNIGNVPRRKYAAMVSIVDEAVRNITYALRKYGF 284
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
NS++IF +DNGA + GSN+P RG K T WEGG++ + SP I+
Sbjct: 285 YRNSVVIFSTDNGAQPL---------TGGSNWPLRGCKGTYWEGGIRGVGFVHSPLIRHR 335
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
R+S ++HI+DW PTL AGG+ S ++ +DG D W
Sbjct: 336 RRISRDLIHITDWYPTLVGLAGGNVS----------------------QHQGLDGFDVWP 373
Query: 451 SLLLNTPSRRNSVLINIDEKKR-----------------TAAVRLDSWKLVLGTQENGTM 493
++ S R +L NID R AA+R+ WKL+ G NG
Sbjct: 374 TISEGKESPRLEILHNIDPLNRRSRGSLKDGYGLWDTTVQAAIRVGDWKLLTGDPGNGD- 432
Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
++P + S+ G
Sbjct: 433 ------------------------------------WVPPHVLTHFPSSWWHLERDIGEK 456
Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
+ T PC E++++A RPD+ +L L +H RT +P + PD
Sbjct: 457 RKSVWLFNVTGDPC----------ERHDLAVHRPDVVRELLARLAFHNRTAIPVRYP-PD 505
Query: 614 LVQADPKRFNDTWSPWI 630
+A+P W PW+
Sbjct: 506 DTRANPSANGGAWRPWV 522
Score = 45.8 bits (107), Expect = 0.069, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 42/76 (55%), Gaps = 6/76 (7%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
S AW G+Y+T L+T+ +++ P +PLF++L+ A H L+ P+E I
Sbjct: 195 SVAWGRAGKYSTHLYTQRVRKILATHDPTSQPLFIFLSLQAVHTP-----LKPPKEYIYP 249
Query: 65 FQYITDPNRRTYAALT 80
++ I + RR YAA+
Sbjct: 250 YRNIGNVPRRKYAAMV 265
>gi|291226838|ref|XP_002733395.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 498
Score = 279 bits (714), Expect = 3e-72, Method: Compositional matrix adjust.
Identities = 172/538 (31%), Positives = 270/538 (50%), Gaps = 87/538 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ ++ + I TP +D LA G+ILN Y P+CTP RASLM+G Y G+Q +
Sbjct: 44 GWNDIGYNNPS-IFTPTLDKLAREGVILNQSYVLPMCTPDRASLMSGYYAYRVGLQHKVL 102
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEP G+PL +P+ ++E GY+T +GKWHLGF + EYTP YRGF+ +G+ N
Sbjct: 103 DHAEPAGLPLNFTLIPQRMKEHGYTTYMLGKWHLGFCKWEYTPTYRGFDHFYGFYNAAED 162
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H S D+R W G Y+T ++ ++A + I P+++Y
Sbjct: 163 YFNHTTSKYL---------DLRNGKEVDWSKNGTYSTYMYAEKATEYIATHNKSTPMYMY 213
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L + H +EAPQ+ ++ + +I D NRR +AMV +DD+V VI+AL+R +
Sbjct: 214 LPFQSVHG-----VIEAPQKYLDMYTFIHDTNRRIKSAMVTAMDDAVKVVIAALERYELW 268
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N++++F +DNG P +N+P RG K TLWEGG + + +++
Sbjct: 269 DNTLLVFTTDNGGPANPNNAG-------NNWPLRGSKLTLWEGGTRGVGFVHGKMLEKKG 321
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
V+ +MMH++DW PTL AGG + +S++DGL+ W +
Sbjct: 322 YVNNEMMHVTDWYPTLLHIAGG------------------------KADSDMDGLNVWDT 357
Query: 452 LLLNTPSRRNSVLINIDEKKRT-AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
L PS R + NIDE + T AA+R+ +KL++G + G DG+ +P
Sbjct: 358 LNKGYPSPRKEFIYNIDEIEPTGAAIRVGDYKLIVG--KAGHPDGW--------IPAPTI 407
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
+ + E LP++ + + R + YLF
Sbjct: 408 DGVWEE-------------ILPMNEYEGLLDGRLSK-----------------DNDTYLF 437
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
N+ DP E+NN+A P+ ++L L+ ++ LVP PD +DP++F WSP
Sbjct: 438 NIKADPTERNNLAEQMPEKVAELSARLEELKQKLVPAIRPAPDPKGSDPRKFGGAWSP 495
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 22/90 (24%), Positives = 44/90 (48%), Gaps = 5/90 (5%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R W G Y+T ++ ++A + I P+++YL + H +EAPQ+
Sbjct: 174 LRNGKEVDWSKNGTYSTYMYAEKATEYIATHNKSTPMYMYLPFQSVHG-----VIEAPQK 228
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIV 90
++ + +I D NRR +A+ + + +V
Sbjct: 229 YLDMYTFIHDTNRRIKSAMVTAMDDAVKVV 258
>gi|198428954|ref|XP_002125106.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 562
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 174/546 (31%), Positives = 271/546 (49%), Gaps = 75/546 (13%)
Query: 91 YGWNDLSFHGS---NEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
+G+ND+ +H +++ TP +D+LA G+IL N Y QP+C+P+R L+TG+Y IHTG+
Sbjct: 44 FGFNDIGYHAREHYSDMYTPFLDSLAAKGVILENYYVQPICSPTRGQLLTGRYQIHTGLA 103
Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
I A+P G+PL L + LR+ GY T +GKWHLGFFR EY P RGF++ FG+LN
Sbjct: 104 HGIIRAAQPYGLPLDNILLSQQLRQCGYKTNMVGKWHLGFFREEYLPWNRGFQNFFGFLN 163
Query: 208 GVISYYDHILSDQYSRTVELNGHDM---RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
G ++++ + +T G+DM R + A T GEY+T+LF +++ ++I+
Sbjct: 164 GGVNHFTRYHCEP-KKTRRFCGYDMIDSRYGPTNA--TYGEYSTNLFIRKSKEMIDKHNK 220
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
KP+FLYL+ A H L+ P + + +F++I D NRR YA MV +D + ++
Sbjct: 221 QKPMFLYLSLQAVHGP-----LQVPNQYLKRFKHIRDKNRRIYAGMVYAMDRGIRQLVKH 275
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
L+R M +N+I IF +DNG T G+N+P RG K TLWEGG++ +
Sbjct: 276 LKRARMWKNTIFIFSTDNGGQTTR---------GGNNWPLRGKKGTLWEGGIRGVGFVHG 326
Query: 385 PQIQ-QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
+Q PRV+ +++H+SDW PT+ +A IDG DQW +L N S+R +
Sbjct: 327 KPLQVTTPRVNKELLHVSDWYPTIMSATHCPYVVGTPPIDGYDQWETLRSNKTSKRTEIL 386
Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
+D L + R D R AA+R WKL + GY G
Sbjct: 387 HNIDPLHKKLTESEFRE-----GFDVSVR-AAIRSGDWKL---------LTGYAGMNEWV 431
Query: 504 KVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT 563
P I ++ Q+ + + ++ +R
Sbjct: 432 VPPECTHGNI-------TIGQIRDDFHVYPNSTKNVR----------------------- 461
Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
LF + DP E+ ++ P++ + L LK + + +P + D ++DPKR
Sbjct: 462 -----LFKITEDPHEEREVSDQYPEVVNLLLARLKEYNTSSLPVDYPHND-KKSDPKRHG 515
Query: 624 DTWSPW 629
W PW
Sbjct: 516 GFWLPW 521
Score = 57.8 bits (138), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 26/70 (37%), Positives = 43/70 (61%), Gaps = 5/70 (7%)
Query: 11 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 70
T GEY+T+LF +++ ++I+ KP+FLYL+ A H L+ P + + +F++I D
Sbjct: 198 TYGEYSTNLFIRKSKEMIDKHNKQKPMFLYLSLQAVHGP-----LQVPNQYLKRFKHIRD 252
Query: 71 PNRRTYAALT 80
NRR YA +
Sbjct: 253 KNRRIYAGMV 262
>gi|126331176|ref|XP_001365999.1| PREDICTED: arylsulfatase J [Monodelphis domestica]
Length = 607
Score = 279 bits (713), Expect = 4e-72, Method: Compositional matrix adjust.
Identities = 178/539 (33%), Positives = 268/539 (49%), Gaps = 72/539 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 95 GFRDVGYHGS-EIKTPTLDKLAAQGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 153
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 154 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 213
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T++ Q++ KP+FL
Sbjct: 214 YYTHYKCD----SPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQILASHDPRKPIFL 269
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP + ++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 270 YIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKRYGF 324
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+NSIII+ SDNG + GSN+P RG K T WEGG++ + SP ++
Sbjct: 325 YDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 375
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ +++HI+DW PTL T A G +DG D W ++ + RR+ +D L
Sbjct: 376 GTICKELVHITDWYPTLITLAEGWIDE-DTQLDGYDIWETI---SEGRRSPRVDILHNID 431
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+ + + I +A+R+ WKL+ G G D Q+ SN P
Sbjct: 432 PIYTKAKNGSWAAGFGIWNTAIQSAIRVQHWKLLTGNP--GYSDWVPPQSFSNAGPNRWH 489
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
N + T +S+ +LF
Sbjct: 490 NERISWSTGKSV---------------------------------------------WLF 504
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
N+ DP E+ +++S P I +L + L +T VP + D +++P+ W PW
Sbjct: 505 NITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 562
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T++ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 230 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 284
Query: 60 ETINQFQYITDPNRRTYAAL 79
+ ++ I + NRR YAA+
Sbjct: 285 KYFEHYRSIININRRRYAAM 304
>gi|148668607|gb|EDL00926.1| arylsulfatase B [Mus musculus]
Length = 556
Score = 278 bits (712), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 188/572 (32%), Positives = 281/572 (49%), Gaps = 94/572 (16%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P R + A L GWNDL FHGS I TP++DALA G++L+N Y QP+CTP
Sbjct: 67 PARASGATQPPHVVFVLADDLGWNDLGFHGS-VIRTPHLDALAAGGVVLDNYYVQPLCTP 125
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ L+TG+Y IH G+Q I +P VPL E+ LP+ L+E GY+T +GKWHLG +R+
Sbjct: 126 SRSQLLTGRYQIHLGLQHYLIMTCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMYRK 185
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEY 246
E P RGF+++FGYL G YY H + + LNG D+R A + Y
Sbjct: 186 ECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCALDLRDGEEPAKEYNNIY 242
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T++FTK A +I + P +K + L+ P+E + + +I D +RR
Sbjct: 243 STNIFTKRATTVIANHPPEK--------------SVHDPLQVPEEYMEPYGFIQDKHRRI 288
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV +D++VG V AL+ G+ N++ IF +DNG T R+ G+N+P RG
Sbjct: 289 YAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT---------RSGGNNWPLRG 339
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K TLWEGG++ + SP ++Q S ++MHI+DWLPTL AGG T+
Sbjct: 340 RKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHITDWLPTLVDLAGGSTNG--------- 390
Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
+DG + W ++ PS R +L NID+ D + +
Sbjct: 391 -------------TKPLDGFNMWKTISEGHPSPRVELLHNIDQ---------DFFDGLPC 428
Query: 487 TQENGTMDGYYGQTRSNKVPLLN--FNAIVESKT-YQSLQQLSQN-----IFLPISNIDK 538
+N T + + PL + FN + + Y++ + L+ + F P S
Sbjct: 429 PGKNMT------PAKDDSFPLEHSAFNTSIHAGIRYKNWKLLTGHPGCGYWFPPPSQ--- 479
Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
+N + + P +LF++ DP E+++++ P I L L+
Sbjct: 480 -------------SNVSEIPPVDPPTKTLWLFDINQDPEERHDVSREHPHIVQNLLSRLQ 526
Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
Y+ VP SH P + DPK WSPW+
Sbjct: 527 YYHEHSVP-SHFPPLDPRCDPKS-TGVWSPWM 556
>gi|6863176|gb|AAF30402.1|AF109924_1 sulfatase 1 precursor [Helix pomatia]
Length = 503
Score = 278 bits (712), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 152/403 (37%), Positives = 236/403 (58%), Gaps = 29/403 (7%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+G++D+ +HGS EI TP +DAL+ +G+ L N Y QP+CTP+R+ LM+G+Y IHTG+Q
Sbjct: 44 FGFHDVGYHGS-EIHTPTLDALSASGVRLENYYVQPICTPTRSQLMSGRYQIHTGLQHGI 102
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I +P +P L + L+E GY+T +GKWHLGF+++EY P RGF+++FGYLN
Sbjct: 103 INSCQPNALPNDSPTLADKLKESGYATHMVGKWHLGFYKQEYLPWNRGFDTYFGYLNAAE 162
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y++H + + R + D+R N + G+Y+ LFT +A+ +++ KPLFL
Sbjct: 163 DYFNHNVPWRQVRYL-----DLRDNNGPVRNETGQYSAHLFTGKAIDVVQSHNTSKPLFL 217
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA+ + HA LE P++ ++++ ITD NRRT+A MV LD+ V + AL+ KG+
Sbjct: 218 YLAYQSVHA-----PLEVPEKYEHKYRNITDKNRRTFAGMVSALDEGVANLTQALKDKGL 272
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
N+++IF +DNG G+NYP RG K +LWEGG + ++++
Sbjct: 273 WNNTVLIFSTDNGG---------QIHAGGNNYPLRGWKASLWEGGFHGVGFVSGGALKRS 323
Query: 391 PRVSLQMMHISDWLPTLYTAAGGD-TSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
VS ++H+SDW PTL T AGG+ PL DG +QW ++ TPS R + +D
Sbjct: 324 GAVSKGLIHVSDWFPTLVTLAGGNLNGTKPL--DGFNQWDTISNETPSPREILLHNID-- 379
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
+ P + + N + + AA+R+ +KL+ G NG+
Sbjct: 380 ----ILYPQKGVPLYSNTWDTRVRAAIRVGDYKLITGDPGNGS 418
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 29/82 (35%), Positives = 49/82 (59%), Gaps = 5/82 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R N + G+Y+ LFT +A+ +++ KPLFLYLA+ + HA LE P++
Sbjct: 179 LRDNNGPVRNETGQYSAHLFTGKAIDVVQSHNTSKPLFLYLAYQSVHA-----PLEVPEK 233
Query: 61 TINQFQYITDPNRRTYAALTKS 82
++++ ITD NRRT+A + +
Sbjct: 234 YEHKYRNITDKNRRTFAGMVSA 255
Score = 44.3 bits (103), Expect = 0.18, Method: Compositional matrix adjust.
Identities = 22/63 (34%), Positives = 33/63 (52%), Gaps = 1/63 (1%)
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
+LFN+ DP E N+++S +P +L ++L T VP + PD + DP D W
Sbjct: 441 VWLFNITADPNEHNDLSSEKPLEVLRLLQILVQFNNTAVPPRYPAPD-PRCDPALHGDVW 499
Query: 627 SPW 629
PW
Sbjct: 500 GPW 502
>gi|115947271|ref|XP_790151.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 500
Score = 278 bits (712), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 179/557 (32%), Positives = 282/557 (50%), Gaps = 89/557 (15%)
Query: 80 TKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
TK + +LI G+ND+ +HGS EI TPNID LA G+ L N Y QP+CTP+R+ L++G
Sbjct: 25 TKPNIVFILIDDLGYNDVGYHGS-EIYTPNIDKLAREGVRLENYYVQPICTPTRSQLLSG 83
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
+Y IHTG+Q I A+P +P + LRE GY+T A+GKWHLGF+++E P RG
Sbjct: 84 RYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATHAVGKWHLGFYKKECLPTQRG 143
Query: 199 FESHFGYLNGVISYYDHILSDQ--YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
F+S+FGYL G Y+ H + + G D+ N AW+ VG Y+ +FT++
Sbjct: 144 FDSYFGYLTGGEDYWTHHRAGDGLLPNSNHWLGMDLWDNEKVAWEYVGNYSAFVFTEKIQ 203
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
Q++ PV++P LYL + H+ L+ P +++ I + NRR YA M+ LD+
Sbjct: 204 QIVAQHPVEQPFLLYLPFQSVHSP-----LQVPSSYEERYKNIKNTNRRIYAGMMTCLDE 258
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+VG ++ +LQ+ G+ +N+++IF +DNG G+N+P RG K ++WEGG+
Sbjct: 259 AVGKIVHSLQQAGVWDNTVLIFSTDNGGEVAAG---------GNNWPLRGWKRSIWEGGM 309
Query: 377 K----VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
+ V + L +Q V+ Q++H+SDW PTL G +S + +DG + W ++
Sbjct: 310 RGVGFVNSRLLPASVQGT--VNKQLIHVSDWFPTLVQGVAG-SSLDNITLDGFNMWKTIS 366
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
NT S R + +D W N P N+ + +AA+R+ WKL+ G + G+
Sbjct: 367 SNTSSPRQELLHNMDPWP----NQPKEFNANI--------SAAIRVGHWKLLTG--DPGS 412
Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
+G Q I +P S I
Sbjct: 413 NNG-------------------------------QWIPVPDSGI---------------- 425
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
P+ P T +LFN+ DP E +++++S P++ +L LK + VP +
Sbjct: 426 --IPIHPLKITGKDTWLFNITADPNEYHDLSTSHPEVVEELMAKLKKYYADSVPVQYPS- 482
Query: 613 DLVQADPKRFNDTWSPW 629
++ +A+P W PW
Sbjct: 483 EVEEANPAYHQGVWGPW 499
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 24/76 (31%), Positives = 41/76 (53%), Gaps = 5/76 (6%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
N AW+ VG Y+ +FT++ Q++ PV++P LYL + H+ L+ P
Sbjct: 182 NEKVAWEYVGNYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHSP-----LQVPSSYEE 236
Query: 64 QFQYITDPNRRTYAAL 79
+++ I + NRR YA +
Sbjct: 237 RYKNIKNTNRRIYAGM 252
>gi|432098813|gb|ELK28308.1| Arylsulfatase I [Myotis davidii]
Length = 571
Score = 278 bits (712), Expect = 5e-72, Method: Compositional matrix adjust.
Identities = 183/542 (33%), Positives = 269/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 58 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 117 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 176
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + ++ +PLFLY
Sbjct: 177 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLY 232
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 233 VAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 287
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 288 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 338
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG S + +DG D W ++ S R + +D
Sbjct: 339 RTSRALVHITDWYPTLVGLAGGTVS-VADGLDGYDVWPAISEGRASPRTEILHNIDP--- 394
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 395 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 441
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 442 LAAFPGSWW----------------NLERMASARQA---------------------VWL 464
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 465 FNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP---RAHPDFNGGAWG 521
Query: 628 PW 629
PW
Sbjct: 522 PW 523
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 198 NVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 252
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 253 RSMGNVARRKYAAMV 267
>gi|114145538|ref|NP_001041352.1| arylsulfatase J [Rattus norvegicus]
gi|81158024|tpe|CAI84986.1| TPA: arylsulfatase J [Rattus norvegicus]
gi|149025900|gb|EDL82143.1| arylsulfatase J [Rattus norvegicus]
Length = 597
Score = 278 bits (710), Expect = 7e-72, Method: Compositional matrix adjust.
Identities = 184/556 (33%), Positives = 275/556 (49%), Gaps = 76/556 (13%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
A+T L ++ G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+
Sbjct: 69 AVTSQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQF 127
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
+TGKY IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+R++ P
Sbjct: 128 ITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPT 187
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKE 254
RGF++ FG L G YY H D + + G+D+ N + AWD G Y+T ++T+
Sbjct: 188 KRGFDTFFGSLLGSGDYYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQR 243
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
Q++ KPLFLY+A+ A H+ L+AP ++ I + NRR YAAM+ L
Sbjct: 244 VQQILASHDPTKPLFLYVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCL 298
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
D+++ V AL+R G NSIII+ SDNG PT GSN+P RG K T WE
Sbjct: 299 DEAIHNVTLALKRYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWE 348
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
GG++ + SP ++ V +++HI+DW PTL + A G + +DG D W ++
Sbjct: 349 GGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI-- 405
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
+ R+ +D L + + + I +A+R+ WKL+ G
Sbjct: 406 -SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------ 458
Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
GY S+ VP F SN+ R ++ T+ G +
Sbjct: 459 PGY-----SDWVPPQAF-----------------------SNLGPNRWHNERITLSTGKS 490
Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
+LFN+ DP E+ +++S P I +L L +T VP + D
Sbjct: 491 -------------IWLFNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD 537
Query: 614 LVQADPKRFNDTWSPW 629
+++P+ W PW
Sbjct: 538 -PRSNPRLNGGVWGPW 552
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPLFLYVAYQAVHS-----PLQAPG 274
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294
>gi|119893510|ref|XP_611819.3| PREDICTED: arylsulfatase J [Bos taurus]
gi|297475606|ref|XP_002688145.1| PREDICTED: arylsulfatase J [Bos taurus]
gi|296486797|tpg|DAA28910.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase-like [Bos taurus]
Length = 599
Score = 278 bits (710), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 180/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 88 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 146
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 147 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 206
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 207 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFL 262
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 263 YIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAINNVTLALKMYGF 317
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 318 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAIGFVHSPLLKN 367
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 368 KGTVCKELVHITDWYPTLISLAEGQIDE-NIQLDGYDVWETI---SEGLRSPRVDILHNI 423
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 424 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVKHWKLLTGN------PGY-----SDWVPPQS 472
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 473 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 496
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I QL L +T VP + D +++P+ W PW
Sbjct: 497 FNITADPYERVDLSNRYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 555
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 223 LYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 277
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 278 RYFEHYRSIVNINRRRYAAM 297
>gi|449514410|ref|XP_002188440.2| PREDICTED: arylsulfatase B, partial [Taeniopygia guttata]
Length = 491
Score = 278 bits (710), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 163/420 (38%), Positives = 235/420 (55%), Gaps = 39/420 (9%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
L L GW D+ +HGS I TP +DAL G+ L Y QP+CTPSR+ L++G+Y IH
Sbjct: 4 VLVLADDLGWGDVGWHGS-AIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRYQIH 62
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
TG+Q IW +P +PL E+ LPE L+E GY T +GKWHLG +R+E P +RGF+++F
Sbjct: 63 TGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYRKECLPTHRGFDTYF 122
Query: 204 GYLNGVISYYDHILSDQY----SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI 259
GYL G YY H D+ ++ V D R A Y+T+LFT+ A+ +I
Sbjct: 123 GYLLGSEDYYTH---DRCVFIKAKNVTRCALDFRDGEEVATGFKNVYSTNLFTERAIDVI 179
Query: 260 EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
+ +KPLFLYLA + H + LE P++ + + I D RR YA MV +D++VG
Sbjct: 180 ANHKTEKPLFLYLAFQSVH-----EPLEVPEKYVKPYSSIKDVKRRHYAGMVTLMDEAVG 234
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
+ AL+ G+ +N++ +F +DNG T+ G+N+P RG K TLWEGGV+
Sbjct: 235 NLTDALKTYGLWDNTVFVFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGV 285
Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
+ SP +++ S +++HISDWLPTL AGG T+ +DG D W+++ PS R
Sbjct: 286 GFVASPLLKRKGVESHELIHISDWLPTLVHLAGGHTNGTK-PLDGFDVWNTISEGRPSPR 344
Query: 440 NS---NID----------GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
NID G+D+ +SL ++ S + NI AA+R WKL+ G
Sbjct: 345 VELLHNIDPMFVDDFPCLGIDKRTSLPQSSSSPWDDTFFNISIH---AAIRHGKWKLLTG 401
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T+LFT+ A+ +I + +KPLFLYLA + H + LE P++ + + I D RR
Sbjct: 166 YSTNLFTERAIDVIANHKTEKPLFLYLAFQSVH-----EPLEVPEKYVKPYSSIKDVKRR 220
Query: 75 TYAALT 80
YA +
Sbjct: 221 HYAGMV 226
>gi|114326206|ref|NP_001041581.1| arylsulfatase J [Canis lupus familiaris]
gi|81158066|tpe|CAI85007.1| TPA: arylsulfatase J [Canis lupus familiaris]
Length = 598
Score = 278 bits (710), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 180/540 (33%), Positives = 268/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 203
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 204 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFL 259
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 260 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 314
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDVWETI---SEGLRSPRVDILHNI 420
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 469
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 493
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++ P I QL L +T VP + D +++P+ W PW
Sbjct: 494 FNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 274
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294
>gi|301775027|ref|XP_002922933.1| PREDICTED: arylsulfatase J-like [Ailuropoda melanoleuca]
Length = 600
Score = 278 bits (710), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 185/561 (32%), Positives = 273/561 (48%), Gaps = 74/561 (13%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P T AA L G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTP
Sbjct: 66 PEPSTTAASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTP 124
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ +TGKY IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+R+
Sbjct: 125 SRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRK 184
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATD 249
E P RGF++ FG L G YY H D + + G+D+ N + AWD G Y+T
Sbjct: 185 ECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQ 240
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
++T+ Q++ KPLFLY+A+ A H+ L+AP ++ I + NRR YAA
Sbjct: 241 MYTQRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAA 295
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVK 368
M+ LD+++ V AL+ G NSIII+ SDNG PT GSN+P RG K
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSK 345
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T WEGG++ + SP ++ V +++HI+DW PTL + A G + +DG D W
Sbjct: 346 GTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDVW 404
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
++ + R+ +D L + + + I +A+R+ WKL+ G
Sbjct: 405 ETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN- 460
Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
GY S+ VP +F SN+ R ++ ++
Sbjct: 461 -----PGY-----SDWVPPQSF-----------------------SNLGPNRWHNERISL 487
Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
G + +LFN+ DP E+ +++ P I QL L +T VP
Sbjct: 488 STGKS-------------VWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVR 534
Query: 609 HEQPDLVQADPKRFNDTWSPW 629
+ D +++P+ W PW
Sbjct: 535 YPPKD-PRSNPRLNGGVWGPW 554
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|148680337|gb|EDL12284.1| arylsulfatase J [Mus musculus]
Length = 572
Score = 278 bits (710), Expect = 8e-72, Method: Compositional matrix adjust.
Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 59 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R++ P RGF++ FG L G
Sbjct: 118 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KPLFL
Sbjct: 178 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 233
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 234 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 288
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 289 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 338
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 339 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 394
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP
Sbjct: 395 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 443
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 444 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 467
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++S P I +L L +T VP + D +++P+ W PW
Sbjct: 468 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 526
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 194 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 248
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 249 RYFEHYRSIININRRRYAAM 268
>gi|426231237|ref|XP_004009646.1| PREDICTED: arylsulfatase J [Ovis aries]
Length = 599
Score = 278 bits (710), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 180/540 (33%), Positives = 268/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 88 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 146
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 147 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 206
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 207 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFL 262
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 263 YIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAINNVTLALKMYGF 317
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 318 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 367
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 368 KGTVCKELVHITDWYPTLISLAEGQIDE-NIQLDGYDVWETI---SEGLRSPRVDILHNI 423
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 424 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 472
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 473 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 496
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++ P I QL L +T VP + D +++P+ W PW
Sbjct: 497 FNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 555
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 223 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 277
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 278 RYFEHYRSIVNINRRRYAAM 297
>gi|281339106|gb|EFB14690.1| hypothetical protein PANDA_011975 [Ailuropoda melanoleuca]
Length = 595
Score = 278 bits (710), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 185/561 (32%), Positives = 273/561 (48%), Gaps = 74/561 (13%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
P T AA L G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTP
Sbjct: 66 PEPSTTAASQPHLIFILADDQGFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTP 124
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR+ +TGKY IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+R+
Sbjct: 125 SRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRK 184
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATD 249
E P RGF++ FG L G YY H D + + G+D+ N + AWD G Y+T
Sbjct: 185 ECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQ 240
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
++T+ Q++ KPLFLY+A+ A H+ L+AP ++ I + NRR YAA
Sbjct: 241 MYTQRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAA 295
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVK 368
M+ LD+++ V AL+ G NSIII+ SDNG PT GSN+P RG K
Sbjct: 296 MLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSK 345
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T WEGG++ + SP ++ V +++HI+DW PTL + A G + +DG D W
Sbjct: 346 GTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDVW 404
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
++ + R+ +D L + + + I +A+R+ WKL+ G
Sbjct: 405 ETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN- 460
Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
GY S+ VP +F SN+ R ++ ++
Sbjct: 461 -----PGY-----SDWVPPQSF-----------------------SNLGPNRWHNERISL 487
Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
G + +LFN+ DP E+ +++ P I QL L +T VP
Sbjct: 488 STGKS-------------VWLFNITADPYERVDLSHRYPGIVKQLLRRLSQFNKTAVPVR 534
Query: 609 HEQPDLVQADPKRFNDTWSPW 629
+ D +++P+ W PW
Sbjct: 535 YPPKD-PRSNPRLNGGVWGPW 554
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPRKPLFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|27734088|ref|NP_775627.1| arylsulfatase J precursor [Mus musculus]
gi|77416378|sp|Q8BM89.1|ARSJ_MOUSE RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|26329953|dbj|BAC28715.1| unnamed protein product [Mus musculus]
gi|81158042|tpe|CAI84995.1| TPA: arylsulfatase J [Mus musculus]
gi|109734872|gb|AAI17814.1| Arylsulfatase J [Mus musculus]
Length = 598
Score = 278 bits (710), Expect = 9e-72, Method: Compositional matrix adjust.
Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R++ P RGF++ FG L G
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 469
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++S P I +L L +T VP + D +++P+ W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294
>gi|26338057|dbj|BAC32714.1| unnamed protein product [Mus musculus]
Length = 570
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R++ P RGF++ FG L G
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 469
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++S P I +L L +T VP + D +++P+ W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294
>gi|405964464|gb|EKC29946.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 182/554 (32%), Positives = 272/554 (49%), Gaps = 114/554 (20%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
L IV YGWND+ F + ++ TPNID LA++G++LN+ Y PVCTPSR S MTG Y
Sbjct: 28 LFIVADDYGWNDVGFR-NPDVLTPNIDKLAHSGMVLNSSYVMPVCTPSRNSFMTGHYAFK 86
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
+G+Q I + PL FLP+ L+ELGY+T AIGKWHLGF + E TP YRGF++ F
Sbjct: 87 SGLQHLAILPKQAACAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFF 146
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
G+ NG YY T+ + G R+ T + GEY+T L+ + A +I++
Sbjct: 147 GFYNGQEDYY----------TLSVAGGKDFRDNRTPVNATGEYSTFLYARRAESIIKEHD 196
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
KP+++YL + H + L+ PQ+ + + + +RRTY MV +D+++G +
Sbjct: 197 ASKPMYMYLPFQSVH-----EPLQVPQKYSDMYSKVHTESRRTYLGMVTAMDEAIGNITK 251
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW----GSNYPYRGVKNTLWEGGVKVP 379
AL+ KGM +++IIF +DNG W G+NYP RG K T++EGG +
Sbjct: 252 ALKTKGMFNDTLIIFTADNGG-------------WITFAGNNYPLRGGKATVFEGGTRAT 298
Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
A + IQ++ V M+H DW+PT+ AAGG TP
Sbjct: 299 AFVSGAGIQKSNYVYPGMIHAVDWMPTVLRAAGG---------------------TP--- 334
Query: 440 NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----AAVRLDSWKLVLGTQENGTMDG 495
+ +DG+DQW SL S+R + NID+ + AA+R+ +KLV G G
Sbjct: 335 DPTLDGIDQWDSLRTAGESKRTEFIYNIDDLRPEICGHAAIRMGDYKLVDG------FPG 388
Query: 496 YY-GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP 554
+Y G + ++V Y + +S N P DK+
Sbjct: 389 FYPGWYKPDQV------------DYGLYEDVSTNYTRPF--FDKVM-------------- 420
Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
LFNL +DP E +++ PDI +L L+Y+ + +VP + +
Sbjct: 421 --------------LFNLKDDPNEHVDLSKKHPDIVKKLKSRLEYYHKQIVPANFPKNSA 466
Query: 615 VQADPKRFNDTWSP 628
A P + W+P
Sbjct: 467 SSA-PTNYGGFWTP 479
Score = 42.7 bits (99), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 21/80 (26%), Positives = 42/80 (52%), Gaps = 5/80 (6%)
Query: 3 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 62
R+ T + GEY+T L+ + A +I++ KP+++YL + H + L+ PQ+
Sbjct: 167 RDNRTPVNATGEYSTFLYARRAESIIKEHDASKPMYMYLPFQSVH-----EPLQVPQKYS 221
Query: 63 NQFQYITDPNRRTYAALTKS 82
+ + + +RRTY + +
Sbjct: 222 DMYSKVHTESRRTYLGMVTA 241
>gi|444723687|gb|ELW64328.1| Arylsulfatase I [Tupaia chinensis]
Length = 613
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 180/542 (33%), Positives = 263/542 (48%), Gaps = 84/542 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 103 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 161
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 162 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 221
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 222 YYTYDNCDGPG----VCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHSPRQPLFLY 277
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 278 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 332
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 333 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 383
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG S + +DG D W ++ S R + +D
Sbjct: 384 RTSRALVHITDWYPTLVGLAGGVVSAVD-GLDGYDVWPAISEGRASPRTEILHNIDP--- 439
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G NG + +P
Sbjct: 440 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTGDPGNG-----------DWIPPQT 486
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S +Q+ L
Sbjct: 487 LAAFPGXXXXPSFRQVVG-----------------------------------------L 505
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 506 FNISADPYERKDLAGQRPDVVRTLLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 562
Query: 628 PW 629
PW
Sbjct: 563 PW 564
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 243 SVAWGLSGQYSTMLYAQRASHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 297
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 298 RTMGNVARRKYAAMV 312
>gi|291221493|ref|XP_002730757.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 585
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 185/569 (32%), Positives = 288/569 (50%), Gaps = 87/569 (15%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ 125
+T R L K + L++ GWND+ ++ ++ +PTP ++ LA NG+ILNN Y+Q
Sbjct: 93 MTTAERPPCGDLGKPPHIVLVLADDVGWNDVGWN-NDFMPTPILNELASNGVILNNTYSQ 151
Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
P C+PSRA+L+TGKYP + G+Q + P +PL L L+ELGY AIGKWHL
Sbjct: 152 PACSPSRAALLTGKYPANAGIQHLVVQEQHPYYLPLHNTLLSTKLKELGYMNHAIGKWHL 211
Query: 186 GFFRREYTPLYRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
GF +YTPL+RGF+S +G NG +S Y HI+ + +G D+R N G
Sbjct: 212 GFCNWKYTPLWRGFDSFYGIFNGYLSDYSTHIVHSPFIGEPGASGLDLRDNTGVVAHENG 271
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
+ T LFT+ A ++I + PLFLY A L H L+AP N + + D R
Sbjct: 272 THVTYLFTERAERIIRNHNPAAPLFLYTALLVPHG-----PLDAPPGFENSVE-VNDTRR 325
Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
R YA M+ +D++VG + +AL+ KGM +++ IF+SDNG G+NYP+
Sbjct: 326 REYAGMISAMDEAVGNITNALKDKGMWNDTLFIFLSDNGGDMFYA---------GNNYPF 376
Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
RG K +LWEG VK+P ++ +++ + +++H +D TL +AAGG
Sbjct: 377 RGNKASLWEGSVKIPGFIYGNMLEKKGYSNNELIHFTDLFATLLSAAGG----------- 425
Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINID--EKKRTAAVRLDSWK 482
TP + +IDG++QW ++ S+R+ +L+++D E R AA+R++ +K
Sbjct: 426 ----------TP---DDDIDGINQWDTVCRGDASKRSEILLHLDTFEATRGAALRMNDYK 472
Query: 483 LVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRST 542
+ G E VP VE+ L Q ++P + D
Sbjct: 473 YIEGITE--------------LVP-----QRVEAHERFHLDQW----YVPAEHPDA---- 505
Query: 543 RQQATIHCGANPAPMTPSPCTN-GPCYLFNLGNDPCEQNNIASS--RPDISSQLYELLKY 599
P P P N +LFNL +DP E N+A+ DI ++ L+Y
Sbjct: 506 -----------PIPDAPLISQNVNNTFLFNLRDDPLETTNLATDPEMQDILEEMQARLQY 554
Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
++ P + + ++ ++DPK F+ W+P
Sbjct: 555 YKDNAAPLWYPE-NIDESDPKYFDGAWTP 582
>gi|26343103|dbj|BAC35208.1| unnamed protein product [Mus musculus]
Length = 555
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R++ P RGF++ FG L G
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQA 469
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++S P I +L L +T VP + D +++P+ W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294
>gi|26330047|dbj|BAC28762.1| unnamed protein product [Mus musculus]
Length = 614
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 181/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R++ P RGF++ FG L G
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKDCMPTKRGFDTFFGSLLGSGD 203
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KPLFL
Sbjct: 204 YYTHYKCD----SPGVCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFL 259
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 260 YVAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKRYGF 314
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 315 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 364
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 365 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 420
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP
Sbjct: 421 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQA 469
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 470 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 493
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++S P I +L L +T VP + D +++P+ W PW
Sbjct: 494 FNITADPYERVDLSSRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILATHDPTKPLFLYVAYQAVHS-----PLQAPG 274
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIININRRRYAAM 294
>gi|126697478|gb|ABO26696.1| sulfatase 1B precursor [Haliotis discus discus]
Length = 382
Score = 277 bits (709), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 147/387 (37%), Positives = 219/387 (56%), Gaps = 44/387 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ F + ++ TP++D LA G+ILN+ Y QP+C+PSR M+G +P HTG+Q I
Sbjct: 36 GWNDVGFR-NPQVLTPHLDKLAKAGVILNSSYVQPLCSPSRNCFMSGYFPYHTGLQDGVI 94
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A P VP+ FLP+ L+ELGYST A+GKWHLGF +YTP YRGF++ GY G
Sbjct: 95 RPASPGFVPIKFTFLPQKLKELGYSTHAVGKWHLGFCNLKYTPTYRGFDTFVGYYIGAED 154
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY H + +Y + +G+D+R N S + G+Y+T +F + AV +I+ D PL+LY
Sbjct: 155 YYKH--TREYDK---FSGYDLRFNTSVYTEAKGKYSTRVFAERAVDIIKSHDTDTPLYLY 209
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L A HA LE P E N +++I D RRTY M+ LD++V + +AL+ G++
Sbjct: 210 LPFQAVHA-----PLEVPPEYENLYKHIHDLPRRTYCGMISALDEAVANITNALEETGLI 264
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI-QQN 390
+N +++F +DNG P + +N P RG K TLWEGG K ++S + ++
Sbjct: 265 DNLLLVFTTDNGGP---------FNGAANNLPLRGCKATLWEGGTKGAGFIYSKTLLKKT 315
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
++ MMH DW PT AGG+ + + N+DG+ Q+
Sbjct: 316 GYLNTGMMHAVDWYPTFVELAGGENT-----------------------DPNMDGISQYD 352
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVR 477
L+ PS+R + NI +K ++AA+R
Sbjct: 353 MLINGGPSKRKEFVYNIYDKSKSAAIR 379
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 44/82 (53%), Gaps = 5/82 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R N S + G+Y+T +F + AV +I+ D PL+LYL A HA LE P E
Sbjct: 170 LRFNTSVYTEAKGKYSTRVFAERAVDIIKSHDTDTPLYLYLPFQAVHA-----PLEVPPE 224
Query: 61 TINQFQYITDPNRRTYAALTKS 82
N +++I D RRTY + +
Sbjct: 225 YENLYKHIHDLPRRTYCGMISA 246
>gi|344239533|gb|EGV95636.1| Arylsulfatase J [Cricetulus griseus]
Length = 571
Score = 277 bits (708), Expect = 1e-71, Method: Compositional matrix adjust.
Identities = 178/539 (33%), Positives = 267/539 (49%), Gaps = 72/539 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 59 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 118 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 178 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFL 233
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 234 YIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTLALKRYGF 288
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
NSIII+ SDNG + GSN+P RG K T WEGG++ + SP ++
Sbjct: 289 YNNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 339
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 340 GTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNID 395
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+ + + I +A+R+ WKL+ G GY S+ VP F
Sbjct: 396 PIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQAF 444
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
SN+ R ++ T+ G + +LF
Sbjct: 445 -----------------------SNLGPNRWHNERITLSTGKS-------------IWLF 468
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
N+ DP E+ +++ P I +L L +T VP + D +++P+ W PW
Sbjct: 469 NITADPYERVDLSGRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 526
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 194 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFLYIAYQAVHS-----PLQAPG 248
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 249 RYFEHYRSIVNINRRRYAAM 268
>gi|443696989|gb|ELT97571.1| hypothetical protein CAPTEDRAFT_178894 [Capitella teleta]
Length = 503
Score = 277 bits (708), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 152/405 (37%), Positives = 229/405 (56%), Gaps = 29/405 (7%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YG++D+ +HGS I TPNID LA+ G+ L N Y QP+CTP+R+ L++G+Y IHTG+Q
Sbjct: 41 YGYHDVGYHGS-AIRTPNIDRLAFEGVRLENYYVQPICTPTRSQLLSGRYQIHTGLQHSI 99
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
IW A+P +P L + LRE GY+ +GKWHLGF++ EY P RGF+S +GYL G
Sbjct: 100 IWAAQPNALPKELPTLADKLREEGYANHIVGKWHLGFYKEEYVPTNRGFDSFYGYLTGSE 159
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKP 267
YY+ ++ G D R N + ++ EY+T L+ + QL+ D D+P
Sbjct: 160 FYYNKTYCLAQINRSDVCGLDFRENDRSINES--EYSTHLYAERTKQLVADHTSAHPDQP 217
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFLYLA + H LE P + +++I D NR+ YA MV +D++V V A ++
Sbjct: 218 LFLYLALQSVHG-----PLEVPAQYRTPYKHIKDENRQIYAGMVSCMDEAVKNVTDAFKK 272
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
G+ +N+I++F +DNG E G+N+P RG K +LW+GG++ + S +
Sbjct: 273 YGLWDNTILVFSTDNGGQVAE---------GGNNWPLRGWKGSLWDGGMRGVGFVHSTLL 323
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+Q V+ Q+MH+SDW PTL AGG+ S + L DG DQW+++ R + +D
Sbjct: 324 KQKGAVAHQLMHVSDWFPTLLNQAGGNASSVKL--DGFDQWAAISDGAEGPRTEILHNID 381
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
LN +R + + AA+R++ WKL+ G NG+
Sbjct: 382 P-----LN--GKRGKRTEEVFDNTVRAALRMNKWKLLTGEPGNGS 419
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 37/70 (52%), Gaps = 8/70 (11%)
Query: 14 EYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 70
EY+T L+ + QL+ D D+PLFLYLA + H LE P + +++I D
Sbjct: 192 EYSTHLYAERTKQLVADHTSAHPDQPLFLYLALQSVHG-----PLEVPAQYRTPYKHIKD 246
Query: 71 PNRRTYAALT 80
NR+ YA +
Sbjct: 247 ENRQIYAGMV 256
>gi|354502405|ref|XP_003513277.1| PREDICTED: arylsulfatase J [Cricetulus griseus]
Length = 597
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 178/539 (33%), Positives = 267/539 (49%), Gaps = 72/539 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 85 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 143
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 144 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 203
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 204 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFL 259
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+R G
Sbjct: 260 YIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTLALKRYGF 314
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
NSIII+ SDNG + GSN+P RG K T WEGG++ + SP ++
Sbjct: 315 YNNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNK 365
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 366 GTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNID 421
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+ + + I +A+R+ WKL+ G GY S+ VP F
Sbjct: 422 PIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQAF 470
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
SN+ R ++ T+ G + +LF
Sbjct: 471 -----------------------SNLGPNRWHNERITLSTGKS-------------IWLF 494
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
N+ DP E+ +++ P I +L L +T VP + D +++P+ W PW
Sbjct: 495 NITADPYERVDLSGRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 552
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 220 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHDPTKPIFLYIAYQAVHS-----PLQAPG 274
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 275 RYFEHYRSIVNINRRRYAAM 294
>gi|100801472|emb|CAJ18095.1| arylsulfatase J [Homo sapiens]
Length = 596
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|441658369|ref|XP_003269374.2| PREDICTED: arylsulfatase J [Nomascus leucogenys]
Length = 597
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 180/540 (33%), Positives = 269/540 (49%), Gaps = 75/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 86 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 144
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 145 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 204
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 205 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 260
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 261 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 315
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 316 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 365
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 366 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 421
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 422 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 470
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 471 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 494
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP S P +++P+ W PW
Sbjct: 495 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVS--IPQDPRSNPRLNGGVWGPW 552
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 221 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 275
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 276 RYFEHYRSIININRRRYAAM 295
>gi|301621823|ref|XP_002940244.1| PREDICTED: arylsulfatase B-like [Xenopus (Silurana) tropicalis]
Length = 502
Score = 276 bits (707), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 167/411 (40%), Positives = 232/411 (56%), Gaps = 36/411 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS EI TP +D L+ G+ L Y QP+CTPSR+ L++G+Y IHTG+Q I
Sbjct: 24 GWNDVGFHGS-EILTPTLDFLSGQGVRLAGYYTQPLCTPSRSQLLSGRYQIHTGLQHQII 82
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P PL ++ LPE L+E GY T +GKWHLG ++ + P RGF+S+FGY G
Sbjct: 83 WPCQPHCHPLEDKLLPELLKERGYVTHMVGKWHLGMYKTDCLPTRRGFDSYFGYWTGGED 142
Query: 212 YYDHILSDQYSRTVELN----GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
YY H ++ LN D R A D +Y+T LFT AV LI + +KP
Sbjct: 143 YYSH---ERCYLITTLNITRCALDFRDGEVPATDYQMKYSTHLFTDRAVDLITNHNPEKP 199
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFLYLA+ A H+ L+ P + I + I D NRR YA MV LD++V V +AL++
Sbjct: 200 LFLYLAYQAVHSP-----LQVPDQYIEPYTSIHDKNRRLYAGMVSVLDEAVANVTNALKK 254
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
+G+ N++ IF +DNG T+ + G+N+P RG K TLWEGG++ + S +
Sbjct: 255 QGLWNNTVFIFSTDNGGQTL---------SGGNNWPLRGRKWTLWEGGIRGVGFVTSSLL 305
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRRNS---NI 443
+Q S +++HISDWLPTL AGG T S PL DG D W ++ +PS R NI
Sbjct: 306 KQKGVTSKELIHISDWLPTLVNLAGGSTNSTKPL--DGFDVWKTISEGSPSPRVELLHNI 363
Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRT--------AAVRLDSWKLVLG 486
D L + S +P+ R+ + K + AA+R +WKL+ G
Sbjct: 364 DPLFEDSFPCSVSPNTRDQAPTHAATKWQDSPFNVSVHAAIRHGNWKLLTG 414
Score = 53.9 bits (128), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 39/67 (58%), Gaps = 5/67 (7%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y+T LFT AV LI + +KPLFLYLA+ A H+ L+ P + I + I D NR
Sbjct: 177 KYSTHLFTDRAVDLITNHNPEKPLFLYLAYQAVHSP-----LQVPDQYIEPYTSIHDKNR 231
Query: 74 RTYAALT 80
R YA +
Sbjct: 232 RLYAGMV 238
>gi|296195717|ref|XP_002745502.1| PREDICTED: arylsulfatase J [Callithrix jacchus]
Length = 605
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 93 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 151
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 152 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 211
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ +++ KP+FL
Sbjct: 212 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQKILASHNPTKPIFL 267
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 268 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 322
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 323 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 372
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + IDG D W ++ + R+ +D L
Sbjct: 373 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQIDGYDIWETI---SEGLRSPRVDILHNI 428
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 429 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 477
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 478 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 501
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 502 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 560
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ +++ KP+FLY+A+ A H+ L+AP
Sbjct: 228 LYENDNAAWDYDNGIYSTQMYTQRVQKILASHNPTKPIFLYIAYQAVHS-----PLQAPG 282
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 283 RYFEHYRSIININRRRYAAM 302
>gi|426345299|ref|XP_004040357.1| PREDICTED: arylsulfatase J [Gorilla gorilla gorilla]
Length = 599
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|410038636|ref|XP_526667.3| PREDICTED: arylsulfatase J isoform 2 [Pan troglodytes]
gi|410210212|gb|JAA02325.1| arylsulfatase family, member J [Pan troglodytes]
gi|410253696|gb|JAA14815.1| arylsulfatase family, member J [Pan troglodytes]
gi|410298378|gb|JAA27789.1| arylsulfatase family, member J [Pan troglodytes]
gi|410351985|gb|JAA42596.1| arylsulfatase family, member J [Pan troglodytes]
Length = 598
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|109389362|ref|NP_078866.3| arylsulfatase J precursor [Homo sapiens]
gi|74722580|sp|Q5FYB0.1|ARSJ_HUMAN RecName: Full=Arylsulfatase J; Short=ASJ; Flags: Precursor
gi|58201086|gb|AAW66666.1| arylsulfatase J [Homo sapiens]
gi|124376924|gb|AAI32880.1| ARSJ protein [Homo sapiens]
gi|124376926|gb|AAI32882.1| ARSJ protein [Homo sapiens]
gi|219521550|gb|AAI44266.1| ARSJ protein [Homo sapiens]
Length = 599
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|313232584|emb|CBY19254.1| unnamed protein product [Oikopleura dioica]
Length = 506
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 180/556 (32%), Positives = 273/556 (49%), Gaps = 75/556 (13%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
A+ K + +LI GWND+S H S + TPN+D L + L + Y P+CTP+R+
Sbjct: 13 VAIAKKPNVLILIADDLGWNDISLHNS-YLSTPNVDGLIQESLHLQSYYVNPICTPTRSV 71
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
LM+G+Y IHTG+Q I GA+P G+PLT+ PE ++ GY T +GKWHLGF+ + P
Sbjct: 72 LMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTHMVGKWHLGFYDEKMVP 131
Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
RGFESH+GYL G +Y+H Q+ + NG D R ++ + G+Y+ DLF K
Sbjct: 132 ENRGFESHYGYLIGAEGHYNH---SQFMQG--QNGVDFRDGGASTNSSWGQYSADLFAKR 186
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
L+E V++ L++Y+ H LEAPQ ++QF +I D +RR YAAM K +
Sbjct: 187 VEDLVEAHDVEESLYMYVGLQNVHYP-----LEAPQHYVDQFSWIKDRDRRVYAAMTKSM 241
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
DD+VG +I++ + KG+ E++I+ F +DNG T+ G+N+P RG+KNTLWEG
Sbjct: 242 DDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTLYG---------GNNWPLRGLKNTLWEG 292
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
G++ + P + + R Q+MH+ D PTL LD L +
Sbjct: 293 GIRGIGSIKIPGVSADKRD--QLMHVVDMFPTL-----------------LD-----LTS 328
Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
P+ R DG Q L+ N S+ +S LINID ++ +W + +
Sbjct: 329 CPNTRKVKFDGKSQAKMLVDNDESQSDSFLINIDPLRKNKGADNRTWHSDFDVRVQAGL- 387
Query: 495 GYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP 554
R + LL S Y + +I+ P D ++++
Sbjct: 388 ------RWQQWKLL--TGAPSSDNYP-----TGHIYPPEWPQDGRKNSQSNFE------- 427
Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
+N LF++ NDP E I++ +I ++ +L + T VP PD
Sbjct: 428 -------SSNLNVRLFDIDNDPLEDFEISNQHTEIVEKMLNMLSSYNSTAVPCRWPNPDP 480
Query: 615 VQADPKRFNDTWSPWI 630
P + W PWI
Sbjct: 481 AGL-PDQNGGFWKPWI 495
Score = 60.5 bits (145), Expect = 3e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G+Y+ DLF K L+E V++ L++Y+ H LEAPQ ++QF +I D +
Sbjct: 176 GQYSADLFAKRVEDLVEAHDVEESLYMYVGLQNVHYP-----LEAPQHYVDQFSWIKDRD 230
Query: 73 RRTYAALTKSTTLTL 87
RR YAA+TKS T+
Sbjct: 231 RRVYAAMTKSMDDTV 245
>gi|348583281|ref|XP_003477401.1| PREDICTED: arylsulfatase I [Cavia porcellus]
Length = 572
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 181/542 (33%), Positives = 267/542 (49%), Gaps = 80/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 59 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 118 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 178 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRASHILASHNPQRPLFLY 233
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 234 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 288
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 289 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLKRKR 339
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
S ++HI+DW PTL AGG S +DG D W ++ S R + +D
Sbjct: 340 WTSHALVHITDWYPTLVGLAGGTVSAAD-GLDGYDVWPAVSEGRASPRTEILHNIDP--- 395
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G +G + +P
Sbjct: 396 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTGDPGSG-----------DWIPPQT 442
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 443 LAAFPGS----------------WWNLERMASVRQA---------------------VWL 465
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 466 FNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 522
Query: 628 PW 629
PW
Sbjct: 523 PW 524
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 24/75 (32%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 199 SVAWGLSGQYSTMLYAQRASHILASHNPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 253
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 254 RTMGNVARRKYAAMV 268
>gi|344277505|ref|XP_003410541.1| PREDICTED: arylsulfatase J [Loxodonta africana]
Length = 599
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 180/540 (33%), Positives = 267/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDRAAWDYDNGIYSTQMYTQRVQQILASHNPRKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLACLDEAINNVTLALETYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL T A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLITLAEGQIDE-DVQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKT-------------IWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 48.5 bits (114), Expect = 0.010, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDRAAWDYDNGIYSTQMYTQRVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|402870284|ref|XP_003899162.1| PREDICTED: arylsulfatase J [Papio anubis]
Length = 597
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|355749520|gb|EHH53919.1| hypothetical protein EGM_14634 [Macaca fascicularis]
Length = 599
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|351698063|gb|EHB00982.1| Arylsulfatase J [Heterocephalus glaber]
Length = 593
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 180/540 (33%), Positives = 270/540 (50%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 81 GFRDVGYHGS-EIRTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 139
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 140 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 199
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 200 YYTHYKCD----SPGMCGYDLYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFL 255
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP + ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 256 YIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAIHNVTLALKAYGF 310
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 311 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 360
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G +DG D W ++ + R+ +D L
Sbjct: 361 KGTVCKELVHITDWYPTLISLAEGQIDE-DTQLDGYDVWETI---SEGLRSPRVDILHNI 416
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 417 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 465
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 466 F-----------------------SNLGPNRWHNERITLSTGKS-------------IWL 489
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++S P I +L + L +T VP + D +++P+ W PW
Sbjct: 490 FNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 548
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 216 LYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 270
Query: 60 ETINQFQYITDPNRRTYAAL 79
+ ++ I + NRR YAA+
Sbjct: 271 KYFEHYRSIININRRRYAAM 290
>gi|397519903|ref|XP_003830091.1| PREDICTED: arylsulfatase J [Pan paniscus]
Length = 596
Score = 276 bits (706), Expect = 2e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|355687554|gb|EHH26138.1| hypothetical protein EGK_16035 [Macaca mulatta]
Length = 599
Score = 276 bits (706), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|109075466|ref|XP_001096903.1| PREDICTED: arylsulfatase J [Macaca mulatta]
Length = 596
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 269/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + +L
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKS-------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|432911274|ref|XP_004078601.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 187/549 (34%), Positives = 281/549 (51%), Gaps = 82/549 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +H I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 55 GFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLLTGRYQIHTGLQHSII 113
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P LPE LR GYST +GKWHLGF+R+ P +GF++ FG L G +
Sbjct: 114 RSRQPSCLPRHMDTLPETLRRAGYSTHMVGKWHLGFYRKSCLPTRKGFDTFFGSLTGSVD 173
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
YY + + + G+D+ N AW G+Y+T LFT+ A ++++ P D+PLFL
Sbjct: 174 YYSYGSCNGPG----VCGYDLHDNERVAWGHEGKYSTTLFTQRAHKILKSHDPADRPLFL 229
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
L+ A H L+ P+ + ++ + + +RR +AAMV +D++V V AL++ G
Sbjct: 230 LLSLQAVHGP-----LQPPKSFVYLYRDMANVDRRKFAAMVSTVDEAVRNVTYALRKYGY 284
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+NS+II+ +DNGA P V GSN+P RG K T WEGGV+ A + SP ++
Sbjct: 285 YKNSVIIYSTDNGAQPYVG----------GSNWPLRGRKGTYWEGGVRGVAFVHSPLLKH 334
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
RVS ++HI+DW PTL AGG+ S+ P +DG D W ++ S R + ++
Sbjct: 335 RRRVSTALLHITDWFPTLVGLAGGNISQSP-GLDGFDVWPTISEGKESPRKEILHNINP- 392
Query: 450 SSLLLNTPSRRN--------SVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
L PS+R S L + + AA+R+ WKL+ G +G
Sbjct: 393 ---LHRAPSKRGICNHSMLPSGLSAVWDTSVQAAIRVGDWKLLTGDPGDG---------- 439
Query: 502 SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSP 561
+ VP+ Q+ P N+++ ++IH
Sbjct: 440 -DWVPV----------------QMLSTPPRPWWNLERT----LMSSIHKDV--------- 469
Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKR 621
+LFN+ DP E+ +++ RPD+ QL L ++ +T VP + PD QADP R
Sbjct: 470 ------WLFNITADPYERQDLSQQRPDVVLQLLGRLAFYNQTAVP-VYFPPDDPQADPDR 522
Query: 622 FNDTWSPWI 630
W PW+
Sbjct: 523 HQGAWVPWV 531
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 45/81 (55%), Gaps = 6/81 (7%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 62
N AW G+Y+T LFT+ A ++++ P D+PLFL L+ A H L+ P+ +
Sbjct: 193 NERVAWGHEGKYSTTLFTQRAHKILKSHDPADRPLFLLLSLQAVHGP-----LQPPKSFV 247
Query: 63 NQFQYITDPNRRTYAALTKST 83
++ + + +RR +AA+ +
Sbjct: 248 YLYRDMANVDRRKFAAMVSTV 268
>gi|363744029|ref|XP_003642960.1| PREDICTED: arylsulfatase B [Gallus gallus]
Length = 514
Score = 276 bits (705), Expect = 3e-71, Method: Compositional matrix adjust.
Identities = 165/418 (39%), Positives = 231/418 (55%), Gaps = 35/418 (8%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
L L GW D+ +HGS I TP +DAL G+ L Y QP+CTPSR+ L++G+Y IH
Sbjct: 27 VLVLADDLGWGDVGWHGS-AIRTPRLDALGAGGVRLERYYTQPLCTPSRSQLLSGRYQIH 85
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
TG+Q IW +P +PL E+ LPE L++ GY T +GKWHLG +R+E P RGF+++F
Sbjct: 86 TGLQHQIIWPCQPSCLPLDEKLLPELLKDAGYVTHMVGKWHLGMYRKECLPTRRGFDTYF 145
Query: 204 GYLNGVISYY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
GYL G YY DH + + ++ V D R A Y+T+LFT+ A+ LI +
Sbjct: 146 GYLLGSEDYYSHDHCVLIK-AKNVTRCALDFRDGEEVATGFKNMYSTNLFTERAIDLIAN 204
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
+KPLFLYLA + H + LE E + + I D RR YA MV +D++VG +
Sbjct: 205 HKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRRRYAGMVSLMDEAVGNL 259
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
AL+ G+ N++++F +DNG T+ G+N+P RG K TLWEGGV+
Sbjct: 260 TDALKEYGLWNNTVLVFSTDNGGQTMAG---------GNNWPLRGRKWTLWEGGVRGVGF 310
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
+ SP ++Q S +++HISDWLPTL AGG T+ +DG D W ++ PS R
Sbjct: 311 VASPLLKQKGVESHELIHISDWLPTLVHLAGGHTNGTK-PLDGFDVWKTISEGRPSPRVE 369
Query: 442 ---NID----------GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
NID G+D +SL S ++ L NI AA+R WKL+ G
Sbjct: 370 LLHNIDPMFVDDYPCLGIDNRTSLPQINSSPWDNTLFNISVH---AAIRHGKWKLLTG 424
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 37/66 (56%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T+LFT+ A+ LI + +KPLFLYLA + H + LE E + + I D RR
Sbjct: 189 YSTNLFTERAIDLIANHKTEKPLFLYLAFQSVH-----EPLEVSAEYMKPYSSIKDVKRR 243
Query: 75 TYAALT 80
YA +
Sbjct: 244 RYAGMV 249
>gi|395851353|ref|XP_003798225.1| PREDICTED: arylsulfatase J [Otolemur garnettii]
Length = 661
Score = 275 bits (703), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 177/539 (32%), Positives = 266/539 (49%), Gaps = 72/539 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 149 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 207
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 208 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGD 267
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 268 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 323
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 324 YIAYQAVHS-----PLQAPVRYFEHYRSIVNINRRRYAAMLSCLDEAIHNVTLALKAYGF 378
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
NSIII+ SDNG GSN+P RG K T WEGG++ + SP ++
Sbjct: 379 YNNSIIIYSSDNGGQPAA---------GGSNWPLRGSKGTYWEGGIRAIGFVHSPLLKNK 429
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 430 GTVCRELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNID 485
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+ + + I +A+R+ WKL+ G GY S+ VP +F
Sbjct: 486 PIYTKAKNGSLAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQSF 534
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
SN+ R ++ T+ G + +LF
Sbjct: 535 -----------------------SNLGPTRWHNERITLSTGKS-------------VWLF 558
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
N+ DP E+ +++ P I +L L +T VP + D +++P+ W PW
Sbjct: 559 NITADPYERVDLSHRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 616
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 284 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPV 338
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 339 RYFEHYRSIVNINRRRYAAM 358
>gi|301621596|ref|XP_002940132.1| PREDICTED: arylsulfatase I-like [Xenopus (Silurana) tropicalis]
Length = 575
Score = 275 bits (703), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 175/539 (32%), Positives = 269/539 (49%), Gaps = 73/539 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 60 GFHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 118
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+++E P RGF++ G L G +
Sbjct: 119 RPRQPNCLPLHQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 178
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AWD G+Y+T L+ + Q++ +P+F+Y
Sbjct: 179 YYSYDNCDGPG----VCGFDLHEGENVAWDQAGKYSTLLYAQRVNQILASHNPQQPIFIY 234
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E I ++ + + RR YAAMV +D +V + AL++ G
Sbjct: 235 VAFQAVHT-----PLQSPREYIYPYRGMGNVARRKYAAMVTCMDAAVKNITRALKKYGYY 289
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+I+F SDNG T + GSN+P RG K T WEGGV+ + SP I++
Sbjct: 290 SNSVIVFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLIKKKR 340
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG+ S + +GL DG D W +
Sbjct: 341 RTSRALMHITDWYPTLVKLAGGNIS----SAEGL------------------DGYDVWGA 378
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
+ S R +L NID A G+ E G G + + + +F
Sbjct: 379 ISEGKESPRTEILHNIDPLYNLAKS---------GSLEEG--QGVWNTAVQASIRVKDFK 427
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC-GANPAPMTPSPCTNGPCYLF 570
+ Y ++P + + H GA + +LF
Sbjct: 428 LLTGDPGYSD--------WIPPQTLTNFPGSWWNLERHTDGARKS-----------LWLF 468
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
N+ DP E+++++SS+P++ +L L Y+ RT +P + D + +P+ W PW
Sbjct: 469 NITADPFERHDLSSSKPEVVKELLIRLGYYNRTAIPVRYPAED-PRGNPELNGGAWGPW 526
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AWD G+Y+T L+ + Q++ +P+F+Y+A A H L++P+E I +
Sbjct: 200 NVAWDQAGKYSTLLYAQRVNQILASHNPQQPIFIYVAFQAVHT-----PLQSPREYIYPY 254
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 255 RGMGNVARRKYAAMV 269
>gi|397466741|ref|XP_003805104.1| PREDICTED: arylsulfatase B [Pan paniscus]
Length = 513
Score = 275 bits (703), Expect = 5e-71, Method: Compositional matrix adjust.
Identities = 155/394 (39%), Positives = 227/394 (57%), Gaps = 32/394 (8%)
Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q IW +P VPL E+
Sbjct: 49 TPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQIIWPCQPSCVPLDEKL 108
Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDH---ILSDQYS 222
LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G YY H L D +
Sbjct: 109 LPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTLID--A 166
Query: 223 RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA 282
V D R A Y+T++FTK A+ LI + P +KPLFLYLA + H
Sbjct: 167 LNVTRCALDFRDGEEVATGYKNMYSTNIFTKRAIALITNHPPEKPLFLYLALQSVH---- 222
Query: 283 GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
+ L+ P+E + + +I D NR YA MV +D++VG V +AL+ G+ N++ IF +DN
Sbjct: 223 -EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSSGLWNNTVFIFSTDN 281
Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISD 402
G T+ G+N+P RG K +LWEGGV+ + SP ++Q + +++HISD
Sbjct: 282 GGQTLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLKQKGVKNRELIHISD 332
Query: 403 WLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNS 462
WLPTL A G T+ +DG D W ++ +PS R + +D + + ++P RNS
Sbjct: 333 WLPTLVKLARGHTNGTK-PLDGFDVWKTISEGSPSPRIELLHNID--PNFVDSSPCPRNS 389
Query: 463 VLINIDEKKRT----------AAVRLDSWKLVLG 486
+ D+ AA+R +WKL+ G
Sbjct: 390 MAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 423
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A+ LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 190 YSTNIFTKRAIALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 244
Query: 75 TYAALT 80
YA +
Sbjct: 245 HYAGMV 250
>gi|58477551|gb|AAH89445.1| ARSJ protein [Homo sapiens]
Length = 578
Score = 275 bits (702), Expect = 7e-71, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 268/540 (49%), Gaps = 74/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTRRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ + R+ +D L
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNI 422
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ + + I +A+R+ WKL+ G GY S+ VP +
Sbjct: 423 DPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY-----SDWVPPQS 471
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F SN+ R ++ T+ G + P
Sbjct: 472 F-----------------------SNLGPNRWHNERITLSTGKSVWP------------- 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ ++++ P I +L L +T VP + D +++P+ W PW
Sbjct: 496 FNITADPYERVDLSNRYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 554
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|426231093|ref|XP_004009578.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Ovis aries]
Length = 597
Score = 274 bits (700), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 184/542 (33%), Positives = 271/542 (50%), Gaps = 85/542 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 89 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 147
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+ELGYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 148 RPRQPNCLPLDQVTLPQKLQELGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 207
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + ++ +PLFLY
Sbjct: 208 YYTYDNCDGPG----VCGFDLHEGENVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLY 263
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 264 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRHGFY 318
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 319 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 369
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 370 RTSRALVHITDWYPTLVALAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 425
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 426 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPPQT 472
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
A S N+++M S RQ +L
Sbjct: 473 LAAFPGS----------------WWNLERMASARQA---------------------VWL 495
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD + L+ Y+ RT +P + E P +A P W
Sbjct: 496 FNISADPYEREDLAGQRPDXA----RLVDYN-RTAIPVRYPAENP---RAHPDFNGGAWG 547
Query: 628 PW 629
PW
Sbjct: 548 PW 549
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 229 NVAWGLSGQYSTLLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 283
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 284 RTMGNVARRKYAAMV 298
>gi|405964468|gb|EKC29950.1| Arylsulfatase B [Crassostrea gigas]
Length = 483
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 180/543 (33%), Positives = 267/543 (49%), Gaps = 102/543 (18%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YGWND+ F + + TPNID LA +G+ILN+ Y PVCTPSR S MTG+Y +G+Q
Sbjct: 35 YGWNDVGFRNPS-VLTPNIDKLARSGMILNSSYVMPVCTPSRNSFMTGQYAFKSGLQHIV 93
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I + PL FLP+ L+ELGY+T AIGKWHLGF + E TP YRGF++ +GY NG
Sbjct: 94 ILPQQATCAPLNNTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFYGYYNGAE 153
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY+ ++ G D R N T + GEY+T L+ + A +I+D KP+++
Sbjct: 154 DYYNLSIA---------GGKDFRDN-RTPVNATGEYSTILYARRAESIIKDHDASKPMYM 203
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQ-FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
YL + H + L+ E+ + + + +RRTY MV +D+++G + AL+ KG
Sbjct: 204 YLPFQSVH-----EPLQVSLESSSDIYSKVHTESRRTYLGMVTAMDEAIGNITKALKTKG 258
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
M +++I+F +DNG + Y G+NYP RG K T++EGG + A + IQ+
Sbjct: 259 MFNDTLIVFTADNGG-WITYA--------GNNYPLRGGKTTVFEGGTRATAFVSGAGIQK 309
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
+ V M+H DW+PT+ +AAGG TP + +DG+DQW
Sbjct: 310 SNYVYPGMIHAVDWMPTVLSAAGG---------------------TP---DPTLDGIDQW 345
Query: 450 SSLLLNTPSRRNSVLINIDEKKRT----AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
SL S+R + NID+ + AA+R+ +KL+ DGY G +
Sbjct: 346 DSLRTAGESKRTEFIYNIDDLRPEICGHAAIRMGDYKLI---------DGYPGAFQDWYK 396
Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
P V S Y+ + F DKM
Sbjct: 397 P-----DQVVSDLYEDVSLNDTRPFF-----DKMM------------------------- 421
Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
LFNL +DP E +++ PDI +L L+Y+ + +VP + + A P +
Sbjct: 422 ---LFNLKDDPNEHVDLSKKHPDIVKKLKSRLEYYHKQIVPANFPKNSASSA-PTNYGGF 477
Query: 626 WSP 628
W+P
Sbjct: 478 WTP 480
>gi|292620475|ref|XP_002664306.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 558
Score = 273 bits (699), Expect = 1e-70, Method: Compositional matrix adjust.
Identities = 177/541 (32%), Positives = 271/541 (50%), Gaps = 77/541 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +H + +I TP +D LA G+ L N Y QP+CTPSR+ +TG+Y IHTG+Q I
Sbjct: 49 GFNDIGYH-NTDIHTPTLDRLAAAGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSII 107
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P R LP+ L+E GY+T +GKWHLGF++R+ P RGF ++FG L G +
Sbjct: 108 RSRQPSCLPFGLRTLPQRLQEAGYATHMVGKWHLGFYKRDCLPTRRGFNTYFGSLTGSVD 167
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
YY + D ++ G D+ AW G Y+T L+T+ +++ P +PLF+
Sbjct: 168 YYTYKSCD----GPKVCGFDLHDGERVAWGQGGRYSTHLYTQRVRKILAAHDPSSQPLFI 223
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+L+ A H L++P+E + ++ + + RR YA MV +D++V V AL++ G
Sbjct: 224 FLSFQAVHT-----PLQSPKEYVYPYRRMGNMFRRKYAGMVSAVDEAVRNVTYALRKYGY 278
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+N++I F +DNG + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 279 YKNTVIFFSTDNGGQPLFG---------GSNWPLRGRKGTYWEGGVRGIGFVHSPLLRRR 329
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS ++HI+DW PTL AGG+ S+ +DG D W ++ S R + +D
Sbjct: 330 RRVSRALIHITDWYPTLMRLAGGNVSQAE-GLDGFDMWDTISDGKESPRFEILHNIDP-- 386
Query: 451 SLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
L +R S+ I AAVR+ WKL+ G G + +P
Sbjct: 387 ---LYNAARHGSLKNGYGIWNTAVQAAVRVGDWKLLTGNPGYG-----------DWIPPQ 432
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
ES + SL+ R T + ++ +
Sbjct: 433 MLGNFPES--WWSLE----------------RHTEAKKSL-------------------W 455
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LFN+ +DP E+ ++A RPD+ QL L ++ RT VP + D +ADP R W P
Sbjct: 456 LFNVADDPYERFDLAERRPDVVKQLLARLSFYNRTAVPVRYPSED-PRADPSRNGGAWGP 514
Query: 629 W 629
W
Sbjct: 515 W 515
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 41/78 (52%), Gaps = 6/78 (7%)
Query: 7 TAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
AW G Y+T L+T+ +++ P +PLF++L+ A H L++P+E + +
Sbjct: 190 VAWGQGGRYSTHLYTQRVRKILAAHDPSSQPLFIFLSFQAVHT-----PLQSPKEYVYPY 244
Query: 66 QYITDPNRRTYAALTKST 83
+ + + RR YA + +
Sbjct: 245 RRMGNMFRRKYAGMVSAV 262
>gi|326928585|ref|XP_003210457.1| PREDICTED: arylsulfatase I-like [Meleagris gallopavo]
Length = 574
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 177/542 (32%), Positives = 280/542 (51%), Gaps = 79/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 61 GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 119
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+++E P RGF++ G L G +
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 179
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G+D+ + AWD G+Y+T L+ + +++ +P+F+Y
Sbjct: 180 YYTYDNCDGPG----VCGYDLHEGENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E I +++ + + RR YAAMV +D++V + AL++ G
Sbjct: 236 VAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 290
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NS+I+F +DNG T + GSN+P RG K T WEGGV+ + SP I++
Sbjct: 291 DNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKRKR 341
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL + A G+ S +P +DG + W ++ S R + +D
Sbjct: 342 RTSWALVHITDWYPTLVSLARGNLSNVP-GLDGYNVWPAISEGKESPRTEILHNIDP--- 397
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I A++R+ WKL+ G + G D QT L N
Sbjct: 398 --LYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LTN 447
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F ++ +L++L+ D +R + +L
Sbjct: 448 FPG-----SWWNLERLT----------DGLRKS------------------------VWL 468
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ +++ RPDI L L ++ RT +P + E P +A P W
Sbjct: 469 FNITADPYERYDLSEQRPDIVRALLTRLVHYNRTAIPVRYPAENP---RAHPDFNGGAWG 525
Query: 628 PW 629
PW
Sbjct: 526 PW 527
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 43/75 (57%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AWD G+Y+T L+ + +++ +P+F+Y+A A H L++P+E I ++
Sbjct: 201 NVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRY 255
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 256 RSMGNVARRKYAAMV 270
>gi|198415046|ref|XP_002127641.1| PREDICTED: similar to Arylsulfatase B precursor (ASB)
(N-acetylgalactosamine-4-sulfatase) (G4S) [Ciona
intestinalis]
Length = 522
Score = 273 bits (698), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 177/557 (31%), Positives = 283/557 (50%), Gaps = 79/557 (14%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
LLIV G+ND+ +H +I TPNI+ LA +G+IL + Y QP+CTPSR+ LMTG+Y IH
Sbjct: 30 LLIVADDLGFNDVGYHNP-DIYTPNINKLAKDGVILESYYVQPICTPSRSQLMTGRYQIH 88
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
TG+Q I+ +P +P+ E LP+ L+E GY+T A+GKWHLGF+++E P RGF++ +
Sbjct: 89 TGLQHSVIFAPQPNCLPVDEIILPQKLKEAGYTTHAVGKWHLGFYKKECLPTSRGFDTFY 148
Query: 204 GYLNGVISYYDHILSDQY---SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
GY G YY + + ++T ++G D N T W+ G Y++ L+ AV++I+
Sbjct: 149 GYYCGAEDYYTKQVHANFHFGNKTRRVSGFDFHDNSRTEWEANGTYSSYLYRDRAVRIIK 208
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
PLF+YL + H L+ P + I ++++I D RRT++AMV +D+++G+
Sbjct: 209 SHNSSIPLFMYLPFQSVHFP-----LQVPAKYIKRYRHIKDRKRRTFSAMVTAMDEAIGS 263
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
V+ AL+ KG+ ++++++F +DNG T+ G+N+P RG K +LWEGGV+
Sbjct: 264 VVDALKWKGIWQDTLVVFTTDNGGQTLF---------GGNNWPLRGRKASLWEGGVRGVG 314
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLL-LNTPSR 438
++ I+ R S +++HISDW PTL AG + S PL D + W ++ + S
Sbjct: 315 LVRGYGIRDKGRSSNELVHISDWFPTLLYIAGANASGTKPL--DSYNVWDAIANKDVVSP 372
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLI------NIDEKKRTAAVRLDSWKLVLGTQENGT 492
R + +D L P LI ++ K AA+R WKL G
Sbjct: 373 RVELLHNIDP----LFTRPYTPTPKLIPYPNKHGVNTTKGHAALRYKHWKLHTGDPGYDK 428
Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
D + +K+Y +L + N K +ST ++H
Sbjct: 429 WD------------------LPPTKSYCNLSGMHTN------TNSKYKSTHLVPSVH--- 461
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
LF++ NDP E+ N+A P + L + + ++ VP +
Sbjct: 462 ----------------LFDIENDPQERKNLAQIYPKVVDLLLGKIAMYNKSAVPINFPPY 505
Query: 613 DLVQADPKRFNDTWSPW 629
D +++P W PW
Sbjct: 506 D-PKSNPSLHGGVWEPW 521
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 43/79 (54%), Gaps = 5/79 (6%)
Query: 4 NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
N T W+ G Y++ L+ AV++I+ PLF+YL + H L+ P + I
Sbjct: 183 NSRTEWEANGTYSSYLYRDRAVRIIKSHNSSIPLFMYLPFQSVHFP-----LQVPAKYIK 237
Query: 64 QFQYITDPNRRTYAALTKS 82
++++I D RRT++A+ +
Sbjct: 238 RYRHIKDRKRRTFSAMVTA 256
>gi|126697310|gb|ABO26612.1| arylsulfatase [Haliotis discus discus]
Length = 481
Score = 273 bits (697), Expect = 2e-70, Method: Compositional matrix adjust.
Identities = 179/542 (33%), Positives = 267/542 (49%), Gaps = 105/542 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FH +I TPNID LA G++LN+ Y QP+C+PSRA+ M+G YP TG+Q I
Sbjct: 37 GWNDIGFHNP-DIITPNIDKLAREGLLLNHHYVQPLCSPSRAAFMSGYYPFKTGLQHSVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+ELGY+T +GKWH GF TP YRGF+S FGY +
Sbjct: 96 LENQPVCLPLNITILPQKLKELGYATHIVGKWHNGFCSWNCTPTYRGFDSFFGYYGAMED 155
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY H++ D R N + W G Y+T FT A +IE +PLFLY
Sbjct: 156 YYTHVIRGFL---------DYRNNTTPVWTDNGTYSTLRFTDVATDIIERHNQSQPLFLY 206
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LA+ A + +E P + + I NRR ++ MV LD++VG V L+++G++
Sbjct: 207 LAYQAVYG-----PIEVPAKYEAMYPNIKSENRRKFSGMVSALDEAVGNVTKTLRQRGLM 261
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
++++I+F +DNG E G+NYP RG K T++EGG + ++ +Q+
Sbjct: 262 DDTLILFTADNGGGVDES---------GNNYPLRGSKFTVYEGGTRAVGFMYGSGLQKTG 312
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
V M+H DWLPTL AAGG TP S+ DG++ W S
Sbjct: 313 TVFDGMIHAVDWLPTLTAAAGG---------------------TPV---SDRDGINLWPS 348
Query: 452 LLLNTPSRRNSVLINIDEKKRT----AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
L +PS R V+ N D + AA+R+ +KL+ DGY G P
Sbjct: 349 LSTASPSPRTEVVYNYDSHPQPVQGHAAIRVGDYKLI---------DGYPGPFPDWYKP- 398
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
+Q++ ++ R +R A +
Sbjct: 399 ---------------EQVTSSL--------NTRFSRDSANQY------------------ 417
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH-EQPDLVQADPKRFNDTW 626
LFNL +DP E+N++++ RPD+ +L L ++++ VP + E PD + ++P + + W
Sbjct: 418 QLFNLKDDPNERNDLSNFRPDMVKKLAARLAWYKKQAVPPNFPETPDDL-SNPALYGNVW 476
Query: 627 SP 628
SP
Sbjct: 477 SP 478
Score = 43.9 bits (102), Expect = 0.29, Method: Compositional matrix adjust.
Identities = 25/81 (30%), Positives = 38/81 (46%), Gaps = 5/81 (6%)
Query: 2 RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
R N + W G Y+T FT A +IE +PLFLYLA+ A + +E P +
Sbjct: 168 RNNTTPVWTDNGTYSTLRFTDVATDIIERHNQSQPLFLYLAYQAVYG-----PIEVPAKY 222
Query: 62 INQFQYITDPNRRTYAALTKS 82
+ I NRR ++ + +
Sbjct: 223 EAMYPNIKSENRRKFSGMVSA 243
>gi|21430588|gb|AAM50972.1| RE13542p [Drosophila melanogaster]
Length = 300
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 146/308 (47%), Positives = 192/308 (62%), Gaps = 12/308 (3%)
Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
+CTPSRA+L+TGKYPI+TGMQ I +P G+PL E + E RE GY T +GKWHLG
Sbjct: 1 MCTPSRAALLTGKYPINTGMQHYVIVNDQPWGLPLNETTMAEIFRENGYRTSLLGKWHLG 60
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
+R +TP RGF+ H GYL + YY Q NGHD R +L + D VG Y
Sbjct: 61 LSQRNFTPTERGFDRHLGYLGAYVDYYTQSYEQQ---NKGYNGHDFRDSLKSTHDHVGHY 117
Query: 247 ATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
TDL T AV+ IED + +PLFL L HLA HA N ++AP E +++F+YI++
Sbjct: 118 VTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQAPAEEVSRFEYISNKT 177
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
R YAAMV +LD SVG+VI AL R+ ML+NSII+F+SDNG PT T+ SNYP
Sbjct: 178 HRYYAAMVSRLDKSVGSVIDALARQEMLQNSIILFLSDNGGPTQGQHSTT-----ASNYP 232
Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
RG KN+ WEG ++ A +WS + ++ V Q ++I D LPTL AA G + L++D
Sbjct: 233 LRGQKNSPWEGALRSSAAIWSTEFERLGSVWKQQIYIGDLLPTL-AAAAGISPDPALHLD 291
Query: 424 GLDQWSSL 431
GL+ WS+L
Sbjct: 292 GLNLWSAL 299
Score = 69.3 bits (168), Expect = 6e-09, Method: Compositional matrix adjust.
Identities = 36/83 (43%), Positives = 50/83 (60%), Gaps = 3/83 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIED---QPVDKPLFLYLAHLAAHAGNAGKHLEA 57
R +L + D VG Y TDL T AV+ IED + +PLFL L HLA HA N ++A
Sbjct: 103 FRDSLKSTHDHVGHYVTDLLTDAAVKEIEDHGSKNSSQPLFLLLNHLAPHAANDDDPMQA 162
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P E +++F+YI++ R YAA+
Sbjct: 163 PAEEVSRFEYISNKTHRYYAAMV 185
>gi|443709644|gb|ELU04236.1| hypothetical protein CAPTEDRAFT_53259, partial [Capitella teleta]
Length = 476
Score = 273 bits (697), Expect = 3e-70, Method: Compositional matrix adjust.
Identities = 180/550 (32%), Positives = 274/550 (49%), Gaps = 84/550 (15%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
LLIV G +D+ +HGS I TPNID LAY G+ L N Y QP+CTPSR+ LMTG+Y IH
Sbjct: 4 LLIVADDLGHHDVGYHGS-VIKTPNIDHLAYTGVRLENYYVQPICTPSRSQLMTGRYQIH 62
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
TG+Q I +P VPL LPE L++ GYST +GKWHLGF++ E P+ RGF+S++
Sbjct: 63 TGLQHNIINPFQPNAVPLDLPMLPEVLKQNGYSTHMVGKWHLGFYKDEVLPMNRGFDSYY 122
Query: 204 GYLNGVISYYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
GYL G Y+ H + G D+R + W+ G+Y+T LF ++A ++
Sbjct: 123 GYLTGSEDYFTHRRCGALPGANKTVCGIDLRNDFEVDWNQTGKYSTQLFAEKAEDVVRKH 182
Query: 263 PV---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
V D+PLFLY+A A HA N + P E + + I DP RR A MV +D++VG
Sbjct: 183 AVHQPDQPLFLYVAFQAVHAPN-----QVPNEYLKPYD-IDDPKRRLLAGMVTCMDEAVG 236
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
+ +A + G+ +N++++F +DNG E N G+N+P RG K++LWEGG++
Sbjct: 237 NITAAFKDTGLWDNTVVVFTTDNGGDVNE--------NSGNNWPLRGWKHSLWEGGMRGV 288
Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
+ SP ++ +MH+SDW PTL AG +TS + +DG D W S+ S R
Sbjct: 289 GFVNSPLLKSFGYPVNGLMHVSDWFPTLLNLAGANTSSIS-GLDGHDIWKSIAEREDSPR 347
Query: 440 NSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQ 499
+ +D L P R S L + + + +A+R+ +K++ G G+
Sbjct: 348 KELLHNIDP-----LYPP--RGSKLNHTYDVRVRSALRVGDFKIITGDPFTGSW------ 394
Query: 500 TRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP 559
VP ++ KT S+N+
Sbjct: 395 -----VPPPEYH----KKTIPDPDPRSKNV------------------------------ 415
Query: 560 SPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
+LFN+ DPCE +++ + P + + + + L + T VP D +++P
Sbjct: 416 --------WLFNVHEDPCETTDLSVALPGVLTMMLDKLLAYNATAVPCVFPDDD-PRSNP 466
Query: 620 KRFNDTWSPW 629
+ + PW
Sbjct: 467 EEHGGAFGPW 476
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/83 (33%), Positives = 43/83 (51%), Gaps = 9/83 (10%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
+R + W+ G+Y+T LF ++A ++ V D+PLFLY+A A HA N +
Sbjct: 152 LRNDFEVDWNQTGKYSTQLFAEKAEDVVRKHAVHQPDQPLFLYVAFQAVHAPN-----QV 206
Query: 58 PQETINQFQYITDPNRRTYAALT 80
P E + + I DP RR A +
Sbjct: 207 PNEYLKPYD-IDDPKRRLLAGMV 228
>gi|50755099|ref|XP_425212.1| PREDICTED: arylsulfatase I [Gallus gallus]
Length = 574
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 176/542 (32%), Positives = 280/542 (51%), Gaps = 79/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 61 GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 119
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+++E P RGF++ G L G +
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 179
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G+D+ + AWD G+Y+T L+ + +++ +P+F+Y
Sbjct: 180 YYTYDNCDGPG----VCGYDLHEGENVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E I +++ + + RR YAAMV +D++V + AL++ G
Sbjct: 236 VAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 290
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NS+I+F +DNG T + GSN+P RG K T WEGGV+ + SP I++
Sbjct: 291 DNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKRKR 341
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL + A G+ S +P +DG + W ++ S R + +D
Sbjct: 342 RTSWALVHITDWYPTLVSLARGNLSNVP-GLDGYNVWPAISEGKESPRTEILHNIDP--- 397
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I A++R+ WKL+ G + G D QT L N
Sbjct: 398 --LYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LTN 447
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F ++ +L++L+ D +R + +L
Sbjct: 448 FPG-----SWWNLERLT----------DGLRKS------------------------VWL 468
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ +++ RPD+ L L ++ RT +P + E P +A P W
Sbjct: 469 FNITADPYERYDLSEQRPDVVRALLTRLVHYNRTAIPVRYPAENP---RAHPDFNGGAWG 525
Query: 628 PW 629
PW
Sbjct: 526 PW 527
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 43/75 (57%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AWD G+Y+T L+ + +++ +P+F+Y+A A H L++P+E I ++
Sbjct: 201 NVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRY 255
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 256 RSMGNVARRKYAAMV 270
>gi|338713661|ref|XP_003362935.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Equus
caballus]
Length = 523
Score = 272 bits (696), Expect = 4e-70, Method: Compositional matrix adjust.
Identities = 179/550 (32%), Positives = 273/550 (49%), Gaps = 83/550 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G +D+ H S TP +DALA G++L+N Y QP+C PSR+ L++G+Y IHTG+Q I
Sbjct: 46 GXHDVGLHES-RFSTPRLDALAAGGLLLDNYYTQPLCXPSRSQLLSGRYQIHTGLQHQII 104
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P +PL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 105 WPCQPSCLPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 164
Query: 212 YYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H + V D R A Y+T +FT+ A LI + P +KPLFL
Sbjct: 165 YYSHERCTFIDALNVTRCALDFRDGEEVATGYKNMYSTSVFTERATALITNHPPEKPLFL 224
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA + H + L+ P+E + + +I D NR YA MV LD++VG V +AL+ +G+
Sbjct: 225 YLALQSVH-----QPLQVPEEYLKPYDFIQDKNRYHYAGMVSLLDEAVGNVTAALKSRGL 279
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
N++ IF +DNG T+ G+N+P RG K +LWEGGV+ + SP ++Q
Sbjct: 280 WNNTVFIFSTDNGGQTLA---------GGNNWPLRGRKWSLWEGGVRGVGFVASPLLKQK 330
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ +++HISDWLPTL AGG T+ +DG + W ++ +PS R + +D
Sbjct: 331 GVKNRELIHISDWLPTLVKLAGGSTNGTK-PLDGFNVWKTISEGSPSPRVELLHNID--P 387
Query: 451 SLLLNTP--------SRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQT 500
+ + ++P ++ +S L + AA+R +WKL + GY G
Sbjct: 388 NFVDDSPCPGRSMAPAKHDSFLPEYSTFNTSIHAAIRHKNWKL---------LTGYPG-- 436
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
F P S N + + PS
Sbjct: 437 -------------------------CGYWFPPPSQY----------------NGSAIPPS 455
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
+LF++ DP E+++++ P + QL L+++ + VP D + DPK
Sbjct: 456 DPPTKTLWLFDIDRDPEERHDVSREHPHVVRQLLSRLQFYHKHSVPVYFPAQD-PRCDPK 514
Query: 621 RFNDTWSPWI 630
W PW+
Sbjct: 515 S-TGAWGPWM 523
Score = 53.1 bits (126), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T +FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 200 YSTSVFTERATALITNHPPEKPLFLYLALQSVH-----QPLQVPEEYLKPYDFIQDKNRY 254
Query: 75 TYAALT 80
YA +
Sbjct: 255 HYAGMV 260
>gi|405964467|gb|EKC29949.1| Arylsulfatase B [Crassostrea gigas]
Length = 482
Score = 271 bits (694), Expect = 6e-70, Method: Compositional matrix adjust.
Identities = 175/537 (32%), Positives = 257/537 (47%), Gaps = 93/537 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ F ++ TPNID LA +G+ILN+ Y PVCTPSR S MTG Y +G+Q I
Sbjct: 36 GWNDVGFRNP-DVLTPNIDKLARSGMILNSSYVMPVCTPSRNSFMTGHYAFKSGLQHLAI 94
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ PL FLP+ L+ELGY+T AIGKWHLGF + E TP YRGF++ FG+ NG
Sbjct: 95 NPQQATCAPLNYTFLPQKLKELGYATHAIGKWHLGFCKWECTPTYRGFDTFFGFYNGQED 154
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY T+ + G R+ + GEY+T L+++ A +I++ KP+++Y
Sbjct: 155 YY----------TLSVAGGKDFRDNKVPVNATGEYSTFLYSRRAESIIKEHDASKPIYMY 204
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L + HA L+ P + + + +RRTY MV +D+++G + AL+ KG+
Sbjct: 205 LPFQSVHA-----PLQVPNRYTDMYTNVHTKSRRTYMGMVTAMDEAIGNITKALKTKGIF 259
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+++IIF +DNG + G+NYP RG K TL+EGG + A + IQ+
Sbjct: 260 NDTLIIFTTDNGGAV---------KFAGNNYPLRGGKATLFEGGTRATAFVTGAGIQKFN 310
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
++H DW+PT+ +AAGG +DG+DQW SL S+RN I +D +
Sbjct: 311 YEYSGLIHAVDWMPTVLSAAGGTADS---KLDGIDQWDSLRTARESKRNEFIYNIDDFRP 367
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
E AA+R+ +KL+ DGY G + P
Sbjct: 368 -----------------EIGGHAAIRIGDYKLI---------DGYPGLHQGWYKP----- 396
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFN 571
+ Y + +S N P DKM LFN
Sbjct: 397 ---DHVGYDLNEDVSTNCMRPF--FDKMM----------------------------LFN 423
Query: 572 LGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
L +DP E +++ PDI +L L+Y+ +VP + + A P + WSP
Sbjct: 424 LQDDPNEHVDLSRKHPDIVKKLKSRLEYYHNQIVPANFPKNSASSA-PTSYGGFWSP 479
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 19/73 (26%), Positives = 38/73 (52%), Gaps = 5/73 (6%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
+ GEY+T L+++ A +I++ KP+++YL + HA L+ P + + +
Sbjct: 174 NATGEYSTFLYSRRAESIIKEHDASKPIYMYLPFQSVHA-----PLQVPNRYTDMYTNVH 228
Query: 70 DPNRRTYAALTKS 82
+RRTY + +
Sbjct: 229 TKSRRTYMGMVTA 241
>gi|68437903|ref|XP_692213.1| PREDICTED: arylsulfatase I-like [Danio rerio]
Length = 562
Score = 271 bits (692), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 181/561 (32%), Positives = 263/561 (46%), Gaps = 111/561 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +HGS EI TP +D LA G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q I
Sbjct: 41 GYNDIGYHGS-EIQTPVLDQLAGEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSII 99
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P LPE L+E GYST +GKWHLGF E P RGF+S G L G
Sbjct: 100 RARQPLCLPPDTPTLPERLQEAGYSTHMVGKWHLGFCHPECLPTSRGFQSFLGSLTGSGD 159
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
++ S Q E G D+ AW+ G Y+T L+T+ ++ KPLFLY
Sbjct: 160 HF----SFQSCDGTEACGFDLHDGDRPAWELRGNYSTRLYTERVKDILRRHDHRKPLFLY 215
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L+AP + ++Q + + RR YAAMV +D+SVG ++S L+ +G
Sbjct: 216 VALQAVHT-----PLQAPGHLLRRYQALGNRPRRHYAAMVSGVDESVGEIVSELRERGYY 270
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NS++I+ SDNG + + G N+P RG K + WEGGV+ + SP +++
Sbjct: 271 DNSVLIYSSDNGGQPL---------SGGCNWPLRGGKGSYWEGGVRAVGFVHSPLLKRKG 321
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
VS ++HISDW PTL + AG S ++++DG D W +
Sbjct: 322 VVSHALIHISDWYPTLLSLAGYRES----------------------DSNHVDGQDVWET 359
Query: 452 LLLNTPSRRNSVLINIDEKKR----------------------TAAVRLDSWKLVLGTQE 489
+ P R +L NID R AA+R WKL+ G
Sbjct: 360 ISSGLPCPRTEILFNIDPVSRRHGDINPKLLTLNGFGIWDTGVRAAIRAGDWKLLTGNIG 419
Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
+G D + QT + +Q ++ K R R ++
Sbjct: 420 DG--DWFPPQTMPGG-----------PQQWQGME--------------KRRDQRGKSV-- 450
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
+LFN+ DP E+ ++A +RP++ L L + RT V
Sbjct: 451 ------------------WLFNITADPYERADLAEARPEVVKVLLTRLSEYNRTAV-SPR 491
Query: 610 EQPDLVQADPKRFNDTWSPWI 630
PD ADP+ W+PW+
Sbjct: 492 NPPDDPMADPQLHGGVWTPWL 512
Score = 47.4 bits (111), Expect = 0.026, Method: Compositional matrix adjust.
Identities = 25/76 (32%), Positives = 39/76 (51%), Gaps = 5/76 (6%)
Query: 8 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 67
AW+ G Y+T L+T+ ++ KPLFLY+A A H L+AP + ++Q
Sbjct: 183 AWELRGNYSTRLYTERVKDILRRHDHRKPLFLYVALQAVHT-----PLQAPGHLLRRYQA 237
Query: 68 ITDPNRRTYAALTKST 83
+ + RR YAA+
Sbjct: 238 LGNRPRRHYAAMVSGV 253
>gi|291231208|ref|XP_002735557.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 270 bits (691), Expect = 1e-69, Method: Compositional matrix adjust.
Identities = 180/564 (31%), Positives = 274/564 (48%), Gaps = 88/564 (15%)
Query: 75 TYAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPS 131
++ ++ S T + I+ YG++D+ +H S I TPN+D LA G+ L N Y QP CTP+
Sbjct: 5 SWRYISSSPTHIVFILADDYGFHDIGYHDS-IIKTPNLDRLASEGVKLENYYVQPKCTPT 63
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
R+ LM+G+Y IHTG+Q IW +P +P+ E +P+ L+E GY+T +GKWHLG +++E
Sbjct: 64 RSQLMSGRYQIHTGLQHGIIWPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKE 123
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDL 250
P RGF++ FGYL G YY H S Y + +G D R N+ G+Y+T +
Sbjct: 124 CLPTERGFDTFFGYLTGSEDYYTHNRS--YDK---FHGMDFRENMQIVQPQYNGQYSTHV 178
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
F ++A +I+ PLFLYL A H L+ P + + IT+ RRTYA M
Sbjct: 179 FAEKAKNIIKSHDPQIPLFLYLPLHAVHGP-----LQVPDQYEKPYTNITNKQRRTYAGM 233
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V +D++VG + L+ G+ +NS+IIF +DNG G+N+P RG K +
Sbjct: 234 VSCMDEAVGNITQTLEDAGLYDNSVIIFSTDNGGQVTA---------GGNNWPLRGWKGS 284
Query: 371 LWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
LWEGG+ + SP I R S +++H++DW PT+ AGG S
Sbjct: 285 LWEGGMHGVGFVNSPLINHQARGTTSKELIHVTDWFPTIVHLAGGSVSG----------- 333
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
+DG +QW ++ S R +L NID K+ T
Sbjct: 334 -----------TQPLDGYNQWETISNGKKSPRKELLHNIDPKRTTL-------------- 368
Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
QT +N N I + + Q + + K+ + +
Sbjct: 369 ----------QTSNN-------NMIYRKDMFDASIQAAMRV-----GDWKILTGNPGNSS 406
Query: 549 HCGANPAPMTPSPCTNGP---CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLV 605
+ +TP T+ P +LFN+ DP E N+++ SRPD ++L E+L+ + +T V
Sbjct: 407 WIPPPESSITPPHPTDIPGKNIWLFNIAEDPNEYNDLSLSRPDKVNELLEILQQYYKTSV 466
Query: 606 PQSHEQPDLVQADPKRFNDTWSPW 629
P H D DP + + W PW
Sbjct: 467 P-VHYPDDEPACDPDKNDGAWGPW 489
Score = 42.4 bits (98), Expect = 0.85, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 5/68 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G+Y+T +F ++A +I+ PLFLYL A H L+ P + + IT+
Sbjct: 172 GQYSTHVFAEKAKNIIKSHDPQIPLFLYLPLHAVHGP-----LQVPDQYEKPYTNITNKQ 226
Query: 73 RRTYAALT 80
RRTYA +
Sbjct: 227 RRTYAGMV 234
>gi|449267146|gb|EMC78112.1| Arylsulfatase I [Columba livia]
Length = 573
Score = 270 bits (690), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 175/542 (32%), Positives = 279/542 (51%), Gaps = 79/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 61 GYHDVGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 119
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+++E P RGF++ G L G +
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 179
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G+D+ AWD G+Y+T L+ + +++ +P+F+Y
Sbjct: 180 YYTYDNCDGPG----VCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E I +++ + + RR YAAMV +D++V + AL++ G
Sbjct: 236 VAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 290
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NS+I+F +DNG T + GSN+P RG K T WEGGV+ + SP I++
Sbjct: 291 DNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKRKR 341
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL + A G+ S +P +DG + W ++ S R + +D
Sbjct: 342 RTSWALVHITDWYPTLVSLARGNLSNVP-GLDGYNVWPAISEGKESPRTEILHNIDP--- 397
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I A++R+ WKL+ G + G D QT L N
Sbjct: 398 --LYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LTN 447
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F ++ +L++L+ D +R + +L
Sbjct: 448 FPG-----SWWNLERLT----------DGLRKS------------------------VWL 468
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ +++ RPD+ L L ++ +T +P + E P +A P W
Sbjct: 469 FNITADPYERYDLSDQRPDVVRTLLMRLVHYNQTAIPVRYPAENP---RAHPDFNGGAWG 525
Query: 628 PW 629
PW
Sbjct: 526 PW 527
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 42/74 (56%), Gaps = 5/74 (6%)
Query: 7 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
AWD G+Y+T L+ + +++ +P+F+Y+A A H L++P+E I +++
Sbjct: 202 VAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYR 256
Query: 67 YITDPNRRTYAALT 80
+ + RR YAA+
Sbjct: 257 SMGNVARRKYAAMV 270
>gi|405977794|gb|EKC42228.1| Arylsulfatase I [Crassostrea gigas]
Length = 545
Score = 270 bits (689), Expect = 2e-69, Method: Compositional matrix adjust.
Identities = 168/551 (30%), Positives = 268/551 (48%), Gaps = 62/551 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + +I TPN+D +A G+ILN+ Y P+CTPSR S +TG YP G+ G I
Sbjct: 39 GWNDVGWH-NPDIKTPNLDRMAGGGVILNSSYVHPICTPSRNSFLTGVYPFRVGLSGTAI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ R + L LPE L++LGYST IGKWHLGF YTP RGF+S G+ G
Sbjct: 98 TPHQARFMSLKTPTLPEKLKKLGYSTHMIGKWHLGFCNERYTPTRRGFDSFLGFYTGTQD 157
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
YY H + G+D R N + + +Y+T + K AV +I + K PLFL
Sbjct: 158 YYKHTTA---------KGYDFRFNQTVFYPPKKQYSTKTYAKRAVDIITEHKRKKNPLFL 208
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA + H L+ P++ Q+ I + +RR Y+ MV +DD++G V++A ++
Sbjct: 209 YLAFQSVHTP-----LQVPKKYEKQYNNIKNKDRRVYSGMVTAMDDAIGAVMAAFRKHKF 263
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ-IQQ 389
+ N I++F +DNG G+N+P RG K T+WEGG + + ++S +++
Sbjct: 264 MGNLIVVFTTDNGGAA---------HIVGNNFPLRGSKTTVWEGGTRAVSFVYSKNLLKK 314
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
+ H +DW PT+ AGG ++ +DG++QW
Sbjct: 315 TGYTHDGLFHATDWFPTILAVAGGK----------------------PKQEKQLDGVNQW 352
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNK-VPLL 508
+ PSRR + ID+ + +A+R+ +KL++G G + +Y +SN P+
Sbjct: 353 PMIRSGKPSRRTEFVYEIDKSRPVSAIRMGKYKLIVGNP--GDYNNWYRIAKSNSFCPIY 410
Query: 509 NFNAIVESK---TYQSLQQLSQN----IFLPISNIDKMRSTRQQATIHCGANPAPMTPSP 561
+ N + Y+SL + +N + S + R R+ +
Sbjct: 411 DENETDSDEDIFIYESLNNIVRNAKGYVKWLSSKLPGYRMVRKWHRYRQLERCRKLKKRD 470
Query: 562 CTN--GPCY-LFNLGNDPCEQNNIASSRPDISSQLYELLK-YHRRTLVPQSHEQPDLVQA 617
P Y L++L DP E+ NIA +P + ++ L+ Y ++T + + A
Sbjct: 471 KAKWRFPRYLLYDLEADPSERKNIAGYKPVLVRKMKARLEHYQKKTKHVAGQSKRKVAAA 530
Query: 618 DPKRFNDTWSP 628
DPK + W+P
Sbjct: 531 DPKHHHGVWTP 541
Score = 41.2 bits (95), Expect = 1.7, Method: Compositional matrix adjust.
Identities = 24/81 (29%), Positives = 40/81 (49%), Gaps = 6/81 (7%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQ 59
R N + + +Y+T + K AV +I + K PLFLYLA + H L+ P+
Sbjct: 169 FRFNQTVFYPPKKQYSTKTYAKRAVDIITEHKRKKNPLFLYLAFQSVHTP-----LQVPK 223
Query: 60 ETINQFQYITDPNRRTYAALT 80
+ Q+ I + +RR Y+ +
Sbjct: 224 KYEKQYNNIKNKDRRVYSGMV 244
>gi|443704600|gb|ELU01579.1| hypothetical protein CAPTEDRAFT_176799 [Capitella teleta]
Length = 476
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 153/405 (37%), Positives = 225/405 (55%), Gaps = 33/405 (8%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YGW+D+ +HGS +I TP +D LAYNGI L N Y QP+C+P+R+ M+G Y IHTG+Q
Sbjct: 18 YGWHDIGYHGS-KIRTPVLDDLAYNGIRLENYYVQPICSPTRSQFMSGVYQIHTGLQHNV 76
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
IW A+ G+PL + + +RE GY+T GKWHLG+++ EY P RGF++++GYLNG
Sbjct: 77 IWPAQANGLPLEFPTIADKMREAGYATHMAGKWHLGYYKEEYLPHNRGFDTYYGYLNGCE 136
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKP 267
YYD Y G+D R N W+ +Y+T L+ +++ + + DKP
Sbjct: 137 DYYDKSYCHPYC------GYDFRLNDDIQWNLT-DYSTYLYVSRVNEILLNHKIYSPDKP 189
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFLYL + H + LE P+E +++ +I D NRRTYA MV +D++VG + ++
Sbjct: 190 LFLYLPLQSVH-----EPLEVPKEYSDKYSHIKDNNRRTYAGMVAAMDEAVGKIRDLFKK 244
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
M + +++ F +DNG G+N+P RG K +LWEGG++ + SP +
Sbjct: 245 YEMWDETVLAFSTDNGG---------QIHRGGNNWPLRGWKISLWEGGMRGVGFVHSPLL 295
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ S +MH+SDWLPTL AGG+T+ + L DG + W L T S R + +D
Sbjct: 296 PNSGGKSKGLMHVSDWLPTLVHVAGGNTTGMKL--DGYNVWDMLTKGTESPRKEILHNID 353
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
L P D + R AA+R WKL+ G NG+
Sbjct: 354 P-----LMEPKGHARPNSTFDTRVR-AAIRYGDWKLITGDPGNGS 392
Score = 46.2 bits (108), Expect = 0.054, Method: Compositional matrix adjust.
Identities = 27/85 (31%), Positives = 45/85 (52%), Gaps = 9/85 (10%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
R N W+ +Y+T L+ +++ + + DKPLFLYL + H + LE
Sbjct: 152 FRLNDDIQWNLT-DYSTYLYVSRVNEILLNHKIYSPDKPLFLYLPLQSVH-----EPLEV 205
Query: 58 PQETINQFQYITDPNRRTYAALTKS 82
P+E +++ +I D NRRTYA + +
Sbjct: 206 PKEYSDKYSHIKDNNRRTYAGMVAA 230
>gi|224067708|ref|XP_002198824.1| PREDICTED: arylsulfatase I [Taeniopygia guttata]
Length = 575
Score = 269 bits (688), Expect = 3e-69, Method: Compositional matrix adjust.
Identities = 174/542 (32%), Positives = 279/542 (51%), Gaps = 79/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 61 GYHDIGYHGS-DIQTPTLDRLAAEGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 119
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+++E P RGF++ G L G +
Sbjct: 120 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRGFDTFLGSLTGNVD 179
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D ++ G+D+ AWD G+Y+T L+ + +++ +P+F+Y
Sbjct: 180 YYTYDNCD----GPDVCGYDLHEGEDVAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIY 235
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E I +++ + + RR YAAMV +D++V + AL++ G
Sbjct: 236 VAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 290
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NS+++F +DNG T + GSN+P RG K T WEGGV+ + SP I++
Sbjct: 291 DNSVMVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKRKR 341
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL + A G+ S + +DG D W ++ S R + +D
Sbjct: 342 RTSWALVHITDWYPTLVSLARGNLSNVQ-GLDGYDVWPAISEGKDSPRTEILHNIDP--- 397
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I A++R+ WKL+ G + G D QT L N
Sbjct: 398 --LYNHAKYGSLEDGFGIWNTAVQASIRVGEWKLLTG--DPGYSDWIPPQT------LTN 447
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F ++ +L++L+ D ++ + +L
Sbjct: 448 FPG-----SWWNLERLT----------DGLKKS------------------------MWL 468
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ +++ RPD+ L L ++ RT +P + E P +A P W
Sbjct: 469 FNITADPYERYDLSEQRPDVVRTLLTRLVHYNRTAIPVRYPAENP---RAHPDFNGGAWG 525
Query: 628 PW 629
PW
Sbjct: 526 PW 527
Score = 47.8 bits (112), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 42/74 (56%), Gaps = 5/74 (6%)
Query: 7 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
AWD G+Y+T L+ + +++ +P+F+Y+A A H L++P+E I +++
Sbjct: 202 VAWDQSGKYSTFLYAQRVSKILASHSPKEPIFIYVAFQAVHT-----PLQSPKEYIYRYR 256
Query: 67 YITDPNRRTYAALT 80
+ + RR YAA+
Sbjct: 257 SMGNVARRKYAAMV 270
>gi|260788430|ref|XP_002589253.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
gi|229274428|gb|EEN45264.1| hypothetical protein BRAFLDRAFT_213051 [Branchiostoma floridae]
Length = 449
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 178/523 (34%), Positives = 259/523 (49%), Gaps = 79/523 (15%)
Query: 112 LAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR 171
LA G+ L N Y QP+C+PSR LMTG+Y I G+Q I P G+PL E LP+ LR
Sbjct: 2 LASEGVKLENYYIQPICSPSRCQLMTGRYQIRYGLQHSVITSDRPHGLPLDEVTLPQKLR 61
Query: 172 ELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRT-VELNGH 230
E GY + +GKWHLGFFR+EY PL RGF+ +GYL G Y+ H + Y+R +G
Sbjct: 62 ENGYRSYIVGKWHLGFFRKEYMPLQRGFDRFYGYLTGGEDYWTHRRPNGYARDPSAFHGL 121
Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
D+R D G Y+T LF ++A++ I KP+FLYL A H+ LEAP
Sbjct: 122 DLRDQDKPVLDQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHSP-----LEAPT 176
Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
+ + +++ +T P + YAAMV +D++VG V AL+R G+ +N+++IF +DNGA R
Sbjct: 177 KYVEKYKDVTHPLIKIYAAMVTAMDEAVGNVTDALKRTGLWDNTVLIFSTDNGA-----R 231
Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
E N GSN+P RG KNTLWEGGV+ + S +++ R S ++HISDW PTL
Sbjct: 232 E-----NAGSNWPLRGWKNTLWEGGVRGVGFVNSKLLRKGNRKSDALIHISDWFPTLLRI 286
Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNSVLINI 467
+ S +DG D W ++ TPS R NID L + ++ S N + +I
Sbjct: 287 SRS-VSNGTKPLDGFDIWDTISTGTPSPRKEILHNIDPLHRKTA-----SSSFNLLYDDI 340
Query: 468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQ 527
AA+R WKL+ T D +G + + F V+ T
Sbjct: 341 FNISVYAAIRSGDWKLL-------TGDQGHGGWKDSDTSGSKFEYAVDPPTKH------- 386
Query: 528 NIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRP 587
+LFN+ NDP E+++++ P
Sbjct: 387 ---------------------------------------LWLFNIRNDPQERSDLSDRYP 407
Query: 588 DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
D+ +L + L + +T +P D +A+P D + PW+
Sbjct: 408 DVVQELLQKLAAYNKTALPPFWPSRD-PRANPALHGDLFGPWL 449
Score = 49.3 bits (116), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 25/73 (34%), Positives = 41/73 (56%), Gaps = 5/73 (6%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
D G Y+T LF ++A++ I KP+FLYL A H+ LEAP + + +++ +T
Sbjct: 132 DQNGTYSTHLFAQKAIEFILSHERSKPMFLYLPFQAVHSP-----LEAPTKYVEKYKDVT 186
Query: 70 DPNRRTYAALTKS 82
P + YAA+ +
Sbjct: 187 HPLIKIYAAMVTA 199
>gi|241601772|ref|XP_002405049.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215500554|gb|EEC10048.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 451
Score = 269 bits (687), Expect = 4e-69, Method: Compositional matrix adjust.
Identities = 179/510 (35%), Positives = 249/510 (48%), Gaps = 81/510 (15%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
I +GMQ I A P G+PL + +P++ ++LGY IGKWHLGFF+ YTP RGF+S
Sbjct: 1 ILSGMQAEVIRDAAPWGLPLELKIMPQHFKDLGYQVNMIGKWHLGFFQTPYTPPKRGFDS 60
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD--TVGEYATDLFTKEAVQLI 259
FGY G YY+H LN + T + +Y+T + L+
Sbjct: 61 FFGYYTGENDYYNHTAG-------TLNRPPLYGTFKTKQTRCVLAKYSTHTWG-----LL 108
Query: 260 EDQPV-----DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
E P+ +P F Y ++ A HA +AP+ + +F YI + NR YA M+ L
Sbjct: 109 EAFPLLLNLHSQPFFCYFSYQAVHAALMNVPFQAPERNVLKFPYIGETNRTLYAGMLDSL 168
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D+S+G V+ L GMLE++II F SDNG + Y SN G N+P RG K TLWEG
Sbjct: 169 DESIGRVVEILGNAGMLEDTIIAFSSDNGG--LPYNAFSNR---GFNWPLRGAKGTLWEG 223
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
GV+VPA +WS + + PRVS Q+MHISDWLPTLY+AAGGD S L
Sbjct: 224 GVRVPAFVWSSKFLKKPRVSDQLMHISDWLPTLYSAAGGDVSNL---------------- 267
Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
++DG++ W L L S R VL+ ID A +R ++KLV+G + +D
Sbjct: 268 ------GSLDGVNMWRYLTLGLGSARKEVLLTIDPMYNLAGLRYKNYKLVVGEGFDEELD 321
Query: 495 GYY----GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC 550
Y G +N VP L +SK + L+ + R + +
Sbjct: 322 DRYAFPGGLRPTNDVPQLR----KDSKVARVLKTFYG------------KQNRNWSPLDW 365
Query: 551 GANPAP------MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL 604
NP+ +T + P YLFNL DPCE NN+A S+ I + L + T
Sbjct: 366 SKNPSVDCRERWLTNNFVARRPPYLFNLARDPCELNNLAGSKRIILHFMMHKLAAYNATA 425
Query: 605 VPQSHEQPDLVQADPKRFND----TWSPWI 630
VP P + DP+ + + WSPW+
Sbjct: 426 VP-----PRFLPEDPRGYPEYHGGVWSPWL 450
>gi|348542810|ref|XP_003458877.1| PREDICTED: arylsulfatase J [Oreochromis niloticus]
Length = 551
Score = 268 bits (686), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 179/540 (33%), Positives = 273/540 (50%), Gaps = 72/540 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS +I TP +D LA G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q I
Sbjct: 35 GFRDVGYHGS-DIKTPTLDRLAAEGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSVI 93
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A+P +PL LP+ L+ GYST +GKWHLGF++R P RGF++ FG L G
Sbjct: 94 RAAQPNCLPLENVTLPQKLKNAGYSTHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSGD 153
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFTKEAVQLIEDQ-PVDKPLF 269
YY H S + G+D+ AW+ G Y+T++FT++AV ++ + P +PLF
Sbjct: 154 YYSHYKCQGPS----MCGYDLYEGEEAAWEQDRGLYSTEMFTQKAVSILANHNPRKQPLF 209
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYLA+ A H+ L+ P + +++ I +P RR YAAMV LD+++ + AL+ G
Sbjct: 210 LYLAYQAVHSP-----LQVPARYLERYKGIPNPYRRKYAAMVSCLDEAIHNLTLALKHYG 264
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+N++I++ SDNG + GSN+P RG K T WEGG++ + SP +
Sbjct: 265 YYDNTVIVYSSDNGGQPLAG---------GSNWPLRGSKATYWEGGIRTVGFVHSPLLMN 315
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
++HI+DW PTL T G T LN+DG D W ++ PS R+ + +D
Sbjct: 316 KGTKCRSLIHITDWFPTLVTLGEG-TLDEDLNLDGYDVWETISEGRPSPRHDILHNIDPI 374
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
N + L N + AA+R+ WKL+ G + GY S+ VP
Sbjct: 375 YIKAKNGSWKAGYGLWNTAIQ---AALRVGHWKLLTG------VPGY-----SDWVP--- 417
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
P + +++ +TR +H P S +L
Sbjct: 418 ----------------------PQTFLNQRLTTR----LHNERVPWDRGKS------IWL 445
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
FN+ DP E+ +++ P I ++ L + +T VP + DL +++P+ W PW
Sbjct: 446 FNITADPYERVDLSQRYPHIVKKMLMRLAQYNKTAVPVCYPTKDL-RSNPQYNGGVWGPW 504
Score = 51.6 bits (122), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 28/76 (36%), Positives = 47/76 (61%), Gaps = 7/76 (9%)
Query: 7 TAWDT-VGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
AW+ G Y+T++FT++AV ++ + P +PLFLYLA+ A H+ L+ P + +
Sbjct: 176 AAWEQDRGLYSTEMFTQKAVSILANHNPRKQPLFLYLAYQAVHSP-----LQVPARYLER 230
Query: 65 FQYITDPNRRTYAALT 80
++ I +P RR YAA+
Sbjct: 231 YKGIPNPYRRKYAAMV 246
>gi|198420473|ref|XP_002123848.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 517
Score = 268 bits (686), Expect = 5e-69, Method: Compositional matrix adjust.
Identities = 179/561 (31%), Positives = 269/561 (47%), Gaps = 71/561 (12%)
Query: 77 AALTKSTTLTLLIV-YGWNDLSFHG---SNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
A LTK + +L YG+ND+ +H +++ TP +D+LA G+ L N Y QP+C+PSR
Sbjct: 19 AELTKPNIVFILADDYGFNDIGYHAVEHHSDMKTPFLDSLAMAGVRLENYYIQPICSPSR 78
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
+ LM+G+Y IHTG+Q I + G+PL LPE L + GY T +GKWHLGF++ EY
Sbjct: 79 SVLMSGRYQIHTGLQHYVISPQQRNGLPLDNIILPEQLHKCGYDTHMVGKWHLGFYKDEY 138
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
P RGF S+FGYL G YY D +L G+DM T G+Y+ +LF
Sbjct: 139 LPWKRGFNSYFGYLTGGEDYYTKWRCDG-----KLCGYDMTSEKGPTNATYGQYSANLFA 193
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
+A + I+ KPLFLY+A + H+ +E P+ F YI + NR+ Y MV
Sbjct: 194 NKANEAIDKHDKTKPLFLYVAFQSVHSP-----MEVPESYAKPFDYIKNHNRKMYGGMVA 248
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
+D++V + LQ G+ +N+I++F +DNG T+ + G+N+P RG K TLW
Sbjct: 249 AMDEAVKNITEHLQAAGLWDNTILVFSADNGGQTL---------SGGNNWPLRGRKLTLW 299
Query: 373 EGGVKVPAILWSPQIQ-QNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
EGG+K + + NP V+ +M+HISDW PT+ A +DG++QW
Sbjct: 300 EGGIKGVGFVHGKILNVPNPNYIVNNEMIHISDWFPTIMEATQCPYVEGTQKLDGVNQWD 359
Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
++ T S R + +D + L + + ++ + D A +R WKL+ G Q
Sbjct: 360 TIFGKTVSARKEILLNIDPLFTPLFYKETPKQNIQMGFDTSVH-AGIRYGEWKLLTGLQ- 417
Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
G R + P E K +K +Q+ T H
Sbjct: 418 --------GDDRWIEPP--------EWK-------------------NKAEEEQQEDTEH 442
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
G L+N+ NDP EQ ++ P + + L L + T VP ++
Sbjct: 443 PGF-------QKFATKSVQLYNIVNDPYEQVEVSDLYPSVVNDLLSKLAAYNATAVPVNY 495
Query: 610 EQPDLVQADPKRFNDTWSPWI 630
D +DP W PW+
Sbjct: 496 PPSDPT-SDPSLHGGFWGPWV 515
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 38/72 (52%), Gaps = 5/72 (6%)
Query: 11 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 70
T G+Y+ +LF +A + I+ KPLFLY+A + H+ +E P+ F YI +
Sbjct: 183 TYGQYSANLFANKANEAIDKHDKTKPLFLYVAFQSVHSP-----MEVPESYAKPFDYIKN 237
Query: 71 PNRRTYAALTKS 82
NR+ Y + +
Sbjct: 238 HNRKMYGGMVAA 249
>gi|395736371|ref|XP_003780537.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I [Pongo abelii]
Length = 481
Score = 268 bits (684), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 172/502 (34%), Positives = 258/502 (51%), Gaps = 75/502 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 46 GYHDVGYHGS-DIETPTLDRLAAKGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 104
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 105 RPQQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 164
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ +PLFLY
Sbjct: 165 YYTYDNCD----GPGVCGFDLHEGENVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLY 220
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 221 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 275
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 276 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQ 326
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S +MHI+DW PTL AGG TS +DG + W ++ S R + +D
Sbjct: 327 RTSRALMHITDWYPTLVGLAGGTTSAAD-GLDGYNVWPAISEGRASPRTEILHNIDP--- 382
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 383 --LYNHAQHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 427
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 428 ----------QTLATFPGSWW----NLERMASVRQAV---------------------WL 452
Query: 570 FNLGNDPCEQNNIASSRPDISS 591
FN+ P E+ ++A RPD+S+
Sbjct: 453 FNISAXPYEREDLAGQRPDVSA 474
Score = 47.8 bits (112), Expect = 0.019, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 186 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 240
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 241 RTMGNVARRKYAAMV 255
>gi|291237236|ref|XP_002738543.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 514
Score = 268 bits (684), Expect = 8e-69, Method: Compositional matrix adjust.
Identities = 181/546 (33%), Positives = 277/546 (50%), Gaps = 93/546 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H ++ +PTPN++ LA G+IL+NMY+QP+CTPSR +LMTGKYP GMQ +
Sbjct: 50 GWNDVGWH-NDFMPTPNLNTLAREGVILDNMYSQPICTPSRVALMTGKYPAKVGMQHFVV 108
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
P +P L E L+E GY+ +GKWHLG +YTP++RGF+SH+G GV S
Sbjct: 109 LPMRPYYLPGNYATLAEKLKEQGYTNHIVGKWHLGSCDWKYTPMWRGFDSHYGCHEGVTS 168
Query: 212 YYD-HILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
++ H+L+ + V ++G D+R N G + T LF++ A +++++ + PLFL
Sbjct: 169 NFETHMLT--WPPVVGVSGRDLRDNTGLVTHENGTHNTMLFSERAERIVKNHNPESPLFL 226
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+ ++A H L+APQ Q + D +RRT+A M+ +D VG + AL+ KGM
Sbjct: 227 YVPYMAPHF-----PLQAPQGFEEAVQ-LDDTDRRTFAGMMAAMDQGVGNITQALKDKGM 280
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
++++ +F+SDNG + GSNYPYRG K T+WEGGVKVPA + ++ +
Sbjct: 281 WDDTLFVFLSDNGGDKMFA---------GSNYPYRGNKATMWEGGVKVPAFIRGSMLKTS 331
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
V+ ++ H +D TL AGG +IDG +QW
Sbjct: 332 GTVNNELYHFTDIFATLLKVAGGTPDE---DIDGKNQW---------------------D 367
Query: 451 SLLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
S PS+R+ +L+++DE T AA+R+ +K + G N VP+L
Sbjct: 368 SFCKGKPSKRSEILLHLDEHPVTNGAALRMGDYKYMEGVTIN--------------VPML 413
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS--PCTNGP 566
LP ++I+ A NP TP P T
Sbjct: 414 ----------------------LPRNDIEHFDQWFTPA-----ENPDASTPDVPPITENV 446
Query: 567 C--YLFNLGNDPCEQNNIAS--SRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
YLFN+ +DP E NN+ + ++ +I +++ + L R ++ P D +P F
Sbjct: 447 NNRYLFNIKDDPLETNNLYNDPAKQEIINEMKQRLDEFRTSVPPFWFPGED-ESGNPDNF 505
Query: 623 NDTWSP 628
WSP
Sbjct: 506 GGAWSP 511
Score = 39.7 bits (91), Expect = 4.9, Method: Compositional matrix adjust.
Identities = 24/82 (29%), Positives = 43/82 (52%), Gaps = 6/82 (7%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R N G + T LF++ A +++++ + PLFLY+ ++A H L+APQ
Sbjct: 188 LRDNTGLVTHENGTHNTMLFSERAERIVKNHNPESPLFLYVPYMAPHF-----PLQAPQG 242
Query: 61 TINQFQYITDPNRRTYAALTKS 82
Q + D +RRT+A + +
Sbjct: 243 FEEAVQ-LDDTDRRTFAGMMAA 263
>gi|391330456|ref|XP_003739676.1| PREDICTED: arylsulfatase B-like [Metaseiulus occidentalis]
Length = 631
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 178/539 (33%), Positives = 271/539 (50%), Gaps = 70/539 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM-QGPP 150
G++D+SFHG+ +IPTPN+DA+A +G+ILN YA TPSR + TGK P+ G+ +GP
Sbjct: 53 GYDDVSFHGNEQIPTPNLDAMAADGVILNRHYAAMSGTPSRGAFFTGKLPLRIGLNEGPI 112
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ G G+ L LP Y+R+LGY T IG+W LGF++ P RGF++H+G +
Sbjct: 113 LKGVYGTGLSLEHEVLPFYMRDLGYETHLIGRWGLGFYKESLLPTNRGFDTHYGPYSDSA 172
Query: 211 SYYDHILSDQYSRTVEL-NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
SY H+ + +++ + +D+ R+ + G Y TDL+ ++IE + KPLF
Sbjct: 173 SYSSHLSREDAWKSLSVPPAYDLHRDGKPDFSGFGSYVTDLYKGRFERIIEQR--RKPLF 230
Query: 270 LYLAHLAAHAGNAGKHLEAPQET--INQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
+ L+H H + G + P T +QF +I D +RR+YA V+ LDDS ++ LQR
Sbjct: 231 IVLSHQTPHGASFGPLHQPPPRTNRASQFLHIKDRSRRSYARTVESLDDSFAHMVDVLQR 290
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
+L S+I+F SDNG T + ++ SN+P RG KNT WEGGV+V + +WS QI
Sbjct: 291 NNILNESVILFSSDNGGTTF---KDPYFKTGASNWPLRGQKNTQWEGGVRVSSFVWSSQI 347
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ P VS ++ H DWLPT GGD L +IDG D W S++ T
Sbjct: 348 TE-PYVSEELYHFVDWLPTFRRLGGGDIGDLK-DIDGFDIWDSIVNRT------------ 393
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
T S R +L+ D R V + ++G + + G+ ++N
Sbjct: 394 --------TASPRTKILLAYD---RDFTVDSPGFAAIIGKFKY-VSGKWSGRDQTNHYVP 441
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDK---MRSTRQQATIHC--GAN--------- 553
+F +S+ + L+ S+ ++ + +I + R+QA I C GA+
Sbjct: 442 ASFKYSEDSQVLERLRNDSK-VWKNLRDIGRPGPAADWREQARIKCSWGAHLTCNGVSCD 500
Query: 554 ----------------PAPMTPSPCTN---GPCYLFNLGNDPCEQNNIASSRPDISSQL 593
P+ S + PC LF+L NDPCE NIA+ P I +
Sbjct: 501 KIMETFRGSRGNVEIRVEPVNVSHACDFAHEPC-LFDLENDPCETTNIANDMPQIRDDI 558
Score = 45.4 bits (106), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 27/84 (32%), Positives = 43/84 (51%), Gaps = 4/84 (4%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+ R+ + G Y TDL+ ++IE + KPLF+ L+H H + G + P
Sbjct: 195 LHRDGKPDFSGFGSYVTDLYKGRFERIIEQR--RKPLFIVLSHQTPHGASFGPLHQPPPR 252
Query: 61 T--INQFQYITDPNRRTYAALTKS 82
T +QF +I D +RR+YA +S
Sbjct: 253 TNRASQFLHIKDRSRRSYARTVES 276
>gi|241025894|ref|XP_002406215.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215491896|gb|EEC01537.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 437
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 142/400 (35%), Positives = 223/400 (55%), Gaps = 34/400 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS +IP PNIDALA +G+ILN YAQP SR L+TG YP TG+ G +
Sbjct: 33 GWADVGFHGSRQIPVPNIDALAADGVILNKYYAQPWPLSSRIGLLTGIYPYRTGV-GRVM 91
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P R LP + + LGY T +G W+LGF+++ +TP+ RGF++ + G
Sbjct: 92 LPCQPVALPSVFRILPTFFKSLGYRTHFVGVWNLGFYKKRFTPVNRGFDTAYAKWTGPGD 151
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA--VQLIEDQPVDKPLF 269
Y+ H + ++ G D+R N W+ G Y+T LFT+ A + + + +PL
Sbjct: 152 YWTHDMQ------TKMQGFDLRLNDDLMWNQTGVYSTRLFTERADPTRKLCFFVLHQPLL 205
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
L L+H A H N L+ P E ++ F +I D RR +A + ++D S+G + AL +
Sbjct: 206 LILSHQALHTANYHGSLQCPLEHLDSFGFIRDRKRRIFAGALTEVDQSLGQIFEALHNRR 265
Query: 330 MLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
+L N+I++F+S +G P ++ ++ N+P RG K T +EGGV+VPA +WSP +
Sbjct: 266 LLNNTILVFVSASGGMPVGDFTNNLSF-----NFPLRGTKGTCFEGGVRVPAFVWSPLLN 320
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ R S Q+MH++DWLPTL++AAGGD +L D +D ++
Sbjct: 321 KTRRTSSQLMHVTDWLPTLFSAAGGDIKKLGAVFD-------------------LDSVNM 361
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
W +L + S R ++ NID A++ ++K++ T+
Sbjct: 362 WEALSGDLQSPRTEIIHNIDLAGHNFALQEGNYKVIAVTK 401
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 30/84 (35%), Positives = 45/84 (53%), Gaps = 3/84 (3%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEA--VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 58
+R N W+ G Y+T LFT+ A + + + +PL L L+H A H N L+ P
Sbjct: 166 LRLNDDLMWNQTGVYSTRLFTERADPTRKLCFFVLHQPLLLILSHQALHTANYHGSLQCP 225
Query: 59 QETINQFQYITDPNRRTYA-ALTK 81
E ++ F +I D RR +A ALT+
Sbjct: 226 LEHLDSFGFIRDRKRRIFAGALTE 249
>gi|432879612|ref|XP_004073512.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 673
Score = 266 bits (681), Expect = 2e-68, Method: Compositional matrix adjust.
Identities = 149/403 (36%), Positives = 232/403 (57%), Gaps = 28/403 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +H S++I TP +D LA G+ L N Y QP+CTPSR+ +TG+Y IHTG+Q I
Sbjct: 59 GFNDIGYH-SSDIKTPTLDKLAAKGVKLENYYIQPICTPSRSQFITGRYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P + LP+ L++LGYST +GKWHLGF+++E P RGF+++FG L G ++
Sbjct: 118 RPRQPNCLPFDQVTLPQRLQQLGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSVN 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFL 270
YY + D EL G D+ S AWD G+Y+T LFT+ +++ P +PLF+
Sbjct: 178 YYTYSSCD----GPELCGFDLHEGESVAWDQGGKYSTHLFTQRVRKILARHDPQSQPLFI 233
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+L+ A H+ L++P+E I + + + RR YAAMV +D++V + AL++ G
Sbjct: 234 FLSFQAVHS-----PLQSPKEYICPYSGLENVARRKYAAMVSTVDEAVRNITYALRKYGY 288
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+NS++IF +DNG + + GSN+P RG K T WEGG++ + SP +++
Sbjct: 289 YQNSVLIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGIRGLGFVHSPLLKKR 339
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS ++HI+DW PTL AGG+ S + +DG + W ++ S R + +D
Sbjct: 340 KRVSKALVHITDWYPTLVGLAGGNES-MTDGLDGYNVWEAISEGKESPRLEILHNIDP-- 396
Query: 451 SLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENG 491
L +R S+ I A++R+ WKL+ G +G
Sbjct: 397 ---LFNHARSGSLQQGFGIWNTAVQASIRMGDWKLLTGNPGHG 436
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 26/79 (32%), Positives = 45/79 (56%), Gaps = 6/79 (7%)
Query: 6 STAWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
S AWD G+Y+T LFT+ +++ P +PLF++L+ A H+ L++P+E I
Sbjct: 199 SVAWDQGGKYSTHLFTQRVRKILARHDPQSQPLFIFLSFQAVHS-----PLQSPKEYICP 253
Query: 65 FQYITDPNRRTYAALTKST 83
+ + + RR YAA+ +
Sbjct: 254 YSGLENVARRKYAAMVSTV 272
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 34/64 (53%), Gaps = 1/64 (1%)
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
+LFN+ DP E+ +++ RPD+ QL L Y+ RT VP D +ADP W
Sbjct: 464 VWLFNISEDPYERFDLSEQRPDVVKQLLARLAYYNRTAVPVRFPSED-PRADPHLNGGAW 522
Query: 627 SPWI 630
PW+
Sbjct: 523 VPWV 526
>gi|260832084|ref|XP_002610988.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
gi|229296357|gb|EEN66998.1| hypothetical protein BRAFLDRAFT_246447 [Branchiostoma floridae]
Length = 494
Score = 265 bits (678), Expect = 4e-68, Method: Compositional matrix adjust.
Identities = 172/542 (31%), Positives = 253/542 (46%), Gaps = 89/542 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + ++ TP +D LA+ G+ILN Y VCTPSR + MTG +P H G Q
Sbjct: 34 GWNDVGWH-NPDVRTPVLDQLAHEGVILNQSYVNYVCTPSRTAFMTGYFPYHVGSQHLVF 92
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P + FLPE L+ LGY+T +GKWHLGF ++TP +RGF+S++GY +G
Sbjct: 93 RPDQPSAILSNFTFLPEKLKSLGYATHMVGKWHLGFCNWKFTPTFRGFDSYYGYYSGAED 152
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H S + G D N D G Y+ LF++ A ++ + PLFLY
Sbjct: 153 YFTHFRSIRNGT----GGIDFHDNKDVVTDQNGTYSAYLFSQRAADIVNKHDPNTPLFLY 208
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L HA LE P+ + + + D NRR MV LD++VG V A+++ G+
Sbjct: 209 LPFQNVHA-----PLEVPKRFEDMYANVQDENRRKLLGMVSALDEAVGNVTMAMKKTGLW 263
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N+++IF +DNG ++ G+NYP RG K TLWEGG + A +Q+
Sbjct: 264 DNTLLIFSTDNGGMVIQS---------GNNYPLRGGKTTLWEGGTRGVAFAHGKMLQKTG 314
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
+ +M+H DW PTL AAGG + +DG+ Q +
Sbjct: 315 YTNNEMIHAVDWFPTLLAAAGGTA------------------------DPGMDGVSQLDT 350
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
+L PS R + NID+ T
Sbjct: 351 ILSGKPSPRTEFVYNIDDIFPT-------------------------------------- 372
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP-MTPS-PCTNGPC-- 567
V+ T+ Q P +++ + A I G P P MT S P + P
Sbjct: 373 --VQDTTFAGGGHKYQKSMSPPVSVEYYKLIEGFAGIFNGWYPPPNMTASHPNSGDPING 430
Query: 568 -YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
+LFNL +DP E +N+A S PD ++ L +R+TLVP + + D ++DPK + W
Sbjct: 431 TWLFNLKDDPTEHHNLADSMPDKLKEMQAKLDGYRKTLVPAINPKSD-PKSDPKNWGGAW 489
Query: 627 SP 628
SP
Sbjct: 490 SP 491
>gi|291227280|ref|XP_002733615.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 265 bits (676), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 173/543 (31%), Positives = 267/543 (49%), Gaps = 82/543 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ +HGS I TP+IDALA G+ L+N Y +CTPSR+ LMTG+Y IHTG+Q I
Sbjct: 33 GWSDVGYHGS-VIKTPHIDALASEGVKLDNYYTSLLCTPSRSQLMTGRYEIHTGLQHRTI 91
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P+ E LP+ L++ GY+T +GKWHLGF+R+E P RGF++ G+ +
Sbjct: 92 DMMQPLCLPIDETILPQKLKDRGYATHMVGKWHLGFYRQECLPNNRGFDTFMGFYQAMGD 151
Query: 212 YYDHILSDQYSRTVELNGHDMRR-NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H +S T + NG D RR N A G+Y+T +F EA +I D PLFL
Sbjct: 152 YYYHNVS-----TGKFNGWDFRRDNDVIAERYAGQYSTHVFADEARDIISKHNPDVPLFL 206
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYIT--DPNRRTYAAMVKKLDDSVGTVISALQRK 328
+L+ A H L+ P + + + +RRTYA MV +D++VG V++ L+
Sbjct: 207 FLSFQAIHFP-----LQVPSRYADIYNTLIPNSADRRTYAGMVTCMDEAVGGVVATLKHS 261
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ ENS+++F +DNG + G N+P RG K +L+EGGV+ A + SP +
Sbjct: 262 GLWENSVLVFSTDNGG----------IHSLGGNWPLRGGKASLYEGGVRGVAFVTSPLLP 311
Query: 389 QNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
R ++ +++H++DW PTL AGG N +DG
Sbjct: 312 SRVRGTINKELLHMTDWFPTLVRLAGGHL----------------------MGNKRLDGY 349
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
+QW ++ TPS RN +L+ I+ KR + N +D Y T S +
Sbjct: 350 NQWDTISQETPSLRNEILLTINPLKR---------------KPNHFVDDPYADTDSFDIL 394
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
+ A + ++ + + N S + H
Sbjct: 395 M---RAAIRVGDWKLITGPTGNGSWVTPAEFGFNSLHPEERTHS---------------- 435
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
L+N+ DP E+++ + RPDI L + + ++ +T VP + ++ +ADP W
Sbjct: 436 VELYNVIADPNERHDYSRDRPDIVGALLDRIGHYYKTSVPVLNPPQEVTRADPALHGGAW 495
Query: 627 SPW 629
SPW
Sbjct: 496 SPW 498
>gi|115644393|ref|XP_781330.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 588
Score = 264 bits (675), Expect = 8e-68, Method: Compositional matrix adjust.
Identities = 186/547 (34%), Positives = 269/547 (49%), Gaps = 69/547 (12%)
Query: 92 GWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
G+ND+ +H G + I TPNID +AY+G+ L N Y QPVCTP+R+ L+TG+Y IHTGMQ
Sbjct: 105 GYNDVGYHAKYGRSMIRTPNIDEMAYSGVRLENYYVQPVCTPTRSQLITGRYQIHTGMQH 164
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
++ P +PL E L + L++ GYST A+GKWHLG+ ++ P RGFES FG + G
Sbjct: 165 LNLFPGRPCCLPLDETTLAQALKKQGYSTHAVGKWHLGYAWKDCLPSRRGFESFFGNIMG 224
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
++ H + + + + G M N W G ++T L+T A QLI QP +KPL
Sbjct: 225 SADHWSHNKTALFGDKLVM-GKSMYYNERIYWKHEGTFSTTLYTNRARQLIRKQPRNKPL 283
Query: 269 FLYLAHLAAHAG-NAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
FLYL++ A H N + P E I I + RR YA +V LD++V V AL+
Sbjct: 284 FLYLSYEAVHTPLNVPEQYAKPYEGI-----IHNSKRRRYAGLVNILDEAVRNVTEALKY 338
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
G+ +NS+IIF +DNG R+ G+N+P RG K+TLWEGG++ + SP I
Sbjct: 339 NGLYDNSVIIFTTDNGG-------RPKPRSVGNNWPLRGGKSTLWEGGIRGVGFVHSPLI 391
Query: 388 QQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
R V+ Q++H+SDW PT+ G L+ N P +DG
Sbjct: 392 PWELRGTVNRQLIHVSDWFPTIVXGIAG---------------GKLVTNKP------LDG 430
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
QW ++ T S R+ +L NID A W T+EN G
Sbjct: 431 XHQWKTISKGTESNRHEILHNIDPIYPAA-----HW-----TRENERDFGALSN------ 474
Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
L FNA + + +LS LP + + + P M +
Sbjct: 475 --LPFNATMRASIRVGNWKLSTG--LPHEDFWEPPKESEM--------PPEMNDIRWST- 521
Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
P L+N+ DP E+ N+A + I +L + L+ ++ T V H P + +PK
Sbjct: 522 PVRLYNIKKDPNERQNMAPYQKKIVYRLLKRLQDYQNTAVTPIHLGPKDERGNPKYHGGA 581
Query: 626 WSPWIYR 632
W PW+ R
Sbjct: 582 WVPWMDR 588
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 32/81 (39%), Positives = 40/81 (49%), Gaps = 6/81 (7%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQ 59
M N W G ++T L+T A QLI QP +KPLFLYL++ A H N + P
Sbjct: 247 MYYNERIYWKHEGTFSTTLYTNRARQLIRKQPRNKPLFLYLSYEAVHTPLNVPEQYAKPY 306
Query: 60 ETINQFQYITDPNRRTYAALT 80
E I I + RR YA L
Sbjct: 307 EGI-----IHNSKRRRYAGLV 322
>gi|47215546|emb|CAG06276.1| unnamed protein product [Tetraodon nigroviridis]
Length = 527
Score = 263 bits (672), Expect = 2e-67, Method: Compositional matrix adjust.
Identities = 151/398 (37%), Positives = 227/398 (57%), Gaps = 28/398 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +H S++I TP +D LA +G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 59 GFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P + LP+ L+ELGYST +GKWHLGF+++E P RGF+++FG L G ++
Sbjct: 118 RPRQPNCLPFDQITLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSVN 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
YY + D + G D+ S AW G+Y+T L+T+ +++ P +PLF+
Sbjct: 178 YYTYDSCD----GPGVCGFDLHEGESVAWSQRGKYSTHLYTQRVRKILATHDPQSQPLFI 233
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+L+ A H L+ PQE I ++ + + RR YAAMV +D++V + L++ G
Sbjct: 234 FLSFQAVHT-----PLQCPQEYIYPYRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGY 288
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+NSIIIF +DNG + + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 289 YQNSIIIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLRKK 339
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS ++HI+DW PTL AGGD S + +DG D W ++ S R + +D
Sbjct: 340 RRVSKALVHITDWYPTLVGLAGGDESHIE-GVDGYDVWEAISDGKESPRLEILHNIDP-- 396
Query: 451 SLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLG 486
L +R S+ I A++R WKL+ G
Sbjct: 397 ---LYNHARSGSLQKGYGIWNTAVQASIRAGDWKLLTG 431
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
+LFN+ DP E+ +++ RPD+ QL L Y+ RT VP + DL +ADP W
Sbjct: 464 VWLFNISGDPYERFDLSEQRPDVVKQLLARLVYYNRTAVPVRYPTEDL-RADPHLNGGAW 522
Query: 627 SPWI 630
PW+
Sbjct: 523 VPWV 526
Score = 47.4 bits (111), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 29/101 (28%), Positives = 52/101 (51%), Gaps = 7/101 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
S AW G+Y+T L+T+ +++ P +PLF++L+ A H L+ PQE I
Sbjct: 199 SVAWSQRGKYSTHLYTQRVRKILATHDPQSQPLFIFLSFQAVHT-----PLQCPQEYIYP 253
Query: 65 FQYITDPNRRTYAALTKSTTLTLL-IVYGWNDLSFHGSNEI 104
++ + + RR YAA+ + + I YG ++ ++ I
Sbjct: 254 YRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGYYQNSII 294
>gi|340384741|ref|XP_003390869.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 263 bits (671), Expect = 3e-67, Method: Compositional matrix adjust.
Identities = 178/560 (31%), Positives = 277/560 (49%), Gaps = 104/560 (18%)
Query: 82 STTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGK 139
S + L ++ GWND S+ GS +I TPNID LA GI L Y QP+C+PSR++L+ GK
Sbjct: 23 SPNIILFVIDDLGWNDTSYQGS-DIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSALLAGK 81
Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
YP H G+ I P G+ L E + ++L++ GYST A+GKW LG + E+TP YRGF
Sbjct: 82 YPYHLGLAHGVITNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTPTYRGF 141
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGH-DMRRNLSTAWDTVGEYATDLFTKEAVQL 258
++ +GY + YY H ++ G+ D R N D G Y+T LFTK A++
Sbjct: 142 DTFYGYYDADEDYYTH----------KVGGYLDFRNNTDPVKDEDGTYSTFLFTK-AIED 190
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
+ D P F+Y A+ + H+ LEAP + + + PNR+ + M+ LD+ +
Sbjct: 191 AINAKSDSPFFIYGAYQSVHS-----PLEAPDTYLEKC-HSPYPNRKIFCGMMLALDEGI 244
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
+ S LQ KG+L+++III +DNG T SN+P RG K T++EGGV+
Sbjct: 245 SNITSLLQTKGLLDDTIIILTTDNGGQTALGS---------SNWPLRGNKATVFEGGVRG 295
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ +WS +++++ + MMHI+DW PT+ G T L+ GL
Sbjct: 296 ISFVWSTKLRKSNYDNNAMMHITDWYPTIIEGIAGMT----LDTKGL------------- 338
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK---RT-----AAVRLDSWKLVLGTQEN 490
DG + W ++ +TPS R +L +D K RT AA+R WKL++G
Sbjct: 339 -----DGFNMWDTINDDTPSPRTEILHQLDPPKYDERTQFIGQAALRQGDWKLIIGQPNC 393
Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC 550
G + + ++ N +E
Sbjct: 394 SEKLGPHTVYDLCPIGWIHLNGTIEEPE-------------------------------- 421
Query: 551 GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
NP+ +LFN+ +DP E+N +++S P+I S+L EL++ + T + Q +
Sbjct: 422 -VNPS----------LTWLFNVKDDPNERNELSASHPEIVSKLKELIEAYNATHIVQM-D 469
Query: 611 QPDLVQADPKRFNDTWSPWI 630
P ++DPK+F + W+PW+
Sbjct: 470 PPIDPRSDPKKFGNIWTPWL 489
>gi|291222022|ref|XP_002731018.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 1410
Score = 262 bits (670), Expect = 4e-67, Method: Compositional matrix adjust.
Identities = 145/419 (34%), Positives = 232/419 (55%), Gaps = 66/419 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +HGS+ I TP++D LA G L N Y +C+PSR +TGK+ IH GM+G I
Sbjct: 38 GWNDVGYHGSS-ISTPHMDTLAKEGTKLENYYVAHLCSPSRGMFLTGKHMIHLGMEGGII 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E + +P+ E + + L+ YST AIGKWH+G++++ P RGF++ FG + G
Sbjct: 97 MPFERKCLPVNEATIAQELKLKNYSTHAIGKWHVGYYKKACLPNNRGFDTFFGIIGGCAD 156
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
+Y H + + EL R N+S A + G Y+TDL+ +EA +I + KP+F+Y
Sbjct: 157 HYTH----KNTHWWEL----YRNNISIAQEYQGHYSTDLYAREATNVIRNHDASKPMFMY 208
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LA AAH L+AP++ I+ + I DP+RR YAAMV +DD++G V+ L+ G+
Sbjct: 209 LAFQAAHLP-----LQAPRKYIDMYSNIEDPDRRVYAAMVTCMDDAIGNVVDQLREAGLW 263
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
N+++I +DNG + G+N+P +G K +LWEGGV+ A + SP + +
Sbjct: 264 NNTVLIVSTDNGGAKIS----------GNNWPLKGSKASLWEGGVRGVAFVTSPFLADHV 313
Query: 392 R--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
R + Q+MH++DW PT+ AG +S ++ ++ +DG++QW
Sbjct: 314 RGTSNHQLMHVTDWFPTMLHVAG--------------VYSDVI------KSMTLDGVNQW 353
Query: 450 SSLLLNTPSRRNSVLINIDEKKRT--------------------AAVRLDSWKLVLGTQ 488
+++ N + RN VLIN++ ++ +A+RL +WKL+ G+Q
Sbjct: 354 NTISENRQTERNDVLINLNVNRKVKHVHYEEPWLDNEHFDVRIKSALRLGNWKLLTGSQ 412
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 33/79 (41%), Positives = 49/79 (62%), Gaps = 5/79 (6%)
Query: 2 RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
R N+S A + G Y+TDL+ +EA +I + KP+F+YLA AAH L+AP++
Sbjct: 170 RNNISIAQEYQGHYSTDLYAREATNVIRNHDASKPMFMYLAFQAAHLP-----LQAPRKY 224
Query: 62 INQFQYITDPNRRTYAALT 80
I+ + I DP+RR YAA+
Sbjct: 225 IDMYSNIEDPDRRVYAAMV 243
>gi|432904444|ref|XP_004077334.1| PREDICTED: arylsulfatase I-like [Oryzias latipes]
Length = 572
Score = 262 bits (669), Expect = 5e-67, Method: Compositional matrix adjust.
Identities = 173/542 (31%), Positives = 268/542 (49%), Gaps = 74/542 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS ++ TP +D LA G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q I
Sbjct: 55 GYADIGYHGS-DVHTPVLDQLAAEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSII 113
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P LPE L + GY T +GKWHLGF R P RGF+S FG L G
Sbjct: 114 RPRQPLCLPPDIPTLPECLLKAGYHTHMVGKWHLGFCRPSCLPTRRGFQSFFGTLTGSGD 173
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
++ + Q E G D+ AW+ G Y+T L+ + Q++ + + PLFLY
Sbjct: 174 HFSY----QSCDGAEACGFDLHDGSRPAWEMRGNYSTLLYIERVKQILRNHDPNTPLFLY 229
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L+ AAH L+ P +N ++ + +RR YAAM+ LDD V V+ L+ G+
Sbjct: 230 LSLQAAHT-----PLQVPAHFLNPYKSQNNRHRRHYAAMLSCLDDGVAQVVKELKTTGIY 284
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NSI+I+ SDNG + + GSN+P RG K + WEGG++ + SP +++
Sbjct: 285 DNSILIYSSDNGGQPL---------SGGSNWPLRGGKGSYWEGGIRAVGFVHSPLLKKKG 335
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
+S +M+H+SDW PTL AG S +DG D W S+ +PS R+ + +D S
Sbjct: 336 VISKEMIHVSDWYPTLLGLAGALQSH--NGLDGHDVWESISEGSPSPRSEILFNIDP-VS 392
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
P + VL I + AA+R+ W L+ G +G + +PL +
Sbjct: 393 RRPGEPYEKALVLNGFGIWDTAVRAALRVGDWNLLTGNVGDG-----------DWIPLQS 441
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
E +Q L++ R + ++ +L
Sbjct: 442 LTGGPER--WQKLEK---------------RRNKMGKSV-------------------WL 465
Query: 570 FNLGNDPCEQNNIASSRPDISSQLY-ELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
FN+ DP E++++A P++ L L +Y++ ++ ++ PD ADP+ WSP
Sbjct: 466 FNVSGDPYERSDLAELHPEVVKHLLTRLAEYNQTAMI--ANNPPDDPMADPELHGGVWSP 523
Query: 629 WI 630
W+
Sbjct: 524 WL 525
Score = 45.8 bits (107), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 40/72 (55%), Gaps = 5/72 (6%)
Query: 8 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 67
AW+ G Y+T L+ + Q++ + + PLFLYL+ AAH L+ P +N ++
Sbjct: 197 AWEMRGNYSTLLYIERVKQILRNHDPNTPLFLYLSLQAAHT-----PLQVPAHFLNPYKS 251
Query: 68 ITDPNRRTYAAL 79
+ +RR YAA+
Sbjct: 252 QNNRHRRHYAAM 263
>gi|291239530|ref|XP_002739676.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 507
Score = 261 bits (668), Expect = 6e-67, Method: Compositional matrix adjust.
Identities = 176/550 (32%), Positives = 262/550 (47%), Gaps = 95/550 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ +H S I TPNID LA G+ L N Y P+CTP+RA LMTG+Y IHTGMQ +
Sbjct: 40 GWHDVGYHDSI-IRTPNIDKLAAEGVKLENYYVTPLCTPTRAVLMTGRYQIHTGMQHGVL 98
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EPR +P E +P+ L+E GY+T +GKWHLGF++ TP +RGF++ FG
Sbjct: 99 MAQEPRCLPTDEVLMPQKLKESGYTTHMVGKWHLGFYKWACTPNHRGFDTFFGMYLAGGD 158
Query: 212 YYDHILSDQYSRTVELNGHDMRR-NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y++H + L D+R + A + VG+Y+T +F ++A ++I+ PLFL
Sbjct: 159 YFNHT---RLCHGRRLAAWDLRDGDQVVAPEYVGKYSTIVFAEKAQEIIKKHDPTNPLFL 215
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
YL+ A HA L+ P+ IN ++ I D +RR YA M +D+++G V L R+G
Sbjct: 216 YLSFQAVHAP-----LQVPERYINMYKDDIRDESRRIYAGMTTCMDEAIGNVTRTLARRG 270
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+L+N+II+F SDNG + SNYP RG K TL+EGGV+ PA ++SP +
Sbjct: 271 LLDNTIIVFSSDNGGAKL---------YGASNYPLRGQKGTLYEGGVRAPAFVYSPLLSN 321
Query: 390 NPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
R + ++M++ DW PT AGG N+ G+ +DG++
Sbjct: 322 RVRGTATHELMYVGDWFPTFLHLAGG-------NLHGVKP---------------LDGVN 359
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
QW ++ PS KRT V N PL
Sbjct: 360 QWHTVSAAMPS------------KRTEIVH-------------------------NIEPL 382
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
V+ + + + N + I + +R + +P P G
Sbjct: 383 RGKPPRVKDEPWLT------NPYFDIRVMASIRVGEWKLITGRKGDPTWTIPPELGGGSV 436
Query: 568 --------YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
L+++ DP E+ +++ RPDI ++L E L+YH T P +DP
Sbjct: 437 DPGRPTIVELYHIKEDPLERTDLSRDRPDIVNKLLERLRYHSDTSRLPETPTPVFDTSDP 496
Query: 620 KRFNDTWSPW 629
N W PW
Sbjct: 497 LHHNMAWGPW 506
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 27/70 (38%), Positives = 42/70 (60%), Gaps = 6/70 (8%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
VG+Y+T +F ++A ++I+ PLFLYL+ A HA L+ P+ IN ++ I D
Sbjct: 188 VGKYSTIVFAEKAQEIIKKHDPTNPLFLYLSFQAVHAP-----LQVPERYINMYKDDIRD 242
Query: 71 PNRRTYAALT 80
+RR YA +T
Sbjct: 243 ESRRIYAGMT 252
>gi|410906623|ref|XP_003966791.1| PREDICTED: arylsulfatase J-like [Takifugu rubripes]
Length = 560
Score = 261 bits (666), Expect = 9e-67, Method: Compositional matrix adjust.
Identities = 172/541 (31%), Positives = 271/541 (50%), Gaps = 74/541 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q I
Sbjct: 49 GFRDVGYHGS-EIKTPTLDRLAAQGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSII 107
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP L++ GY+T +GKWHLGF++R P RGF++ FG L G
Sbjct: 108 RATQPNCLPLENVTLPLKLKQAGYATHMVGKWHLGFYKRGCLPTQRGFDTFFGSLLGSGD 167
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFTKEAVQLI-EDQPVDKPLF 269
+Y H + + G+D+ AW+ G Y+T +FT++A+ ++ + P KPLF
Sbjct: 168 HYSHYKCE----APGMCGYDLYEGEEAAWEQDRGLYSTVMFTQKAISILAKHDPHRKPLF 223
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYLA+ A H+ L+ P + +++ I++ +RR YAAMV LD+++ + AL+R G
Sbjct: 224 LYLAYQAVHSP-----LQVPSRYLERYKGISNVHRRKYAAMVSCLDEAIRNLTLALKRYG 278
Query: 330 MLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
+N+++++ SDNG P + GSN+P RG K + WEGG++ + SP ++
Sbjct: 279 YYDNTVLVYSSDNGGQPLLG----------GSNWPLRGSKASYWEGGIRAVGFVHSPLLR 328
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
++HI+DW PTL + G T LN+DG D W ++ PS R+ + +D
Sbjct: 329 NKGTKCRSLIHITDWFPTL-VSLGEGTLEEHLNLDGYDVWEAISEGRPSPRHDILHNID- 386
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
+ + + I AA+R+ WKL+ G + GY S+ VP
Sbjct: 387 --PIYIKAKNGSWKAGYGIWNTAIQAALRVGHWKLLTG------IPGY-----SDWVPPQ 433
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
F S QQ+ D+ +S +
Sbjct: 434 TF----------SNQQMGSRWHNEHIRFDRGKS-------------------------LW 458
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LFN+ DP E+ +++ P I ++ L + +T VP + DL +++P+ W P
Sbjct: 459 LFNITADPYERVDLSQRYPQIVKKMLIRLAQYNKTAVPVRYPAKDL-RSNPQYNGGVWGP 517
Query: 629 W 629
W
Sbjct: 518 W 518
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 27/76 (35%), Positives = 47/76 (61%), Gaps = 7/76 (9%)
Query: 7 TAWDT-VGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
AW+ G Y+T +FT++A+ ++ + P KPLFLYLA+ A H+ L+ P + +
Sbjct: 190 AAWEQDRGLYSTVMFTQKAISILAKHDPHRKPLFLYLAYQAVHSP-----LQVPSRYLER 244
Query: 65 FQYITDPNRRTYAALT 80
++ I++ +RR YAA+
Sbjct: 245 YKGISNVHRRKYAAMV 260
>gi|340373449|ref|XP_003385254.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 491
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 178/551 (32%), Positives = 272/551 (49%), Gaps = 107/551 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND S+ GS +I TPNID LA GI L Y QP+C+PSR++L+ GKYP H G+ I
Sbjct: 35 GWNDTSYQGS-DIQTPNIDKLAEEGIRLKQYYVQPLCSPSRSALLAGKYPYHLGLAHGVI 93
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
P G+ L E + ++L++ GYST A+GKW LG + E+TP YRGF++ +GY +
Sbjct: 94 TNGHPYGLGLNETTIADHLKKGGYSTHAVGKWDLGMHKWEFTPTYRGFDTFYGYYDADED 153
Query: 212 YYDHILSDQYSRTVELNGH-DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H ++ G+ D R N D G Y+T LFTK A++ + D P F+
Sbjct: 154 YYTH----------KVGGYLDFRNNTDPVKDEDGTYSTFLFTK-AIEDAINAKSDSPFFI 202
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y A+ + H LEAP +N+ I PNR+ + M+ LD+ + + S LQ KG+
Sbjct: 203 YGAYQSVHG-----PLEAPDIYLNKCN-IPYPNRKIFCGMMLALDEGISNITSLLQTKGL 256
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L+++III +DNG T SN+P RG K T++EGGV+ + +WS +++++
Sbjct: 257 LDDTIIILTTDNGGQTALGS---------SNWPLRGNKATVFEGGVRGISFVWSTKLRKS 307
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ MMHI+DW PT+ G T L+ GL DG + W
Sbjct: 308 NYDNNAMMHITDWYPTIIEGIAGMT----LDTKGL------------------DGFNMWD 345
Query: 451 SLLLNTPSRRNSVLINID----EKKRT-----AAVRLDSWKLVLGTQENGTMDGYYGQTR 501
++ +TPS R +L +D + +R AA+R WKL++G G +
Sbjct: 346 TINDDTPSPRTEILHQLDPPRYDNERNPFIGQAALRQGDWKLIIGQPNCSEKIGPH--LV 403
Query: 502 SNKVP--LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP 559
NK P ++ N +E NP+
Sbjct: 404 GNKCPNGWVHLNGTIEEPE---------------------------------VNPS---- 426
Query: 560 SPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
+LFN+ +DP E+N +++S P+I S+L E ++ + T + Q D ++DP
Sbjct: 427 ------LTWLFNVKDDPNERNELSASHPEIVSKLKERIEAYNATHIVQMSPPID-PRSDP 479
Query: 620 KRFNDTWSPWI 630
K+F + W+PW+
Sbjct: 480 KKFGNVWTPWL 490
>gi|126697470|gb|ABO26692.1| sulfatase 1A precursor [Haliotis discus discus]
Length = 477
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 170/538 (31%), Positives = 259/538 (48%), Gaps = 97/538 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ + +I TP +D LA +G+ILN+ Y QP+C+PSR M+GK+P HTG+Q +
Sbjct: 35 GWNDVGWINP-DIKTPTLDRLARSGVILNSSYVQPLCSPSRNCFMSGKFPYHTGLQDKVV 93
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ +P+ +P +P+ L+ LGY T +GKWHLGF +YTP YRGF++ GY G+
Sbjct: 94 FIEQPKYMPANLTTIPQRLKTLGYDTHMVGKWHLGFCNWKYTPTYRGFDTFMGYYAGMED 153
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H+ + + NG+D R GEY+T +F A+ +I + KP++LY
Sbjct: 154 YFTHVRDE----VADYNGYDFRNMTDVYKGAQGEYSTYVFANRAIDIIMNHDKSKPMYLY 209
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L A H L+ P + +++ +I D NR+ Y+ MV LDD+V + + L+ G +
Sbjct: 210 LPFQAVHV-----PLQVPTKYSDRYSHIHDLNRKVYSGMVSALDDAVYNITTVLEDLGFM 264
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ-IQQN 390
+N +++F +DNG P T NW P RG KNTLWEGG + A ++S +++
Sbjct: 265 DNLLLVFTTDNGGP-----PTRGANNW----PLRGGKNTLWEGGTRGAAFVYSKTLLKKK 315
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
+ H DW PTL AGGD+S+L +IDGL Q
Sbjct: 316 GYTHPGLFHAVDWYPTLLDIAGGDSSQL-----------------------DIDGLSQVD 352
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
+L S RN + NI + AA+R KL++G N G+Y P F
Sbjct: 353 MILNGDSSVRNQFVYNIYDDHYAAALRWGDLKLIVG---NPGYSGWY--------PPPGF 401
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
+ V PI + TP P L+
Sbjct: 402 DLDV-----------------PIMDD---------------------TPLPS----VMLY 419
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
N+ DP E +I+ PD+ + L ++L T VP + + P +++P F+ +WSP
Sbjct: 420 NISTDPTEHEDISKDYPDLVNFLLQMLARKNGTRVP-AMDPPSDPRSNPVNFDGSWSP 476
Score = 45.8 bits (107), Expect = 0.075, Method: Compositional matrix adjust.
Identities = 23/81 (28%), Positives = 43/81 (53%), Gaps = 6/81 (7%)
Query: 3 RNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
RN++ + GEY+T +F A+ +I + KP++LYL A H L+ P +
Sbjct: 171 RNMTDVYKGAQGEYSTYVFANRAIDIIMNHDKSKPMYLYLPFQAVHV-----PLQVPTKY 225
Query: 62 INQFQYITDPNRRTYAALTKS 82
+++ +I D NR+ Y+ + +
Sbjct: 226 SDRYSHIHDLNRKVYSGMVSA 246
>gi|291239534|ref|XP_002739678.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 648
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 169/544 (31%), Positives = 275/544 (50%), Gaps = 83/544 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ +H S I TPNID LA G+ L N Y P+CTP+RA LMTG+Y IHT MQ +
Sbjct: 39 GWHDVGYHDSI-IRTPNIDKLAAEGVKLENYYVTPICTPTRAVLMTGRYQIHTTMQHGVL 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E R +P E +P+ L+E GYST +GKWHLGF++ + TP +RGF++ FG+
Sbjct: 98 MAQEQRCLPTDEVLMPQKLKESGYSTHMVGKWHLGFYKWDCTPNHRGFDTFFGFYLAGGE 157
Query: 212 YYDHILSDQYSRTVELNGHDMRR-NLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y+ H R L+ D+R + + GEY+T L+ ++A ++I Q + P+FL
Sbjct: 158 YFTHTRKCHGHR---LDAWDLRDGDKMVGPEYTGEYSTMLYARKAQEVIRKQDPNVPMFL 214
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
Y++ A HA LE P + + + I D +R+ YA M +D+++G V L+ +G
Sbjct: 215 YVSFQAVHAP-----LEVPDSYADAYGKDIYDQSRKLYAGMTSCMDEAIGNVTQTLKDRG 269
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWG-SNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
M +N++++F +DNG T +G SNYP RG K +EGGV+ PA + SP ++
Sbjct: 270 MWDNTVVVFSTDNGGAT----------GFGASNYPLRGQKCGHYEGGVRGPAFVSSPLLK 319
Query: 389 QNPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
+ R + ++M++ DWLPT AGG T +DG+
Sbjct: 320 PHVRGTKNTELMYVGDWLPTFVHLAGGTT----------------------HGTKPLDGV 357
Query: 447 DQWSSLLLNTPSRRNSVLINIDE-KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
+QW ++ PSRR +L NID + ++ ++ + W ++ Y+ +
Sbjct: 358 NQWEAISEGLPSRRKEILHNIDPIRAKSPTLKTEPW----------MLNPYFDIRVMAAI 407
Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
+ ++ + K + Q+ + + N R T +P TN
Sbjct: 408 RVGHWKLLTGQKGNANWQEPPE-----LGN----RLT-----------------NPYTNN 441
Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDT 625
L+++ DP E+N+I+++RPD+ ++ L+ T V +H P+ +DP + +
Sbjct: 442 ILQLYHIKEDPEERNDISATRPDVVYRILLRLQALNSTAVLPTHVDPEFSLSDPVKHDMA 501
Query: 626 WSPW 629
WSPW
Sbjct: 502 WSPW 505
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 45/81 (55%)
Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
H A ++ S +N L+++G DP E+ I+S RPD+ +L L+YH T V
Sbjct: 567 HLPAEIGGISRSSYSNEVLQLYHIGEDPEERYEISSLRPDVVIRLLRRLQYHDSTAVLPV 626
Query: 609 HEQPDLVQADPKRFNDTWSPW 629
+P+ +++DP + + WSPW
Sbjct: 627 ETEPEFIESDPVKHDMAWSPW 647
Score = 42.4 bits (98), Expect = 0.87, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 40/69 (57%), Gaps = 6/69 (8%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDP 71
GEY+T L+ ++A ++I Q + P+FLY++ A HA LE P + + + I D
Sbjct: 188 GEYSTMLYARKAQEVIRKQDPNVPMFLYVSFQAVHAP-----LEVPDSYADAYGKDIYDQ 242
Query: 72 NRRTYAALT 80
+R+ YA +T
Sbjct: 243 SRKLYAGMT 251
>gi|405975641|gb|EKC40195.1| Arylsulfatase B [Crassostrea gigas]
Length = 684
Score = 261 bits (666), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 175/557 (31%), Positives = 266/557 (47%), Gaps = 94/557 (16%)
Query: 73 RRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
R+ A + + GWND+ FH + TPNID LA G+ILN Y QP+C+PSR
Sbjct: 218 RKQVEAKSPHILFIVADDLGWNDVGFHNP-AMKTPNIDKLAREGLILNQTYLQPLCSPSR 276
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
+LMTG YP H G+Q I +P PL +FLP+ L+++GY+T +GKWHLGF
Sbjct: 277 HALMTGYYPYHAGLQHLVILPWQPVCSPLKMKFLPQRLKDIGYATHMVGKWHLGFCSWNC 336
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
TP YRGF+S FGY N +Y H N D R N + G Y+T F
Sbjct: 337 TPTYRGFDSFFGYYNAQGDHYSHTW---------YNYLDYRDNEKPVKNLNGTYSTFTFV 387
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
A +I+ +PLFLY+A H ++ P++ + + I RR ++ MV
Sbjct: 388 SRAQDIIKKHNSSQPLFLYMAFQNVH-----DPIQVPKQYEDMYPNIKTRGRRQFSGMVS 442
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTL 371
LD+++G + L+ GM ++++IIF SDNG P + +G+NYP RG K T+
Sbjct: 443 ALDEAIGNITDQLRESGMYDDTLIIFTSDNGGWP----------KYYGNNYPLRGGKFTV 492
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+EGG +V + + +Q+ +MH DW TL AAGG + L DG+ QWSS+
Sbjct: 493 YEGGTRVVSFVHGAGLQKTGEAYESIMHAVDWSDTLVEAAGG---KPDLMGDGVSQWSSI 549
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
N P R + D ++ ++ AA+R+ +KL+
Sbjct: 550 ANNFPGPRTELVYNFDS-----------------DLAPEEGHAAIRVGDYKLI------- 585
Query: 492 TMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCG 551
DGY G L + + P D++ + + +I
Sbjct: 586 --DGYPG--------------------------LYPDWYKP----DQLYKFKPEPSI--- 610
Query: 552 ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQ 611
P T + G LFNL +DP E N+++ SRPDI +++ + L+ + VP ++
Sbjct: 611 ----PETQAHA--GRFQLFNLKDDPTEHNDLSKSRPDIVTEMSQRLRLLTQNAVPPNYPL 664
Query: 612 PDLVQADPKRFNDTWSP 628
+++P +F+D WSP
Sbjct: 665 VPDPKSNPSKFDDVWSP 681
>gi|390364061|ref|XP_792027.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 524
Score = 260 bits (665), Expect = 1e-66, Method: Compositional matrix adjust.
Identities = 176/577 (30%), Positives = 267/577 (46%), Gaps = 98/577 (16%)
Query: 75 TYAALTKSTTLTLLIVYGWNDLSFHG---SNEIPTPNIDALAYNGIILNNMYAQPVCTPS 131
+YA + L YG+ND+ +HG + I TPN+D LA G+ L N Y QP+C+P+
Sbjct: 23 SYAEQLPNVVFILADDYGFNDVGYHGRSHGSAILTPNLDMLAGEGVKLENYYVQPICSPT 82
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
R+ LM+G+Y IHTG+Q I +P +PL E LP+ L+E GY+T +GKWH+GF+
Sbjct: 83 RSQLMSGRYQIHTGLQHGVIRPPQPNCLPLDEVTLPQKLKENGYATNMVGKWHIGFYLDA 142
Query: 192 YTPLYRGFESHFGYLNGVISYYDHI----LSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
P RGF+S+F ++ DH N + + G+Y+
Sbjct: 143 CLPTERGFDSYF-------AWEDHFSCLPXXXXXXXXXXXXXXXXXANKTPVFQYEGQYS 195
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
T LFT + + +IE KPLF+YLA+ A H LE P ++ + ITD NRRTY
Sbjct: 196 THLFTNKTIDVIERHDKTKPLFIYLAYQAVHF-----PLEVPDSYMDPYMNITDKNRRTY 250
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
A MV +D+ VG V AL+ G+ +N+IIIF +DNG P ++ +N+P RG
Sbjct: 251 AGMVSCMDEGVGNVTRALKEAGLYDNTIIIFSTDNGGPI---------QSGANNWPLRGS 301
Query: 368 KNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
K +LWEGG+ + SP + + + V+ +++H+SDWLPT+ G +DG
Sbjct: 302 KGSLWEGGIHGVGFVHSPLLPTSVKGTVNHELIHVSDWLPTIVAGIAGGALNGTKPLDGY 361
Query: 426 DQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNS----------VLINIDEKKR 472
+ WSS+ S R NID L +L PS + VL + +
Sbjct: 362 NVWSSISGTASSPRKEILHNIDSLYIAPEIL---PSYFDGEDFLWRFLARVLEDKFDTSI 418
Query: 473 TAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLP 532
AA+R+ WK+ G + S +Q + + P
Sbjct: 419 RAALRVGDWKIKTGPDASA------------------------SDVWQPPAESGLHTIYP 454
Query: 533 ISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQ 592
I N +LFN+ DP E+ +++ PD+ +
Sbjct: 455 IEK---------------------------PNQKVWLFNITADPLERTDLSGKYPDVVDE 487
Query: 593 LYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+ L+ + + VP + PDL +ADP +W PW
Sbjct: 488 FLQRLEEYYKGSVPVRYPDPDL-EADPALHGGSWGPW 523
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 40/70 (57%), Gaps = 5/70 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G+Y+T LFT + + +IE KPLF+YLA+ A H LE P ++ + ITD N
Sbjct: 192 GQYSTHLFTNKTIDVIERHDKTKPLFIYLAYQAVHF-----PLEVPDSYMDPYMNITDKN 246
Query: 73 RRTYAALTKS 82
RRTYA +
Sbjct: 247 RRTYAGMVSC 256
>gi|410913855|ref|XP_003970404.1| PREDICTED: arylsulfatase I-like [Takifugu rubripes]
Length = 570
Score = 260 bits (665), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 147/396 (37%), Positives = 225/396 (56%), Gaps = 24/396 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +H S++I TP +D LA +G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 59 GFNDIGYH-SSDIKTPVLDKLAADGVKLENYYIQPICTPSRSQLITGRYQIHTGLQHSII 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P + LP+ L+ELGYST +GKWHLGF+++E P RGF+++FG L G ++
Sbjct: 118 RPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDTYFGSLTGSVN 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
YY + D + G D+ S AW G+Y+T L+T+ +++ P +PLF+
Sbjct: 178 YYTYDSCDGPG----MCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILATHDPRSQPLFI 233
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+L+ A H L+ P+E I ++ + + RR YAAMV +D++V + L++ G
Sbjct: 234 FLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGY 288
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
ENSI+IF +DNG + + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 289 YENSIMIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGVRGLGFIHSPLLRKK 339
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS ++HI+DW PTL AGG S + +DG D W ++ + R + ++ L
Sbjct: 340 KRVSKALVHITDWYPTLVGLAGGKESHIE-GVDGYDVWEAI---SDGRESPRLEILHNID 395
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
L + S I A++R WKL+ G
Sbjct: 396 PLYNHARSGSLQKGYGIWNTAVQASIRAGDWKLLTG 431
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/64 (39%), Positives = 36/64 (56%), Gaps = 1/64 (1%)
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
+LFN+ DP E+ +++ RPD+ QL L Y+ RT VP + DL +ADP W
Sbjct: 464 VWLFNISGDPYERLDLSEQRPDVVKQLLARLVYYNRTAVPVRYPTEDL-RADPHLNGGAW 522
Query: 627 SPWI 630
PW+
Sbjct: 523 VPWV 526
Score = 45.8 bits (107), Expect = 0.068, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 49/96 (51%), Gaps = 7/96 (7%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
S AW G+Y+T L+T+ +++ P +PLF++L+ A H L+ P+E I
Sbjct: 199 SVAWSQKGKYSTHLYTQRVRKILATHDPRSQPLFIFLSFQAVHT-----PLQCPREYIYP 253
Query: 65 FQYITDPNRRTYAALTKSTTLTLL-IVYGWNDLSFH 99
++ + + RR YAA+ + + I YG ++
Sbjct: 254 YRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGYY 289
>gi|156364432|ref|XP_001626352.1| predicted protein [Nematostella vectensis]
gi|156213225|gb|EDO34252.1| predicted protein [Nematostella vectensis]
Length = 270
Score = 260 bits (664), Expect = 2e-66, Method: Compositional matrix adjust.
Identities = 124/256 (48%), Positives = 168/256 (65%), Gaps = 9/256 (3%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM---QG 148
GW+D+SFHGS++IPTP ID LA G+ILN+ Y P+CTP+RASLMTGK+P++ GM
Sbjct: 14 GWDDVSFHGSSQIPTPTIDKLASEGVILNSYYVSPICTPTRASLMTGKHPMNLGMLIHTH 73
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
++G +P G+PL E P+Y++ LGY T IGKWHLGFF +EYTP YRGF+S +G+ NG
Sbjct: 74 ATVFGTQPYGLPLGETTTPQYMKSLGYVTHGIGKWHLGFFEKEYTPTYRGFDSFYGFWNG 133
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
Y+DH S ++ G D+R N + G Y T+LF + A Q+I KPL
Sbjct: 134 KEDYWDH------SSQEDVWGTDLRDNEKPVRNESGHYGTELFAERAAQIIHLHNQTKPL 187
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
+LYLA H+ N + L+AP+ I +F +I+ P RR YAAMV LD+SV TV AL
Sbjct: 188 YLYLAQQGVHSANGNEPLQAPKRLIKKFSHISSPKRRIYAAMVSSLDESVETVHKALSET 247
Query: 329 GMLENSIIIFMSDNGA 344
GML N++++F +DNG
Sbjct: 248 GMLNNTVLVFTTDNGG 263
Score = 68.2 bits (165), Expect = 1e-08, Method: Compositional matrix adjust.
Identities = 32/82 (39%), Positives = 46/82 (56%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R N + G Y T+LF + A Q+I KPL+LYLA H+ N + L+AP+
Sbjct: 151 LRDNEKPVRNESGHYGTELFAERAAQIIHLHNQTKPLYLYLAQQGVHSANGNEPLQAPKR 210
Query: 61 TINQFQYITDPNRRTYAALTKS 82
I +F +I+ P RR YAA+ S
Sbjct: 211 LIKKFSHISSPKRRIYAAMVSS 232
>gi|291232535|ref|XP_002736216.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 784
Score = 259 bits (661), Expect = 4e-66, Method: Compositional matrix adjust.
Identities = 158/413 (38%), Positives = 232/413 (56%), Gaps = 44/413 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ +HGS I TPNIDALA G+ L+N Y +CTPSR L++G+Y IHTG+Q I
Sbjct: 53 GWSDVGYHGS-VIKTPNIDALASEGVKLDNYYMSLLCTPSRGQLLSGRYEIHTGLQHRTI 111
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P+ E LP+ L+E GY+T +GKWHLGF+R+E P YRGF++ G+ G+
Sbjct: 112 DMMQPLCLPIDETILPQKLKERGYATHMVGKWHLGFYRKECLPNYRGFDTFMGFYQGMAD 171
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H +S T +G D RRN A G+Y+T +F EA +I + PLFL
Sbjct: 172 YYYHNIS-----TGIYHGWDFRRNNDVIAQKYAGQYSTHVFADEAQIIIMKHNPEVPLFL 226
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYIT--DPNRRTYAAMVKKLDDSVGTVISALQRK 328
+L+ A H L+ P + ++ + + +R+ YA MV +D++VG V++AL+
Sbjct: 227 FLSFQAIHLP-----LQVPSRYADMYKTLIPNNADRQKYAGMVTCMDEAVGRVVTALKHT 281
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ ENS+++F +DNG G N+P RG K +L+EGGV+ + SP +
Sbjct: 282 GLWENSVLVFSTDNGGIL------------GGNWPLRGSKASLYEGGVRGVGFVTSPLLP 329
Query: 389 QNPR--VSLQMMHISDWLPTLYTAAGGD-TSRLPLNIDGLDQWSSLLLNTPSRRNS---N 442
R V+ +++H++DWLPTL AGGD PL DG +QW ++ TPS R+
Sbjct: 330 SRVRGTVNRELLHMTDWLPTLVRLAGGDLMGTKPL--DGYNQWETISQGTPSVRDEVLLT 387
Query: 443 IDGLDQWSSLLLNTPSRRN---SVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
ID L + + P N +VLI AA+R+ WKL+ G NG+
Sbjct: 388 IDPLIAKAHHFKDDPYADNDSFNVLIR-------AAIRVGDWKLITGPPGNGS 433
Score = 47.4 bits (111), Expect = 0.022, Method: Compositional matrix adjust.
Identities = 23/63 (36%), Positives = 34/63 (53%)
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
L+N+ DP E+N+ + RPDI L + ++ T VP +++ ADP R WSP
Sbjct: 456 LYNVVADPNERNDYSHDRPDIVRFLLARIGHYYNTSVPVLDPSQEILLADPARHGGVWSP 515
Query: 629 WIY 631
W Y
Sbjct: 516 WRY 518
>gi|291236278|ref|XP_002738066.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 258 bits (660), Expect = 6e-66, Method: Compositional matrix adjust.
Identities = 177/569 (31%), Positives = 272/569 (47%), Gaps = 90/569 (15%)
Query: 77 AALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
A+ +S+ ++ V GW+D+ +HGS + TP++D LA GI L N Y QP+CTP+R
Sbjct: 15 CAVARSSQPNIVFVLADELGWHDIGYHGS-RVQTPHLDKLASEGIKLENYYVQPMCTPTR 73
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
+ LM+G+Y IHTG+Q I + +PL E + + L+E GYST +GKWHLG + +
Sbjct: 74 SQLMSGRYQIHTGLQHTVINPDQRSCLPLDEVTIAQKLKEAGYSTHMVGKWHLGHYTKGC 133
Query: 193 TPLYRGFESHFGYLNGVISYYDH----ILSDQYSRTVELNGHDMRRN--LSTAWDTVGEY 246
P RGF+S FG+ N + YY + + + + G D+ RN A G Y
Sbjct: 134 LPTKRGFDSFFGFYNCAVDYYTYEKGKFCKFENETVLRMRGTDLWRNDEEHVAPYYQGHY 193
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
T + KEA +I KPLFLYLA A H LE P+ + + + D +RR
Sbjct: 194 QTHVLAKEAEDVIRKHNPSKPLFLYLAFGAVHVP-----LEVPKVYEDMYADVKDNSRRI 248
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
AM +DDS+ V AL+ G+ +N+I IF +DNGA T G ++P +G
Sbjct: 249 LLAMATCMDDSIADVTGALKDTGLWDNTIFIFSTDNGAATA---------YGGCSWPLKG 299
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
K +L+EGG++ + SP + + + + +++MH+SDW PTL AGG+ S +DG
Sbjct: 300 GKTSLFEGGIRAVGFVTSPLLPPDMQGTENMELMHVSDWFPTLVHVAGGELSGTK-PLDG 358
Query: 425 LDQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
+QW ++ L PS+R NID L + +T +++ AA+R+ W
Sbjct: 359 YNQWETISLGRPSQRTEILINIDPLIPCEACANHTKWDHKKFDVHVH-----AAIRMGEW 413
Query: 482 KLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
KL+ G + N V Y ++ +NI+L
Sbjct: 414 KLITGNPHH--------------------NYWVTPAEYPNM----KNIYL---------- 439
Query: 542 TRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
P M +L+N+ DP EQ++++ + P I + L + L +
Sbjct: 440 ----------NEPGKMV---------WLYNIAADPNEQDDLSDTFPGIVNILLDRLAEYN 480
Query: 602 RTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
T VP D ADP R +W PW+
Sbjct: 481 ETAVPVQFPDCD-TGADPARRGGSWGPWV 508
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 35/112 (31%), Positives = 47/112 (41%), Gaps = 16/112 (14%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G Y T + KEA +I KPLFLYLA A H LE P+ + + + D +
Sbjct: 191 GHYQTHVLAKEAEDVIRKHNPSKPLFLYLAFGAVHVP-----LEVPKVYEDMYADVKDNS 245
Query: 73 RRTYAALTKSTTLTLLIVYG-------WNDLSFHGSNEIPTPNIDALAYNGI 117
RR A+ ++ V G W++ F S T N A AY G
Sbjct: 246 RRILLAMATCMDDSIADVTGALKDTGLWDNTIFIFS----TDNGAATAYGGC 293
>gi|291225027|ref|XP_002732497.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 461
Score = 257 bits (656), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 177/539 (32%), Positives = 258/539 (47%), Gaps = 84/539 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H S +I PN+ LA +G+I N Y P CTPSRA++MTG YP TG Q +
Sbjct: 2 GWNDVHWHNS-DIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMV 60
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ P GVPL + LPE L+E+GYST +GKWHLGF + EY P RGF+SH+G +S
Sbjct: 61 FNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVS 120
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YD ++ S + G+D R N+ + +Y + A ++ PLF
Sbjct: 121 DYDK-MNGVLSPSA---GYDFRDNMGVVPKS-DDYLALMLGDRAEHIVNTHYPGTPLF-- 173
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LA KHLE P+E ++ I D R + + +DD +G V+ AL+ +GM
Sbjct: 174 ---LAFTLDIPAKHLEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMY 230
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
++S+IIF+ DNGA + + GSNYP+RG+ TL+EG +VP+I+ I++
Sbjct: 231 DDSLIIFIGDNGALSSQS---------GSNYPFRGIAGTLFEGATRVPSIVSGKGIKKTG 281
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
S ++ I D T+ AG + +DGL W +
Sbjct: 282 YESNELYSIVDIHRTILDVAGAEAE------------------------PELDGLSMWPT 317
Query: 452 LLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
PS RN +L NID+ + AA+R+ +KL+ G + L
Sbjct: 318 WSKGKPSPRNEILYNIDDDPVSPGAAIRIGDYKLITGHPD----------------LLYP 361
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
+ I + Y + P S + ST QQ+ PAP YL
Sbjct: 362 YRDISLTDNYYNYGD------APASG---LPSTDQQSHA-----PAPQNVE-------YL 400
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
FN+ DP E+N+I++ PDI L L HR+ L+P ++ +L A FND WSP
Sbjct: 401 FNIVVDPEERNDISADHPDIVQDLRARLDEHRKLLIPPVNQTTELAGA-AGNFNDVWSP 458
>gi|291231206|ref|XP_002735556.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 516
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 172/545 (31%), Positives = 268/545 (49%), Gaps = 85/545 (15%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YG++D+ +H S I TPN+D LA G+ L N Y QP+CTP+R+ LM+G+Y IHTG+Q
Sbjct: 50 YGFHDIGYHDS-IIKTPNLDRLASEGVKLENYYVQPICTPTRSQLMSGRYQIHTGLQHGI 108
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
IW +P +P+ E +P+ L+E GY+T +GKWHLG +++E P RGF++ FGYL G
Sbjct: 109 IWPCQPSCLPINEVTIPQKLKESGYATHIVGKWHLGMYKKECLPTERGFDTFFGYLTGSE 168
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLF 269
YY H S Y + +G D R N+ G+Y+T +F ++A +I+ PLF
Sbjct: 169 DYYTHNRS--YDK---FHGMDFRENMQIVQPQYNGQYSTHVFAEKAKNIIKSHDPQIPLF 223
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL A H L+ P + + IT+ RRTYA MV +D++VG + L+ G
Sbjct: 224 LYLPFQAVHGP-----LQVPDQYEKPYANITNKQRRTYAGMVSCMDEAVGNITQTLKDTG 278
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+ +NS+IIF +DNG + G+N+P RG K +LWEGG+ + SP I
Sbjct: 279 LYDNSVIIFSTDNGGQVIA---------GGNNWPLRGWKGSLWEGGMHGVGFVNSPLINH 329
Query: 390 NPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
R S +++H++DW PT+ AGG +++G +DG +
Sbjct: 330 QARGTTSKELIHVTDWFPTIVHLAGG-------SVNG---------------TQPLDGYN 367
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
QW ++ S R +L N +D + L T N + R++
Sbjct: 368 QWETISNGKKSPRKELLHN-----------IDPNYIALQTSNNNMI------YRTDM--- 407
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP- 566
FNA + + L + + + ++ + +P P T+ P
Sbjct: 408 --FNASIRAA-------------LRVGDWKILTGDPGNSSWIPPPESSITSPHP-TDIPG 451
Query: 567 --CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
+LFN+ DP E +++ RPD ++L E L+ + +T VP + D DP + +
Sbjct: 452 KIIWLFNIAEDPNEYKDLSLFRPDKVNELLEKLQQYYKTSVPVRYPDNDPA-CDPGKHSG 510
Query: 625 TWSPW 629
W PW
Sbjct: 511 AWGPW 515
Score = 43.1 bits (100), Expect = 0.46, Method: Compositional matrix adjust.
Identities = 23/68 (33%), Positives = 35/68 (51%), Gaps = 5/68 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G+Y+T +F ++A +I+ PLFLYL A H L+ P + + IT+
Sbjct: 198 GQYSTHVFAEKAKNIIKSHDPQIPLFLYLPFQAVHGP-----LQVPDQYEKPYANITNKQ 252
Query: 73 RRTYAALT 80
RRTYA +
Sbjct: 253 RRTYAGMV 260
>gi|327265410|ref|XP_003217501.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase I-like [Anolis
carolinensis]
Length = 580
Score = 256 bits (655), Expect = 2e-65, Method: Compositional matrix adjust.
Identities = 170/542 (31%), Positives = 274/542 (50%), Gaps = 79/542 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D +HGS +I TP +D LA G+ L N Y +P+CT SR+ L+TG+Y IHTG+Q I
Sbjct: 69 GFHDXGYHGS-DIXTPTLDRLAAEGVKLENYYIRPICTLSRSQLITGRYQIHTGLQHSII 127
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P + LP+ L+E GYST +GKWHLGF+R+E P RGF++ G L G +
Sbjct: 128 RPQQPNCLPFNQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRGFDTFLGSLTGNVD 187
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G+D+ + AW+ G+Y+T L+ + +++ +P+F+Y
Sbjct: 188 YYTYDNCDGPG----VCGYDLHDGENVAWEQSGKYSTFLYAQRVNKILAAHNPKEPIFIY 243
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E I +++ + + RR YAAMV +D++V + AL++ G
Sbjct: 244 IAFQAVHT-----PLQSPKEYIYRYRSMGNVARRKYAAMVTCMDEAVKNITWALKKYGYY 298
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+NS+I+F +DNG T + GSN+P RG K T WEGGV+ + SP I+
Sbjct: 299 DNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGIGFVHSPLIKHKR 349
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R+S ++HI+DW PTL T A G+ S ++DG + W ++ S R + +D
Sbjct: 350 RISRALIHITDWYPTLVTLAQGNVSEAD-DLDGYNVWPAISEGKESPRTEILHNIDP--- 405
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L ++ S+ I A+VR+ WKL+ G GY S+ +P
Sbjct: 406 --LYNHAKYGSLESGFGIWNTAVQASVRVGDWKLLTGDP------GY-----SDWIPP-- 450
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M +++ +L
Sbjct: 451 ----------QTLATFPGSWW----NLERMTGGMRKSV--------------------WL 476
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ +++ RPD+ L L + +T +P + E P +A P W
Sbjct: 477 FNITADPYERYDLSEQRPDVVRALLLRLVRYNQTAIPVRYPAENP---RAHPDFNGGAWG 533
Query: 628 PW 629
PW
Sbjct: 534 PW 535
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 21/75 (28%), Positives = 43/75 (57%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW+ G+Y+T L+ + +++ +P+F+Y+A A H L++P+E I ++
Sbjct: 209 NVAWEQSGKYSTFLYAQRVNKILAAHNPKEPIFIYIAFQAVHT-----PLQSPKEYIYRY 263
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 264 RSMGNVARRKYAAMV 278
>gi|291225029|ref|XP_002732496.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 256 bits (653), Expect = 3e-65, Method: Compositional matrix adjust.
Identities = 175/539 (32%), Positives = 260/539 (48%), Gaps = 84/539 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + +I PN+ LA +G+I N Y P CTPSRA++MTG YP TG Q +
Sbjct: 38 GWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMV 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ P GVPL + LPE L+E+GYST +GKWHLGF + EY P RGF+SH+G +
Sbjct: 97 FNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVG 156
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YD ++ S + G+D R N+ + G Y + A ++ PLF
Sbjct: 157 DYDK-MNGVLSPSA---GYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNTHYPGTPLF-- 209
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LA KHLE P+E ++ I D R + + +DD +G V+ AL+ +GM
Sbjct: 210 ---LAFTLDIPAKHLEIPEEYEEKYSDIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGMY 266
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
++S+IIF+ DNGA + + GSNYP+RG+ T++EG +VP+I+ I++
Sbjct: 267 DDSLIIFIGDNGALSSQS---------GSNYPFRGIAGTMFEGATRVPSIVSGKGIKKTG 317
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
S ++ I D T+ AG ++ +DGL W +
Sbjct: 318 YESNELYSIVDIHRTILDVAG------------------------AKAEPELDGLSMWPT 353
Query: 452 LLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ PS RN +L NID+ + AA+R+ +KL+ G + L
Sbjct: 354 WSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLITGHPDL----------------LYP 397
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
+ I + Y + P S + ST QQ+ PAP YL
Sbjct: 398 YRDISLTDNYYNYGD------APASG---LPSTDQQSHA-----PAPQNVE-------YL 436
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
FN+ DP E+N+I++ PDI L + L HR+ L+P ++ +L A FND WSP
Sbjct: 437 FNVVVDPEERNDISADHPDIVQDLRDRLDEHRKLLIPPVNQTTELAGA-AGNFNDVWSP 494
>gi|348501876|ref|XP_003438495.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 571
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 174/540 (32%), Positives = 252/540 (46%), Gaps = 70/540 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS+ I TP +D LA G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q I
Sbjct: 57 GYGDIGYHGSD-IHTPVLDRLAAEGVKLENYYVQPICSPSRSQLMTGRYQIHTGLQHSII 115
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P LPE L E GY+T +GKWHLGF R P RGF+S G L G
Sbjct: 116 RPRQPLCLPPDSPTLPERLAEAGYATHMVGKWHLGFCRPSCLPTGRGFQSFLGTLTGSGD 175
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
++ + Q E G D+ AW+ G Y+T L+ Q+++ PLFLY
Sbjct: 176 HFSY----QSCDGAEACGFDLHDGDRPAWEMAGNYSTLLYIDRVKQILKRHDPHTPLFLY 231
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L+ AAH L+ P ++ + + RR YAAM+ LD V V+ L+ +G+
Sbjct: 232 LSLQAAHTP-----LQVPDHFLHLYDSHRNRLRRHYAAMLSCLDSGVAQVVQELKTQGLY 286
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
ENS++I+ SDNG + + GSN+P RG K T WEGG++ + SP +++
Sbjct: 287 ENSVLIYSSDNGGQPL---------SGGSNWPLRGGKGTYWEGGIRAVGFVHSPLLKKKG 337
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
VS ++H+SDW PTL AG S + +DG D W S
Sbjct: 338 IVSRALIHVSDWYPTLLGLAGAQQS-----------------------HHGLDGHDVWGS 374
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD-GYYGQTRSNKVPLLNF 510
+ P R +L NID R A D ++ G G D R+ LL
Sbjct: 375 ISEGLPCPRTEILFNIDPVSRKAGEPYDKALILSGF---GIWDTAVRAALRAGDWKLLTG 431
Query: 511 NAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLF 570
N V + Q L ++K R+ ++ +LF
Sbjct: 432 N--VGDGDWIPPQALPGGPER-WKKLEKRRNKLGKSV--------------------WLF 468
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
N+ DP E+ ++A +RP++ L L + +T V + PD ADP+ WSPW+
Sbjct: 469 NVSADPYERYDLAETRPEVVKHLMTRLAEYNQTAV-MARNPPDDPMADPELHGGAWSPWL 527
Score = 39.7 bits (91), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 23/72 (31%), Positives = 36/72 (50%), Gaps = 5/72 (6%)
Query: 8 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 67
AW+ G Y+T L+ Q+++ PLFLYL+ AAH L+ P ++ +
Sbjct: 199 AWEMAGNYSTLLYIDRVKQILKRHDPHTPLFLYLSLQAAHT-----PLQVPDHFLHLYDS 253
Query: 68 ITDPNRRTYAAL 79
+ RR YAA+
Sbjct: 254 HRNRLRRHYAAM 265
>gi|37182416|gb|AAQ89010.1| APRG372 [Homo sapiens]
Length = 515
Score = 256 bits (653), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 143/361 (39%), Positives = 204/361 (56%), Gaps = 23/361 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I
Sbjct: 87 GFRDVGYHGS-EIKTPTLDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSII 145
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+E+GYST +GKWHLGF R+E P RGF++ FG L G
Sbjct: 146 RPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFNRKECMPTRRGFDTFFGSLLGSGD 205
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + + G+D+ N + AWD G Y+T ++T+ Q++ KP+FL
Sbjct: 206 YYTHYKCD----SPGMCGYDLYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFL 261
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y A+ A H+ L+AP ++ I + NRR YAAM+ LD+++ V AL+ G
Sbjct: 262 YTAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGF 316
Query: 331 LENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
NSIII+ SDNG PT GSN+P RG K T WEGG++ + SP ++
Sbjct: 317 YNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKN 366
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V +++HI+DW PTL + A G + +DG D W ++ S R + +D +
Sbjct: 367 KGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDIWETISEGLRSPRVDILHNIDPY 425
Query: 450 S 450
+
Sbjct: 426 T 426
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY A+ A H+ L+AP
Sbjct: 222 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYTAYQAVHS-----PLQAPG 276
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 277 RYFEHYRSIININRRRYAAM 296
>gi|291224485|ref|XP_002732234.1| PREDICTED: jumonji domain containing 2c [Saccoglossus kowalevskii]
Length = 1941
Score = 255 bits (652), Expect = 4e-65, Method: Compositional matrix adjust.
Identities = 164/517 (31%), Positives = 258/517 (49%), Gaps = 84/517 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ ++ +I TP +D LAY G+I N Y QP+CTP+RA+LMTG YP GMQ +
Sbjct: 54 GWNDIGWNNL-QIKTPVLDKLAYEGVIFNQTYVQPLCTPTRAALMTGYYPFRIGMQHQMV 112
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P G+PL + LP+ L++ GY +GKWHLG+ EYTPL RGF+S +G + ++
Sbjct: 113 LPFQPSGLPLHLKILPQKLKQAGYINHIVGKWHLGYCNWEYTPLNRGFDSFYGSFSNSVN 172
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
+ + I S + G+D R N + G+ T LFT+ V +I + D P+F+Y
Sbjct: 173 HNNKISQLPISDHSKYKGYDFRDNTGVVQND-GQPLTKLFTQRVVDIISNHHKDYPMFMY 231
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
K E ++ + +TD R + A + +DD+VG VI AL+ +GM
Sbjct: 232 FP-----INQPSKGSEPDEKYTALYPNVTDDQVRQWYATLSLMDDAVGHVIDALKSRGMW 286
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
E +++IF+SDNGA ++ E G+N P RG TL+EGG++VPAI I++
Sbjct: 287 EETLLIFISDNGA--FDFME-------GTNLPLRGAGTTLFEGGMRVPAIAHGKMIKKTG 337
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
++ ++ HI+DW T+ + AG + ++DG++ W +
Sbjct: 338 YINNELNHITDWHSTILSLAG------------------------IQPEPDLDGINIWGT 373
Query: 452 LLLNTPSRRNSVLINIDE--KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ S RN ++ NIDE + R AA+RL WK++ G G M + + L
Sbjct: 374 VSEGKASARNEIVYNIDEIDECRGAALRLGDWKVITGNP--GLMCRF--------LDLNK 423
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
+ ++S + Q ++ F ISN+ YL
Sbjct: 424 QDGWLQSGRHSDNQAITD--FQSISNVS------------------------------YL 451
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
FNL DP E++N+A P++ S L + L +R+ +VP
Sbjct: 452 FNLKEDPEERHNVAEKYPEVFSALLDRLNEYRKNVVP 488
>gi|189521775|ref|XP_688265.2| PREDICTED: hypothetical protein LOC559800 [Danio rerio]
Length = 1542
Score = 254 bits (649), Expect = 9e-65, Method: Compositional matrix adjust.
Identities = 172/558 (30%), Positives = 271/558 (48%), Gaps = 109/558 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS EI TP +D LA G+ L N Y QP+C+PSR+ LMTG+Y IHTG+Q I
Sbjct: 1038 GFRDVGYHGS-EIKTPTLDRLAAAGVKLENYYVQPLCSPSRSQLMTGRYQIHTGLQHSII 1096
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL LP+ L+ GYST +GKWHLGF++R P RGF++ FG L G
Sbjct: 1097 RPTQPNCLPLENITLPQKLKNAGYSTHMVGKWHLGFYKRACMPTQRGFDTFFGSLLGSGD 1156
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H D + L G+D+ AW+ G Y+T ++T++AV ++ +P+FL
Sbjct: 1157 YYSHYKCD----SPGLCGYDLHEGEEAAWEQDRGVYSTIMYTQKAVNILASHNPKRPIFL 1212
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA A H+ L+ P + +++ I + +RR YAAMV LD++V + AL++ G
Sbjct: 1213 YLAFQAVHSP-----LQVPARYLERYKSIPNLHRRKYAAMVSCLDEAVRNLTLALKQYGY 1267
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+N ++++ SDNG + GSN+P RG K + WEGG++ + SP + +
Sbjct: 1268 YDNMVMVYSSDNGGQPMA---------GGSNWPLRGSKGSYWEGGIRAVGFVHSPLLVKK 1318
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
S ++HI+DW PTL G LD+ N N+DG D W
Sbjct: 1319 GTKSKALIHITDWYPTLVMLGEGT----------LDE------------NQNLDGYDVWE 1356
Query: 451 SLLLNTPSRRNSVLINID------------------EKKRTAAVRLDSWKLVLGTQENGT 492
++ PS R +L NID AA+R+ WKL+ G
Sbjct: 1357 AISEGRPSPRLDILHNIDPIYTKAKNGSWKAGYGIWNTAVQAALRVGDWKLLTG------ 1410
Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
+ GY S+ +P F+ ++ + + + + + D+ +S
Sbjct: 1411 VPGY-----SDWIPPQTFSTLLLANRWHNER----------VSWDRGKSV---------- 1445
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQ-LYELLKYHRRTLVPQSHEQ 611
+LFN+ DP E+ +++ P + + L++LL+Y + VP +
Sbjct: 1446 ---------------WLFNITADPYERVDLSRQYPHVVKKMLFKLLRYSKNA-VPCRYPP 1489
Query: 612 PDLVQADPKRFNDTWSPW 629
D +++P+ W PW
Sbjct: 1490 KDY-RSNPQYNGGVWGPW 1506
Score = 45.4 bits (106), Expect = 0.095, Method: Compositional matrix adjust.
Identities = 31/95 (32%), Positives = 51/95 (53%), Gaps = 12/95 (12%)
Query: 8 AWDT-VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
AW+ G Y+T ++T++AV ++ +P+FLYLA A H+ L+ P + +++
Sbjct: 1180 AWEQDRGVYSTIMYTQKAVNILASHNPKRPIFLYLAFQAVHSP-----LQVPARYLERYK 1234
Query: 67 YITDPNRRTYAALTKS-----TTLTL-LIVYGWND 95
I + +RR YAA+ LTL L YG+ D
Sbjct: 1235 SIPNLHRRKYAAMVSCLDEAVRNLTLALKQYGYYD 1269
>gi|348520018|ref|XP_003447526.1| PREDICTED: arylsulfatase I-like [Oreochromis niloticus]
Length = 732
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 144/383 (37%), Positives = 210/383 (54%), Gaps = 21/383 (5%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ +H I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 60 GFNDIGYHNPT-IKTPTLDKLAAEGVKLENYYVQPICTPSRSQLITGRYQIHTGLQHSII 118
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +P LPE LRE GY+T +GKWHLGF+R+ P +GF++ FG L G +
Sbjct: 119 RPRQPSCLPSHMDTLPERLREAGYTTHMVGKWHLGFYRKACLPTRKGFDTFFGSLTGSVD 178
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFL 270
YY + D +L G+D+ + AW G+Y+T LFT+ A +++E P ++PLFL
Sbjct: 179 YYSYESCD----GKDLCGYDLHDDEGVAWGQEGKYSTTLFTQRARKILESHDPAERPLFL 234
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
L+ A H L P+ I F+ + + RR +A MV +D++V V AL++ G
Sbjct: 235 LLSFQAVHTP-----LNPPESYIYPFRDMVNVARRNFAGMVSTVDEAVRNVTYALRKYGY 289
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
NS+II+ +DNGA G+N+P RG K T WEGGV+ + SP +++
Sbjct: 290 YRNSVIIYSTDNGAQPF---------TGGNNWPLRGRKGTYWEGGVRGVGFVHSPLLRRR 340
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
RVS +MHI+DW PTL A G+ S+ +DG D W ++ S R + +D
Sbjct: 341 RRVSKALMHITDWFPTLVGLARGNISQ-SQGLDGFDVWPAISEGRESPRKEILHNIDPLH 399
Query: 451 SLLLNTPSRRNSVLINIDEKKRT 473
+ ++ N KKRT
Sbjct: 400 KSPIEKKNQGEENSGNYRGKKRT 422
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 25/63 (39%), Positives = 38/63 (60%), Gaps = 1/63 (1%)
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
+LFN+ DP E++++A RPDI QL L Y+ +T VP + PD +A+P + W
Sbjct: 621 WLFNITGDPYERHDLADQRPDIVQQLLARLAYYNQTAVP-VYFPPDDPRANPSQHRGAWV 679
Query: 628 PWI 630
PW+
Sbjct: 680 PWV 682
Score = 46.6 bits (109), Expect = 0.044, Method: Compositional matrix adjust.
Identities = 25/78 (32%), Positives = 41/78 (52%), Gaps = 6/78 (7%)
Query: 7 TAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
AW G+Y+T LFT+ A +++E P ++PLFL L+ A H L P+ I F
Sbjct: 201 VAWGQEGKYSTTLFTQRARKILESHDPAERPLFLLLSFQAVHTP-----LNPPESYIYPF 255
Query: 66 QYITDPNRRTYAALTKST 83
+ + + RR +A + +
Sbjct: 256 RDMVNVARRNFAGMVSTV 273
>gi|241176208|ref|XP_002399501.1| arylsulfatase B precursor, putative [Ixodes scapularis]
gi|215495150|gb|EEC04791.1| arylsulfatase B precursor, putative [Ixodes scapularis]
Length = 418
Score = 253 bits (646), Expect = 2e-64, Method: Compositional matrix adjust.
Identities = 166/496 (33%), Positives = 238/496 (47%), Gaps = 88/496 (17%)
Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
G+Q I E G+PL +P++ ++LGY+T +GK +G F T R + H
Sbjct: 1 GLQHLTIEEGEAWGLPLRFTLMPQHFKKLGYATHMVGK--VGQFIISDTSCRRYVKRH-- 56
Query: 205 YLNGVISYYDHILSDQYSRTVELN--------GHDMRRNLSTAWDTVGEYATDLFTKEAV 256
S + R ++ G D+ N G Y TD+ T +AV
Sbjct: 57 --------TQKSCSQRCKRERKVKEPCGGDHIGLDLWDNEVAVRTECGNYDTDMLTDKAV 108
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
+I KPLFLYL+H +AH+ L+AP++ I +F YI + NR YA MV +DD
Sbjct: 109 SIIASHDTTKPLFLYLSHRSAHSAFTSAPLQAPEKNIAKFGYIGERNRTIYAGMVDSMDD 168
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
+ G ++ AL G+L N+I++F SDNGA P +E + N G N+P RG K TLWEG
Sbjct: 169 AFGRLVEALSGAGVLNNTIVVFSSDNGAVPIIE------FPNRGFNWPLRGAKKTLWEGA 222
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
V+VPA +WSP ++ + RVS QMMHI DWLPT Y+ AGG+ S L
Sbjct: 223 VRVPAFIWSPLLESSGRVSDQMMHIVDWLPTFYSVAGGNLSHL----------------- 265
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
DG D W +L T S R +L+NID T+++R + K+VLGT NG+ D
Sbjct: 266 -----GEQDGFDMWKALSEGTKSPREEMLLNIDPVLNTSSLRYKNHKVVLGTYHNGSFDH 320
Query: 496 YYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCG-ANP 554
+ T + R++AT+ CG N
Sbjct: 321 RFKTT------------------------------------GRRPGWRRRATVRCGECNL 344
Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
A P + P YLF++ DPCE NN+A P+I QL + L+ + T V +
Sbjct: 345 AKDNVKPSS--PPYLFDVVADPCELNNLAPEHPEILRQLLKRLEDYATTAVDPENVYETD 402
Query: 615 VQADPKRFNDTWSPWI 630
++ P+R N TW+PW+
Sbjct: 403 ARSFPERHNGTWAPWL 418
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 30/71 (42%), Positives = 42/71 (59%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
G Y TD+ T +AV +I KPLFLYL+H +AH+ L+AP++ I +F YI +
Sbjct: 95 CGNYDTDMLTDKAVSIIASHDTTKPLFLYLSHRSAHSAFTSAPLQAPEKNIAKFGYIGER 154
Query: 72 NRRTYAALTKS 82
NR YA + S
Sbjct: 155 NRTIYAGMVDS 165
>gi|390369306|ref|XP_003731620.1| PREDICTED: uncharacterized protein LOC763377 [Strongylocentrotus
purpuratus]
Length = 784
Score = 252 bits (644), Expect = 4e-64, Method: Compositional matrix adjust.
Identities = 134/367 (36%), Positives = 215/367 (58%), Gaps = 27/367 (7%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP 126
I++ + ++ LTK + +LI G+ND+ +HGS EI TPNID LA G+ L N Y QP
Sbjct: 13 ISNVCSKKFSQLTKPNIVFILIDDLGYNDVGYHGS-EIYTPNIDKLAREGVRLENYYVQP 71
Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
+CTP+R+ L++G+Y IHTG+Q I A+P +P + LRE GY+T A+GKWHLG
Sbjct: 72 ICTPTRSQLLSGRYQIHTGLQHSYIRPAQPLCLPTNLPTFADKLREAGYATHAVGKWHLG 131
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
F+++E P RGF+S+FGYL G Y+ H + + L +R AW+ G Y
Sbjct: 132 FYKKECLPTQRGFDSYFGYLTGGEDYWTH-----HRKRPXLA---LRHVDKVAWEYGGYY 183
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+ +FT++ Q++ PV++P LYL + H+ L+ P +++ I + NRR
Sbjct: 184 SAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHSP-----LQVPSSYEERYKNIKNTNRRI 238
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA M+ LD++VG ++ +LQ+ G+ +N+++IF +DNG G+N+P RG
Sbjct: 239 YAGMMTCLDEAVGKIVHSLQQAGLWDNTVLIFSTDNGGEVA---------AGGNNWPLRG 289
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
K ++WEGG++ + SP + + + V+ Q++H+SDW PTL G +S + +DG
Sbjct: 290 WKRSIWEGGMRGVGFVNSPLLPASVQGTVNKQLIHVSDWFPTLVQGVAG-SSLDNITLDG 348
Query: 425 LDQWSSL 431
+ W ++
Sbjct: 349 FNMWKTI 355
Score = 44.7 bits (104), Expect = 0.15, Method: Compositional matrix adjust.
Identities = 22/74 (29%), Positives = 39/74 (52%), Gaps = 5/74 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
AW+ G Y+ +FT++ Q++ PV++P LYL + H+ L+ P ++
Sbjct: 174 KVAWEYGGYYSAFVFTEKIQQIVAQHPVEQPFLLYLPFQSVHSP-----LQVPSSYEERY 228
Query: 66 QYITDPNRRTYAAL 79
+ I + NRR YA +
Sbjct: 229 KNIKNTNRRIYAGM 242
>gi|156380740|ref|XP_001631925.1| predicted protein [Nematostella vectensis]
gi|156218974|gb|EDO39862.1| predicted protein [Nematostella vectensis]
Length = 540
Score = 251 bits (640), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 139/364 (38%), Positives = 207/364 (56%), Gaps = 26/364 (7%)
Query: 85 LTLLIVYGWNDLSFHG-SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
L+ GW+D+ +H S+ + TPNID LA G+ L + Y+QP+CTPSR +LMTGKYPIH
Sbjct: 39 FILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSRGALMTGKYPIH 98
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
GMQ I P G+P +P+ LR LGY T IGKWHLGFF +YTPL RGF+S
Sbjct: 99 LGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDYTPLRRGFDSFL 158
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
G+ G ++ H ++ D RR+ A + G+++TD+FT+EA+ +
Sbjct: 159 GFFAGEQDHWRH---------SKMGFLDFRRDEEPANEYGGQHSTDVFTQEAINIAMRHN 209
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
+PLFL L++ A H L+A +N+ ++D +R+ Y M+ D S+G +I
Sbjct: 210 ASQPLFLLLSYAAVHT-----PLQAHPNDVNKIGGVSDKDRQNYLGMMGAADWSIGRLID 264
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
+R G+ N+++I+ SDNGA + G N+P RG K++L+EGGV+VPA +
Sbjct: 265 VYKRNGLWNNTLMIWASDNGA--------QPGKGGGYNWPLRGYKSSLFEGGVRVPAFVH 316
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
+Q+ + H++DW PTL AGG+ +IDG+DQW +L PS+R +
Sbjct: 317 GEMLQRKGGTVNDLFHVTDWYPTLVKLAGGEVEP---DIDGVDQWPTLSEGKPSKREEIL 373
Query: 444 DGLD 447
+D
Sbjct: 374 HNID 377
Score = 45.4 bits (106), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 28/99 (28%), Positives = 51/99 (51%), Gaps = 8/99 (8%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+ A + G+++TD+FT+EA+ + +PLFL L++ A H L+A
Sbjct: 178 FRRDEEPANEYGGQHSTDVFTQEAINIAMRHNASQPLFLLLSYAAVHT-----PLQAHPN 232
Query: 61 TINQFQYITDPNRRTYAALTKSTTLT---LLIVYGWNDL 96
+N+ ++D +R+ Y + + + L+ VY N L
Sbjct: 233 DVNKIGGVSDKDRQNYLGMMGAADWSIGRLIDVYKRNGL 271
>gi|291225021|ref|XP_002732506.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 250 bits (639), Expect = 1e-63, Method: Compositional matrix adjust.
Identities = 172/551 (31%), Positives = 256/551 (46%), Gaps = 108/551 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + +I PN+ LA +G+I N Y P CTPSRA++MTG YP TG Q +
Sbjct: 38 GWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQML 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ P G+PL + LPE L+E+GYST +GKWHLGF + EY P RGF+SH+G +
Sbjct: 97 FNLHPSGLPLEFKLLPEKLKEIGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGIWTLGVG 156
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YD +LS G+D R N + G Y + A ++ PL
Sbjct: 157 DYDKMNGVLSPS-------KGYDFRDNTGVVQKSNG-YLALMLGDRAEHIVNTHYPGTPL 208
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
F LA KHL P+E N++ I D R + + +DD +G V+ AL+ +
Sbjct: 209 F-----LAFTLDIPAKHLAIPEEYENKYADIEDSRTRHFYGKLTMMDDIIGGVVDALKNR 263
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ ++S+IIF+ DNGA + + GSNYP+RG+ TL+EG +VP+I+ I+
Sbjct: 264 GIYDDSLIIFIGDNGALSSQA---------GSNYPFRGIAGTLFEGATRVPSIVSGKGIK 314
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ S ++ I D T+ AG ++ +DGL
Sbjct: 315 KTGYESNELYSIVDIHRTILDVAG------------------------AKAEPELDGLSM 350
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGT-------QENGTMDGY--Y 497
W + PS RN ++ NID+ + AA+R+ +KL+ G ++ G D Y Y
Sbjct: 351 WPTWSKGKPSPRNEIIYNIDDDPVSPGAAIRIGDYKLITGHPDLLYPYRDIGITDNYYNY 410
Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
G ++ +P A+ A M
Sbjct: 411 GDPLASGLPT--------------------------------------------ADQAAM 426
Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
P P YLFN+ DP E+N+I++ PDI L + L HR+ L+P ++ +L +
Sbjct: 427 APPPLDVE--YLFNVIVDPEERNDISADHPDIVQDLRDRLDEHRKLLIPPVNQTTELAGS 484
Query: 618 DPKRFNDTWSP 628
FND WSP
Sbjct: 485 -AANFNDVWSP 494
>gi|313219945|emb|CBY43646.1| unnamed protein product [Oikopleura dioica]
Length = 1683
Score = 250 bits (639), Expect = 2e-63, Method: Compositional matrix adjust.
Identities = 162/512 (31%), Positives = 249/512 (48%), Gaps = 72/512 (14%)
Query: 119 LNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTK 178
L + Y P+CTP+R+ LM+G+Y IHTG+Q I GA+P G+PLT+ PE ++ GY T
Sbjct: 1233 LQSYYVNPICTPTRSVLMSGRYQIHTGLQHAVILGAQPNGLPLTDPVQPEIFKDCGYRTH 1292
Query: 179 AIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST 238
+GKWHLGF+ + P RGFESH+GYL G +Y+H Q V+ D + ++
Sbjct: 1293 MVGKWHLGFYDEKMVPENRGFESHYGYLIGAEGHYNHSQFMQGQHGVDF--RDGGASTNS 1350
Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
+W G+Y+ DLF K L+E V++ L++Y+ H LEAPQ ++QF +
Sbjct: 1351 SW---GQYSADLFVKRVEDLVEAHDVEESLYMYVGLQNVHYP-----LEAPQHYVDQFSW 1402
Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
I D +RR YAAM K +DD+VG +I++ + KG+ E++I+ F +DNG T+
Sbjct: 1403 IKDRDRRVYAAMTKSMDDTVGRIINSFKNKGIWEDTIVYFTTDNGGSTLYG--------- 1453
Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
G+N+P RG+KNTLWEGG++ + P + + R Q+MH+ D PTL
Sbjct: 1454 GNNWPLRGLKNTLWEGGIRGIGSIKIPGVSADKRD--QLMHVVDMFPTL----------- 1500
Query: 419 PLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRL 478
LD L + P+ R DG Q L+ N S+ +S LIN+D ++
Sbjct: 1501 ------LD-----LTSCPNTRKVKFDGKSQAKMLVDNDESQSDSFLINVDPLRKNKGADN 1549
Query: 479 DSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDK 538
+W + + R + LL S Y + +I+ P D
Sbjct: 1550 RTWHSEFDVRVQAGL-------RWQQWKLL--TGAPSSDNYP-----TGHIYPPEWPQDG 1595
Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLK 598
++++ +N LF++ NDP E I++ +I ++ +L
Sbjct: 1596 RKNSQSNFE--------------SSNLNVRLFDIDNDPLEDFEISNQHTEIVEKMLNMLS 1641
Query: 599 YHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
+ T VP PD P + W PWI
Sbjct: 1642 SYNSTAVPCRWPNPDPAGL-PDQNGGFWKPWI 1672
Score = 60.1 bits (144), Expect = 4e-06, Method: Compositional matrix adjust.
Identities = 30/75 (40%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G+Y+ DLF K L+E V++ L++Y+ H LEAPQ ++QF +I D +
Sbjct: 1353 GQYSADLFVKRVEDLVEAHDVEESLYMYVGLQNVHYP-----LEAPQHYVDQFSWIKDRD 1407
Query: 73 RRTYAALTKSTTLTL 87
RR YAA+TKS T+
Sbjct: 1408 RRVYAAMTKSMDDTV 1422
>gi|323449751|gb|EGB05637.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 533
Score = 249 bits (637), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 174/523 (33%), Positives = 243/523 (46%), Gaps = 87/523 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ FHGS +IPTP +DALA +G+ L N + PVC+PSRAS+++G++ IH G+ P
Sbjct: 67 GFNDVGFHGSKQIPTPRLDALAADGVDLLNYHTHPVCSPSRASMLSGRHAIHHGIYMPFA 126
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G + L LPE LR LGY T A+GKWHLG R P RGF+S GY +G
Sbjct: 127 QGTAYH-LSLEYELLPEALRRLGYETHAVGKWHLGQNTRAALPTGRGFDSFLGYWSGAED 185
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H + Y D N +TAW G Y+ FT AV ++ P FLY
Sbjct: 186 YFAHDCAGAY---------DFANNETTAWAYDGVYSAYSFTDRAVDVVAS--ASTPYFLY 234
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A H LE P + F ++ RR + MV LD++VG V AL+ +
Sbjct: 235 VAWQNVH-----WPLEVPARYLEPFGHVPAGPRRNVSGMVSALDEAVGNVTGALKARRAY 289
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
++++++F+SDNG PT T++ +N+P RG KNTL+EGG +V +L +P + ++
Sbjct: 290 DDAVVVFVSDNGGPTNGDEGTAS-----NNWPLRGGKNTLYEGGTRVVGLLKAPGVAKS- 343
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
S +H DW PTLY AGG L+LN P DG+D +
Sbjct: 344 SASRAKLHAVDWFPTLYRFAGG---------------GPLVLNPPLAPG---DGVDSYDV 385
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
L P R DE L+L G+ D +G
Sbjct: 386 LARGAPGR--------DE-------------LLLEAHAPGSNDTVHG------------- 411
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQA----TIHCGANPAP----MTPSPCT 563
E+ + L P S R Q+A + CGA P P T C
Sbjct: 412 ---EALIVGDWKFLRIGTVRPDSEAGWTRPPGQEAHPRSVVDCGAAPEPADANATRRQCV 468
Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
PC LF++ DPCE+ + A PD+ + L L +R T VP
Sbjct: 469 AAPC-LFDVAADPCERVDRAVEFPDVLAALDARLDAYRATAVP 510
>gi|241634070|ref|XP_002410502.1| arylsulfatase J, putative [Ixodes scapularis]
gi|215503435|gb|EEC12929.1| arylsulfatase J, putative [Ixodes scapularis]
Length = 480
Score = 249 bits (636), Expect = 3e-63, Method: Compositional matrix adjust.
Identities = 178/551 (32%), Positives = 251/551 (45%), Gaps = 88/551 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+SFHGS +IPTPNID LA +G+IL+N YA P+CTPSRA+LMTG YPIHTGM I
Sbjct: 5 GWGDVSFHGSTQIPTPNIDVLAGDGVILDNYYALPLCTPSRAALMTGLYPIHTGMHAGVI 64
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A P G+ L + +P++ +LGY IGK H + G + +G+
Sbjct: 65 QDAAPWGLTLETKIMPQHFEDLGYEVNMIGKSH-----------HDGCHNFDSTKSGIDL 113
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV------- 264
+ ++S + HD ++ A K +Q + +
Sbjct: 114 LHTPLISSIPGQ------HDNTWMYGKTYNFYRTRAICRLQKRVIQGVVVYLIRFLLDLH 167
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD-DSVGTVIS 323
+P F YL+H A H+ +AP + +F YI + NR YA KK +G +
Sbjct: 168 SQPFFCYLSHQAVHSALMKDPFQAPARNLLKFPYIGETNRTVYAGETKKTHLPWLGLALV 227
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
+L + + +++ SDNG + N G N+P RG K LWEGGV+VPA +W
Sbjct: 228 SLVKTKNYVDILLMGFSDNGGSPFGL-----FSNRGFNWPLRGAKANLWEGGVRVPAFVW 282
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
S + + PRVS Q+MHISDWLPTLY+AAGGD S L ++
Sbjct: 283 SSKFLKKPRVSDQLMHISDWLPTLYSAAGGDVSNL----------------------GSM 320
Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY----GQ 499
DG+D W L L S R VL+ ID K AA+R ++KLV+G N +DG Y G
Sbjct: 321 DGIDMWRYLTLGFGSPRKEVLLTIDPKNNLAALRYKNYKLVVGEGFNEELDGRYSFPGGL 380
Query: 500 TRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTP 559
+N VP L +SK + L+ + R + + NP
Sbjct: 381 RPTNDVPQLR----KDSKVARVLKSFYG------------KQNRNWSPLDWSKNPV---- 420
Query: 560 SPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADP 619
C + NN + +P + L + T VP QP +A P
Sbjct: 421 VDC-----------RERWLTNNFVARQPPYFRFMMSKLAAYNATAVPPLF-QPQDPRAYP 468
Query: 620 KRFNDTWSPWI 630
+ WSPW+
Sbjct: 469 EYHGGVWSPWL 479
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 28/54 (51%)
Query: 35 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLL 88
+P F YL+H A H+ +AP + +F YI + NR YA TK T L L
Sbjct: 169 QPFFCYLSHQAVHSALMKDPFQAPARNLLKFPYIGETNRTVYAGETKKTHLPWL 222
>gi|346472067|gb|AEO35878.1| hypothetical protein [Amblyomma maculatum]
Length = 514
Score = 248 bits (634), Expect = 5e-63, Method: Compositional matrix adjust.
Identities = 166/525 (31%), Positives = 255/525 (48%), Gaps = 94/525 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+S+H + + +P ++ LA G+IL+ YA P CTP+RA+LMTG+YP G+Q I
Sbjct: 36 GWNDVSWH-NERMESPILEQLAKEGVILDQHYALPTCTPTRAALMTGRYPYKLGIQSHGI 94
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+PL L E L+ GY+T A GKWHLG+ + TP RGF++ G+ G
Sbjct: 95 RTLEPNGLPLGVTTLAEELKRTGYTTHAFGKWHLGYCNQSLTPTRRGFDTFRGFYVGGQD 154
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H LS + + G+D R + G Y T L + IE+ DKP+FLY
Sbjct: 155 YFSHTLSGGKT-SATAKGYDYRNGDEVDYSAKGVYTTTLIANHVLSAIEESQPDKPMFLY 213
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A HA L+ P + +P R+ M+ +D +VG ++ L+ K M
Sbjct: 214 VAFQAVHA-----PLQVPTQYRKMCSIYRNPKRKLLCEMLAVMDHAVGLIVKKLKMKDMW 268
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
N++++F+SDNG + G+N+P RG KNTL+EGG +VPA + P I+
Sbjct: 269 RNTLLVFISDNGGQIL---------YGGNNWPLRGNKNTLFEGGTRVPAFVAGPLIRNGG 319
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPL-NIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
R S ++H+ DW PTL +AAGG+++ L ++DG+DQW+++ R+ +DG
Sbjct: 320 RNSSSIIHVVDWFPTLLSAAGGESAESQLKDMDGVDQWAAI-------RDGTLDGT---- 368
Query: 451 SLLLNTPSRRNSVLINIDEKKR--TAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
R + N+D KK A+R K +L + T G+Y +
Sbjct: 369 ---------RTEFVYNLDCKKGKLVGAIRDGDLKFILKPSQVRT--GWYEE--------- 408
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
E K ++ ++L G NPA M
Sbjct: 409 ------ELKGSENPEKL-------------------------GRNPARM----------- 426
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
LFN+ DP E++++ +S + +L L+ +R L+ S PD
Sbjct: 427 LFNISADPLEKDDLFASLKTEAQRLEAKLR--KRALLLVSSRNPD 469
>gi|291220870|ref|XP_002730451.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 519
Score = 248 bits (633), Expect = 6e-63, Method: Compositional matrix adjust.
Identities = 151/426 (35%), Positives = 218/426 (51%), Gaps = 69/426 (16%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+GWND+ +H S+ I +PNI+AL +G+ L N Y QP CTPSR+ LM+G+Y IHTG+Q
Sbjct: 32 HGWNDIGYH-SHIIRSPNINALCNDGVRLENYYIQPGCTPSRSQLMSGRYQIHTGLQHSV 90
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I +P +PL E L E L+E+GY+T +GKWHLGF+ P RGF+S FGYL G
Sbjct: 91 IRNDQPNCLPLDEVTLAEKLKEVGYATHLVGKWHLGFYTPSCLPTRRGFDSFFGYLIGQE 150
Query: 211 SYYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
YY HI G+D++R+ + G+Y T +FT EA +I+ PLF
Sbjct: 151 DYYKHIHD---------GGYDLKRHETDVSKQYQGDYTTHVFTSEAQNIIKSHDPSTPLF 201
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
LY+++ + HA +L+ P+ N + I D +RR A MV +D+ +G + L+
Sbjct: 202 LYMSYQSVHA----NYLQVPEHYSNMYNGVIDDEDRRIVAGMVTCMDEGIGNITQTLKET 257
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
+ N+IIIF SDNG + + G+N+P RG K T WEG + ++S I
Sbjct: 258 ELWNNTIIIFSSDNGG---------DPNDGGNNWPLRGEKGTHWEGAIHGLGFVYSSDIA 308
Query: 389 QNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
++ R V+ +M+H+SDW PT+ AGG SL P +DG
Sbjct: 309 KDVRGTVNTEMIHVSDWFPTIVNLAGG----------------SLNGTKP------LDGF 346
Query: 447 DQWSSLLLNTPSRRNSVLINID--------------------EKKRTAAVRLDSWKLVLG 486
DQW ++ S R +LINID A+R+ WKL+ G
Sbjct: 347 DQWQTISQGAESPRTEILINIDPLVPPPSDMEHFFHGWNGTYNTSMCGAIRVGDWKLITG 406
Query: 487 TQENGT 492
G+
Sbjct: 407 YPGEGS 412
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 23/69 (33%), Positives = 37/69 (53%), Gaps = 5/69 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDP 71
G+Y T +FT EA +I+ PLFLY+++ + HA +L+ P+ N + I D
Sbjct: 176 GDYTTHVFTSEAQNIIKSHDPSTPLFLYMSYQSVHA----NYLQVPEHYSNMYNGVIDDE 231
Query: 72 NRRTYAALT 80
+RR A +
Sbjct: 232 DRRIVAGMV 240
>gi|72159051|ref|XP_791089.1| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 545
Score = 248 bits (633), Expect = 7e-63, Method: Compositional matrix adjust.
Identities = 172/546 (31%), Positives = 260/546 (47%), Gaps = 77/546 (14%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YG+ND+ + + + TPN+D LA GI L+N Y QP+CTPSRA LM+GKY IHTG+Q
Sbjct: 69 YGFNDIGYR-NPAMRTPNLDYLAAEGIKLDNYYVQPICTPSRAQLMSGKYQIHTGLQHSI 127
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
IW +P +PL LP+ L+E GY+T GKWHLGF+++E P RGF+S G L G
Sbjct: 128 IWPPQPNCLPLDLPTLPQKLKEAGYATHMAGKWHLGFYKKECWPTNRGFDSFLGILLGKG 187
Query: 211 SYYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
++ H Y T G D R L + G Y+T + + +IE DKP
Sbjct: 188 DHFLHTEEGGGGPYPSTWPWEGLDFRDGLQSTNAYSGIYSTHVIAERVENIIEKHDKDKP 247
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQ 326
LFLY++ A H L+ P+ + F+ I D RR YA M +D++VG + L+
Sbjct: 248 LFLYVSFQAVHT-----PLQVPESYLQPFESSIQDEKRRIYAGMTYCMDEAVGNITKKLK 302
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
++G+ ++++++F SDNG N SN+P RG K TLWEGGV+ + SP
Sbjct: 303 KQGLWDDTVLVFSSDNGG---------NIDQGASNWPLRGSKTTLWEGGVRAVGFVTSPL 353
Query: 387 IQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
+ + + VS +++ ISDW PTL G T +DG + W +L PS R +
Sbjct: 354 LSERMKGTVSRELIDISDWYPTLIEGVAGWTLS-GTKLDGYNIWETLRSGKPSARVELLH 412
Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNK 504
+D + P+ + N + AA+R WK+V G +G+Y S+K
Sbjct: 413 NIDPLITPPSTWPNESIAAAHNSFSTRTYAALRYKDWKIVTGYXS--INNGWYSPAESSK 470
Query: 505 VPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN 564
Q ++ I LP ++
Sbjct: 471 ------------------QSVASEI-LPGKSV---------------------------- 483
Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL-VQADPKRFN 623
+LFN+ DP E +++++ P I + L E L+ ++ P + PD+ +A+P
Sbjct: 484 ---WLFNITRDPREFHDLSNQEPAIVNFLLERLESYQSGASPVLY--PDIDTKANPAGHG 538
Query: 624 DTWSPW 629
W W
Sbjct: 539 GIWDSW 544
Score = 42.7 bits (99), Expect = 0.62, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 35/69 (50%), Gaps = 6/69 (8%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDP 71
G Y+T + + +IE DKPLFLY++ A H L+ P+ + F+ I D
Sbjct: 224 GIYSTHVIAERVENIIEKHDKDKPLFLYVSFQAVHT-----PLQVPESYLQPFESSIQDE 278
Query: 72 NRRTYAALT 80
RR YA +T
Sbjct: 279 KRRIYAGMT 287
>gi|405379584|ref|ZP_11033433.1| arylsulfatase A family protein [Rhizobium sp. CF142]
gi|397323967|gb|EJJ28356.1| arylsulfatase A family protein [Rhizobium sp. CF142]
Length = 502
Score = 247 bits (630), Expect = 1e-62, Method: Compositional matrix adjust.
Identities = 154/402 (38%), Positives = 209/402 (51%), Gaps = 50/402 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS +I TPNID LA G L Y QP+CTP+RA+ MTG+YP GMQ I
Sbjct: 71 GWKDVGFHGS-DIKTPNIDELAEKGARLEQYYVQPMCTPTRAAFMTGRYPFRYGMQTAVI 129
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ L + LPE L+E GY+T A GKWHLG + + P RGF+S +G L G I
Sbjct: 130 PQGGTYGLALDDHLLPELLKEAGYATAASGKWHLGHAKTAFWPRQRGFDSFYGALLGEID 189
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD--LFTKEAVQLIEDQPVDKPLF 269
++ H ++ N R N D + E D LF EAV++I + KPLF
Sbjct: 190 HFTH-------KSANGNADWYRNN-----DALEEEGFDNVLFADEAVRVINEHDQAKPLF 237
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYLA + H +AP+E + + +I D +RR YAAM+ +DD +G V+ AL+ +G
Sbjct: 238 LYLAFTSPHT-----PFQAPKEFLERNSHIADESRRNYAAMISVMDDGIGRVVEALEMRG 292
Query: 330 MLENSIIIFMSDNGAPT-VEYRETSNYRNW--GSNYPYRGVKNTLWEGGVKVPAIL-WSP 385
M EN++I+F SDNG + SN + N PYR K T +EGG +V ++ W
Sbjct: 293 MRENTLIVFHSDNGGVVDAFFAGESNVKGKLPADNGPYREGKGTTYEGGTRVVGLVNWPG 352
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+I+ P + MMH+ D PTL AG + +N +DG
Sbjct: 353 KIE--PGTANGMMHVVDMYPTLAALAGAELG----------------------KNKPLDG 388
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGT 487
+D W L PS R ++ NI+ TAAVR WKLV T
Sbjct: 389 MDMWPVLGEGKPSPRTEIVYNIE--PMTAAVRQGDWKLVWKT 428
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 36/61 (59%), Gaps = 5/61 (8%)
Query: 19 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 78
LF EAV++I + KPLFLYLA + H +AP+E + + +I D +RR YAA
Sbjct: 218 LFADEAVRVINEHDQAKPLFLYLAFTSPHT-----PFQAPKEFLERNSHIADESRRNYAA 272
Query: 79 L 79
+
Sbjct: 273 M 273
>gi|220906870|ref|YP_002482181.1| sulfatase [Cyanothece sp. PCC 7425]
gi|219863481|gb|ACL43820.1| sulfatase [Cyanothece sp. PCC 7425]
Length = 495
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 163/447 (36%), Positives = 236/447 (52%), Gaps = 48/447 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS+ I TPN+D LA G L Y+QP+CTPSRA+L+TG+YP G+Q I
Sbjct: 59 GWKDVGFHGSD-IRTPNLDQLAKTGARLEQYYSQPMCTPSRAALLTGRYPHRYGLQTLVI 117
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A G+P E LP+ L+E GY T +GKWHLG +Y P RGF+ +G L G I
Sbjct: 118 PSAGKYGLPTDEYLLPQALKEAGYETAIVGKWHLGHADPKYWPRQRGFDYQYGPLLGEID 177
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H + D RN + Y T L ++AV+LIE PLFLY
Sbjct: 178 YFTHSAHGKV---------DWYRNNQLIKEE--GYVTTLLGQDAVKLIEKHNPKTPLFLY 226
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LA A HA +APQ+ ++Q++ I DPNRR YAAM+ +DD +G V++AL+++GM
Sbjct: 227 LAFTAPHA-----PYQAPQKYLDQYKTIADPNRRAYAAMITAMDDQIGQVVAALEKRGMR 281
Query: 332 ENSIIIFMSDNGAP-TVEYR---ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
N++I+F SDNG P + ++ +TS N PYR K +L+EGG +V A+ W +
Sbjct: 282 NNTLIVFQSDNGGPRSAQFTGEVDTSGGTIPADNGPYRDGKASLYEGGTRVVALANWPGK 341
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
IQ V+ +HI D PTL + L + +N +DGL
Sbjct: 342 IQPGTVVN-HPIHIVDMYPTL----------------------TGLASVSVGKNKPLDGL 378
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY-YGQTRSNKV 505
+ W +L PS R+ V+ +I+ + AA+ + WKLV ++ + Q S +
Sbjct: 379 NIWPALSEAKPSPRSQVVYDIEPFR--AALSQEDWKLVWKATLPSRLELFNLSQDVSEQT 436
Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLP 532
L N + S+ Q ++ LS++ LP
Sbjct: 437 NLAEQNPEIVSRLKQQIEVLSRDAVLP 463
Score = 58.9 bits (141), Expect = 7e-06, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 41/65 (63%), Gaps = 5/65 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y T L ++AV+LIE PLFLYLA A HA +APQ+ ++Q++ I DPNRR
Sbjct: 201 YVTTLLGQDAVKLIEKHNPKTPLFLYLAFTAPHA-----PYQAPQKYLDQYKTIADPNRR 255
Query: 75 TYAAL 79
YAA+
Sbjct: 256 AYAAM 260
>gi|190891646|ref|YP_001978188.1| sulfatase [Rhizobium etli CIAT 652]
gi|190696925|gb|ACE91010.1| putative sulfatase protein [Rhizobium etli CIAT 652]
Length = 498
Score = 247 bits (630), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 158/417 (37%), Positives = 219/417 (52%), Gaps = 54/417 (12%)
Query: 78 ALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
AL +ST ++ + GW D+ FHGS +I TPNID LA G L YAQP+CTP+RA
Sbjct: 49 ALAQSTKPNIVFIVADDLGWKDVGFHGS-DIKTPNIDQLAEKGGRLEQFYAQPMCTPTRA 107
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
+LMTG+YP GMQ I G+ L + LPE L+E GY+T A GKWHLG +
Sbjct: 108 ALMTGRYPFRYGMQTAVIPQGGTYGLALDDYLLPEMLKEAGYATAASGKWHLGHADTAFW 167
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD--LF 251
P RGF+S +G L G I ++ H ++ N R N + + E D LF
Sbjct: 168 PRQRGFDSFYGALLGEIDHFTH-------KSANGNADWYRNN-----EAIEEAGFDNILF 215
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
EAV++I + KPLFLYLA + H +AP+E +++ +I D +RR YAAM+
Sbjct: 216 ATEAVRVINEHDQSKPLFLYLAFTSPHT-----PFQAPKEYLDRNSHIADESRRAYAAMI 270
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT-VEYRETSNYRNW--GSNYPYRGVK 368
+DD +G V+ AL+ +GM EN++I+F SDNG + S+ + N PYR K
Sbjct: 271 SVMDDGIGRVVEALETRGMRENTLILFHSDNGGVVDAFFAGESDVKGKLPADNGPYRQGK 330
Query: 369 NTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
T +EGG +V ++ W +I+ P + MMH+ D PTL AG
Sbjct: 331 GTTYEGGTRVVGLINWPGKIE--PGSADGMMHVVDMYPTLAALAGA-------------- 374
Query: 428 WSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
TP +N +DG+D W L PS R ++ N++ TAAVR WKLV
Sbjct: 375 -------TPG-KNKPLDGMDMWPMLGEGKPSPRTEIVYNVE--PMTAAVRQGDWKLV 421
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 5/61 (8%)
Query: 19 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 78
LF EAV++I + KPLFLYLA + H +AP+E +++ +I D +RR YAA
Sbjct: 214 LFATEAVRVINEHDQSKPLFLYLAFTSPHT-----PFQAPKEYLDRNSHIADESRRAYAA 268
Query: 79 L 79
+
Sbjct: 269 M 269
>gi|291230930|ref|XP_002735418.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 480
Score = 246 bits (629), Expect = 2e-62, Method: Compositional matrix adjust.
Identities = 178/554 (32%), Positives = 267/554 (48%), Gaps = 88/554 (15%)
Query: 81 KSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
KS +++++ GWND+ +H + +I PN+ LA +G+I N Y P CTPSRA++M
Sbjct: 6 KSDQPHIILIFVDDMGWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMM 64
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TG YP TG Q ++ P GVPL + LPE +E+GYST +GKWHLGF + EY P
Sbjct: 65 TGLYPFKTGNQHQMVFNLHPSGVPLEFKLLPEKFKEVGYSTHMVGKWHLGFCKDEYLPTN 124
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
RGF+SH+G + YD ++ S + G+D R N+ + G Y + A
Sbjct: 125 RGFDSHYGIWTLGVGDYDK-MNGVLSPSA---GYDFRDNMGVVPKSNG-YLALMLGDRAE 179
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
++ + PLF LA KHLE P+E + I D R + + +DD
Sbjct: 180 HIVNNHYPGTPLF-----LAFTLDIPAKHLEIPEEYEEPYADIEDDRTRQFYGKLTMMDD 234
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+G V+ AL+ +GM ++S+IIF+ DNGA + + GSNYP+RG+ TL+EG
Sbjct: 235 IIGGVVDALKNRGMYDDSLIIFIGDNGALSSQA---------GSNYPFRGIAGTLFEGAT 285
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
+VP+I+ I++ S ++ I D T+ AG
Sbjct: 286 RVPSIVSGKGIKKTGYESNELYSIVDIHRTILDVAG------------------------ 321
Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMD 494
++ +DGL W + + PS RN +L NID+ + AA+R+ +KL+ G +
Sbjct: 322 AKAEPELDGLSMWPTWSEDKPSPRNEILYNIDDDPVSPGAAIRIGDYKLITGHPD----- 376
Query: 495 GYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP 554
L + I + Y + P S + ST Q+A P
Sbjct: 377 -----------LLYPYRDISLTDNYYNYGD------PPASG---LPSTDQEANA-----P 411
Query: 555 APMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
AP YLFN+ DP E+N+I++ RPDI +L + L HR L+P ++ +L
Sbjct: 412 APQNVE-------YLFNVVVDPEERNDISADRPDIVQELRDRLDEHRNLLIPPVNQTTEL 464
Query: 615 VQADPKRFNDTWSP 628
A FND WSP
Sbjct: 465 AGA-AGNFNDVWSP 477
>gi|357023853|ref|ZP_09086021.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
gi|355544286|gb|EHH13394.1| sulfatase [Mesorhizobium amorphae CCNWGS0123]
Length = 501
Score = 245 bits (626), Expect = 5e-62, Method: Compositional matrix adjust.
Identities = 153/402 (38%), Positives = 214/402 (53%), Gaps = 57/402 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D F+GS +IPTPN+D LA G L YA P+CTP+RA+LMTG+YP+ G+Q I
Sbjct: 71 GFGDAGFNGS-DIPTPNLDKLAAEGARLEQFYALPMCTPTRAALMTGRYPLRYGLQVGVI 129
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A +P+ E LP+ L++ GY+T +GKWHLG + E+ P RGF+ +G L G I
Sbjct: 130 PAAGTYSLPVDEYLLPQALKDTGYTTAMVGKWHLGHAKPEFWPRQRGFDYFYGALVGEID 189
Query: 212 YYDHI---LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
++ H + D Y LN T +D LF EAV+++E PL
Sbjct: 190 HFKHSSHGVKDWYRNNKPLN--------ETGFDNT------LFGNEAVRVVERHEGKSPL 235
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA A H +APQ+ +++F+ I D NRR YAAM+ +DD VG V++AL++K
Sbjct: 236 FLYLAFTAPHT-----PFQAPQDYLDRFKDIADDNRRKYAAMISVMDDGVGKVVAALEKK 290
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWG-----SNYPYRGVKNTLWEGGVKVPAIL- 382
GM +N+II+F SDNG V+ + G SN PYR K TL+EGG +V A+
Sbjct: 291 GMRDNTIIVFHSDNGG--VKNSLFAGDSKVGGELPASNGPYRDGKGTLYEGGTRVAALAN 348
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
W +I+ P V M+H+ D PTL A + +N
Sbjct: 349 WPGKIK--PGVVDSMVHVVDMYPTLAAVA----------------------DAKLEKNKP 384
Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
+DG++ W +L PS R+ V+ N+D AVR WKLV
Sbjct: 385 LDGMNVWDALSEGKPSPRSEVVYNVD--PLAGAVRKGDWKLV 424
Score = 52.8 bits (125), Expect = 6e-04, Method: Compositional matrix adjust.
Identities = 27/61 (44%), Positives = 37/61 (60%), Gaps = 5/61 (8%)
Query: 19 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 78
LF EAV+++E PLFLYLA A H +APQ+ +++F+ I D NRR YAA
Sbjct: 217 LFGNEAVRVVERHEGKSPLFLYLAFTAPHT-----PFQAPQDYLDRFKDIADDNRRKYAA 271
Query: 79 L 79
+
Sbjct: 272 M 272
>gi|405975640|gb|EKC40194.1| Arylsulfatase B [Crassostrea gigas]
Length = 484
Score = 244 bits (624), Expect = 7e-62, Method: Compositional matrix adjust.
Identities = 142/399 (35%), Positives = 209/399 (52%), Gaps = 52/399 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + TPNID LA G+ILN Y QP+C+PSR + MTG YP G+Q I
Sbjct: 37 GWNDVGYHNP-AMKTPNIDKLAREGLILNQTYFQPLCSPSRHAFMTGYYPYRAGLQHLVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P PL +FLP+ L++LGY+T +GKWHLG + TP YRGF+S FG+ +
Sbjct: 96 MPWQPVCSPLNMKFLPQRLKDLGYATHMVGKWHLGMCNWDCTPTYRGFDSFFGFYHAKAD 155
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY HI S +Y D R N + G Y+T FT A +I+ +PLFLY
Sbjct: 156 YYSHI-SYKYL--------DYRDNEKPVKNLNGTYSTFTFTSRAQDIIKKHNSSQPLFLY 206
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A H + L+ PQ+ + + RR Y+ MV LD+++G + L+ GM
Sbjct: 207 MAFPIPH-----EPLQVPQQYEDMYPNTMSKGRRQYSGMVSALDEAIGNITDQLRESGMY 261
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
++++I+F SDNG ++Y G+NYP RG K T++EGG +V + + +Q+
Sbjct: 262 DDTLIVFTSDNGGSPMDY---------GNNYPLRGAKFTVYEGGTRVVSFVHGAGLQKTG 312
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
MMH DW TL AAGG+ P R+ DG+ QW++
Sbjct: 313 EAYESMMHAVDWSDTLVAAAGGE---------------------PDMRD---DGVSQWTA 348
Query: 452 LLLNTPSRRNSVLINID----EKKRTAAVRLDSWKLVLG 486
+ N+P R ++ N D ++ AA+R+ +KL+ G
Sbjct: 349 IAQNSPGPRTELVYNFDPDSAPEQGHAAIRVGDYKLIHG 387
Score = 45.8 bits (107), Expect = 0.065, Method: Compositional matrix adjust.
Identities = 25/61 (40%), Positives = 37/61 (60%), Gaps = 2/61 (3%)
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYE-LLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
LF + +DP EQN++A S+PDI +L + LL + L P PD ++DP F++ WS
Sbjct: 422 LFYIKDDPTEQNDLAQSKPDIVKKLSQRLLMLTKNALPPNYPLVPD-PKSDPTHFDNVWS 480
Query: 628 P 628
P
Sbjct: 481 P 481
>gi|410956991|ref|XP_003985119.1| PREDICTED: arylsulfatase J [Felis catus]
Length = 621
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 140/377 (37%), Positives = 204/377 (54%), Gaps = 25/377 (6%)
Query: 109 IDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPE 168
+D LA G+ L N Y QP+CTPSR+ +TGKY IHTG+Q I +P +PL LP+
Sbjct: 80 LDKLAAEGVKLENYYVQPICTPSRSQFITGKYQIHTGLQHSIIRPTQPNCLPLDNATLPQ 139
Query: 169 YLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELN 228
L+E+GYST +GKWHLGF+R+E P RGF++ FG L G YY H D + +
Sbjct: 140 KLKEVGYSTHMVGKWHLGFYRKECMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMC 195
Query: 229 GHDMRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLE 287
G+D+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+
Sbjct: 196 GYDLYENDNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQ 250
Query: 288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PT 346
AP ++ I + NRR YAAM+ LD+++ V AL+ G NSIII+ SDNG PT
Sbjct: 251 APGRYFEHYRSIININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPT 310
Query: 347 VEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPT 406
GSN+P RG K T WEGG++ + SP ++ V +++HI+DW PT
Sbjct: 311 AG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPT 360
Query: 407 LYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN 466
L + A G +DG D W ++ + R+ +D L + + +
Sbjct: 361 LISLAEGQIDE-DTQLDGYDVWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYG 416
Query: 467 IDEKKRTAAVRLDSWKL 483
I +A+R+ WKL
Sbjct: 417 IWNTAIQSAIRVQHWKL 433
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 199 LYENDNAAWDYDNGLYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 253
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 254 RYFEHYRSIININRRRYAAM 273
>gi|260794509|ref|XP_002592251.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
gi|229277467|gb|EEN48262.1| hypothetical protein BRAFLDRAFT_206907 [Branchiostoma floridae]
Length = 487
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 143/405 (35%), Positives = 211/405 (52%), Gaps = 40/405 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + ++ TP +D LA+ G+ILN Y VCTPSR + MTG YP H G Q
Sbjct: 40 GWNDVGWH-NPDVKTPVLDKLAHEGVILNQSYVNYVCTPSRTAFMTGYYPYHAGSQHLVF 98
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ +G+P FLPE L++LGY+T +GKWHLGF +YTP YRGF+S +GY N
Sbjct: 99 LPQQAQGIPYNFTFLPEKLKDLGYATHMVGKWHLGFCNWKYTPTYRGFDSFYGYYNADED 158
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY H+++ G D+R + G+Y T FT V +IE P D PLFLY
Sbjct: 159 YYTHVVA---------GGLDLRDDKEVVNTKNGQYGTYFFTDRMVDIIEKHPADTPLFLY 209
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L H + LE P+ N + + + NRR MV LD++VG V A++ G+
Sbjct: 210 LPFQNVH-----EPLEVPERFENIYMNVQNENRRKLLGMVSALDEAVGNVTMAMKNAGLW 264
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N+++IF +DNG + G+NYP RG K TLWEGG + A + +Q+
Sbjct: 265 DNTLVIFTTDNGGWIIAS---------GNNYPLRGGKVTLWEGGTRGVAFVHGKMLQKTG 315
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
+ +M+H DW PT+ AA +DG+ Q +++ PS R + +D
Sbjct: 316 YTNNEMIHAVDWFPTILAAA---GGTADDAMDGVSQLDTIMSGQPSPRTEFVYNIDD--- 369
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
P+++ ++ I T +R+ +KL+ G G DG+
Sbjct: 370 ---VFPTKQGAIRIIC-----TLCLRVGDYKLIEGVA--GKPDGW 404
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 24/61 (39%), Positives = 37/61 (60%), Gaps = 1/61 (1%)
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
+LFNL +DP E +N+A S PD ++ L +R++LVP + PD ++ PK + WS
Sbjct: 425 WLFNLKDDPTEHHNLADSMPDKLKEMQAKLAEYRKSLVPAINPPPD-PKSFPKNWGGAWS 483
Query: 628 P 628
P
Sbjct: 484 P 484
Score = 45.8 bits (107), Expect = 0.061, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 33/70 (47%), Gaps = 5/70 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G+Y T FT V +IE P D PLFLYL H + LE P+ N + + + N
Sbjct: 182 GQYGTYFFTDRMVDIIEKHPADTPLFLYLPFQNVH-----EPLEVPERFENIYMNVQNEN 236
Query: 73 RRTYAALTKS 82
RR + +
Sbjct: 237 RRKLLGMVSA 246
>gi|198417507|ref|XP_002121051.1| PREDICTED: similar to arylsulfatase B [Ciona intestinalis]
Length = 518
Score = 244 bits (623), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 139/360 (38%), Positives = 202/360 (56%), Gaps = 33/360 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+S+H S + PN+ LA G+IL + YAQ CTPSRA+ +TG+YPI+TGMQ +
Sbjct: 41 GWNDVSWHNS-IVQMPNLQDLAERGVILEHAYAQEKCTPSRAAFLTGRYPINTGMQEEVV 99
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ G+P+ + LP YL++ GY+T IGKWH+G+ YTP RGF+SH+G+ N IS
Sbjct: 100 VATQMSGLPIEFKLLPSYLKDQGYATHMIGKWHVGYCDEAYTPTRRGFDSHYGFYNSGIS 159
Query: 212 YYDHILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y YS T + G+D R +L+ G+Y T FT +A LI++ P+FL
Sbjct: 160 Y------SNYSSTEGTDVGYDYRDDLALNLAAEGKYTTTDFTDQAKTLIDNHDQTNPMFL 213
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYI-----TDPNRRTYAAMVKKLDDSVGTVISAL 325
Y+A+ A H P E ++ I D NR+TY M+ LD+ VG ++ L
Sbjct: 214 YMAYNAPH---------TPFEVEESYRDIYDGNLRDGNRKTYLGMISALDEQVGQLVDKL 264
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
+ GM N++ +F SDNG + ++ G+N+P RG K +L+EGG +VPA L +
Sbjct: 265 KEVGMWSNTVFVFYSDNGGTQPQSGQS------GNNFPLRGKKGSLFEGGYRVPAFLIAR 318
Query: 386 QIQQN----PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
N S + HISD T AGGD + +P +IDG++ W ++ NTPS R +
Sbjct: 319 TRAGNLELIASTSSTLFHISDMFATFIALAGGDAA-VPADIDGINAWPAISGNTPSPRTT 377
>gi|345316675|ref|XP_001517879.2| PREDICTED: arylsulfatase B-like [Ornithorhynchus anatinus]
Length = 782
Score = 244 bits (622), Expect = 1e-61, Method: Compositional matrix adjust.
Identities = 165/499 (33%), Positives = 241/499 (48%), Gaps = 82/499 (16%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
IHTG+Q IW +P VPL E+ LPE L+E GY+T +GKWHLG +R+E P RGF+S
Sbjct: 354 IHTGLQHQIIWACQPNCVPLDEKLLPELLKEAGYATHMVGKWHLGMYRKECLPTRRGFDS 413
Query: 202 HFGYLNGVISYYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
+FGYL G YY H S V D R A Y+T++F K AV LI
Sbjct: 414 YFGYLLGSEDYYSHERCVLIRSLNVTRCALDFRDGEEVAVGYKNMYSTNVFAKRAVDLIA 473
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
+ P DKPLFLYLA + H + L+ P+E + + +I + RR YA MV +D++VG
Sbjct: 474 NHPPDKPLFLYLAFQSVH-----EPLQVPEEYVKPYSFIQNKKRRNYAGMVSIMDEAVGN 528
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
V +AL+R G+ +N+++IF +DNG T+ G+N+P RG K TLWEGGV+
Sbjct: 529 VTAALKRHGLWDNTVLIFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRGIG 579
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
+ SP ++Q + +++HISDWLPTL AGG + +DG D W ++ +PS R
Sbjct: 580 FVASPLLKQKGVKNRELIHISDWLPTLVNLAGGHVNGT-KPLDGFDVWKTISEGSPSPRV 638
Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEK----------KRTAAVRLDSWKLVLGTQEN 490
+ +D + +P +R+ V + D AA+R +WKL+ G
Sbjct: 639 ELLHNID--PMFVDPSPCQRSKVTPSKDTSFPQDYSTFNTSIHAAIRHGNWKLLTGYPGC 696
Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC 550
G N F P S + ++
Sbjct: 697 G------------------------------------NWFPPPSLL----------SVSS 710
Query: 551 GANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE 610
+P P T S +LF++ DP E+++++ P I +L L+Y+ + VP +
Sbjct: 711 IESPDPPTKS------LWLFDIDRDPEERHDLSREHPSIVKKLLARLRYYSQDSVPIRYP 764
Query: 611 QPDLVQADPKRFNDTWSPW 629
D DPK W PW
Sbjct: 765 DED-PYCDPKS-TGVWGPW 781
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++F K AV LI + P DKPLFLYLA + H + L+ P+E + + +I + RR
Sbjct: 459 YSTNVFAKRAVDLIANHPPDKPLFLYLAFQSVH-----EPLQVPEEYVKPYSFIQNKKRR 513
Query: 75 TYAALT 80
YA +
Sbjct: 514 NYAGMV 519
>gi|171909641|ref|ZP_02925111.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 486
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 148/411 (36%), Positives = 214/411 (52%), Gaps = 48/411 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ F+G +I TP++DALA G Y QP+CTP+RA+LMTG+YP G+Q I
Sbjct: 41 GWQDVGFNGCKDIQTPHLDALAKGGARFTQFYVQPMCTPTRAALMTGRYPFRYGLQTAVI 100
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ E LP+ L++ GY+T IGKWHLG +++ P RGFE +G + G +
Sbjct: 101 PSVSTYGLDTGEYLLPQCLQDAGYTTAIIGKWHLGHADKKFWPKQRGFEYQYGAMIGELD 160
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY H E D R+ + Y T+L +AV+ +E Q D+P +LY
Sbjct: 161 YYTH---------SEHGVLDWFRDNEPVHEE--GYTTNLLGADAVKYLEKQKADRPFYLY 209
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
LA A H +APQE I+++ +I DP RRTYA MV LDD++G V++AL +KG+
Sbjct: 210 LAFNAPHT-----PYQAPQEYIDRYTHIADPTRRTYAGMVACLDDNIGRVVAALDKKGLR 264
Query: 332 ENSIIIFMSDNGA---PTVE--YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSP 385
EN++I+F SDNG P + S + N PYR K +L+EGGV+V A+ W
Sbjct: 265 ENTLILFHSDNGGTHNPMFAGVMADVSKIKIPCDNSPYRDGKGSLFEGGVRVCALANWPG 324
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+I+ P ++H +D T AG TSR +DG
Sbjct: 325 KIK--PATVDGLIHATDIFTTFTKLAGASTSRC----------------------KPLDG 360
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
+D W ++ PS R V+ N++ + AA+R WKL+ T ++D Y
Sbjct: 361 VDAWPTMAEGRPSLREEVVYNVEPYR--AALRQGDWKLIWRTTLPSSVDLY 409
Score = 57.4 bits (137), Expect = 2e-05, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y T+L +AV+ +E Q D+P +LYLA A H +APQE I+++ +I DP RR
Sbjct: 184 YTTNLLGADAVKYLEKQKADRPFYLYLAFNAPHT-----PYQAPQEYIDRYTHIADPTRR 238
Query: 75 TYAALT 80
TYA +
Sbjct: 239 TYAGMV 244
>gi|403049780|ref|ZP_10904264.1| sulfatase [SAR86 cluster bacterium SAR86D]
Length = 515
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 143/349 (40%), Positives = 204/349 (58%), Gaps = 25/349 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+S+H + IPTPNID+ NGI LN YA P C+P+RASL+TG + + G+ P +
Sbjct: 29 GWGDVSYH-NGFIPTPNIDSFVSNGIELNRFYANPTCSPTRASLLTGLHIFNHGVIRPFM 87
Query: 152 W-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
AE G+P + +PEY +E GY T GKWHLG + EY P RGF+S +G++ G I
Sbjct: 88 NPSAEQTGLPEHLKIMPEYFKEAGYQTALSGKWHLGMHKEEYLPTNRGFDSSYGHMLGGI 147
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YYDH+ T ++ H R +S D Y+T+L EA+ +I+++ D+PLFL
Sbjct: 148 GYYDHV------HTNRMDWH--RDGVSLNEDG---YSTELIADEAINIIKNKDDDRPLFL 196
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A A H +EAP+E +N F YI D R YAA + KLD +G +I++++ G+
Sbjct: 197 YVAFNAPHT-----PIEAPEEDVNNFLYIEDELDRNYAANISKLDIEIGRIINSIKDLGI 251
Query: 331 LENSIIIFMSDNGA-----PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
LE +IIIF+SDNG P V+ + GS RG K + +GG++VPA +W
Sbjct: 252 LEETIIIFLSDNGPVFDINPIVKTIAPGLTKAKGSTAGLRGSKTSALDGGIRVPAAIWWK 311
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
+ + + S Q + I D LPTL TA G L IDG+D+WS+L+ N
Sbjct: 312 GVLEKSK-SDQFIFIQDLLPTLLTATGIGYDE-NLLIDGVDRWSNLITN 358
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 40/64 (62%), Gaps = 5/64 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T+L EA+ +I+++ D+PLFLY+A A H +EAP+E +N F YI D R
Sbjct: 172 YSTELIADEAINIIKNKDDDRPLFLYVAFNAPHT-----PIEAPEEDVNNFLYIEDELDR 226
Query: 75 TYAA 78
YAA
Sbjct: 227 NYAA 230
>gi|430746415|ref|YP_007205544.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430018135|gb|AGA29849.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 243 bits (619), Expect = 3e-61, Method: Compositional matrix adjust.
Identities = 147/405 (36%), Positives = 213/405 (52%), Gaps = 52/405 (12%)
Query: 82 STTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYP 141
+ L L GW D+ +H S EI TP++D LA +G L Y QPVC+P+RA+LMTG+YP
Sbjct: 38 NVVLILADDMGWGDVGWHDS-EIKTPHLDKLAASGTRLEQFYVQPVCSPTRAALMTGRYP 96
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
+ G+Q + G+PL ER LP+ L+E+GY T GKWHLG F+ EY P +RGF+
Sbjct: 97 MRHGLQVGVVRPWAQYGLPLNERTLPQALKEVGYETAICGKWHLGHFQPEYLPTHRGFDH 156
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
+G+ NG + Y+ HI G D R+ D Y+T L +EA ++I
Sbjct: 157 QYGHYNGALDYFTHIRD---------GGFDWHRDDRVNRDE--GYSTHLIGREATRIIGH 205
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
KPLFLY+ A HA + + P++ + + +P RRTYA M+ +D++VG +
Sbjct: 206 HDTSKPLFLYVPFNAVHAPH-----QVPEKYKEPYSKLKEP-RRTYAGMLAAMDEAVGQI 259
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
SA+ KGM N+I +F SDNG P SN P RG K TL+EGGV+VPA
Sbjct: 260 ASAIDGKGMRSNTIFLFSSDNGGPAPG--------QVTSNGPLRGQKGTLYEGGVRVPAF 311
Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
+ W ++ V+ +HI D PTL T AG P ++
Sbjct: 312 ISWEGHLRPGTVVNAP-LHIVDLFPTLLTLAG----------------------APLEQS 348
Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
DG D W+++ + PS +++L+N + A+R+ WKL+L
Sbjct: 349 LPTDGRDAWTAIAHHGPSPHDAILLNTTPD--SGAIRVGDWKLIL 391
>gi|291225017|ref|XP_002732505.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 497
Score = 242 bits (617), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 168/539 (31%), Positives = 256/539 (47%), Gaps = 84/539 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H +I PN+ LA +G+I Y P CTPSRA++MTG YP TG Q +
Sbjct: 38 GWNDVHWHNP-DIAMPNLMDLADDGVIFEQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMV 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ P GVPL + LPE L+E+GY+T +GKWHLGF + EY P RGF+SH+G +
Sbjct: 97 FNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVG 156
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YD L+ S + G+D R NL + G Y + A +++ + + PLFL
Sbjct: 157 DYDK-LNGVLSPSA---GYDFRDNLEVVPKSDG-YLALMLGDRAEEIVNNHSPETPLFLV 211
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
KHLE P+E + I D R + + +DD +G V+ AL+ + M
Sbjct: 212 FT-----LDIPAKHLEIPEEYEELYADIEDDRTRQFYGKLTMMDDIIGGVVDALKDRDMY 266
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
++S+ IF+ DNGA + + GSNYP+RG+ TL+EG +VP+I+ I++
Sbjct: 267 DDSLFIFIGDNGALSSQA---------GSNYPFRGIAGTLFEGATRVPSIVSGKGIEKTG 317
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
S ++ I D T+ AG ++ +DGL W +
Sbjct: 318 YKSNELYSIVDIHRTILDVAG------------------------AKAEPELDGLSMWPT 353
Query: 452 LLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
+ PS RN ++ NID+ + AA+R+ +KL+ G + L
Sbjct: 354 WSEDEPSPRNEIIYNIDDDPVSPGAAIRIGDYKLITGHPDL----------------LYP 397
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
+ I + Y + P S + +T Q+A PAP YL
Sbjct: 398 YRDIRLTDNYYNYGD------PPASG---LPTTDQEANA-----PAPQNVE-------YL 436
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
FN+ +DP E+++I++ DI L + L HR+ L P ++ +L + FND WSP
Sbjct: 437 FNIVDDPEERHDISADNQDIVQDLRDRLDEHRKLLHPPVNQTTELAGS-AANFNDVWSP 494
>gi|291227809|ref|XP_002733875.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 505
Score = 242 bits (617), Expect = 5e-61, Method: Compositional matrix adjust.
Identities = 176/557 (31%), Positives = 259/557 (46%), Gaps = 109/557 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ + + + TPNID LA G+ L YAQ C PSR+ LM G+Y IHTG +
Sbjct: 37 GYFDVGYREGSIVKTPNIDKLAAEGVKLERYYAQSSCMPSRSCLMMGRYQIHTGFDYRCL 96
Query: 152 WGAEPRGVPLTERF-LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G R + LP LR+ GY+T IGKWHLG R+E P ++GF++ FGY
Sbjct: 97 DGQLTRLCMAPDTVTLPMKLRQYGYATHMIGKWHLGHERKECVPTHKGFDTFFGYHGAAE 156
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNL-STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
+YY H + HD+ RN+ + A D G+Y+T +TKEA +I++ KP F
Sbjct: 157 NYYTH------TALGRPRCHDLWRNMENVAEDYDGQYSTLFYTKEAQDIIKNHDKKKPFF 210
Query: 270 LYLAHLAAHAGNAGKHLEAPQE-TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
+YL++ A H LE P + + D +R+ YAAM +D+ +G + L+
Sbjct: 211 MYLSYQAVHG-----PLEVPSNYRMPYYTSGVDESRQIYAAMATCMDEGIGNITETLKAS 265
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
GML N+I++F SDNGA + G N+P RG K T+WEGGV+V A + SP ++
Sbjct: 266 GMLNNTILVFSSDNGA---------QFHTGGRNWPLRGGKGTMWEGGVRVLAFVNSPLLK 316
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
++ V+ +M+H SDW PT AGG + L DG+DQW ++ P++R
Sbjct: 317 KSRTVNHEMIHTSDWFPTFVRLAGGTVDDVEL--DGVDQWDTISTGAPTKRQ-------- 366
Query: 449 WSSLLLNTPSRRNSVL-INID--------EKKRTAAVRLDSWKLVLGTQENGTMDGYYGQ 499
LL+ RR ++ I D + R A++ + WKLV GQ
Sbjct: 367 --ELLIQISGRRGALEDIGKDPWITNDYFDITRAASIIVGRWKLVT------------GQ 412
Query: 500 TRSNKV----PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA 555
RS K P LN+ P +I
Sbjct: 413 YRSPKTWVPPPELNY---------------------PEDHI------------------- 432
Query: 556 PMTPSPCTNGP-CYLFNLGNDPCEQNNIASSRPDISSQLYELLK--YHRRTLVPQSHEQP 612
M PS G +LF+L D E+N+++S RP+I +L + L+ Y S P
Sbjct: 433 -MPPSTEILGKMVWLFDLEEDQNEKNDLSSCRPEIVIKLLKRLRDYYSHSVFCGSSSYSP 491
Query: 613 DLVQADPKRFNDTWSPW 629
+ Q + W P+
Sbjct: 492 NCAQR-----GNQWGPY 503
Score = 42.0 bits (97), Expect = 0.91, Method: Compositional matrix adjust.
Identities = 27/79 (34%), Positives = 43/79 (54%), Gaps = 7/79 (8%)
Query: 3 RNL-STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE- 60
RN+ + A D G+Y+T +TKEA +I++ KP F+YL++ A H LE P
Sbjct: 174 RNMENVAEDYDGQYSTLFYTKEAQDIIKNHDKKKPFFMYLSYQAVHG-----PLEVPSNY 228
Query: 61 TINQFQYITDPNRRTYAAL 79
+ + D +R+ YAA+
Sbjct: 229 RMPYYTSGVDESRQIYAAM 247
>gi|158424485|ref|YP_001525777.1| twin-arginine translocation pathway signal [Azorhizobium
caulinodans ORS 571]
gi|158331374|dbj|BAF88859.1| twin-arginine translocation pathway signal precursor [Azorhizobium
caulinodans ORS 571]
Length = 490
Score = 240 bits (613), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 149/401 (37%), Positives = 215/401 (53%), Gaps = 57/401 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ FHGS+ I TPN+D LA G L Y QP+CTP+RA+LMTG+YP+ G+Q I
Sbjct: 58 GFADVGFHGSD-IKTPNLDKLAATGATLGQFYTQPMCTPTRAALMTGRYPLRYGLQTGVI 116
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ E LP+ L+ +GYST IGKWHLG ++++ P RGF+ +G L G I
Sbjct: 117 PSGASYGLATDEFLLPQALKSVGYSTALIGKWHLGHAKQDFWPRQRGFDYFYGPLVGEID 176
Query: 212 YYDHILSDQYSRTVELNG-----HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
+Y H E +G D ++ + +D T+LF +A +LI
Sbjct: 177 HYKH----------EAHGVVDWYRDNKQVVEEGYD------TELFGTDAARLIGAHDPKT 220
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
PLFLYLA A H +APQ ++++ ITDP RR YAAM+ +DD +G V++AL+
Sbjct: 221 PLFLYLAFTAPH-----TPFQAPQAYVDRYPNITDPARRLYAAMISAMDDQIGKVVAALE 275
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNW---GSNYPYRGVKNTLWEGGVKVPAIL- 382
++GM EN++I+F SDNG + + SN PYR K TL+EGG +V A+
Sbjct: 276 QRGMRENTLIVFHSDNGGTRSKMFVGEGAFHGELPASNAPYRDGKGTLYEGGTRVAALAN 335
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
W +I P + +MH+ D +PTL AG N+D R+
Sbjct: 336 WPGRI--TPGAADGVMHVVDMMPTLARLAGA-------NLD---------------RSKP 371
Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
+DG+D WS+L S R+ V+ N++ + AVR +WKL
Sbjct: 372 LDGVDVWSALSSARTSPRDGVVYNVEPTQ--GAVREGNWKL 410
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/68 (41%), Positives = 38/68 (55%), Gaps = 5/68 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y T+LF +A +LI PLFLYLA A H +APQ ++++ ITDP RR
Sbjct: 200 YDTELFGTDAARLIGAHDPKTPLFLYLAFTAPH-----TPFQAPQAYVDRYPNITDPARR 254
Query: 75 TYAALTKS 82
YAA+ +
Sbjct: 255 LYAAMISA 262
>gi|219110117|ref|XP_002176810.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
gi|217411345|gb|EEC51273.1| arylsulfatase [Phaeodactylum tricornutum CCAP 1055/1]
Length = 564
Score = 240 bits (612), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 179/566 (31%), Positives = 257/566 (45%), Gaps = 113/566 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G +DL H ++ I TP+ D LA +G+ L+ Y P C+P+RASL++G+YP+HTG +
Sbjct: 75 GSHDLGIHENSGIQTPHADQLARDGLYLDQYYVLPYCSPTRASLLSGRYPLHTGCH-TIV 133
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E +G+PL E LP+ LR GY A+GKWH+G R TP +RGF+S FG+ G
Sbjct: 134 NDWETQGLPLDEETLPQVLRRAGYQAHAVGKWHVGHSRWTQTPTFRGFQSFFGFYLGAQD 193
Query: 212 YYDHILSDQYSRTVELNGHDMR----RNLSTAWDTVGEYATDLFTKEAVQLIEDQPV--D 265
Y HI + E++ D R R+ S D G Y+T +FT+EA+++IE+ P
Sbjct: 194 YNTHIKQGERGNAYEMH-WDARGKCGRDCSRLVDERGNYSTHVFTREAIRVIENHPQRPH 252
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAAMVKKLDDSVGTV 321
+PLFLYLAH A H + + P+ ++ T N R+TYA M+ D+S+G V
Sbjct: 253 EPLFLYLAHQAVHWPD-----QVPETYRKFYEGATYSNWTDQRKTYAGMLSAADESIGNV 307
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
ALQ GM EN++++F +DNG PT + SNYP RG K T++EGG
Sbjct: 308 TKALQDAGMWENTLVVFTTDNGGPTAVCAAQGS-----SNYPKRGGKCTVYEGGTTGDGF 362
Query: 382 LWSPQIQQNPR-----------VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
+ P + R + ++ H+ DWLPTL G + PL DG++QW S
Sbjct: 363 VSGPAWNKVARSRKKEYSETLELYSKVFHVVDWLPTLARMTGATPNGKPL--DGVNQWDS 420
Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN---IDEKKRTAAVRLDSWKLVLGT 487
+L PS P R V + + A+R WKL+ G
Sbjct: 421 MLQREPS------------------APPPREEVFVGYAYFGNQWYGPAIRYKHWKLIQG- 461
Query: 488 QENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQAT 547
+S ++ L FLP
Sbjct: 462 ---------------------------QSGGPETSHDLPPGSFLPAP------------- 481
Query: 548 IHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLY-ELLKYHRRTLVP 606
G P G L++L +DP E NIASS P I L +L++YH + P
Sbjct: 482 ---GGAP----------GEYQLYDLQSDPSETQNIASSYPLIVQILQGKLIEYHASFVPP 528
Query: 607 QSHEQ--PDLVQADPKRFNDTWSPWI 630
S++ P + F TW PW
Sbjct: 529 ISNDPTCPFTGTTNTSTFGPTWLPWC 554
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 32/47 (68%), Gaps = 2/47 (4%)
Query: 3 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPV--DKPLFLYLAHLAAH 47
R+ S D G Y+T +FT+EA+++IE+ P +PLFLYLAH A H
Sbjct: 219 RDCSRLVDERGNYSTHVFTREAIRVIENHPQRPHEPLFLYLAHQAVH 265
>gi|291236518|ref|XP_002738186.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 473
Score = 239 bits (611), Expect = 2e-60, Method: Compositional matrix adjust.
Identities = 172/546 (31%), Positives = 255/546 (46%), Gaps = 114/546 (20%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ +H S EI TPNID LA G+ L N Y P+CTPSRA LMTG+Y IH+GMQ +
Sbjct: 35 GWHDVGYHDS-EIQTPNIDMLAAEGVKLENYYVTPLCTPSRAVLMTGRYLIHSGMQHGVL 93
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
PR +P E LP+ L++ GYST +GKWHLGF + + TP +RGF++ FG+ N
Sbjct: 94 VAQNPRCLPTDEILLPQMLKDSGYSTHMVGKWHLGFCKFQCTPNHRGFDTFFGWYNAA-- 151
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
++S S G + R + +T P+F+Y
Sbjct: 152 ----VISPFISCRRRSAGCEFRVSHTTV---------------------------PMFIY 180
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
L A HA L+ P + + ++ I D +RRTYA M +D+++G + L+ KG+
Sbjct: 181 LPFQAVHAP-----LQVPDKYSDMYKDTIHDESRRTYAGMTSCMDEAIGNITRTLKDKGI 235
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWG-SNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
N++I+F +DNG R +G SNYP RG K + +EGG++ PA + SP +
Sbjct: 236 WNNTVIVFSTDNGGA----------RTFGASNYPLRGQKASNFEGGIRGPAFVSSPLLDP 285
Query: 390 NPRVSL--QMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
R ++ ++M+I DW PT AGG PL DG++QW ++ PS+R + +
Sbjct: 286 LVRGTINNELMYIGDWFPTFMNLAGGHVIGSKPL--DGINQWETISRGVPSKRVEIVYNI 343
Query: 447 DQ-WSSLLLNTPSRRNSVLIN-IDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNK 504
D W S R++ + N + +AA+R+ WKL+ G +TR
Sbjct: 344 DPLWPGC---KSSTRDTWIGNPYFDICVSAAIRVGHWKLLTGHTGCSNW-----ETRPEI 395
Query: 505 VPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTN 564
P +S+T +S +
Sbjct: 396 AP--------KSRTVESSD----------------------------------------D 407
Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
G L+++ D E+ N+ RPDI L + L YH T VP V+ADP R
Sbjct: 408 GMVRLYHIKEDAEEKVNLVHYRPDIVFMLLQRLHYHNTTAVPPGEVDKFSVRADPARHGM 467
Query: 625 TWSPWI 630
W+PWI
Sbjct: 468 AWNPWI 473
>gi|328726458|ref|XP_001949756.2| PREDICTED: arylsulfatase B-like [Acyrthosiphon pisum]
Length = 402
Score = 239 bits (609), Expect = 4e-60, Method: Compositional matrix adjust.
Identities = 155/424 (36%), Positives = 223/424 (52%), Gaps = 48/424 (11%)
Query: 227 LNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAG 283
++G D RR A + + YATD+FT+EA ++IE + +FL ++HLA H+GN+G
Sbjct: 1 MSGIDARRGFERAGNEMDRDRYATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSG 60
Query: 284 -KHLEAPQETINQ--FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
HLE +T N F YI + NRR YA MV LDDSVG +I +L MLENSI++FMS
Sbjct: 61 PNHLEVLNKTYNDEAFGYIENENRRLYAGMVTSLDDSVGKIIESLHEHEMLENSIVVFMS 120
Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHI 400
DNGAPT + + N+GSN+P RG K ++ EGGV+ A +WSP +++ VS + HI
Sbjct: 121 DNGAPTDD--PLWGHDNFGSNWPLRGKKASVLEGGVRGVAAIWSPWLKEKNTVSNNLFHI 178
Query: 401 SDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRR 460
+DWLPTLYTAAGG+ L IDG++QW SL + R
Sbjct: 179 TDWLPTLYTAAGGNFEDL----------------------GKIDGVNQWKSLTDTSKQVR 216
Query: 461 NSVLINIDEKKRTAAVRLDSWKLVLGTQ-ENGTMDGYYGQTRSNKVPLLNFNAIVESKTY 519
+ +LINIDE + A+ + WK+V + N + Y N P N + +V S
Sbjct: 217 SKILINIDETRGEEALIFNQWKVVKSNRTSNMVFNLKYAGDPGNAGPKYNMSGVVASTVG 276
Query: 520 QSLQQ---LSQNIFLPISNI-DKMRSTRQQATIH---CG---ANPAPMTPSPCTNGPCYL 569
+ L + L+Q+ + S D +R A + C +NP P C +G C L
Sbjct: 277 KRLSKINCLTQDSCMNASTARDVFYVSRSLAKVDGQTCPERVSNPNPDQQYECFDGYC-L 335
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLV---QADPKRFNDTW 626
F++ DPCE N+A P +L+ ++ + Q+ PD + ADP+RF W
Sbjct: 336 FDILQDPCEYRNVAKQNPQTLDATIHMLEQFKKEMTMQA---PDPIIDPDADPRRFAGYW 392
Query: 627 SPWI 630
PW+
Sbjct: 393 EPWL 396
Score = 60.5 bits (145), Expect = 2e-06, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 46/72 (63%), Gaps = 4/72 (5%)
Query: 15 YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAG-KHLEAPQETINQ--FQYITD 70
YATD+FT+EA ++IE + +FL ++HLA H+GN+G HLE +T N F YI +
Sbjct: 22 YATDVFTEEARKIIESSKRENTEMFLMVSHLAVHSGNSGPNHLEVLNKTYNDEAFGYIEN 81
Query: 71 PNRRTYAALTKS 82
NRR YA + S
Sbjct: 82 ENRRLYAGMVTS 93
>gi|390362801|ref|XP_793927.3| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 739
Score = 239 bits (609), Expect = 5e-60, Method: Compositional matrix adjust.
Identities = 156/508 (30%), Positives = 239/508 (47%), Gaps = 107/508 (21%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
IHTG+Q I A+P +PL E LP+ L+E GY+T +GKWH+GF+ TP RGF+S
Sbjct: 318 IHTGLQHGVIRHAQPNCLPLDEVTLPQKLKESGYATNMVGKWHIGFYLDACTPTERGFDS 377
Query: 202 HFGYLNGVISYYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
FG+L G +Y H + + T L+GHD+ N + +D G+Y+T LFT + +
Sbjct: 378 FFGFLVGGEDHYTHSGHCEVYNADDGSTKTLSGHDLHANKTPVFDYEGQYSTHLFTNKTI 437
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
IE KPLF+YLA+ A H L+ P ++ ++ I D NRRTYA MV +D+
Sbjct: 438 DAIERHDKTKPLFMYLAYQAVHLP-----LQVPDRYMDPYKNIADENRRTYAGMVSCMDE 492
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+G V AL+ G+ +N+IIIF +DNG P ++ +N+P RG K +WEGG+
Sbjct: 493 GIGNVTRALKDAGLYDNTIIIFSTDNGGPI---------QSGANNWPLRGSKGNMWEGGI 543
Query: 377 KVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
+ SP + + + V+ +MH+SDWLPT+ G ++G
Sbjct: 544 HGVGFVHSPLLPTSVKGTVNHDLMHVSDWLPTIVAGIAGGA------LNG---------- 587
Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT-----------AAVRLDSWKL 483
+DG D W+S+ S R +L NID KK+ AA+R+ WK+
Sbjct: 588 -----TKPLDGFDVWNSISGTASSPRTEILHNIDLKKKQPANEKFDTSIRAALRVGDWKI 642
Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFL--PISNIDKMRS 541
G + T + N+++ P S +D +
Sbjct: 643 KTGADYDVT---------------------------------THNVWMPPPESGLDTIYP 669
Query: 542 TRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
T + N +LFN+ DP EQ +++ PD+ ++L + L+ +
Sbjct: 670 TEK------------------PNQKVWLFNITADPLEQTDLSEKYPDVVNELVQKLEEYY 711
Query: 602 RTLVPQSHEQPDLVQADPKRFNDTWSPW 629
VP + DL +ADP +W PW
Sbjct: 712 EGSVPMRYPDEDL-EADPALRGGSWGPW 738
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 44/80 (55%), Gaps = 5/80 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+ N + +D G+Y+T LFT + + IE KPLF+YLA+ A H L+ P
Sbjct: 413 LHANKTPVFDYEGQYSTHLFTNKTIDAIERHDKTKPLFMYLAYQAVHLP-----LQVPDR 467
Query: 61 TINQFQYITDPNRRTYAALT 80
++ ++ I D NRRTYA +
Sbjct: 468 YMDPYKNIADENRRTYAGMV 487
>gi|291241933|ref|XP_002740864.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 496
Score = 238 bits (607), Expect = 8e-60, Method: Compositional matrix adjust.
Identities = 168/551 (30%), Positives = 256/551 (46%), Gaps = 103/551 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H S I TP ID LA +G+ LNN Y C PSR L++G++ I G+Q I
Sbjct: 36 GWNDVGYHNS-YIKTPTIDMLAKSGVRLNNYYVASHCVPSRNMLISGRHVIDIGLQHGEI 94
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G PRG+PL E + + L+E+GY+T IGKWH G + P RGF++ FGYL
Sbjct: 95 -GYYPRGLPLDEFTIADKLKEIGYATHLIGKWHCGCYSNHSLPHNRGFDTFFGYLGSSDD 153
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
+Y HI+ + LN + + G+Y+T ++ EA +I +KPLFL
Sbjct: 154 HYTHIIMSNGLADLRLN------DECVGYKYFGDYSTIMYANEAKNIIAQHDENKPLFLM 207
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
LA A H K ++ P + ++ I D RRTYA M +D+++ + AL+ KGM
Sbjct: 208 LAFSAVH-----KPIQVPDVYMQYYKSTIHDNERRTYAGMASCVDEAIANITEALENKGM 262
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L+NS+I+ +DNG +V G+N+P RG KN+ WEGGV+ A ++S + +
Sbjct: 263 LKNSVIVLTTDNGGGSV-----------GNNWPLRGRKNSHWEGGVRGVAFVYSDLLPMD 311
Query: 391 PRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
R + ++MHI+DW PTL GG+ P R N+ G+DQ
Sbjct: 312 VRGTENSELMHITDWFPTLVKLGGGN---------------------PCER-KNLYGVDQ 349
Query: 449 WSSLLLNTPSRRNSVLINID------EKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
W + PS R VLI +D E+ RT + D + + + +
Sbjct: 350 WEMIRGMEPSARGEVLIALDTTRKASEEYRTGPFKYDKFDITT-----------FAAIKM 398
Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPI--SNIDKMRSTRQQATIHCGANPAPMTPS 560
K LL ++ TY++ + + I + D R R
Sbjct: 399 GKWKLLTGDS-----TYRNFWDMPDPEYGVIDDEDFDDTRMVR----------------- 436
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQ-LYELLKYHRRTLVPQSHEQPDLVQADP 619
L++L +DP E+ +++ R DI + L +L Y + + PQ +P L
Sbjct: 437 --------LYDLVDDPSERTDVSEERQDIVHEMLAKLDGYEKAAVPPQMQTKPRL----D 484
Query: 620 KRFNDTWSPWI 630
+ PWI
Sbjct: 485 NHVLECMGPWI 495
Score = 40.8 bits (94), Expect = 2.4, Method: Compositional matrix adjust.
Identities = 24/79 (30%), Positives = 37/79 (46%), Gaps = 6/79 (7%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ G+Y+T ++ EA +I +KPLFL LA A H K ++ P + +
Sbjct: 173 CVGYKYFGDYSTIMYANEAKNIIAQHDENKPLFLMLAFSAVH-----KPIQVPDVYMQYY 227
Query: 66 QY-ITDPNRRTYAALTKST 83
+ I D RRTYA +
Sbjct: 228 KSTIHDNERRTYAGMASCV 246
>gi|313235834|emb|CBY19819.1| unnamed protein product [Oikopleura dioica]
Length = 518
Score = 238 bits (606), Expect = 9e-60, Method: Compositional matrix adjust.
Identities = 166/577 (28%), Positives = 268/577 (46%), Gaps = 98/577 (16%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
L +S + L++ +GW D HGS ++ TPN+DA+A +GI+L Y Q VC+P+R+SL
Sbjct: 13 CLVQSKNVVYLMIDDFGWGDFGVHGS-KLETPNLDAIARDGILLEKYYTQQVCSPTRSSL 71
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
+TG+YPI GMQ I + G+P LP+ L+ G++T +GKWH G P
Sbjct: 72 LTGRYPIRYGMQHNVILAGQTTGIPKEYALLPQDLKSCGFATHMVGKWHCGHSHEYMLPF 131
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
RGF+S++GYL G +Y I + E G D T G Y T++++ +
Sbjct: 132 NRGFDSYYGYLQGAEDHYSRI----QCQAKEWCGVDFCTENGPTNSTWGTYGTEIYSAQV 187
Query: 256 VQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
Q+++ +KP FLY A H L+AP+ +F +I DP+RRTYAAMV +
Sbjct: 188 AQVLDKVSKEEKPFFLYYAMQNVH-----DPLQAPEHYKIKFDWIEDPDRRTYAAMVSIM 242
Query: 315 D-DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
D ++VG ++ L + + ++++II +DNG T R+ G+N+P R K +L+E
Sbjct: 243 DEETVGDFMTMLHERNLYDDTLIIITADNGGET---------RSGGNNFPLRSQKWSLYE 293
Query: 374 GGVKVPAILWSPQIQQNPRVSLQ-MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
GGV V A+L I+ + + ++H+SDW T+ A ++DG+ QW ++L
Sbjct: 294 GGVHVNAMLSGGLIKDQSKSTFDGLVHVSDWRETILEAMECKPPADKSDLDGISQWQAIL 353
Query: 433 LNTPSRRNSNIDGLD-----------QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
N R S + +D QW L S NI+ + +A+R W
Sbjct: 354 QNASPPRQSIVHNIDPIKTVKGTDERQWVQKLAADAS------FNINSQ---SAIRYGRW 404
Query: 482 KLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
KL+ G G DG + P+ + +++ ++S + L++ +
Sbjct: 405 KLLTGDPAFGVPDGNIPPPEDAR-PV--WQKLLDDSMFESSETLTRLV------------ 449
Query: 542 TRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
L+++ DP E+++I+ + PDI +LL H
Sbjct: 450 --------------------------QLYDIEEDPFEKHDISDNHPDIVKIGLDLLSRHN 483
Query: 602 RTLVPQSHEQPDLVQADPKR--------FNDTWSPWI 630
T V P+ DP F W PW+
Sbjct: 484 ETAV-----WPNFPHTDPNSNPETRSDGFAGFWWPWV 515
Score = 47.8 bits (112), Expect = 0.020, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 40/71 (56%), Gaps = 6/71 (8%)
Query: 11 TVGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
T G Y T++++ + Q+++ +KP FLY A H L+AP+ +F +I
Sbjct: 174 TWGTYGTEIYSAQVAQVLDKVSKEEKPFFLYYAMQNVH-----DPLQAPEHYKIKFDWIE 228
Query: 70 DPNRRTYAALT 80
DP+RRTYAA+
Sbjct: 229 DPDRRTYAAMV 239
>gi|242002216|ref|XP_002435751.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215499087|gb|EEC08581.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 433
Score = 236 bits (601), Expect = 4e-59, Method: Compositional matrix adjust.
Identities = 169/493 (34%), Positives = 233/493 (47%), Gaps = 73/493 (14%)
Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
+TGMQ + EPRG+PL + LPE+L +LGY K L +E +Y
Sbjct: 1 YTGMQHFVLMEMEPRGLPLNFKLLPEWLGDLGYVHNQTTKQTL--LHKELRFVYA----- 53
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
F YL G D RR L + G Y T+L T+EA+++IE
Sbjct: 54 FPYLR--------------------RGFDFRRGLLGGREFDGRYYTELMTEEAIRIIETH 93
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
P++KPLFLYL+HLA H+GN L+ P++ + + I NR YA MV LD+SVG V+
Sbjct: 94 PLEKPLFLYLSHLAPHSGNLRDPLQVPKKFSDMYHDIGSRNRTIYAGMVSALDESVGAVV 153
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL R+GML +++++F SDNGA N+ S +P++G K T WEGGV+ PAI+
Sbjct: 154 EALGRRGMLGDTVLVFSSDNGA-----NSRGENPNYASAWPFKGQKITPWEGGVRAPAII 208
Query: 383 WSPQIQ-QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
WSP + R + HISDWLPTLY AGGD S L
Sbjct: 209 WSPSLSGTQGRDYSNIFHISDWLPTLYQLAGGDPSDL----------------------G 246
Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
+IDG+ Q +L T + R VLINID A+ +KLV GT + G +D ++ Q
Sbjct: 247 DIDGISQLEALRHGTEAPRKEVLINIDPIDNVTAIIEGHFKLVSGTVKGGALDDWF-QIP 305
Query: 502 SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQA---TIHCGANPAPMT 558
N A +T + L S D + Q + + CG T
Sbjct: 306 GNV-------AWDTDRTRHECETSVAAQVLRNSGHDVTCGSGQGSYAIPVKCGRRD---T 355
Query: 559 PSPCTN--GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
PCT PC LF+L DPCE NN+A ++ +L L ++ T + P+ +
Sbjct: 356 SKPCTPTLAPC-LFDLSEDPCEYNNVAEQHSEVVRRLLRKLAVYKATSWSPGN-VPEDPR 413
Query: 617 ADPKRFNDTWSPW 629
A P N W W
Sbjct: 414 AHPALHNGAWVSW 426
Score = 70.5 bits (171), Expect = 3e-09, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 50/82 (60%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR L + G Y T+L T+EA+++IE P++KPLFLYL+HLA H+GN L+ P++
Sbjct: 63 FRRGLLGGREFDGRYYTELMTEEAIRIIETHPLEKPLFLYLSHLAPHSGNLRDPLQVPKK 122
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+ + I NR YA + +
Sbjct: 123 FSDMYHDIGSRNRTIYAGMVSA 144
>gi|390336076|ref|XP_784530.2| PREDICTED: arylsulfatase J-like [Strongylocentrotus purpuratus]
Length = 545
Score = 235 bits (600), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 162/516 (31%), Positives = 243/516 (47%), Gaps = 106/516 (20%)
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
RASL Y IHTG+Q I A+P +P+ E LP+ L+E GY+T +GKWH+GF+
Sbjct: 117 RASLC---YWIHTGLQHGVIRHAQPNCLPVDEVTLPQKLKENGYTTNMVGKWHIGFYLDA 173
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQY-----SRTVELNGHDMRRNLSTAWDTVGEY 246
TP RGF+S FG+L G +Y +LS + T L+GHD+ N + +D G+Y
Sbjct: 174 CTPTERGFDSFFGFLIGAEDHYTRMLSCEAYNANDGSTKTLSGHDLHANKTPVFDYEGQY 233
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+T LFT + V +IE KPLF+YLA+ A HA L+ P ++ ++ I D RRT
Sbjct: 234 STHLFTNKTVDVIERHDKTKPLFIYLAYQAVHAP-----LQVPDSYMDPYKNIADKTRRT 288
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YA MV +D+ +G V AL+ G+ +N+IIIF +DNG + G+N+P RG
Sbjct: 289 YAGMVSCMDEGIGNVTQALKDAGLYDNTIIIFSTDNGG---------HIDRGGNNWPLRG 339
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
K ++WEGG+ + SP + + V+ ++MH+SDWLPT+ G
Sbjct: 340 SKGSMWEGGIHGVGFVHSPLLPDAVKGTVNHELMHVSDWLPTIVAGIAG----------- 388
Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKR-----------T 473
+L P +DG + WSS+ S + +L NI+ KK+
Sbjct: 389 ----GALNGTKP------LDGYNVWSSIRGTASSPQTELLHNINPKKKRPENEKFDTSMK 438
Query: 474 AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
AA+R+ WK+ G + T+ FN + P
Sbjct: 439 AALRVGDWKIKTGANYDVTV----------------FNEWMPP---------------PE 467
Query: 534 SNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
S +D + T + N +LFN+ DP EQ ++A P + +L
Sbjct: 468 SGLDTIYPTEKP------------------NQKVWLFNITADPLEQTDLAEKYPGVVDEL 509
Query: 594 YELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
L+ + + VP + DL +ADP +W PW
Sbjct: 510 VHKLEEYYKGSVPMRYPDEDL-EADPALHGGSWGPW 544
Score = 66.6 bits (161), Expect = 4e-08, Method: Compositional matrix adjust.
Identities = 40/99 (40%), Positives = 56/99 (56%), Gaps = 4/99 (4%)
Query: 108 NIDALAYNGIILNNMYAQPVCTPSRASLMT---GKYPIHTGMQGPPIWGAEPRGVPLTER 164
NI A A + + + AQ C + A + T G IHTG+Q I A+P +P+ E
Sbjct: 2 NIPAKAVLHLYADKIKAQ-RCILAAAIVGTPDFGHNMIHTGLQHGVIRHAQPNCLPMDEV 60
Query: 165 FLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
LP+ L+E GY+T +GKWH+GF+ TP RGF+S F
Sbjct: 61 TLPQKLKENGYTTNMVGKWHIGFYLDACTPTERGFDSFF 99
Score = 58.2 bits (139), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/82 (36%), Positives = 45/82 (54%), Gaps = 5/82 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+ N + +D G+Y+T LFT + V +IE KPLF+YLA+ A HA L+ P
Sbjct: 219 LHANKTPVFDYEGQYSTHLFTNKTVDVIERHDKTKPLFIYLAYQAVHAP-----LQVPDS 273
Query: 61 TINQFQYITDPNRRTYAALTKS 82
++ ++ I D RRTYA +
Sbjct: 274 YMDPYKNIADKTRRTYAGMVSC 295
>gi|291227815|ref|XP_002733878.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 508
Score = 235 bits (600), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 162/526 (30%), Positives = 257/526 (48%), Gaps = 77/526 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ + + + TPNID LA G+ L YAQP C PSR+ LM G+Y IHTG
Sbjct: 37 GYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYAQPSCMPSRSCLMMGRYQIHTGFDYRCK 96
Query: 152 WGAEPRGVPLTERF-LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G + + +P L+E GY+T IGKWHLG R E P +GF++ FGYL+ +
Sbjct: 97 DGKRSQLCMHPDTITMPMKLKENGYATHMIGKWHLGNIRWECLPNAKGFDTFFGYLSAIE 156
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNL-STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y+ H N HD RN A D G+Y+T LFTKEA +I++ +++P+F
Sbjct: 157 DYFTHYTP------AGANCHDFWRNHDEVADDYKGQYSTHLFTKEAQDIIKNHDINQPMF 210
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQY--ITDPNRRTYAAMVKKLDDSVGTVISALQR 327
+YL++ A H L+ P + + R+TYAAM +D+ +G V S L++
Sbjct: 211 MYLSYQAVHGP-----LQVPPHYYDMYNDGPRVSTTRQTYAAMATCMDEGIGNVTSTLKQ 265
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
+GM +N+++IF+SDNGA + G N+P RG K +++EGG++V + + SP I
Sbjct: 266 RGMWDNTVLIFVSDNGA---------QFHTGGRNWPLRGGKGSVFEGGIRVVSFVTSPLI 316
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
++ R+S ++H+ DW PTL AGG + L DG++
Sbjct: 317 ERPRRISEDIVHLMDWFPTLTHLAGGTVDDVML-----------------------DGVN 353
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVR-LDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
W ++ T S R +L++I R LD W NG D TR++ +
Sbjct: 354 VWDTISKGTLSPRRELLVHISATDGVTNYRGLDPW------YNNGMFD----ITRTSAII 403
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
++ + K Y S + + ++I KM P T+
Sbjct: 404 AGDWKLVTSQKRYDSSWAVPPEL---ANSIGKMSINE---------------PKSDTDKL 445
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYE-LLKYHRRTLVPQSHEQ 611
L++L D EQ+++++ RPDI +L E L KY + ++ ++ ++
Sbjct: 446 VLLYHLTEDKREQHDLSNLRPDIVFKLLEKLRKYQKHSVYCETSKR 491
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 7/80 (8%)
Query: 2 RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
R + A D G+Y+T LFTKEA +I++ +++P+F+YL++ A H L+ P
Sbjct: 174 RNHDEVADDYKGQYSTHLFTKEAQDIIKNHDINQPMFMYLSYQAVHGP-----LQVPPHY 228
Query: 62 INQFQY--ITDPNRRTYAAL 79
+ + R+TYAA+
Sbjct: 229 YDMYNDGPRVSTTRQTYAAM 248
>gi|291232045|ref|XP_002735970.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 500
Score = 235 bits (600), Expect = 5e-59, Method: Compositional matrix adjust.
Identities = 144/401 (35%), Positives = 217/401 (54%), Gaps = 33/401 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ ++ TP ID LA +G+ L N Y C PSR LMTG++ I G+Q
Sbjct: 34 GWNDVGYNNP-VFKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIQHDD- 91
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+G PR +PL E + E L+ +GYST +GKWH GF+ P RGF++ FG++ I
Sbjct: 92 YGFHPRSLPLNETTIAEPLKHVGYSTHIVGKWHCGFYSDNCLPHNRGFDTFFGFVGAGID 151
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+Y H SD ++ H++R+N A +G+Y+T +F E +I +KPLFL
Sbjct: 152 HYTH--SDHFNHM-----HNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFL 204
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
YL+ A HA LE P + Q++ I D +RRTYAAM +D+++ + AL+ K
Sbjct: 205 YLSFSAVHAP-----LEVPSSYLKQYESTIHDEDRRTYAAMTSCVDEAIANITEALEEKN 259
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML+NSII+F SDNG V + G+N+P RG K+T WEGGV+ ++ + +
Sbjct: 260 MLKNSIIVFTSDNGGALVN--------SAGNNWPLRGGKHTSWEGGVRAIGFVYGDLLPR 311
Query: 390 NPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL- 446
+ R + +MHI+DW PTL T AGG+ + + G+DQW+ + S R+ + L
Sbjct: 312 HSRGTENTGLMHITDWFPTLITLAGGN-PHVGKKLYGVDQWNMITGKDKSGRDDVLITLQ 370
Query: 447 -DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
D + T +N+ + AA+++ WKL+ G
Sbjct: 371 EDYKAGAHYRTGIYKNNAF----DITTHAAIKMGKWKLLTG 407
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 6/73 (8%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
+G+Y+T +F E +I +KPLFLYL+ A HA LE P + Q++ I D
Sbjct: 177 IGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSAVHAP-----LEVPSSYLKQYESTIHD 231
Query: 71 PNRRTYAALTKST 83
+RRTYAA+T
Sbjct: 232 EDRRTYAAMTSCV 244
>gi|291221683|ref|XP_002730830.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 235 bits (599), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 166/580 (28%), Positives = 262/580 (45%), Gaps = 116/580 (20%)
Query: 67 YITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
YI+ P + K+ +++++ GWND+ +H +I P + LA +G+I N
Sbjct: 15 YISKP------CIAKAEQPHIVLIFVDDLGWNDVEWHNP-DIKMPVLSKLAADGVIFNQS 67
Query: 123 YAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGK 182
Y P CTPSRA++MTG YP TG Q I+ P GVPL + LPE L+E+GY T +GK
Sbjct: 68 YTHPTCTPSRAAMMTGMYPFRTGNQHQMIFNLHPSGVPLEFKLLPEKLKEVGYFTHMVGK 127
Query: 183 WHLGFFRREYTPLYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTA 239
WHLGF + EY P RGF+SH+G + +YD +LS G+D R N+
Sbjct: 128 WHLGFCKEEYLPTSRGFDSHYGLWTLGVGHYDKMNGVLSPS-------EGYDFRDNIGVV 180
Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
+ EY T + + A +I PLFL KHLE P + I
Sbjct: 181 PKS-DEYLTLMLAERAEHIINGHYNKHPLFLQFT-----MDIPAKHLEIPDTFEALYCDI 234
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
D R + + +D +G V+ AL+ + M ++++++F+ DNGA + G
Sbjct: 235 EDERTRKFYGKLSLMDHVIGRVVDALKAREMWDDTVLMFIGDNGALASQA---------G 285
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
SNYP+RG+ TL+EG +VP+++ +++ V+ ++ ++ D T+ AG ++ +
Sbjct: 286 SNYPFRGIAGTLFEGATRVPSLIHGSMLRETGYVNNELWNLVDLHRTILALAGAESGSM- 344
Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT--AAVR 477
+DG++ W + S R + NID+ + AA+R
Sbjct: 345 ---------------------CELDGMNMWDTFSKGKKSPREEHVYNIDDDPISPGAAIR 383
Query: 478 LDSWKLVLGT-------QENGTMDGY--YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQN 528
+ +KL+ G ++ DG+ YG ++ +P
Sbjct: 384 IGDYKLITGHPDLLYPYRDISLTDGWYNYGDPPASGLP---------------------- 421
Query: 529 IFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPD 588
ST Q+A P P+ YLFN+ DP E+NNIA PD
Sbjct: 422 ------------STDQEANA-----PPPLNVE-------YLFNVVVDPSEKNNIADQHPD 457
Query: 589 ISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
I L + L HR+ L+P ++ +L D + WSP
Sbjct: 458 IVQNLRDRLDEHRKLLIPPVNQTTELA-GDAANYCGVWSP 496
>gi|449278684|gb|EMC86475.1| Arylsulfatase B, partial [Columba livia]
Length = 431
Score = 235 bits (599), Expect = 6e-59, Method: Compositional matrix adjust.
Identities = 164/504 (32%), Positives = 241/504 (47%), Gaps = 90/504 (17%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
IHTG+Q IW +P +PL E+ LPE L+E GY T +GKWHLG +++E P +RGF++
Sbjct: 1 IHTGLQHQIIWPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMYKKECLPTHRGFDT 60
Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
+FGYL G YY H +L ++ + D R A Y+T+LFT+ A+ L
Sbjct: 61 YFGYLLGSEDYYSHDRCVLIK--AKNITRCALDFRDGEEVATGFKNMYSTNLFTERAIDL 118
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I +KPLFLYLA + H + LE P+E + + I D RR YA MV +D++V
Sbjct: 119 IAHHKTEKPLFLYLAFQSVH-----EPLEVPEEYMKPYSSIKDAKRRHYAGMVTLMDEAV 173
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G + AL+R G+ N++++F +DNG T+ G+N+P RG K TLWEGGV+
Sbjct: 174 GNLTDALKRYGLWNNTVLVFSTDNGGQTLAG---------GNNWPLRGRKWTLWEGGVRG 224
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ SP ++Q S +++HISDWLPTL AGG T+ +DG D W ++ PS
Sbjct: 225 VGFVASPLLKQKGVESHELIHISDWLPTLVHLAGGHTNGTK-PLDGFDVWKAISEGRPSP 283
Query: 439 RNS---NID----------GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
R NID G+D +SL + + L NI AA+R WKL
Sbjct: 284 RVELLHNIDPMFVDDSPCLGIDNRTSLPQSNSFPWDDTLFNI---SMHAAIRHGKWKL-- 338
Query: 486 GTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQ 545
+ GY G + P L + ++S
Sbjct: 339 -------LTGYPGCSHWFPPPSLFSESKIQS----------------------------- 362
Query: 546 ATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLV 605
P P T +LF++ +DP E+N+++ P + +L L+Y+ + V
Sbjct: 363 -------------PDPPTKR-LWLFDIVHDPEEKNDLSEKYPHVVKKLLSRLQYYYKRSV 408
Query: 606 PQSHEQPDLVQADPKRFNDTWSPW 629
P + D DP+ W PW
Sbjct: 409 PVFYPDED-PSCDPEA-TGAWGPW 430
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 38/66 (57%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T+LFT+ A+ LI +KPLFLYLA + H + LE P+E + + I D RR
Sbjct: 106 YSTNLFTERAIDLIAHHKTEKPLFLYLAFQSVH-----EPLEVPEEYMKPYSSIKDAKRR 160
Query: 75 TYAALT 80
YA +
Sbjct: 161 HYAGMV 166
>gi|291236973|ref|XP_002738412.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 843
Score = 235 bits (599), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 148/401 (36%), Positives = 218/401 (54%), Gaps = 33/401 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ ++ TP ID LA +G+ L N Y C PSR LMTG++ I G+Q
Sbjct: 377 GWNDVGYNNP-VFKTPTIDRLAGSGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIQRHG- 434
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+G PR +PL E + + L+++GYST IGKWH GF+ P RGF++ FG++ I
Sbjct: 435 FGYHPRSLPLDETTIAQPLKQVGYSTHIIGKWHCGFYSDNCLPHNRGFDTFFGFVGAGIE 494
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+Y H SD ++ H++R+N A +G+Y+T ++ E +I +KP FL
Sbjct: 495 HYTH--SDHFNHM-----HNLRKNDDCIAKQYIGKYSTTIYANEGKNIINAHDQNKPFFL 547
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
YL+ A H LE P + Q++ I D +RRTYAAM +D+++ + AL+ K
Sbjct: 548 YLSFSAVHTP-----LEVPSSYLKQYESTIYDEDRRTYAAMTSCVDEAIANITEALEEKD 602
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML+NSII+F SDNG VE R+ GSN+P RG K+T+WEGGVK ++ +
Sbjct: 603 MLKNSIIVFTSDNGG-AVE-------RSAGSNWPLRGGKHTIWEGGVKAVGFVYGELLPH 654
Query: 390 NPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL- 446
+ R + +MHI+DW PTL T AG + + + G+DQW + SRR+ + L
Sbjct: 655 HSRGTENTGLMHITDWFPTLVTLAGAN-PHVGKKLYGVDQWKMVTGEGKSRRDDVLIELV 713
Query: 447 -DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
D +S T +N D R AA+++ WKL+ G
Sbjct: 714 KDNKASTRYRTGIFKNHPF---DITTR-AAMKMGKWKLLTG 750
Score = 48.9 bits (115), Expect = 0.008, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 39/73 (53%), Gaps = 6/73 (8%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
+G+Y+T ++ E +I +KP FLYL+ A H LE P + Q++ I D
Sbjct: 520 IGKYSTTIYANEGKNIINAHDQNKPFFLYLSFSAVHTP-----LEVPSSYLKQYESTIYD 574
Query: 71 PNRRTYAALTKST 83
+RRTYAA+T
Sbjct: 575 EDRRTYAAMTSCV 587
>gi|167515556|ref|XP_001742119.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163778743|gb|EDQ92357.1| predicted protein [Monosiga brevicollis MX1]
Length = 339
Score = 235 bits (599), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 130/316 (41%), Positives = 182/316 (57%), Gaps = 21/316 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+S HGS +IPTP+IDA+A++G+ L N + QPVCTP+R++ ++G++ IHTG+ P
Sbjct: 44 GWNDVSLHGSPQIPTPHIDAIAHSGVHLTNYHVQPVCTPTRSTFLSGRHVIHTGIYMPFA 103
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G R + L+ LP YL++LGY T A+GKWHLG + P RGF+ + GY +G
Sbjct: 104 QGTALR-LNLSYTLLPAYLKKLGYRTAAVGKWHLGQNVEKALPTGRGFDEYLGYWSGAED 162
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY H D + G+D + A Y+T +F + AV I + ++PLFLY
Sbjct: 163 YYTH---DTHG------GYDFQDGTECAIKYNNTYSTYIFAERAVNTILEADPEQPLFLY 213
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
A H LEAP E + +F +I + R+ AAM LDD+VG + AL+R +
Sbjct: 214 TAFQNVH-----WPLEAPAEYVARFSHIPNSERQYVAAMTSILDDAVGNITDALKRSRIA 268
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N+I+IF SDNG P + SN NYP RG KNTLW GG +V ++ I+
Sbjct: 269 DNTILIFTSDNGGPVHDENTESN------NYPLRGGKNTLWNGGTQVVGMIAGKGIENPG 322
Query: 392 RVSLQMMHISDWLPTL 407
MMH SDWLP+L
Sbjct: 323 TDCHGMMHASDWLPSL 338
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 36/67 (53%), Gaps = 5/67 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T +F + AV I + ++PLFLY A H LEAP E + +F +I + R+
Sbjct: 188 YSTYIFAERAVNTILEADPEQPLFLYTAFQNVH-----WPLEAPAEYVARFSHIPNSERQ 242
Query: 75 TYAALTK 81
AA+T
Sbjct: 243 YVAAMTS 249
>gi|291227811|ref|XP_002733876.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 539
Score = 234 bits (598), Expect = 7e-59, Method: Compositional matrix adjust.
Identities = 164/527 (31%), Positives = 260/527 (49%), Gaps = 78/527 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ + + + TPNID LA G+ L YAQP C PSR+ LM G+Y IHTG
Sbjct: 72 GYFDVGYRNGSIVKTPNIDKLAAEGVKLERHYAQPSCMPSRSCLMMGRYQIHTGFNYKCT 131
Query: 152 WGAEPRGVPLTERF-LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ + + +P L+E GY+T +GKWHLG R E P +GF++ FGY
Sbjct: 132 DGSGSQLCMHPDTITIPMKLKENGYATHMVGKWHLGNIRWECLPNAKGFDTFFGYHGASE 191
Query: 211 SYYDHIL-SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
YY H + + R + N D+ A + G+Y+T +FT EA+ +IE+ V KP+F
Sbjct: 192 DYYTHFSPAGRECRDLWRNRDDV------AQEYYGQYSTHIFTNEALNIIENHDVSKPMF 245
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL + A H L+ P++ N ++ D RR YAAM +D+ +G V +AL+++G
Sbjct: 246 LYLPYQAVHGP-----LQVPEKYYNMYERPLDDIRRKYAAMATCMDEGIGNVTNALKQRG 300
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
M +N++ IF+SDNGA + G N+P RG K +++EGG++V + + SP I++
Sbjct: 301 MWDNTVFIFVSDNGA---------QFPTGGRNWPLRGGKGSVFEGGIRVVSFVTSPLIER 351
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
R S +M+H++DWLPTL AGG + L DG++ W ++ + + R + +
Sbjct: 352 PQRSSNEMIHLTDWLPTLTHLAGGTVDDVEL--DGVNMWDTISKGSATPRKELLVQISPP 409
Query: 450 SSL--LLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
+ + + P +N + + +A+ + +WKLV T +GYYG + VP
Sbjct: 410 HGVRDIGHEPWYKNGMF----DVTTNSAIIVGNWKLV-------TNEGYYG---GHVVP- 454
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
+L I GA P P
Sbjct: 455 ---------------PELESQI---------------------GAERIP-PPESKRGRLV 477
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDL 614
+LF++G D E++ +++ RPDI +L E L+ ++ V H L
Sbjct: 478 WLFHIGEDTKEKHEMSNERPDIVFRLLERLRGYQSHAVFCRHSYASL 524
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 44/79 (55%), Gaps = 5/79 (6%)
Query: 2 RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
R A + G+Y+T +FT EA+ +IE+ V KP+FLYL + A H L+ P++
Sbjct: 209 RNRDDVAQEYYGQYSTHIFTNEALNIIENHDVSKPMFLYLPYQAVHGP-----LQVPEKY 263
Query: 62 INQFQYITDPNRRTYAALT 80
N ++ D RR YAA+
Sbjct: 264 YNMYERPLDDIRRKYAAMA 282
>gi|327291280|ref|XP_003230349.1| PREDICTED: arylsulfatase J-like, partial [Anolis carolinensis]
Length = 471
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 158/498 (31%), Positives = 242/498 (48%), Gaps = 73/498 (14%)
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L++ +Y IHTG+Q I +P +PL LP+ L+E GYST +GKWHLGF+R+E P
Sbjct: 3 LVSSRYQIHTGLQHSVIRPTQPNCLPLDNATLPQKLKEAGYSTHMVGKWHLGFYRKECMP 62
Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTK 253
RGF++ FG L G YY H D + + G+D+ N + AWD G Y+T ++T+
Sbjct: 63 TQRGFDTFFGSLLGSGDYYTHYKCD----SPRMCGYDLYENDNAAWDHDNGIYSTQMYTQ 118
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
+ Q++ KP+FLY+A+ A H+ L+AP +++ I + NRR YAAM+
Sbjct: 119 KVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPGTYYERYRSINNINRRRYAAMLAC 173
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLW 372
LD+++ V AL+R G NS+II+ SDNG PT G+N+P RG K T W
Sbjct: 174 LDEAINNVTLALKRYGYYNNSVIIYSSDNGGQPTAG----------GNNWPLRGSKGTYW 223
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
EGG++ + SP ++ V +++HI+DW PTL T A G L +DG D W ++
Sbjct: 224 EGGIRAVGFVHSPLLKNKGCVCKELIHITDWFPTLITLAEGQIDEDTL-LDGYDVWETI- 281
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
+ SRR+ +D L + + + I +A+R++ WKL+ G
Sbjct: 282 --SESRRSPRMDILHNIDPIYTKAKNGSWAAGFGIWNTAIQSAIRVNHWKLLTGNP---- 335
Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
GY S+ VP +F SN+ R ++ + G
Sbjct: 336 --GY-----SDWVPPQSF-----------------------SNVGPSRWHNERVSWTAGK 365
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQP 612
+LFN+ DP E+ +++S PD+ QL L +T VP +
Sbjct: 366 T-------------VWLFNITADPYERVDLSSKYPDVVKQLLRRLSQFNKTAVPVRYPPK 412
Query: 613 DLVQADPKRFNDTWSPWI 630
D +++P W PW
Sbjct: 413 D-PRSNPNFNGGVWGPWF 429
Score = 48.9 bits (115), Expect = 0.007, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T++ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 96 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 150
Query: 60 ETINQFQYITDPNRRTYAAL 79
+++ I + NRR YAA+
Sbjct: 151 TYYERYRSINNINRRRYAAM 170
>gi|403256717|ref|XP_003921000.1| PREDICTED: arylsulfatase B [Saimiri boliviensis boliviensis]
Length = 551
Score = 234 bits (597), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 135/358 (37%), Positives = 200/358 (55%), Gaps = 32/358 (8%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
I TG+Q IW +P VPL E+ LP++L+E GY+T +GKWHLG +R+E P RGF++
Sbjct: 123 IRTGLQHQIIWPCQPSCVPLDEKLLPQFLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 182
Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
+FGYL G YY H L D + V D R A Y+T++FTK A L
Sbjct: 183 YFGYLLGSEDYYSHERCTLIDALN--VTRCALDFRDGEEVATGYKNMYSTNIFTKRAATL 240
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV +D++V
Sbjct: 241 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAV 295
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V +AL+ +G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+
Sbjct: 296 GNVTAALKSRGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRG 346
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ SP ++Q + +++HISDWLPTL AGG T+ +DG D W ++ +PS
Sbjct: 347 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWKTISEGSPSP 405
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
R + +D + + ++P RNS+ + D+ AA+R +WKL+ G
Sbjct: 406 RVELLHNID--PNFVDSSPCPRNSMALAKDDSSLPEHSAFNTSVHAAIRHGNWKLLTG 461
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 228 YSTNIFTKRAATLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 282
Query: 75 TYAALT 80
YA +
Sbjct: 283 HYAGMV 288
>gi|260816811|ref|XP_002603281.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
gi|229288599|gb|EEN59292.1| hypothetical protein BRAFLDRAFT_226338 [Branchiostoma floridae]
Length = 357
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 141/405 (34%), Positives = 204/405 (50%), Gaps = 52/405 (12%)
Query: 77 AALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
AA T + LLIV GW+D+S++ N + PN+ LA G+I N Y+Q +CTPSR
Sbjct: 1 AAQTTTKPHILLIVADDLGWSDVSWNNPN-VVMPNLHTLATTGVIFNQTYSQRLCTPSRT 59
Query: 134 SLMTGKYPIHTGMQ-GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
+L+TGK+P GMQ ++ G+PL E LP+ L++LGY+T +GKWHLG + EY
Sbjct: 60 ALLTGKFPYRLGMQVQKSMFEKNSHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEY 119
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
TP RGF+S +G+ +G YY H+ G D ++ D G Y+T+ F
Sbjct: 120 TPTERGFDSFYGFHHGGEDYYTHMSE---------RGLDFWDGRTSVSDRNGVYSTESFA 170
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
+ A +I + PLFLYL + H + + P + F I D NR++ M
Sbjct: 171 RRAENIISQHDPNTPLFLYLPFQSVHTPH-----QVPSSYLQTFSTIQDDNRKSILGMAT 225
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
LDD++ V AL+ KG+ +N++ IFMSDNG NY SN+P RG K TLW
Sbjct: 226 ALDDAIKRVTDALKGKGLWDNTLTIFMSDNGG---------NYVQGQSNWPLRGAKGTLW 276
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
EGG +VPA + +++ MMH D LPTL + AGG
Sbjct: 277 EGGTRVPAFVHGNMLERTGYTYHGMMHGVDILPTLVSVAGG------------------- 317
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
++ +DG + W S+ S R + +ID + A+R
Sbjct: 318 -----TEDAGLDGKNMWQSISTGADSPRTEFVYDIDSRGERYAIR 357
>gi|168701793|ref|ZP_02734070.1| twin-arginine translocation pathway signal precursor [Gemmata
obscuriglobus UQM 2246]
Length = 459
Score = 234 bits (596), Expect = 1e-58, Method: Compositional matrix adjust.
Identities = 147/422 (34%), Positives = 212/422 (50%), Gaps = 54/422 (12%)
Query: 74 RTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
R A S L G D F G EI TPNID +A G L+ YAQPVC+P+RA
Sbjct: 18 RAEAPAQPSVVFLLADDLGREDCGFMGGKEIKTPNIDKIAAAGATLDAFYAQPVCSPTRA 77
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
+ MTG+YP+ G+Q + G+PL ER + + L++ GY+T IGKWHLG F EY
Sbjct: 78 AFMTGRYPMRHGLQVGVVRPWAQYGLPLDERTVAQGLKDAGYTTAVIGKWHLGHFAPEYL 137
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
P RGF+ +G+ NG + Y+ HI G D R+ D Y+T L K
Sbjct: 138 PTKRGFDHQYGHYNGALDYFTHIRD---------GGFDWHRDDKVNSDE--GYSTHLVAK 186
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
+AVQ ++ KP FLY+ A HA + + P+E + + ++ R+ YA M+
Sbjct: 187 DAVQFVQTHAGKKPFFLYVPFNAVHAPH-----QVPEEYLKPYGHLKG-ERQKYAGMLAA 240
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
+D++VG +++A+++ G+ +N++ +F+SDNG P SN +R K TL+E
Sbjct: 241 MDEAVGQIVAAVEKAGVRKNTLFVFLSDNGGPQPGVVT--------SNGKFRAGKGTLYE 292
Query: 374 GGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
GGV+V A + I+ ++ + +H+ DW PTL GG T L +DGLD W L
Sbjct: 293 GGVRVAACAAFDGHIRPGTTIT-EPLHVVDWYPTLLK-LGGPTGEQQLPVDGLDIWPVLT 350
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
PS ++LLNT T AVR WKLV+ +NGT
Sbjct: 351 GGKPSPHG----------AILLNT-------------TPNTGAVRAGDWKLVV---KNGT 384
Query: 493 MD 494
D
Sbjct: 385 DD 386
>gi|154248610|ref|YP_001419568.1| sulfatase [Xanthobacter autotrophicus Py2]
gi|154162695|gb|ABS69911.1| sulfatase [Xanthobacter autotrophicus Py2]
Length = 491
Score = 234 bits (596), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 147/400 (36%), Positives = 210/400 (52%), Gaps = 54/400 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ FHGS+ I TPN+D LA G L Y QP CTP+RA+ +TG+YP+H G+Q I
Sbjct: 60 GFADVGFHGSD-IKTPNLDHLAAQGARLGQFYTQPFCTPTRAAFLTGRYPLHYGLQVGAI 118
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ E LP+ L+++GY T +GKWHLG +++ P RGF+S +G L G I
Sbjct: 119 PSGAKYGLATDEFLLPQALKDVGYRTALVGKWHLGHADQKFWPRQRGFDSFYGPLVGEID 178
Query: 212 YYDHI---LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
++ H ++D Y HD + +D T+LF KEAV+LI PL
Sbjct: 179 HFKHEAHGVTDWY--------HDNTQVKEEGYD------TELFGKEAVRLIAAHDPKTPL 224
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA A H +APQ ++Q+ +I P RR YAAM+ +DD +G V++AL +
Sbjct: 225 FLYLAFTAPH-----TPFQAPQSYLDQYAHIAAPQRRAYAAMITAMDDQIGHVVAALTSR 279
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNW---GSNYPYRGVKNTLWEGGVKVPAIL-WS 384
GM EN++I+F SDNG + SN PYR K +L+EGG +V A+ W
Sbjct: 280 GMRENTLIVFHSDNGGTRSKMFAGEGAVAGDLPASNAPYRDGKGSLYEGGTRVVALANWP 339
Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
+I P + +MH+ D LPTL AG +SL + P +D
Sbjct: 340 GRIA--PGAAEGVMHVVDMLPTLAKLAG----------------ASLAKSKP------LD 375
Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
G+D W +L + R ++ N++ + AVR WKLV
Sbjct: 376 GVDVWPALAAGQ-AGRAGIVYNVEPTQ--GAVRDGRWKLV 412
Score = 56.2 bits (134), Expect = 5e-05, Method: Compositional matrix adjust.
Identities = 30/68 (44%), Positives = 39/68 (57%), Gaps = 5/68 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y T+LF KEAV+LI PLFLYLA A H +APQ ++Q+ +I P RR
Sbjct: 202 YDTELFGKEAVRLIAAHDPKTPLFLYLAFTAPH-----TPFQAPQSYLDQYAHIAAPQRR 256
Query: 75 TYAALTKS 82
YAA+ +
Sbjct: 257 AYAAMITA 264
>gi|300433302|gb|ADK13094.1| arylsulfatase [Dicathais orbita]
Length = 571
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 165/540 (30%), Positives = 256/540 (47%), Gaps = 79/540 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+S+H ++ TPN+ +A NG+IL Y+Q CTPSRAS MTG YP G+Q +
Sbjct: 43 GYNDVSWHNP-QVLTPNLGKMAKNGVILTESYSQAACTPSRASYMTGYYPFRIGVQNSVV 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
VPL FLP+ L+E GY + +GKWHLG RR+ TP RGF++ G LNG
Sbjct: 102 REGMEDYVPLDVDFLPKRLKEAGYVSHLVGKWHLGHCRRDVTPPGRGFDTFLGLLNGYND 161
Query: 212 YYDHILS-----DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
YY + + + +D N + +Y TD+FT A++LI+ Q D
Sbjct: 162 YYTKKIRAIASHEDFDPNAPGTIYDFFSNYTLQPSPETDYTTDIFTNRAIELIQ-QSKDT 220
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
P FL L + A H KH P + + + D NRR Y + + +D +G V+ AL+
Sbjct: 221 PFFLALHYTAPH-WPLQKH---PGFSDKDYPDVKDLNRRIYLSSITAMDTGIGKVVDALK 276
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
+ +L+N++I+F SDNG T +N P R K TL+EGGVKVP++ + P
Sbjct: 277 QHNLLDNTLIVFQSDNGGDT---------HFSANNGPLRDRKTTLYEGGVKVPSMAYGPG 327
Query: 387 IQQN-PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+ N PR S + HISDW T+ +AAG + N+DG+DQW+ LL S+
Sbjct: 328 LLTNTPRTSDDLFHISDWFTTILSAAGLEAG----NVDGIDQWN--LLRYGSK------- 374
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
S R + + NI AA+R +KL+ G + + + R V
Sbjct: 375 ------------SPRQTFIYNI--HGDNAAIRAGRYKLIEGNPK-----AFQARRRRRSV 415
Query: 506 PLLNFN---------------AIVESKTYQSLQQLSQNIFLPISNIDKMRSTR------Q 544
+L+ + V+ + Y+S + F ++ + K+ R
Sbjct: 416 DVLDQDLAVHGSALTRHKRDEEFVDRRPYESDEDKVMKAFSSLTGLKKLEQIRPTGDWTM 475
Query: 545 QATIHCGANPAP---MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
+ + P + C C L+NL DP E+ ++ +P + ++ LL+ ++
Sbjct: 476 DSILSVLEKIRPDQVKVQADCHK--CQLYNLTADPSEKKDLVKEKPAVVLFMHTLLELYK 533
>gi|313247306|emb|CBY15582.1| unnamed protein product [Oikopleura dioica]
Length = 486
Score = 233 bits (595), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 172/545 (31%), Positives = 252/545 (46%), Gaps = 76/545 (13%)
Query: 98 FHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPR 157
+ S+ I TPNIDA++ G+ L N Y QP+CTPSR+ L++G+Y IHTG+Q IW P
Sbjct: 5 YKSSDIIKTPNIDAISAAGVRLENYYVQPICTPSRSQLLSGRYQIHTGLQHQLIWMGMPS 64
Query: 158 GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHIL 217
+PL LPE +R GY T A GKWHLG+ + TP RGF + GYL G YY
Sbjct: 65 ALPLDTELLPETMRNCGYHTMAAGKWHLGYAKTANTPWGRGFHNFTGYLGGSEDYYKKTR 124
Query: 218 S-DQYSRTVELN--GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAH 274
D + ++ N G + A EY+ + ++A I+ + DKP FLYL
Sbjct: 125 CIDHHKCGIDQNTDGEIFGERVYNA--DASEYSAFKYIRQAKNYIDGRDKDKPFFLYLPM 182
Query: 275 LAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLEN 333
+ HA LEAP E I ++ I D R+ YA MV +D+ +G + S L+ G+ N
Sbjct: 183 QSVHA-----PLEAPAEYIAKYDGIIEDKPRKIYAGMVSVMDEGIGNITSHLKESGLWNN 237
Query: 334 SIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK----VPAILWSPQIQQ 389
+I IF +DNG G+N P RG K WEGG+ V + + +
Sbjct: 238 TIFIFSTDNGGQAYV---------GGNNLPLRGNKAGYWEGGIHGNGFVAGGYFEDRRKG 288
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
+ V+ +++HISDW PT+ A + ++ ++DG+ QW LL S R+ + +D+
Sbjct: 289 SELVNNELIHISDWYPTIMEATSCEKAQNAPDLDGMSQWKMLLDLEKSPRDEILHNIDE- 347
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
++ + +R NI + AA+R +WKL+ G GY +
Sbjct: 348 MTVAKGSDNRTFKSDFNISVQ---AAIRWKNWKLLTGDP------GY-----------PD 387
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
F + + + L IF P + + P P P P T L
Sbjct: 388 FPITIPPQGNKKL------IFTPENGLQY---------------PIPPRPEPLTR-LVRL 425
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND----T 625
F++ DP E N I+ DI L + L Y T VP + DPK F +
Sbjct: 426 FDVIKDPTESNEISDYHEDIVEFLLDRLSYWNSTQVPI-----NFPAVDPKSFPEYHQGF 480
Query: 626 WSPWI 630
W PW+
Sbjct: 481 WKPWL 485
>gi|260786699|ref|XP_002588394.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
gi|229273556|gb|EEN44405.1| hypothetical protein BRAFLDRAFT_198899 [Branchiostoma floridae]
Length = 353
Score = 233 bits (594), Expect = 2e-58, Method: Compositional matrix adjust.
Identities = 143/394 (36%), Positives = 198/394 (50%), Gaps = 52/394 (13%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
LLIV GW+D+S++ N + PN+ LA G+I N Y Q +CTPSR +L+TGK+P
Sbjct: 4 LLIVADDLGWSDVSWNNPNVV-MPNLHTLATTGVIFNQTYCQRLCTPSRTALLTGKFPYR 62
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
GMQ PI + G+PL E LP+ L++LGY+T IGKWHLG + EYTP RGF+S +
Sbjct: 63 LGMQ-RPIRHKKAHGLPLDEELLPQKLKKLGYATHMIGKWHLGCCKWEYTPTERGFDSFY 121
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
GY G YY H +SD G D + D G Y+T+ F A +I
Sbjct: 122 GYHRGSQDYYTH-MSD--------GGLDFWEGKTAISDQNGVYSTESFATRAENIISQHD 172
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
+ PLFLYL H + + P + F I D NR++ MV LDD++ V
Sbjct: 173 PNTPLFLYLPLQPVHTPH-----QVPSSYLQTFSTIQDHNRKSILGMVAALDDAIKRVTD 227
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
ALQ KG+ N++ IFMSDNG +Y + SN+P RG K T+WEGG +VPA
Sbjct: 228 ALQGKGLWNNTLTIFMSDNGG---------DYLDGQSNWPLRGAKGTVWEGGTRVPAFAH 278
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
+++ MMH D LPTL + AGG ++ +
Sbjct: 279 GNMLERTGYTYHGMMHGVDILPTLVSVAGG------------------------TEDAGL 314
Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
DG + W S+ S R + +ID ++ +R
Sbjct: 315 DGKNMWQSISTGAESPRTEFVYDIDSRQDRYGIR 348
>gi|291238558|ref|XP_002739195.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 495
Score = 233 bits (594), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 142/399 (35%), Positives = 214/399 (53%), Gaps = 29/399 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ ++ TP ID LA NG+ L N Y C PSR LMTG++ I G+
Sbjct: 34 GWNDVGYNNP-VFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIPNDG- 91
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+G PR +PL E + E L+ GYS +GKWH G++ P RGF++ FG++ I
Sbjct: 92 FGYHPRSLPLDETTIAEPLKHAGYSNHIVGKWHCGYYADNCLPHNRGFDTFFGFVGAGID 151
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+Y H SD ++ H++R+N A +G+Y+T +F E +I +KPLFL
Sbjct: 152 HYTH--SDHFNHM-----HNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFL 204
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
YL+ A HA LE P + Q++ I D +RRTYAAM +D+++ + AL+ K
Sbjct: 205 YLSFSAVHAP-----LEVPSSYLKQYESTIHDEDRRTYAAMTSCVDEAIANITEALEEKN 259
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML+NSII+F SDNG V + G+N+P RG K+T WEGGV+ ++ + +
Sbjct: 260 MLKNSIIVFTSDNGGALV--------NSAGNNWPLRGGKHTSWEGGVRAVGFVYGDLLPR 311
Query: 390 NPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ R + +MHI+DW PTL T AGG+ + + G+DQW+ + S R+ + L+
Sbjct: 312 HSRGTENTGLMHITDWFPTLITLAGGN-PHVGKKLYGVDQWNMITGEDKSGRDDVLITLE 370
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
+ + P R N + AA+++ WKL+ G
Sbjct: 371 EDNK--AGAPYRTGIYKNNAFDITTHAAIKMGKWKLLTG 407
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 6/73 (8%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
+G+Y+T +F E +I +KPLFLYL+ A HA LE P + Q++ I D
Sbjct: 177 IGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSAVHAP-----LEVPSSYLKQYESTIHD 231
Query: 71 PNRRTYAALTKST 83
+RRTYAA+T
Sbjct: 232 EDRRTYAAMTSCV 244
>gi|291227813|ref|XP_002733877.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 490
Score = 233 bits (593), Expect = 3e-58, Method: Compositional matrix adjust.
Identities = 144/421 (34%), Positives = 218/421 (51%), Gaps = 57/421 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ + + I TPNID LA G+ L YAQP C PSRA LM G+Y IHTG + +
Sbjct: 23 GYFDVGYRNRSVIKTPNIDKLAAEGVKLERHYAQPSCLPSRACLMMGRYQIHTGYRDECM 82
Query: 152 WGAE-PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
R + LP +++ GY T IGKWHLG + P +GF+++FGY
Sbjct: 83 NDTRYQRCMNHDIVTLPMKMKQNGYVTHMIGKWHLGNNNWDCLPNAKGFDTYFGYNAAAE 142
Query: 211 SYYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
YY H+LS + N D+ R+ + A +G+Y+T +FT+EAV +IE+ + +P+F
Sbjct: 143 DYYTHMLSGRQ------NCSDLWRDRMDVADKYIGQYSTRIFTEEAVNIIENHDISQPMF 196
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
+YLAH A HA L+ PQE + ++ + RR YAAMV +D+ +G V AL+ G
Sbjct: 197 MYLAHQAVHAP-----LQVPQEYYSLYETTLNDVRRKYAAMVTCMDEGIGNVTEALKESG 251
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
M +N++++F+SDNG ++ RNW P RG K +++EGG++ + + S I++
Sbjct: 252 MWDNTVLVFVSDNGGEL-----SAGGRNW----PLRGGKGSVFEGGIRTVSFVTSSLIER 302
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
R+S M+HI+DW PTL AGG ID + +DG++ W
Sbjct: 303 PQRISNAMIHITDWFPTLTHLAGG-------TID----------------DEMVDGINMW 339
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWK------------LVLGTQENGTMDGYY 497
++ + R +LI I R V +D W L +G + T + YY
Sbjct: 340 DTISTGSVEPRTELLIKISPPARITEVGMDPWHKYDVFDAWLNSALFVGNWKLVTNEAYY 399
Query: 498 G 498
G
Sbjct: 400 G 400
Score = 62.0 bits (149), Expect = 1e-06, Method: Compositional matrix adjust.
Identities = 30/79 (37%), Positives = 48/79 (60%), Gaps = 5/79 (6%)
Query: 2 RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 61
R + A +G+Y+T +FT+EAV +IE+ + +P+F+YLAH A HA L+ PQE
Sbjct: 160 RDRMDVADKYIGQYSTRIFTEEAVNIIENHDISQPMFMYLAHQAVHAP-----LQVPQEY 214
Query: 62 INQFQYITDPNRRTYAALT 80
+ ++ + RR YAA+
Sbjct: 215 YSLYETTLNDVRRKYAAMV 233
>gi|443321855|ref|ZP_21050894.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788399|gb|ELR98093.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 469
Score = 232 bits (592), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 148/411 (36%), Positives = 220/411 (53%), Gaps = 61/411 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS EI TPN+D LA +G+ L Y + VCTP+RA+ +TG++P GM I
Sbjct: 46 GWNDVGFHGS-EIKTPNLDKLAASGVRLERFYVKSVCTPTRAAFLTGRHPFRYGMSTGVI 104
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ G+PL E+ + E L+E GY T +GKWHLG ++ + P RGF+ H+G+ G I
Sbjct: 105 KPWDKVGLPLEEKTIAETLKEAGYYTAILGKWHLGHYQESFLPTSRGFDYHYGHYLGGID 164
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD-KPLFL 270
Y+ H +D + L+ H R +L YATDL +EAV+LI + + +PLFL
Sbjct: 165 YFTH--NDDFLGA--LDWHRNRIHLKEEG-----YATDLIGQEAVKLINNHNYEQQPLFL 215
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A A H L A E I + I D RR +AA V+ +D+++G +I +L+ + +
Sbjct: 216 YIAFNAPHTP-----LHAKTEDIEDYLTIDDEKRRVFAAQVQSMDEAIGKIIQSLENQQV 270
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQ 389
+N+ + F+SDNG + N G N P RG KN+L+EGGV+VPAI+ + P++
Sbjct: 271 CDNTFVFFVSDNGGSVMR-------ANRGDNRPLRGGKNSLYEGGVRVPAIVSYPPKLSA 323
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
N ++ Q+ I D PTL AG S +DGLD +
Sbjct: 324 NQEIN-QIFSIVDLYPTLAKLAG----------------------VESVEGKQLDGLDIF 360
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
+ SRR+ +LI +AA+ D++KLVL +G YG++
Sbjct: 361 AK-----HSRRDELLIQY-RSSTSAAIIKDNYKLVL--------NGGYGES 397
Score = 45.1 bits (105), Expect = 0.10, Method: Compositional matrix adjust.
Identities = 29/69 (42%), Positives = 38/69 (55%), Gaps = 6/69 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
YATDL +EAV+LI + + +PLFLY+A A H L A E I + I D R
Sbjct: 190 YATDLIGQEAVKLINNHNYEQQPLFLYIAFNAPHTP-----LHAKTEDIEDYLTIDDEKR 244
Query: 74 RTYAALTKS 82
R +AA +S
Sbjct: 245 RVFAAQVQS 253
>gi|410446533|ref|ZP_11300636.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409980205|gb|EKO36956.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 517
Score = 232 bits (592), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 141/363 (38%), Positives = 203/363 (55%), Gaps = 31/363 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP-- 149
GW D+S+HG + IPTPNIDALA NG+ LN YA PVC+P+RASL+TG + + G+ P
Sbjct: 30 GWGDVSYHGGH-IPTPNIDALAKNGVELNRFYASPVCSPTRASLLTGLHIFNHGIIRPLA 88
Query: 150 -PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P AE G+P+ + +P++ +E GY T GKWHLG EY P RGF+ +G++ G
Sbjct: 89 NPT--AEQYGLPVDLKIMPQFFKEAGYQTALSGKWHLGMHLEEYWPTNRGFDQSYGHMLG 146
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
I Y+DH+ S + D RN ++ Y+T+L EAV++IE + ++PL
Sbjct: 147 GIGYFDHVHSSRL---------DWHRNEEPLFED--GYSTELIANEAVRIIETKDPNRPL 195
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLY+A A H ++AP + I F YI DP R YAA V LD +G +I A++ +
Sbjct: 196 FLYVAFNAPHT-----PIQAPDKNIELFSYIEDPLDRAYAANVNALDAEIGKIIYAIEDQ 250
Query: 329 GMLENSIIIFMSDNGA-----PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA-IL 382
G+LE +II+F SDNG P + GS G K + EGG++VPA I+
Sbjct: 251 GILEETIIVFFSDNGPVFDINPIIAVLAPGLVEARGSTAGLPGSKTSAREGGIRVPASIM 310
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
W + + S Q + + D LPTL +AA + S L + DG D+W +L+ N++
Sbjct: 311 WKGVLDNSK--SDQYIFVQDLLPTLLSAAEIEVSDLKV-FDGTDKWQNLVTGALQAPNND 367
Query: 443 IDG 445
G
Sbjct: 368 FVG 370
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/64 (43%), Positives = 39/64 (60%), Gaps = 5/64 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T+L EAV++IE + ++PLFLY+A A H ++AP + I F YI DP R
Sbjct: 173 YSTELIANEAVRIIETKDPNRPLFLYVAFNAPHT-----PIQAPDKNIELFSYIEDPLDR 227
Query: 75 TYAA 78
YAA
Sbjct: 228 AYAA 231
>gi|449265842|gb|EMC76972.1| Arylsulfatase J, partial [Columba livia]
Length = 467
Score = 232 bits (592), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 155/493 (31%), Positives = 238/493 (48%), Gaps = 71/493 (14%)
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
+Y IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+RRE P RG
Sbjct: 1 RYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRG 60
Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQ 257
F++ FG L G YY H D + + G+D+ N + AWD G Y+T ++T++ Q
Sbjct: 61 FDTFFGSLLGSGDYYTHFKCD----SPGICGYDLYENDNAAWDHDNGIYSTQMYTQKVQQ 116
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
++ KP+FLY+A+ A H+ L+AP + ++ I + NRR YAAM+ LD++
Sbjct: 117 ILASHNPSKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSINNINRRRYAAMLACLDEA 171
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
+ V AL++ G +NSIII+ SDNG + GSN+P RG K T WEGG++
Sbjct: 172 INNVTLALKKYGYYDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIR 222
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
+ SP ++ V +++HI+DW PTL T A G + +DG D W ++ +
Sbjct: 223 AVGFVHSPLLKNKGSVCKELVHITDWFPTLITLAEGQIDE-DIQLDGYDIWETI---SEG 278
Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
RR+ +D L + + + I +A+R++ WKL+ G GY
Sbjct: 279 RRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVNHWKLLTGNP------GY- 331
Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
S+ VP F SN R ++ + G
Sbjct: 332 ----SDWVPPQAF-----------------------SNAGPNRWHNERVSWTAGKT---- 360
Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
+LFN+ DP E+ ++++ PD+ QL L +T VP + D ++
Sbjct: 361 ---------VWLFNITADPYERVDLSAKYPDVVKQLLRRLSQFNKTAVPVRYPPKD-PRS 410
Query: 618 DPKRFNDTWSPWI 630
+PK W PW
Sbjct: 411 NPKLNGGVWGPWF 423
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T++ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 90 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPSKPIFLYIAYQAVHS-----PLQAPG 144
Query: 60 ETINQFQYITDPNRRTYAAL 79
+ ++ I + NRR YAA+
Sbjct: 145 KYFEHYRSINNINRRRYAAM 164
>gi|440902784|gb|ELR53530.1| Arylsulfatase B, partial [Bos grunniens mutus]
Length = 431
Score = 232 bits (592), Expect = 4e-58, Method: Compositional matrix adjust.
Identities = 134/360 (37%), Positives = 200/360 (55%), Gaps = 32/360 (8%)
Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
+ IHTG+Q IW +P +PL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF
Sbjct: 1 FKIHTGLQHQIIWPCQPSCIPLDEKLLPQLLKEAGYATHMVGKWHLGMYRKECLPTRRGF 60
Query: 200 ESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
+++FGYL G YY H L D + V D R A Y+T++FT+ A
Sbjct: 61 DTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNVFTERAT 118
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
LI + P +KPLFLYLA + H + L+ P+E + + +I D NRR YA M +D+
Sbjct: 119 TLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRRYYAGMASVMDE 173
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+VG V +AL+R+G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV
Sbjct: 174 AVGNVTAALERRGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGV 224
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
+ + SP +++ + +++HISDWLPTL AGG T+ +DG D W+++ +P
Sbjct: 225 RGVGFVASPLLKRKGVKTRELIHISDWLPTLVKLAGGSTNGTK-PLDGFDVWNTISEGSP 283
Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
S R + +D + + P NS+ + DE AA+R +WKL+ G
Sbjct: 284 SPRMELLHNID--PNFVDTAPCPGNSMSLAKDESSLLEYSAFNISIHAAIRHQNWKLLTG 341
Score = 55.5 bits (132), Expect = 8e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NRR
Sbjct: 108 YSTNVFTERATTLITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDRNRR 162
Query: 75 TYAALT 80
YA +
Sbjct: 163 YYAGMA 168
>gi|313233524|emb|CBY09696.1| unnamed protein product [Oikopleura dioica]
Length = 609
Score = 232 bits (591), Expect = 6e-58, Method: Compositional matrix adjust.
Identities = 163/551 (29%), Positives = 256/551 (46%), Gaps = 94/551 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +H + EI TP +D LA G+ L Y QPVCTP+R LMTG+Y I GMQ +
Sbjct: 109 GYHDIGYHQA-EILTPFMDKLATTGVRLEQYYVQPVCTPTRVQLMTGRYQIRYGMQHGVV 167
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P GVPL E+ LPE LR+ GY+T+ IGKWHLG F +Y P RGF+ G+ G
Sbjct: 168 RPPQPDGVPLDEKLLPEALRKCGYNTEMIGKWHLGMFTEDYLPQNRGFDHFMGFYTGSQD 227
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLS-----TAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
+Y H + +S + G+D R + WD Y+T +F E + + +
Sbjct: 228 FYSH--NKCFS---GMCGYDFREATAGQPEVIRWDLNNTYSTGVFADELEKRLSKMNPSE 282
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
P F YL+ A H+ L+AP++ I+Q++ + + RR Y AMV +D ++ V+ A +
Sbjct: 283 PSFTYLSFQAVHS-----PLQAPRKFIDQYRNLKNRKRRNYNAMVTAMDTAIYKVVKAYK 337
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
+ +++I+IF SDNG N + SN+P RG K +L+EGGV+ + SP
Sbjct: 338 KFKFWDDTILIFSSDNGG---------NLKGGASNWPLRGAKGSLFEGGVRTIGFVHSP- 387
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLLNTPSR---RNSN 442
++ ++DW PTL AG TS +DG+ Q L + + R+
Sbjct: 388 ----------LLPVTDWFPTLLQLAGCKTSDYGGKPLDGVSQVRQLWMGSNGNDHARDEI 437
Query: 443 IDGLDQWSSLLLNTPSRRNSVLIN----IDEKKRTAAVRLDSWKLVLGTQENGTMDGYYG 498
+ GLD + + R+ ++ N +D + A+R WKL+ G Q
Sbjct: 438 LHGLDPLNKVKNTVDPRQFPIIKNRTFSVDVE---GALRWKQWKLITGKQ---------- 484
Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
+ D+ R + T++ GA
Sbjct: 485 ------------------------------------SPDQNRWSTPPTTMYGGAKTIFDK 508
Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
P + L+N+ +DP E+N+I+ P + + + L + VP + DL +AD
Sbjct: 509 VEPKKSKLVRLYNVVDDPFERNDISDEHPQVVNVMLTKLADYYDASVPPRYPAYDL-RAD 567
Query: 619 PKRFNDTWSPW 629
P + W PW
Sbjct: 568 PALGDGVWRPW 578
Score = 40.8 bits (94), Expect = 2.0, Method: Compositional matrix adjust.
Identities = 25/91 (27%), Positives = 43/91 (47%), Gaps = 6/91 (6%)
Query: 9 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
WD Y+T +F E + + +P F YL+ A H+ L+AP++ I+Q++ +
Sbjct: 256 WDLNNTYSTGVFADELEKRLSKMNPSEPSFTYLSFQAVHS-----PLQAPRKFIDQYRNL 310
Query: 69 TDPNRRTY-AALTKSTTLTLLIVYGWNDLSF 98
+ RR Y A +T T +V + F
Sbjct: 311 KNRKRRNYNAMVTAMDTAIYKVVKAYKKFKF 341
>gi|449499600|ref|XP_002193905.2| PREDICTED: arylsulfatase J [Taeniopygia guttata]
Length = 488
Score = 231 bits (590), Expect = 7e-58, Method: Compositional matrix adjust.
Identities = 156/492 (31%), Positives = 236/492 (47%), Gaps = 71/492 (14%)
Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
Y IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+RRE P RGF
Sbjct: 23 YQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKWHLGFYRRECMPTQRGF 82
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQL 258
+S FG L G YY H D + + G+D+ N + AWD G Y+T ++T++ Q+
Sbjct: 83 DSFFGSLLGSGDYYTHFKCD----SPGICGYDLYENNNAAWDHDNGIYSTQMYTQKVQQI 138
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
+ KP+FLY+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++
Sbjct: 139 LASHNPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSINNINRRRYAAMLACLDEAI 193
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
V AL++ G ENSIII+ SDNG + GSN+P RG K T WEGG++
Sbjct: 194 NNVTLALRKYGYYENSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRA 244
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ SP ++ V +++HI+DW PTL T A G + +DG D W ++ + R
Sbjct: 245 VGFVHSPLLKNKGSVCKELVHITDWFPTLITLAEGQIDE-DIQLDGYDVWETI---SEGR 300
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYG 498
R+ +D L + + + I +A+R++ WKL+ G GY
Sbjct: 301 RSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVNHWKLLTGN------PGY-- 352
Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
S+ VP F SN R ++ + G
Sbjct: 353 ---SDWVPPQAF-----------------------SNAGPNRWHNERVSWSAGKT----- 381
Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
+LFN+ DP E+ ++++ P++ QL L +T VP + D +++
Sbjct: 382 --------VWLFNITADPYERVDLSARHPEVVKQLLRRLSQFNKTAVPVRYPPKD-PRSN 432
Query: 619 PKRFNDTWSPWI 630
PK W PW
Sbjct: 433 PKLNGGVWGPWF 444
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 45/80 (56%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T++ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 111 LYENNNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 165
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 166 RYFEHYRSINNINRRRYAAM 185
>gi|291236588|ref|XP_002738221.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 504
Score = 231 bits (589), Expect = 9e-58, Method: Compositional matrix adjust.
Identities = 168/548 (30%), Positives = 260/548 (47%), Gaps = 99/548 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H I TP ID LA NG+ LNN YA +C PSR LM+G++ +H + G
Sbjct: 39 GWNDVGYHNL-YIKTPTIDRLANNGVKLNNYYAANLCVPSRNMLMSGRH-VHGVIMG--- 93
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
PRG+PL E + L+E GYST +GKW+ GF+ +E+ P RGF++ FG+++
Sbjct: 94 ---YPRGLPLNETTIANKLKEAGYSTHLVGKWNCGFYSKEFLPHNRGFDTFFGFVDSKED 150
Query: 212 YYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+Y H++ D D+RRN L A G Y+T ++ E +I++ +KPLFL
Sbjct: 151 HYTHMVHDI---------SDLRRNDLCVADKYYGNYSTIMYGNEGTTIIDNHDTNKPLFL 201
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQY---ITDPNRRTYAAMVKKLDDSVGTVISALQR 327
+++ A H + L+ P ++ + +Y I D +RR YA MV +D+++ +I L+
Sbjct: 202 FMSFSAVH-----EPLQVP--SVYEKEYIPTIDDTDRRIYAGMVSCVDEAIANIIETLEN 254
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
K ML+NSII+F SDNG + G+N+P RG K + WEGGV+ ++S +
Sbjct: 255 KDMLKNSIIVFTSDNGGDPLYS---------GNNWPLRGWKASNWEGGVRALGFVYSEIL 305
Query: 388 QQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
N R + ++MHI+DW PTL +GG+ L + G+ QW + S R +
Sbjct: 306 PTNARGTDNNELMHITDWFPTLVDISGGNIHTEEL-LYGVSQWGVIRGLAKSPREDVLIA 364
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
LD + + R N + AA+++ WKL+ G + G R +
Sbjct: 365 LDTTRKAV--SKYRTGIYGTNDFDITTLAALKMGKWKLLTGQND--------GDNRWFQP 414
Query: 506 PLLN--FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT 563
P +N +I++ ++ D+ R R
Sbjct: 415 PEINPPVKSIID------------------TDFDRKRMVR-------------------- 436
Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDIS-SQLYELLKYHRRTLVPQSHEQPDLVQADPKRF 622
L+NL DP E +++ R DI L L +Y T+ Q P + D K
Sbjct: 437 -----LYNLDEDPTETTDLSLEREDIVLEMLARLTRYASSTVPAQLQTVP---RVDRKTM 488
Query: 623 NDTWSPWI 630
D PWI
Sbjct: 489 TDCLGPWI 496
>gi|323454531|gb|EGB10401.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 530
Score = 231 bits (588), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 147/435 (33%), Positives = 219/435 (50%), Gaps = 43/435 (9%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIP-TPNIDALAYNGIILNNMYAQP 126
+ +P A T L GWND+ + ++ TP + LA NG+ L Y Q
Sbjct: 14 LPEPEEVASAPETPHIVYVLFDDMGWNDIGYQSTDMAALTPVLSDLAENGVKLTQYYTQS 73
Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
CT SRA+L++G P+H G+ I P G+PL + LP+YL+E GY T +GKW +G
Sbjct: 74 TCTVSRAALLSGVLPMHNGISHGTIVMDSPIGLPLKYKLLPQYLQESGYRTYMVGKWDIG 133
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
F EY P RGF+ FG+ I+Y+ HI S Y D+R T + Y
Sbjct: 134 HFNEEYLPHNRGFDHFFGFYGADITYFSHISSRGYCANPNCFP-DLRNEDETMANASMRY 192
Query: 247 ATDLFTKEAVQLIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQETI-NQFQYI--- 299
TDLF + AV +E + PLFLYL+ A H + APQE + N+ + +
Sbjct: 193 TTDLFRERAVGFVEGHAANHATDPLFLYLSFNAPHYPTS-----APQEFMRNEAELLAPF 247
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRN 357
T+ RR +AA+V D SVG V+ AL+ G +S+I+F SDNGA T ++ N
Sbjct: 248 TNRERRVFAAVVNYADRSVGRVVEALKATGAYNDSVILFASDNGAVPTTCNQSHCTSESN 307
Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQI---QQNPRVSLQMMHISDWLPTLYTAAGGD 414
GSN+P RG+K T WEGG +VPA + +P+ + + + ++HI+DW+PT+
Sbjct: 308 TGSNWPLRGMKATYWEGGCRVPAFVHAPKYLGDRASGSLYQGIVHITDWIPTI------- 360
Query: 415 TSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE----K 470
+ G+ + SS LL + + G D W S+ R +L +D
Sbjct: 361 -------VRGVLRSSSTLL------GAEVSGADHWDSIRKLATPPRTELLYGVDHCVAGG 407
Query: 471 KRTAAVRLDSWKLVL 485
+ T A+R+D+WK ++
Sbjct: 408 QLTGAIRVDNWKFII 422
>gi|395510440|ref|XP_003759483.1| PREDICTED: arylsulfatase B [Sarcophilus harrisii]
Length = 659
Score = 230 bits (587), Expect = 1e-57, Method: Compositional matrix adjust.
Identities = 183/550 (33%), Positives = 259/550 (47%), Gaps = 86/550 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H SN I TP++DALA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 185 GWNDVGYHDSN-IFTPHLDALAAGGVRLENYYTQPLCTPSRSQLLTGRYQIHTGLQHQII 243
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P +PL E+ LPE L+E GY T +GKWHLG FR+E P RGF++ FGYL G
Sbjct: 244 WPCQPSCLPLDEKLLPELLQEAGYVTHMVGKWHLGMFRKECLPTRRGFDTFFGYLLGSED 303
Query: 212 YYDHILSDQY-SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
YY H + V D R A Y+T++FT++A+ LI P K
Sbjct: 304 YYSHKHCVHIDALNVTRCALDFRDGEDVAEGYNNTYSTNIFTEKAIDLIAKHPAQKGTED 363
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+ L + L + +E ++ F + T ++ G++ + +
Sbjct: 364 LIVILDSRGVPT---LVSSEEQVS-FLCKDQSSDHTSLQIIH-----FGSIKNVQTPRAS 414
Query: 331 LENSIIIF-MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ENS F DNG T+ G+N+P RG K TLWEGG++ + SP ++Q
Sbjct: 415 SENSSSAFPFPDNGGQTLAG---------GNNWPLRGRKWTLWEGGIRGVGFVASPLLKQ 465
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRRNS---NID- 444
S +++HISDWLPTL AGG T S PL DG D W ++ PS R NID
Sbjct: 466 TGVQSRELIHISDWLPTLVNLAGGHTNSTKPL--DGFDVWKTISEGNPSPRMELLHNIDP 523
Query: 445 GLDQWSSLLLNT--PSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQT 500
G S +T PS+ +S + AA+R +WKL + GY G
Sbjct: 524 GFVDISPCPESTRPPSKSDSFSTTHSTFNVSIHAAIRYGNWKL---------LTGYPGCD 574
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
P L SNI K P+P +P
Sbjct: 575 YWFPPPSL-------------------------SNITK--------------QPSPDSP- 594
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPK 620
N +LF++ DP E+ +++S P I +L + L+Y++++ VP + D DPK
Sbjct: 595 ---NKNLWLFDIQQDPEERYDLSSKHPSIVKKLLDRLQYYQKSSVPVFYPDED-PHCDPK 650
Query: 621 RFNDTWSPWI 630
W PW+
Sbjct: 651 A-TGVWGPWM 659
>gi|345326826|ref|XP_003431088.1| PREDICTED: arylsulfatase J [Ornithorhynchus anatinus]
Length = 504
Score = 229 bits (584), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 154/508 (30%), Positives = 239/508 (47%), Gaps = 73/508 (14%)
Query: 124 AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
A P C + S KY IHTG+Q I +P +PL LP+ L+E+GYST +GKW
Sbjct: 20 ALPGCPQNEGSCFQQKYQIHTGLQHSIIRPTQPNCLPLDNVTLPQKLKEVGYSTHMVGKW 79
Query: 184 HLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-T 242
HLGF+R+E P RGF++ FG L G YY H D + + G+D+ N AW+
Sbjct: 80 HLGFYRKECMPTQRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDDAAWEYD 135
Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
G Y+T ++T+ Q++ +PLFLY+A+ A H+ L+AP ++ I +
Sbjct: 136 NGIYSTQMYTQRVQQILASHDPRRPLFLYIAYQAVHS-----PLQAPGRYFEHYRSIINI 190
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSN 361
NRR YAAM+ LD+++ V AL+R G +NSI+I+ SDNG PT GSN
Sbjct: 191 NRRRYAAMLSCLDEAIHNVTLALKRYGFYDNSILIYSSDNGGQPTAG----------GSN 240
Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
+P RG K T WEGG++ + SP ++ V +++HI+DW PTL T A G +
Sbjct: 241 WPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCRELVHITDWYPTLITLAEGQLDE-GVR 299
Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
+DG D W ++ + RR+ +D L + + + I +A+R+ W
Sbjct: 300 LDGYDVWETI---SEGRRSPRVDILHNIDPIYTKAKNGSWAAGFGIWNTAIQSAIRVRHW 356
Query: 482 KLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
KL+ G F+ V + + SN+ R
Sbjct: 357 KLLTGNP--------------------GFSEWVPPQAF--------------SNLGPNRW 382
Query: 542 TRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
++ + G + +LFN+ DP E+ ++++ P + QL L
Sbjct: 383 HNERISWSAGKS-------------VWLFNITADPYERVDLSARYPGVVKQLLRRLSQFN 429
Query: 602 RTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+T VP + P +++P+ W PW
Sbjct: 430 KTAVPVRYP-PRDPRSNPRLNGGVWGPW 456
Score = 47.4 bits (111), Expect = 0.021, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 6/77 (7%)
Query: 4 NLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 62
N AW+ G Y+T ++T+ Q++ +PLFLY+A+ A H+ L+AP
Sbjct: 127 NDDAAWEYDNGIYSTQMYTQRVQQILASHDPRRPLFLYIAYQAVHS-----PLQAPGRYF 181
Query: 63 NQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 182 EHYRSIININRRRYAAM 198
>gi|421593685|ref|ZP_16038213.1| sulfatase [Rhizobium sp. Pop5]
gi|403700318|gb|EJZ17522.1| sulfatase [Rhizobium sp. Pop5]
Length = 497
Score = 229 bits (583), Expect = 4e-57, Method: Compositional matrix adjust.
Identities = 150/398 (37%), Positives = 208/398 (52%), Gaps = 48/398 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ +HGS +I TPNID LA G L Y QP+CTPSRA+ MTG+YP G+Q I
Sbjct: 66 GWKDVGYHGS-DIRTPNIDRLAAEGARLEQFYVQPMCTPSRAAFMTGRYPFRYGLQTAVI 124
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ G+ L E LP+ L++ GY T GKWHLG + Y P RGF+S +G L G I
Sbjct: 125 PQSGTYGLALDEYPLPQVLKDAGYYTAMSGKWHLGHSKTAYWPRQRGFDSFYGALLGEID 184
Query: 212 YYDHILSDQYSRTVELNGH-DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
++ H + NG+ D RN + A G Y L EAV++I KPLFL
Sbjct: 185 HFTHKAA---------NGNPDWYRN-NKALKEEG-YDNILIGAEAVRVINKHDQQKPLFL 233
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA A H +AP+E +++ +I D +RR YAAM+ +DD +G V++AL+++GM
Sbjct: 234 YLAFTAPHT-----PYQAPKEYLDRNSHIADESRRKYAAMISVMDDEIGKVVAALEKRGM 288
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNW---GSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
+N++I+F SDNG + + N PYR K T +EGG +V ++ W +
Sbjct: 289 RDNTLIVFHSDNGGVVNAFFAGESEVKGKLPADNGPYREGKGTTYEGGTRVAGLVNWPGK 348
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
I+ P MH+ D PTL AG L+ +N +DG+
Sbjct: 349 IK--PGTIDGPMHVVDMYPTL---AG-------------------LVGATLGKNKPLDGM 384
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
D W +L PS R ++ NI+ TAAVR WKLV
Sbjct: 385 DMWPTLAEGKPSPRTEIVYNIE--PMTAAVRQGDWKLV 420
Score = 45.4 bits (106), Expect = 0.081, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 5/65 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y L EAV++I KPLFLYLA A H +AP+E +++ +I D +RR
Sbjct: 209 YDNILIGAEAVRVINKHDQQKPLFLYLAFTAPHT-----PYQAPKEYLDRNSHIADESRR 263
Query: 75 TYAAL 79
YAA+
Sbjct: 264 KYAAM 268
>gi|260788444|ref|XP_002589260.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
gi|229274435|gb|EEN45271.1| hypothetical protein BRAFLDRAFT_213058 [Branchiostoma floridae]
Length = 455
Score = 228 bits (581), Expect = 8e-57, Method: Compositional matrix adjust.
Identities = 123/269 (45%), Positives = 165/269 (61%), Gaps = 10/269 (3%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YGWND+ +H S I TPN+D LA G+ L N Y QP+C+PSR LMTG+Y IH G+Q
Sbjct: 11 YGWNDIGYHNSF-IRTPNLDRLASEGVKLENYYVQPICSPSREQLMTGRYQIHYGLQHSV 69
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I P G+PL E LP+ L+E GYST +GKWHLGFFR+EY PL RGFE FGYL G
Sbjct: 70 IMCDRPHGLPLDEVTLPQRLKENGYSTYMVGKWHLGFFRKEYMPLQRGFERFFGYLTGGE 129
Query: 211 SYYDHILSDQYSRT-VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y+ H Q+S+ E +G D+R D G Y+T LF ++A+++I + KP+F
Sbjct: 130 DYWTHRKPSQFSKDPSEFHGLDLRDQDKPVLDQNGTYSTHLFARKAIEMILNHDQSKPMF 189
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LYL A H LEAP+E ++ I + RTYAAMV +D++VG V AL+ +G
Sbjct: 190 LYLPFQAVHGP-----LEAPEEYKRIYEDIDNSLVRTYAAMVTVMDEAVGNVTGALKHRG 244
Query: 330 MLENSIIIFMSDNGAPT---VEYRETSNY 355
M N+I+IF + + A T ++Y T +Y
Sbjct: 245 MWRNTILIFSTASSAFTQSQIKYDGTISY 273
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/71 (38%), Positives = 40/71 (56%), Gaps = 5/71 (7%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
D G Y+T LF ++A+++I + KP+FLYL A H LEAP+E ++ I
Sbjct: 161 DQNGTYSTHLFARKAIEMILNHDQSKPMFLYLPFQAVHGP-----LEAPEEYKRIYEDID 215
Query: 70 DPNRRTYAALT 80
+ RTYAA+
Sbjct: 216 NSLVRTYAAMV 226
>gi|428202415|ref|YP_007081004.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
gi|427979847|gb|AFY77447.1| arylsulfatase A family protein [Pleurocapsa sp. PCC 7327]
Length = 538
Score = 228 bits (580), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 134/350 (38%), Positives = 194/350 (55%), Gaps = 25/350 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H S EI TPN+D LA + L+ Y CTP+RA+LMTG++P GM I
Sbjct: 54 GWNDVGYHNS-EIKTPNLDKLAESSTRLDRFYVTSSCTPTRAALMTGRHPSRYGMSSGVI 112
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W + G+PL E+ + + L+E GY T +GKWHLG ++ EY P RGF+ H+G+ G I
Sbjct: 113 WPWDKVGLPLEEKTIAQTLKEAGYYTAIVGKWHLGHYKEEYLPTRRGFDYHYGHYCGSID 172
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
Y+ H L G D RN + YATDL +EAV+LI D +K PLFL
Sbjct: 173 YFTHQLDAGIQ-----GGLDWHRNEQPVEEE--GYATDLLAQEAVKLIRDCDYNKSPLFL 225
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y++ A HA L+A ++ I + I D RR +AA V+ +D++VG +++AL+ K +
Sbjct: 226 YVSFNAPHA-----PLQAKEKDIKNYANIQDEGRRIFAAQVQSMDEAVGRIVAALKEKQV 280
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNW--GSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
+N+++ F SDNG + + W G N P RG K TL++GGV+VP I+ P
Sbjct: 281 WDNTLLFFTSDNGG--------GSDQPWTRGDNRPLRGQKGTLYDGGVRVPTIISFPAQL 332
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ +V Q+ + D PTL AG S +DG+D S+ + R
Sbjct: 333 KGGQVIEQVFSVVDLYPTLAKLAGVKESSQN-QLDGVDILESIATGSTRR 381
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 41/69 (59%), Gaps = 6/69 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
YATDL +EAV+LI D +K PLFLY++ A HA L+A ++ I + I D R
Sbjct: 200 YATDLLAQEAVKLIRDCDYNKSPLFLYVSFNAPHA-----PLQAKEKDIKNYANIQDEGR 254
Query: 74 RTYAALTKS 82
R +AA +S
Sbjct: 255 RIFAAQVQS 263
>gi|402821074|ref|ZP_10870630.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
gi|402510105|gb|EJW20378.1| hypothetical protein IMCC14465_18640 [alpha proteobacterium
IMCC14465]
Length = 526
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 138/388 (35%), Positives = 215/388 (55%), Gaps = 37/388 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ +HGS +I TP+ID LA G LN YA P C+P+RA+LMTG+ P+ G+ +
Sbjct: 62 GWGDVGYHGS-DIQTPHIDRLAKEGAKLNRFYATPFCSPTRAALMTGRDPLKLGVAYSVL 120
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E GV L E FLP+ + GY+T +GKWHLG ++TP RGF+ +G+++ +S
Sbjct: 121 MPWENGGVSLDEHFLPQSFQAAGYNTAMVGKWHLGHTIEQHTPNARGFDLFYGHMHTQVS 180
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED-QPVDKPLFL 270
Y+DH ++ NGHD + N D GEYATD+ +A + I D + KP L
Sbjct: 181 YFDHQIA---------NGHDFQEN-GKPVDHNGEYATDVHGAQAARFITDLRDKTKPFLL 230
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITD---PNRRTYAAMVKKLDDSVGTVISALQR 327
Y+ LA H+ +EAP+ +++ D R+TYAAMV +D ++G V++AL
Sbjct: 231 YVPFLAPHS-----PMEAPEALKDKYSSRLDLPGSTRKTYAAMVDSMDQAIGKVLTALDE 285
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAIL-WSP 385
+G+ +N+I+ F SDNG + N+GS N PYRG K ++EGG++V A++ W
Sbjct: 286 EGVADNTIVFFFSDNGG----------FENYGSDNGPYRGGKLEVYEGGIRVTAVMRWPE 335
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI-- 443
++ V +++ + D LPTL AAG + IDG+++WS++ S+R +
Sbjct: 336 KLAAGSEVD-EIVSVMDLLPTLTHAAGVENGT-EKKIDGVNRWSTITGEKVSKRKGALYF 393
Query: 444 -DGLDQWSSLLLNTPSRRNSVLINIDEK 470
+ ++ L + ++ NID K
Sbjct: 394 ASNIPVYNKFQLGVIDGKWKLVQNIDHK 421
Score = 40.0 bits (92), Expect = 3.6, Method: Compositional matrix adjust.
Identities = 27/77 (35%), Positives = 40/77 (51%), Gaps = 9/77 (11%)
Query: 10 DTVGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
D GEYATD+ +A + I D + KP LY+ LA H+ +EAP+ +++
Sbjct: 200 DHNGEYATDVHGAQAARFITDLRDKTKPFLLYVPFLAPHS-----PMEAPEALKDKYSSR 254
Query: 69 TD---PNRRTYAALTKS 82
D R+TYAA+ S
Sbjct: 255 LDLPGSTRKTYAAMVDS 271
>gi|291233691|ref|XP_002736785.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 499
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 158/540 (29%), Positives = 251/540 (46%), Gaps = 88/540 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H E+ P ++ LA +G+I N Y QP CTP+RA+LM+G YP TG Q +
Sbjct: 42 GWNDVGWHNP-EVKMPVLNQLAADGVIFNQAYVQPTCTPTRAALMSGYYPFKTGNQHQLL 100
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
P G+PL + LP+ L+++GY T +GKWHLGF + + P RGF+S +G L S
Sbjct: 101 LNLHPGGLPLRFKTLPQRLKDVGYLTHIVGKWHLGFCKEAFLPTNRGFDSFYGGLTLGTS 160
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
++ + + + G+D N T +Y + AV++I + PLF+Y
Sbjct: 161 HFSKM-----NGILSTPGYDFYDNSGVVPQT-NDYLAFMLADRAVKIINGHYQEYPLFMY 214
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+ K E P E + ITD R + + +DD+VG+V+ AL+++ M
Sbjct: 215 FS-----MDVPAKSPEVPPEYEALYANITDDRTRRFYGKLSVMDDAVGSVVDALKKRDMW 269
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+++++IF+ DNGA + GSN+P RG+ TL+EG +VPA + ++Q
Sbjct: 270 DDTLLIFIGDNGALA---------SSSGSNWPLRGIAATLFEGATRVPAFIHGNMLKQTG 320
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
+ ++ HI D T+ + AG ++S IDG+D W D +S
Sbjct: 321 YENNELWHIVDLHKTILSLAGAESS---CEIDGMDMW------------------DTFSK 359
Query: 452 LLLNTPSRRNSVLINIDEKKRT--AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
N S RN ++ NID+ + AA+R+ +KL+ G
Sbjct: 360 ---NESSPRNEIVYNIDDDPISPGAAIRVGDYKLIAG----------------------- 393
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPA-PMTPSPCTNGPCY 568
+ ++F P+ I++ + P P P Y
Sbjct: 394 ----------------NPDLFYPLRLINQSDGWFNYGDVPTSGLPVLPTGDEPPPPNVTY 437
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
LFN+ +DP E+NNIA P+I L + L HR +P ++ D DP F+ +SP
Sbjct: 438 LFNVIDDPEERNNIADDHPEIVQALRQRLDEHRIYYMPAVNQVFD-EAGDPSNFDGVFSP 496
>gi|432104048|gb|ELK30879.1| Arylsulfatase J [Myotis davidii]
Length = 496
Score = 227 bits (579), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 155/492 (31%), Positives = 230/492 (46%), Gaps = 73/492 (14%)
Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
Y IHTG+Q I A+P +PL LP+ LRE GY+T +GKWHLGF+RRE P +RGF
Sbjct: 31 YQIHTGLQHSVIRPAQPNCLPLGHATLPQKLREAGYATHMVGKWHLGFYRRECMPTHRGF 90
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQL 258
++ FG L G YY H D G+D+ N AWD G Y+T ++T+ Q+
Sbjct: 91 DTFFGSLLGSGDYYTHYKCDGPGAC----GYDLYENDRAAWDHDDGVYSTHMYTRRVQQI 146
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
+ KPLFLY+A+ A H+ L+AP ++ I + +RR YAAM+ LD++V
Sbjct: 147 LAAHDPGKPLFLYIAYQAVHSP-----LQAPSRYFRHYRSIANTHRRRYAAMLSCLDEAV 201
Query: 319 GTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
G V AL+ G+ NSI+I+ SDNG P+ GSN+P RG K T WEGG++
Sbjct: 202 GNVTLALRAHGLYNNSILIYSSDNGGQPSAG----------GSNWPLRGSKGTYWEGGIR 251
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
+ SP ++Q V +++HI+DW PTL A G ++DG D W ++ +
Sbjct: 252 AVGFVHSPLLRQRGTVCRELVHITDWYPTLVALAEGHIEE-DAHLDGYDVWETI---SEG 307
Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
R+ +D L + + I +A+R+ WKL+ G G
Sbjct: 308 LRSPRVDILHNIDPIYTKAKHGSWAAGYGIWNTAVQSAIRVQHWKLLTGNPGYG------ 361
Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
+ VP F SN+ R ++ T+ G +
Sbjct: 362 -----DWVPPQAF-----------------------SNLGPSRWHSERVTLATGQS---- 389
Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
+LFN+ DP E+ +++ P + QL L RT VP + P ++
Sbjct: 390 ---------VWLFNITADPYERVDLSGRYPGVVRQLLRRLSQFNRTAVPVRYP-PGDPRS 439
Query: 618 DPKRFNDTWSPW 629
+P+ W PW
Sbjct: 440 NPRLNGGVWGPW 451
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N AWD G Y+T ++T+ Q++ KPLFLY+A+ A H+ L+AP
Sbjct: 119 LYENDRAAWDHDDGVYSTHMYTRRVQQILAAHDPGKPLFLYIAYQAVHSP-----LQAPS 173
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + +RR YAA+
Sbjct: 174 RYFRHYRSIANTHRRRYAAM 193
>gi|431918040|gb|ELK17268.1| Arylsulfatase I [Pteropus alecto]
Length = 547
Score = 227 bits (578), Expect = 1e-56, Method: Compositional matrix adjust.
Identities = 167/542 (30%), Positives = 250/542 (46%), Gaps = 105/542 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 60 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 118
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + R RGF++ G L G +
Sbjct: 119 RPRQPNCLPLDQSXXXXXXR-------------------------RGFDTFLGSLTGNVD 153
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ S AW G+Y+T L+ + ++ +PLFLY
Sbjct: 154 YYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLY 209
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 210 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 264
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
NS+IIF SDNG T + GSN+P RG K T WEGGV+ + SP +++
Sbjct: 265 NNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKR 315
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
R S ++HI+DW PTL AGG T+ +DG D W ++ S R + +D
Sbjct: 316 RTSRALVHITDWYPTLVGLAGG-TASAADGLDGYDVWPAISEGRASPRTEILHNIDP--- 371
Query: 452 LLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L +R S+ I AA+R+ WKL+ G D YG + +P
Sbjct: 372 --LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPGYG----DWIPP-- 416
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
Q+L + + N+++M S RQ +L
Sbjct: 417 ----------QTLAAFPGSWW----NLERMASARQA---------------------VWL 441
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPKRFNDTWS 627
FN+ DP E+ ++A RPD+ L L + RT +P + E P +A P W
Sbjct: 442 FNISADPYEREDLAGQRPDVVRALLARLADYNRTAIPVRYPAENP---RAHPDFNGGAWG 498
Query: 628 PW 629
PW
Sbjct: 499 PW 500
Score = 46.6 bits (109), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 175 SVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 229
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 230 RTMGNVARRKYAAMV 244
>gi|171910063|ref|ZP_02925533.1| twin-arginine translocation pathway signal precursor
[Verrucomicrobium spinosum DSM 4136]
Length = 496
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 152/439 (34%), Positives = 220/439 (50%), Gaps = 66/439 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G +D+ + GS EI TP++D LA G L+ Y QPVC+P+RA+L+TG+YP G Q +
Sbjct: 64 GSHDVGWRGS-EIKTPHLDELARAGATLDQFYVQPVCSPTRAALLTGRYPFRYGFQTGVV 122
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+PL ER LP+ L+E GY T GKWHLG F+ Y P RGF+ +G+ NG++
Sbjct: 123 RPWAEYGLPLEERTLPQALKEAGYETAITGKWHLGHFQPAYLPTKRGFDHQYGHYNGMLD 182
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY HI G D RN D Y+T+L KEA + + ++ +PLFLY
Sbjct: 183 YYTHIRH---------GGFDWHRNDQENHDE--GYSTELVGKEAARRVRERDKSRPLFLY 231
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+ H+ + + P E + + R+TYA MV LD +VG ++ A++ + ++
Sbjct: 232 VPFNGVHSPH-----QVPVEYETPYTQF-EGKRKTYAGMVTALDAAVGRIVQAVRDEKLV 285
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQN 390
EN++ IF SDNG P N + SN P R K T++EGGV+V A W +I+
Sbjct: 286 ENTLFIFSSDNGGP--------NPKQLTSNGPLRAGKGTVYEGGVRVCAFATWPGKIK-- 335
Query: 391 PRVSLQM-MHISDWLPTLYTAAGGDTSR-LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
P S+Q +HI+DW PTL AG + LPL DG D
Sbjct: 336 PGTSVQAPIHIADWFPTLLGLAGASLKQPLPL-----------------------DGRDV 372
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG----TQENGTMDGYYGQTRSNK 504
W+++ S +++LIN A+R WKLV+G E+G D G+ + +
Sbjct: 373 WATVAEGAASPHDTILINTTPS--GGAIRSGDWKLVIGGANRDDEDGPGDAAKGKGKKGR 430
Query: 505 ------VPLLNFNAIVESK 517
V L N + V K
Sbjct: 431 EQPAQGVELFNLASDVGEK 449
>gi|149726732|ref|XP_001501521.1| PREDICTED: arylsulfatase I-like [Equus caballus]
Length = 476
Score = 226 bits (577), Expect = 2e-56, Method: Compositional matrix adjust.
Identities = 159/495 (32%), Positives = 234/495 (47%), Gaps = 79/495 (15%)
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
+Y IHTG+Q I +P +PL + LP+ L+E GYST +GKWHLGF+R+E P RG
Sbjct: 8 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYRKECLPTRRG 67
Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
F++ G L G + YY + D + G D+ S AW G+Y+T L+ + +
Sbjct: 68 FDTFLGSLTGNVDYYTYDNCD----GPGVCGFDLHEGESVAWGLSGQYSTMLYAQRVSHI 123
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
+ +PLFLY+A A H L++P+E + +++ + + RR YAAMV +D++V
Sbjct: 124 LASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAV 178
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
+ AL+R G NS+IIF SDNG T + GSN+P RG K T WEGGV+
Sbjct: 179 RNITWALKRYGFYNNSVIIFSSDNGGQTF---------SGGSNWPLRGRKGTYWEGGVRG 229
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ SP +++ R S ++HI+DW PTL AGG T+ +DG D W ++ S
Sbjct: 230 LGFVHSPLLKRKRRTSRALVHITDWYPTLVGLAGG-TASATDGLDGYDVWPAISEGRASP 288
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVL--INIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
R + +D L +R S+ I AA+R+ WKL+ G D
Sbjct: 289 RTEILHNIDP-----LYNHARHGSLEGGFGIWNTAVQAAIRVGEWKLLTG-------DPG 336
Query: 497 YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
YG + +P A S N+++M S RQ
Sbjct: 337 YG----DWIPPQTLAAFPGS----------------WWNLERMASARQA----------- 365
Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDL 614
+LFN+ DP E+ ++A RPD+ L L + RT +P + E P
Sbjct: 366 ----------VWLFNISADPYEREDLAGQRPDVVRALLARLVDYNRTAIPVRYPAENP-- 413
Query: 615 VQADPKRFNDTWSPW 629
+A P W PW
Sbjct: 414 -RAHPDFNGGAWGPW 427
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW G+Y+T L+ + ++ +PLFLY+A A H L++P+E + ++
Sbjct: 102 SVAWGLSGQYSTMLYAQRVSHILASHSPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 156
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 157 RTMGNVARRKYAAMV 171
>gi|355750020|gb|EHH54358.1| Arylsulfatase B, partial [Macaca fascicularis]
Length = 430
Score = 226 bits (576), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 134/358 (37%), Positives = 196/358 (54%), Gaps = 32/358 (8%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF++
Sbjct: 2 IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61
Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
+FGYL G YY H L D + V D R A Y+T++FTK A L
Sbjct: 62 YFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATAL 119
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV +D++V
Sbjct: 120 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAV 174
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+
Sbjct: 175 GNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRG 225
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ SP ++Q + +++HISDWLPTL AGG T+ +DG D W ++ +PS
Sbjct: 226 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWKTISEGSPSP 284
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
R + +D + + ++P NS+ D+ AA+R +WKL+ G
Sbjct: 285 RMELLHNID--PNFVDSSPCPGNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 340
Score = 56.2 bits (134), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 107 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 161
Query: 75 TYAALT 80
YA +
Sbjct: 162 HYAGMV 167
>gi|354473078|ref|XP_003498763.1| PREDICTED: arylsulfatase B [Cricetulus griseus]
Length = 460
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 159/501 (31%), Positives = 239/501 (47%), Gaps = 84/501 (16%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
IHTG+Q I +P VPL E+ LP+ L+E GY+T +GKWHLG ++E P RGF++
Sbjct: 32 IHTGLQHLLIRVCQPSCVPLDEKLLPQLLKEAGYATHMVGKWHLGMCQKECLPTRRGFDT 91
Query: 202 HFGYLNGVISYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
+FGYL G +YY H + + V D R A Y+T +F++ A LI
Sbjct: 92 YFGYLLGSENYYTHEVCALIEPLNVTRCALDFRDGEEPATKYNNTYSTHIFSQRATTLIA 151
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
+ P +KPLFLYLA + H L+ P+E + + +I D +RR YA MV +D++VG
Sbjct: 152 NHPPEKPLFLYLALQSVH-----DPLQVPEEYMKPYDFIQDKHRRIYAGMVSLMDEAVGN 206
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
V +AL+ G+ N++ +F +DNG T+ + G+N+P RG K TLWEGG++
Sbjct: 207 VTAALKSHGLWNNTVFVFSTDNGGQTL---------SGGNNWPLRGRKWTLWEGGIRGVG 257
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSLLLNTPSRR 439
+ SP ++Q S +++HISDWLPTL AGG T PL DG D W ++ TPS R
Sbjct: 258 FVASPLLKQKGVKSRELIHISDWLPTLVNLAGGSTYGTKPL--DGFDVWRTISEGTPSPR 315
Query: 440 NSNIDGLDQWSSLLLNTPSRRNSVLINIDEK----------KRTAAVRLDSWKLVLGTQE 489
+ +DQ ++P S+ ++ A +R +WKL+ G
Sbjct: 316 VELLHNIDQ--DFFDSSPCPGKSMAEAKNDSFPLKYSAFNTSIHAGIRYKNWKLLTGH-- 371
Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
P F S+ SNI ++RS +
Sbjct: 372 ----------------PGCGFWFPPPSQ----------------SNISEIRSLDSSSKT- 398
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
+LF++ DP E+++++ P I +L L+Y+ VP S+
Sbjct: 399 -----------------LWLFDINQDPEERHDLSEKHPHIVLKLLSRLQYYHEHSVP-SY 440
Query: 610 EQPDLVQADPKRFNDTWSPWI 630
P + DPK WSPW+
Sbjct: 441 FPPMDPRCDPKD-TGVWSPWM 460
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T +F++ A LI + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 137 YSTHIFSQRATTLIANHPPEKPLFLYLALQSVH-----DPLQVPEEYMKPYDFIQDKHRR 191
Query: 75 TYAALT 80
YA +
Sbjct: 192 IYAGMV 197
>gi|291235057|ref|XP_002737462.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like,
partial [Saccoglossus kowalevskii]
Length = 355
Score = 226 bits (575), Expect = 3e-56, Method: Compositional matrix adjust.
Identities = 131/344 (38%), Positives = 193/344 (56%), Gaps = 27/344 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ ++ TP ID LA NG+ L N Y C PSR LMTG++ I G+
Sbjct: 34 GWNDVGYNNP-VFKTPTIDRLAGNGVKLLNYYVASHCLPSRNMLMTGRHAIQLGIPQDG- 91
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+G PR +PL E + E L+ GYST +GKWH G++ P RGF++ FG++ I
Sbjct: 92 FGYHPRSLPLDETTIAEPLKHAGYSTHIVGKWHCGYYADNCLPHNRGFDTFFGFVGAGID 151
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+Y H SD ++ H++R+N A +G+Y+T +F E +I +KPLFL
Sbjct: 152 HYTH--SDHFNHM-----HNLRKNDDCIAKKYIGKYSTTIFANEGKDIINAHDQNKPLFL 204
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
YL+ A HA LE P + Q++ I D +RRTYAAM +D+++ + AL+ K
Sbjct: 205 YLSFSAVHAP-----LEVPSSYLKQYESTIHDEDRRTYAAMTSCVDEAIANITEALEEKN 259
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML+NSII+F SDNG V + G+N+P RG K+T WEGGV+ ++ + +
Sbjct: 260 MLKNSIIVFTSDNGGALVN--------SAGNNWPLRGGKHTSWEGGVRAVGFVYGDLLPR 311
Query: 390 NPRVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+ R + +MHI+DW PTL T AGG+ + + G+DQW+ +
Sbjct: 312 HSRGTENTGLMHITDWFPTLITLAGGN-PHVGKKLYGVDQWNMI 354
Score = 53.5 bits (127), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/73 (39%), Positives = 41/73 (56%), Gaps = 6/73 (8%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITD 70
+G+Y+T +F E +I +KPLFLYL+ A HA LE P + Q++ I D
Sbjct: 177 IGKYSTTIFANEGKDIINAHDQNKPLFLYLSFSAVHAP-----LEVPSSYLKQYESTIHD 231
Query: 71 PNRRTYAALTKST 83
+RRTYAA+T
Sbjct: 232 EDRRTYAAMTSCV 244
>gi|355691424|gb|EHH26609.1| Arylsulfatase B, partial [Macaca mulatta]
Length = 430
Score = 226 bits (575), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 134/358 (37%), Positives = 196/358 (54%), Gaps = 32/358 (8%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
I TG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF++
Sbjct: 2 IRTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 61
Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
+FGYL G YY H L D + V D R A Y+T++FTK A L
Sbjct: 62 YFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATAL 119
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV +D++V
Sbjct: 120 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAV 174
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +LWEGGV+
Sbjct: 175 GNVTAALKSSGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGVRG 225
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ SP ++Q + +++HISDWLPTL AGG T+ +DG D W ++ +PS
Sbjct: 226 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLAGGHTNGTK-PLDGFDVWKTISEGSPSP 284
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLG 486
R + +D + + ++P NS+ D+ AA+R +WKL+ G
Sbjct: 285 RVELLHNID--PNFVDSSPCPGNSMAPAKDDSSLPEYSAFNTSVHAAIRHGNWKLLTG 340
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 107 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 161
Query: 75 TYAALT 80
YA +
Sbjct: 162 HYAGMV 167
>gi|254515652|ref|ZP_05127712.1| arylsulfatase B [gamma proteobacterium NOR5-3]
gi|219675374|gb|EED31740.1| arylsulfatase B [gamma proteobacterium NOR5-3]
Length = 507
Score = 226 bits (575), Expect = 4e-56, Method: Compositional matrix adjust.
Identities = 139/390 (35%), Positives = 208/390 (53%), Gaps = 56/390 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +HGS +I TP+ID LA G+ L+ YAQ C+P+RA+L++G+ G+ P +
Sbjct: 52 GWNDVGYHGS-DIHTPHIDQLAAEGLELDRFYAQTACSPTRAALLSGQSSQSLGIYSP-L 109
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
P G+ L ++ +P Y R+ GY T +GKWHLGF+ EY PL RGF+ +G L G +
Sbjct: 110 SKLNPTGLALDQKIMPAYFRDAGYQTFMVGKWHLGFYEPEYRPLARGFDHFYGNLTGGVG 169
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H V G D +RN T Y+T L + E +LI+ + +KPLFLY
Sbjct: 170 YWNH---------VHGGGLDWQRNGKTLRQE--GYSTHLQSAEITRLIQQRDPEKPLFLY 218
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
A A H N EAP +T+ ++ +I +PNRR +AAMV +LD ++G ++ L +GML
Sbjct: 219 AAFNAPHLPN-----EAPADTLARYAHIENPNRRIHAAMVTELDSAIGQLMETLSTEGML 273
Query: 332 ENSIIIFMSDNGA---------------------------PTVEYRETSNYRNWGSNYPY 364
EN++I FMSDNG T+E+ T+ N P+
Sbjct: 274 ENTLIWFMSDNGGLNRTAMPSGLVSMSQRLEDWFGKPLFPKTLEFIRTNALDGGSDNSPH 333
Query: 365 RGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
R K +++EGG +VP+ + W ++ +P QM+ + D LPTL +A D + LP
Sbjct: 334 RKGKQSIYEGGARVPSFVYWKGRL--SPERITQMVTVKDVLPTLLSATDIDANGLPTATT 391
Query: 424 ----GLDQWSSL----LLNTPSRRNSNIDG 445
G+DQW L + P + +DG
Sbjct: 392 TESAGVDQWPGLTRGEFIQAPDYLINGMDG 421
Score = 50.8 bits (120), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T L + E +LI+ + +KPLFLY A A H N EAP +T+ ++ +I +PNRR
Sbjct: 193 YSTHLQSAEITRLIQQRDPEKPLFLYAAFNAPHLPN-----EAPADTLARYAHIENPNRR 247
Query: 75 TYAALT 80
+AA+
Sbjct: 248 IHAAMV 253
>gi|291239589|ref|XP_002739705.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 489
Score = 225 bits (574), Expect = 5e-56, Method: Compositional matrix adjust.
Identities = 163/538 (30%), Positives = 245/538 (45%), Gaps = 86/538 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + +I P ++ LA +G+I N Y QP CTPSRA+L+TG YP Q +
Sbjct: 34 GWNDVGWHNA-DIKMPILNQLAADGVIFNQSYVQPACTPSRAALLTGYYPFKIQRQHQML 92
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E G+ L + LPE L+++GY T +GKWHLGF + EY P RGF+S +G+L +
Sbjct: 93 LNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLGFCKEEYLPNKRGFDSFYGWLTLGTT 152
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y + G+D R N ++ Y + + AV ++ + PLFL
Sbjct: 153 LYS-------KENIIAPGYDFRDNTGVVQES-DTYLPFMLAERAVDIVMGHYKEYPLFLE 204
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+ +GK LE PQE + + I D +R + + +D+S+G ++ AL+ +GM
Sbjct: 205 FS-----MALSGKFLEVPQEYEDLYSDIEDDRQRKFYGKLSVMDNSIGLLVDALKTRGMW 259
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
E+++ IF SDNGA E GSN+P RG TL+EG +VP ++
Sbjct: 260 EDTLFIFTSDNGALASES---------GSNWPLRGNIATLFEGATRVPTFFHGSMLKNTG 310
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
V+ ++MHI D T+ AG T +IDG+D W ++ PS R + +D
Sbjct: 311 YVNNELMHIVDLHKTIIELAGAKTE---CDIDGMDLWKTISKGKPSPRTEFVYNIDD--- 364
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
+E AA+R+ +KL+ G + L F
Sbjct: 365 ----------------NELSPGAAIRVGDYKLITGHPD----------------LLYPFR 392
Query: 512 AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS-PCTNGPCYLF 570
+ S + + PIS I P P T + P + YLF
Sbjct: 393 MLNRSDDWYNYGD------EPISGI-----------------PIPPTGNEPPPSNVTYLF 429
Query: 571 NLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
N+ +DP E+NN+A P+ +L L +R LVP + D DP F WSP
Sbjct: 430 NIIDDPEERNNLAEEYPNKVEELRHRLDEYREHLVPPVNRVFD-PAGDPVNFGGVWSP 486
>gi|440894324|gb|ELR46807.1| Arylsulfatase J, partial [Bos grunniens mutus]
Length = 466
Score = 225 bits (573), Expect = 7e-56, Method: Compositional matrix adjust.
Identities = 154/493 (31%), Positives = 236/493 (47%), Gaps = 73/493 (14%)
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
+Y IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+R+E P RG
Sbjct: 1 RYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRG 60
Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQ 257
F++ FG L G YY H D + + G+D+ N + AWD G Y+T ++T+ Q
Sbjct: 61 FDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDYDNGVYSTQMYTQRVQQ 116
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
++ KP+FLY+A+ A H+ L+AP ++ I + NRR YAAM+ LD++
Sbjct: 117 ILASHDPRKPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIVNINRRRYAAMLSCLDEA 171
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+ V AL+ G NSIII+ SDNG PT GSN+P RG K T WEGG+
Sbjct: 172 INNVTLALKMYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGI 221
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTP 436
+ + SP ++ V +++HI+DW PTL + A G + +DG D W ++ +
Sbjct: 222 RAIGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-NIQLDGYDVWETI---SE 277
Query: 437 SRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
R+ +D L + + + I +A+R+ WKL+ G GY
Sbjct: 278 GLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVKHWKLLTGNP------GY 331
Query: 497 YGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAP 556
S+ VP +F SN+ R ++ T+ G +
Sbjct: 332 -----SDWVPPQSF-----------------------SNLGPNRWHNERITLSTGKS--- 360
Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQ 616
+LFN+ DP E+ ++++ P I QL L +T VP + D +
Sbjct: 361 ----------VWLFNITADPYERVDLSNKYPGIVKQLLRRLSQFNKTAVPVRYPPKD-PR 409
Query: 617 ADPKRFNDTWSPW 629
++P+ W PW
Sbjct: 410 SNPRLNGGVWGPW 422
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 90 LYENDNAAWDYDNGVYSTQMYTQRVQQILASHDPRKPIFLYIAYQAVHS-----PLQAPG 144
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 145 RYFEHYRSIVNINRRRYAAM 164
>gi|291233195|ref|XP_002736539.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 513
Score = 224 bits (571), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 137/408 (33%), Positives = 221/408 (54%), Gaps = 31/408 (7%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+GW+D+ +HGS + +P +D LA G+ L N Y QP+C+P+RA LM+G+Y I G+Q
Sbjct: 36 FGWHDIGYHGSI-VRSPYMDFLASEGVKLENYYVQPMCSPTRAQLMSGRYQIRYGLQHLV 94
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I + +P E + + ++E GY+T +GKWHLGF+R+E P+ RGF++ FG+LN +I
Sbjct: 95 IQPDQRACLPPDEVTIAQKMKEAGYATHMVGKWHLGFYRKECLPINRGFDTFFGFLNCLI 154
Query: 211 SYYDHILS-----DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
+Y + + ++T+ R + A + GEY+T LF +EA ++I + +
Sbjct: 155 YHYTYDFGYWHTPESGNKTIMFGWDLFRNHDCVAKEHKGEYSTILFAEEAQRVIWNHDQE 214
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISA 324
P+FLYL A H L+ P+E ++ I D RR AAMV +D+++ + A
Sbjct: 215 TPMFLYLPFAAVH-----NPLDVPREYEAMYEGIIEDEQRRKKAAMVTCMDEAIWNITKA 269
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
L GM +N+++IF +DNG T G+N+P RG K +++EGG++ + S
Sbjct: 270 LMDTGMWDNTVLIFSTDNGGAT---------SVGGNNWPLRGGKRSMFEGGIRGVGFVTS 320
Query: 385 PQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS- 441
P + + R + Q++H++DW PT A G+ S +DG +QW ++ S RN
Sbjct: 321 PLLDEAVRGTENNQLIHVTDWFPTFVHLAEGNLSGTK-PLDGFNQWDTISQGMFSPRNEF 379
Query: 442 --NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGT 487
NID ++ ++ L R N D R +R+ WKLV G
Sbjct: 380 LINIDPMEICANPLATNQFRPNPYF---DVCVR-GGIRVGKWKLVTGV 423
Score = 41.2 bits (95), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 41/81 (50%), Gaps = 6/81 (7%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
R + A + GEY+T LF +EA ++I + + P+FLYL A H L+ P+E
Sbjct: 181 FRNHDCVAKEHKGEYSTILFAEEAQRVIWNHDQETPMFLYLPFAAVH-----NPLDVPRE 235
Query: 61 TINQFQ-YITDPNRRTYAALT 80
++ I D RR AA+
Sbjct: 236 YEAMYEGIIEDEQRRKKAAMV 256
>gi|443321854|ref|ZP_21050893.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
gi|442788398|gb|ELR98092.1| arylsulfatase A family protein [Gloeocapsa sp. PCC 73106]
Length = 476
Score = 224 bits (570), Expect = 1e-55, Method: Compositional matrix adjust.
Identities = 132/356 (37%), Positives = 194/356 (54%), Gaps = 26/356 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS EI T N+D LA +G+ L Y + +CTP+RA+ +TG++P GM +
Sbjct: 46 GWNDVGFHGS-EIKTTNLDKLAVSGVRLERFYVKAMCTPTRAAFLTGRHPFRYGMSAINV 104
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+PL E+ + E L+E GY T +GKWHLG ++ Y P RGF+ H+G+ I
Sbjct: 105 TPWSETGLPLEEKTIAETLKEAGYYTAILGKWHLGHYQESYLPTSRGFDYHYGHYLAGID 164
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFL 270
Y+ H D G D RN + + + Y+TDL ++AVQLI + K PLFL
Sbjct: 165 YFTHKSGD---------GLDWHRNNNPVY--IEGYSTDLIAQDAVQLINNHDYHKNPLFL 213
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+A A H L+A E + + I D RR +AA V+ +D+++G++I +LQ K +
Sbjct: 214 YIAFNAPHIP-----LQAKAEDLEDYLTIEDEQRRLFAAQVQSMDEAIGSIIQSLQAKQV 268
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
N++++F SDNG V + G N P RG K L+EGGV+VP I+ P +
Sbjct: 269 WNNTLLVFTSDNGGEIVA--NDIRFTGRGDNRPLRGGKRNLYEGGVRVPTIISYPGHLSS 326
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
+ QM I D PT AG ++ IDG+D +L + + +N+N D L
Sbjct: 327 GKTVEQMFSIVDLYPTFAKLAGLKINQ-EQQIDGVD-----ILESIAEKNTNRDEL 376
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 28/69 (40%), Positives = 38/69 (55%), Gaps = 6/69 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
Y+TDL ++AVQLI + K PLFLY+A A H L+A E + + I D R
Sbjct: 188 YSTDLIAQDAVQLINNHDYHKNPLFLYIAFNAPHIP-----LQAKAEDLEDYLTIEDEQR 242
Query: 74 RTYAALTKS 82
R +AA +S
Sbjct: 243 RLFAAQVQS 251
>gi|298712440|emb|CBJ33216.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 726
Score = 223 bits (569), Expect = 2e-55, Method: Compositional matrix adjust.
Identities = 182/597 (30%), Positives = 271/597 (45%), Gaps = 143/597 (23%)
Query: 92 GWNDLSFHGSNEI-PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
GWND+ +H ++ TPN+D L+ +G+ ++ Y+ +CTP+RA+LMTG+YP+ G+Q
Sbjct: 123 GWNDIGYHSTDLANVTPNLDRLSASGVKVSQYYSMSICTPARAALMTGRYPVRYGLQYNV 182
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I P G+PLTE+ LPEY+ E GY + +GKWHLG + +TP RGFE++FGYLN
Sbjct: 183 IQPGAPWGLPLTEKLLPEYMNEAGYESHMVGKWHLGSYTHAHTPHRRGFETYFGYLNDEE 242
Query: 211 SYYDHILSDQYSRTVELNG--------------HDMRRNL-------------------S 237
Y+ H + T +NG +D+ S
Sbjct: 243 MYWTH-----QTWTATINGRKFFDFGFGNATGFYDVIERFDPPPGDDDLVSTGPTSSVYS 297
Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQ-PVDK-PLFLYLAHLAAH--AGNAGKHLEAPQE-- 291
++ + G+Y+T++FT A++++ + P D+ PLFLYL+H A H G +P+E
Sbjct: 298 SSLEIKGDYSTEIFTDRALEILSQKTPHDENPLFLYLSHQAVHDPLGVPPADAFSPEEWA 357
Query: 292 --TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN-GAPTVE 348
T + + R +A ++ LD S+G ++ L+ +G LENSII+ SDN G P+
Sbjct: 358 VLTALEDESSDASLRVRFAKVLMYLDKSIGRLVDYLETEGWLENSIIVVASDNGGCPSTG 417
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ---MMHISDWLP 405
GSNYP RG+K++ WEGGVKVPA ++SP + +MH++DWLP
Sbjct: 418 ----------GSNYPLRGLKHSNWEGGVKVPAFVYSPSHIPEEHWGTEYRGLMHVTDWLP 467
Query: 406 TLYTAAGGDTSRLPLNIDGLDQWSSLL---------------------LNTPSRRNSNID 444
TL AA + ++DG+ W ++ N R +
Sbjct: 468 TLADAANFELLGSHGDLDGVSHWKHIVPAPGDDTATVSESPEAAAAVEDNRLKRARTATA 527
Query: 445 GLD----------QWSSLLLNTPS--RRNSVLINID-------------EKKRT---AAV 476
+D SS L S R VL N D E T A
Sbjct: 528 AVDGGGGGGGGGGGGSSYGLGEVSYGARREVLYNFDPYVLGANITDDIGEADFTEVQGAF 587
Query: 477 RLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNI 536
R WK + N GYY NK LL + E T Q L + SQ
Sbjct: 588 RQGKWKFMF----NSWCSGYYA--FDNKT-LLADDLTDEDNTCQELGECSQ--------- 631
Query: 537 DKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
CGAN ++ + +LF+L +DP E NN+A+ P++ L
Sbjct: 632 -------------CGANCGSLS----IDYTDWLFDLEDDPRELNNLANEHPEVVESL 671
>gi|350587859|ref|XP_003129285.2| PREDICTED: arylsulfatase J [Sus scrofa]
Length = 539
Score = 223 bits (568), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 152/492 (30%), Positives = 235/492 (47%), Gaps = 73/492 (14%)
Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
Y IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF
Sbjct: 75 YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 134
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQL 258
++ FG L G YY H D + + G+D+ N + AWD G Y+T ++T+ Q+
Sbjct: 135 DTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENENAAWDYDNGIYSTQMYTQRVQQI 190
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
+ +P+FLY+A+ A H+ L+AP ++ I + NRR YAAM+ LD+++
Sbjct: 191 LASHDPKRPIFLYIAYQAVHS-----PLQAPGRYFEHYRSIININRRRYAAMLSCLDEAI 245
Query: 319 GTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
V AL+ G NSIII+ SDNG PT GSN+P RG K T WEGG++
Sbjct: 246 NNVTLALKMYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKGTYWEGGIR 295
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
+ SP ++ V +++HI+DW PTL + A G + +DG D W ++ +
Sbjct: 296 AVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE-DIQLDGYDVWETI---SEG 351
Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
R+ +D L + + + I +A+R+ WKL+ G GY
Sbjct: 352 LRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN------PGY- 404
Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
S+ VP +F SN+ R ++ T+ G +
Sbjct: 405 ----SDWVPPQSF-----------------------SNLGPNRWHNERITLSTGKS---- 433
Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
+LFN+ DP E+ ++++ P + QL L +T VP + P ++
Sbjct: 434 ---------VWLFNITADPYERVDLSNRYPGVVKQLLRRLSQFNKTAVPVRYP-PKDPRS 483
Query: 618 DPKRFNDTWSPW 629
+P+ W PW
Sbjct: 484 NPRLNGGVWGPW 495
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 26/80 (32%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ +P+FLY+A+ A H+ L+AP
Sbjct: 163 LYENENAAWDYDNGIYSTQMYTQRVQQILASHDPKRPIFLYIAYQAVHS-----PLQAPG 217
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 218 RYFEHYRSIININRRRYAAM 237
>gi|87306948|ref|ZP_01089094.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
gi|87290321|gb|EAQ82209.1| arylsulfatase B precursor [Blastopirellula marina DSM 3645]
Length = 455
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 146/414 (35%), Positives = 212/414 (51%), Gaps = 54/414 (13%)
Query: 75 TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
T+A + L+ G D+S+ GS I TP +DALA +G L Y QPVC+P+R
Sbjct: 21 TFATDAPRPNIVFLLADDLGGADVSWRGS-PIKTPQLDALANSGAKLEQFYVQPVCSPTR 79
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
++L+TG+YP+ G+Q + G+PL ER L E L++ GY T +GKWHLG Y
Sbjct: 80 SALLTGRYPMRYGLQVGVVRPWADYGLPLDERTLAEALQDAGYETAIVGKWHLGHVSPAY 139
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
P+ RGF+ +G+ NG + Y+ H GHD ++ D YAT L
Sbjct: 140 LPMARGFDHQYGHYNGALDYFTHDRD---------GGHDWHKDDHVNRDE--GYATHLIA 188
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
+EAV++I+D+ KPLFLY+ A H+ L+ P+ + + R+ YA MV
Sbjct: 189 QEAVRVIQDRDKKKPLFLYVPFNAVHS-----PLQVPESYAAPYGDMKK-RRQAYAGMVA 242
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
LD++VG ++ +QR+ ML+N++ IF SDNG P E + N P RG K+TL+
Sbjct: 243 ALDEAVGQIVDEIQRQEMLDNTLFIFSSDNGGP-----EPGKLTD---NGPLRGGKHTLY 294
Query: 373 EGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
EGGV+V A W +I +V +HI DW PTL AGG SL
Sbjct: 295 EGGVRVCAFASWKGRIAPGSKVEAP-LHIVDWYPTLIELAGG----------------SL 337
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
P +DG + W S+ PS + ++ NI + A+R+ WKLV+
Sbjct: 338 QQAKP------LDGRNIWPSITTGEPSPHDVIVCNITPTE--GAIRVGDWKLVV 383
>gi|395545568|ref|XP_003774672.1| PREDICTED: arylsulfatase J-like, partial [Sarcophilus harrisii]
Length = 466
Score = 223 bits (567), Expect = 3e-55, Method: Compositional matrix adjust.
Identities = 152/491 (30%), Positives = 236/491 (48%), Gaps = 71/491 (14%)
Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
Y IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+R+E P RGF
Sbjct: 1 YQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYRKECMPTKRGF 60
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDLFTKEAVQL 258
++ FG L G YY H D + + G+D+ N + AWD G Y+T ++T++ Q+
Sbjct: 61 DTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDHDNGIYSTQMYTQKVQQI 116
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
+ KP+FLY+A+ A H+ L+AP + ++ I + NRR YAAM+ LD+++
Sbjct: 117 LASHNPRKPIFLYIAYQAVHS-----PLQAPGKYFEHYRSIININRRRYAAMLSCLDEAI 171
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
V AL+R G +NSIII+ SDNG + GSN+P RG K T WEGG++
Sbjct: 172 NNVTLALKRYGFYDNSIIIYSSDNGGQPMAG---------GSNWPLRGSKGTYWEGGIRA 222
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+ SP ++ + +++HI+DW PTL T A G + +DG D W ++ +
Sbjct: 223 VGFVHSPLLKNKGTICKELVHITDWYPTLITLAEGWIDE-DIQLDGYDIWETI---SEGL 278
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYG 498
R+ +D L + + + I +A+R+ WKL+ G GY
Sbjct: 279 RSPRVDILHNIDPIYTKAKNGSWAAGFGIWNTAIQSAIRVKHWKLLTGNP------GY-- 330
Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
S+ VP +F SN R ++ + G +
Sbjct: 331 ---SDWVPPQSF-----------------------SNAGPNRWHNERISWSAGKS----- 359
Query: 559 PSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQAD 618
+LFN+ DP E+ +++S P I +L L +T VP + D +++
Sbjct: 360 --------VWLFNITADPYERVDLSSKYPGIVKKLLRRLSQFNKTAVPVRYPPKD-PRSN 410
Query: 619 PKRFNDTWSPW 629
P+ W PW
Sbjct: 411 PRLNGGVWGPW 421
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 46/80 (57%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T++ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 89 LYENDNAAWDHDNGIYSTQMYTQKVQQILASHNPRKPIFLYIAYQAVHS-----PLQAPG 143
Query: 60 ETINQFQYITDPNRRTYAAL 79
+ ++ I + NRR YAA+
Sbjct: 144 KYFEHYRSIININRRRYAAM 163
>gi|402871949|ref|XP_003899908.1| PREDICTED: arylsulfatase B-like, partial [Papio anubis]
Length = 316
Score = 222 bits (566), Expect = 4e-55, Method: Compositional matrix adjust.
Identities = 113/250 (45%), Positives = 156/250 (62%), Gaps = 11/250 (4%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS I TP++DALA G++L+N Y QP+CTPSR+ L+TG+Y I TG+Q I
Sbjct: 57 GWNDVGFHGS-RIRTPHLDALAAGGVLLDNYYTQPLCTPSRSQLLTGRYQIRTGLQHQII 115
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
W +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G
Sbjct: 116 WPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSED 175
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY H L D + V D R A Y+T++FTK A LI + P +KPL
Sbjct: 176 YYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTKRATALITNHPPEKPL 233
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLYLA + H + L+ P+E + + +I D NR YA MV +D++VG V +AL+
Sbjct: 234 FLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHHYAGMVSLMDEAVGNVTAALKSS 288
Query: 329 GMLENSIIIF 338
G+ N++ IF
Sbjct: 289 GLWNNTVFIF 298
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 211 YSTNIFTKRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 265
Query: 75 TYAALT 80
YA +
Sbjct: 266 HYAGMV 271
>gi|406832341|ref|ZP_11091935.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 490
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 149/412 (36%), Positives = 212/412 (51%), Gaps = 64/412 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G DL + GS +I TP+IDALA G+ L + Y P+CTP+RA+LMTG+YP+ G+Q I
Sbjct: 37 GNADLGYRGS-KIKTPHIDALAKGGVRLESYYGLPLCTPARAALMTGRYPMRQGLQTLVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ + G+P E+ LP+ L+E+GY T +GKWHLG +++ P RGF+ +G + G +
Sbjct: 96 FPSHRYGLPTDEKTLPQALKEVGYHTAMVGKWHLGHADKKFWPQNRGFDHFYGNVVGEVD 155
Query: 212 YYDHILSDQYSRTV--ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y+ + + V + NG +R + Y DL EAV+LI KPLF
Sbjct: 156 YF----TRERGGVVDWQRNGEFLRED---------GYYVDLIGTEAVKLIAGHDKAKPLF 202
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN-RRTYAAMVKKLDDSVGTVISALQRK 328
LY A LA HA +AP+ I+ + I D RTYA M+ LD VG V++AL+
Sbjct: 203 LYFASLAPHA-----PYQAPKADIDAYNDIFDNEMHRTYAGMISNLDRQVGRVVAALETN 257
Query: 329 GMLENSIIIFMSDNGAPTV----------EYRETSNYRNWG-----SNYPYRGVKNTLWE 373
GM EN+II F +DNG T E RE S G SN P+ G K +L +
Sbjct: 258 GMRENTIIFFTTDNGGATSALFATGARAPEEREASGGVALGSKPPCSNLPFSGGKGSLKD 317
Query: 374 GGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
GGV +PAI+ W +++ P V + +H D +PTL AGG
Sbjct: 318 GGVHLPAIVNWPGRLK--PSVVNEPLHHVDLMPTLLALAGGK------------------ 357
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
PS+ DG D W +L PS +LIN++ + ++R WKL+
Sbjct: 358 -GDPSK---PFDGKDAWGTLAEGKPSPHEEILINVEAFR--GSIRKGRWKLI 403
Score = 43.5 bits (101), Expect = 0.31, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 34/66 (51%), Gaps = 6/66 (9%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN-R 73
Y DL EAV+LI KPLFLY A LA HA +AP+ I+ + I D
Sbjct: 179 YYVDLIGTEAVKLIAGHDKAKPLFLYFASLAPHA-----PYQAPKADIDAYNDIFDNEMH 233
Query: 74 RTYAAL 79
RTYA +
Sbjct: 234 RTYAGM 239
>gi|424863174|ref|ZP_18287087.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
gi|400757795|gb|EJP72006.1| N-acetylgalactosamine-4-sulfatase [SAR86 cluster bacterium SAR86A]
Length = 519
Score = 222 bits (565), Expect = 5e-55, Method: Compositional matrix adjust.
Identities = 133/351 (37%), Positives = 197/351 (56%), Gaps = 30/351 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP-- 149
GW D+S++G I TPNID LA +G+ +N Y+ P C+P+RA+L TG + G+ P
Sbjct: 31 GWGDVSYNGG-PINTPNIDKLADDGLQMNRFYSAPTCSPTRAALFTGINSLKNGIIRPLN 89
Query: 150 -PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P AE G+PL + LPEYL+E+GY T GKWHLG F EY P RGFES +G+L G
Sbjct: 90 NPT--AERYGLPLKHKILPEYLKEIGYQTALSGKWHLGMFSDEYLPRNRGFESTYGHLGG 147
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
I Y+DH LS + D RN ++ Y+T L EA+++IE++ + PL
Sbjct: 148 GIGYFDHALSGRL---------DWHRNGEILYED--GYSTTLIADEAIRIIENKNNETPL 196
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLY+A A H ++A ++ IN I+D R YAA + LD +G +I A++ K
Sbjct: 197 FLYVAFNAPHT-----PIQAEEKIINNLSDISDKKERVYAANIITLDREIGKIIDAIESK 251
Query: 329 GMLENSIIIFMSDNGA-----PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
G+L +IIIF SDNG P + G+ +G K + +EGG++VPA+++
Sbjct: 252 GILNETIIIFFSDNGPVFDIDPIAATIAPDILDSKGNTLGLKGSKGSAYEGGIRVPAVIY 311
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
I + +S Q + D LPTL+ A + ++ NI G+++W+ L+ N
Sbjct: 312 YKGILEK-SLSNQFFFVDDILPTLFGALNINPNQ--NNITGVNRWNDLIKN 359
Score = 45.8 bits (107), Expect = 0.072, Method: Compositional matrix adjust.
Identities = 24/64 (37%), Positives = 37/64 (57%), Gaps = 5/64 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T L EA+++IE++ + PLFLY+A A H ++A ++ IN I+D R
Sbjct: 174 YSTTLIADEAIRIIENKNNETPLFLYVAFNAPHT-----PIQAEEKIINNLSDISDKKER 228
Query: 75 TYAA 78
YAA
Sbjct: 229 VYAA 232
>gi|281353470|gb|EFB29054.1| hypothetical protein PANDA_009046 [Ailuropoda melanoleuca]
Length = 431
Score = 222 bits (565), Expect = 6e-55, Method: Compositional matrix adjust.
Identities = 134/367 (36%), Positives = 197/367 (53%), Gaps = 34/367 (9%)
Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
+ IHTG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF
Sbjct: 1 FKIHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGF 60
Query: 200 ESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
+++FGYL G YY H L D + V D R A Y+T++FT+ A
Sbjct: 61 DTYFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTERAT 118
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV +D+
Sbjct: 119 ALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDE 173
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+VG V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +LWEGG+
Sbjct: 174 AVGNVTAALKSHGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGI 224
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS-RLPLNIDGLDQWSSLLLNT 435
+ + SP ++Q + +++HISDWLPTL A G T+ PL DG D W ++ +
Sbjct: 225 RGVGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTNGSKPL--DGFDVWETISEGS 282
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVL 485
PS R + +D + + +P S+ D+ AA+R +WKL+
Sbjct: 283 PSPRMELLHNID--PNFVDISPCPGKSLAPAKDDSSHPEYFSFNTSLHAAIRHRNWKLLT 340
Query: 486 GTQENGT 492
G G
Sbjct: 341 GYPGCGC 347
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 108 YSTNIFTERATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRH 162
Query: 75 TYAALT 80
YA +
Sbjct: 163 YYAGMV 168
>gi|301769831|ref|XP_002920339.1| PREDICTED: arylsulfatase B-like [Ailuropoda melanoleuca]
Length = 519
Score = 221 bits (564), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 134/365 (36%), Positives = 196/365 (53%), Gaps = 34/365 (9%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
IHTG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF++
Sbjct: 91 IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 150
Query: 202 HFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
+FGYL G YY H L D + V D R A Y+T++FT+ A L
Sbjct: 151 YFGYLLGSEDYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATAL 208
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV +D++V
Sbjct: 209 ITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRHYYAGMVSLMDEAV 263
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V +AL+ G+ N++ IF +DNG T+ G+N+P RG K +LWEGG++
Sbjct: 264 GNVTAALKSHGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGIRG 314
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS-RLPLNIDGLDQWSSLLLNTPS 437
+ SP ++Q + +++HISDWLPTL A G T+ PL DG D W ++ +PS
Sbjct: 315 VGFVASPLLKQKGVKNRELIHISDWLPTLVKLARGSTNGSKPL--DGFDVWETISEGSPS 372
Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGT 487
R + +D + + +P S+ D+ AA+R +WKL+ G
Sbjct: 373 PRMELLHNID--PNFVDISPCPGKSLAPAKDDSSHPEYFSFNTSLHAAIRHRNWKLLTGY 430
Query: 488 QENGT 492
G
Sbjct: 431 PGCGC 435
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 196 YSTNIFTERATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYNFIQDKNRH 250
Query: 75 TYAALT 80
YA +
Sbjct: 251 YYAGMV 256
>gi|443706067|gb|ELU02328.1| hypothetical protein CAPTEDRAFT_179702 [Capitella teleta]
Length = 501
Score = 221 bits (564), Expect = 7e-55, Method: Compositional matrix adjust.
Identities = 160/513 (31%), Positives = 251/513 (48%), Gaps = 59/513 (11%)
Query: 94 NDLSFHGSN----EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP 149
+DL +H ++ TP +D LA G+ N Y C+PSR S M+G+YP + MQ
Sbjct: 30 DDLGYHDIGLRNPDLHTPTLDKLATKGVQFKNNYVMHACSPSRHSFMSGRYPFTSQMQKD 89
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
I+ P PL +FLPEYL+ELGY T A+GKWHLG+ R + P RGF+S +G L+G
Sbjct: 90 VIFPVSPDCSPLKLKFLPEYLKELGYGTHAVGKWHLGYCREDCMPTSRGFDSFYGTLDGE 149
Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
Y H+ + Y D N + G+++ DL T +IE Q +KP F
Sbjct: 150 GDYLTHMSAGFY---------DWHTNGTVDRSKSGQHSQDLHTAALADIIERQTEEKPFF 200
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LY A A N L+A ++ + ++ Q ++D ++ Y ++ LD V T+ L++
Sbjct: 201 LYFA-----AQNPHTPLQAKKKDMKKYKNQNLSDTLKK-YYGLITGLDGMVKTIKDTLEK 254
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
K +L N+ IIF++DNGA GSN P+RG K+TLWEGGVK + ++SP +
Sbjct: 255 KNLLNNTYIIFLADNGADV----------GIGSNEPFRGTKSTLWEGGVKSLSFIYSPLL 304
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
++ + ++HI+DW+PTL + AGG IDG++Q + + S R + LD
Sbjct: 305 KKTGYENDGLIHITDWMPTLVSLAGGSVPESE-GIDGINQVAMVTDARKSERKDMVYNLD 363
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
S L + + AVR + +KL+ G G DG YG T + +
Sbjct: 364 MESQFGLPSSGQ--------------IAVRDERYKLLWGFP--GQNDG-YGLTTTYLAKV 406
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
F I +S T + ++ + P + M + T+ + G
Sbjct: 407 EEFKVIYKSLT--TREKRAGRFTYPEDDQKVMNRLLAELTV-------SVDDIERGTGFM 457
Query: 568 YLFNLGNDPCEQNNIASS-RPDISSQLYELLKY 599
LF+L DP E ++++S P+ L++L+ +
Sbjct: 458 RLFDLIKDPYETYDLSTSDEPEHQEALHKLITF 490
>gi|355669602|gb|AER94582.1| arylsulfatase B [Mustela putorius furo]
Length = 418
Score = 221 bits (564), Expect = 8e-55, Method: Compositional matrix adjust.
Identities = 154/491 (31%), Positives = 234/491 (47%), Gaps = 86/491 (17%)
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
IW +P VPL E+ LP+ L+E GY+T +GKWHLG FR+E P RGF+++FGYL G
Sbjct: 1 IWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMFRKECLPTRRGFDTYFGYLLGSE 60
Query: 211 SYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
YY H L D + V D R A Y+T++FT+ A LI + P +KP
Sbjct: 61 DYYSHERCTLID--ALNVTRCALDFRDGEEVATGYKNMYSTNIFTERATALIANHPPEKP 118
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFLYLA + H + L+ P+E + +++I D NR YA MV +D++VG V +AL+
Sbjct: 119 LFLYLALQSVH-----EPLQVPEEYLKPYKFIQDKNRHHYAGMVSLMDEAVGNVTAALKS 173
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
G+ N++ IF +DNG T+ G+N+P RG K +LWEGG++ + SP +
Sbjct: 174 HGLWNNTVFIFSTDNGGQTLAG---------GNNWPLRGRKWSLWEGGIRGVGFVASPLL 224
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+Q + +++HISDWLPTL T AGG T+ +DG D W ++ +PS R + +D
Sbjct: 225 KQKGIKNRELIHISDWLPTLVTLAGGSTNGTK-PLDGFDVWKTISEGSPSPRMELLHNID 283
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRT----------AAVRLDSWKLVLGTQENGTMDGYY 497
+ + +P S+ D+ AA+R +WKL+ G G
Sbjct: 284 --PNFVDISPCPGMSLAPPKDDSSGPEYFAFNTSLHAAIRHRNWKLLTGYPGCGCW---- 337
Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPM 557
F P S + A++ ++P
Sbjct: 338 --------------------------------FPPPS--------QNNASVIPSSDPPTK 357
Query: 558 TPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQA 617
T +LF++ DP E+++++ P I QL L+++ + VP D +
Sbjct: 358 T--------VWLFDIDQDPEERHDLSRDHPHIVKQLLSRLQFYHKHSVPVHFPAQD-PRC 408
Query: 618 DPKRFNDTWSP 628
DPK W P
Sbjct: 409 DPKD-TGAWGP 418
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 41/66 (62%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + +++I D NR
Sbjct: 97 YSTNIFTERATALIANHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYKFIQDKNRH 151
Query: 75 TYAALT 80
YA +
Sbjct: 152 HYAGMV 157
>gi|449680632|ref|XP_004209636.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 336
Score = 221 bits (562), Expect = 1e-54, Method: Compositional matrix adjust.
Identities = 136/403 (33%), Positives = 207/403 (51%), Gaps = 73/403 (18%)
Query: 232 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQ 290
M +N ++ G Y+T +++ EA+ I + +P+FLYLA+ A H+ N ++ L+APQ
Sbjct: 1 MAQNQENIFNQWGNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQ 60
Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
E +++F +I R+ YAAM+ +D +G + S L ++ ML++SIIIF SDNG P
Sbjct: 61 EWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITSELLKQNMLDDSIIIFTSDNGGPANGLN 120
Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
NW +NYP RG K T++EGGV+ A +WS ++ +NPRVS +MHI+DWLPTL +A
Sbjct: 121 N-----NWATNYPLRGTKTTVYEGGVRGAACIWSKELSKNPRVSNDLMHITDWLPTLLSA 175
Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE- 469
AG DT+ LP N+DG L+ W +L+ PS R VLINID
Sbjct: 176 AGVDTTLLP-NLDG---------------------LNLWDTLINQVPSPRTEVLINIDPL 213
Query: 470 KKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI 529
AA+R+ WK+V ++N DG+Y ++S I
Sbjct: 214 LYMNAALRVGDWKIV---KQNNFYDGWYPPP-----------------------EISNEI 247
Query: 530 FLPISNIDKMRSTRQQATIHCGANPAPMTPSPC--TNGPCYLFNLGNDPCEQNNIASSRP 587
++ +HC P P+ P C +G LFN+ DPCE ++++S P
Sbjct: 248 --------------EEYDVHCS--PPPLLPRSCDKKDGKLCLFNIKYDPCEYIDLSASNP 291
Query: 588 DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
I + E L ++ +++V ADPK +N W W+
Sbjct: 292 HIYQFMLERLNFYNKSMVESRRNTFRDPLADPKLYNGVWKSWV 334
Score = 59.7 bits (143), Expect = 5e-06, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 50/80 (62%), Gaps = 1/80 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQ 59
M +N ++ G Y+T +++ EA+ I + +P+FLYLA+ A H+ N ++ L+APQ
Sbjct: 1 MAQNQENIFNQWGNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQ 60
Query: 60 ETINQFQYITDPNRRTYAAL 79
E +++F +I R+ YAA+
Sbjct: 61 EWVDKFSHIKHKGRQNYAAM 80
>gi|326437895|gb|EGD83465.1| hypothetical protein PTSG_04073 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 173/559 (30%), Positives = 253/559 (45%), Gaps = 98/559 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG--P 149
G++D+ F S++I TPNID +AS+++G+Y +H G+ P
Sbjct: 46 GFDDVGFK-SHQIKTPNID---------------------QASILSGRYAMHHGIVNWIP 83
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
P + G+PL LP+ L+ GY T AIGKWHLGF++ +YTP +RGF S GY +G
Sbjct: 84 P---KDSYGLPLNHTTLPQLLKNGGYDTHAIGKWHLGFYKWDYTPTFRGFNSFLGYYSGG 140
Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
+Y+ H Y + + A+D G+Y+T +F+ EAV++I+D KPLF
Sbjct: 141 ENYFTHKNGPAYDMHRDPLPSCGQNCSQIAFDLQGQYSTTIFSDEAVRVIDDHIGPKPLF 200
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
LYLA+ A H + +APQ I+ + I D RRT+A M+ LD+ +G V AL+R
Sbjct: 201 LYLAYQAVH-----EPAQAPQSYIDPYTDLIPDAQRRTFAGMLSCLDEGIGNVTRALKRN 255
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ-- 386
G N++I+F +DNG P + NYP RG K+ ++EGGV+ A + PQ
Sbjct: 256 GFYNNTLIVFTTDNGGPILGGDSVG-----ARNYPMRGGKHAVYEGGVRGIAFVHPPQNT 310
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
+ +MH +DWLPTL AAG + ++DG+ W ++ NT
Sbjct: 311 FVETGAGYHNLMHAADWLPTLCDAAGVE-CHTQQSLDGVSHWDAMTTNT----------- 358
Query: 447 DQWSSLLLNTPSRRNSVLIN----IDEKKRTAAVRLDSWKLVLGT------------QEN 490
N+ R + VL N D A R WK + GT Q
Sbjct: 359 --------NSTLRGDVVLGNATDLCDATNCGFAYRKGDWKFIRGTGGMPFSWSTPMNQSG 410
Query: 491 GTMDGYYGQ----------TRSNKVPLLNFNAIVESKTYQSL--QQLSQNIFLPISNIDK 538
G + YGQ T + TY + Q S N L S
Sbjct: 411 GNCNLAYGQCVDGASLGRVTAATAADCCYTCTNTTKCTYSTFLANQTSNNCLLFAS---- 466
Query: 539 MRSTRQQATIHC--GANPAPMTPSPCTNGPCY-LFNLGNDPCEQNNIASSRPDISSQLYE 595
ST +C GA+ P N Y LFN+ DP E ++ASS D++ Q+
Sbjct: 467 --STASTLDTNCISGAHDPKQVPYTTLNPLGYQLFNIAQDPNEHTDVASSHADVAKQMNS 524
Query: 596 LLKYHRRTLVPQSHEQPDL 614
L ++ V Q+ P+
Sbjct: 525 ELDSILQSYV-QAQNDPNC 542
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 31/73 (42%), Positives = 46/73 (63%), Gaps = 6/73 (8%)
Query: 8 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-Q 66
A+D G+Y+T +F+ EAV++I+D KPLFLYLA+ A H + +APQ I+ +
Sbjct: 170 AFDLQGQYSTTIFSDEAVRVIDDHIGPKPLFLYLAYQAVH-----EPAQAPQSYIDPYTD 224
Query: 67 YITDPNRRTYAAL 79
I D RRT+A +
Sbjct: 225 LIPDAQRRTFAGM 237
>gi|443706557|gb|ELU02545.1| hypothetical protein CAPTEDRAFT_109345 [Capitella teleta]
Length = 370
Score = 220 bits (561), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 132/366 (36%), Positives = 197/366 (53%), Gaps = 26/366 (7%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
L + + G++DL F + I TPNIDALA G+I N Y Q VCTPSR +L+TG+YP +
Sbjct: 3 LKPVFIPGYHDLGFRNPDVI-TPNIDALATEGVIFTNNYVQSVCTPSRHALLTGRYPHRS 61
Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
MQ I + R L +FLPEYL++LGYST A+GKWH+G+ R E P +RGF+S FG
Sbjct: 62 AMQNLVIMSNQARCTGLGYKFLPEYLKDLGYSTHAVGKWHVGYCREECLPTHRGFDSFFG 121
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+G Y++H + TV D + + G ++ DL + +++ Q
Sbjct: 122 LYDGDGYYWNH------TSTVIPGAFDWNNSTGVYLEARGIHSEDLGAERLTAILDGQNA 175
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQ-ETINQFQYITDPN-RRTYAAMVKKLDDSVGTVI 322
+P FLY + H + PQ E +N + P+ RR Y +V LD VG ++
Sbjct: 176 KEPFFLYFSPQNPHTPS------QPQAEFLNLYPENRYPDIRRKYLGLVSGLDRMVGGIV 229
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
L+R G++ N+ IIF+SDNGA E G N P+RG K +L+EGG K + +
Sbjct: 230 DGLKRNGLMNNTYIIFVSDNGADPEE----------GLNDPFRGGKGSLFEGGTKSASFI 279
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
+SP + + + +MHI+DW+PT+ AGG +DG+DQ + S+R
Sbjct: 280 YSPLLNKAGYENNGLMHITDWMPTILKLAGGSIPEGE-QLDGIDQSDMVKNGKASQRKKM 338
Query: 443 IDGLDQ 448
+ +D+
Sbjct: 339 VYSIDR 344
>gi|291230656|ref|XP_002735281.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 522
Score = 220 bits (560), Expect = 2e-54, Method: Compositional matrix adjust.
Identities = 149/435 (34%), Positives = 215/435 (49%), Gaps = 70/435 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ +HGS + TP IDALA G+ L N Y +CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 39 GWDDVGYHGS-VMKTPYIDALAAEGVTLENYYMPSLCTPSRSVLLTGRYEIHTGLQHGTI 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL E LP+ L+E GY T +GKWHLGF+R+E P RGF+S G+ +
Sbjct: 98 LMMQPLCLPLDEITLPQKLKEEGYDTHMVGKWHLGFYRKECLPNNRGFDSFLGFYQAMGD 157
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFTKEAVQ------------- 257
++ H +S + NG D RRN D G+Y+T +F +
Sbjct: 158 HFYHNIS---ASPGHFNGFDFRRNNDVVADQYAGKYSTHIFXXXFINTQTLSFVCVNNVK 214
Query: 258 -------LIEDQPV------DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT--DP 302
L P+ +PLFLYL+ A H L+ P + + D
Sbjct: 215 GVYFRGYLSSFTPITSSYNPQQPLFLYLSFQAVHT-----PLQVPSRYAELYNDLIPNDE 269
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
+RR YA MV +D++VG V + L++ + ENS++IF +
Sbjct: 270 DRRIYAGMVTCMDEAVGYVTTVLKQSALWENSVVIFSTX--------------------- 308
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPL 420
RG K TL+EGGV+ + SP + + R ++ +++HISDW PTL AGGD
Sbjct: 309 -LRGGKGTLYEGGVRGVGFVTSPLLHEQVRGTINHELIHISDWFPTLTRLAGGDLMG-GG 366
Query: 421 NIDGLDQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
+DG +QW ++ TPS R NID L L + P S N+ + AA+R
Sbjct: 367 RLDGYNQWETISQGTPSLRTEILLNIDPLYTAVRNLRDDP-WAESTCFNVSMR---AAIR 422
Query: 478 LDSWKLVLGTQENGT 492
+ WKL+ G NG+
Sbjct: 423 VGDWKLITGGAGNGS 437
Score = 52.8 bits (125), Expect = 5e-04, Method: Compositional matrix adjust.
Identities = 23/61 (37%), Positives = 37/61 (60%)
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
L+N+ DP E+ + + RPDI+ +L + L ++ RT VP + Q ++ ADP + WSP
Sbjct: 461 LYNVHVDPYERMDYSHERPDIAEELLQRLGHYYRTSVPPNFPQQEVRSADPANYGGVWSP 520
Query: 629 W 629
W
Sbjct: 521 W 521
>gi|348564635|ref|XP_003468110.1| PREDICTED: arylsulfatase J-like [Cavia porcellus]
Length = 543
Score = 219 bits (559), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 155/500 (31%), Positives = 238/500 (47%), Gaps = 74/500 (14%)
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
R L++ Y IHTG+Q I +P +PL LP+ L+E+GYST +GKWHLGF+++E
Sbjct: 71 RCGLLS-LYQIHTGLQHSIIRPTQPNCLPLDNATLPQKLKEVGYSTHMVGKWHLGFYKKE 129
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD-TVGEYATDL 250
P RGF++ FG L G YY H D + + G+D+ N + AWD G Y+T +
Sbjct: 130 CMPTKRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAAWDHDNGIYSTQM 185
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+T+ Q++ KP+FLY+A+ A H+ L+AP ++ I + NRR YAAM
Sbjct: 186 YTQRVQQILASHNPTKPIFLYIAYQAVHSP-----LQAPGRYFEHYRSIININRRRYAAM 240
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKN 369
+ LD+++ V AL+ G NSIII+ SDNG PT GSN+P RG K
Sbjct: 241 LSCLDEAIHNVTLALKAYGFYNNSIIIYSSDNGGQPTAG----------GSNWPLRGSKG 290
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
T WEGG++ + SP ++ V +++HI+DW PTL + A G +DG D W
Sbjct: 291 TYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDEH-TQLDGYDVWE 349
Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
++ + R+ +D L + + + I +A+R+ WKL+ G
Sbjct: 350 TI---SEGFRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIRVQHWKLLTGN-- 404
Query: 490 NGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
GY S+ VP F SN+ R ++ T+
Sbjct: 405 ----PGY-----SDWVPPQAF-----------------------SNLGPNRWHNERITLS 432
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
G + +LFN+ DP E+ +++S P I +L + L +T VP +
Sbjct: 433 TGKS-------------IWLFNITADPYERVDLSSRYPGIVKKLLKRLSQFNKTAVPVRY 479
Query: 610 EQPDLVQADPKRFNDTWSPW 629
D +++P+ W PW
Sbjct: 480 PPKD-PRSNPRLNGGVWGPW 498
Score = 49.3 bits (116), Expect = 0.006, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 166 LYENDNAAWDHDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHSP-----LQAPG 220
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 221 RYFEHYRSIININRRRYAAM 240
>gi|260819479|ref|XP_002605064.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
gi|229290394|gb|EEN61074.1| hypothetical protein BRAFLDRAFT_124136 [Branchiostoma floridae]
Length = 485
Score = 219 bits (558), Expect = 3e-54, Method: Compositional matrix adjust.
Identities = 152/539 (28%), Positives = 241/539 (44%), Gaps = 97/539 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + ++ TP +D LA G+ILN Y VCTPSR + MTG +P H G Q
Sbjct: 39 GWNDVGWH-NPDVKTPVLDQLANEGVILNQCYVNYVCTPSRTAFMTGYFPYHVGSQHTVF 97
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ + +G+P FLPE L++LGY+T +GK
Sbjct: 98 FPQQAQGIPSKFPFLPEKLKQLGYATHMVGK----------------------------- 128
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
+ D+ + S G+Y T FT V ++E P D PLF+Y
Sbjct: 129 ----------------DDKDVVKTRS------GQYGTYFFTDRVVDIVEKHPADTPLFMY 166
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L + H + L+ PQ + + + D NRR MV LD++VG V A+++ G+
Sbjct: 167 LPYQNVH-----QPLQVPQRFEDMYASVQDENRRKLLGMVSALDEAVGNVTMAMKKAGLW 221
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N+++IF +DNG + S G+NYP RG K TLWEGG + A +Q+
Sbjct: 222 DNTLVIFTTDNGG----WISAS-----GNNYPLRGGKVTLWEGGTRGVAFAHGKMLQKTG 272
Query: 392 RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
+ +M+H DW PT+ AAGG + IDG+ Q +
Sbjct: 273 YTNNEMIHAVDWFPTILAAAGGTA------------------------DEAIDGVSQLDT 308
Query: 452 LLLNTPSRRNSVLINIDEK--KRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
++ PS R + NID++ + A+R+ +KL+ G G DG+ L +
Sbjct: 309 IMSGQPSPRTEFVYNIDDEFPTKQGAIRVGDYKLIEGYA--GKPDGWILPPNLTGSQLDS 366
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYL 569
+ ++ + + I + N+ + + + P + P +L
Sbjct: 367 GDPVIGTWLFNLKVSGDPVIGTWLFNLKAGQRRPRHRDLALQPESQPDSGDPVIG--TWL 424
Query: 570 FNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
FNL +DP E +N+A S PD ++ L +R++LVP PD +++P + WSP
Sbjct: 425 FNLKDDPTEHHNLADSMPDKLREMQAKLAEYRKSLVPAIFPAPD-PKSNPDNWGGAWSP 482
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 22/70 (31%), Positives = 34/70 (48%), Gaps = 5/70 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G+Y T FT V ++E P D PLF+YL + H + L+ PQ + + + D N
Sbjct: 139 GQYGTYFFTDRVVDIVEKHPADTPLFMYLPYQNVH-----QPLQVPQRFEDMYASVQDEN 193
Query: 73 RRTYAALTKS 82
RR + +
Sbjct: 194 RRKLLGMVSA 203
>gi|241680395|ref|XP_002412684.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215506486|gb|EEC15980.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 461
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 141/417 (33%), Positives = 203/417 (48%), Gaps = 50/417 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+SFHGS +IPTPNID LA +G+ L++ Y Q +R++LMTG YPIHTG++
Sbjct: 20 GWGDVSFHGSRQIPTPNIDTLAGDGLALHHYYVQSSGAATRSALMTGLYPIHTGIKHASH 79
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
R +PL+ + ++ + H+GF + + + + NG S
Sbjct: 80 ICISARLIPLSGGIVRCHIVLTIKYAFMVNIPHIGFLFEHISKGAHAVNASW-WTNGYWS 138
Query: 212 Y-YDHILSDQYSRTVEL----NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
+ H Q E+ N R S W + +
Sbjct: 139 KRFIHAFFLQLFMNAEIFFFFNQWPCREMRSPKWKIAPLHNCSV---------------- 182
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
PL LYLAH A H GNA LEAP + +F+YI D RR +A MV LD SVG V ALQ
Sbjct: 183 PLLLYLAHQAVHVGNAYHPLEAPSSYVKKFRYIKDLRRRIFAGMVAALDQSVGDVFRALQ 242
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
RKG+LE++I++ SD+G + GSN+P RG K TLWEGGV+ +LWS +
Sbjct: 243 RKGILEDTIVLLTSDSGGAAGGVEQGV-----GSNWPLRGTKKTLWEGGVRAVGLLWSAK 297
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
+ + RV+ Q++H S + P + GD + L +IDG
Sbjct: 298 LARG-RVASQLIHRSAYFPLTNADSCGDITDL----------------------GSIDGK 334
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
D W+ L+ + PS R +L NID + + AA+R+ +KLVLG + D +Y S+
Sbjct: 335 DLWTVLVDDRPSPRTELLYNIDYRWKLAAIRVGPYKLVLGASSSDQFDDWYEPIESD 391
Score = 52.0 bits (123), Expect = 0.001, Method: Compositional matrix adjust.
Identities = 23/47 (48%), Positives = 29/47 (61%)
Query: 36 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKS 82
PL LYLAH A H GNA LEAP + +F+YI D RR +A + +
Sbjct: 183 PLLLYLAHQAVHVGNAYHPLEAPSSYVKKFRYIKDLRRRIFAGMVAA 229
>gi|198434445|ref|XP_002131042.1| PREDICTED: similar to sulfatase 1 [Ciona intestinalis]
Length = 512
Score = 217 bits (552), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 174/571 (30%), Positives = 262/571 (45%), Gaps = 98/571 (17%)
Query: 59 QETINQFQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNE---IPTPNIDALA 113
+ TI+ F I T A TK + L++ G+ND+ + G N TP +D+LA
Sbjct: 7 KSTISSFLLICVVG--TVAEATKKPHVVLILADDLGYNDVGYWGQNHGSAAKTPFLDSLA 64
Query: 114 YNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLREL 173
NG+ L N Y VC+P+R +LMTG+ I G+ I + G+PL LPE L
Sbjct: 65 ENGVRLENYYTHSVCSPTRGALMTGRNRIDIGLAHGIIHTTQIEGLPLDNVLLPEQLSNC 124
Query: 174 GYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMR 233
GY+T+ IGKWHLGF +Y P RGF +G+L G +Y+ L +R + G D
Sbjct: 125 GYNTQMIGKWHLGFSSSKYAPWNRGFHGFYGFLAGSENYWSKWLP--MARHSNIGGVDFT 182
Query: 234 RNLSTAW-DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQET 292
+ + +T G+Y+ ++ A +I+ KPLFLYL H L AP
Sbjct: 183 DSTTGPTNETWGQYSAHVYASRARYVIQHHDQSKPLFLYLPLQTPHT-----PLGAPSHY 237
Query: 293 INQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRET 352
F+ I D +R Y +MV LD++V V + L+ GM E++++IF +DNG
Sbjct: 238 YEPFKDIEDDDRMKYLSMVSVLDETVRNVTNYLKDAGMWEDTLLIFSTDNGGEV------ 291
Query: 353 SNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP---QIQQNPRVSLQMMHISDWLPTLYT 409
++ G+N+PYRG KNTL+EGGVK + + RV+ +++H++DW PT+
Sbjct: 292 ---KHGGNNWPYRGTKNTLYEGGVKAVGFVHGKILGDLSPAQRVNRELIHVTDWYPTIMA 348
Query: 410 AAGGDTSRLPLN----IDGLDQWSSLLLNTPSRRNS---NIDGLDQWSSLLLNTPSRRNS 462
AA D + N + GL+QW +L S+R ID +++ S + P R
Sbjct: 349 AA--DCPYVSGNSRNKLYGLNQWGALRGERGSQRTEIVHTIDPMNRTSEI----PDMRFG 402
Query: 463 VLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSL 522
IN+ AA+ + WKL+ G +G G+Y A ESK
Sbjct: 403 FNINVK-----AAICVRKWKLITGNPGHG---GWY--------------APPESK----- 435
Query: 523 QQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNI 582
+I TR I LF+L DP E+N I
Sbjct: 436 ------------DIFTKSGTRGSKNIQ-------------------LFDLSVDPLEENEI 464
Query: 583 ASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
++ P + S+L E L + + P ++ D
Sbjct: 465 SNEHPTVVSELLERLAKYSLSAPPAINQNTD 495
>gi|313236221|emb|CBY11544.1| unnamed protein product [Oikopleura dioica]
Length = 511
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 160/573 (27%), Positives = 261/573 (45%), Gaps = 93/573 (16%)
Query: 62 INQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN 121
+ + YI + +A + + + G +D+S HG+ +I TPN+D LA +G++LNN
Sbjct: 1 MKKLLYIFGLSANIFAEQQPNIVVIVADDLGKHDVSMHGA-DIYTPNLDMLARDGVLLNN 59
Query: 122 MYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIG 181
Y QPVC+P+R SLMTG+YP G+Q + G P G+PL E +P+Y++E GY T +G
Sbjct: 60 YYVQPVCSPTRGSLMTGRYPYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVG 119
Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD 241
KW LGFF+ Y P RGF+ FG L G YY + L G+D+R D
Sbjct: 120 KWQLGFFKDNYLPWKRGFDEFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRD 179
Query: 242 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ--YI 299
T +Y L+ +A + PL++Y+A + H + L+APQ+ ++ +
Sbjct: 180 TSMKYQPFLYADKAREKFFAHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQ 234
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+R +A MV LD +VG ++ L+ G+ N++I F +DNG R
Sbjct: 235 MSNGQRVFAGMVTALDAAVGEIVDGLKDSGLWNNTLIYFSNDNG------------REDD 282
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSR 417
S++ +G KN W+GG+ P I + R + H+SD LPT+ +AA T
Sbjct: 283 SDFVLKGGKNDFWQGGITGYGFFSGPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHT-- 340
Query: 418 LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
LP + IDG D WS L ++ + DEK++ ++
Sbjct: 341 LPPKV--------------------IDGHDLWSRLQGDS--------LGQDEKRKEIIMQ 372
Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPL-LNFNAIVESKTYQSLQQLSQNIFLPISNI 536
++ L+ +E+ TR+ V + ++ IV S + +
Sbjct: 373 MNP--LLRLWEEDTRFQNSTWDTRTQGVIIDGDYKLIVGSHS-----------------V 413
Query: 537 DKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYEL 596
+ +++ Q +LFN+ NDP E N+++ S+P+++ +L
Sbjct: 414 LRRKASDYQTV--------------------WLFNIKNDPQEANDLSGSKPNLTHKLLHR 453
Query: 597 LKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
L + H PD +DP D W P+
Sbjct: 454 LYKEMDNVRDPWHPLPDPA-SDPALHGDFWGPY 485
>gi|443705042|gb|ELU01787.1| hypothetical protein CAPTEDRAFT_153777 [Capitella teleta]
Length = 551
Score = 216 bits (551), Expect = 2e-53, Method: Compositional matrix adjust.
Identities = 164/544 (30%), Positives = 254/544 (46%), Gaps = 90/544 (16%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YG+ND+ F N I +PN+DALA +GIIL N Y P C+PSR S M+G+Y + MQ
Sbjct: 36 YGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFMSGRYSYKSAMQHGV 94
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I +P+ + L LP YL+ELGY T A GKWHLG+ R E TP +RGF+S G +G
Sbjct: 95 ILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEG 154
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+Y+H T +D +D +G+++ DL + +++ PLF+
Sbjct: 155 EFYEH-------TTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQSSPLFM 207
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRRTYAAMVKKLDDSVGTVISAL 325
Y+A H +P + +F + D +R+ Y +V +D +G ++ +
Sbjct: 208 YVAFHNVH---------SPLDPKPEFLALYDDLDVTDDRKKYLGLVSGMDYIIGGIVDKI 258
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
+ G+ EN+ I+F SDNG E G N P+RG K+TLWEGG K + SP
Sbjct: 259 KEIGIYENTYILFSSDNGGDVGE----------GDNSPHRGGKSTLWEGGCKANSWTHSP 308
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+ + +MH++DWLPT+ AGG +P + DGL DG
Sbjct: 309 LLGATGVENNGLMHVTDWLPTIVDLAGG---SVPAS-DGL------------------DG 346
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTA-------AVRLDSWKLVLGTQENGTMDGYYG 498
+ Q ++ N S R +++ NID + A R +KL+ G G DGY
Sbjct: 347 VVQTDMVINNGDSARTTMIYNIDRESDIGQPKFGEMACRNLQYKLIWGFP--GQQDGYGA 404
Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR------STRQQATIHCGA 552
+ I + Y Q+ +Q + DK R + Q + ++
Sbjct: 405 DAK----------YIYDIPFY---QEAAQAYIDAEGSTDKKRGGPYVLTADQTSVLNNYM 451
Query: 553 NPAPMTPSPCTN--GPCYLFNLGNDPCEQNNIA-SSRPDISSQLYELLKYHRRTL----- 604
+ ++ + G LF+L NDP E +++ S+ PD+ + EL+K + +
Sbjct: 452 SSVHVSDQELADGTGSMLLFDLINDPNETTDLSESTDPDVVAAKAELIKMIQDAVNDGGY 511
Query: 605 VPQS 608
VPQS
Sbjct: 512 VPQS 515
>gi|426233825|ref|XP_004023235.1| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like [Ovis aries]
Length = 475
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 132/391 (33%), Positives = 202/391 (51%), Gaps = 46/391 (11%)
Query: 107 PNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFL 166
PN+ +L YN ++ + IHTG+Q IW +P +PL E+ L
Sbjct: 30 PNLGSLVYN-------------VEEQSCPFFRNFQIHTGLQHQIIWPCQPSCIPLDEKLL 76
Query: 167 PEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY-SRTV 225
P+ L+E GY+T +GKWHLG +R+E P RGF+++FGYL G YY H + V
Sbjct: 77 PQLLKEAGYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYSHERCTVIDALNV 136
Query: 226 ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
D R A Y+T++FT+ A LI P +KPLFLYLA + H +
Sbjct: 137 TRCALDFRDGEEVATGYKNMYSTNVFTERATTLITTHPPEKPLFLYLALQSVH-----EP 191
Query: 286 LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
L+ P+E + + +I D NRR YA M +D++VG V +AL+ +G+ N++ IF +DNG
Sbjct: 192 LQVPEEYLKPYDFIQDKNRRHYAGMASLMDEAVGNVTAALESRGLWNNTVFIFSTDNGGQ 251
Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLP 405
T+ G+N+P RG K +LWEGGV+ + SP +++ + +++HISDWLP
Sbjct: 252 TLAG---------GNNWPLRGRKWSLWEGGVRGVGFVASPLLKRKGVKTRELIHISDWLP 302
Query: 406 TLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI 465
L G D ++ +DG D W+++ +PS R + +D + P NS+ +
Sbjct: 303 PL---XGTDGTK---PLDGFDVWNTISEGSPSPRMELLHNIDPDFVDISLCPG--NSMAL 354
Query: 466 NIDEKKRT----------AAVRLDSWKLVLG 486
D+ AA+R +WKL+ G
Sbjct: 355 AKDDSSLLEYSAFNTSIHAAIRHQNWKLLTG 385
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI P +KPLFLYLA + H + L+ P+E + + +I D NRR
Sbjct: 157 YSTNVFTERATTLITTHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRR 211
Query: 75 TYAALT 80
YA +
Sbjct: 212 HYAGMA 217
>gi|443691100|gb|ELT93060.1| hypothetical protein CAPTEDRAFT_21969 [Capitella teleta]
Length = 529
Score = 215 bits (548), Expect = 5e-53, Method: Compositional matrix adjust.
Identities = 164/544 (30%), Positives = 254/544 (46%), Gaps = 90/544 (16%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YG+ND+ F N I +PN+DALA +GIIL N Y P C+PSR S ++G+Y + MQ
Sbjct: 14 YGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGV 72
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I +P+ + L LP YL+ELGY T A GKWHLG+ R E TP +RGF+S G +G
Sbjct: 73 ILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEG 132
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+Y+H T +D +D +G+++ DL + +++ PLF+
Sbjct: 133 EFYEH-------TTATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQSSPLFM 185
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRRTYAAMVKKLDDSVGTVISAL 325
Y+A H +P + +F + D +R+ Y +V +D +G ++ +
Sbjct: 186 YVAFHNVH---------SPLDPKPEFLALYDDLDVTDDRKKYLGLVSGMDYIIGGIVDKI 236
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
+ G+ EN+ I+F SDNG E G N P+RG K+TLWEGG K + SP
Sbjct: 237 KEIGIYENTYILFSSDNGGDVGE----------GDNSPHRGGKSTLWEGGCKANSWTHSP 286
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+ + +MH++DWLPT+ AGG +P + DGL DG
Sbjct: 287 LLGATGVENNGLMHVTDWLPTIVDLAGG---SVPAS-DGL------------------DG 324
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTA-------AVRLDSWKLVLGTQENGTMDGYYG 498
+ Q ++ N S R +++ NID + A R +KL+ G G DGY
Sbjct: 325 VVQTDMVINNGDSARTTMIYNIDRESDIGQPKFGEMACRNLQYKLIWGFP--GQQDGYGA 382
Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR------STRQQATIHCGA 552
+ I + Y Q+ +Q + DK R + Q + ++
Sbjct: 383 DAK----------YIYDIPFY---QEAAQAYIDAEGSTDKKRGGPYVLTADQTSVLNNYM 429
Query: 553 NPAPMTPSPCTN--GPCYLFNLGNDPCEQNNIA-SSRPDISSQLYELLKYHRRTL----- 604
+ ++ + G LF+L NDP E +++ S+ PDI + EL+K + +
Sbjct: 430 SSVHVSDQELADGTGSMLLFDLINDPNETTDLSESTDPDIVAAKAELIKMIQDAVNDGGY 489
Query: 605 VPQS 608
VPQS
Sbjct: 490 VPQS 493
>gi|432963754|ref|XP_004086820.1| PREDICTED: uncharacterized protein LOC101172478 [Oryzias latipes]
Length = 873
Score = 215 bits (548), Expect = 6e-53, Method: Compositional matrix adjust.
Identities = 154/498 (30%), Positives = 234/498 (46%), Gaps = 71/498 (14%)
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
S+ Y IHTG+Q I +P +PL LP+ LR+ GYST +GKWHLGF++R
Sbjct: 415 SVFHSGYQIHTGLQHSVIRATQPNCLPLENVTLPQKLRDAGYSTHMVGKWHLGFYKRGCL 474
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT-VGEYATDLFT 252
P RGF+S FG L G YY H + + G+D+ AW+ G Y+T +FT
Sbjct: 475 PTQRGFDSFFGSLLGSGDYYSHYKCE----VPGMCGYDLYEGEEAAWEQDRGLYSTVMFT 530
Query: 253 KEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
++AV ++ P PLFLYLA+ A H+ L+ P + +++ I++ +RR YAAMV
Sbjct: 531 RKAVSILAHHNPRAHPLFLYLAYQAVHS-----PLQVPARYLERYKGISNLSRRKYAAMV 585
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
LD+++ + AL+R G +N++I++ SDNG + GSN+P RG K T
Sbjct: 586 SCLDEAIHNLTLALKRYGYYDNTVIVYSSDNGGQPLAG---------GSNWPLRGSKATY 636
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGG++ + SP + + ++HI+DW PTL T G T LN+DG D W ++
Sbjct: 637 WEGGIRAVGFVHSPLLMKKGTKCRSLIHITDWFPTLVTLGEG-TLDEDLNLDGYDVWEAI 695
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
PS R +D L + + + + AA+R+ WKL+ G
Sbjct: 696 SEGRPSPR---LDILHNIDPIYIKAKNGSWKAGFGLWNTAVQAAIRVGHWKLLTG----- 747
Query: 492 TMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCG 551
+ GY S+ VP F S Q+L+ D +S
Sbjct: 748 -VPGY-----SDWVPPQTF----------SKQRLTNRWHNERVRWDGGKS---------- 781
Query: 552 ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQ 611
+LFN+ DP E+ +++ P I + L + RT VP +
Sbjct: 782 ---------------LWLFNITADPYERVDLSQHYPHIVKTMLLRLAQYNRTAVPVRYPA 826
Query: 612 PDLVQADPKRFNDTWSPW 629
DL +++P+ W PW
Sbjct: 827 KDL-RSNPQYNGGVWGPW 843
Score = 47.0 bits (110), Expect = 0.033, Method: Compositional matrix adjust.
Identities = 34/96 (35%), Positives = 53/96 (55%), Gaps = 13/96 (13%)
Query: 8 AWDT-VGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
AW+ G Y+T +FT++AV ++ P PLFLYLA+ A H+ L+ P + ++
Sbjct: 516 AWEQDRGLYSTVMFTRKAVSILAHHNPRAHPLFLYLAYQAVHS-----PLQVPARYLERY 570
Query: 66 QYITDPNRRTYAALTKST-----TLTL-LIVYGWND 95
+ I++ +RR YAA+ LTL L YG+ D
Sbjct: 571 KGISNLSRRKYAAMVSCLDEAIHNLTLALKRYGYYD 606
>gi|251798133|ref|YP_003012864.1| sulfatase [Paenibacillus sp. JDR-2]
gi|247545759|gb|ACT02778.1| sulfatase [Paenibacillus sp. JDR-2]
Length = 434
Score = 214 bits (546), Expect = 9e-53, Method: Compositional matrix adjust.
Identities = 151/426 (35%), Positives = 222/426 (52%), Gaps = 55/426 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +GS+ + TP++D LA GI N Y+ PVC+PSRASL+TGKYP G+
Sbjct: 15 GYGDLGCYGSDAMKTPHLDQLASEGIRFTNWYSNSPVCSPSRASLLTGKYPAKAGVT--S 72
Query: 151 IWGAE--PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
I G + +G+ L + L L+E GY T GKWHLG EY P GF+ +G+ G
Sbjct: 73 ILGGKRGTKGLSLEQTTLASALKEHGYHTALFGKWHLGA-SAEYGPNAHGFDQFYGFRAG 131
Query: 209 VISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
I YY HI + + +N HD+ RN + W+ GEY T+ T+EA I+ P D+P
Sbjct: 132 CIDYYSHIF--YWGQGGGVNPVHDLWRNETEVWEN-GEYMTEAITREATSYIDAAPDDEP 188
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
F+Y+A+ A H + AP+ +++F + P+RR AAM+ +DD VG ++ AL++
Sbjct: 189 YFMYVAYNAPH-----YPMHAPKAYLDRFPDLP-PDRRIMAAMIAAVDDGVGEIVKALKQ 242
Query: 328 KGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
KG E++II F SDNG P+ E R T + GS +RG K +L+EGG++ PAIL
Sbjct: 243 KGAYEDTIIFFSSDNG-PSTESRNWLDGTEDLYYGGSAGRFRGHKASLFEGGIREPAILS 301
Query: 384 SPQ--IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
P +Q ++S +M + D PT+ L L+
Sbjct: 302 YPAGLAEQQGQISDEMFAMMDIFPTM-----------------------LELSGIGTEGY 338
Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
++DG + +L N S R + E + AVR WKLVL NG +D + +T
Sbjct: 339 SLDGHSVFDALSGNALSPRKQLFW---EYEGQLAVREGKWKLVL----NGKLD--FSRTE 389
Query: 502 SNKVPL 507
++ V L
Sbjct: 390 ADAVHL 395
Score = 46.6 bits (109), Expect = 0.046, Method: Compositional matrix adjust.
Identities = 27/81 (33%), Positives = 45/81 (55%), Gaps = 7/81 (8%)
Query: 3 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 62
RN + W+ GEY T+ T+EA I+ P D+P F+Y+A+ A H + AP+ +
Sbjct: 156 RNETEVWEN-GEYMTEAITREATSYIDAAPDDEPYFMYVAYNAPH-----YPMHAPKAYL 209
Query: 63 NQFQYITDPNRRTYAALTKST 83
++F + P+RR AA+ +
Sbjct: 210 DRFPDLP-PDRRIMAAMIAAV 229
>gi|313215020|emb|CBY41206.1| unnamed protein product [Oikopleura dioica]
Length = 427
Score = 213 bits (541), Expect = 3e-52, Method: Compositional matrix adjust.
Identities = 127/374 (33%), Positives = 192/374 (51%), Gaps = 23/374 (6%)
Query: 62 INQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN 121
+ + YI + +A + + + G +D+S HG+ +I TPN+D LA +G++LNN
Sbjct: 1 MKKLLYIFGLSANIFAEQQPNIVVIVADDLGKHDVSMHGA-DIYTPNLDMLARDGVLLNN 59
Query: 122 MYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIG 181
Y QPVC+P+R SLMTG+YP G+Q + G P G+PL E +P+Y++E GY T +G
Sbjct: 60 YYVQPVCSPTRGSLMTGRYPYRLGLQHENLVGYRPAGLPLDEYIMPQYMKECGYKTYMVG 119
Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD 241
KW LGFF+ Y P RGF+ FG L G YY + L G+D+R D
Sbjct: 120 KWQLGFFKDNYLPWKRGFDEFFGQLLGGQDYYSRRKCLKLRNYGNLCGYDLRTEQGPVRD 179
Query: 242 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYI 299
T +Y L+ +A + PL++Y+A + H + L+APQ+ ++ +
Sbjct: 180 TSMKYQPFLYADKAREKFFAHNKTDPLYMYVAFQSVH-----RPLQAPQQYVSLYNDNVQ 234
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+R +A MV LD +VG ++ L+ G+ N++I F +DNG R
Sbjct: 235 MSNGQRVFAGMVTALDAAVGEIVDGLKDSGLWNNTLIYFSNDNG------------REDD 282
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSR 417
S++ +G KN W+GG+ P I + R + H+SD LPT+ +AA T
Sbjct: 283 SDFVLKGGKNDFWQGGITGYGFFSGPAIPRKMRGKKMNDLTHVSDVLPTMLSAAKCHTLP 342
Query: 418 LPLNIDGLDQWSSL 431
P IDG D WS L
Sbjct: 343 -PKVIDGHDLWSRL 355
>gi|308512479|ref|XP_003118422.1| CRE-SUL-3 protein [Caenorhabditis remanei]
gi|308239068|gb|EFO83020.1| CRE-SUL-3 protein [Caenorhabditis remanei]
Length = 500
Score = 212 bits (540), Expect = 4e-52, Method: Compositional matrix adjust.
Identities = 133/416 (31%), Positives = 202/416 (48%), Gaps = 58/416 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAY--NGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGP 149
G+NDL + S + TPN+ LA+ N +L N Y +CTP+R++ MTG YP G Q
Sbjct: 42 GFNDLDWKDST-LHTPNLRNLAFHKNTALLTNSYVNQLCTPTRSAFMTGYYPFRVGTQNG 100
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
EP GVP FL E +R+L YST +GKWHLG+ ++E+ P RGF+ +G+
Sbjct: 101 VFLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQ 160
Query: 210 ISYYDHILSDQYSRTVE--LNGHDMRRNLSTA-----WDTVGEYATDLFTKEAVQLIEDQ 262
Y++H +DQY R ++ + G D+ + + G Y+TDLFT A+ ++++
Sbjct: 161 TGYFNHS-ADQYHRELKRVVKGLDLFEEVGNGKSVPDFSQNGVYSTDLFTDVAMSVLDNH 219
Query: 263 PVDKPLFLYLAHLAAH-----------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
KP F++L++ A H H E P+ + D RR Y M+
Sbjct: 220 NTTKPFFMFLSYQAVHPPLQIKSEKKLTPKIATHFEVPRSFDKYCSRVKDRYRRIYCGML 279
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+D ++G ++ L+ + EN++I+F SDNG TSN+ SN P RG K+T+
Sbjct: 280 TAMDFAIGRLVEYLKASNLYENTVIVFTSDNGG-------TSNF--GASNAPLRGEKDTI 330
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
WEGG K + SP + M H+ DW T+ + G L ID
Sbjct: 331 WEGGTKTTTFVHSPMYIEEGGTREMMFHVVDWHATILSITG-------LEID-------- 375
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPS-RRNSVLINIDEKKRTAAVRLDSWKLVLG 486
S DG++QW + P RR + NID +A+R +KL++G
Sbjct: 376 ---------SYGDGINQWEYIKTGRPKFRRFQFVYNIDN--HGSAIRDGDYKLIVG 420
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 21/78 (26%), Positives = 34/78 (43%), Gaps = 11/78 (14%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-----------AGNAGKHLEAPQET 61
G Y+TDLFT A+ ++++ KP F++L++ A H H E P+
Sbjct: 201 GVYSTDLFTDVAMSVLDNHNTTKPFFMFLSYQAVHPPLQIKSEKKLTPKIATHFEVPRSF 260
Query: 62 INQFQYITDPNRRTYAAL 79
+ D RR Y +
Sbjct: 261 DKYCSRVKDRYRRIYCGM 278
>gi|291244839|ref|XP_002742301.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 263
Score = 212 bits (540), Expect = 5e-52, Method: Compositional matrix adjust.
Identities = 110/269 (40%), Positives = 159/269 (59%), Gaps = 26/269 (9%)
Query: 229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 288
G+D+R N++ + G Y T++FT +A Q+I+ PLFLYLAHLA H+GN LEA
Sbjct: 2 GYDLRDNVNVTYYMNGSYTTEVFTTKAEQIIKKHNKQNPLFLYLAHLAVHSGNPSNPLEA 61
Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
P++ +++F +I D NR+T+A MV LD+SVG + L+ GM N+IIIF +DNG P
Sbjct: 62 PEKYMSRFPHIQDKNRKTFAGMVSALDESVGNLTKVLKDSGMYNNTIIIFTTDNGGPAAG 121
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
+ N SN+P RG+K TLWEGGV+ + SP +++ RVS M+H+ DWLPT+Y
Sbjct: 122 FNN-----NHASNWPLRGIKATLWEGGVRGDGFVHSPLLEKPGRVSTDMIHVCDWLPTIY 176
Query: 409 TAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINID 468
AGG+ S LP N+DG + W +L SR + S R +L NID
Sbjct: 177 HIAGGNVSSLPTNLDGFNVWDTL-----SR----------------GSVSPRTEILHNID 215
Query: 469 EKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
+ AA+R+ +K+++G G DG+Y
Sbjct: 216 PILQIAALRVGKYKIIMGECYRGNWDGWY 244
Score = 77.0 bits (188), Expect = 3e-11, Method: Compositional matrix adjust.
Identities = 34/82 (41%), Positives = 54/82 (65%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+R N++ + G Y T++FT +A Q+I+ PLFLYLAHLA H+GN LEAP++
Sbjct: 5 LRDNVNVTYYMNGSYTTEVFTTKAEQIIKKHNKQNPLFLYLAHLAVHSGNPSNPLEAPEK 64
Query: 61 TINQFQYITDPNRRTYAALTKS 82
+++F +I D NR+T+A + +
Sbjct: 65 YMSRFPHIQDKNRKTFAGMVSA 86
>gi|313234414|emb|CBY24613.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 212 bits (539), Expect = 6e-52, Method: Compositional matrix adjust.
Identities = 169/580 (29%), Positives = 263/580 (45%), Gaps = 99/580 (17%)
Query: 75 TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
YA K + + +GW D+SFH + I TPNID L G+ L N Y Q +CTP+R++
Sbjct: 17 VYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQHICTPTRSA 76
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTK-AIGKWHLGFFRREYT 193
L+TG+YPIHTG+Q I ++ G+ E LPEYL + +GKWH+G T
Sbjct: 77 LLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHVGHGHSWMT 136
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS----TAWDTVGEYATD 249
P RGFE+ GYL G +Y + + ++ + +LS ++W G+Y+ D
Sbjct: 137 PWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGPTNSSW---GKYSGD 193
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
L+ ++ ++++ K F+Y A H L+A + + ++ +I D NRR YAA
Sbjct: 194 LYLQKMSEIVKSIDPTKDSFIYFAPQHVHYP-----LQALERHLVKYSWIKDTNRRNYAA 248
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
MV LD+ VG VI + +L+N+I F++DNG T R+ G+N+P G K
Sbjct: 249 MVASLDEIVGDVIDLYHKNRLLKNTIGFFVADNGGET---------RDGGNNWPLSGQKW 299
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
TL EGGVK ++ + L H+SD LPT+ A
Sbjct: 300 TLLEGGVKATGFMFGGLVPLGKYNGL--FHVSDILPTVMEA------------------- 338
Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK--------RT-------- 473
N P ++ IDG QW +++ + R+ +L NID K RT
Sbjct: 339 ---FNCPIKK--KIDGYAQWEAIIYSNDFPRSQILHNIDPTKINKVANDSRTWNSDWDVR 393
Query: 474 --AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFL 531
AA+R WKL+ G G+ S++VP + + +Q ++ +
Sbjct: 394 VQAALRKGPWKLITGDP------GF----TSDEVPEF---PVSPPEWHQQGPYKTRGLIS 440
Query: 532 PISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISS 591
I+ +S+ H N L+ + DP E +I+ P I
Sbjct: 441 QITG----KSSDYYFDNH-------------RNKLVRLYKIDEDPTESRDISDQNPSIVE 483
Query: 592 QLYELLKYHRRTLVPQSHEQPDLVQ-ADPKRFNDTWSPWI 630
++ +L + T+VP + PDL ++PK W PWI
Sbjct: 484 EMLNILIKYNSTMVPP--QLPDLDPFSNPKLHEGFWFPWI 521
>gi|405964717|gb|EKC30169.1| Arylsulfatase B [Crassostrea gigas]
Length = 898
Score = 211 bits (538), Expect = 8e-52, Method: Compositional matrix adjust.
Identities = 138/467 (29%), Positives = 221/467 (47%), Gaps = 110/467 (23%)
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
+GKWH+GF+++EY P RGF+S+FGYL G YY H + ++ G D+R N A
Sbjct: 526 VGKWHVGFYKKEYLPTSRGFDSYFGYLTGSEDYYTH-----HRCYGKMCGMDLRENAVPA 580
Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
+ G Y+T LF ++A +++ D+PLFLYL + H+ L+ P++ I + +I
Sbjct: 581 -NYSGIYSTYLFAQKAADIVQKHDSDQPLFLYLPFQSVHSP-----LQVPEKYIQPYLHI 634
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
D RR YA MV +D+++G + ++KG+ N++++F +DNG +E G
Sbjct: 635 KDKKRRKYAGMVAAMDEAIGNLTEVFKQKGIWNNTLMVFSTDNGGQILE---------GG 685
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
+NYP RG K +LWEGG+ + +++ VS +MH++DW PTL + AGG
Sbjct: 686 NNYPLRGWKASLWEGGMHGVGFVHGQMLKRKGMVSKALMHVTDWFPTLVSLAGG------ 739
Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLL-NTPSRRNSVLINID---------- 468
N++G +DG+DQW ++ +T ++R+S+L NID
Sbjct: 740 -NLNG---------------TKPLDGVDQWKTISEGSTVNQRSSILHNIDPLFQPVGHRL 783
Query: 469 -----EKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQ 523
+ + AAVR WKL+ G+ NG+ V +SL
Sbjct: 784 PKSPFDNRIRAAVRFGDWKLITGSPGNGSW--------------------VPPPHMKSLH 823
Query: 524 QLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIA 583
Q+ N P NI +LFN+ DP E ++A
Sbjct: 824 QMEDNSEPPEKNI-------------------------------WLFNITADPYEHKDVA 852
Query: 584 SSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
P + Q+ +LL + +T VP + + D +A+PK W PW+
Sbjct: 853 HINPKVVMQMLDLLAQYNQTAVPCRYPKMD-EKANPKYLGGYWGPWL 898
Score = 68.6 bits (166), Expect = 9e-09, Method: Compositional matrix adjust.
Identities = 33/72 (45%), Positives = 49/72 (68%), Gaps = 5/72 (6%)
Query: 75 TYAALTKSTTLTLLIV----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
T+ KS+ +L V +G+ND+ +HGS EI TPN+D LA +GI L N Y QP+CTP
Sbjct: 17 TFPVQGKSSQPHILFVLADDFGYNDIGYHGS-EIKTPNLDKLAGDGIRLENYYVQPICTP 75
Query: 131 SRASLMTGKYPI 142
+R+ L++G+Y +
Sbjct: 76 TRSQLLSGRYQV 87
Score = 48.1 bits (113), Expect = 0.013, Method: Compositional matrix adjust.
Identities = 23/70 (32%), Positives = 39/70 (55%), Gaps = 5/70 (7%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
G Y+T LF ++A +++ D+PLFLYL + H+ L+ P++ I + +I D
Sbjct: 584 GIYSTYLFAQKAADIVQKHDSDQPLFLYLPFQSVHSP-----LQVPEKYIQPYLHIKDKK 638
Query: 73 RRTYAALTKS 82
RR YA + +
Sbjct: 639 RRKYAGMVAA 648
>gi|283778949|ref|YP_003369704.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437402|gb|ADB15844.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 486
Score = 211 bits (537), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 138/411 (33%), Positives = 202/411 (49%), Gaps = 48/411 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ F+G EI TPNIDALA G + Y Q +CTP+RA LMTG++P G+Q I
Sbjct: 40 GWKDVGFNGCTEIKTPNIDALAKGGAKFSQFYVQNMCTPTRACLMTGRFPYRYGLQTIVI 99
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A G+ +E +P+ L + GY T IGKWHLG ++Y P RGF+ +G + G +
Sbjct: 100 PTAAGYGLDTSEYLMPQCLGDAGYKTAIIGKWHLGHADQKYWPKQRGFDYQYGAMIGELD 159
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H D++ D R+ + Y T L +AV+ I Q KP +LY
Sbjct: 160 YFTH---DEHGVL------DWFRDNKPVHEQ--GYTTTLIGDDAVKYIHGQDGKKPFYLY 208
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L A H +AP+E I ++ I +P RRTYAAMV LD+++G V++AL +KG+
Sbjct: 209 LTFNAPHT-----PYQAPKEYITKYLNIAEPTRRTYAAMVDCLDENIGKVVAALDQKGLR 263
Query: 332 ENSIIIFMSDNGAP-----TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSP 385
EN++I F SDNG + + S N PYR K +L+EGG +V A+ W
Sbjct: 264 ENTLIFFHSDNGGTKDKMFAGQMADMSKVVLPCDNGPYRNGKGSLFEGGSRVCALANWPG 323
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+I+ + M+H D PT AG ++ +DG
Sbjct: 324 KIKA--QTVDGMIHAVDLYPTFAALAGASIAKC----------------------KPLDG 359
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
+ W ++ PS R +I+ + A +R WKL+ T ++D Y
Sbjct: 360 TNVWDTIAEGKPSPRTEFFYSIEPFR--AGLRQGDWKLIWRTMLPSSVDLY 408
Score = 46.6 bits (109), Expect = 0.039, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 35/66 (53%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y T L +AV+ I Q KP +LYL A H +AP+E I ++ I +P RR
Sbjct: 183 YTTTLIGDDAVKYIHGQDGKKPFYLYLTFNAPHT-----PYQAPKEYITKYLNIAEPTRR 237
Query: 75 TYAALT 80
TYAA+
Sbjct: 238 TYAAMV 243
>gi|298706368|emb|CBJ29377.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 653
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 164/573 (28%), Positives = 263/573 (45%), Gaps = 83/573 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FH + TPN+DA+ G+ L+ Y P CTPSRA LMTG+Y GMQ +
Sbjct: 48 GWKDVGFHDTT-FSTPNLDAMVAEGVELSTFYTAPTCTPSRAQLMTGRYSYRIGMQDSVL 106
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EPRGVPLTE F+ E L+ GYST A+GKWHLG ++ P+ RGF+ +G L G
Sbjct: 107 HTTEPRGVPLTETFVGEKLQAAGYSTAAVGKWHLGMHMPQFLPVERGFDDFYGILTGGGG 166
Query: 212 YYDHI-LSDQYS------RTVELNGHDMRRNLSTAWDT--VGEYATDLFTKEAVQLIEDQ 262
+Y H+ +S +++ +T +G ++ + D ++T+L+TK+A + +E
Sbjct: 167 HYTHMSVSPEFTPRGGSGKTRTFSGPNIVEGSEVSDDNQDTRTHSTELYTKKAAEYVEAM 226
Query: 263 PV--DKPLFLYLAHLAAHA----------GNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
D+P FLYL++ A H G + + + A +I + NRR M
Sbjct: 227 SAKEDQPWFLYLSYQAIHDPIETDDSWINGRSCESISAENSSIGETDETDYDNRRIACGM 286
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKN 369
V ++D+ +G + L+ G +N+++IF SDNG + GS N P+RG K
Sbjct: 287 VAQMDNGLGALRGLLENLGEWDNTVVIFFSDNGG----------LPSHGSVNRPFRGGKG 336
Query: 370 TLWEGGVKVPAILWSPQIQQ-------NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
WEGGV VPA + + P + H++D T+ AG +P
Sbjct: 337 DYWEGGVHVPAFVSGGFVSAALRRTGVEPYRYGHLTHVTDVHATILGLAG-----VPGAD 391
Query: 423 DGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEK--KRTAAVRLDS 480
DG + +DG+D W SL+ R +V+IN++ + AVR +
Sbjct: 392 DG------------GNGEAELDGIDLWDSLVETKAPARQAVVINLNSPNFASSGAVRWSN 439
Query: 481 WKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFL--PISNIDK 538
+KL+ + T+ Y + +S V + ES ++ + I L P +
Sbjct: 440 YKLIRNPEPKETV--IYSRVQSKLVE--EEGVVSESTLASAISTVQAQIHLSEPKMYMFD 495
Query: 539 MRSTRQQATIH-CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
+ + ++ + CG + A C N L+++ P + D +L ++
Sbjct: 496 LEANPSESVVEGCGTDRALGPVESCAN----LYSI---PAFR--------DARKKLEGII 540
Query: 598 -KYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
K R ++ P + D ADP + W PW
Sbjct: 541 RKADRESVAPTLRWEDDGPLADPSNWGG-WVPW 572
>gi|313213139|emb|CBY36997.1| unnamed protein product [Oikopleura dioica]
Length = 532
Score = 211 bits (536), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 168/580 (28%), Positives = 263/580 (45%), Gaps = 99/580 (17%)
Query: 75 TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
YA K + + +GW D+SFH + I TPNID L G+ L N Y Q +CTP+R++
Sbjct: 17 VYATQAKHVLVIIADDFGWADVSFHNTGGIQTPNIDRLVGGGLELTNYYTQHICTPTRSA 76
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTK-AIGKWHLGFFRREYT 193
L+TG+YPIHTG+Q I ++ G+ E LPEYL + +GKWH+G T
Sbjct: 77 LLTGRYPIHTGLQTNVIAISQSSGLQRDEMLLPEYLESCDIKQRHMVGKWHVGHGHSWMT 136
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS----TAWDTVGEYATD 249
P RGFE+ GYL G +Y + + ++ + +LS ++W G+Y+ D
Sbjct: 137 PWKRGFETFSGYLAGAEDHYTREWCMEQTDWCGVDYSEHSASLSGPTNSSW---GKYSGD 193
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
L+ ++ ++++ K F+Y A H L+A + + ++ +I D NRR YAA
Sbjct: 194 LYLQKMSEIVKSIDPTKDSFIYFAPQHVHYP-----LQALERHLVKYSWIKDTNRRNYAA 248
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
MV LD+ VG V+ + +L+N+I F++DNG T R+ G+N+P G K
Sbjct: 249 MVASLDEIVGDVVDLYHKNRLLKNTIGFFVADNGGET---------RDGGNNWPLSGQKW 299
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
TL EGGVK ++ + L H+SD LPT+ A
Sbjct: 300 TLLEGGVKATGFMFGGLVPLGKYNGL--FHVSDILPTVMEA------------------- 338
Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK--------RT-------- 473
N P ++ IDG QW +++ + R+ +L NID K RT
Sbjct: 339 ---FNCPIKK--KIDGYAQWEAIIYSNDFPRSQILHNIDPTKINKVANDSRTWNSDWDVR 393
Query: 474 --AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFL 531
AA+R WKL+ G G+ S++VP + + +Q ++ +
Sbjct: 394 VQAALRKGPWKLITGDP------GF----TSDEVPEF---PVSPPEWHQQGPYKTRGLIS 440
Query: 532 PISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISS 591
I+ +S+ H N L+ + DP E +I+ P I
Sbjct: 441 QITG----KSSDYYFDNH-------------RNKLVRLYKIDEDPTESRDISDQNPSIVE 483
Query: 592 QLYELLKYHRRTLVPQSHEQPDLVQ-ADPKRFNDTWSPWI 630
++ +L + T+VP + PDL ++PK W PWI
Sbjct: 484 EMLNILIKYNSTMVPP--QLPDLDPFSNPKLHEGFWFPWI 521
>gi|117558328|gb|AAI27451.1| Arsb protein [Rattus norvegicus]
Length = 523
Score = 210 bits (535), Expect = 1e-51, Method: Compositional matrix adjust.
Identities = 148/468 (31%), Positives = 220/468 (47%), Gaps = 84/468 (17%)
Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH---- 230
Y+T +GKWHLG +R+E P RGF+++FGYL G YY H + + LNG
Sbjct: 128 YATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIECLNGTRCAL 184
Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
D+R A + Y+T++FTK A LI + P +KPLFLYLA + H L+ P+
Sbjct: 185 DLRDGEEPAKEYTDIYSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPE 239
Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
E + + +I D +RR YA MV LD++VG V AL+ +G+ N+++IF +DNG T
Sbjct: 240 EYMEPYDFIQDKHRRIYAGMVSLLDEAVGNVTKALKSRGLWNNTVLIFSTDNGGQT---- 295
Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
R+ G+N+P RG K TLWEGG++ + SP ++Q S ++MHI+DWLPTL
Sbjct: 296 -----RSGGNNWPLRGRKGTLWEGGIRGAGFVASPLLKQKGVKSRELMHITDWLPTLVNL 350
Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRR-----NSNIDGLDQWSSLLLNTPSRRNSVLI 465
AGG T +DG D W ++ +PS R N + D D NT +N
Sbjct: 351 AGGSTHGT-KPLDGFDVWETISEGSPSPRVELLLNIDPDFFDGLPCPGKNTTPEKNDSFP 409
Query: 466 NIDEKKRT---AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSL 522
T A +R +WKL+ G G Q+ ++VP S+
Sbjct: 410 LEHSAFNTSIHAGIRYKNWKLLTGYPGCGYWFPPPSQSNISEVP--------------SV 455
Query: 523 QQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNI 582
++ ++L F++ DP E++++
Sbjct: 456 DSPTKTLWL--------------------------------------FDINRDPEERHDV 477
Query: 583 ASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
+ P I L L+Y+ VP S+ P + DPK WSPW+
Sbjct: 478 SREHPHIVQNLLSRLQYYHEHSVP-SYFPPLDPRCDPKG-TGVWSPWM 523
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A LI + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 200 YSTNIFTKRATTLIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYDFIQDKHRR 254
Query: 75 TYAALT 80
YA +
Sbjct: 255 IYAGMV 260
>gi|291243527|ref|XP_002741646.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 506
Score = 209 bits (532), Expect = 4e-51, Method: Compositional matrix adjust.
Identities = 159/555 (28%), Positives = 246/555 (44%), Gaps = 103/555 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H ++ P ++ LA +G+I N Y QP CTPSR++L TG YP Q +
Sbjct: 34 GWNDVGWHNP-DLKMPILNQLAADGVIFNQSYVQPACTPSRSALFTGYYPFKIKRQHQML 92
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E G+ L + LPE L+++GY T +GKWHLGF + EY P RGF+S +G+L
Sbjct: 93 LNLEADGLSLDLKTLPEMLKDVGYLTHLVGKWHLGFCKEEYLPNKRGFDSFYGWLT---- 148
Query: 212 YYDHILSDQYSR-TVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+ +D Y++ V G+D R N ++ Y + + AV ++ + PLFL
Sbjct: 149 ----LGTDLYTKENVLAPGYDFRDNTGVVQES-DTYLPFMLAERAVDIVMGHYKEYPLFL 203
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM-----------------VKK 313
+ GK LE PQ+ + + I D R + + +
Sbjct: 204 EFS-----MALPGKFLEVPQDYEDLYSDIDDDRTRKFYGICNLNIVVSKNTKLPNRKLSA 258
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
+D+S+G ++ AL+ +GM ++++ IF SDNGA E GSN+P RG TL+E
Sbjct: 259 MDNSIGLLVEALKTRGMWDDTLFIFTSDNGAFATES---------GSNWPLRGNIATLFE 309
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
GG +VP + +++ V+ +++HI D T+ AG T +IDG+D W ++
Sbjct: 310 GGTRVPTFFYGSMLKKAGYVNNELIHIVDLHKTIIELAGATTES---DIDGMDMWKTISK 366
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
PS R + +D +E AA+R+ +KL+ G + M
Sbjct: 367 GEPSPRTEFVYNIDD-------------------NELSPGAAIRVGDYKLITGHPD--LM 405
Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
+ RS+ Y + I LP + G
Sbjct: 406 YPFRLLNRSD-------------DWYNYGDEPISGILLPPT----------------GNE 436
Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
P P + YLFN+ +DP E+NN+A P+ +L L +R LVP D
Sbjct: 437 PPPANVT-------YLFNIIDDPEERNNLADVYPNKVEELRHRLDEYREHLVPPESRIFD 489
Query: 614 LVQADPKRFNDTWSP 628
D F WSP
Sbjct: 490 -PSGDAVNFGGVWSP 503
>gi|221640917|ref|YP_002527179.1| twin-arginine translocation pathway signal protein [Rhodobacter
sphaeroides KD131]
gi|221161698|gb|ACM02678.1| Twin-arginine translocation pathway signal [Rhodobacter sphaeroides
KD131]
Length = 509
Score = 207 bits (528), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 145/459 (31%), Positives = 218/459 (47%), Gaps = 69/459 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS+ + TPN+D LA G L Y QP+CTP+RA+LMTG+YP+ G+Q I
Sbjct: 75 GYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGRYPMRYGLQTGVI 133
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ E LP+ L+E GY T +GKWHLG ++Y P RG + +G L G I
Sbjct: 134 PSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGVDYFYGPLVGEID 193
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
++ H + + T ++M + Y T+LF +A++LIE+ PL++Y
Sbjct: 194 HFKH---EAHGITDWYRDNEMVKEPG--------YDTELFGADAIRLIEEHDSATPLYMY 242
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L+ A H +AP + + + I D R+ YAAM+ +DD VG V+ AL+R+GM
Sbjct: 243 LSFTAPHT-----PYQAPDKYKDLYPDIADEGRKAYAAMISCMDDQVGLVLQALERRGMR 297
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGS--------NYPYRGVKNTLWEGGVKVPAIL- 382
E++++IF SDNG + + G+ N P R K TL+EGG +V A+
Sbjct: 298 EDTLVIFHSDNGG-----TRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALAN 352
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
W +I + MMH+ D LPTL A + +
Sbjct: 353 WPGRIPAGE--THGMMHVVDMLPTLAGLAQAEIAH----------------------AGQ 388
Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
+DG+D W ++ S R V+ NI+ + A+R WKL Y+
Sbjct: 389 LDGMDVWQAISAGKASPREEVVYNIEPTQ--GALRDGKWKL------------YWQPILP 434
Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
KV L + A T S ++ Q + ID RS
Sbjct: 435 PKVELFDLEADPSETTDLSAKEPEQLARMQARVIDLARS 473
Score = 41.2 bits (95), Expect = 1.8, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 36/65 (55%), Gaps = 5/65 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y T+LF +A++LIE+ PL++YL+ A H +AP + + + I D R+
Sbjct: 217 YDTELFGADAIRLIEEHDSATPLYMYLSFTAPHT-----PYQAPDKYKDLYPDIADEGRK 271
Query: 75 TYAAL 79
YAA+
Sbjct: 272 AYAAM 276
>gi|298706923|emb|CBJ29750.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 706
Score = 207 bits (527), Expect = 1e-50, Method: Compositional matrix adjust.
Identities = 167/559 (29%), Positives = 253/559 (45%), Gaps = 105/559 (18%)
Query: 92 GWNDLSFHGSN-EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
GWND+ + + + TP++D LA G+ + N Y +CTP+RASLMTG+Y + G+Q
Sbjct: 142 GWNDIGYQSVDLQGVTPHLDRLAAGGVKMTNYYTMSICTPARASLMTGRYVMRYGLQYSV 201
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN--- 207
I P G+PLTE+ PEY+++ GY T IGKWH+G + + P RGF+++ GYLN
Sbjct: 202 IQPGAPWGLPLTEKIFPEYMKDAGYETHMIGKWHIGSYTSRHIPSQRGFDTYLGYLNDEE 261
Query: 208 -----------------------GVISYYDHI---LSDQYSRTVELNGHDMRRNLSTAWD 241
YYD I + DQ +G D LS
Sbjct: 262 MYWTHQSWTAKYNGRKFFDFGFGNATGYYDVIDREVPDQLRDGTISDGDDDETMLSATSS 321
Query: 242 TV-------GEYATDLFTKEAVQLIEDQPV--DKPLFLYLAHLAAH---AGNAGKHLEAP 289
G Y+T +F A+++I ++ D+PL++YLAH A H +H
Sbjct: 322 YSSSGGFFKGRYSTQVFHNRALEIIHEKTPYDDEPLYMYLAHQAVHDPLGLPPAEHFTPD 381
Query: 290 QETINQFQYI-------TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
Q ++ + I T+ R +A ++ LD ++G ++ L+ G +ENSIII SDN
Sbjct: 382 Q--LDLLERIETGSEEGTEALRARFAKVLMYLDYTIGELVEYLETNGWMENSIIIVASDN 439
Query: 343 GA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ---MM 398
G P+ GSNYP RG+K++ WEGG KVP+ ++S R + +M
Sbjct: 440 GGCPSCG----------GSNYPLRGIKHSYWEGGTKVPSFVYSKSHIPEERWGTEYDGLM 489
Query: 399 HISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPS 458
H++DWLPT+ AA D DG+DQW L+ + + S
Sbjct: 490 HVTDWLPTIAGAAEIDLDGGAGPFDGVDQW-PLITSVAAEAEFG---------------S 533
Query: 459 RRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKT 518
R+ ++ N D + +DS +G E G + RS K L T
Sbjct: 534 ARDEMMYNFDPY----ILWVDS-SDEIGDPEFNLAQGAF---RSGKYKYLANEWCTGWYT 585
Query: 519 YQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCT--NGPCYLFNLGNDP 576
+ L++ P++N+ TI C +P CT N YLF+L DP
Sbjct: 586 FDK-HILAEE---PLTNL----------TIACDGSPCSDCGQGCTSYNYTDYLFDLEADP 631
Query: 577 CEQNNIASSRPDISSQLYE 595
E+ N+ PD++ + E
Sbjct: 632 REETNLIYDLPDVAHNMRE 650
>gi|442749327|gb|JAA66823.1| Putative arylsulfatase b precursor [Ixodes ricinus]
Length = 274
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 103/218 (47%), Positives = 139/218 (63%), Gaps = 5/218 (2%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+SFHGS +IPTPN+DALA +GIILN YAQ +CTPSRA+LMTG+YPI+TGMQ I
Sbjct: 59 GWDDVSFHGSPQIPTPNMDALAADGIILNQHYAQALCTPSRAALMTGRYPIYTGMQHFVI 118
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+PL R +PE+ +LGY T +GKWHLG F++++ P+ RGF+S +G+ N
Sbjct: 119 QPGEPWGLPLEYRLMPEFFSDLGYKTHMVGKWHLGSFKKDFIPVRRGFDSFYGFYNADQD 178
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY+ L++ E G+D N Y+T +T+ AV LI +PLFLY
Sbjct: 179 YYNKTLTEG-----EHTGYDFWLNEDIHIYPNNRYSTHHYTERAVSLIRSHNPSQPLFLY 233
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
L++ G LEAP+E +N+F YI + NR YA
Sbjct: 234 LSYXXXXVGTGPSLLEAPEENVNKFLYIPEKNRTIYAG 271
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/64 (42%), Positives = 37/64 (57%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T +T+ AV LI +PLFLYL++ G LEAP+E +N+F YI + NR
Sbjct: 208 YSTHHYTERAVSLIRSHNPSQPLFLYLSYXXXXVGTGPSLLEAPEENVNKFLYIPEKNRT 267
Query: 75 TYAA 78
YA
Sbjct: 268 IYAG 271
>gi|429206655|ref|ZP_19197919.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
gi|428190241|gb|EKX58789.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodobacter sp.
AKP1]
Length = 498
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 146/459 (31%), Positives = 217/459 (47%), Gaps = 69/459 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ +HGS+ + TPN+D LA G L Y QP+CTP+RA+LMTG YP+ G+Q I
Sbjct: 64 GYADVGYHGSD-VKTPNVDRLAAEGARLMQFYTQPLCTPTRAALMTGCYPMRYGLQTGVI 122
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ E LP+ L+E GY T +GKWHLG ++Y P RGF+ +G L G I
Sbjct: 123 PSGGRYGLDTAEVLLPQVLKEAGYKTALVGKWHLGHADQKYWPRQRGFDYFYGPLVGEID 182
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
++ H + + T ++M + Y T+LF +A++LIE+ PL++Y
Sbjct: 183 HFKH---EAHGITDWYRDNEMVKEPG--------YDTELFGADAIRLIEEHDSATPLYMY 231
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L+ A H +AP + + + I D R YAAM+ +DD VG V+ AL+R+GM
Sbjct: 232 LSFTAPHT-----PYQAPDKYKDLYPDIADEGRNAYAAMISCMDDQVGPVLQALERRGMR 286
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGS--------NYPYRGVKNTLWEGGVKVPAIL- 382
E++++IF SDNG + + G+ N P R K TL+EGG +V A+
Sbjct: 287 EDTLVIFHSDNGG-----TRSKMFAGEGAVAGELPPRNDPLREGKGTLYEGGTRVVALAN 341
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
W +I + MMH+ D LPTL A + +
Sbjct: 342 WPGRIPAGE--THGMMHVVDMLPTLAGLAQAEIAH----------------------AGQ 377
Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
+DG+D W ++ S R V+ NI+ + A+R WKL Y+
Sbjct: 378 LDGMDVWQAISAGKASPREEVVYNIEPTQ--GALRDGKWKL------------YWQPILP 423
Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRS 541
KV L + A T S ++ Q + ID RS
Sbjct: 424 PKVELFDLEADPSETTDLSAKEPEQLARMQARVIDLARS 462
Score = 39.7 bits (91), Expect = 4.5, Method: Compositional matrix adjust.
Identities = 22/65 (33%), Positives = 35/65 (53%), Gaps = 5/65 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y T+LF +A++LIE+ PL++YL+ A H +AP + + + I D R
Sbjct: 206 YDTELFGADAIRLIEEHDSATPLYMYLSFTAPH-----TPYQAPDKYKDLYPDIADEGRN 260
Query: 75 TYAAL 79
YAA+
Sbjct: 261 AYAAM 265
>gi|443693750|gb|ELT95037.1| hypothetical protein CAPTEDRAFT_126817, partial [Capitella teleta]
Length = 318
Score = 207 bits (526), Expect = 2e-50, Method: Compositional matrix adjust.
Identities = 118/323 (36%), Positives = 169/323 (52%), Gaps = 24/323 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ + I TPN+DALA G+IL N Y Q +C+PSR +LMTG+YP + MQ +
Sbjct: 14 GYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCSPSRHALMTGRYPYKSAMQSFVV 72
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E + L + LP+YL+ELGY IGKWHLG+ R E P RGF+S +G L+G
Sbjct: 73 LPFEAKCTGLEYKLLPQYLKELGYENHLIGKWHLGYCREECLPTSRGFDSFYGLLDGAGD 132
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H S Y D N + G ++ DL +L + PLFLY
Sbjct: 133 YWEHTTSGVY---------DWHLNDEVFHEAYGNHSQDLELDRLDKLFAEHDNKDPLFLY 183
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
A H + + + F + RR Y A+ LD VG ++ L+ M+
Sbjct: 184 FAPQNPHVPSQPTEEYLSMYSADDFSDV----RRRYLALTSALDSMVGKIVQHLKDNCMM 239
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+N+ +IFMSDNGA E G N P+RG K +L+EGG K + ++S +++
Sbjct: 240 DNTYLIFMSDNGADPGE----------GQNTPFRGGKTSLFEGGTKSNSFIYSQLLKKTE 289
Query: 392 RVSLQMMHISDWLPTLYTAAGGD 414
+ +MHI+DWLPTL AGGD
Sbjct: 290 YENDGLMHITDWLPTLVKLAGGD 312
>gi|346464573|gb|AEO32131.1| hypothetical protein [Amblyomma maculatum]
Length = 418
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 147/438 (33%), Positives = 207/438 (47%), Gaps = 54/438 (12%)
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
Y P RGF++ +GY N Y+ L+ + G D N + D G Y T L
Sbjct: 1 YLPTRRGFDTFYGYYNTAKYYFTQKLN-----FYNICGRDYWDNETLVTDKDGTYDTHLI 55
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
T AV +I DKPLF+Y+A LAAH ++APQ + +F YI D NR A V
Sbjct: 56 TDRAVDIISSHDPDKPLFIYMAPLAAHGQTTNLTVDAPQRNLAKFPYIGDHNRTLLAGAV 115
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
LD+SVG +++AL ++ ML NS+++F SDNG ++ N GSN+P RG K T+
Sbjct: 116 DALDESVGRIVAALHQRRMLANSVLVFTSDNGG-----IPWGDFSNTGSNWPLRGTKGTM 170
Query: 372 WEGGVKVPAILWSPQIQQNPRVSL-QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
WEGGV++PA++WS + RV + MH+ DWLPTLY+AAGG L NIDG+D W
Sbjct: 171 WEGGVRLPAVVWSSMLAPGSRVVVPHPMHLVDWLPTLYSAAGGHIGDLG-NIDGIDLWPV 229
Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL----- 485
L S +++ W R LIN D +A R +KLV
Sbjct: 230 LSSAKTSSKST------SWP---------RTEFLINYDTDTGLSAYRDGDYKLVAVTTPK 274
Query: 486 -----------GTQENGTMDGYYGQTRSNKVPLLNFNAIVESK-TYQSLQQLSQNIFLPI 533
G Q N G R K + N ++ S +++LQ F
Sbjct: 275 SKDARYPFGDPGLQRNVPTPGGIPPDRCRKGAIKRLNTLMASSLAWRTLQG-----FYGD 329
Query: 534 SNIDKMRST----RQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDI 589
I + S R QA++ C + YLF++ DPCE NN+A S P +
Sbjct: 330 GEIGRAESLAPEWRVQASVRCNPRKETVPGLEQLREGNYLFDIKRDPCELNNLAHSLPSV 389
Query: 590 SSQLYELLK-YHRRTLVP 606
++L + L+ Y T+ P
Sbjct: 390 VAKLSKKLQSYVAITMAP 407
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 37/69 (53%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
D G Y T L T AV +I DKPLF+Y+A LAAH ++APQ + +F YI
Sbjct: 45 DKDGTYDTHLITDRAVDIISSHDPDKPLFIYMAPLAAHGQTTNLTVDAPQRNLAKFPYIG 104
Query: 70 DPNRRTYAA 78
D NR A
Sbjct: 105 DHNRTLLAG 113
>gi|283779108|ref|YP_003369863.1| sulfatase [Pirellula staleyi DSM 6068]
gi|283437561|gb|ADB16003.1| sulfatase [Pirellula staleyi DSM 6068]
Length = 468
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 121/320 (37%), Positives = 180/320 (56%), Gaps = 32/320 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G++DL HG +IPTP++DALA +G+ + Y P C+P+RA L+TG+Y G + P
Sbjct: 41 GYHDLGVHGCKDIPTPHLDALATSGVRCTSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 100
Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
G P G +PL+E L + L+++GY T +GKWHLG + + PL RGF+ FG+L
Sbjct: 101 --GPTPTGEIGLPLSETTLADRLKKVGYKTGMVGKWHLGNDEKRH-PLSRGFDEFFGFLG 157
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
G +Y+ + + T L G ++ EY TD F +EAV I D+ P
Sbjct: 158 GARTYF-ATPGNASAGTKLLRGREVVDE--------KEYLTDAFAREAVAYI-DRSKASP 207
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
FLYL A H +EA Q+ +++F ++DP R+ Y AM+ +DD+VG V++ L+R
Sbjct: 208 FFLYLTFNAVHT-----PMEASQKYLDRFTAVSDPKRQKYCAMMSAMDDAVGQVVAKLER 262
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
+ +LEN++I F+SDNG PT N G N P RG K T WEGG++VP + W +
Sbjct: 263 EKLLENTLIFFVSDNGGPTAA--------NTGDNTPLRGFKATTWEGGIRVPYFVSWKGK 314
Query: 387 IQQNPRVSLQMMHISDWLPT 406
I ++ I D++PT
Sbjct: 315 IPAGKTYDQPVIQI-DFVPT 333
Score = 48.1 bits (113), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 38/69 (55%), Gaps = 6/69 (8%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
EY TD F +EAV I D+ P FLYL A H +EA Q+ +++F ++DP R
Sbjct: 186 EYLTDAFAREAVAYI-DRSKASPFFLYLTFNAVHT-----PMEASQKYLDRFTAVSDPKR 239
Query: 74 RTYAALTKS 82
+ Y A+ +
Sbjct: 240 QKYCAMMSA 248
>gi|390341601|ref|XP_796347.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 497
Score = 206 bits (523), Expect = 4e-50, Method: Compositional matrix adjust.
Identities = 105/251 (41%), Positives = 151/251 (60%), Gaps = 9/251 (3%)
Query: 91 YGWNDLSFHGSNE---IPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
+G+ND+ +HG + TPN+D LA G+ L Y QP+C+P+R+ L++G+Y IHTG+Q
Sbjct: 45 FGYNDVGYHGREHGSMVLTPNLDGLAGEGVKLEKYYVQPICSPTRSQLLSGRYQIHTGLQ 104
Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
I A+P +PL E LP+ L+E Y+T +GKWH+GF++ TP RGF+S+FGYL+
Sbjct: 105 YGVIRPAQPHCLPLDEVTLPQKLKERDYATHMVGKWHIGFYKDACTPTERGFDSYFGYLS 164
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
G YY H S Q L G D+ N + A+ G+Y+T LFT +A+ +I + KP
Sbjct: 165 GAEDYYSHSRSFQIGSKT-LKGLDLMANKTPAFQYKGQYSTHLFTSKAIDVINNHERSKP 223
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFLYLA+ A H+ L+ P + + IT RR YA MV +D+ +G V AL
Sbjct: 224 LFLYLAYQAVHS-----PLQVPSKYEEPYANITSSARRAYAGMVSCMDEGIGNVTRALVD 278
Query: 328 KGMLENSIIIF 338
G+ N+IIIF
Sbjct: 279 AGLYNNTIIIF 289
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 24/62 (38%), Positives = 37/62 (59%), Gaps = 1/62 (1%)
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
+LFN+ DP E+N+++ + PD+ +QL L+Y+ + VP + PD V ADP W
Sbjct: 436 WLFNITADPLEENDLSDTYPDVVAQLLHRLQYYYKDSVPPRYPPPD-VNADPALHGGAWG 494
Query: 628 PW 629
PW
Sbjct: 495 PW 496
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 28/80 (35%), Positives = 43/80 (53%), Gaps = 5/80 (6%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+ N + A+ G+Y+T LFT +A+ +I + KPLFLYLA+ A H+ L+ P +
Sbjct: 188 LMANKTPAFQYKGQYSTHLFTSKAIDVINNHERSKPLFLYLAYQAVHS-----PLQVPSK 242
Query: 61 TINQFQYITDPNRRTYAALT 80
+ IT RR YA +
Sbjct: 243 YEEPYANITSSARRAYAGMV 262
>gi|443702858|gb|ELU00682.1| hypothetical protein CAPTEDRAFT_125641 [Capitella teleta]
Length = 370
Score = 205 bits (522), Expect = 5e-50, Method: Compositional matrix adjust.
Identities = 131/358 (36%), Positives = 188/358 (52%), Gaps = 25/358 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ F +I TPNID LA G+++ N Y+ VCTPSR +LMTG+YP TGMQ I
Sbjct: 22 GYNDVGFRNP-DIITPNIDKLARKGVVMTNSYSTHVCTPSRHALMTGRYPYKTGMQNFVI 80
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G P L +FLP+YL+ LGY+T A+GKWHLG R E P RGF+S +G L G
Sbjct: 81 PGDAPVCSGLEYKFLPQYLKSLGYNTHAVGKWHLGDCRDECLPTERGFDSFYGLLLGGGG 140
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H + +D N G ++ DL + ++P+FLY
Sbjct: 141 YWNHTYT-------LFGAYDWFNNKDLDLSANGTHSQDLMVDRLSAVFASHNREEPMFLY 193
Query: 272 LAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
A H + EA + + + + RR Y +V LD VG ++ + M
Sbjct: 194 FAPQNPHTPS-----EATENFLRHYPSSMYTEIRRQYLGLVSGLDWMVGELLELVIANNM 248
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
L+N+ IIF SDNG VE N PYRG K +L+EGG KV ++SP +++
Sbjct: 249 LDNTYIIFQSDNGGDAVE----------ALNGPYRGGKGSLFEGGSKVVNFIYSPLLKKV 298
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQ 448
+ MHI+DW+PT+ + AGG+ L IDG++Q S +L S+R S + +D+
Sbjct: 299 GYENNGWMHITDWMPTIISLAGGEVP-LVEGIDGINQMSMVLKGAESKRTSMVYNIDR 355
>gi|323452769|gb|EGB08642.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1517
Score = 205 bits (521), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 128/361 (35%), Positives = 188/361 (52%), Gaps = 44/361 (12%)
Query: 77 AALTKSTTLTLLIVYGWNDLSF-HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
AA L+ G++D+ + + + TP+IDALA G+ L+ Y+ CTP+R +L
Sbjct: 67 AASYPHVVFALMDDLGFDDVGYGNADGAVATPHIDALAKEGVTLSRYYSAFSCTPARGAL 126
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
+TG P G+Q ++ +P G+P LP++L +LGY + +GKWHLG F E P
Sbjct: 127 LTGLSPHRLGLQHGQVFPEQPWGLPSKFSILPQHLAKLGYRSHLVGKWHLGHFSAERLPT 186
Query: 196 YRGFESHFGYLNGVISYYDHILS-------DQYSRTVELNGH--------------DMRR 234
RGF+S FG L+G Y HI + D R E+ + D+R
Sbjct: 187 ARGFDSFFGGLDGAQYYATHIDAMDCKLPGDVLYRGFEVGDYDSLKAVTAEHGCYFDLRE 246
Query: 235 NLSTAWDTVGEYATDLFTKEAVQLIEDQ-----PVDKPLFLYLAHLAAHAGNAGKHLEAP 289
N D G Y+T LF ++A +LI+ +KPLFL L+ A HA + AP
Sbjct: 247 NNDRVEDLFGSYSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVHA-----PVWAP 301
Query: 290 QETIN---QFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT 346
++T +T+ NRR +AA ++ +DD+VG V++AL+ M NSI+IF SDNGA
Sbjct: 302 EDTYETHPDLLNVTNGNRRKFAAALRLVDDAVGNVVAALEEARMDANSILIFASDNGA-- 359
Query: 347 VEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPT 406
N + GSN P RG K L+EGGV+VPA + +P+ H++DW+PT
Sbjct: 360 -------NPEHGGSNAPLRGSKGYLFEGGVRVPAFVRAPKYLPRGATYDHPFHVTDWVPT 412
Query: 407 L 407
L
Sbjct: 413 L 413
Score = 39.7 bits (91), Expect = 5.1, Method: Compositional matrix adjust.
Identities = 29/89 (32%), Positives = 44/89 (49%), Gaps = 13/89 (14%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-----PVDKPLFLYLAHLAAHAGNAGKHL 55
+R N D G Y+T LF ++A +LI+ +KPLFL L+ A HA +
Sbjct: 244 LRENNDRVEDLFGSYSTQLFGRKAEELIDAHSKRADAAEKPLFLLLSFNAVHA-----PV 298
Query: 56 EAPQETIN---QFQYITDPNRRTYAALTK 81
AP++T +T+ NRR +AA +
Sbjct: 299 WAPEDTYETHPDLLNVTNGNRRKFAAALR 327
>gi|241789348|ref|XP_002400616.1| sulfatase, putative [Ixodes scapularis]
gi|215510801|gb|EEC20254.1| sulfatase, putative [Ixodes scapularis]
Length = 224
Score = 205 bits (521), Expect = 7e-50, Method: Compositional matrix adjust.
Identities = 105/218 (48%), Positives = 136/218 (62%), Gaps = 6/218 (2%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+SF G +IPTPN+D LA GIILNN Y +CTPSR +LM+G YPIHTG+Q
Sbjct: 13 GWADVSFRGDPQIPTPNLDVLASQGIILNNYYVLHLCTPSRGALMSGLYPIHTGLQHYVQ 72
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
AEP G+PL +PE+L+ LGY+T IGKW+LG+++ YTP RGF+S +G+LNG
Sbjct: 73 LPAEPHGLPLNVTIMPEHLKNLGYTTHMIGKWNLGYYKESYTPTRRGFDSFYGFLNGGED 132
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YYDH + + +G RN S Y+TDLFTK+A+ LI+D KP+FLY
Sbjct: 133 YYDHTILFVSTGLDFWDGTTPVRNQSH------HYSTDLFTKKALALIKDHDQAKPMFLY 186
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+H A H+G+ LEAP I F YI + NR A
Sbjct: 187 FSHQAVHSGDFKVELEAPASAIAHFPYIKEQNRSIIAG 224
Score = 65.9 bits (159), Expect = 7e-08, Method: Compositional matrix adjust.
Identities = 31/65 (47%), Positives = 40/65 (61%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
Y+TDLFTK+A+ LI+D KP+FLY +H A H+G+ LEAP I F YI + NR
Sbjct: 160 HYSTDLFTKKALALIKDHDQAKPMFLYFSHQAVHSGDFKVELEAPASAIAHFPYIKEQNR 219
Query: 74 RTYAA 78
A
Sbjct: 220 SIIAG 224
>gi|410932865|ref|XP_003979813.1| PREDICTED: arylsulfatase I-like, partial [Takifugu rubripes]
Length = 316
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 110/291 (37%), Positives = 168/291 (57%), Gaps = 20/291 (6%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
IHTG+Q I +P +P + LP+ L+ELGYST +GKWHLGF+++E P RGF++
Sbjct: 2 IHTGLQHSIIRPRQPNCLPFDQVTLPQRLQELGYSTHMVGKWHLGFYKKECLPTRRGFDT 61
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
+FG L G ++YY + D + G D+ S AW G+Y+T L+T+ +++
Sbjct: 62 YFGSLTGSVNYYTYDSCD----GPGMCGFDLHEGESVAWSQKGKYSTHLYTQRVRKILAT 117
Query: 262 Q-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
P +PLF++L+ A H L+ P+E I ++ + + RR YAAMV +D++V
Sbjct: 118 HDPRSQPLFIFLSFQAVHT-----PLQCPREYIYPYRGLENIARRKYAAMVSAVDEAVRN 172
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
+ L++ G ENSI+IF +DNG + + GSN+P RG K T WEGGV+
Sbjct: 173 ITYGLRKYGYYENSIMIFSTDNGGQPL---------SGGSNWPLRGRKGTYWEGGVRGLG 223
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+ SP +++ RVS ++HI+DW PTL AGG S + +DG D W ++
Sbjct: 224 FIHSPLLRKKKRVSKALVHITDWYPTLVGLAGGKESHIE-GVDGYDVWEAI 273
Score = 46.6 bits (109), Expect = 0.036, Method: Compositional matrix adjust.
Identities = 27/96 (28%), Positives = 49/96 (51%), Gaps = 7/96 (7%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 64
S AW G+Y+T L+T+ +++ P +PLF++L+ A H L+ P+E I
Sbjct: 93 SVAWSQKGKYSTHLYTQRVRKILATHDPRSQPLFIFLSFQAVHT-----PLQCPREYIYP 147
Query: 65 FQYITDPNRRTYAALTKSTTLTLL-IVYGWNDLSFH 99
++ + + RR YAA+ + + I YG ++
Sbjct: 148 YRGLENIARRKYAAMVSAVDEAVRNITYGLRKYGYY 183
>gi|443703066|gb|ELU00815.1| hypothetical protein CAPTEDRAFT_95989 [Capitella teleta]
Length = 382
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 123/365 (33%), Positives = 190/365 (52%), Gaps = 27/365 (7%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YG+ND+ F N I +PN+DALA +GIIL N Y P C+PSR S ++G+Y + MQ
Sbjct: 31 YGFNDVGFRNPNVI-SPNMDALAQSGIILTNAYTAPQCSPSRGSFLSGRYSYKSAMQHGV 89
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I +P+ + L LP YL+ELGY T A GKWHLG+ R E TP +RGF+S G +G
Sbjct: 90 ILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRDECTPTHRGFDSFSGGFSGEG 149
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+Y+H T +D +D +G+++ DL + +++ PLF+
Sbjct: 150 EFYEHT-------TATGGYYDWHLGTEVDYDAIGKHSEDLIGYYVNKTLDEYDQSSPLFM 202
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYI---TDPNRRTYAAMVKKLDDSVGTVISALQR 327
Y+A H+ L+ E + + + TD + ++ +V +D +G ++ ++
Sbjct: 203 YVAFHNVHS-----PLDPKPEFLALYDDLDASTDYSGFFFSGLVSGMDYIIGGIVDKIKE 257
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
G+ EN+ I+F SDNG E G N P+RG K+TLWEGG K + SP +
Sbjct: 258 IGIYENTYILFSSDNGGDVGE----------GDNSPHRGGKSTLWEGGCKANSWTHSPLL 307
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
+ + +MH++DWLPT+ AGG +DG Q ++ N S R + I +D
Sbjct: 308 GASGVENNGLMHVTDWLPTIVDLAGGSVPASD-GLDGFVQTDMVINNGDSARTTMIYNID 366
Query: 448 QWSSL 452
+ S +
Sbjct: 367 RESDI 371
>gi|340619607|ref|YP_004738060.1| sulfatase [Zobellia galactanivorans]
gi|339734404|emb|CAZ97781.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 463
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 126/351 (35%), Positives = 190/351 (54%), Gaps = 34/351 (9%)
Query: 75 TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCT 129
+++L + +++V GW D+ F+G+ +IPTPN+D LA GI+ +N Y + P C+
Sbjct: 16 AFSSLAQEKKPNIIVVLTDDQGWADVGFNGATDIPTPNLDRLASEGIVFDNAYVSHPYCS 75
Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAE---PRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
PSRA L+TG+Y G + +E G PL+E+ +PE L+E GY T AIGKWHLG
Sbjct: 76 PSRAGLLTGRYQARFGHDCNMPYDSENDDTVGTPLSEKMIPEALKEHGYRTSAIGKWHLG 135
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
P+++GF+ FG+ G ++Y+ + D +T+ NG + +N Y
Sbjct: 136 D-HPSLHPIHQGFDHWFGFAGGGMNYWG--IPDGPIKTIVRNGEPVPQNEL-------RY 185
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
TD FT EA+ I + DKP F+YLA+ A HA + +A ++ + Q ++I R
Sbjct: 186 LTDDFTDEAIDFITKKD-DKPFFMYLAYNAPHAPD-----QATKQYLEQTKHIEYGGRSI 239
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
YAAMV +D +VG + S L GM EN+I++F+SDNG N PYRG
Sbjct: 240 YAAMVNAVDANVGRIDSTLVANGMKENTILVFLSDNGGRGEH----------ADNRPYRG 289
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
K L+EGG+KVP + P+ + + +M+ D PT AAGG +
Sbjct: 290 HKGMLFEGGIKVPFFITWPKKLKGGQHYEKMVSSLDLFPTFLEAAGGKVKK 340
Score = 42.4 bits (98), Expect = 0.75, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 38/69 (55%), Gaps = 6/69 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD FT EA+ I + DKP F+YLA+ A HA + +A ++ + Q ++I R
Sbjct: 185 YLTDDFTDEAIDFITKKD-DKPFFMYLAYNAPHAPD-----QATKQYLEQTKHIEYGGRS 238
Query: 75 TYAALTKST 83
YAA+ +
Sbjct: 239 IYAAMVNAV 247
>gi|354488427|ref|XP_003506371.1| PREDICTED: arylsulfatase I [Cricetulus griseus]
Length = 560
Score = 204 bits (519), Expect = 1e-49, Method: Compositional matrix adjust.
Identities = 110/272 (40%), Positives = 160/272 (58%), Gaps = 19/272 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 113 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 171
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGFFR+E P RGF++ G L G +
Sbjct: 172 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLPSCRGFDTFLGSLTGNVD 231
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ PLFLY
Sbjct: 232 YYTYDNCD----GPGVCGFDLHEGETVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLY 287
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 288 VAFQAVH-----TPLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 342
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
NS+IIF SDNG T + GSN+P
Sbjct: 343 NNSVIIFSSDNGGQTF---------SGGSNWP 365
Score = 46.6 bits (109), Expect = 0.040, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ PLFLY+A A H L++P+E + ++
Sbjct: 253 TVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLYVAFQAVH-----TPLQSPREYLYRY 307
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 308 RTMGNVARRKYAAMV 322
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
+LFN+ DP E+ ++A RPD+ L L + RT +P + + +A P W
Sbjct: 453 WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWG 511
Query: 628 PW 629
PW
Sbjct: 512 PW 513
>gi|323454261|gb|EGB10131.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 635
Score = 203 bits (517), Expect = 2e-49, Method: Compositional matrix adjust.
Identities = 160/531 (30%), Positives = 242/531 (45%), Gaps = 86/531 (16%)
Query: 92 GWNDLSF----HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
G+ND+ + H +N++ TP +D LA G+ L Y Q C+PSR +L+TG YP TG+
Sbjct: 129 GYNDIGYNRAPHQTNQVSTPFLDELASEGVTLTRYYTQCDCSPSRGALLTGLYPASTGLY 188
Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
I G+PL +P++L Y + AIGKW +G + + P RGF S+ G+
Sbjct: 189 HGVIVTQSHWGLPLEYHLIPQFLPSR-YRSHAIGKWDVGHYTWNHVPTGRGFHSYVGFYG 247
Query: 208 GVISYYDHIL-----SDQYSRTVELNGHDMRRNLSTAWDTVGE-----YATDLFTKEAVQ 257
I YY H + S S ++ +D +L+ G+ Y+TD+FT AV+
Sbjct: 248 TDIDYYTHEIGAGCNSYNCSSAIKRCMNDSITDLNYDGAATGDEYYNRYSTDIFTDRAVE 307
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ----YITDPNRRTYAAMVKK 313
L+ + PLFLY+A A H LEA TI++F Y R T+AA
Sbjct: 308 LLRTESARNPLFLYVAFNAVHGP-----LEADDATIDRFNASFLYEARTERATFAAAALL 362
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
+D +V +++ + G +NSI++F SDNGA + GSN+P RG K T +E
Sbjct: 363 MDRAVESIVKTMHAVGAYDNSIVLFASDNGATLAQ-------TGGGSNWPLRGSKFTPYE 415
Query: 374 GGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
GGV+VPA L SP + R + H++D LPTL AAGG P +IDG++Q+ +L
Sbjct: 416 GGVRVPAFLHSPLLGSGRRGITHAGLFHVTDILPTLVHAAGGSP---PSDIDGVNQYHAL 472
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
G D +S S R VL+++D ++ L G G
Sbjct: 473 F------------GSDATAS------SPRADVLVHVD----VYGYETEAMGLATGAYIEG 510
Query: 492 TMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQ--QATIH 549
+ T + +S+TY+ + LP + DK+ + A H
Sbjct: 511 DLKVVVNATEATWCA-------PDSETYRYADECWD---LPATYCDKVAENKSGYAAYFH 560
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYH 600
YLF+L DP EQ+++ + P +LY + H
Sbjct: 561 ----------------KTYLFDLSVDPEEQHDLKDAMPAKFRKLYAAFQAH 595
Score = 43.9 bits (102), Expect = 0.30, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 9/68 (13%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ----YITD 70
Y+TD+FT AV+L+ + PLFLY+A A H LEA TI++F Y
Sbjct: 296 YSTDIFTDRAVELLRTESARNPLFLYVAFNAVHGP-----LEADDATIDRFNASFLYEAR 350
Query: 71 PNRRTYAA 78
R T+AA
Sbjct: 351 TERATFAA 358
>gi|298710054|emb|CBJ31771.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 588
Score = 203 bits (516), Expect = 3e-49, Method: Compositional matrix adjust.
Identities = 177/619 (28%), Positives = 272/619 (43%), Gaps = 126/619 (20%)
Query: 92 GWNDLSFHGSN-EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ-GP 149
GW+D+ + + + TP +D LA G+ + N Y CTP+RASLMTG+Y + GMQ
Sbjct: 6 GWDDIGYQSVDLKGVTPVLDKLAAGGVKITNYYTMNTCTPARASLMTGRYTVRYGMQYNV 65
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
I EP GVPL+E+ LPEY +E GY T +GKWHLG E+ P RGF+++ GY+ G
Sbjct: 66 AINPGEPWGVPLSEKMLPEYFKEAGYGTHLVGKWHLGSHSPEHIPSQRGFDTYMGYVGGF 125
Query: 210 ISYYDH----ILSD-------------------QYSRTVELNGHDMRRNLSTAW----DT 242
+Y+ H ++SD + ++ G D R + A D
Sbjct: 126 EAYWTHETVGVISDGRHVCDFGFGNASGYYDIIDRPSSQDIYGSDHREDQKEAEENRDDG 185
Query: 243 VGE---------------------------YATDLFTKEAVQLIEDQ-PVD-KPLFLYLA 273
GE Y+T +F + A++++E + P D +P++++LA
Sbjct: 186 DGEGVHSTTPTKTPWASSFRSNSGVMKGQQYSTQMFQERAIEILETKSPFDEEPIYMHLA 245
Query: 274 HLAAH-----------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
H A H + + L+ + I Q RR ++ LD SVG +I
Sbjct: 246 HQAVHSPLGLPPADSFSEDETLLLDEIEGKIGDVQGAQ--TRRHLTEVLMFLDKSVGHLI 303
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
L+ +G +ENSI++ SDNG + E GSNYP RG K + WEGG KVPA +
Sbjct: 304 HYLEDEGWMENSIVVVASDNGGDSNE---------GGSNYPLRGKKASYWEGGSKVPAFV 354
Query: 383 WS----PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
+S P+ ++ +MH++DWLPTL AG + S +DG++QW ++ + +
Sbjct: 355 YSTSHIPEARRGSEYD-GLMHVTDWLPTLAAGAGLEMSGSAGPLDGVNQWEHIVGSAAA- 412
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYG 498
+DG + LL N R L+ L +++ + G G +
Sbjct: 413 ----VDGGRPRTELLYNYDPYR---LLPNSNHCYCCWCSLCHYRVY----DTGYAQGAF- 460
Query: 499 QTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCG--ANPAP 556
RS K ++ FN L ++ +S + CG N
Sbjct: 461 --RSGKYKMM-FNVSCHGWYTFDGDMLEEDHLTDLSKV-------------CGGTCNDGG 504
Query: 557 MTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL----YELLKYHRRTLVPQSHEQP 612
T+ YLF+L DP E NN+ PD++ ++ YE+ R E
Sbjct: 505 TCSDEATSD--YLFDLDQDPRESNNLIDELPDVAQEMRDRWYEVT---FREYTDTVFEPL 559
Query: 613 DLVQADPKRFNDTWS-PWI 630
DL D N+ W PW+
Sbjct: 560 DLKAYDVWAENNWWMIPWL 578
>gi|443705385|gb|ELU01963.1| hypothetical protein CAPTEDRAFT_143986, partial [Capitella teleta]
Length = 345
Score = 202 bits (515), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 116/327 (35%), Positives = 172/327 (52%), Gaps = 34/327 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ + I TPN+DALA G+IL N Y Q +CTPSR +LM+G+YP + MQ I
Sbjct: 37 GYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMSGRYPSASAMQSMVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E R L +FLP+YL+ELGY +GKWHLG+ R E P RGF++ +G G
Sbjct: 96 QPMEARCAGLEYKFLPQYLKELGYKNHMVGKWHLGYCRDECLPTSRGFDTFYGLYAGAGD 155
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H + +Y D N ++ G ++ DL + ++ ++ PLFLY
Sbjct: 156 YWSHEIFGKY---------DWHINGDIHFEANGTHSQDLEMEGLDKVFDEHDSKDPLFLY 206
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYI-----TDPNRRTYAAMVKKLDDSVGTVISALQ 326
A H P + ++F + D RR Y + LD VG ++ L+
Sbjct: 207 FAPQNPH---------TPSQPTDEFLNLYPEDKFDDIRRKYLGLTSGLDAMVGKIVEKLE 257
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
G+++N+ +IF+SDNGA E G N P+RG K +L+EGG K + ++SP
Sbjct: 258 NNGLMDNTYVIFVSDNGADPPE----------GLNTPFRGGKGSLFEGGTKSNSFVYSPL 307
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGG 413
+ + + +MHI+DWLPTL AGG
Sbjct: 308 LNKTGYENDGLMHITDWLPTLVKLAGG 334
>gi|410642189|ref|ZP_11352707.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
gi|410648635|ref|ZP_11359039.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410131832|dbj|GAC07438.1| arylsulfatase I/J [Glaciecola agarilytica NO2]
gi|410138506|dbj|GAC10894.1| arylsulfatase I/J [Glaciecola chathamensis S18K6]
Length = 473
Score = 202 bits (515), Expect = 4e-49, Method: Compositional matrix adjust.
Identities = 125/348 (35%), Positives = 186/348 (53%), Gaps = 34/348 (9%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
LL G+ D+ + GS +I TPNID LA G+ L + YA P+C+P+RA+LMTGK P++
Sbjct: 47 FILLDDLGYGDVGYLGS-QIQTPNIDNLASQGVTLKHGYAYPICSPTRAALMTGKNPLNF 105
Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
G+ GP A +P +PE +E GY T +GKWHLG +R P RGF+ +G
Sbjct: 106 GIDGPMENDAM---LPEDLTTMPERFQEAGYQTWMVGKWHLGMAKRSAMPHNRGFDDFYG 162
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+L G + YY H V G D + N T+ G + T+L T +A+ I
Sbjct: 163 FLGGFVDYYTH---------VYFGGLDWQNN-DTSLREEG-FVTELLTAKAIDKITHFKG 211
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
DKP F+YL++ A H L+ IN + +R+ +A M +D +G V++A
Sbjct: 212 DKPFFMYLSYSAPHT-----PLQYVPNAINDYSGFESKDRQVFADMTSDVDQDIGEVLAA 266
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-W 383
L ++G+ EN+I++FMSDNG N +N +G K++++EGGV+VP ++ W
Sbjct: 267 LDKRGIAENTIVVFMSDNGG---------NLEAGANNGSLQGGKSSVYEGGVRVPVLIRW 317
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
Q+Q + Q M DW PTL AA D P + DG+ Q+ +L
Sbjct: 318 PAQLQS--KTMTQPMFTQDWAPTLLEAANIDYQ--PNDFDGVSQFVNL 361
>gi|443692244|gb|ELT93884.1| hypothetical protein CAPTEDRAFT_107177, partial [Capitella teleta]
Length = 328
Score = 202 bits (514), Expect = 5e-49, Method: Compositional matrix adjust.
Identities = 120/327 (36%), Positives = 174/327 (53%), Gaps = 22/327 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ F ++ TPN+D LA G+IL N Y Q VCTPSR +LMTG+YP + MQ I
Sbjct: 12 GYNDIGFRNP-DVQTPNLDYLANKGVILTNNYVQAVCTPSRHALMTGRYPYRSAMQNFVI 70
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ + L +FLP+YL+ELGY IGKW+LG+ R E P RGF+S FG L+G
Sbjct: 71 NPDQAKCTALEYKFLPQYLKELGYQNHLIGKWNLGYCREECLPTSRGFDSFFGLLDGAGD 130
Query: 212 YYDHILSDQYSRTVE-LNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
Y++H Y T + L G + + +S + DL ++ + +
Sbjct: 131 YWEHTTYGLYDCTGQSLAGMACLCEFTQKISILMPVIC--FQDLELDRLDKIFTEHDNKQ 188
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
PLFLY A H + + + + F+ + RR Y A+ LD VG V+ L
Sbjct: 189 PLFLYFAPQNPHTPSQPTDEFLNKYSADDFEDV----RRRYLALTSGLDSMVGKVVELLN 244
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
M++N+ +IF+SDNGA E G N P+RG K +L+EGG K + ++SP
Sbjct: 245 DNHMMDNTYLIFLSDNGADPAE----------GLNAPFRGGKGSLFEGGTKSSSFIYSPL 294
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGG 413
+++ + +MHI+DWLPTL AGG
Sbjct: 295 LKKTEYENDGLMHITDWLPTLVKLAGG 321
>gi|395504882|ref|XP_003756775.1| PREDICTED: arylsulfatase I [Sarcophilus harrisii]
Length = 598
Score = 201 bits (512), Expect = 7e-49, Method: Compositional matrix adjust.
Identities = 102/251 (40%), Positives = 157/251 (62%), Gaps = 10/251 (3%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS+ I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 225 GYHDVGYHGSD-IQTPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 283
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E+GYST +GKWHLGF+++ P RGF++ G L G +
Sbjct: 284 RPRQPSCLPLDQVTLPQKLQEVGYSTHMVGKWHLGFYKKACLPTRRGFDTFLGSLTGNVD 343
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G+D+ S AW+ G+Y+T L+ + A Q++ +PLFLY
Sbjct: 344 YYTYDNCDGPG----VCGYDLHEGESVAWEQSGQYSTLLYAQRASQILASHNPRQPLFLY 399
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 400 VAFQAVHT-----PLQSPREYLYRYRSMGNVARRKYAAMVTCMDEAVRNITWALKRYGYY 454
Query: 332 ENSIIIFMSDN 342
+NS+IIF +DN
Sbjct: 455 DNSVIIFSTDN 465
Score = 50.1 bits (118), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
S AW+ G+Y+T L+ + A Q++ +PLFLY+A A H L++P+E + ++
Sbjct: 365 SVAWEQSGQYSTLLYAQRASQILASHNPRQPLFLYVAFQAVHT-----PLQSPREYLYRY 419
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 420 RSMGNVARRKYAAMV 434
Score = 47.0 bits (110), Expect = 0.029, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 36/70 (51%), Gaps = 5/70 (7%)
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH--EQPDLVQADPK 620
T +LFN+ DP E+ ++A+ RPDI L L Y+ RT +P + E P +A P
Sbjct: 484 TRQSVWLFNISADPYEREDLAAQRPDIVRSLLARLAYYNRTAIPVRYPAENP---RAHPD 540
Query: 621 RFNDTWSPWI 630
W PW+
Sbjct: 541 FNGGAWGPWV 550
>gi|291225025|ref|XP_002732499.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 307
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 119/324 (36%), Positives = 173/324 (53%), Gaps = 26/324 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + +I PN+ LA +G+I N Y P CTPSRA++MTG YP TG Q
Sbjct: 2 GWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMA 60
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG-YLNGVI 210
+ P GVPL + LPE L+E+GYST +GKWHLGF + EY P RGF+SH+G + GV
Sbjct: 61 FNLHPSGVPLEFKLLPEKLKEVGYSTHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVG 120
Query: 211 SY--YDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
Y D +LS G+D R N+ + +Y + A ++ PL
Sbjct: 121 DYDKMDGVLSPS-------AGYDFRDNMGVVPKS-DDYLALMLGDRAEHIVNTHYPGTPL 172
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
F LA KHLE P+E ++ I D R + + +DD +G V+ A++ +
Sbjct: 173 F-----LAFTLDIPAKHLEIPEEYEEKYAEIEDDRTRQFYGKLTMMDDIIGGVVDAMKNR 227
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
GM ++S+IIF+ DN + + GSNYP+RG+ TL+EG +VP+I+ I+
Sbjct: 228 GMYDDSLIIFIGDNEGLSSQS---------GSNYPFRGIAGTLFEGATRVPSIVSGKGIK 278
Query: 389 QNPRVSLQMMHISDWLPTLYTAAG 412
+ S ++ I D T+ AG
Sbjct: 279 KTGYESNELYSIVDIHRTILDVAG 302
>gi|354581367|ref|ZP_09000271.1| sulfatase [Paenibacillus lactis 154]
gi|353201695|gb|EHB67148.1| sulfatase [Paenibacillus lactis 154]
Length = 446
Score = 201 bits (511), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 143/412 (34%), Positives = 214/412 (51%), Gaps = 47/412 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +GS+ + TPN+D+LA GI N Y+ PVC+PSRASL+TGKYP G+
Sbjct: 22 GYGDLGCYGSDTVTTPNLDSLAGEGIRFTNWYSNSPVCSPSRASLLTGKYPARAGVG--E 79
Query: 151 IWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
I GA+ G+P TE L + L+ GY T GKWHLG E +P GF+ FG+ G
Sbjct: 80 ILGAKRGLDGLPSTEVTLAKALKPAGYRTALYGKWHLGV-SEETSPNAHGFDEFFGFKAG 138
Query: 209 VISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQLIE-DQPVDK 266
I +Y HI + + +N HD+ N + W+ G Y T+L T+ +V I+ + +
Sbjct: 139 CIDFYSHIF--YWGQGHGVNPLHDLWENETEVWEN-GRYMTELITERSVDFIKRSREQED 195
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
P FL++++ A H + AP+E +++F ++ +R+ AAM+ +DD VG ++ AL+
Sbjct: 196 PFFLFVSYNAPH-----YPMHAPKEYMDRFAHLP-WDRQVMAAMIAAVDDGVGEIVKALK 249
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
G EN++I F SDNG P+ E R T + G+ +RG K +L+EGG++ PAIL
Sbjct: 250 AAGCYENTVIFFSSDNG-PSSESRNWLDGTEDVYYGGTAGIFRGHKASLFEGGIREPAIL 308
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
P + +V ++ + D +PT AG D + PL +
Sbjct: 309 SWPNGLEGGQVRDEVGVMMDIVPTFLDLAGVDPATRPL------------------EGAE 350
Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
+DG LL+ PS + E + AVR WKLVL NG +D
Sbjct: 351 LDGSSLKEMLLMGEPSPHKQLFW---EYQGQLAVREGDWKLVL----NGKLD 395
>gi|291225031|ref|XP_002732500.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 286
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 114/299 (38%), Positives = 166/299 (55%), Gaps = 22/299 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +H + +I PN+ LA +G+I N Y P CTPSRA++MTG YP TG Q +
Sbjct: 2 GWNDVHWH-NPDIAMPNLMDLAADGVIFNQSYTHPTCTPSRAAMMTGLYPFKTGNQHQMV 60
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ P GVPL + LPE L+E+GY+T +GKWHLGF + EY P RGF+SH+G +
Sbjct: 61 FNLHPSGVPLNFKLLPEKLKEVGYATHMVGKWHLGFCKDEYLPTNRGFDSHYGLWTLGVG 120
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL- 270
YD L+ S + G+D R N+ + G Y + A ++ PLFL
Sbjct: 121 DYDK-LNGVLSPSA---GYDFRDNMGVVPKSDG-YLALMLGDRAEHIVNTHYPGTPLFLT 175
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+ + A KHLE P+E + I D R + + +DD +G V+ AL+ +GM
Sbjct: 176 FTLDIPA------KHLEIPEEYEEAYADIEDDRTRQFYGKLTMMDDIIGGVVDALKNRGM 229
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
++S+IIF+ DNGA + + GSNYP+RG+ TL+EG VP+I+ I++
Sbjct: 230 YDDSLIIFIGDNGALSSQS---------GSNYPFRGIAGTLFEGATCVPSIVSGKGIKK 279
>gi|443700441|gb|ELT99395.1| hypothetical protein CAPTEDRAFT_208054 [Capitella teleta]
Length = 558
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 153/513 (29%), Positives = 236/513 (46%), Gaps = 77/513 (15%)
Query: 88 LIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
L + G+ D + S I TPNID L +GI N Y+ C+PSR+S ++G+YP +GMQ
Sbjct: 78 LALPGFQDAGYRNS-AIHTPNIDKLVGDGISFTNAYSSQQCSPSRSSFLSGRYPYKSGMQ 136
Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
I P + L +FL +YL++L Y+T A+GKWHLG+ ++ TP YRGF++ G +
Sbjct: 137 HGVISDEGPNCMDLKFKFLSDYLKDLNYNTHAVGKWHLGYCNKKCTPTYRGFDTFSGGYS 196
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
G YY+H T +D T G ++ DL + ++ + D+ P
Sbjct: 197 GEGDYYEH--------TTHGGWYDWHNGTETDLTAAGVHSQDLVEADMMKNL-DENDGTP 247
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
F Y A H+ K P+ P+R+ Y +V +D +G V++ L+
Sbjct: 248 FFYYTAFHNTHSPLQPK----PEHKALYNDMDATPSRKKYLGLVSGMDAVIGNVVAKLKA 303
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
KGM +N+ I F SDNG N G N P+RG K++L+EGG + SP I
Sbjct: 304 KGMFDNTYIFFSSDNGGDV----------NEGDNSPWRGAKSSLFEGGCHAHSWASSPLI 353
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
++ + ++H++DWLPT+ T AGG +P IDG++Q L+ N GL
Sbjct: 354 RKKGVEADGLLHMTDWLPTILTLAGG---TVPDGIDGIEQ-VDLITNA---------GL- 399
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTA-------AVRLDSWKLVLGTQENGTMDGYYGQT 500
S+R +++ NID + T A R +KL+ G + G DGY
Sbjct: 400 ----------SKRTNMIYNIDMELETGPPNFGCMAARDLRYKLIWGFE--GRSDGY---- 443
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR-----STRQQATI--HCGAN 553
+ + N + +V+ L + + N+DK S RQ +
Sbjct: 444 GVDADFIYNHDRVVD---------LVEEEEFIVENVDKRGGPYPLSHRQHRIFSEYVSLT 494
Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSR 586
M GP LF+L DP E ++ S+
Sbjct: 495 TVTMADVQAGTGPMMLFDLQEDPNETTDLIESQ 527
>gi|119582144|gb|EAW61740.1| arylsulfatase family, member I [Homo sapiens]
Length = 426
Score = 201 bits (510), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 142/454 (31%), Positives = 212/454 (46%), Gaps = 79/454 (17%)
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
+GKWHLGF+R+E P RGF++ G L G + YY + D + G D+ + A
Sbjct: 2 VGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCDGPG----VCGFDLHEGENVA 57
Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
W G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + +++ +
Sbjct: 58 WGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTM 112
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+ RR YAAMV +D++V + AL+R G NS+IIF SDNG T G
Sbjct: 113 GNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTFSG---------G 163
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
SN+P RG K T WEGGV+ + SP +++ R S +MHI+DW PTL AGG TS
Sbjct: 164 SNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAAD 223
Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--INIDEKKRTAAVR 477
+DG D W ++ S R + +D L ++ S+ I AA+R
Sbjct: 224 -GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHAQHGSLEGGFGIWNTAVQAAIR 277
Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
+ WKL+ G D YG + +P Q+L + + N++
Sbjct: 278 VGEWKLLTG-------DPGYG----DWIPP------------QTLATFPGSWW----NLE 310
Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
+M S RQ +LFN+ DP E+ ++A RPD+ L L
Sbjct: 311 RMASVRQA---------------------VWLFNISADPYEREDLAGQRPDVVRTLLARL 349
Query: 598 KYHRRTLVPQSH--EQPDLVQADPKRFNDTWSPW 629
+ RT +P + E P +A P W PW
Sbjct: 350 AEYNRTAIPVRYPAENP---RAHPDFNGGAWGPW 380
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 55 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 109
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 110 RTMGNVARRKYAAMV 124
>gi|410039868|ref|XP_003950702.1| PREDICTED: arylsulfatase I isoform 1 [Pan troglodytes]
Length = 426
Score = 200 bits (509), Expect = 1e-48, Method: Compositional matrix adjust.
Identities = 142/454 (31%), Positives = 212/454 (46%), Gaps = 79/454 (17%)
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
+GKWHLGF+R+E P RGF++ G L G + YY + D + G D+ + A
Sbjct: 2 VGKWHLGFYRKECLPTRRGFDTFLGSLTGNVDYYTYDNCD----GPGVCGFDLHEGENVA 57
Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
W G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + +++ +
Sbjct: 58 WGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTM 112
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+ RR YAAMV +D++V + AL+R G NS+IIF SDNG T G
Sbjct: 113 GNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTFSG---------G 163
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
SN+P RG K T WEGGV+ + SP +++ R S +MHI+DW PTL AGG TS
Sbjct: 164 SNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAAD 223
Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--INIDEKKRTAAVR 477
+DG D W ++ S R + +D L ++ S+ I AA+R
Sbjct: 224 -GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHAQHGSLEGGFGIWNTAVQAAIR 277
Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
+ WKL+ G D YG + +P Q+L + + N++
Sbjct: 278 VGEWKLLTG-------DPGYG----DWIPP------------QTLATFPGSWW----NLE 310
Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
+M S RQ +LFN+ DP E+ ++A RPD+ L L
Sbjct: 311 RMASVRQAV---------------------WLFNISADPYEREDLAGQRPDVVRTLLARL 349
Query: 598 KYHRRTLVPQSH--EQPDLVQADPKRFNDTWSPW 629
+ RT +P + E P +A P W PW
Sbjct: 350 AEYNRTAIPVRYPAENP---RAHPDFNGGAWGPW 380
Score = 48.1 bits (113), Expect = 0.014, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 42/75 (56%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 55 NVAWGLSGQYSTMLYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 109
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 110 RTMGNVARRKYAAMV 124
>gi|344250866|gb|EGW06970.1| Arylsulfatase I [Cricetulus griseus]
Length = 484
Score = 199 bits (507), Expect = 3e-48, Method: Compositional matrix adjust.
Identities = 104/251 (41%), Positives = 152/251 (60%), Gaps = 10/251 (3%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ +HGS +I TP +D LA G+ L N Y QP+CTPSR+ L+TG+Y IHTG+Q I
Sbjct: 60 GYHDVGYHGS-DIETPTLDRLAAEGVKLENYYIQPICTPSRSQLLTGRYQIHTGLQHSII 118
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL + LP+ L+E GYST +GKWHLGFFR+E P RGF++ G L G +
Sbjct: 119 RPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFFRKECLPSCRGFDTFLGSLTGNVD 178
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YY + D + G D+ + AW G+Y+T L+ + A ++ PLFLY
Sbjct: 179 YYTYDNCD----GPGVCGFDLHEGETVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLY 234
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
+A A H L++P+E + +++ + + RR YAAMV +D++V + AL+R G
Sbjct: 235 VAFQAVHT-----PLQSPREYLYRYRTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFY 289
Query: 332 ENSIIIFMSDN 342
NS+IIF SDN
Sbjct: 290 NNSVIIFSSDN 300
Score = 46.6 bits (109), Expect = 0.043, Method: Compositional matrix adjust.
Identities = 35/116 (30%), Positives = 56/116 (48%), Gaps = 15/116 (12%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T L+ + A ++ PLFLY+A A H L++P+E + ++
Sbjct: 200 TVAWGLSGQYSTMLYAQRASHILASHSPQNPLFLYVAFQAVHT-----PLQSPREYLYRY 254
Query: 66 QYITDPNRRTYAALTKSTTLTL------LIVYGWNDLS---FHGSNEIPTPNIDAL 112
+ + + RR YAA+ + L YG+ + S F NEI NID L
Sbjct: 255 RTMGNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNEI-LHNIDPL 309
Score = 40.4 bits (93), Expect = 3.0, Method: Compositional matrix adjust.
Identities = 20/62 (32%), Positives = 31/62 (50%), Gaps = 1/62 (1%)
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWS 627
+LFN+ DP E+ ++A RPD+ L L + RT +P + + +A P W
Sbjct: 377 WLFNISADPYEREDLAGQRPDVVRTLLARLADYNRTAIPVRYPAAN-PRAHPDFNGGAWG 435
Query: 628 PW 629
PW
Sbjct: 436 PW 437
>gi|443692243|gb|ELT93883.1| hypothetical protein CAPTEDRAFT_107171, partial [Capitella teleta]
Length = 330
Score = 199 bits (506), Expect = 4e-48, Method: Compositional matrix adjust.
Identities = 115/327 (35%), Positives = 168/327 (51%), Gaps = 25/327 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+NDL + I TPN+DALA G+IL N Y Q VC+PSR +LMTG+YP + MQ I
Sbjct: 14 GYNDLGLRDPDVI-TPNMDALASKGVILTNNYVQAVCSPSRHALMTGRYPSASAMQSIVI 72
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E + L +FLP+YL++LGY IGKWHLG+ R E P RGF++ +G
Sbjct: 73 QPMEAKCSGLKYKFLPQYLKDLGYKNHMIGKWHLGYCREECLPTSRGFDTFYGLYASSGD 132
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y++H + Y E R + + DL + ++ ++ PLFLY
Sbjct: 133 YWEHGIMGMYDWHTEAGVDFEARGTHAQVGSRYWHIYDLEMERLDKVFDEHDNKDPLFLY 192
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPN-----RRTYAAMVKKLDDSVGTVISALQ 326
A +H P + ++F + + RR Y + LD VG ++ L
Sbjct: 193 FAPQNSH---------TPHQPTDEFINLYSEDKFSDIRRKYLGLTSGLDAMVGKIVDKLT 243
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
M+ N+ +IF+SDNGA +E G N P+RG K +L+EGG K + ++SP
Sbjct: 244 ENDMMNNTYVIFVSDNGADPIE----------GRNAPFRGGKGSLFEGGTKSSSFIYSPL 293
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGG 413
+++ + +MHI+DWLPTL AGG
Sbjct: 294 LKKTEYENDGLMHITDWLPTLVKLAGG 320
>gi|298706912|emb|CBJ29739.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 781
Score = 199 bits (505), Expect = 5e-48, Method: Compositional matrix adjust.
Identities = 165/565 (29%), Positives = 258/565 (45%), Gaps = 118/565 (20%)
Query: 87 LLIVYGWNDLSFHGSN--EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
L+ G+ D+ + ++ EI TPN+DALA G+ L+N Y +CTP+RAS+MTG+YP+
Sbjct: 223 LIDDMGFGDMGYQSTDLSEI-TPNLDALAAGGVKLSNYYTMTLCTPARASIMTGRYPVRY 281
Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
GMQ I P G+P +E+ LPEY+ E GY + +GKWHLG +R E P RGF++ G
Sbjct: 282 GMQYSVIMPGSPWGLPTSEKILPEYMNEAGYESHMVGKWHLGSYRDESLPSQRGFKTFLG 341
Query: 205 YLNGVISYYDH-----ILSDQY-----------SRTVELNGHDMR-----RNLSTAWDTV 243
YLNG+ +YY H + QY V L HD + W V
Sbjct: 342 YLNGIETYYSHKNPEASVDGQYFFDFGYGNATGYHDVTLQNHDENVGGPCTDGGPRWGDV 401
Query: 244 ------------GEYATDLFTKEAVQLIEDQ-PVDK-PLFLYLAHLAAHA--GNAGKHLE 287
G Y+TD F A Q+++ + P D+ PLF+Y+AH + H+ G A
Sbjct: 402 MENEDPADVCFTGTYSTDAFVGRAKQIVKSKAPFDEDPLFMYIAHQSVHSPTGPAPYEEF 461
Query: 288 APQE--TINQFQYITDPNRRT-YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
+P+E + + T+ +RT +A ++ LD +G + L+++G LENS+I+ SDNGA
Sbjct: 462 SPEELALLEGVKDATESEKRTAFAGVLLYLDKRIGEFVDMLEQEGWLENSVIVVSSDNGA 521
Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV-----PAILWSPQIQ----QNPRVSL 395
+ GSNYP RG K +++EGG + PA+L Q +PR
Sbjct: 522 CP---------DDGGSNYPLRGSKQSVFEGGSRTGYQPSPALLGKRQPAGLNTSSPRYLC 572
Query: 396 QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN-------IDGLDQ 448
++ D +AG +DG+DQW L N S + L
Sbjct: 573 NPCNLPD-------SAGA--------LDGVDQWECLKSNGVRGAGSGNTSKPPRTELLYN 617
Query: 449 WSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLL 508
W S LL++ + D + A R WKL++ N GY+
Sbjct: 618 WDSYLLSSADG-----LKEDLEFSQGAFRSGDWKLLV----NVFCSGYFSHDL------- 661
Query: 509 NFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCY 568
A++ES L ++ + N++ + + ++ G P
Sbjct: 662 ---AVIESDDL-----LDSDVACGVGNME-CQDCCDRCQLYSGDYPVSDR---------- 702
Query: 569 LFNLGNDPCEQNNIASSRPDISSQL 593
L+N+ DP E++++ P+ + +L
Sbjct: 703 LYNVREDPREEHDLYDQYPEFAKEL 727
>gi|315644664|ref|ZP_07897795.1| sulfatase [Paenibacillus vortex V453]
gi|315279923|gb|EFU43222.1| sulfatase [Paenibacillus vortex V453]
Length = 439
Score = 199 bits (505), Expect = 6e-48, Method: Compositional matrix adjust.
Identities = 142/420 (33%), Positives = 213/420 (50%), Gaps = 49/420 (11%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
+IVY G+ DL +GS+ + TP++D LA G+ N Y+ PVC+PSRASL+TGKYP
Sbjct: 6 FVIVYCDDLGYGDLGCYGSDSVRTPHLDGLADEGVRFTNWYSNSPVCSPSRASLLTGKYP 65
Query: 142 IHTGMQGPPIWGAE--PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
+ G+ I GA+ G+P E L + L+ GY T GKWHLG +E +P GF
Sbjct: 66 VRAGVG--EILGAKRGSHGLPAAEVTLAKALKPAGYRTALYGKWHLGL-SKETSPNAHGF 122
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI 259
+ FG+ G + +Y HI V HD+ N + W+ G Y T+L T+ +V I
Sbjct: 123 DEFFGFKAGCVDFYSHIFYWGQGHGVNPL-HDLWENETEVWEN-GRYMTELITERSVDFI 180
Query: 260 E-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
+ + + P FL+ ++ A H + APQE +++F ++ +R+ AAM+ +DD V
Sbjct: 181 KRSREQEAPFFLFASYNAPH-----YPMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDDGV 234
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWEG 374
G ++ AL+ G E+++I F SDNG P+ E R T + GS +RG K +L+EG
Sbjct: 235 GEIVEALKEAGCYEDTVIFFSSDNG-PSSESRNWLDGTEDIYYGGSAGIFRGHKASLFEG 293
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
G++ PAIL P + +V ++ + D +PT AG D + PL
Sbjct: 294 GIREPAILNWPNGWEGGQVRDEVAAMMDIVPTFLDLAGVDLAAGPL-------------- 339
Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
+ +DG L PS + E + AVR +WKLVL NG +D
Sbjct: 340 ----QGITLDGSSLKEMLFRGKPSPHKQLFW---EYQGQLAVREGNWKLVL----NGKLD 388
>gi|323452295|gb|EGB08169.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 614
Score = 198 bits (503), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 136/427 (31%), Positives = 224/427 (52%), Gaps = 50/427 (11%)
Query: 83 TTLTLLIVYGWNDLSFHGSN--EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
+L G+ND+ + S+ E+ TP +D L +G+ ++ +Y Q VCTPSRA+++TGK
Sbjct: 65 VVFVMLDDVGFNDVGYASSDLGEM-TPFLDGLMADGVRVDRLYGQQVCTPSRAAMLTGKL 123
Query: 141 PIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
PIH +Q + +EP G+P E L +YL+ LGYST +GKWHLG + TPL RGF+
Sbjct: 124 PIHLELQHWQVAPSEPWGLPTREATLAQYLKALGYSTHMVGKWHLGHYNNASTPLNRGFD 183
Query: 201 SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
S +G+ +G + Y H S Y + D R ++ D G++ T L + A+ ++E
Sbjct: 184 SFYGFYSGGVDYLTHDPSTGYVWRCYRDLWDDERPVT---DAHGQHQTSLMNERAIAVLE 240
Query: 261 DQPVDK---PLFLYLAHLAAHAGNAGKHLEAPQETINQ----FQYITDPNRRTYAAMVKK 313
V+K P+F Y+++ AH L+ P E + + I + +R+ +AA++
Sbjct: 241 RHAVEKKSEPVFAYVSYPNAHL-----PLQPPTELLERRNATLLDIPNHDRKNFAALMMN 295
Query: 314 LDDSVGTVIS-ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
D+S+ + + L+ G+ + ++I+ SDNG + GSNYP RG K L+
Sbjct: 296 ADESLANLTNRGLKATGLYDTAVIVIASDNGGQV-------SAMGGGSNYPLRGEKKYLF 348
Query: 373 EGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
EGGV+ A++ SP + + R S ++MH+SDWLPTL A G ++ D+ +
Sbjct: 349 EGGVRAHAVIHSPLLPRAARGSSYGKLMHMSDWLPTLLQGALGASA---------DELAG 399
Query: 431 LLLNTPSRRN-SNIDGLDQWSSLLLNTPSR----RNSVLINIDEKKRT--------AAVR 477
L L + IDG+D W+ ++ + R+ +L+NID + AA+
Sbjct: 400 LKLPRGVVNDYGAIDGVDHWAHVVGAVDAAAAAPRDELLLNIDYLDGSGDYLGYFRAALI 459
Query: 478 LDSWKLV 484
+ WK +
Sbjct: 460 IGDWKFI 466
>gi|325286699|ref|YP_004262489.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
gi|324322153|gb|ADY29618.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
Length = 494
Score = 198 bits (503), Expect = 8e-48, Method: Compositional matrix adjust.
Identities = 143/440 (32%), Positives = 215/440 (48%), Gaps = 63/440 (14%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L G++D+ F+GS +I TP +D LA G I + Y A P C PSRA+L+TGKYP
Sbjct: 46 LVLCDDLGYSDVGFNGSTDIKTPELDKLANAGTIFTSAYVAHPFCGPSRAALLTGKYPHT 105
Query: 144 TGMQG--PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
G Q P + +G+ E+F+ + L+E GY T AIGKWHLG E+ P RGF
Sbjct: 106 IGSQFNLPANGESLGKGIDTNEQFIAKTLQESGYYTGAIGKWHLGA-TEEFHPNQRGFTD 164
Query: 202 HFGYLNGVISYY---------------DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
+G+L G +Y+ I+ D Y +E NG +++ EY
Sbjct: 165 FYGFLGGGHNYFPEQYQAQYQKQKKAKKKIIRD-YILPLEHNGKEVKET---------EY 214
Query: 247 ATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
TD F++EA + +++ KP FLYLA+ A H LEA +E + +F I D +RR
Sbjct: 215 LTDAFSREASRFVKEASNKKKPFFLYLAYNAPHVP-----LEAKEEDLEKFSVIKDKDRR 269
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
TYAAMV +D + +++ L+ LEN++IIF+SDNG N + +N P
Sbjct: 270 TYAAMVYAVDRGIKQIVNTLKETNQLENTLIIFLSDNGG---------NTDHGATNNPLH 320
Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
G K WEGG +VP ++ P+ + + D PTL A ++LP N +
Sbjct: 321 GRKGDTWEGGYRVPMFMYCPKNIAKGHTNNFPITSLDLYPTL---AHIGKAKLPDNKVLH 377
Query: 424 GLDQWSSLLLNTPSRRNSNIDGL-------------DQWSSLLLNT-PSRRNSVLINIDE 469
G + W S+L N + + + D+W +L ++ P + + +I E
Sbjct: 378 GKNVWESILNNKNIHKKETLFAMRHREGYTDVAARKDEWKALKVSKQPWKLFKITEDIGE 437
Query: 470 KKRTAAVRLDSWKLVLGTQE 489
KK +A D K ++ E
Sbjct: 438 KKDLSATHPDILKGLVNDAE 457
Score = 53.9 bits (128), Expect = 3e-04, Method: Compositional matrix adjust.
Identities = 29/67 (43%), Positives = 41/67 (61%), Gaps = 6/67 (8%)
Query: 14 EYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
EY TD F++EA + +++ KP FLYLA+ A H LEA +E + +F I D +
Sbjct: 213 EYLTDAFSREASRFVKEASNKKKPFFLYLAYNAPHVP-----LEAKEEDLEKFSVIKDKD 267
Query: 73 RRTYAAL 79
RRTYAA+
Sbjct: 268 RRTYAAM 274
>gi|323451705|gb|EGB07581.1| hypothetical protein AURANDRAFT_27261 [Aureococcus anophagefferens]
Length = 614
Score = 198 bits (503), Expect = 9e-48, Method: Compositional matrix adjust.
Identities = 166/557 (29%), Positives = 249/557 (44%), Gaps = 94/557 (16%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
LT++ +G D+S++ + + TP + ++ +G +L+N YA P CTPSRA LMTG+Y I
Sbjct: 48 LTVVDDWGSGDVSYNDA-ALHTPELQRMSEHGFVLDNFYAAPTCTPSRAMLMTGRYNIRN 106
Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
GMQ I EPRGVPL ERFL + L + GY T AIGKWHLG R YTPL RGF+ +G
Sbjct: 107 GMQDSVIHSTEPRGVPLDERFLSQKLSDAGYRTAAIGKWHLGMHRDAYTPLKRGFDLFYG 166
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
L G S+ H +S+ + G D + G + + +A
Sbjct: 167 ILTGGGSHTGH-----FSKGDDKKGDDKKDAKKDDAKKGGAHRRLAASGKA--------- 212
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
P F+YL++ A H +++ I+ NR T M+ ++DD + T+ +
Sbjct: 213 --PWFVYLSYQAVH----DPVTVGDDRYVSETAAISTTNRPTLCGMIAEIDDGLKTLRLS 266
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAILW 383
L+ +I + +SDNG R GS NYP RG K +EGGVKVPA+L
Sbjct: 267 LENAQAWAQTIFVVLSDNG----------GVRAHGSANYPLRGEKAQYYEGGVKVPAVLA 316
Query: 384 SPQIQQNPR--------VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
+++ R S + HI+D T + AG + +DG+ +S L+
Sbjct: 317 GGYVEEALRRAGGAERLRSASLAHITDVHATFLSLAGYEGGDDDKPLDGVSLFSHLV--- 373
Query: 436 PSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEK--KRTAAVRLDSWKLVLGTQENGTM 493
Q +S + R+ VLINI+ + A+R+ KL++ + N +
Sbjct: 374 ------------QSASGVAGAAPPRSDVLININSALFAGSGAIRVGDHKLMVNPEPNES- 420
Query: 494 DGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGAN 553
Y + R L V + T+QS+ A H A
Sbjct: 421 -SIYSKVRK---ALAAQKGAVPTDTFQSI----------------------VAQAHKDAL 454
Query: 554 PAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHE-QP 612
AP LF+L +P E+ + A+S P S LY++ Y + P + +
Sbjct: 455 GAP---------SISLFDLKKNPYERTDCATSDPAESCNLYDVAAYADQEAQPSTFAWED 505
Query: 613 DLVQADPKRFNDTWSPW 629
D ADP F W+PW
Sbjct: 506 DGPLADPALFGSVWAPW 522
>gi|323456975|gb|EGB12841.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 536
Score = 197 bits (502), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 127/364 (34%), Positives = 186/364 (51%), Gaps = 31/364 (8%)
Query: 92 GWNDLSFHG----SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
G+ D+S++G +N + TP +D LA +GI L Y+Q CTP+RA+L+TG+YP +TGMQ
Sbjct: 44 GYGDVSYNGDGSLTNAVATPYLDRLAADGITLTRFYSQCDCTPARAALLTGRYPSNTGMQ 103
Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ +P LP L E GY AIGKW +G R TP RGF+SH GY
Sbjct: 104 HEVVTAQSQWSLPHEFALLPSALPE-GYRKHAIGKWDVGHARAADTPTARGFDSHLGYYG 162
Query: 208 GVISYYDHILSDQYSRTVELNG--HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
I+Y +H S NG DM + +T T Y+T LF A+ L++ + +
Sbjct: 163 AEITYDEHAALRSCS-----NGTIRDMNHDGATLAATEDRYSTHLFADHAMALVDREADE 217
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI----TDPNRRTYAAMVKKLDDSVGTV 321
LFLYL A H + L A + +F +D R T+AA+ +LD +V
Sbjct: 218 YKLFLYLCFQAVH-----QPLAADAALVKRFADAFLDESDAARSTFAAVALELDRAVERF 272
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ + +G ++++ FMSDNGA + GSN+P RG K T +EGGV+VPA
Sbjct: 273 VEFTKARGAYDDAVFFFMSDNGATLAQ-------SGGGSNWPLRGSKFTAYEGGVRVPAF 325
Query: 382 LWSPQIQQNPRVSLQ--MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
L S ++ R + + H+ D LPTL A R+P +DG+DQ ++L + R
Sbjct: 326 LHSARVPPARRGAAHAGLFHVVDVLPTL-AAFAESADRVPAGVDGVDQRDAVLHGAAAPR 384
Query: 440 NSNI 443
+
Sbjct: 385 TEAL 388
>gi|329928435|ref|ZP_08282305.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
gi|328937871|gb|EGG34277.1| putative cerebroside-sulfatase [Paenibacillus sp. HGF5]
Length = 443
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 141/421 (33%), Positives = 214/421 (50%), Gaps = 51/421 (12%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
+++Y G+ DL +GS+ + TP++D LA GI N Y+ PVC+PSRASL+TGKYP
Sbjct: 10 FIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGKYP 69
Query: 142 IHTGMQGPPIWGAE--PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
G+ I GA+ G+P E L + L+ GY T GKWHLG E +P GF
Sbjct: 70 ARAGVG--EILGAKRGSHGLPADEVTLAKALKPAGYRTALFGKWHLGL-SEETSPNAHGF 126
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
+ FG+ G + +Y HI + + +N HD+ N + W+ G Y T+L T+ +V
Sbjct: 127 DEFFGFKAGCVDFYSHIF--YWGQAHGVNPLHDLWENETEVWEN-GRYMTELITERSVDF 183
Query: 259 IE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
I+ + + P FL+ ++ A H + APQE +++F ++ +R+ AAM+ +DD
Sbjct: 184 IQRSREQEAPFFLFASYNAPH-----YPMHAPQEYMDRFAHLP-WDRQVMAAMIAAVDDG 237
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWE 373
VG ++ AL+ G E+++I F SDNG P+ E R T + GS +RG K +L+E
Sbjct: 238 VGEIVKALKEAGCYEDTVIFFSSDNG-PSSESRNWLDGTEDVYYGGSAGIFRGHKASLFE 296
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
GG++ PAIL P + +V +++ + D PT AG D + PL
Sbjct: 297 GGIREPAILSWPNGWEGGQVRDEVVAMMDIAPTFLDLAGVDPAAGPL------------- 343
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
+ +DG L + PS + E + AVR WKLVL NG +
Sbjct: 344 -----QGVALDGSSLKEMLQMRMPSPHKQLFW---EYQGQLAVREGDWKLVL----NGKL 391
Query: 494 D 494
D
Sbjct: 392 D 392
>gi|406832516|ref|ZP_11092110.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 453
Score = 197 bits (501), Expect = 1e-47, Method: Compositional matrix adjust.
Identities = 125/343 (36%), Positives = 184/343 (53%), Gaps = 39/343 (11%)
Query: 76 YAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPS 131
+AA S +LIV G+ DL F G +IPTP ID LA +G+ ++ Y P C+P+
Sbjct: 25 WAADASSKPNIVLIVADDLGYGDLGFQGGRDIPTPRIDGLARSGVTCSSGYVSGPYCSPT 84
Query: 132 RASLMTGKYPIHTGMQGPPIWGA---EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
RA L+TG+Y G + P A E G+ L ER LP+ L++ GY+T +GKWHLGF
Sbjct: 85 RAGLLTGRYQQRFGHEFNPGNAARVTETFGLSLEERTLPQRLKQAGYATGIVGKWHLGF- 143
Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD-MRRNLSTAWDTVGEYA 247
++ PL RGF+ FG+L G Y+ + N +D +RR ++ EY
Sbjct: 144 APQFQPLERGFDEFFGFLGGAHPYFP-----------DANSNDPIRRGREAVVES--EYL 190
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
TD F +E+V I D+ ++P FLYLA A H A E + +F+ I D RRTY
Sbjct: 191 TDAFARESVAYI-DRNKNRPFFLYLAFNAVH-----NPQHAKPEHLERFKSIADERRRTY 244
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
A M+ +DD+VG V+ L + + + ++++F+SDNG P N +N P G
Sbjct: 245 AGMLTAMDDAVGQVLDKLAAEKLEQKTLVVFISDNGGPPA---------NGSTNTPLNGQ 295
Query: 368 KNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYT 409
K T WEGGV+VP ++ W Q+ + ++ + D LPT+ T
Sbjct: 296 KATTWEGGVRVPFVVSWPGQLAAGKKYEEPVIQL-DLLPTILT 337
Score = 45.8 bits (107), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 27/66 (40%), Positives = 36/66 (54%), Gaps = 6/66 (9%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
EY TD F +E+V I D+ ++P FLYLA A H A E + +F+ I D R
Sbjct: 188 EYLTDAFARESVAYI-DRNKNRPFFLYLAFNAVH-----NPQHAKPEHLERFKSIADERR 241
Query: 74 RTYAAL 79
RTYA +
Sbjct: 242 RTYAGM 247
>gi|29348898|ref|NP_812401.1| arylsulfatase [Bacteroides thetaiotaomicron VPI-5482]
gi|383125065|ref|ZP_09945724.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
gi|29340804|gb|AAO78595.1| arylsulfatase B precursor [Bacteroides thetaiotaomicron VPI-5482]
gi|251837419|gb|EES65514.1| hypothetical protein BSIG_5384 [Bacteroides sp. 1_1_6]
Length = 458
Score = 197 bits (501), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 127/342 (37%), Positives = 189/342 (55%), Gaps = 35/342 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP ++ALA G++L+ Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPCLNALAAEGVVLDRFYTAPISTPTRAGLMTGRYPNRFGIRTTVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E L + L GYS +A IGKWHLG R+ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETLADMLARNGYSNRAIIGKWHLGHTRKVHYPINRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y+DH+ R EL+ H+ + T +D Y+T+L T+EAV+ I + P L
Sbjct: 156 DYFDHM------REGELDWHN---DWETCYDK--GYSTELITQEAVRCINTYEKEGPFLL 204
Query: 271 YLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVISA 324
Y+A+ A H L+A ++ I + F +T + R TY AMV +D +GT++ A
Sbjct: 205 YVAYNAPHT-----PLQAQEKDIELYCDDFGSLTPKEQKRVTYQAMVSCMDRGIGTIVDA 259
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
L++KG+++N+ +IF SDNG V S+ RG K W+GGV+VPA+ +
Sbjct: 260 LKKKGIMDNTFLIFFSDNGPAGVP---------GSSSGKLRGRKFDEWDGGVRVPAVFYW 310
Query: 385 PQIQQNPR-VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
+ + N + +S Q+ D +PTL GD +R DG+
Sbjct: 311 KRAESNYKNLSSQVTGFVDIVPTLKELV-GDKNRPERAYDGI 351
>gi|109897214|ref|YP_660469.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699495|gb|ABG39415.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 500
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 123/369 (33%), Positives = 183/369 (49%), Gaps = 43/369 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ND+ F+GS +I TPN+D LA NG+ + Y A P C PSRA++MTG+YP G Q
Sbjct: 51 GYNDVGFNGSTDIKTPNLDGLAKNGMTFDAAYVAHPFCGPSRAAIMTGRYPHKIGAQFNL 110
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
GV E F+ + ++ GY T A+GKWHLG EY P GF+ +G+L G
Sbjct: 111 PEDNSNVGVSADELFIAQTMKSAGYFTGAMGKWHLGE-ASEYHPNKHGFDEFYGFLGGGH 169
Query: 211 SYYDHILSDQYSRTV--------------ELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
+Y+ Y++ V E NG ++R EY TD ++EAV
Sbjct: 170 NYFPEQFEAAYNKRVAQGMTNINMYLTPLEHNGKEVRET---------EYITDGLSREAV 220
Query: 257 QLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
++ K P FLYLA+ A H L+A +E + F I D RRTYA MV +D
Sbjct: 221 NFVDKAAAKKKPFFLYLAYNAPHV-----PLQAKEEDMAMFSQIKDKKRRTYAGMVYAVD 275
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
VG ++ L++ G +N++I+F SDNG + +NYP + K ++ EGG
Sbjct: 276 RGVGRIVEQLKKNGQFDNTVIVFTSDNGGKLGQ---------GANNYPLKEGKGSVQEGG 326
Query: 376 VKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
+ P ++ W ++ R S ++ + D PT + GG +DG D W+ + N
Sbjct: 327 FRTPMLVHWPKHMKAGSRFSHPVLAL-DLYPT-FAGLGGAVLPEDKKLDGKDIWADIQAN 384
Query: 435 TPSRRNSNI 443
T ++ I
Sbjct: 385 TAPHKDEFI 393
Score = 48.5 bits (114), Expect = 0.011, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 6/68 (8%)
Query: 14 EYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
EY TD ++EAV ++ K P FLYLA+ A H L+A +E + F I D
Sbjct: 209 EYITDGLSREAVNFVDKAAAKKKPFFLYLAYNAPHV-----PLQAKEEDMAMFSQIKDKK 263
Query: 73 RRTYAALT 80
RRTYA +
Sbjct: 264 RRTYAGMV 271
>gi|319952005|ref|YP_004163272.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420665|gb|ADV47774.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 484
Score = 197 bits (500), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 125/368 (33%), Positives = 193/368 (52%), Gaps = 28/368 (7%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L G+ND+ F+GS +I TPN+D LA +G I + Y A P C PSRA+L+TG+YP
Sbjct: 35 LILCDDLGYNDVGFNGSTDITTPNLDQLAQDGTIFTSAYVAHPFCGPSRAALLTGRYPHT 94
Query: 144 TGMQG--PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
G Q P + +G+ + E+F+ +++ GY T AIGKWHLG EY P RGF
Sbjct: 95 LGSQFNLPANGASTGKGISVEEKFMGVPMQKAGYYTGAIGKWHLGE-TAEYHPNKRGFND 153
Query: 202 HFGYLNGVISYYDHILSDQYSRTVEL-----NGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
+G+L G Y+ QY E+ N + + + A +Y TD+ ++E +
Sbjct: 154 FYGFLGGGHKYFPEEYKLQYKHQKEMGTKNINDYVLPLEHNGAIVEENDYLTDVLSREGI 213
Query: 257 QLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
+ E KP FLYLA+ A H LEA ++ + +F+ I D +RRTYAAMV +D
Sbjct: 214 RFTKEAHDKKKPFFLYLAYNAPHVP-----LEAKEKDLEKFKDIEDIDRRTYAAMVYAVD 268
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
VG ++++L++ G +N++IIF+SDNG T + +NYP G K WEGG
Sbjct: 269 RGVGEIVASLKKTGQFDNTLIIFLSDNGGHT---------GHGANNYPLTGRKGDTWEGG 319
Query: 376 VKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-L 433
+VP W +I++ + + + D PT+ + +DG + W +++ L
Sbjct: 320 FRVPMFFHWPKKIKKGQKFDYPVSAL-DLYPTIAHIGQAKIGSDKI-LDGKNIWEAIISL 377
Query: 434 NTPSRRNS 441
P + +
Sbjct: 378 TNPHDKET 385
Score = 47.8 bits (112), Expect = 0.018, Method: Compositional matrix adjust.
Identities = 26/68 (38%), Positives = 40/68 (58%), Gaps = 6/68 (8%)
Query: 14 EYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
+Y TD+ ++E ++ E KP FLYLA+ A H LEA ++ + +F+ I D +
Sbjct: 202 DYLTDVLSREGIRFTKEAHDKKKPFFLYLAYNAPHVP-----LEAKEKDLEKFKDIEDID 256
Query: 73 RRTYAALT 80
RRTYAA+
Sbjct: 257 RRTYAAMV 264
>gi|340619110|ref|YP_004737563.1| sulfatase [Zobellia galactanivorans]
gi|339733907|emb|CAZ97284.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 511
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 143/448 (31%), Positives = 220/448 (49%), Gaps = 57/448 (12%)
Query: 78 ALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSR 132
L + T +L+V G+ D+ F+GS +I TP +D LA NG I + Y A P C PSR
Sbjct: 41 GLAQETRPNILVVLCDDLGYADVGFNGSTDILTPELDNLAQNGSIFTSAYVAHPFCGPSR 100
Query: 133 ASLMTGKYPIHTG-----MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
++++TG+YP TG + GVP+ E ++ + L+ GY T AIGKWHLG
Sbjct: 101 SAILTGRYPHLTGTAYNLFHNSSEDDKDNMGVPVEETYMSKVLQNAGYYTSAIGKWHLGA 160
Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMR-------RNLSTAW 240
+ + P RGF+ +G+L G Y+ Y + ++R N A
Sbjct: 161 APK-FHPNKRGFDDFYGFLGGGHDYFPSEYQKTYKAQKKAGNPNIRDYVFPMEHNGKPAN 219
Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
+T EY TD F++EA++ I+ K P F+YLA+ A H L+A E + +F +I
Sbjct: 220 ET--EYITDGFSREAIKNIKIAAAKKQPFFIYLAYNAPHVP-----LQAKAEDVAKFAHI 272
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
D +RRTYAAMV +D VG ++ L+ +N++I+F+SDNG N+ +
Sbjct: 273 KDKDRRTYAAMVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNGG---------NFNHGA 323
Query: 360 SNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
+NYP +G K WEGG +VP W +I+++ R + + D PT G ++L
Sbjct: 324 NNYPLKGTKGDTWEGGYRVPMFFHWPKKIKKDQRFDFPVSSL-DLYPTF---TGLAEAKL 379
Query: 419 PLN--IDGLDQWSSLLLNTPSRRNSNIDGL--------------DQWSSLLLNTPSRRNS 462
P +DG + +L NT ++ I L D + + N P R ++
Sbjct: 380 PKGKQLDGKNIMDDVLKNTEPYKDEMIYSLRYREGYNDVGARMGDWKITRMGNEPWRLHN 439
Query: 463 VLINIDEKKRTAAVRLDSWK-LVLGTQE 489
+ +I EKK A D K ++ TQE
Sbjct: 440 ITQDIGEKKNLAGRYPDRLKEMIAKTQE 467
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 38/111 (34%), Positives = 55/111 (49%), Gaps = 24/111 (21%)
Query: 14 EYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
EY TD F++EA++ I+ K P F+YLA+ A H L+A E + +F +I D +
Sbjct: 222 EYITDGFSREAIKNIKIAAAKKQPFFIYLAYNAPHVP-----LQAKAEDVAKFAHIKDKD 276
Query: 73 RRTYAAL-----------------TKSTTLTLLIVYGWNDLSF-HGSNEIP 105
RRTYAA+ TK TL++ N +F HG+N P
Sbjct: 277 RRTYAAMVYAVDRGVGKIVQTLKETKQFDNTLIVFLSDNGGNFNHGANNYP 327
>gi|390361685|ref|XP_789654.2| PREDICTED: LOW QUALITY PROTEIN: arylsulfatase B-like
[Strongylocentrotus purpuratus]
Length = 401
Score = 196 bits (499), Expect = 2e-47, Method: Compositional matrix adjust.
Identities = 149/484 (30%), Positives = 220/484 (45%), Gaps = 103/484 (21%)
Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV 225
+ + LR+ GY+T A+GKWH+G ++ P RGF+S FG L G +Y H + V
Sbjct: 1 MADKLRKAGYATHAVGKWHMGLYKTCCWPTNRGFDSFFGLLTGRGGFYTHKHYGGHPGLV 60
Query: 226 E---LNGHDMRRNLS-TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGN 281
+ +G+D+R NL A D G Y+T LFT+++ +I KPLFLY + A H
Sbjct: 61 DSKNWSGYDLRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSKPLFLYHSFQAVHYP- 119
Query: 282 AGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSD 341
LE P + F YI D RRTYA MVK +D++VG + L++ G+ N+IIIF SD
Sbjct: 120 ----LEVPPRYMEDFNYIADERRRTYAGMVKCMDEAVGNLTRTLKKTGLWNNTIIIFSSD 175
Query: 342 NGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMH 399
NGA N+ GSN+P RG+K +LWEGG+K + SP + + R V+ + H
Sbjct: 176 NGA---------NFNYGGSNWPLRGMKRSLWEGGIKSVGFIASPLLPKLVRGTVNNNLFH 226
Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSR 459
++DW PTL + + ++DG + W LL +R G D W
Sbjct: 227 VTDWFPTLVRGVARGSLK-GTHLDGHNLWKHLLNKKKTR------GKDSWP--------- 270
Query: 460 RNSVLINID-------EKKR------TAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVP 506
R +L NID E R AA+R ++WKL+
Sbjct: 271 RKEILHNIDPYSPIFSELPRWFSINFPAAIRYENWKLI---------------------- 308
Query: 507 LLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGP 566
Q+ ++P + TIH P
Sbjct: 309 ---------------TSQIGNRSWVPPPE-------SGENTIHSTDTPEKR--------- 337
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTW 626
+LF++ DP E N+++ R DI+ L L + R VP + PD ++A+P N ++
Sbjct: 338 VWLFDISEDPKEMNDLSLERTDITYFLLRRLGSYLRGSVPIRYPGPD-IRANPSLHNGSF 396
Query: 627 SPWI 630
S WI
Sbjct: 397 SDWI 400
Score = 54.7 bits (130), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 33/82 (40%), Positives = 42/82 (51%), Gaps = 6/82 (7%)
Query: 1 MRRNLS-TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+R NL A D G Y+T LFT+++ +I KPLFLY + A H LE P
Sbjct: 70 LRDNLEQVAQDYQGVYSTHLFTQKSQNIIRRHNRSKPLFLYHSFQAVHYP-----LEVPP 124
Query: 60 ETINQFQYITDPNRRTYAALTK 81
+ F YI D RRTYA + K
Sbjct: 125 RYMEDFNYIADERRRTYAGMVK 146
>gi|193784090|dbj|BAG53634.1| unnamed protein product [Homo sapiens]
Length = 426
Score = 196 bits (499), Expect = 3e-47, Method: Compositional matrix adjust.
Identities = 140/454 (30%), Positives = 210/454 (46%), Gaps = 79/454 (17%)
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
+GKWHLGF+R+E P RGF++ G G + YY + D + G D+ + A
Sbjct: 2 VGKWHLGFYRKECLPTRRGFDTFLGSFTGNVDYYTYDNCD----GPGVCGFDLHEGENVA 57
Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
W G+Y+T + + A ++ +PLFLY+A A H L++P+E + +++ +
Sbjct: 58 WGLSGQYSTMPYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRYRTM 112
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+ RR YAAMV +D++V + AL+R G NS+IIF SDNG T G
Sbjct: 113 GNVARRKYAAMVTCMDEAVRNITWALKRYGFYNNSVIIFSSDNGGQTFSG---------G 163
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
SN+P RG K T WEGGV+ + SP +++ R S +MHI+DW PTL AGG TS
Sbjct: 164 SNWPLRGRKGTYWEGGVRGLGFVHSPLLKRKQRTSRALMHITDWYPTLVGLAGGTTSAAD 223
Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL--INIDEKKRTAAVR 477
+DG D W ++ S R + +D L ++ S+ I AA+R
Sbjct: 224 -GLDGYDVWPAISEGRASPRTEILHNIDP-----LYNHAQHGSLEGGFGIWNTAVQAAIR 277
Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
+ WKL+ G D YG + +P Q+L + + N++
Sbjct: 278 VGEWKLLTG-------DPGYG----DWIPP------------QTLATFPGSWW----NLE 310
Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
+M S RQ +LFN+ DP E+ ++A RPD+ L L
Sbjct: 311 RMASVRQA---------------------VWLFNISADPYEREDLAGQRPDVVRTLLARL 349
Query: 598 KYHRRTLVPQSH--EQPDLVQADPKRFNDTWSPW 629
+ RT +P + E P +A P W PW
Sbjct: 350 AEYNRTAIPVRYPAENP---RAHPDFNGGAWGPW 380
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 22/75 (29%), Positives = 41/75 (54%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW G+Y+T + + A ++ +PLFLY+A A H L++P+E + ++
Sbjct: 55 NVAWGLSGQYSTMPYAQRASHILASHSPQRPLFLYVAFQAVHT-----PLQSPREYLYRY 109
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 110 RTMGNVARRKYAAMV 124
>gi|291513548|emb|CBK62758.1| Arylsulfatase A and related enzymes [Alistipes shahii WAL 8301]
Length = 467
Score = 196 bits (497), Expect = 4e-47, Method: Compositional matrix adjust.
Identities = 130/357 (36%), Positives = 182/357 (50%), Gaps = 39/357 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ +HGS IPTPNIDALA GI +N Y PV +P+RA LMTG+YP G++ I
Sbjct: 43 GWGDVGYHGSV-IPTPNIDALAARGIEMNRFYTAPVSSPTRAGLMTGRYPSRFGIRKTVI 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E+ L + L GY+ +AI GKWHLG RR Y PL RGF +G LNG +
Sbjct: 102 PPWRDYGLDPEEQTLADMLAANGYAHRAIVGKWHLGHGRRAYYPLNRGFTHFYGCLNGAL 161
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+ H R EL+ H+ W++ + Y+TDL EAV+ I + P
Sbjct: 162 DYFTH------EREGELDWHN-------DWESCRDEGYSTDLIADEAVRCIGGYASEGPF 208
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI------NQFQYIT--DPNRRTYAAMVKKLDDSVGT 320
FLY+A A H +AP++ I +F +T + + TY AMV ++D VG
Sbjct: 209 FLYVAFNAPHT-----PFQAPEDEIAEHISPEKFAALTPREKDAYTYRAMVTRMDKGVGR 263
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
++ L+R G++EN++++FMSDNG N S P RG K W+GG++VPA
Sbjct: 264 ILEKLERSGLVENTLVLFMSDNGG-------IPNLPGGSSCAPLRGHKGREWDGGLRVPA 316
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
+L S Q+ D PT+ G P DG++ +L T S
Sbjct: 317 VLCRKGRFAEGVRSEQLAAFVDMAPTIAALVG--AGEPPRPYDGVNLLPALTGETAS 371
>gi|115533418|ref|NP_001041232.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
gi|351060348|emb|CCD68016.1| Protein SUL-3, isoform b [Caenorhabditis elegans]
Length = 452
Score = 196 bits (497), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 131/408 (32%), Positives = 202/408 (49%), Gaps = 61/408 (14%)
Query: 93 WNDLSFHGSNEIPTPNIDALAY--NGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
W D + H TPN+ LA+ N +L+N Y +CTP+R++ MTG YP G Q
Sbjct: 12 WKDSTLH------TPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNGV 65
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
EP GVP FL E +R+L YST +GKWHLG+ ++E+ P RGF+ +G+
Sbjct: 66 FLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQT 125
Query: 211 SYYDHILSDQYSRTVE--LNGHDMRRNLSTA-----WDTVGEYATDLFTKEAVQLIEDQP 263
Y++H +DQY R ++ + G D+ + + + G Y+TDLFT A+ ++++
Sbjct: 126 GYFNHS-ADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 184
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQE--TINQFQYITDPNRRTYAA--MVKKLDDSVG 319
KP F++L++ A H L+ Q+ TI Q + T R + M+ +D ++G
Sbjct: 185 NSKPFFMFLSYQAVHPP-----LQVSQQSKTIGQGKEATFILRSHAHSTRMLTAMDFAIG 239
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
++ L+ + EN++I+F SDNG T+N+ SN P RG K+T+WEGG K
Sbjct: 240 RLVEYLKASNLYENTVIVFTSDNGG-------TANF--GASNAPLRGEKDTIWEGGTKTT 290
Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
+ SP + M H+ DW T+ + G L ID
Sbjct: 291 TFVHSPMYIEEGGTRDMMFHVVDWHATILSITG-------LEID---------------- 327
Query: 440 NSNIDGLDQWSSLLLNTPS-RRNSVLINIDEKKRTAAVRLDSWKLVLG 486
S DG++QW L P RR + NID +A+R +KL++G
Sbjct: 328 -SYGDGINQWEYLKTGRPKFRRFQFVYNIDN--HGSAIRDGDYKLIVG 372
>gi|115533416|ref|NP_001041231.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
gi|351060347|emb|CCD68015.1| Protein SUL-3, isoform a [Caenorhabditis elegans]
Length = 488
Score = 195 bits (496), Expect = 5e-47, Method: Compositional matrix adjust.
Identities = 131/408 (32%), Positives = 202/408 (49%), Gaps = 61/408 (14%)
Query: 93 WNDLSFHGSNEIPTPNIDALAY--NGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
W D + H TPN+ LA+ N +L+N Y +CTP+R++ MTG YP G Q
Sbjct: 48 WKDSTLH------TPNLRHLAFHKNTALLSNSYVNQLCTPTRSAFMTGYYPFRVGTQNGV 101
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
EP GVP FL E +R+L YST +GKWHLG+ ++E+ P RGF+ +G+
Sbjct: 102 FLHMEPAGVPTMFPFLSENMRQLDYSTYLVGKWHLGYCKKEFLPTNRGFDYFYGFYGPQT 161
Query: 211 SYYDHILSDQYSRTVE--LNGHDMRRNLSTA-----WDTVGEYATDLFTKEAVQLIEDQP 263
Y++H +DQY R ++ + G D+ + + + G Y+TDLFT A+ ++++
Sbjct: 162 GYFNHS-ADQYHRELKRVVKGLDLFEEVGSGKSVPDFSQNGVYSTDLFTDVAMSVLDNHN 220
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQE--TINQFQYITDPNRRTYAA--MVKKLDDSVG 319
KP F++L++ A H L+ Q+ TI Q + T R + M+ +D ++G
Sbjct: 221 NSKPFFMFLSYQAVHPP-----LQVSQQSKTIGQGKEATFILRSHAHSTRMLTAMDFAIG 275
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
++ L+ + EN++I+F SDNG T+N+ SN P RG K+T+WEGG K
Sbjct: 276 RLVEYLKASNLYENTVIVFTSDNGG-------TANF--GASNAPLRGEKDTIWEGGTKTT 326
Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
+ SP + M H+ DW T+ + G L ID
Sbjct: 327 TFVHSPMYIEEGGTRDMMFHVVDWHATILSITG-------LEID---------------- 363
Query: 440 NSNIDGLDQWSSLLLNTPS-RRNSVLINIDEKKRTAAVRLDSWKLVLG 486
S DG++QW L P RR + NID +A+R +KL++G
Sbjct: 364 -SYGDGINQWEYLKTGRPKFRRFQFVYNIDN--HGSAIRDGDYKLIVG 408
>gi|291235506|ref|XP_002737685.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 658
Score = 195 bits (495), Expect = 6e-47, Method: Compositional matrix adjust.
Identities = 137/416 (32%), Positives = 205/416 (49%), Gaps = 56/416 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ +HGS I TPNID LA G+ L N Y C PSR +L+TG+Y I TG+
Sbjct: 215 GWHDVGYHGSI-IDTPNIDHLAAEGVKLENYYVSSWCAPSRVNLLTGRYRIRTGL----- 268
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL-GFFRREYTPLYRGFESHFGYLNGVI 210
+G + + E L + L E GY T +GKWHL GF RE P +RGF++ G+ G
Sbjct: 269 YGDVCDFMGIHEITLADKLYEAGYYTAMVGKWHLSGFQHRECYPAHRGFQTFLGFHGGSQ 328
Query: 211 SYYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
+Y+ H S +D N S + G Y+T ++ +EA ++I ++PLF
Sbjct: 329 NYFTHRRGGSNSE------YDFWANDTSIGREYDGRYSTMVYAEEAQRIIRHHRTEQPLF 382
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
LYL+ A H+ L P ++++ I D RR YAAM +D+++G V L+
Sbjct: 383 LYLSFQAVHSP-----LLVPSAYEDKYRTGIEDDKRRVYAAMTSCMDEAIGNVTDTLKET 437
Query: 329 GMLENSIIIFMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
G+ ++ IF SDNG PT GSN+P RG K +EGG++ + S +
Sbjct: 438 GLWNKTVFIFSSDNGGTPTFG----------GSNWPLRGRKGLWYEGGIRAVGFVNSYLL 487
Query: 388 QQNPRVSL--QMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLLNTPSRRNSNID 444
+ S +++HI+DW PTL A D RL PL DG+DQW ++ + R
Sbjct: 488 SNHVTGSTNHELIHITDWFPTLLRLAKTDLHRLKPL--DGIDQWETISQGKKTNR----- 540
Query: 445 GLDQWSSLLLN----------TPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
+LLN +R + + AA+R+ WKL+ G +++
Sbjct: 541 -----KEVLLNFLPGNISGNINENREGEATADYFDTMPFAAIRVGDWKLLTGNKDS 591
Score = 42.0 bits (97), Expect = 1.1, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 39/69 (56%), Gaps = 6/69 (8%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-ITDP 71
G Y+T ++ +EA ++I ++PLFLYL+ A H+ L P ++++ I D
Sbjct: 357 GRYSTMVYAEEAQRIIRHHRTEQPLFLYLSFQAVHSP-----LLVPSAYEDKYRTGIEDD 411
Query: 72 NRRTYAALT 80
RR YAA+T
Sbjct: 412 KRRVYAAMT 420
>gi|1030057|emb|CAA63067.1| murine arylsulfatase B [Mus musculus]
Length = 252
Score = 194 bits (494), Expect = 8e-47, Method: Compositional matrix adjust.
Identities = 107/268 (39%), Positives = 154/268 (57%), Gaps = 22/268 (8%)
Query: 174 GYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH--- 230
GY+T +GKWHLG +R+E P RGF+++FGYL G YY H + + LNG
Sbjct: 2 GYATHMVGKWHLGMYRKECLPTRRGFDTYFGYLLGSEDYYTH---EACAPIESLNGTRCA 58
Query: 231 -DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
D+R A + Y+T++FTK A +I + P +KPLFLYLA + H L+ P
Sbjct: 59 LDLRDGEEPAKEYNNIYSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVP 113
Query: 290 QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEY 349
+E + + +I D +RR YA MV +D++VG V AL+ G+ N++ IF +DNG T
Sbjct: 114 EEYMEPYGFIQDKHRRIYAGMVSLMDEAVGNVTKALKSHGLWNNTVFIFSTDNGGQT--- 170
Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
R+ G+N+P RG K TLWEGG++ + SP ++Q S ++MHISDWLPTL
Sbjct: 171 ------RSGGNNWPLRGRKGTLWEGGIRGTGFVASPLLKQKGVKSRELMHISDWLPTLVD 224
Query: 410 AAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
AGG T+ +DG + W ++ PS
Sbjct: 225 LAGGSTNGTK-PLDGFNMWKTISEGHPS 251
Score = 55.8 bits (133), Expect = 7e-05, Method: Compositional matrix adjust.
Identities = 26/66 (39%), Positives = 40/66 (60%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FTK A +I + P +KPLFLYLA + H L+ P+E + + +I D +RR
Sbjct: 75 YSTNIFTKRATTVIANHPPEKPLFLYLAFQSVH-----DPLQVPEEYMEPYGFIQDKHRR 129
Query: 75 TYAALT 80
YA +
Sbjct: 130 IYAGMV 135
>gi|261404208|ref|YP_003240449.1| sulfatase [Paenibacillus sp. Y412MC10]
gi|261280671|gb|ACX62642.1| sulfatase [Paenibacillus sp. Y412MC10]
Length = 452
Score = 194 bits (494), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 140/421 (33%), Positives = 213/421 (50%), Gaps = 51/421 (12%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
+++Y G+ DL +GS+ + TP++D LA GI N Y+ PVC+PSRASL+TGKYP
Sbjct: 19 FIVIYCDDLGYGDLGCYGSDTVKTPHLDGLADEGIRFTNWYSNSPVCSPSRASLLTGKYP 78
Query: 142 IHTGMQGPPIWGAE--PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
G+ I GA+ G+P E L + L+ GY T GKWHLG E +P GF
Sbjct: 79 ARAGVG--EILGAKRGSHGLPADEVTLAKALKPAGYRTALYGKWHLGL-SEETSPNAHGF 135
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
+ FG+ G + +Y HI + + +N HD+ N + W+ G Y T+L T+ +V
Sbjct: 136 DEFFGFKAGCVDFYSHIF--YWGQAHGVNPLHDLWENETEVWEN-GRYMTELITERSVDF 192
Query: 259 IE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
I+ + + P FL+ ++ A H + APQ+ +++F ++ +R+ AAM+ +DD
Sbjct: 193 IQRSREQEAPFFLFASYNAPH-----YPMHAPQKYMDRFAHLP-WDRQVMAAMIAAVDDG 246
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE----TSNYRNWGSNYPYRGVKNTLWE 373
VG ++ AL+ G E+++I F SDNG P+ E R T + GS +RG K +L+E
Sbjct: 247 VGKIVKALKEAGCYEDTVIFFSSDNG-PSSESRNWLDGTEDVYYGGSAGIFRGHKASLFE 305
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
GG++ PAIL P + +V ++ + D PT AG D + PL
Sbjct: 306 GGIREPAILSWPNGWEGGQVRDEVAAMMDLAPTFLDLAGVDPAAGPL------------- 352
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
+ +DG L + PS + E + AVR WKLVL NG +
Sbjct: 353 -----QGVALDGSSLKEMLQMREPSPHQQLFW---EYQGQLAVREGDWKLVL----NGKL 400
Query: 494 D 494
D
Sbjct: 401 D 401
>gi|160885330|ref|ZP_02066333.1| hypothetical protein BACOVA_03329 [Bacteroides ovatus ATCC 8483]
gi|156108952|gb|EDO10697.1| arylsulfatase [Bacteroides ovatus ATCC 8483]
Length = 458
Score = 194 bits (493), Expect = 1e-46, Method: Compositional matrix adjust.
Identities = 125/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP +DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L TKEA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I N F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTNNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GGV PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351
>gi|149197407|ref|ZP_01874458.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149139425|gb|EDM27827.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 454
Score = 194 bits (492), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 122/331 (36%), Positives = 178/331 (53%), Gaps = 38/331 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D+ +HG EIPTPNID +A G+ + Y+ +C P+RA+LM+G Y G +G
Sbjct: 31 GYADVGYHGLEEIPTPNIDRIANEGVQFSAGYSNGSICGPTRAALMSGVYQQRIGCEG-- 88
Query: 151 IWGAEPR------GVPLTERFLPEYLRELGYSTKAIGKWHLG---FFRREYTPLYRGFES 201
I G G+P + L +Y +E GY+T GKWHLG F + P RGF+
Sbjct: 89 ICGGRKLNEHVVVGMPREVKTLAQYFQEAGYATGLFGKWHLGGERLFDKTLMPTSRGFDE 148
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
FG L G Y D TV +R++ T D GEY TD +EAV I
Sbjct: 149 FFGILEGASLYDD---------TVNRERKYIRQD--TVIDYEGEYFTDAIGREAVSFITR 197
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
+ DKP FLYL A HA ++A ++ + +F +I DPNRR +AAM+ +DD++G V
Sbjct: 198 KG-DKPFFLYLPFTAVHA-----PMQASEKYMQRFAHIADPNRRVFAAMLSAMDDNIGRV 251
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
AL+ +G+L+N++I+F SDNG + N N+ N+P +G K +EGG++VPA
Sbjct: 252 FDALEHQGILDNTLIVFWSDNGG------KPDN--NYSLNHPLKGQKTQFYEGGIRVPAC 303
Query: 382 LWSPQIQQNPRVSL-QMMHISDWLPTLYTAA 411
+ P+ Q +L Q + + D P+ AA
Sbjct: 304 VRWPKGQIPAGKTLDQPVFLMDIFPSALEAA 334
Score = 57.0 bits (136), Expect = 3e-05, Method: Compositional matrix adjust.
Identities = 38/116 (32%), Positives = 55/116 (47%), Gaps = 24/116 (20%)
Query: 7 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
T D GEY TD +EAV I + DKP FLYL A HA ++A ++ + +F
Sbjct: 174 TVIDYEGEYFTDAIGREAVSFITRKG-DKPFFLYLPFTAVHA-----PMQASEKYMQRFA 227
Query: 67 YITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
+I DPNRR +AA+ + + V+ DAL + GI+ N +
Sbjct: 228 HIADPNRRVFAAMLSAMDDNIGRVF------------------DALEHQGILDNTL 265
>gi|372210595|ref|ZP_09498397.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 472
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 123/332 (37%), Positives = 177/332 (53%), Gaps = 18/332 (5%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F G+ +I TPN+D LA NG YA C PSRA+L++G+Y G + P
Sbjct: 37 GYADTGFTGATDIQTPNLDNLAKNGAFFKQGYANHAYCGPSRAALLSGRYQHRFGFETNP 96
Query: 151 IWG-AEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
+ A P G+ + E+ P+ L+E GY T AIGKWHLG E+ PL RGF+ +G+L G
Sbjct: 97 AYDPANPHMGIDVGEKLFPKRLQEAGYKTGAIGKWHLGA-AAEFHPLNRGFDYFYGFLGG 155
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
+D+ D + E + RN D Y T + +A Q ++D + P
Sbjct: 156 G---HDYFRIDGTKKVWEAYLQPLVRNKRA--DNFEGYLTTALSNDAAQFVKDNK-ENPF 209
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FLY+A+ A H L+AP+E I ++ +I D RR YAAMV +D +GTVI +L+
Sbjct: 210 FLYVAYNAPHM-----PLQAPKEDIARYAHIKDNKRRVYAAMVDVMDRGIGTVIQSLKDS 264
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
G+ EN++I F+SDNG P + T + N N P+RG K L++GGV VP W +I
Sbjct: 265 GIYENTLIFFLSDNGGPQSKGGSTKGW-NGSDNQPFRGGKGNLYDGGVHVPFFASWPNKI 323
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
+ V+ + + D T AGGD + P
Sbjct: 324 KPGTVVNYPVNSL-DIGRTAVALAGGDATAAP 354
Score = 45.8 bits (107), Expect = 0.070, Method: Compositional matrix adjust.
Identities = 25/71 (35%), Positives = 38/71 (53%), Gaps = 6/71 (8%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
D Y T + +A Q ++D + P FLY+A+ A H L+AP+E I ++ +I
Sbjct: 183 DNFEGYLTTALSNDAAQFVKDNK-ENPFFLYVAYNAPHM-----PLQAPKEDIARYAHIK 236
Query: 70 DPNRRTYAALT 80
D RR YAA+
Sbjct: 237 DNKRRVYAAMV 247
>gi|241844556|ref|XP_002415496.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509708|gb|EEC19161.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 328
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 184/365 (50%), Gaps = 41/365 (11%)
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
+FLY +H A H G+ LEAP + F YI NR +A V +LD SVG V+ AL +
Sbjct: 1 MFLYFSHQAVHCGDYKVKLEAPASAVAHFPYIKSQNRSIHAGAVYELDKSVGLVVEALNK 60
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
+GML NSI++F +DNG S Y N+P RG K TLWEGGV+ A +WSP +
Sbjct: 61 RGMLSNSIVVFSTDNGGLPWGIDPNSGY-----NWPLRGAKATLWEGGVRGAAFVWSPLL 115
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLD 447
++ R+S Q+MHI+DWLPTLY+AAGG+ S L + DG D
Sbjct: 116 FKSGRLSNQLMHITDWLPTLYSAAGGNLSTL----------------------GDTDGKD 153
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
W +L + S R +LINID T+A+ + K+VLGT + G D + K P
Sbjct: 154 MWEALSEDLESPRQEILINIDPTGNTSALIVGRRKVVLGTFQGGQHD------QRMKAP- 206
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR---STRQQATIHCGANPAPMTPSPCTN 564
+ V+ L + + N+ ++R + R + + C AP +
Sbjct: 207 -GGSRTVDGLDQMMLSSRTGKVLKDFYNVRQLRVRPNWRNETVVRCDQY-APRD-NFVAA 263
Query: 565 GPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFND 624
P Y F+L +DPCE NN+A+S +L + ++ + + +VP ++ D + P+ +
Sbjct: 264 SPPYYFDLEHDPCELNNLAASNVTDLKELIKKIEEYGKGMVPPGNKPLD-PRGLPQNNHG 322
Query: 625 TWSPW 629
W PW
Sbjct: 323 LWGPW 327
>gi|430741545|ref|YP_007200674.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013265|gb|AGA24979.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 474
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 127/340 (37%), Positives = 179/340 (52%), Gaps = 35/340 (10%)
Query: 85 LTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP 141
+ LL+ G+ DL F G+ +IPTP++DALA G+ + Y P C+P+RA L+TG+Y
Sbjct: 39 VVLLVADDLGYGDLGFQGARDIPTPHLDALAQGGVRCTSGYVSGPYCSPTRAGLLTGRYQ 98
Query: 142 IHTGMQGPPIWGAEPR-----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G + P G G+P+TE L + L+ GY+T +GKWHLG ++ P
Sbjct: 99 QRFGHEFNPGGGGGAAAAKNVGLPVTETTLADRLKAAGYATGLVGKWHLGS-EAKFHPQK 157
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
RGF+ FG+L G +Y+ D Y R E+ + Y TD F++EA+
Sbjct: 158 RGFDEFFGFLGGQHTYFASKSGDVY-RGTEVVKEEA-------------YLTDAFSREAL 203
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
I D+ D P FL L+ A H ++A ++ + +F I DP RRTYAAM+ LD+
Sbjct: 204 SFI-DRHKDHPFFLQLSFNAVHT-----PMDATEDRVARFSSIEDPKRRTYAAMLSALDE 257
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+ G VI L+ G+ E ++IIF SDNG PT+ T N N P RG K T EGGV
Sbjct: 258 AAGNVIERLRAAGLEEKTLIIFFSDNGGPTMAGTTT----NGSKNDPLRGSKRTTLEGGV 313
Query: 377 KVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
+VP +L W + +V Q + D LPT AAG D
Sbjct: 314 RVPFVLSWKGTLPAG-KVYDQPIIQLDILPTALAAAGVDA 352
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 38/65 (58%), Gaps = 6/65 (9%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD F++EA+ I D+ D P FL L+ A H ++A ++ + +F I DP RR
Sbjct: 193 YLTDAFSREALSFI-DRHKDHPFFLQLSFNAVHT-----PMDATEDRVARFSSIEDPKRR 246
Query: 75 TYAAL 79
TYAA+
Sbjct: 247 TYAAM 251
>gi|313242955|emb|CBY39683.1| unnamed protein product [Oikopleura dioica]
Length = 581
Score = 193 bits (491), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 155/573 (27%), Positives = 247/573 (43%), Gaps = 98/573 (17%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
L K + ++V G++DL + + ++ +PNIDALA + + L Y QP CTPSRA+
Sbjct: 19 VLEKKPHVIFVLVDDLGFDDLGY-VNRDVISPNIDALAKDALHLKKHYVQPSCTPSRAAF 77
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
+TG+Y I GMQ I EP G+PL E L E ++ GY T GKWHLGF+ ++ P
Sbjct: 78 LTGRYNIRMGMQSGVIRATEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQ 137
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
RGF+ +G+ G +Y H + NG + T G Y+T LF +
Sbjct: 138 IRGFDRFYGFYLGSQDFYFH--DSGRLKAYPGNGDVENDTILDDLHTNGTYSTKLFVDDF 195
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY-------A 308
+ + PLF Y++ H P +T+N+F+ + T+ +
Sbjct: 196 INDLAKHDPAVPLFNYVSFQDVH---------GPLQTVNKFKKHYENKTDTWTHERILIS 246
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
+ LD +G +++AL+ K N++++F SDNG E SN+P RG K
Sbjct: 247 TKITTLDHHIGKMVTALKEKNYWNNTVLVFTSDNGGQPKE---------GASNWPLRGSK 297
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQ----MMHISDWLPTLYTAAGGDTSRLPLN-ID 423
T+++GG+K A + SP +Q R+ Q + H+SDW PTL T +G D R +D
Sbjct: 298 GTIYDGGLKSRAFIASPLLQN--RMKGQNYNYLFHVSDWFPTLLTLSGCDVPRKEGRPLD 355
Query: 424 GLDQWSSLLLNTPSR-RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWK 482
G + L + P + RN + LD S P + A+ R WK
Sbjct: 356 G--KIHELFVYKPKKARNKMLHYLDPLSQFDYGDPRHFEVLDDRAFNTTIKASYRQGRWK 413
Query: 483 LVLGTQENGTMDGYYGQTR-----SNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
++ G + G GY R NK+ L++ +P
Sbjct: 414 IITG-KPCGNGCGYIKNERPWNYAKNKMKLID---------------------MPKEK-- 449
Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLY-EL 596
N +L+++ D E+ + ++ +P + ++L +L
Sbjct: 450 --------------------------NKNVWLYDMHKDKSEKFDRSAQKPWVVNRLLTKL 483
Query: 597 LKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+ ++PQ E D A+PK W PW
Sbjct: 484 ADLYDEQVMPQRREASD--AANPKHHGGVWMPW 514
>gi|323452003|gb|EGB07878.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1818
Score = 193 bits (490), Expect = 2e-46, Method: Compositional matrix adjust.
Identities = 133/395 (33%), Positives = 197/395 (49%), Gaps = 55/395 (13%)
Query: 92 GWNDLSFHG----SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
G+ D+ ++G +N + TP IDALA G+ L+ Y QP CTPSRA+L++GKYP TG
Sbjct: 55 GFGDVGYNGDPTLTNRVSTPVIDALADAGVKLSRYYTQPDCTPSRAALLSGKYPATTGTY 114
Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--- 204
+ G+PL LPE L Y + A+GKW +G + P RGF+S G
Sbjct: 115 HGVLNPQSTWGLPLEHALLPEALPG-AYRSHAVGKWDVGHSSAKRLPEARGFDSFLGFYL 173
Query: 205 -YLNGVISYYDHILSD-------------------QYSRTVELNGHDMRRNLSTAWDTVG 244
+ +I Y H + D + S + +G +R D G
Sbjct: 174 CFYGPMIDYSTHEIHDHDLACAGDACAAALAKCQVRGSTVADFSGLGGQRR-----DYDG 228
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI----T 300
Y TD+F AV LIE + D PLFLY+A A H LEA + + F+ +
Sbjct: 229 MYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVHG-----PLEADEADVAAFEATFLPES 283
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
D R T+AA+ +D ++G V+ ALQ G EN++++F SDNGA GS
Sbjct: 284 DSKRATFAAVALAMDRAIGRVVDALQATGAYENALVVFASDNGA-------IPGQMGGGS 336
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRL 418
N+P RG K + WEGGV+VPA + SP + + R ++ + H++DWLPT+ AAG
Sbjct: 337 NWPLRGGKFSAWEGGVRVPAFVHSPLLPAHMRGAVYDGLFHVADWLPTIAAAAGAAVD-- 394
Query: 419 PLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLL 453
+ DG+ + +L + P+ R+ + +D + L
Sbjct: 395 --SGDGVSHYRALFADGPAPRSELLVHVDVYGVEL 427
Score = 185 bits (469), Expect = 7e-44, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 187/373 (50%), Gaps = 42/373 (11%)
Query: 92 GWNDLSFHG----SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ 147
G++D+ ++ +N++ TP +D+LA G+ L Y QP CTPSRA+L++G YP +GM
Sbjct: 630 GFDDVGYNSDPSKTNQVQTPFLDSLAAGGVKLARYYTQPDCTPSRAALLSGMYPASSGMY 689
Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
I G+ L +P+ L Y + A+GKW +G + + P +RGF S G+ +
Sbjct: 690 HKMITAQSNWGLDLDLELIPQRL-PAAYRSHAVGKWDVGHYTWSHVPQFRGFRSFLGFYS 748
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV---------------GEYATDLFT 252
+I YY H D E+ + L++ + G Y+TD+F
Sbjct: 749 PIIDYYTHETFDTLQCLEEMELTECEARLASECSSSIKDFNFDGDPLPLADGTYSTDVFA 808
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPN--RRTYA 308
A LI + PLFLY+A A HA LEA ++ I F ++ + R ++A
Sbjct: 809 ARARDLIRKEAPKHPLFLYVAFNAVHA-----PLEADEDVIESFASSFLEEARDARASFA 863
Query: 309 AMVKKLDDSVGTVISALQR-KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
A+ +D ++ V+ L+ G ENS+I+F SDNG GSN+P RG
Sbjct: 864 AVALGMDRAMEAVVDELKAVDGAYENSVIMFASDNGG-------LPGQIAGGSNWPLRGQ 916
Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
K + +EGGV+VPA + SP + R + H++DWLPTL AGG P N+DG+
Sbjct: 917 KFSPFEGGVRVPAFVHSPLLPAARRGGAYGGLFHVTDWLPTLVRLAGGAP---PRNVDGV 973
Query: 426 DQWSSLLLNTPSR 438
DQ+ ++ P+
Sbjct: 974 DQYEAIFGEHPAE 986
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 29/75 (38%), Positives = 38/75 (50%), Gaps = 9/75 (12%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI- 68
D G Y TD+F AV LIE + D PLFLY+A A H LEA + + F+
Sbjct: 225 DYDGMYTTDVFADRAVDLIEAEAADHPLFLYVAFNAVHG-----PLEADEADVAAFEATF 279
Query: 69 ---TDPNRRTYAALT 80
+D R T+AA+
Sbjct: 280 LPESDSKRATFAAVA 294
>gi|291241212|ref|XP_002740506.1| PREDICTED: arylsulfatase A-like [Saccoglossus kowalevskii]
Length = 534
Score = 193 bits (490), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 143/423 (33%), Positives = 206/423 (48%), Gaps = 67/423 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ +H S I TPNID LA G+ L N Y C PSR +LMTG+Y I TG+
Sbjct: 80 GWHDVGYHDS-VIKTPNIDQLAAEGVKLENYYVSSWCAPSRVNLMTGRYRIRTGL----- 133
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL-GFFRREYTPLYRGFESHFGYLNGVI 210
+G + + E L + L E GY T +GKWHL GF E P +RGF++ GY G
Sbjct: 134 YGDVCDFMGIHETTLADKLYEAGYYTAMVGKWHLSGFEHAECYPTHRGFQTFLGYHGGSQ 193
Query: 211 SYYDHILSDQYSRTVELNGHDMRRN-LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
+Y+ H ++ +D N S A G+Y+T +F EA ++I +PLF
Sbjct: 194 NYFTHRRGGPHAP------YDFWANDTSIAVKYEGQYSTMIFADEAQRIIRQHNTKQPLF 247
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
LYL+ A H L P +Q++ I D RR YAAM +D+++G V + L+ K
Sbjct: 248 LYLSFQAVHVP-----LLVPPSYEDQYRSLIEDDKRRVYAAMASCMDEAIGNVTATLKEK 302
Query: 329 GMLENSIIIFMS-----------DNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
G+ N++ IF S DNGA PT + GSN+P RG K +EGG+
Sbjct: 303 GLWNNTVFIFSSARLNFKHDFCLDNGATPT----------DGGSNWPLRGRKGLWYEGGI 352
Query: 377 KVPAILWSPQIQQN--PRVSLQMMHISDWLPTLYTAAGGDTSRL-PLNIDGLDQWSSLLL 433
+ + S + + + +++H++DW PTL A D L PL DG+DQW ++
Sbjct: 353 RAVGFVNSHLLSDHVTGTTNHELIHVTDWFPTLLRLAKADLHGLKPL--DGIDQWETISR 410
Query: 434 NTPSRRNSNIDGLDQWSSLLLN-TPSRRNSVLINIDEKKRTA---------AVRLDSWKL 483
+ R +LLN P N V+ E + TA A+R+ WKL
Sbjct: 411 GNVTNR----------KEVLLNFIPGDTNGVVNENHEGEATADYFDTMPFSAIRVGDWKL 460
Query: 484 VLG 486
+ G
Sbjct: 461 LTG 463
Score = 42.0 bits (97), Expect = 0.88, Method: Compositional matrix adjust.
Identities = 31/94 (32%), Positives = 44/94 (46%), Gaps = 13/94 (13%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDP 71
G+Y+T +F EA ++I +PLFLYL+ A H L P +Q++ I D
Sbjct: 222 GQYSTMIFADEAQRIIRQHNTKQPLFLYLSFQAVHVP-----LLVPPSYEDQYRSLIEDD 276
Query: 72 NRRTYAALTK-------STTLTLLIVYGWNDLSF 98
RR YAA+ + T TL WN+ F
Sbjct: 277 KRRVYAAMASCMDEAIGNVTATLKEKGLWNNTVF 310
>gi|383110963|ref|ZP_09931781.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
gi|313694533|gb|EFS31368.1| hypothetical protein BSGG_2068 [Bacteroides sp. D2]
Length = 458
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 183/344 (53%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP +DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L TKEA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYCDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG+ V S+ P RG K W+GGV PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGSAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 CWKKAEKQYKNLSSQVTGFVDLVPTLKELV-GDHSRPKREYDGI 351
>gi|299147176|ref|ZP_07040243.1| arylsulfatase B [Bacteroides sp. 3_1_23]
gi|298515061|gb|EFI38943.1| arylsulfatase B [Bacteroides sp. 3_1_23]
Length = 458
Score = 192 bits (489), Expect = 3e-46, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP +DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L TKEA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GGV PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351
>gi|423214938|ref|ZP_17201466.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692201|gb|EIY85439.1| hypothetical protein HMPREF1074_02998 [Bacteroides xylanisolvens
CL03T12C04]
Length = 458
Score = 192 bits (489), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP +DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L TKEA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GGV PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351
>gi|402820941|ref|ZP_10870501.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
gi|402510173|gb|EJW20442.1| hypothetical protein IMCC14465_17350 [alpha proteobacterium
IMCC14465]
Length = 496
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 121/323 (37%), Positives = 172/323 (53%), Gaps = 29/323 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ DL F GS +I TPN+D LA GI+LN Y+ P+CTP+R++LMT + PI G +
Sbjct: 34 GYADLGFRGS-DIQTPNLDRLAAEGIVLNRFYSLPICTPTRSALMTARDPIKLGTAYAGL 92
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
E GV E F+PE ++ GY T IGKWH+G P +RGF+ FG+LN +
Sbjct: 93 QPWENGGVSPDEHFMPESFKKAGYQTAMIGKWHIGRQYESLVPHHRGFDHFFGHLNTQVD 152
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ-LIEDQPVDKPLFL 270
YY H GHD++ N + YATD+ E+V+ L E + KP L
Sbjct: 153 YYTH---------ASAGGHDLQENGKSLKRDA--YATDIHGDESVRYLKEIRDPSKPFLL 201
Query: 271 YLAHLAAHA-GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
Y+ LA H+ A + LEA ++ F +RTYAAMV +D ++G ++ L R+G
Sbjct: 202 YVPFLAPHSPMQAPEALEAKYKSRLNFPV----PKRTYAAMVDSMDQAIGRIMDELDRQG 257
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
+ +N+II+F SDNG + +G N PYRG K +EGG++V AI+ P
Sbjct: 258 VSDNTIILFFSDNGG----------FAGFGGVNKPYRGGKLETFEGGIRVNAIMRWPGAI 307
Query: 389 QNPRVSLQMMHISDWLPTLYTAA 411
+ Q + + D PTL AA
Sbjct: 308 PAGSKTAQAISVMDVFPTLAAAA 330
>gi|326428402|gb|EGD73972.1| arylsulfatase B [Salpingoeca sp. ATCC 50818]
Length = 545
Score = 192 bits (488), Expect = 4e-46, Method: Compositional matrix adjust.
Identities = 161/571 (28%), Positives = 248/571 (43%), Gaps = 109/571 (19%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT------ 144
+G+++ E TPN+D LA +G++L+ Y+ CTPSR+S ++G+ P+H
Sbjct: 51 WGFHNFGIRNQTEAKTPNMDKLARDGLLLDQAYSYFWCTPSRSSFLSGRLPLHVFHSNRV 110
Query: 145 -----GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
Q P G G+P +P ++R+ GY T +GKW G +++PL RGF
Sbjct: 111 SSASWDSQHPDTAGV---GIPRNMTTIPAFMRKAGYKTHMVGKWDAGIATPQHSPLGRGF 167
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGH---------DMRRNLSTAWDTVGEYATDL 250
+S Y N +YY + +D S + D+ + A +G Y +
Sbjct: 168 DSSLHYFNHDNNYYAYNYTDTVSVQFPVKCQLLKYVTGFVDLWNSTEPADAPIGTYEEHV 227
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
F A+ +I PLFLY A AHA L+ PQ +++FQ+I DP RRTY AM
Sbjct: 228 FRDHALDVISKHDASTPLFLYYASHIAHAP-----LQVPQAYLDRFQHIPDPIRRTYQAM 282
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG--SNYPYRGVK 368
V DD VG + +AL+ KGM EN++++ SDNG + + + G SN+P RG K
Sbjct: 283 VLAADDVVGNITAALKAKGMWENTLLVVHSDNGGAIYQCLDGMDASLCGGASNWPLRGGK 342
Query: 369 NTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
+ ++GG++V A + + + R V ++ ++DW+ T AG
Sbjct: 343 LSPFQGGIRVNAFVNGGFLPSHRRGQVETGLVVLADWMATFCELAG-------------- 388
Query: 427 QWSSLLLNTPSRRNSNIDGLDQWS--SLL--LNTPSRRNSVLINIDEKKRTAAVRLDSWK 482
+ P + +N+ +D S SLL N+ S R I D+ + + WK
Sbjct: 389 ----VACEDPIAKKANLPAIDSQSLWSLLSGANSTSPRVEYPITFDQ---SPLIVRPPWK 441
Query: 483 LVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRST 542
L++G YGQ K P
Sbjct: 442 LIVGDNLQ------YGQWTGPKYP------------------------------------ 459
Query: 543 RQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY--- 599
AT G PA CT G C LFN+ DP E +++A+ PD+ + L + L Y
Sbjct: 460 --NATTTGGGVPAGAV--NCTAG-C-LFNIEQDPTEHHDVAAQHPDVVADLKQRLAYWQS 513
Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSPWI 630
H T P P +A R+N + P+I
Sbjct: 514 HTLTWKP-GWPIPQACEAYIHRYNGYYGPFI 543
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 37/69 (53%), Gaps = 5/69 (7%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
+G Y +F A+ +I PLFLY A AHA L+ PQ +++FQ+I DP
Sbjct: 220 IGTYEEHVFRDHALDVISKHDASTPLFLYYASHIAHAP-----LQVPQAYLDRFQHIPDP 274
Query: 72 NRRTYAALT 80
RRTY A+
Sbjct: 275 IRRTYQAMV 283
>gi|336415441|ref|ZP_08595781.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
3_8_47FAA]
gi|335941037|gb|EGN02899.1| hypothetical protein HMPREF1017_02889 [Bacteroides ovatus
3_8_47FAA]
Length = 458
Score = 192 bits (488), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP +DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L TKEA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GGV PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351
>gi|149197521|ref|ZP_01874572.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139539|gb|EDM27941.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 465
Score = 192 bits (488), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 193/348 (55%), Gaps = 32/348 (9%)
Query: 77 AALTKSTTLTLLIV-----YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMY-AQPVCT 129
A+L+ S +IV G+ D+S+HG+ E TP+ID++A +G N Y A PVC
Sbjct: 14 ASLSASAAKPNIIVILADDLGYGDVSYHGTLKETTTPHIDSIAQSGAWFQNGYSAAPVCG 73
Query: 130 PSRASLMTGKYPIHTGMQ---GPPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHL 185
PSRA L++G+Y G GP + G+PL+++ +PE L + GY+T +GKWH
Sbjct: 74 PSRAGLLSGRYQQRFGYYDNIGPFTLNKDVEAGLPLSQKLIPEILVKEGYATGMVGKWHD 133
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
G + ++ P RGF+ +G+ NG I+ + +L + + TV+ G R N + GE
Sbjct: 134 GD-QHKFWPYNRGFQEFYGFNNGAINNW--VLKGE-NHTVDEWGAVHRENKRV--ENSGE 187
Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
Y T+ F +EAV+ I D+ +P FLYL+ A H L+AP+ NQF++I NR
Sbjct: 188 YMTEAFGREAVEFI-DRHKTEPFFLYLSFNAVHG-----PLQAPKSYTNQFKHIKPENRA 241
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
AM+K +DD++G V+ L+++G+ EN+II F SDNG N+ N YR
Sbjct: 242 LCLAMLKSMDDNIGLVLEKLRKEGLEENTIIFFTSDNGGKLK--------GNYSFNGKYR 293
Query: 366 GVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
G KNT+++GG+ VP A+ W QI + +H D T++ AAG
Sbjct: 294 GEKNTVFDGGLHVPYAVQWKAQIPAQTKALEAPVHSIDLAHTIFAAAG 341
Score = 50.4 bits (119), Expect = 0.003, Method: Compositional matrix adjust.
Identities = 30/81 (37%), Positives = 44/81 (54%), Gaps = 6/81 (7%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
+ GEY T+ F +EAV+ I D+ +P FLYL+ A H L+AP+ NQF++I
Sbjct: 183 ENSGEYMTEAFGREAVEFI-DRHKTEPFFLYLSFNAVHG-----PLQAPKSYTNQFKHIK 236
Query: 70 DPNRRTYAALTKSTTLTLLIV 90
NR A+ KS + +V
Sbjct: 237 PENRALCLAMLKSMDDNIGLV 257
>gi|293372058|ref|ZP_06618453.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
gi|292632962|gb|EFF51547.1| arylsulfatase [Bacteroides ovatus SD CMC 3f]
Length = 458
Score = 192 bits (488), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP +DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L TKEA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GGV PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351
>gi|423290535|ref|ZP_17269384.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
CL02T12C04]
gi|392665922|gb|EIY59445.1| hypothetical protein HMPREF1069_04427 [Bacteroides ovatus
CL02T12C04]
Length = 458
Score = 192 bits (488), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 182/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP +DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L TKEA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GGV PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351
>gi|221119831|ref|XP_002168522.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 223
Score = 192 bits (488), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 97/229 (42%), Positives = 143/229 (62%), Gaps = 7/229 (3%)
Query: 146 MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
MQ I+ A GV L E+FLP+YL+ +GY T AIGKWHLGFF +EYTP YRGF+S +GY
Sbjct: 1 MQSDTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGY 60
Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
G Y+DH L+ ++L+ +D + ++ G Y+T +++ EA+ I +
Sbjct: 61 YGGQADYWDHSLASNGWWGLDLH-YDTPSSSKNIFNQWGNYSTAMYSMEAIDRIRNHNST 119
Query: 266 KPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
+P+FLYLA+ A H+ N ++ L+APQE +++F +I R+ YAAM+ +D +G + S
Sbjct: 120 QPMFLYLAYQAVHSANLREYPLQAPQEWVDKFSHIKHKGRQNYAAMLGVMDHWIGQITSE 179
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
L ++ ML++SIIIF SDNG P NW +NYP RG K T++E
Sbjct: 180 LLKQNMLDDSIIIFTSDNGGPANGLN-----NNWATNYPLRGTKTTVYE 223
Score = 58.5 bits (140), Expect = 1e-05, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 49/78 (62%), Gaps = 4/78 (5%)
Query: 3 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQET 61
+N+ W G Y+T +++ EA+ I + +P+FLYLA+ A H+ N ++ L+APQE
Sbjct: 91 KNIFNQW---GNYSTAMYSMEAIDRIRNHNSTQPMFLYLAYQAVHSANLREYPLQAPQEW 147
Query: 62 INQFQYITDPNRRTYAAL 79
+++F +I R+ YAA+
Sbjct: 148 VDKFSHIKHKGRQNYAAM 165
>gi|313212736|emb|CBY36668.1| unnamed protein product [Oikopleura dioica]
Length = 602
Score = 192 bits (488), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 161/595 (27%), Positives = 255/595 (42%), Gaps = 92/595 (15%)
Query: 54 HLEAPQETINQFQYITDPNRRTYA--ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNI 109
HLE Q ++ F + T++ L K + ++V G++DL + + ++ +PNI
Sbjct: 14 HLEKEQGAMSSFFFYLFLFCSTFSQKVLEKKPHVIFVLVDDLGFDDLGY-VNRDVISPNI 72
Query: 110 DALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEY 169
DALA + + L Y QP CTPSRA+ +TG+Y I GMQ I EP G+PL E L E
Sbjct: 73 DALAKDALHLKKHYVQPSCTPSRAAFLTGRYNIRMGMQSGVIRAPEPEGIPLRETLLSEA 132
Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
++ GY T GKWHLGF+ ++ P RGF+ +G+ G +Y H NG
Sbjct: 133 FKQCGYRTSLQGKWHLGFYTYKHCPQNRGFDRFYGFYLGSQDFYFH--DSGRLEAYPGNG 190
Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
+ + T G Y+T LF + + + PLF Y++ H P
Sbjct: 191 DVENDTILDDFHTNGTYSTKLFVDDFINDLAKHDPAVPLFNYVSFQDVH---------GP 241
Query: 290 QETINQFQYITDPNRRTY-------AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
+T+N+F+ + T+ + + LD +G +++AL+ K N++++F SDN
Sbjct: 242 LQTVNKFKKHYENKTDTWTHERILISTKITTLDHHIGRMVTALKEKNYWNNTVLVFTSDN 301
Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ----MM 398
G E SN+P RG K T+++GG+K A + SP +Q R+ Q +
Sbjct: 302 GGQPNE---------GASNWPLRGSKGTIYDGGLKSRAFVASPLLQN--RMKGQNYNYLF 350
Query: 399 HISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLLNTPSR-RNSNIDGLDQWSSLLLNT 456
H+SDW PTL T +G D R +DG + L + P + RN + LD S +
Sbjct: 351 HVSDWFPTLLTLSGCDVPRKEGRPLDG--KIHELFVYKPKKARNEMLHYLDPLSQFDYDD 408
Query: 457 PSRRNSVLINIDEKKRTAAVRLDSWKLVLGT-QENGTMDGYYGQTRSNKVPLLNFNAIVE 515
P + A+ R WK++ G NG GY + R N I
Sbjct: 409 PRHFEVLDDRAFNTTIKASYRQGRWKIITGKPCANGC--GYIKKERPWNYAKNNMKLIDM 466
Query: 516 SKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGND 575
K N +L+++ D
Sbjct: 467 PKE--------------------------------------------QNKNVWLYDMHKD 482
Query: 576 PCEQNNIASSRPDISSQLY-ELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
E+ + ++ +P + ++L +L + ++PQ E D A+PK W PW
Sbjct: 483 KSEKFDRSAQKPWVVNRLLTKLADLYDEQVMPQRREASD--AANPKHHGGVWMPW 535
>gi|294053963|ref|YP_003547621.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613296|gb|ADE53451.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 478
Score = 192 bits (488), Expect = 5e-46, Method: Compositional matrix adjust.
Identities = 125/332 (37%), Positives = 177/332 (53%), Gaps = 28/332 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ-GP 149
G+ D F G+ +I TPN+D LA +G+I N Y C PSRA + G+Y G +
Sbjct: 33 GYADAGFTGATDILTPNLDKLAESGVIFNQGYVTHAFCGPSRAGFLAGRYQHRFGFEHNT 92
Query: 150 PIWGAEP-RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P A P G+ + E P L+++GY+T IGKWHLG + PL RGF+ +G+L G
Sbjct: 93 PYDPANPLAGIDVRETLFPARLQDVGYTTGIIGKWHLGA-SSPFYPLNRGFDYFYGFLTG 151
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
Y++ I Q ++ L G + RN A Y T +++AVQ + D + P
Sbjct: 152 GHDYFE-IDVTQPVKSAYLQG--LFRNKRVA--NFEGYLTTALSRDAVQFVNDNK-ENPF 205
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FL+L++ A H + L+APQE I ++ +I D RR YAAMV +D +G V++AL R
Sbjct: 206 FLFLSYNAPH-----QPLQAPQEDIARYAHIKDKKRRVYAAMVDVMDRGIGEVVTALDRN 260
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
G+ EN++I F+SDNG P V ++ N SN P+RG K +EGGV VP I W +I
Sbjct: 261 GLRENTLIFFLSDNGGP-VSSKKHPGKGNGSSNGPFRGGKTDFYEGGVHVPFIASWPAKI 319
Query: 388 Q-----QNPRVSLQMMHISDWLPTLYTAAGGD 414
Q P +SL + T AGGD
Sbjct: 320 QPGQVYDKPVISLDIGR------TAVELAGGD 345
Score = 48.5 bits (114), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 25/66 (37%), Positives = 39/66 (59%), Gaps = 6/66 (9%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y T +++AVQ + D + P FL+L++ A H + L+APQE I ++ +I D RR
Sbjct: 184 YLTTALSRDAVQFVNDNK-ENPFFLFLSYNAPH-----QPLQAPQEDIARYAHIKDKKRR 237
Query: 75 TYAALT 80
YAA+
Sbjct: 238 VYAAMV 243
>gi|443698985|gb|ELT98690.1| hypothetical protein CAPTEDRAFT_103525, partial [Capitella teleta]
gi|443734460|gb|ELU18442.1| hypothetical protein CAPTEDRAFT_129771, partial [Capitella teleta]
Length = 333
Score = 192 bits (487), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 111/330 (33%), Positives = 172/330 (52%), Gaps = 29/330 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++D+ + I TPN+DALA G+IL N Y Q +CTPSR +LMTG+YP + MQ I
Sbjct: 14 GYHDIGLRNPDVI-TPNLDALASKGVILTNNYVQALCTPSRHALMTGRYPSASAMQTSVI 72
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ L + LP+YL++LGY +GKWHLG+ R E P RGF++ +G G
Sbjct: 73 LPMRAKCTGLEYKLLPQYLKDLGYKNHMVGKWHLGYCRDECLPTSRGFDTFYGLYAGTGD 132
Query: 212 YYDHILSDQY----SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
Y+ H +Y + ++ + + +++ + DL + ++ ++ P
Sbjct: 133 YWSHTFFGKYDWHTNADIDFEANSTHSQVRSSY--MNFVLQDLEMERLDKVFDEHDSKDP 190
Query: 268 LFLYLA----HLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
LFLY A H + + +L P++ N RR Y + LD VG ++
Sbjct: 191 LFLYFAPQNPHTPSQPTDEFLNL-YPEDKFNDI-------RRKYLGLTSGLDAMVGKIVE 242
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
L GM+ N+ +IF+SDNGA E G N P+RG K++L+EGG K + ++
Sbjct: 243 KLIENGMMNNTYVIFVSDNGADPPE----------GLNTPFRGGKSSLFEGGTKSNSFIY 292
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
SP +++ + +MHI+DWLPTL AGG
Sbjct: 293 SPLLKKTEYENDGLMHITDWLPTLVRLAGG 322
>gi|319954018|ref|YP_004165285.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319422678|gb|ADV49787.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 434
Score = 192 bits (487), Expect = 6e-46, Method: Compositional matrix adjust.
Identities = 138/407 (33%), Positives = 204/407 (50%), Gaps = 54/407 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ-GP 149
GW D+ F+G+ +IPTPN+D LA G+I +N Y + P C+PSRA L+TG+Y G
Sbjct: 8 GWADVGFNGATDIPTPNLDRLASEGVIFSNGYVSHPYCSPSRAGLLTGRYQARFGHDCNM 67
Query: 150 PIWGAEPR--GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
P G G PL+E+ + E L+E GY T AIGKWHLG Y P +GF+ FG+
Sbjct: 68 PYDGKNDASVGTPLSEKMISEALKEQGYRTSAIGKWHLGDHPDLYPPA-QGFDHWFGFPG 126
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
G ++Y+ S +T+ RN + Y TD FT EA++ I Q +KP
Sbjct: 127 GGMNYWGE--SKNEIQTI-------YRNRKVVPEEELTYLTDDFTTEAIRFIT-QKDEKP 176
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
F+YLA+ A HA + +A +E + + ++I R YAAMV +D +VG + S L
Sbjct: 177 FFMYLAYNAPHAPD-----QATKEYLEKTKHIEYAGRSVYAAMVNAVDANVGKIDSTLIA 231
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA-ILWSPQ 386
G+ EN+I++F+SDNG +E+ + N PYRG K L+EGG+KVP I W +
Sbjct: 232 NGLKENTILVFLSDNGG-RIEHAD---------NRPYRGHKGMLFEGGIKVPFFITWPTK 281
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
I++ + + + D PT AAGG +++ +DG+
Sbjct: 282 IKEKYAYNEPISSL-DLFPTFLNAAGGK----------------------AKKERQLDGV 318
Query: 447 DQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTM 493
D +L T + L AVR ++KL +N T+
Sbjct: 319 DLLPFILQKTQETPHETLFWRSSGNFEYAVRKGNYKLYKSAYKNKTL 365
Score = 42.4 bits (98), Expect = 0.72, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 39/69 (56%), Gaps = 6/69 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD FT EA++ I Q +KP F+YLA+ A HA + +A +E + + ++I R
Sbjct: 156 YLTDDFTTEAIRFIT-QKDEKPFFMYLAYNAPHAPD-----QATKEYLEKTKHIEYAGRS 209
Query: 75 TYAALTKST 83
YAA+ +
Sbjct: 210 VYAAMVNAV 218
>gi|423294191|ref|ZP_17272318.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
CL03T12C18]
gi|392676448|gb|EIY69884.1| hypothetical protein HMPREF1070_00983 [Bacteroides ovatus
CL03T12C18]
Length = 458
Score = 191 bits (486), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 181/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP +DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPCLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETVADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L TKEA+ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITKEAIHCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKVTYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GGV PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351
>gi|86141258|ref|ZP_01059804.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
gi|85831817|gb|EAQ50272.1| arylsulfatase B precursor [Leeuwenhoekiella blandensis MED217]
Length = 461
Score = 191 bits (486), Expect = 8e-46, Method: Compositional matrix adjust.
Identities = 125/359 (34%), Positives = 193/359 (53%), Gaps = 43/359 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND SFHGS EI TPN+D LA G+ L+ Y P C+P+RASL+TG+ G+ P I
Sbjct: 50 GWNDFSFHGS-EIQTPNLDQLAGKGLTLDRFYTYPTCSPARASLLTGRPASRMGIVAP-I 107
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G +P + LP+ L +L Y T +GKWHLG + E P GF+ +G+L+G +
Sbjct: 108 SGRSELNLPDSITTLPQALSKLNYKTALMGKWHLGL-KPESGPEVYGFDFSYGFLHGQL- 165
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY------ATDLFTKEAVQLIEDQPVD 265
DQY+ T +N + W G++ TDL T+ AV I+ D
Sbjct: 166 -------DQYAHTY--------KNGDSTWYRNGKFISEKGHVTDLLTQSAVHYIDTLQTD 210
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
+ +L +A+ A H L+ PQE + ++ I D +RR YAA + +D +G ++ L
Sbjct: 211 QNFYLQVAYSAPHIP-----LQEPQEWLEKYTGIKDSSRRAYAAAMTHMDAGIGEILQKL 265
Query: 326 QRKGMLENSIIIFMSDNGA-----PTVEY--RETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
+ K + +N++++F+SDNGA P +Y + NY + GSN P R K + +EG ++V
Sbjct: 266 KDKDLEKNTVVLFVSDNGAQEKWVPNTQYDGKYGPNY-SLGSNLPLRDFKTSNYEGALRV 324
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
PAI+ P+ N S ++++DW+PT A + LP ++G++ + LL T S
Sbjct: 325 PAIISWPE-NLNSGTSTNYINVTDWMPTFLNWANAE--ELPSTVEGVN--AQHLLKTAS 378
Score = 40.4 bits (93), Expect = 2.6, Method: Compositional matrix adjust.
Identities = 23/64 (35%), Positives = 35/64 (54%), Gaps = 5/64 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
+ TDL T+ AV I+ D+ +L +A+ A H L+ PQE + ++ I D +RR
Sbjct: 191 HVTDLLTQSAVHYIDTLQTDQNFYLQVAYSAPHIP-----LQEPQEWLEKYTGIKDSSRR 245
Query: 75 TYAA 78
YAA
Sbjct: 246 AYAA 249
>gi|325285341|ref|YP_004261131.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
gi|324320795|gb|ADY28260.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga lytica DSM 7489]
Length = 460
Score = 191 bits (485), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 185/346 (53%), Gaps = 33/346 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW D+ F+G+ +IPTPN+D +A G+I +N Y + P C+PSRA L+TG+Y G
Sbjct: 37 GWADVGFNGATDIPTPNLDRIASEGVIFSNGYVSHPYCSPSRAGLLTGRYQARFGHDCNM 96
Query: 151 IWGAE---PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ E G PL+E+ + E L+E GY T AIGKWH+G + P +GF+ FG+
Sbjct: 97 PYEGENDATVGTPLSEKLISEALKEQGYRTSAIGKWHIGDHPNLHPPA-QGFDHWFGFPG 155
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
G ++Y+ S + + RN + Y TD FT EA+ I + + P
Sbjct: 156 GSMNYWGKATSKIQT---------IYRNTKPVAEEELTYLTDDFTNEAINFINKKDKN-P 205
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
F+YLA+ A HA + A ++ + Q ++I R YAAMV +D ++G + S L
Sbjct: 206 FFIYLAYNAPHAPD-----HATKKYLEQTKHIEYAGRSVYAAMVNAVDANIGKIDSTLVA 260
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
G+ +N+I++F+SDNG N PYRG K L+EGG+KVP + W +
Sbjct: 261 NGIKDNTILVFLSDNGGRIQH----------ADNRPYRGHKGMLFEGGIKVPFFISWPSK 310
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
I+++ + + + + D PTL AAGG+ + + +DG+D LL
Sbjct: 311 IKEDSKYTKPISSL-DLFPTLLNAAGGNAA-IEKQLDGVDLMPYLL 354
>gi|119626708|gb|EAX06303.1| hCG20717, isoform CRA_a [Homo sapiens]
Length = 427
Score = 191 bits (485), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 135/452 (29%), Positives = 211/452 (46%), Gaps = 73/452 (16%)
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
+GKWHLGF+R+E P RGF++ FG L G YY H D + + G+D+ N + A
Sbjct: 2 VGKWHLGFYRKECMPTRRGFDTFFGSLLGSGDYYTHYKCD----SPGMCGYDLYENDNAA 57
Query: 240 WD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
WD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP ++
Sbjct: 58 WDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPGRYFEHYRS 112
Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRN 357
I + NRR YAAM+ LD+++ V AL+ G NSIII+ SDNG PT
Sbjct: 113 IININRRRYAAMLSCLDEAINNVTLALKTYGFYNNSIIIYSSDNGGQPTAG--------- 163
Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
GSN+P RG K T WEGG++ + SP ++ V +++HI+DW PTL + A G
Sbjct: 164 -GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNKGTVCKELVHITDWYPTLISLAEGQIDE 222
Query: 418 LPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVR 477
+ +DG D W ++ + R+ +D L + + + I +A+R
Sbjct: 223 -DIQLDGYDIWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAGYGIWNTAIQSAIR 278
Query: 478 LDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNID 537
+ WKL+ G GY S+ VP +F SN+
Sbjct: 279 VQHWKLLTGNP------GY-----SDWVPPQSF-----------------------SNLG 304
Query: 538 KMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
R ++ T+ G + +LFN+ DP E+ ++++ P I +L L
Sbjct: 305 PNRWHNERITLSTGKS-------------VWLFNITADPYERVDLSNRYPGIVKKLLRRL 351
Query: 598 KYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
+T VP + D +++P+ W PW
Sbjct: 352 SQFNKTAVPVRYPPKD-PRSNPRLNGGVWGPW 382
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 44/80 (55%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWD-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N + AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 50 LYENDNAAWDYDNGIYSTQMYTQRVQQILASHNPTKPIFLYIAYQAVHS-----PLQAPG 104
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 105 RYFEHYRSIININRRRYAAM 124
>gi|374620849|ref|ZP_09693383.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374304076|gb|EHQ58260.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 551
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 130/426 (30%), Positives = 199/426 (46%), Gaps = 72/426 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ +HG N I TP++D LA G+ LN Y P+C+P+RA+LMTG+ P+ G+ I
Sbjct: 47 GWNDVGYHGGN-IDTPSLDKLAEQGVQLNRFYTTPICSPTRAALMTGRDPMRLGIAYGVI 105
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ GV E F+P+ + GY T +GKWHLG + Y P RGFE +G+L+ +
Sbjct: 106 LPWDNIGVNPAEHFMPQSFQAAGYQTAMVGKWHLGHAQMTYHPNQRGFEHFYGHLHTEVG 165
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
+Y + G D + N + D G Y T L E + I D+ +KP F+Y
Sbjct: 166 FYPPF--------ANVGGKDFQEN-GVSIDDEG-YETYLLADEVSRYIRDRDEEKPFFIY 215
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYI-----------TDPNRR------------TYA 308
+ +A H L+APQE +++ I TD RR YA
Sbjct: 216 MPFIAPHT-----PLDAPQELQEKYKDIETDLAPARSNQTDSTRRMAKLMMQPSARPMYA 270
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
A+V +D ++G V+ L +G+ +N+I++F SDNG Y N P RG K
Sbjct: 271 AVVDAMDQAIGQVLDTLDEEGLADNTIVLFFSDNGGAAYSY-------GGADNAPLRGGK 323
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
+EGG++V +++ P + + + Q+M + D PTL A G PLN
Sbjct: 324 GETFEGGIRVVSLMRWPGVLEPAQSFDQIMTVMDVFPTLAEATG----VTPLN------- 372
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRL----DSWKLV 484
N DG WSS+ R+ +++ E + +L + WKLV
Sbjct: 373 -----------NFPFDGSSLWSSIKEGEIHERDELVMFASEIPIYGSFKLTAFDEEWKLV 421
Query: 485 LGTQEN 490
+++
Sbjct: 422 QEMEQD 427
>gi|296121469|ref|YP_003629247.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013809|gb|ADG67048.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 487
Score = 191 bits (484), Expect = 1e-45, Method: Compositional matrix adjust.
Identities = 118/320 (36%), Positives = 181/320 (56%), Gaps = 38/320 (11%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPI 142
LLIV G+ D+ FHG +IPTPN+DALA +G+ + Y P C+P+RA L+TG+Y
Sbjct: 40 LLIVGDDMGYADVGFHGCKDIPTPNLDALAKSGVQFTSGYVTGPYCSPTRAGLLTGRYQQ 99
Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
G + P GA G+PLTE + + L+++GY+T +GKWHLG + P RGFE
Sbjct: 100 RFGHEFNPS-GAN-TGLPLTEVTIADRLKQVGYTTGLVGKWHLGS-QPAMHPQERGFEEF 156
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
G+L G S++D ++ + L GH+ + + +Y TDLF +EAV IE
Sbjct: 157 IGFLGGAHSFFD-------AQGI-LRGHEPVKTI--------DYTTDLFGREAVSFIEKH 200
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
DKP FLYL+ A H + A ++ + + I+D RRTYAAM+ +D+++G V+
Sbjct: 201 R-DKPWFLYLSFNAVHT-----PMHATEDRMAKLASISDQERRTYAAMMLAMDEAIGKVL 254
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
+ L+ G + ++++F+SDNG PT+ N N P RG K T EGG++VP ++
Sbjct: 255 TQLETTGQKQKTLVMFISDNGGPTMPGVTI----NGSINTPLRGSKRTTLEGGIRVPFVV 310
Query: 383 -W----SPQIQQNPRVSLQM 397
W +P + +P + L +
Sbjct: 311 SWPGKIAPAVFDSPVIQLDL 330
Score = 49.7 bits (117), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 28/66 (42%), Positives = 38/66 (57%), Gaps = 6/66 (9%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y TDLF +EAV IE DKP FLYL+ A H + A ++ + + I+D R
Sbjct: 183 DYTTDLFGREAVSFIEKH-RDKPWFLYLSFNAVHT-----PMHATEDRMAKLASISDQER 236
Query: 74 RTYAAL 79
RTYAA+
Sbjct: 237 RTYAAM 242
>gi|406830958|ref|ZP_11090552.1| sulfatase [Schlesneria paludicola DSM 18645]
Length = 441
Score = 190 bits (483), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 120/323 (37%), Positives = 173/323 (53%), Gaps = 30/323 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D+ FHG +IPTP+ID+LA +G ++ Y P C+P+RA L+TG+Y G + P
Sbjct: 40 GYADVGFHGGKDIPTPHIDSLAASGTRFSSGYVSGPYCSPTRAGLLTGRYQQRFGHEFNP 99
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G +G+PLTE + + L+ GY+T +GKWHLG ++ PL RGF FG+L G
Sbjct: 100 --GGANKGLPLTETTIADRLQAAGYATGLVGKWHLGT-DPKFHPLKRGFGEFFGFLAGHH 156
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+Y+D +D T ++ T Y TD F +EAV IE + P FL
Sbjct: 157 TYFDKQEADIQRGTTKV--------------TEPGYLTDAFGREAVSFIERHQ-NHPFFL 201
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLA A H LEA + +N F I RRTYAAM+ +DD++G V+++L+
Sbjct: 202 YLAFNAVHT-----PLEADDDRLNAFSSIDATPRRTYAAMLSAMDDAIGKVLASLRETAQ 256
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQ 389
EN++I F+SDNG PT+ N N P RG K T EGG++VP ++ W +
Sbjct: 257 DENTLIFFISDNGGPTMP----GTTINSSINAPLRGSKRTTLEGGIRVPFVVRWPGHVPA 312
Query: 390 NPRVSLQMMHISDWLPTLYTAAG 412
+ ++ + D PT AAG
Sbjct: 313 GAVYAHPVIQL-DIQPTALAAAG 334
Score = 45.4 bits (106), Expect = 0.091, Method: Compositional matrix adjust.
Identities = 29/65 (44%), Positives = 34/65 (52%), Gaps = 6/65 (9%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD F +EAV IE + P FLYLA A H LEA + +N F I RR
Sbjct: 178 YLTDAFGREAVSFIERHQ-NHPFFLYLAFNAVHT-----PLEADDDRLNAFSSIDATPRR 231
Query: 75 TYAAL 79
TYAA+
Sbjct: 232 TYAAM 236
>gi|326431091|gb|EGD76661.1| hypothetical protein PTSG_08011 [Salpingoeca sp. ATCC 50818]
Length = 511
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 115/333 (34%), Positives = 173/333 (51%), Gaps = 28/333 (8%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
+GWND F G+ + TP ID L GI LN Y Q VC+P+RA+LMTG+YP G+Q P
Sbjct: 24 HGWNDCGFAGT-RVKTPTIDTLRSEGIALNQHYVQKVCSPTRAALMTGRYPHRYGLQFP- 81
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G + E LP+Y++ GY+T+A+GKWHLGF ++TP +RGF+S +G+ +
Sbjct: 82 FCGGAAMALNSNETLLPQYMKSAGYTTRAVGKWHLGFTEWQFTPTFRGFDSFYGFYSCAE 141
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNL---------STAWDTVG----EYATDLFTKEAVQ 257
Y+ H L + S + D + A++ VG Y+T LF V
Sbjct: 142 DYFFHGLGFKNSSGAHVKSLDFHDDARPSCGADCSKAAFEAVGTDWQHYSTTLFAGRIVD 201
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDD 316
+++ +PLFLY A H N G E PQ +++ + D RR AA + +D
Sbjct: 202 IVDGHDPSQPLFLYFASQDTH--NPG---EVPQCYKDRYNTTVPDQVRRKLAAKLTTVDS 256
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAP-TVEYRETSNYRN---WGSNYPYRGVKNTLW 372
++ + A +R GM +N +I++ +DNG P TV+ + SNYP RG K+ +
Sbjct: 257 ALANITEAYKRNGMYDNLLIVYTADNGGPITVQSGPNKGKQEDSIGSSNYPLRGGKHNAY 316
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQM---MHISD 402
EGGV+ A L P ++ R+ + +HI D
Sbjct: 317 EGGVRSTAFLSGPALKTLARLVRKYQYKLHIGD 349
>gi|414070343|ref|ZP_11406329.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410807260|gb|EKS13240.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 469
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 121/331 (36%), Positives = 177/331 (53%), Gaps = 20/331 (6%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL F GS EI TPNIDALA NG N Y P C PSR L+TG+Y GM+
Sbjct: 28 GYGDLGFTGSKEIKTPNIDALASNGTRFKNAYVTHPYCGPSRVGLLTGRYQARLGMENNV 87
Query: 151 IWGAEPR--GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
+ + + G+PL+E L+++GY T GKWHLG + P RGF+ +G+L+G
Sbjct: 88 SYMPQDKYMGLPLSENTFANRLQDVGYHTSVFGKWHLGG-APHFQPNKRGFDYFYGFLDG 146
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
+Y + DQ TV +G+ + +T EY T +++AV+ I Q + P
Sbjct: 147 GHNY----MPDQV--TVGGDGYSLPLMRNTQVTEFDEYLTTALSRDAVKYIHRQQ-ESPF 199
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
F+YL++ A H L+AP E I ++++I D +RR YAAM+ +D+ +G V+ AL+
Sbjct: 200 FMYLSYNAPHTP-----LQAPAEYIEKYKHIEDEDRRVYAAMIDVMDEGIGMVVDALKDI 254
Query: 329 GMLENSIIIFMSDNGAPTVE-YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
L+N++IIF+SDNG E ++ TS +W N P+R K L EGG+ VP I P +
Sbjct: 255 NQLDNTLIIFLSDNGGVFPESWQPTS---DWADNSPFRRGKAALLEGGIHVPFIAHWPGV 311
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
++ D T AG D +L
Sbjct: 312 IPKGEEFTGLVSSLDIAATSVALAGADNKKL 342
Score = 48.1 bits (113), Expect = 0.012, Method: Compositional matrix adjust.
Identities = 29/79 (36%), Positives = 45/79 (56%), Gaps = 9/79 (11%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MR T +D EY T +++AV+ I Q + P F+YL++ A H L+AP E
Sbjct: 167 MRNTQVTEFD---EYLTTALSRDAVKYIHRQQ-ESPFFMYLSYNAPHTP-----LQAPAE 217
Query: 61 TINQFQYITDPNRRTYAAL 79
I ++++I D +RR YAA+
Sbjct: 218 YIEKYKHIEDEDRRVYAAM 236
>gi|345510992|ref|ZP_08790548.1| arylsulfatase B [Bacteroides sp. D1]
gi|229442597|gb|EEO48388.1| arylsulfatase B [Bacteroides sp. D1]
Length = 498
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 182/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP++DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 77 GWGDVGFHGS-EIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 135
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 136 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 195
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L T+EA++ I+ + P
Sbjct: 196 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITQEAIRCIDAYEKEGPF 242
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 243 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 297
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GG PA+L
Sbjct: 298 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGGHAPAVL 348
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 349 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 391
>gi|295086308|emb|CBK67831.1| Arylsulfatase A and related enzymes [Bacteroides xylanisolvens
XB1A]
Length = 458
Score = 190 bits (482), Expect = 2e-45, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 182/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP++DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L T+EA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITQEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GGV PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGVHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +P L GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPMLKDLV-GDHSRPKREYDGI 351
>gi|390361328|ref|XP_780209.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 469
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 117/345 (33%), Positives = 170/345 (49%), Gaps = 66/345 (19%)
Query: 92 GWNDLSFHG-SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ND+ +H + I T NIDALA G+ L + Y P+CTPSR+ ++GKY IH GMQ
Sbjct: 40 GYNDVGYHSDGSAIETDNIDALAAGGLKLESYYVAPLCTPSRSQFLSGKYLIHNGMQHLV 99
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I PR +PL + + L + GY+T +GKWHLGF+++E PL RGF+S FG L G
Sbjct: 100 IDPRVPRCLPLGDDTMANKLTDAGYATHLVGKWHLGFYKQECWPLNRGFQSFFGMLLG-- 157
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Q DQ KP+F+
Sbjct: 158 ----------------------------------------------QSTVDQGGTKPMFM 171
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
YL++ A H E P E ++ I + NRRTYA MV LD+S+G + L+ +G
Sbjct: 172 YLSYQAPHL-----PFEVPDEYFVSYRGKINNRNRRTYAGMVTMLDESIGKLTDTLKEEG 226
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+ +++ IF +DNG +N G+N+P RGVK +EGG++ + P +
Sbjct: 227 LWNDTVFIFSTDNGG--------VGKKNAGNNWPLRGVKGNYFEGGIRGVGFVAGPLLST 278
Query: 390 NPR--VSLQMMHISDWLPTLYTAAGGDT-SRLPLNIDGLDQWSSL 431
N + +S +MHISDW PTL T + L +DG++ W +
Sbjct: 279 NVQGTISTDLMHISDWYPTLVEGVAKVTLNHTELGLDGVNMWDVI 323
>gi|75910438|ref|YP_324734.1| twin-arginine translocation pathway signal protein [Anabaena
variabilis ATCC 29413]
gi|75704163|gb|ABA23839.1| Twin-arginine translocation pathway signal [Anabaena variabilis
ATCC 29413]
Length = 457
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 122/348 (35%), Positives = 186/348 (53%), Gaps = 30/348 (8%)
Query: 72 NRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTP 130
+R T + + L+ GW DLS +G + TPN+D LA G+ N YA Q VCTP
Sbjct: 33 SRATAQSSRPNVVFILVDDMGWGDLSIYGRTDYETPNLDRLARQGVRFTNAYANQTVCTP 92
Query: 131 SRASLMTGKYP--IHTGMQGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWH 184
+R + +TG+Y + G++ P ++P G+P + + L+ GY T +GKWH
Sbjct: 93 TRIAFLTGRYQARLPVGLREPLGARSQPASNNIGIPANQPTIASLLKANGYETALVGKWH 152
Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
G+ + PL +GF+ +FG+L+G I Y+ H +D R ++L +D+ S
Sbjct: 153 AGY-PPNFGPLQKGFDEYFGHLSGGIEYFTHTGTD---RILDLYENDVPVQRS------- 201
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
Y TDLFT AV+ I+ +P +P +L L + A H G + +A Y ++
Sbjct: 202 GYVTDLFTDRAVEFIQ-RPHSRPFYLSLHYNAPHWPWQGPNDQASTAFYLTNGYTVGGSQ 260
Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
TYAAMVK LDD VG V+ AL+ G +N+++IF SDNG + N+G P+
Sbjct: 261 ATYAAMVKSLDDGVGRVLDALEASGQADNTLVIFTSDNGG--------ERFSNFG---PF 309
Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
RG K +L+EGG++VPAI+ P + Q +VS Q++ D T+ A G
Sbjct: 310 RGQKASLYEGGIRVPAIIRYPGVTQANQVSNQVIITFDLTATILAATG 357
>gi|313232487|emb|CBY24155.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 189 bits (481), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 126/399 (31%), Positives = 200/399 (50%), Gaps = 28/399 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++DL + ++++ +PNID LA N + + N Y QP CTPSRA+ MTG+Y I GMQ I
Sbjct: 35 GFDDLGY-VNDDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVI 93
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP +PL+E LP+ ++ GY+T GKWHLGF+ ++ P RGF+ FG+ G
Sbjct: 94 KPDEPEAIPLSETLLPQAFKKCGYNTSMHGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQD 153
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H S E NG + R D G Y+T ++ + +++ + PLF +
Sbjct: 154 YFYHD-SGNNCYLYEPNGTEKVR-----LDLNGTYSTKAIAEDFIAKLDEYDPETPLFEF 207
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGML 331
L+ H G P E + R+ + + +D +G V+ L+ KG
Sbjct: 208 LSFQEVH----GPLQWLPAEVLKMAGGWEGWERKMLSRKIVIVDHFIGEVVKKLKEKGFW 263
Query: 332 ENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNP 391
+++I++ SDNG T R SN+P RG K ++EGG++ A + SP++ +
Sbjct: 264 KDTILVITSDNGGQT---------REGASNWPLRGRKGDVFEGGIRSRAFIHSPKLPDSL 314
Query: 392 RVS--LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS-RRNSNIDGLDQ 448
+ S + H++DW PTL +G + N+DG Q + N P +R ++ LD
Sbjct: 315 KGSSFQHVFHVTDWFPTLLRFSG--CQQPDSNLDGKAQ--DIFHNKPEVKRTHLLNFLDP 370
Query: 449 WSSLLLNTPSRRNSVLIN-IDEKKRTAAVRLDSWKLVLG 486
+ R SVL N E + +R ++WKL+ G
Sbjct: 371 FKVAKRKLDDRMFSVLQNRTFEVTVKSVIRTENWKLITG 409
>gi|149197396|ref|ZP_01874447.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139414|gb|EDM27816.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 465
Score = 189 bits (480), Expect = 3e-45, Method: Compositional matrix adjust.
Identities = 116/319 (36%), Positives = 174/319 (54%), Gaps = 28/319 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G+ND+ F+G EIPTP ID++A NG+ N Y VC PSRA +TG+Y G + P
Sbjct: 32 GYNDVGFNGCTEIPTPGIDSIAQNGVKFTNGYTSYSVCGPSRAGFITGRYQQRFGFERNP 91
Query: 151 IWG-AEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
W +P +P +E + E L ++GY IGKWHLG P RGF+ FG+L G
Sbjct: 92 QWNLTDPNSALPKSEMTIAESLTQVGYHCGIIGKWHLGA-EPSLRPNKRGFDEFFGHLGG 150
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT---VGEYATDLFTKEAVQLIEDQPVD 265
+ L Q++ V+ N D R+ T DT +Y T+ F+ EAV I+
Sbjct: 151 GHRFMPEDLVIQHTEEVK-NELDSYRSWITRNDTPVKTTKYLTEEFSDEAVSFIKRNH-Q 208
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
KP FL+L++ A H L+A ++ + +F +I DP R+TYAAMV +DD V V+ +L
Sbjct: 209 KPFFLFLSYNAPHLP-----LQATEKYLARFPHIKDPKRKTYAAMVSAVDDGVSQVMQSL 263
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV------P 379
+ + +N+I+ F+SDNG P+ ++N N+P +G K+ +WEGG +V P
Sbjct: 264 KETNIADNTIVFFLSDNGGPS--------HKNKSDNFPLKGQKSDVWEGGFRVPFAMQYP 315
Query: 380 AILWSPQIQQNPRVSLQMM 398
A + + Q+ +P SL +
Sbjct: 316 AAIQAKQVYDHPVSSLDIF 334
Score = 45.1 bits (105), Expect = 0.11, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 41/70 (58%), Gaps = 6/70 (8%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y T+ F+ EAV I+ KP FL+L++ A H L+A ++ + +F +I DP R
Sbjct: 189 KYLTEEFSDEAVSFIKRNH-QKPFFLFLSYNAPHLP-----LQATEKYLARFPHIKDPKR 242
Query: 74 RTYAALTKST 83
+TYAA+ +
Sbjct: 243 KTYAAMVSAV 252
>gi|262405390|ref|ZP_06081940.1| arylsulfatase B [Bacteroides sp. 2_1_22]
gi|262356265|gb|EEZ05355.1| arylsulfatase B [Bacteroides sp. 2_1_22]
Length = 458
Score = 189 bits (480), Expect = 4e-45, Method: Compositional matrix adjust.
Identities = 122/344 (35%), Positives = 182/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP++DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RGF +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGFSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L T+EA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITQEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GG PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGGHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351
>gi|350414673|ref|XP_003490384.1| PREDICTED: hypothetical protein LOC100748821, partial [Bombus
impatiens]
Length = 413
Score = 189 bits (479), Expect = 6e-45, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 175/346 (50%), Gaps = 55/346 (15%)
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
+V KLD SVG V+ AL+ +GMLENSI++FMSDNGAPT N GSNYP RG+KN
Sbjct: 1 VVSKLDQSVGDVMDALRSRGMLENSIVVFMSDNGAPT-----NGILSNQGSNYPLRGIKN 55
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
+ WEGG + A +WSP I+++ RVS QMM +SDWLPTL +AAG D +L
Sbjct: 56 SPWEGGTRGVAAIWSPLIRKSKRVSNQMMFMSDWLPTLLSAAGVDRIQL----------- 104
Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
NIDG D W SL+ S R+ ++INID+ AA+R +K ++G E
Sbjct: 105 -----------GNIDGFDLWPSLVSGKISPRSEIVINIDDLADYAAIRRGDFKYIIGRTE 153
Query: 490 NGTM-DGYYGQTRSNKVPLLNFNAIVESKTYQSL-------------QQLSQNIFL---- 531
G G G P + + ++ SKT ++ ++ +NI +
Sbjct: 154 TGIAWLGASGDPSEGVSPRYDPHKVLYSKTGVAISGIITAKQAMELNKKKKENIRIVYDT 213
Query: 532 --------PISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIA 583
I + + R++A I C P PC LFN+ DPCE N+A
Sbjct: 214 SSKTNFQEKILTSEDIIEMRKKAQIKCNVTEKDKIPCEPMIAPC-LFNIEQDPCEMVNLA 272
Query: 584 SSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
P I + +L HR T++P S+ D +++P +N+TW+ W
Sbjct: 273 ERNPVILTIFETILIRHRLTVIPPSNLDGD-PRSNPSLWNNTWTSW 317
>gi|410617068|ref|ZP_11328044.1| sulfatase [Glaciecola polaris LMG 21857]
gi|410163337|dbj|GAC32182.1| sulfatase [Glaciecola polaris LMG 21857]
Length = 488
Score = 188 bits (478), Expect = 7e-45, Method: Compositional matrix adjust.
Identities = 117/318 (36%), Positives = 173/318 (54%), Gaps = 30/318 (9%)
Query: 75 TYAALTKSTTLTLLIV-----YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVC 128
YA TK+ + +IV G+ DL F GS EI TPNIDALA+ G++ +N Y P C
Sbjct: 24 AYAGQTKADSRPNIIVILADDLGYGDLGFTGSREIKTPNIDALAHKGVVFSNAYVTHPYC 83
Query: 129 TPSRASLMTGKYPIHTGMQGPPIWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
PSRA L+TG+Y GM+ + G+P+ E + +++ GY T IGKWH+G
Sbjct: 84 GPSRAGLLTGRYQARFGMEINAAHSPDDPFMGLPVDEPTFAKRMQKAGYKTAVIGKWHMG 143
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTV 243
+ P RGF+ +G+L G Y+ + +++YS + NG + N
Sbjct: 144 S-HPNFHPNNRGFDYFYGFLGGGHDYFPESVKVSNEEYSIPLSRNGKPAQLN-------- 194
Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
EY T +KEA + +P +Y+A+ A H + LEA Q+ + ++Q+I D N
Sbjct: 195 -EYLTTAISKEAAEFA--MTTSQPFMMYVAYNAPH-----QPLEATQKDLAKYQHIEDIN 246
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
RRTYAAMV +D +G +++ L++ G EN++I F+SDNG E ++ NW SN P
Sbjct: 247 RRTYAAMVDSMDQGIGRLVNGLKQSGKFENTLIFFLSDNGGVYPESWMPNS--NWASNKP 304
Query: 364 YRGVKNTLWEGGVKVPAI 381
+R K +L EGGV VP I
Sbjct: 305 FRRGKVSLTEGGVHVPFI 322
Score = 45.4 bits (106), Expect = 0.085, Method: Compositional matrix adjust.
Identities = 26/71 (36%), Positives = 40/71 (56%), Gaps = 7/71 (9%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
+ EY T +KEA + +P +Y+A+ A H + LEA Q+ + ++Q+I D
Sbjct: 193 LNEYLTTAISKEAAEFA--MTTSQPFMMYVAYNAPH-----QPLEATQKDLAKYQHIEDI 245
Query: 72 NRRTYAALTKS 82
NRRTYAA+ S
Sbjct: 246 NRRTYAAMVDS 256
>gi|388257120|ref|ZP_10134300.1| sulfatase [Cellvibrio sp. BR]
gi|387939324|gb|EIK45875.1| sulfatase [Cellvibrio sp. BR]
Length = 474
Score = 188 bits (477), Expect = 8e-45, Method: Compositional matrix adjust.
Identities = 122/347 (35%), Positives = 179/347 (51%), Gaps = 37/347 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS EIPTPN+D LA G++ Y + VC PSRA L+TGKYP G +
Sbjct: 47 GYADFGFQGSTEIPTPNLDQLAQEGVVFKQAYVSASVCGPSRAGLLTGKYPQRFGFEENN 106
Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
+ G + G+ L + + YL E GY T IGKWH G R + PL RGF+
Sbjct: 107 VPGYMSSSGATGDDMGMRLDQLTMANYLAERGYRTSLIGKWHQGNEDR-FHPLKRGFDEF 165
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE---YATDLFTKEAVQLI 259
FG+ G SY+ + SR + + RN ++ GE Y TD E ++ I
Sbjct: 166 FGFRGGARSYFPFTQAHPSSRREDF----LERN----FNNYGESPLYLTDALANETIEFI 217
Query: 260 EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
+ +P F +L+ A H +EA + ++QF ++T R+ YAAM+ +D ++G
Sbjct: 218 KRNK-HQPFFTFLSLSAPH-----YPMEAEKSDLDQFPHLTG-ERKIYAAMMLNMDRAIG 270
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
V+ LQ++G+ EN+++IF +DNG P+ N N P G K L EGG++VP
Sbjct: 271 RVLQTLQQEGLSENTLVIFTNDNGGPSDH--------NGSINLPLSGTKANLLEGGIRVP 322
Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
I+ P + + M+ D LPT Y AAGGD ++ + DG+D
Sbjct: 323 MIMRWPGVTKPGSSYDPMVSTLDLLPTFYAAAGGDVRQV-TDWDGVD 368
>gi|323453557|gb|EGB09428.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 1605
Score = 188 bits (477), Expect = 9e-45, Method: Compositional matrix adjust.
Identities = 153/547 (27%), Positives = 240/547 (43%), Gaps = 69/547 (12%)
Query: 85 LTLLIVYGWNDLSFHGSNEIP-TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
L+ G ND+ + + + TP ID LA G+ L Y+ +CTP+RA+LM+G YP+
Sbjct: 96 FVLVDDMGSNDVGYQSHDMVGVTPFIDGLAEQGVRLKEYYSMHMCTPARAALMSGHYPMR 155
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
GMQ I P G+P + E L+ LGY+T +GKW LG + + P+ RGF++ +
Sbjct: 156 IGMQLENIKPDSPWGMPRELTTMAETLKNLGYNTHGVGKWGLGHSQHGFLPVNRGFDTWY 215
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRR-------------NLSTAWDTVGEYATDL 250
GYL+ I YY H + TVE M +L D G ++++L
Sbjct: 216 GYLSDEIDYYSHEYPAPFE-TVEDGATVMASFTDYVFMERSRPYDLQYMPDLNGTHSSEL 274
Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQY------ITDPN 303
+T+ Q+++ + PLF+Y A H G P+ + ++ I
Sbjct: 275 YTQRVQQIVKSANASREPLFVYYASQMTH----GPLDVPPRSAFSDAEWARIGADIKGTV 330
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
RR +A+++ LD SVG V+ ++R + +++ +F SDNG + G+NYP
Sbjct: 331 RRKFASLLLALDQSVGRVVEEMKRSDVWRDTLFVFASDNGG---------CWAQGGTNYP 381
Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQN--PRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
RG K+ LWEGGVKVPA ++S I + R + H++DW PTL S L
Sbjct: 382 LRGGKHFLWEGGVKVPAFVYSELIPSSYRNRAHRGLFHVTDWYPTLVKLVASSNSVLAST 441
Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
D D+ ++ L + ++DG QW +L+ R V IN+ T
Sbjct: 442 NDDDDETTTAL-----EKVDDLDGFFQWEALMGLADPPRVEVPINVQSYAFTC------- 489
Query: 482 KLVLGTQENGTM---DGYYGQTRSNKVP----LLNFNAIVESKTYQSLQQLSQNIF---- 530
T NGT+ G P + NA + Y + L
Sbjct: 490 -----TGPNGTLLQDADSLGAVSDCADPWHILVRQGNATRAALRYGDWKLLVNEFAEPWY 544
Query: 531 -LPISNIDKMRSTRQQATIHCGANPAPMT---PSPCTNGPCYLFNLGNDPCEQNNIASSR 586
+P D + + G + PM + T YLF+L NDP E +++ +
Sbjct: 545 GVPAEGADALGLDLADGSADLGDDEGPMDNCGTAVGTEAHTYLFDLKNDPSETMDLSGEK 604
Query: 587 PDISSQL 593
P++ L
Sbjct: 605 PEVRDAL 611
>gi|21483362|gb|AAM52656.1| HL07902p [Drosophila melanogaster]
Length = 365
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 127/366 (34%), Positives = 182/366 (49%), Gaps = 39/366 (10%)
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
+ L+HLA H GN ++AP+E + +F +I DP RRTYA M+ LD SV I AL+ G
Sbjct: 1 MVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISSLDKSVAQTIGALKDNG 60
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
ML NSII+ SDNGAPT+ + N GSNYPYRG K + WEGG++ LWSP +++
Sbjct: 61 MLNNSIILLYSDNGAPTIGI-----HSNAGSNYPYRGQKESPWEGGIRSAGALWSPLLKE 115
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
VS Q +H DWLPTL AAG SL + P +DG++ W
Sbjct: 116 RGYVSNQAIHAVDWLPTLAGAAG----------------VSLPQDLP------LDGINLW 153
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L N + +++ +DE ++ D+ K V G+ G D + G+ +N+ L
Sbjct: 154 PMLSGNEEPKPRTMIHVLDEVFGYSSYMRDTLKYVNGSSFKGRYDQWLGELETNEDDPLG 213
Query: 510 FNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHC----GANP--APMTPSPCT 563
+ + +Q L N L D++R R +AT C G NP + P
Sbjct: 214 -ESYEQHVLASDVQSLLGNRGL---TKDRIRQMRSEATETCPPIEGQNPLESHFKCEPL- 268
Query: 564 NGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFN 623
PC+ F+L DPCE+ N+A P QL + L+ R+T +P + +A+P N
Sbjct: 269 KAPCF-FDLAKDPCERYNLAQMYPLQLQQLADELEQIRKTAIPSARVPHSDSRANPTFHN 327
Query: 624 DTWSPW 629
W W
Sbjct: 328 GNWEWW 333
Score = 49.7 bits (117), Expect = 0.005, Method: Compositional matrix adjust.
Identities = 20/44 (45%), Positives = 29/44 (65%)
Query: 39 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKS 82
+ L+HLA H GN ++AP+E + +F +I DP RRTYA + S
Sbjct: 1 MVLSHLAVHTGNEDSPMQAPEEEVAKFPHIRDPKRRTYAGMISS 44
>gi|256421633|ref|YP_003122286.1| sulfatase [Chitinophaga pinensis DSM 2588]
gi|256036541|gb|ACU60085.1| sulfatase [Chitinophaga pinensis DSM 2588]
Length = 452
Score = 187 bits (476), Expect = 1e-44, Method: Compositional matrix adjust.
Identities = 127/348 (36%), Positives = 183/348 (52%), Gaps = 29/348 (8%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP 141
+LI+Y G D++ +G+ ++ TPNID LA G++ + YA PVC+PSRASL+TG+YP
Sbjct: 30 VLIIYTDDQGTLDVNCYGAKDLHTPNIDRLAKEGVLFSQFYAAAPVCSPSRASLLTGRYP 89
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
+ G+P ++ + E ++ GY+T IGKWH+G+ E P +GF+
Sbjct: 90 QRAQLDNNAPSEEGHAGMPGSQYTMAEMFKDGGYTTAHIGKWHIGY-SPETMPNQQGFDY 148
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE- 260
FG++ G I Y H Y N HD+ RN W+ G++ DL +E +E
Sbjct: 149 SFGFMGGCIDNYSH-----YFYWAGPNRHDLWRNGQEIWED-GKFFADLTVQEVNGFLEK 202
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
++ DKP FLY A H G QE Q+ RR YAA V +D+ +G
Sbjct: 203 NKRADKPFFLYWAINMPHYPLQG------QEKWRQYYKDLPAPRRMYAAAVSTMDEKIGQ 256
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
V+ L R G+ EN+I++F SD G T E ++ G PYRG K +L+EGG++VPA
Sbjct: 257 VLQQLDRLGLAENTIVVFQSDQGHST----EDRSFGGGGFTGPYRGAKFSLFEGGIRVPA 312
Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLD 426
I+ W+ + +N V Q+ DW PTL AG LP IDG D
Sbjct: 313 IIRWTGHLPKN-EVRDQLCVNIDWYPTL---AGLCKVALPQRKIDGKD 356
>gi|388257121|ref|ZP_10134301.1| sulfatase [Cellvibrio sp. BR]
gi|387939325|gb|EIK45876.1| sulfatase [Cellvibrio sp. BR]
Length = 484
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 125/344 (36%), Positives = 185/344 (53%), Gaps = 29/344 (8%)
Query: 92 GWNDLSF-HGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGP 149
G+ND+ F +G EI TP +DALA G++ N Y P C PSRA L+TG+Y GM+
Sbjct: 39 GYNDVGFTNGQTEIKTPRLDALANEGVVFENGYVTHPYCGPSRAGLITGRYQARFGMENN 98
Query: 150 PIWGAEPR--GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ + + G+PLTE+ P L+E+GY T GKWHLG + P RGF+ +G+L+
Sbjct: 99 VTYSPDDKYMGLPLTEKTFPARLQEVGYKTAIFGKWHLGG-APHFQPNERGFDYFYGFLD 157
Query: 208 GVISYYD---HILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
G +Y H+ + Y + MR +D EY T +++A + IE
Sbjct: 158 GGHNYMPGEVHLGAGGYLLPI------MRNKGVAEFD---EYLTTALSRDAARYIERTSK 208
Query: 265 DK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
++ P F+Y+++ A HA L+APQ + ++ +I D RRTYAAMV +D+ +G +I
Sbjct: 209 EQAPFFIYMSYNAPHA-----PLQAPQNYLEKYAHIKDEKRRTYAAMVDAMDEGIGMLID 263
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL- 382
AL + G L+N++I F+SDNG Y +Y W N P+R K L EGGV VP I
Sbjct: 264 ALDKSGELDNTLIFFLSDNGG-LYPYSWWPDY-TWADNSPFRFGKVALTEGGVHVPFIAH 321
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
W +I + R++ ++ D T AG +T L DG+D
Sbjct: 322 WPKKITKGKRLN-GLVSALDIAATSLALAGANTKDAKL--DGVD 362
Score = 45.1 bits (105), Expect = 0.12, Method: Compositional matrix adjust.
Identities = 24/70 (34%), Positives = 41/70 (58%), Gaps = 6/70 (8%)
Query: 14 EYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
EY T +++A + IE ++ P F+Y+++ A HA L+APQ + ++ +I D
Sbjct: 189 EYLTTALSRDAARYIERTSKEQAPFFIYMSYNAPHA-----PLQAPQNYLEKYAHIKDEK 243
Query: 73 RRTYAALTKS 82
RRTYAA+ +
Sbjct: 244 RRTYAAMVDA 253
>gi|325109241|ref|YP_004270309.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969509|gb|ADY60287.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 485
Score = 187 bits (475), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 139/387 (35%), Positives = 197/387 (50%), Gaps = 65/387 (16%)
Query: 80 TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV-CTPSRASLM 136
T + LL+ G+ +L G+ +IPTP+ID+LA NG+ Y C+PSRA L+
Sbjct: 39 TDRPNIVLLLADDLGYGELGCQGNPQIPTPHIDSLAANGVRFRCGYVTAAYCSPSRAGLL 98
Query: 137 TGKYPIHTGMQGPPIWG--AEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
TG+Y G + P +P G+PL E+ L L + GY+T +GKWHLG R +
Sbjct: 99 TGRYQSRFGYEQNPTGARNEDPELGLPLEEKTLARRLHDAGYATGLVGKWHLGGTAR-FH 157
Query: 194 PLYRGFESHFGYLNG----VISYYDHILSDQYSRTVELNGH-----DMRRNLST------ 238
PL RGF+ FG+L+ V Y+ + S R NG D R LST
Sbjct: 158 PLRRGFDEFFGFLHEGHFFVPPPYEGV-STFLRRRALPNGKTGRWGDERLMLSTHMGHDE 216
Query: 239 -AWDT------------VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
A+D Y TD FT+EA I + D+P FLYLA+ A H+
Sbjct: 217 PAYDANNPILRGGQPVEEAAYLTDAFTREACDFIA-RNQDRPFFLYLAYNAVHS-----P 270
Query: 286 LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
L+ + QF +I D RR +AAM+ LDDSVG V+ L++ + N+I+IF+SDNG P
Sbjct: 271 LQGADAYMQQFAHIADQQRRIFAAMLANLDDSVGEVLQTLEQHQLTSNTIVIFLSDNGGP 330
Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI-----QQNPRVSLQMMH 399
T RE ++ SN P RG K ++EGG++VP ++ W QI ++ P +SL
Sbjct: 331 T---RELTS-----SNAPLRGEKGDMYEGGIRVPVLMQWPGQIPAGRVEERPIISL---- 378
Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLD 426
D PT AA + LP +DG++
Sbjct: 379 --DLFPTCLAAA---EAELPTELDGVN 400
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 26/65 (40%), Positives = 36/65 (55%), Gaps = 6/65 (9%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD FT+EA I + D+P FLYLA+ A H+ L+ + QF +I D RR
Sbjct: 237 YLTDAFTREACDFIA-RNQDRPFFLYLAYNAVHS-----PLQGADAYMQQFAHIADQQRR 290
Query: 75 TYAAL 79
+AA+
Sbjct: 291 IFAAM 295
>gi|294647729|ref|ZP_06725288.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294809280|ref|ZP_06767994.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
gi|292636934|gb|EFF55393.1| arylsulfatase [Bacteroides ovatus SD CC 2a]
gi|294443524|gb|EFG12277.1| arylsulfatase [Bacteroides xylanisolvens SD CC 1b]
Length = 458
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 121/344 (35%), Positives = 181/344 (52%), Gaps = 39/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHGS EI TP++DAL G+ L Y P+ TP+RA LMTG+YP G++ I
Sbjct: 37 GWGDVGFHGS-EIKTPSLDALVGEGVELERFYTSPISTPTRAGLMTGRYPNRFGVRSAVI 95
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKA-IGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+ E + + L GY +A IGKWHLG ++ + P+ RG +G+LNG I
Sbjct: 96 PPWREDGLDENEETMADMLARNGYKNRAIIGKWHLGHTKKVHYPMNRGLSHFYGHLNGAI 155
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+D +R EL+ H+ W+T + Y+T+L T+EA++ I+ + P
Sbjct: 156 DYFD------LTREGELDWHN-------DWETCHDKGYSTELITQEAIRCIDAYEKEGPF 202
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETI----NQFQYIT--DPNRRTYAAMVKKLDDSVGTVI 322
LY+A+ A H L+A ++ I + F +T + + TY+AMV +D +G ++
Sbjct: 203 MLYVAYNAPHT-----PLQAQEKDIKLYTDNFDSLTPKEQKKATYSAMVSCMDRGIGAIV 257
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++KG+++N+ IF SDNG V S+ P RG K W+GG PA+L
Sbjct: 258 DALKKKGIMDNTFFIFFSDNGTAGVP---------GSSSGPLRGHKFDEWDGGGHAPAVL 308
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +Q +S Q+ D +PTL GD SR DG+
Sbjct: 309 YWKKAEKQYKNLSSQVTGFVDLVPTLKDLV-GDHSRPKREYDGI 351
>gi|241244123|ref|XP_002402253.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215496292|gb|EEC05932.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 196
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 100/233 (42%), Positives = 132/233 (56%), Gaps = 52/233 (22%)
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
+G WHLG+++ YTP RGFES +GY N YY+H L
Sbjct: 11 LGAWHLGYYKEAYTPTERGFESFYGYYNFGEDYYNHTL---------------------- 48
Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
D PLFLYL+HLA HAG ++AP+E + +F YI
Sbjct: 49 -------------------------DLPLFLYLSHLAVHAGTQYGPIQAPEENVQKFDYI 83
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
NR YA V LD SVG V AL++K ML+N++++F+SDNGA + + E SN G
Sbjct: 84 GVKNRSLYAGAVDALDQSVGIVFQALEKKDMLQNTVLVFLSDNGA--IPWGEWSNA---G 138
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
SN+P RG K TLWEGGV+VPA +WSP +++ RVS Q+MH+SDW+PTLY+ AG
Sbjct: 139 SNWPLRGAKFTLWEGGVRVPAFVWSPLLRKPQRVSNQLMHVSDWVPTLYSLAG 191
Score = 54.3 bits (129), Expect = 2e-04, Method: Compositional matrix adjust.
Identities = 25/62 (40%), Positives = 37/62 (59%)
Query: 30 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLI 89
+ +D PLFLYL+HLA HAG ++AP+E + +F YI NR YA + ++ I
Sbjct: 45 NHTLDLPLFLYLSHLAVHAGTQYGPIQAPEENVQKFDYIGVKNRSLYAGAVDALDQSVGI 104
Query: 90 VY 91
V+
Sbjct: 105 VF 106
>gi|149197772|ref|ZP_01874821.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149138993|gb|EDM27397.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 441
Score = 187 bits (474), Expect = 2e-44, Method: Compositional matrix adjust.
Identities = 131/361 (36%), Positives = 193/361 (53%), Gaps = 42/361 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTG-MQGP 149
G +D S +GS ++ TP+ID++A+NGI Y A VC+PSRA L+TG+Y G +
Sbjct: 32 GSSDFSCYGSKQLLTPHIDSIAHNGIKFTQAYTASSVCSPSRAGLLTGRYQQTFGHLANI 91
Query: 150 P-----IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
P E G+P+TE L + L+ELGYST IGKWHLG + P RGF++ +G
Sbjct: 92 PHSKHSANDPELLGLPVTEITLADSLKELGYSTHCIGKWHLGE-ADHFHPNARGFDNFYG 150
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMR--RNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
+L+G +Y+ EL G R RN A + G Y T++FT+EA+++I+++
Sbjct: 151 FLSGARTYF---------LGGELRGDMDRIMRNKEFAEPSSG-YTTEVFTQEAIRIIQEE 200
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
DKP F+YL+H A H ++A E I + + +P R+ Y+ ++K LDD G ++
Sbjct: 201 Q-DKPFFIYLSHNAVHGP-----MDAKDEDIMSYDF-KNPLRKKYSGLMKNLDDQTGLLL 253
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL+ EN++I FMSDNG PT N SN+P RG K + +EGG + P +L
Sbjct: 254 QALKDSKQYENTLIFFMSDNGGPTT--------HNGSSNWPLRGFKGSEFEGGNRTPFLL 305
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDGLDQWSSLLLNTPSRRN 440
P+ S + + D T AAGG+ T R ID L ++N P N
Sbjct: 306 QWPEKISAGLSSDKPIIAYDVFATCIQAAGGELVTDRTYHGIDLLP-----VINKPQETN 360
Query: 441 S 441
+
Sbjct: 361 A 361
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 28/82 (34%), Positives = 49/82 (59%), Gaps = 8/82 (9%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
+ RN A + G Y T++FT+EA+++I+++ DKP F+YL+H A H ++A E
Sbjct: 171 IMRNKEFAEPSSG-YTTEVFTQEAIRIIQEEQ-DKPFFIYLSHNAVHGP-----MDAKDE 223
Query: 61 TINQFQYITDPNRRTYAALTKS 82
I + + +P R+ Y+ L K+
Sbjct: 224 DIMSYDF-KNPLRKKYSGLMKN 244
>gi|227540472|ref|ZP_03970521.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
gi|227239796|gb|EEI89811.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Sphingobacterium
spiritivorum ATCC 33300]
Length = 466
Score = 186 bits (472), Expect = 3e-44, Method: Compositional matrix adjust.
Identities = 118/377 (31%), Positives = 198/377 (52%), Gaps = 35/377 (9%)
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNG 116
T+NQ ++ T +++ ++++ G+ D +GS +IPTP+IDALA G
Sbjct: 3 TLNQIVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGG 62
Query: 117 IILNNMYAQP-VCTPSRASLMTGKYPIHTGMQ----GPPIWGAEPRGVPLTE--RFLPEY 169
I N Y VC PSRA L+ G+Y +G + P G + + + L++ R + +
Sbjct: 63 IRFTNSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQIQDIGLSDTVRTIADQ 122
Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
++ GY T AIGKWH G ++ PL++GF FG++ G S++ RT
Sbjct: 123 MQSNGYETMAIGKWHQGN-ETKHHPLHKGFNHFFGFIGGHRSFFP-------IRTAIKQE 174
Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
+ + + + Y TD+FT +A+ + Q DKP F+YL++ A H +EA
Sbjct: 175 EKILNDYTEVDEKDVYYLTDMFTDKAISYMR-QKRDKPYFIYLSYNAVHT-----PVEAT 228
Query: 290 QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEY 349
+ + QF + + +RR+YA ++ +DD +G ++ L++ G LEN+++IF++DNGA T
Sbjct: 229 PQKLAQFARLKNAHRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT--- 285
Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
N N P RG+K + WEGG++VP I+ P+ Q + ++ D +PT
Sbjct: 286 ------NNGADNGPLRGLKGSKWEGGIRVPMIMHWPKQLQAGKDYKGLVSSLDLVPTCVA 339
Query: 410 AAGGDTSRLPLNIDGLD 426
AA G T R + +DG++
Sbjct: 340 AAQG-TVRSDMQLDGVN 355
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 39/68 (57%), Gaps = 6/68 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD+FT +A+ + Q DKP F+YL++ A H +EA + + QF + + +RR
Sbjct: 191 YLTDMFTDKAISYMR-QKRDKPYFIYLSYNAVHT-----PVEATPQKLAQFARLKNAHRR 244
Query: 75 TYAALTKS 82
+YA L +
Sbjct: 245 SYAGLMSA 252
>gi|239626021|ref|ZP_04669052.1| arylsulfatase E [Clostridiales bacterium 1_7_47_FAA]
gi|239520251|gb|EEQ60117.1| arylsulfatase E [Clostridiales bacterium 1_7_47FAA]
Length = 483
Score = 186 bits (472), Expect = 4e-44, Method: Compositional matrix adjust.
Identities = 123/347 (35%), Positives = 187/347 (53%), Gaps = 27/347 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
G+ DLS GS ++ TPN+D LA G + YA VC+PSRA L+TG+YP TG++
Sbjct: 28 GYGDLSCMGSTDVCTPNLDILAAGGARFTDFYAGSAVCSPSRACLLTGRYPYMTGVRSIL 87
Query: 149 ---PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
G P G+P L++LGY+T +GKWHLG E P + GF+ G+
Sbjct: 88 GGIKTTTGLNP-GIPT----FASALKDLGYTTGMVGKWHLGAVP-ECRPTHMGFDYFCGF 141
Query: 206 LNGVISYYDHILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-P 263
L+GV Y+ HI + + +N HD+ N GEY+T+LF ++ ++ I +Q
Sbjct: 142 LSGVNDYFSHIHYTEANSHPGINPNHDLWENDERCLKYTGEYSTELFARKGLEFIREQVE 201
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
D P LY A A H + AP + + +F+++ + +R+ AAM+ +DD VG +++
Sbjct: 202 KDMPFALYCAFNAPH-----YPMHAPYKYLERFKHLPE-DRQIMAAMLSAVDDGVGEIMN 255
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYR----ETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
L+R+G+ ++II F SDNG P+ E R E +Y GS +G K +L++GG++VP
Sbjct: 256 YLKRRGIFNDTIIYFQSDNG-PSKESRNWLDERKDYYYGGSTGGLKGHKFSLFDGGIRVP 314
Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
AI P + +V + +D PT AAGG+ S I G D
Sbjct: 315 AIFSWPAMVPAGQVISEPCMGTDIFPTFINAAGGNASD--YEISGCD 359
>gi|300771261|ref|ZP_07081137.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
gi|300761931|gb|EFK58751.1| N-acetylgalactosamine-4-sulfatase [Sphingobacterium spiritivorum
ATCC 33861]
Length = 466
Score = 186 bits (471), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 117/377 (31%), Positives = 197/377 (52%), Gaps = 35/377 (9%)
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNG 116
T+NQ ++ T +++ ++++ G+ D +GS +IPTP+IDALA G
Sbjct: 3 TLNQIVFLFTLFMLTTGSVSAQQRPNIILILSDDGGYEDFGCYGSQDIPTPHIDALAKGG 62
Query: 117 IILNNMYAQP-VCTPSRASLMTGKYPIHTGMQ----GPPIWGAEPRGVPLTE--RFLPEY 169
+ N Y VC PSRA L+ G+Y +G + P G + + + L++ R + +
Sbjct: 63 VRFTNSYVTASVCAPSRAGLLMGQYQQRSGFEHNVSDLPADGYQMQDIGLSDTVRTIADQ 122
Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
++ GY T AIGKWH G ++ PL++GF FG++ G S++ RT
Sbjct: 123 MQSNGYETMAIGKWHQGN-ETKHHPLHKGFNHFFGFIGGHRSFFP-------IRTAIKQE 174
Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
+ + + + Y TD+FT +A+ + Q DKP F+YL++ A H +EA
Sbjct: 175 EKILNDYTEVDEKDVYYLTDMFTDKAISYMR-QKRDKPYFIYLSYNAVHT-----PVEAT 228
Query: 290 QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEY 349
+ + QF ++ D RR+YA ++ +DD +G ++ L++ G LEN+++IF++DNGA T
Sbjct: 229 PQKLAQFAHLKDAQRRSYAGLMSAMDDGIGRIVEELKKTGQLENTLLIFLNDNGAAT--- 285
Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
N N P RG+K + WEGG++VP I+ P+ + ++ D +PT
Sbjct: 286 ------NNGADNGPLRGLKGSKWEGGIRVPMIMHWPKQLPAGKDYKGLVSSLDLVPTCVA 339
Query: 410 AAGGDTSRLPLNIDGLD 426
AA G T R + +DG++
Sbjct: 340 AAQG-TVRSDMQLDGVN 355
Score = 47.4 bits (111), Expect = 0.024, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 39/68 (57%), Gaps = 6/68 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD+FT +A+ + Q DKP F+YL++ A H +EA + + QF ++ D RR
Sbjct: 191 YLTDMFTDKAISYMR-QKRDKPYFIYLSYNAVHT-----PVEATPQKLAQFAHLKDAQRR 244
Query: 75 TYAALTKS 82
+YA L +
Sbjct: 245 SYAGLMSA 252
>gi|443704179|gb|ELU01354.1| hypothetical protein CAPTEDRAFT_182406 [Capitella teleta]
Length = 548
Score = 185 bits (470), Expect = 5e-44, Method: Compositional matrix adjust.
Identities = 149/535 (27%), Positives = 239/535 (44%), Gaps = 88/535 (16%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YG++D + S +I TPNID L +GI N Y+ C+PSR+S ++G+Y +GMQ
Sbjct: 39 YGYHDAGYRNS-DIHTPNIDQLVADGISFTNAYSAQQCSPSRSSFLSGRYAYTSGMQHGV 97
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I + L FL +YL+EL Y+T A GKWHLG+ +E TP YRGF++ G +G
Sbjct: 98 ISDTAAHCMDLKYNFLSDYLKELNYNTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEG 157
Query: 211 SYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
YY+H Y NG + R+ S G ++ DL + + + D+ PLF
Sbjct: 158 KYYEHTTFQGMYDWH---NGTKLDRSAS------GIHSQDLIEADLLNQL-DENDGTPLF 207
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRRTYAAMVKKLDDSVGTVISA 324
Y A H +P + + + + D R+ Y +V +D +G V++
Sbjct: 208 YYAAFHNTH---------SPMQPKPEHEALYDDLDAPSTRKKYLGLVSGMDAVIGKVVAK 258
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
L+ K M +N+ I F SDNG E G N P RG K++L++GG + S
Sbjct: 259 LKEKDMFDNTYIFFSSDNGGDVHE----------GDNSPKRGAKSSLFDGGCHAHSWASS 308
Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
P +++ + ++HI+DWLPT+ AGG+ + ID
Sbjct: 309 PLLKKKGVEADGLLHITDWLPTIVKLAGGEVP----------------------ESDGID 346
Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTA-------AVRLDSWKLVLGTQENGTMDGYY 497
G++Q + PS+R ++ NID + A R +KL+ G + G DGY
Sbjct: 347 GIEQVDLITNAGPSKRTDMIYNIDMELEIGTPDFGAMAARDLRYKLIWGFE--GRSDGYG 404
Query: 498 GQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKM-----RSTRQQATIHCGA 552
+ I E + + + + + N+DK +T+Q +
Sbjct: 405 ----------MGAQFIYEVDRVLDVVKEADAV---VENVDKRGGPYPMNTQQHHILRTYM 451
Query: 553 NPAPMTPS--PCTNGPCYLFNLGNDPCEQNNI-ASSRPDISSQLYELLKYHRRTL 604
+T GP LF+L DP E ++ AS+ P+ + +L++ R +
Sbjct: 452 QTKLVTLDDIEAGTGPMMLFDLQEDPNETTDLSASTDPEHQAAKLKLIEMLRSAI 506
>gi|449680623|ref|XP_004209633.1| PREDICTED: arylsulfatase B-like [Hydra magnipapillata]
Length = 327
Score = 185 bits (470), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 124/382 (32%), Positives = 196/382 (51%), Gaps = 68/382 (17%)
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITDPNRRTYA 308
++T +A+ I + +PLFLYLA A H+ N+ L+APQE I++F YI RR YA
Sbjct: 1 MYTSKAISYITNHNCSEPLFLYLAFQAVHSSNSEVEPLQAPQEWIDKFSYIKHEQRRKYA 60
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
AM+ +D +G V AL K ML+NS+IIF +DNG P+ + +N+P RGVK
Sbjct: 61 AMLAYMDYGIGRVHEALIEKKMLDNSVIIFTTDNGGPSNGFNNNWA-----NNFPLRGVK 115
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
TL+EGGV+ ++S I+++ RVS ++H++DWLPTL + AGG+ S + +DG DQW
Sbjct: 116 ATLFEGGVRGVGFVYSKLIEKS-RVSHDLIHVTDWLPTLVSLAGGNVSDISF-LDGFDQW 173
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK-RTAAVRLDSWKLVLGT 487
++L + S S+RN +L+NIDE + A+R+ SWKL+
Sbjct: 174 ATL-----QNKQS----------------SQRNEILLNIDEHVWKNEALRVGSWKLIKEG 212
Query: 488 QENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQAT 547
+N DG+Y P S + M + +
Sbjct: 213 LKN--WDGWYP---------------------------------PPSYNESMDESLNSSI 237
Query: 548 IHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQ 607
+ CG + P+ + C + C LF++ DPCE +++ PD+ + + L +++++VP
Sbjct: 238 LKCGHD-FPVVINHCDDY-C-LFHVVEDPCEIVDLSKKFPDVLATMMYRLNIYKQSMVPP 294
Query: 608 SHEQPDLVQADPKRFNDTWSPW 629
+ ++DPK N W PW
Sbjct: 295 RNNMTIDSRSDPKLHNGVWQPW 316
Score = 55.8 bits (133), Expect = 6e-05, Method: Compositional matrix adjust.
Identities = 27/62 (43%), Positives = 39/62 (62%), Gaps = 1/62 (1%)
Query: 19 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITDPNRRTYA 77
++T +A+ I + +PLFLYLA A H+ N+ L+APQE I++F YI RR YA
Sbjct: 1 MYTSKAISYITNHNCSEPLFLYLAFQAVHSSNSEVEPLQAPQEWIDKFSYIKHEQRRKYA 60
Query: 78 AL 79
A+
Sbjct: 61 AM 62
>gi|449137658|ref|ZP_21772978.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448883711|gb|EMB14224.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 810
Score = 185 bits (470), Expect = 6e-44, Method: Compositional matrix adjust.
Identities = 122/338 (36%), Positives = 177/338 (52%), Gaps = 42/338 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
G++D+ F+G EIPTP +D LA G++ N YA P C+PSRA L+TG+Y G +G
Sbjct: 40 GYSDVGFNGCKEIPTPRLDELAGEGVVFTNGYASHPYCSPSRAGLLTGRYQQRFGHEGNP 99
Query: 149 --PPIW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
P W G + G+PL+E L + L+E GY T AIGKWHLG + + P RGF+ FG+
Sbjct: 100 EPDPQWHGDDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRRGFDEWFGF 158
Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
G +SY+ + V + R T Y TD F+ EAV+ I+ D
Sbjct: 159 SGGGLSYWGDLGRMDPLLGVHRGDEPVDRKSLT-------YLTDDFSTEAVKFIQRHETD 211
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
P FLYLA+ A HA + +A + + + +I R Y AMV +D+ +G V+ +
Sbjct: 212 -PFFLYLAYNAPHAPD-----QATRAHLEKTAHIEYGGRAVYGAMVAGMDEGIGRVVDQI 265
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
+ G+ +N++I+F SDNG R N PYRG K L+EGG++VP ++ P
Sbjct: 266 RESGLDDNTLIVFYSDNGG----------RREHAMNLPYRGHKGMLFEGGIRVPFLVSWP 315
Query: 386 QI------QQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
+Q+P +L D PT+ AAG D S+
Sbjct: 316 GTLPGGVREQSPVTAL------DVFPTVLAAAGLDPSK 347
>gi|422293430|gb|EKU20730.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
gi|422295486|gb|EKU22785.1| arylsulfatase B [Nannochloropsis gaditana CCMP526]
Length = 703
Score = 185 bits (469), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 141/424 (33%), Positives = 213/424 (50%), Gaps = 62/424 (14%)
Query: 86 TLLIVY---GWNDLSFHGSNEIP----TPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
L+IV G D+ ++ S E P TP +D+LA + L Y PVCTP+RA+L+TG
Sbjct: 64 VLMIVVDDAGVQDVGYNASPESPLRGKTPVLDSLAAESVRLKEYYVHPVCTPTRAALLTG 123
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLR-ELGYSTKAIGKWHLGFFRREYTPLYR 197
+Y ++ GM P I G G+ + LPE L+ E YST +GKWHLG + + PL R
Sbjct: 124 RYAVNVGMPFPLI-GDAISGLDGSIPTLPEMLKSEANYSTHLVGKWHLGAAKAKNRPLAR 182
Query: 198 GFESHFGYLNGVISYYDHILS---DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
GF+S +G L +Y + D + E+ ++ E+AT LF++E
Sbjct: 183 GFDSFYGLLGASFDHYTKKMGEVRDLWKNEAEVPAKEVDEK---------EHATTLFSRE 233
Query: 255 AVQLIEDQPVD----------KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
AV++IE+ PLFLYLA+ A HA L+A ++ + + + +R
Sbjct: 234 AVKVIEEHSARGHAGAKDGDMDPLFLYLAYSAPHA-----PLQADEKFMKLCSDVPNRHR 288
Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
RT+ AM+ +LD+ +G V ++L+ GM E+++++F SDNG + G NY Y
Sbjct: 289 RTFCAMMAQLDEGIGNVTASLKAHGMWEDTVVVFTSDNGG---------YFLVGGFNYNY 339
Query: 365 RGVKNTLWEGGVKVPAILWSPQ-IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
RG K+ WEGGV+VPA + +P+ PR M H++DW+PTL A G D
Sbjct: 340 RGGKSGGWEGGVRVPAFIRAPKRYNFAPRDFRGMAHVADWMPTLMGMATGREEGRERMGD 399
Query: 424 GLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
G D + R G ++ S RN L+ +D + T+A R +KL
Sbjct: 400 GYDLRKAF-------RGDEEGGKEK---------SPRNEALLQLDIFQNTSAYRSGCFKL 443
Query: 484 VLGT 487
+LG
Sbjct: 444 LLGC 447
Score = 41.6 bits (96), Expect = 1.2, Method: Compositional matrix adjust.
Identities = 26/76 (34%), Positives = 42/76 (55%), Gaps = 15/76 (19%)
Query: 14 EYATDLFTKEAVQLIEDQPVD----------KPLFLYLAHLAAHAGNAGKHLEAPQETIN 63
E+AT LF++EAV++IE+ PLFLYLA+ A HA L+A ++ +
Sbjct: 224 EHATTLFSREAVKVIEEHSARGHAGAKDGDMDPLFLYLAYSAPHA-----PLQADEKFMK 278
Query: 64 QFQYITDPNRRTYAAL 79
+ + +RRT+ A+
Sbjct: 279 LCSDVPNRHRRTFCAM 294
>gi|241708194|ref|XP_002413315.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215507129|gb|EEC16623.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 294
Score = 185 bits (469), Expect = 8e-44, Method: Compositional matrix adjust.
Identities = 123/328 (37%), Positives = 163/328 (49%), Gaps = 42/328 (12%)
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
PLFLYLAH A H+GN G+ LE P+ Q++ I NR YA MV LD+SVG V AL
Sbjct: 1 PLFLYLAHQAPHSGNFGR-LEVPETYTEQYRDIGSKNRTLYAGMVSALDESVGAVFEALD 59
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
++GML NSI++F SDNG E ET N GS +P RG K +L EGGV+ PA+LWSP
Sbjct: 60 KRGMLSNSIVVFSSDNGGAPDE--ETYNS---GSPWPLRGQKMSLSEGGVRAPALLWSPL 114
Query: 387 IQQNP-RVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
+ + + HI+DWLPT Y AGG S L IDG
Sbjct: 115 LSATEGSIFDGLFHITDWLPTFYELAGGSPSSL----------------------GAIDG 152
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
+ Q +L T + R +LINID ++ +A+ ++KLVLG E G + +Y
Sbjct: 153 VSQLGALTRQTTAPRKEILINIDPEENCSAIIEGNFKLVLGVAEQGLYENWY-------- 204
Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQA---TIHCGANPAPMTPSPC 562
PL K Q ++ L + + +R A + CGA P
Sbjct: 205 PLSGHVDWESDKPRQECEKSVVARVLEHHGVRPVCGSRTGAYATPVQCGARDPSRECVP- 263
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDIS 590
T C LFNL DPCE NN+A P+++
Sbjct: 264 TKSLC-LFNLSKDPCEYNNVADQHPEVT 290
Score = 46.2 bits (108), Expect = 0.059, Method: Compositional matrix adjust.
Identities = 23/56 (41%), Positives = 33/56 (58%), Gaps = 1/56 (1%)
Query: 36 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
PLFLYLAH A H+GN G+ LE P+ Q++ I NR YA + + ++ V+
Sbjct: 1 PLFLYLAHQAPHSGNFGR-LEVPETYTEQYRDIGSKNRTLYAGMVSALDESVGAVF 55
>gi|313219585|emb|CBY30507.1| unnamed protein product [Oikopleura dioica]
Length = 617
Score = 184 bits (468), Expect = 9e-44, Method: Compositional matrix adjust.
Identities = 137/421 (32%), Positives = 194/421 (46%), Gaps = 57/421 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ ++ S E TPNI+ LA NGIIL++ Y+QPVC+PSR+ +TG+Y GMQ I
Sbjct: 169 GYNDIGYN-SIEAFTPNINYLAKNGIILDSHYSQPVCSPSRSQFLTGRYSFRYGMQHRNI 227
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P GVPLTE+ LPE +E GYST GKWH G F Y P RGF+ G +G
Sbjct: 228 LPTQPHGVPLTEKMLPEVFKECGYSTFGTGKWHQGMFHESYLPTSRGFDKFVGSYSGSSQ 287
Query: 212 YYDH---ILSDQYSRTVELNGHDMRRN--LSTAWDTVGEYATDLFTKEAVQLI------E 260
+ H S Q G+DMR + + G +D +E I
Sbjct: 288 HSTHEKCFNSPQRDSPFSFCGYDMREAEVVEARQNLTG---SDFLLREICIRITVHGPLT 344
Query: 261 DQPVDKPLF--LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
P D+ ++ LY H ETI + R AMV LD ++
Sbjct: 345 TDPRDEKIYEELYERHR--------------NETIYK--------RTKMLAMVTGLDRAI 382
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G +I + ++ G +N+I F SDNG ++ YR +N P+RG KN L EGGV+
Sbjct: 383 GRIIESFKKYGKYKNTIFAFASDNGG-SLTYR--------ANNMPFRGGKNDLLEGGVRT 433
Query: 379 PAILWS---PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LN 434
+ L S P + R S + I+DW PTL A LP +DG + +
Sbjct: 434 LSFLHSSLIPNDKIGSRYS-NLFDITDWFPTLLDAGKCSQKNLPNQLDGKSHFENFFSAG 492
Query: 435 TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI---NIDEKKRTAAVRLDSWKLVLGTQENG 491
S R+ + LD R V + D + +T A+R+ +KL+ G +
Sbjct: 493 NASVRDEIVHHLDPLKKADAEVEDPRPFVKVYGFKYDIRMKT-AIRVGKYKLITGAEATK 551
Query: 492 T 492
T
Sbjct: 552 T 552
>gi|323452297|gb|EGB08171.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 889
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 159/587 (27%), Positives = 251/587 (42%), Gaps = 118/587 (20%)
Query: 109 IDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPE 168
++ LA G+ L + YA P CT SRA+L TGKYP+H G+Q I +EPRGVPL E FL +
Sbjct: 1 MNKLASGGVTLTSFYASPTCTASRAALFTGKYPMHLGLQDSVIHPSEPRGVPLKETFLSQ 60
Query: 169 YLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELN 228
L++ GY T +GKWHLGF + YTP RGF+ FG L G + H ++ ++ +
Sbjct: 61 KLKDAGYGTVFVGKWHLGFHQAAYTPSQRGFDEFFGILTGGGDHVAHASTESFA----MR 116
Query: 229 GHDMRRNLSTAWDT--------VGE--------YATDLFTKEAVQLIED--QPVDKPLFL 270
G D + ++T + VGE ++T L+T ++ + + D PLFL
Sbjct: 117 GPDYKSKITTMTGSNLWHNGRPVGEDEEGVRDTHSTHLYTAKSRAYVAKYAEMADVPLFL 176
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------RRTYAAMVKKLDDSV 318
+++ A HA ++ P E ++ Y RR MV +D+S+
Sbjct: 177 TVSYQAVHA-----PIQVPDEYVDGSAYANGCESLRAGSASATRLRRQLCGMVSMVDESL 231
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
++ L+ + M EN+++ F+SDNG R+ SN RG K + +EGG++V
Sbjct: 232 ASIHDDLKAEAMWENTLLFFLSDNGGI---------LRHGSSNLDLRGEKGSYYEGGIRV 282
Query: 379 PAIL---WSPQIQQNPRVSLQMMHISDWLPTLYTAAG------------------GDTSR 417
PA + + ++ RV ++HI+D T AG G R
Sbjct: 283 PAFVSGGYLAKLLSAGRVFEGLVHITDLHATALRVAGVSSKEDKRAAPAVAAAKVGHLRR 342
Query: 418 LPLNIDGLDQWSSLLLNTPSRRN--SNIDGLDQWSSLLLN---TPSRRNSVLINIDEKKR 472
L + L + S + ++DG+DQW + + S+R V NI
Sbjct: 343 LEPGQQPQSYAETALDDVLSHEHLTDSLDGVDQWDAFVSGASAAASQRTHVAHNI----- 397
Query: 473 TAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLP 532
N + G G R + L+ + ES+ Y+ Q + Q+
Sbjct: 398 -----------------NSELFGAAGALRVGEYKLIVEARVTESEIYEYGQHMLQDDNWD 440
Query: 533 ISNIDK------MRSTRQQATIHCGANPAPMTPSPCTNG-PCY-LFNLGNDPCEQNNIAS 584
S + + +R+ + ANP + C + C L++L + A
Sbjct: 441 QSELSQVIHQKLLRTPGTTSLFDVVANPNELDEEDCDDARSCRNLYDL-------DEFAD 493
Query: 585 SRPDISSQLYELLKYHRRTLVPQSHE--QPDLVQADPKRFNDTWSPW 629
+ I + EL R P+S E D ADP F W+PW
Sbjct: 494 VKATILEKWAEL-----RDSTPESTEIWLDDGPLADPALFGGVWTPW 535
>gi|241999506|ref|XP_002434396.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215497726|gb|EEC07220.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 458
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 152/500 (30%), Positives = 234/500 (46%), Gaps = 67/500 (13%)
Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES-HF 203
GMQ I+ EPRG+ L + LP++L + GY+T IGK + + Y+ Y + H
Sbjct: 1 GMQHSIIFNEEPRGLSLRFKLLPQWLADRGYTTHMIGKVAMHLSLQHYS--YEQIKHLHC 58
Query: 204 GYLNGVISYYDHILSDQYSRTVEL---------NGHDMRRNLSTAWDTVGEYATDLFTKE 254
+LN L++ + R L G D RR+LS + + G Y T L T+E
Sbjct: 59 RHLNSK-------LAEVFFRPQTLVLLLQPGGDTGLDFRRDLSISPNESGTYITQLLTRE 111
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN-QFQYITDPNRRTYAAMVKK 313
A +I + P+DK + YLA + A + ++I F P MV
Sbjct: 112 AEAVIANHPIDK-VCHYLASDRETSVFALDTVVICTKSIAISFHKCLLP------GMVSA 164
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
LD+SVG+V +L +GML+N++ +F SDNGA T S N S++P++G K T WE
Sbjct: 165 LDESVGSVFESLNHRGMLDNTVFVFSSDNGADT-----DSANANTASSWPFKGQKYTPWE 219
Query: 374 GGVKVPAILWSPQIQQN-PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
GGV+ PA++WSP V M HISDWLPTLY AGG +
Sbjct: 220 GGVRAPAVIWSPLFSGLLGSVYNNMFHISDWLPTLYQLAGGLAEGMA------------- 266
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
+IDGL SL T R+ +LINID + +A+ +KLV G
Sbjct: 267 --------DDIDGLSHLHSLTGGTEPPRHELLINIDPVENYSAIIEGHFKLVKGRASGSA 318
Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQS--LQQLSQNIFL-PISNIDKMRSTRQQATIH 549
D +Y + + +++ + K +S + ++ +NI L P+ + TR +
Sbjct: 319 FDKWYSIPGN-----VTWDSDMPRKQCESSVVVRVLRNIGLSPVCGAGR---TRYAVPVD 370
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSH 609
CG P S LF+L DPCE N+++ P++ +L + L ++ T + ++
Sbjct: 371 CGQR-DPREKSCAPERSACLFDLSKDPCEYNDVSGQHPEVVERLLKKLAGYKATALKPAN 429
Query: 610 EQPDLVQADPKRFNDTWSPW 629
PD +++P N+ W W
Sbjct: 430 -LPDDPRSNPGLHNNVWVAW 448
>gi|325109725|ref|YP_004270793.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969993|gb|ADY60771.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 471
Score = 184 bits (468), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 115/310 (37%), Positives = 165/310 (53%), Gaps = 35/310 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
GW+D+ F+G EIPTP++DALA +G+ + YA P C+PSRA L+TG+Y G + P
Sbjct: 39 GWSDVGFNGCKEIPTPHLDALAKSGVAFDCGYASHPYCSPSRAGLLTGRYQQRFGHECNP 98
Query: 151 IWGA-------EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
GA E G+PL+E L R GY T AIGKWHLG ++ P RGFE F
Sbjct: 99 --GAHGNDDAIEMEGLPLSETLLSTVFRNAGYRTGAIGKWHLG-DEPQFWPTERGFEEWF 155
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
G+ G +SY+ + V NG + ++ T Y TD F+ EAV+ +E+
Sbjct: 156 GFSGGGLSYWGDLGKKPPLHGVLRNGDVVPKDELT-------YLTDDFSTEAVKFVEENR 208
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
+P FLYLA+ A HA + + +E + + +I R Y AMV +D+ +G V+
Sbjct: 209 A-RPFFLYLAYNAPHAPD-----QVTREHLKKTSHIEYGGRAVYGAMVAAMDEGIGRVVQ 262
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AIL 382
AL + + + ++I F SDNG R N+PYRG K L+EGG++VP +
Sbjct: 263 ALDDQNLRDETVIFFYSDNGGRATLAR----------NFPYRGHKGMLFEGGIRVPFCVS 312
Query: 383 WSPQIQQNPR 392
W QI+ R
Sbjct: 313 WPEQIEGGRR 322
>gi|313219878|emb|CBY30794.1| unnamed protein product [Oikopleura dioica]
Length = 481
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 120/404 (29%), Positives = 202/404 (50%), Gaps = 38/404 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G++DL + ++++ +PNID LA N + + N Y QP CTPSRA+ MTG+Y I GMQ I
Sbjct: 35 GFDDLGY-VNDDVISPNIDFLAKNALHIENYYNQPSCTPSRAAFMTGRYNIRYGMQSGVI 93
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP +PL+E LP+ L++ GY+T GKWHLGF+ ++ P RGF+ FG+ G
Sbjct: 94 KPDEPEAIPLSETLLPQALKKCGYNTSMHGKWHLGFYTEKHCPQNRGFDRFFGFYLGSQD 153
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H + +R +L+ G Y+T ++ + +++ + PLF +
Sbjct: 154 YFYHDSGNNCYLYEPCYREKVRLDLN------GTYSTKAIAEDFIAKLDEYDPETPLFEF 207
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPN-----RRTYAAMVKKLDDSVGTVISALQ 326
L+ H P + ++++ + R+ + + +D +G V+ L+
Sbjct: 208 LSFQEVH---------GPLQWLSEYSTLFKEKDWTHYRKMLSRKIVIVDHFIGEVVKKLK 258
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
KG +++I++ SDNG T R SN+P RG K ++EGG++ A + SP+
Sbjct: 259 EKGFWKDTILVITSDNGGQT---------REGASNWPLRGRKGDVFEGGIRSRAFIHSPK 309
Query: 387 IQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS-RRNSNI 443
+ + + S + H++DW PTL +G + N+DG Q + P +R +
Sbjct: 310 LPNSLKGSSFPHVFHVTDWFPTLLRFSG--CQQPDSNLDGEAQ--DIFHQKPEVKRTHLL 365
Query: 444 DGLDQWSSLLLNTPSRRNSVLIN-IDEKKRTAAVRLDSWKLVLG 486
+ LD + R SVL N E + +R ++WKL+ G
Sbjct: 366 NFLDPFKVAKRKLDDRMFSVLQNRTFEVTVKSVIRTENWKLITG 409
>gi|311748319|ref|ZP_07722104.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
gi|126576822|gb|EAZ81070.1| N-acetylgalactosamine-6-sulfate sulfatase [Algoriphagus sp. PR1]
Length = 472
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 125/367 (34%), Positives = 187/367 (50%), Gaps = 38/367 (10%)
Query: 81 KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMT 137
K L L++ G+ DL F GS +I TP++D LA NG+ Y + VC+PSRA +T
Sbjct: 33 KHYNLVLIVADDLGYGDLGFTGSTQIKTPHLDQLATNGVTFTQGYVSSAVCSPSRAGFIT 92
Query: 138 GKYPIHTGMQGPPIWGAEP------RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
G + G + G EP G+PL+++ + ++L +LGY IGKWHLG +
Sbjct: 93 GINQVEFGHDNN-LAGVEPGFDIAYNGMPLSQKTIADHLNKLGYVNGLIGKWHLGK-EPQ 150
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG--EYATD 249
+ PL RGF+ +GY G Y++ + NG + L + + T Y TD
Sbjct: 151 FHPLKRGFDEFWGYTGGGHDYFESLP----------NGKGYKEPLESNFKTPDPITYITD 200
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
E+V IE D+P FL+ A A H ++A +E + +Q+I D RRTYAA
Sbjct: 201 DVGNESVDFIERHK-DEPFFLFAAFNAPHT-----PMQALEEDLALYQHIEDKKRRTYAA 254
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
MV +LD +VG ++++L+ +G+ EN++++F SDNG PT N N PYRG K
Sbjct: 255 MVHRLDLNVGKIMTSLEEQGLSENTLVVFFSDNGGPTDS--------NASLNAPYRGQKG 306
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
L EGG+ VP ++ P + + + + D +PT + A GDT G+D
Sbjct: 307 ILLEGGIHVPFVMNLPGLLPEGLIYQEQVTSLDVVPT-FLALAGDTETSMDMFSGVDLIP 365
Query: 430 SLLLNTP 436
L TP
Sbjct: 366 HLTGKTP 372
Score = 42.0 bits (97), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 26/73 (35%), Positives = 37/73 (50%), Gaps = 6/73 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD E+V IE D+P FL+ A A H ++A +E + +Q+I D RR
Sbjct: 197 YITDDVGNESVDFIERHK-DEPFFLFAAFNAPHT-----PMQALEEDLALYQHIEDKKRR 250
Query: 75 TYAALTKSTTLTL 87
TYAA+ L +
Sbjct: 251 TYAAMVHRLDLNV 263
>gi|417303628|ref|ZP_12090677.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
gi|327540049|gb|EGF26644.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
WH47]
Length = 826
Score = 184 bits (467), Expect = 1e-43, Method: Compositional matrix adjust.
Identities = 124/341 (36%), Positives = 181/341 (53%), Gaps = 35/341 (10%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI 142
+LIV G++D+ F+G EIPTP++D LA +G++ N YA P C+PSRA L+TG++
Sbjct: 48 VLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQ 107
Query: 143 HTGMQGPP----IW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
G + P W G + G+PLTE L + L+E GY T AIGKWHLG + + P R
Sbjct: 108 RFGHESNPEPDTQWHGEDTPGMPLTETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNRR 166
Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
GF+ FG+ G SY+ D + L H + T + TD F+ EAV+
Sbjct: 167 GFDEWFGFSGGGFSYW----GDLGKKDPLLGVHRGDEPVDPKTLT---HLTDDFSTEAVK 219
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
I+ + P FLYLA+ A HA + A + + + +I R Y AMV +D+
Sbjct: 220 FIQRHETE-PFFLYLAYNAPHAPD-----HATRAHLQKTAHIEYGGRAVYGAMVAGMDEG 273
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
+G V+ ++ G+ EN++IIF SDNG R N+PYRG K L+EGG++
Sbjct: 274 IGRVVDQIRESGLGENTMIIFYSDNGG----------RREHAVNFPYRGHKGMLFEGGIR 323
Query: 378 VPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
VP ++ W ++ + L + + D PT AAG D SR
Sbjct: 324 VPFLVSWPGTVRSGMKEELPITAL-DLFPTALAAAGMDPSR 363
>gi|319951998|ref|YP_004163265.1| n-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
gi|319420658|gb|ADV47767.1| N-acetylgalactosamine-6-sulfatase [Cellulophaga algicola DSM 14237]
Length = 471
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 121/347 (34%), Positives = 178/347 (51%), Gaps = 37/347 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS E+ TPN+D LA +G+ Y C PSRA L+TGKY G +
Sbjct: 33 GFADFGFQGSTEMKTPNLDKLANSGVKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEIN 92
Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
+ G A+ G+PL + + +YL++LGY+T GKWHLG R + P+ RGF+
Sbjct: 93 VPGYMSENSKFLADDMGLPLDQLTIGDYLKKLGYNTAMYGKWHLGNADR-FHPMNRGFDE 151
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
+G+ G SY+ + ++ + M R ++ EY TD EA+ IE
Sbjct: 152 FYGFRGGARSYFGYDVASSAHHDTK-----MERGFGN-FEEPQEYVTDALADEAISFIEK 205
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
+ P F+YLA A H +EA +E +NQF +T R+ AAM LD + G V
Sbjct: 206 NKKN-PFFIYLAFNAVHT-----PMEATKEDLNQFPNLTG-KRKELAAMTLALDRACGRV 258
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ L++ G+ +N+II+F +DNG PT +N N P G K+ EGG++VP +
Sbjct: 259 LDKLKQLGLDKNTIIVFSNDNGGPTD--------KNASLNLPLSGTKSNHLEGGIRVPFL 310
Query: 382 L-WSPQIQQNPRVSLQM-MHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
+ W IQ P+ + + D LPT Y A GG+T L IDG++
Sbjct: 311 MSWPKHIQ--PKTTYTFPISTMDLLPTFYAAGGGNTKDLQ-EIDGIN 354
Score = 41.2 bits (95), Expect = 1.9, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 35/67 (52%), Gaps = 7/67 (10%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
EY TD EA+ IE + P F+YLA A H +EA +E +NQF +T R
Sbjct: 189 EYVTDALADEAISFIEKNKKN-PFFIYLAFNAVHT-----PMEATKEDLNQFPNLTG-KR 241
Query: 74 RTYAALT 80
+ AA+T
Sbjct: 242 KELAAMT 248
>gi|241601300|ref|XP_002405277.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215502504|gb|EEC11998.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 234
Score = 183 bits (465), Expect = 2e-43, Method: Compositional matrix adjust.
Identities = 95/234 (40%), Positives = 139/234 (59%), Gaps = 14/234 (5%)
Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILS-DQYSRTVELNGHDMRRNLSTAW 240
+WHLG++ +Y P+ RGF++ G+ + + YY + D ++ GHD W
Sbjct: 12 QWHLGYYDSKYVPVNRGFDTFLGFYSPALDYYTQNFTFDNHT------GHDFWNGDKIYW 65
Query: 241 --DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
+Y+T +T++ VQLI+ PLFL L+H A H L+ P++ I F Y
Sbjct: 66 VDKEKKQYSTHYYTRKTVQLIQVHNKSTPLFLLLSHQAPHTSGGPTLLQVPKKGIRDFSY 125
Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
I + NR +A MV LD SVG+V+ ALQ+ ML ++I++F +DNGA ++ N
Sbjct: 126 IQEENRTLFAGMVDALDQSVGSVVDALQKADMLTDTILVFCTDNGALPWGFKS-----NR 180
Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
GSN+P RG K TL+EGGV+ A +WSP + + RVS Q+MHI+DWLPTLY+AAG
Sbjct: 181 GSNWPLRGTKFTLYEGGVRSTAFIWSPLVNNSKRVSSQLMHITDWLPTLYSAAG 234
Score = 52.4 bits (124), Expect = 7e-04, Method: Compositional matrix adjust.
Identities = 24/69 (34%), Positives = 38/69 (55%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y+T +T++ VQLI+ PLFL L+H A H L+ P++ I F YI + NR
Sbjct: 72 QYSTHYYTRKTVQLIQVHNKSTPLFLLLSHQAPHTSGGPTLLQVPKKGIRDFSYIQEENR 131
Query: 74 RTYAALTKS 82
+A + +
Sbjct: 132 TLFAGMVDA 140
>gi|443734654|gb|ELU18562.1| hypothetical protein CAPTEDRAFT_195389, partial [Capitella teleta]
Length = 330
Score = 183 bits (464), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 114/342 (33%), Positives = 167/342 (48%), Gaps = 28/342 (8%)
Query: 50 NAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNI 109
NA + E +E F +I + A G+NDL + I TPN+
Sbjct: 8 NAHRSHEGRKEKKPNFVFILADDLGGMLASKLKNFQACSFSVGYNDLGLRDPDVI-TPNM 66
Query: 110 DALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEY 169
DALA G+IL N Y Q VC+PSR +LMTG+YP + MQ I E + L +FLP+Y
Sbjct: 67 DALASKGVILTNNYVQAVCSPSRHALMTGRYPSASAMQSIVIQPMEAKCSGLKYKFLPQY 126
Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
L++LGY IGKWHLG+ R E P RGF++ +G Y++H + Y
Sbjct: 127 LKDLGYKNHMIGKWHLGYCREECLPTSRGFDTFYGLYASSGDYWEHGIMGMY-------- 178
Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
D ++ G +A DL + ++ ++ PLFLY A +H L+
Sbjct: 179 -DWHTEAGVDFEARGTHAQDLEIERLDKVFDEHDNKDPLFLYFAPQNSHT-----PLQPT 232
Query: 290 QETINQF--QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
E IN + +D RR Y + LD VG ++ L M+ N+ +IF+SDNGA +
Sbjct: 233 DEFINLYPEDKFSDI-RRKYLGLTSGLDAMVGKIVDKLTENDMMNNTYVIFVSDNGADPI 291
Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
E G N P+RG K +L+EGG K + ++SP +++
Sbjct: 292 E----------GRNAPFRGGKGSLFEGGTKSSSFIYSPLLKK 323
>gi|372210445|ref|ZP_09498247.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 474
Score = 182 bits (463), Expect = 3e-43, Method: Compositional matrix adjust.
Identities = 122/359 (33%), Positives = 188/359 (52%), Gaps = 40/359 (11%)
Query: 81 KSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
KS +LI+ GW D+ F+G+ +IPTPN++ALA +G+I + Y+ P C+PSRA L
Sbjct: 23 KSKQPNILIILSDDQGWGDVGFNGATDIPTPNLNALAKDGVIFSQGYSSHPYCSPSRAGL 82
Query: 136 MTGKYPIHTGMQGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
+TG+Y G + P + G+PL E + E L++ Y T AIGKWHLG +
Sbjct: 83 LTGRYQQKFGHENNPENEKQNEDTVIGLPLNELMISEVLQQNNYHTCAIGKWHLGNAHK- 141
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
+ P RGF+ FG+ G +Y+ V NG + N T Y TD F
Sbjct: 142 FLPNQRGFKDWFGFSGGGFNYWGKTTPKNKELGVMKNGKPVPENTLT-------YLTDDF 194
Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+ +A+ I+ ++P F+YLA+ A HA ++A +E N +I + R YAAM
Sbjct: 195 SNQAINYIDQYSKTEQPFFMYLAYNAPHA-----PIQATKEYTNLVTHIENGERAAYAAM 249
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V +D +G VI L+ G+ N++I+F SDNG + + S+ PYRG K
Sbjct: 250 VAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGG----------HSHGASSAPYRGHKGM 299
Query: 371 LWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL--NIDGLD 426
L+EGG++ P ++ W +I+ N R + + D PT+ A +++P NIDG++
Sbjct: 300 LFEGGIREPFVISWPNKIKGN-RTYKEPIIALDIFPTILAAT---NTQVPASKNIDGVN 354
Score = 41.2 bits (95), Expect = 1.6, Method: Compositional matrix adjust.
Identities = 33/122 (27%), Positives = 54/122 (44%), Gaps = 23/122 (18%)
Query: 15 YATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
Y TD F+ +A+ I+ ++P F+YLA+ A HA ++A +E N +I + R
Sbjct: 189 YLTDDFSNQAINYIDQYSKTEQPFFMYLAYNAPHA-----PIQATKEYTNLVTHIENGER 243
Query: 74 RTYAALT--------------KSTTL---TLLIVYGWNDLSFHGSNEIPTPNIDALAYNG 116
YAA+ K T + TL++ Y N HG++ P + + G
Sbjct: 244 AAYAAMVAGMDAGIGKVIQKLKETGVYNNTLIVFYSDNGGHSHGASSAPYRGHKGMLFEG 303
Query: 117 II 118
I
Sbjct: 304 GI 305
>gi|372210513|ref|ZP_09498315.1| N-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 465
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 117/346 (33%), Positives = 181/346 (52%), Gaps = 38/346 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS + TPN+D LA +G+ Y + P C PSRA +MTGKY G +
Sbjct: 34 GYMDFGFQGSKVMKTPNLDKLAKSGVTFTQGYVSDPTCGPSRAGMMTGKYQARFGYEEIN 93
Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
+ G + G+PL ++ + YL++LGY T GKWHLG R + PL RGF+
Sbjct: 94 VPGYMSSHSALKGDEMGLPLDQKLMSNYLKDLGYKTAVYGKWHLGNADR-FHPLNRGFDE 152
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
+G+ G SY+ + + R +E N ++ YATD+F ++AV IE
Sbjct: 153 FYGFRGGARSYFAYK-NPTGDRKMETN--------FGQYEEPNHYATDVFAEKAVHFIER 203
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
+ P F+YL+ A H +EA + + QF +T R+ AAM LD + G V
Sbjct: 204 NK-EHPFFIYLSFNAVHT-----PMEATEADLAQFPNLTG-KRQQLAAMTLALDRACGKV 256
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
++ L+ G+ N+I++F +DNG P+ +N +N P G K+ EGG++VP +
Sbjct: 257 LNKLKELGIDRNTIVVFSNDNGGPSD--------KNASNNKPLSGTKSNHLEGGIRVPFL 308
Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
+ W +I++ R + ++ D LPT Y AAGGD L +I+G++
Sbjct: 309 MSWPKEIKKKSRFNYPVVTF-DLLPTFYAAAGGDLKDLK-DINGVN 352
>gi|410628682|ref|ZP_11339400.1| arylsulfatase B [Glaciecola mesophila KMM 241]
gi|410151686|dbj|GAC26169.1| arylsulfatase B [Glaciecola mesophila KMM 241]
Length = 510
Score = 182 bits (463), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 112/318 (35%), Positives = 173/318 (54%), Gaps = 28/318 (8%)
Query: 73 RRTYAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVC 128
++T+ + K ++I+ G+ DL F GS EI TPNIDALA NG++ N Y P C
Sbjct: 46 KKTHKLVAKERPNIVVILADDLGYADLGFTGSKEIFTPNIDALANNGVVFKNGYVTHPYC 105
Query: 129 TPSRASLMTGKYPIHTGMQGPPIWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
PSRA L+TG+Y GM+ + G+P+ E + +++ GY T +GKWH+G
Sbjct: 106 GPSRAGLLTGRYQARFGMEVNAAHSPDDPYMGLPVEELTFAKRMQQAGYKTAVMGKWHMG 165
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDH---ILSDQYSRTVELNGHDMRRNLSTAWDTV 243
+ P RGF+ FG+L G Y+ + S +YS + NG + N
Sbjct: 166 S-HPNFHPNNRGFDEFFGFLGGGHDYFPESVKVSSAEYSIALSRNGKPAQLN-------- 216
Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
EY T +KEA + + ++P +Y+A+ A H+ L+A ++ + ++Q+I D +
Sbjct: 217 -EYLTTAISKEAARFVS--ATEQPFMMYVAYNAPHSP-----LQATEQDLAKYQHIADLD 268
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
RRTYAAM+ +D VG +++AL++ L N++I F+SDNG V E +W +N P
Sbjct: 269 RRTYAAMIDSMDQGVGRIVAALKQSEKLHNTLIFFLSDNGG--VYPEEWMPDSDWANNAP 326
Query: 364 YRGVKNTLWEGGVKVPAI 381
+R K +L EGGV VP I
Sbjct: 327 FRRGKVSLTEGGVHVPFI 344
Score = 43.5 bits (101), Expect = 0.38, Method: Compositional matrix adjust.
Identities = 23/71 (32%), Positives = 42/71 (59%), Gaps = 7/71 (9%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
+ EY T +KEA + + ++P +Y+A+ A H+ L+A ++ + ++Q+I D
Sbjct: 215 LNEYLTTAISKEAARFVS--ATEQPFMMYVAYNAPHSP-----LQATEQDLAKYQHIADL 267
Query: 72 NRRTYAALTKS 82
+RRTYAA+ S
Sbjct: 268 DRRTYAAMIDS 278
>gi|421613320|ref|ZP_16054406.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495914|gb|EKK00487.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 826
Score = 182 bits (462), Expect = 4e-43, Method: Compositional matrix adjust.
Identities = 124/352 (35%), Positives = 181/352 (51%), Gaps = 33/352 (9%)
Query: 75 TYAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTP 130
T A K +LIV G++D+ F+G EIPTP++D LA +G++ N YA P C+P
Sbjct: 36 TSVAGAKDRPNIVLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSP 95
Query: 131 SRASLMTGKYPIHTGMQGPP----IW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
SRA L+TG++ G + P W G + G+PL+E L + L+E GY T AIGKWHL
Sbjct: 96 SRAGLLTGRHQQRFGHESNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHL 155
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
G + + P +RGF+ FG+ G SY+ D + L H R
Sbjct: 156 GD-AKPFWPNHRGFDEWFGFSGGGFSYW----GDLGKKDPLLGVH---RGDEPVEPKTLT 207
Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
+ TD F+ EAV+ I+ + P FLYLA+ A HA + A + + + +I R
Sbjct: 208 HLTDDFSTEAVKFIQRNETE-PFFLYLAYNAPHAPD-----HATRAHLQKTTHIEYGGRA 261
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
Y AMV +D+ +G V+ ++ G+ EN++IIF SDNG R N+PYR
Sbjct: 262 VYGAMVAGMDEGIGRVVDQIRESGLGENTMIIFYSDNGG----------RREHAVNFPYR 311
Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
G K L+EGG++VP ++ P ++ + D PT AAG D S+
Sbjct: 312 GHKGMLFEGGIRVPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPSQ 363
>gi|431796835|ref|YP_007223739.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430787600|gb|AGA77729.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 470
Score = 182 bits (462), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 117/341 (34%), Positives = 181/341 (53%), Gaps = 30/341 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTG----- 145
G+ DLSF GS ++ TP+ID LA +G+ Y + VC+PSRA L+TG+ + G
Sbjct: 46 GYGDLSFTGSTQVKTPHIDELAASGVFFPEGYVSSAVCSPSRAGLLTGRNQVSFGYDNNL 105
Query: 146 MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
P + G+P+ + + ++L++LGY T +GKWHLG+ +++PL RGF+ +GY
Sbjct: 106 ANSQPGFDPAFLGLPVNVKTVGDHLKKLGYVTGLVGKWHLGY-EDQFSPLNRGFDEFWGY 164
Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
L G Y++ + + R + ++ N T + Y TD E + I+ D
Sbjct: 165 LGGGHDYFE---ASEAKRGYKAK---IKCNYKTPQEIT--YITDDKGDECINFIQRHK-D 215
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
+P FLY + A H ++A E + +Q+I D RRTYAAMV +LD +VG ++ A+
Sbjct: 216 EPFFLYASFNAPHT-----PMQATAEDLAIYQHIEDRKRRTYAAMVHRLDINVGRIVEAV 270
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
+ +LEN+I +F+SDNG P + N N PY G K TL EGG++VP + P
Sbjct: 271 EEADLLENTIFVFISDNGGPVM--------TNGSINAPYNGKKGTLLEGGIRVPFFISWP 322
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
+ +V + D PT AGG+ S+ DGL+
Sbjct: 323 GHLEPGKVFEHPVSSLDLTPTFVALAGGE-SKEEDQFDGLN 362
>gi|414068777|ref|ZP_11404774.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410808616|gb|EKS14585.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 480
Score = 182 bits (462), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 130/427 (30%), Positives = 204/427 (47%), Gaps = 73/427 (17%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
+LIV G+ D+ F+GS +I TPNID LA +G ++ Y A P C PSRA+LMTG+YP
Sbjct: 33 ILIVADDLGYADVGFNGSKDIITPNIDDLAKSGTSFSDAYVAHPFCGPSRAALMTGRYPH 92
Query: 143 HTGMQ-GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
G Q P G+ GVP +F+ + L E Y T A+GKWH+G +Y P RGF+
Sbjct: 93 KIGSQFNLPTRGSN-VGVPTDAKFISKLLNENNYFTGALGKWHMGD-APQYHPNKRGFDE 150
Query: 202 HFGYLNGVISYYDHILSD--------------QYSRTVELNGHDMRRNLSTAWDTVGEYA 247
++G+L G +Y+ +Y +E NG +++ +Y
Sbjct: 151 YYGFLGGGHNYFPDQYQPQYKKQQAQGLKNIFEYITPLEHNGKEVKET---------QYI 201
Query: 248 TDLFTKEAVQLIEDQPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
TD ++EAV + D+ V+K P FLYLA+ A H L+A E + F I + +R+
Sbjct: 202 TDALSREAVNFV-DKAVNKKNPFFLYLAYNAPHVP-----LQAKDEDMAMFPNIKNKDRK 255
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
TYA MV +D VG ++ L++ +N++I+FMSDNG + +N+P +
Sbjct: 256 TYAGMVYAVDRGVGKLVETLKKNNQYDNTLIVFMSDNGGKLSK---------GANNFPLK 306
Query: 366 GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
K + EGG +VP + W + R S + + D PT AG
Sbjct: 307 AGKGSTQEGGFRVPMLFHWPKHVPAGKRFSHPVSAL-DLYPTFAALAGAKV--------- 356
Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK--RTAAVRLDSWK 482
N ++DG + W + + N ++ + + +K AA+R++ WK
Sbjct: 357 -------------EENQHLDGTNMWPAFIKNENPHKDEPIYALRHRKGYSDAAIRMNQWK 403
Query: 483 LVLGTQE 489
+ Q+
Sbjct: 404 ALKVNQQ 410
Score = 43.1 bits (100), Expect = 0.41, Method: Compositional matrix adjust.
Identities = 26/69 (37%), Positives = 39/69 (56%), Gaps = 8/69 (11%)
Query: 14 EYATDLFTKEAVQLIEDQPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 71
+Y TD ++EAV + D+ V+K P FLYLA+ A H L+A E + F I +
Sbjct: 199 QYITDALSREAVNFV-DKAVNKKNPFFLYLAYNAPHVP-----LQAKDEDMAMFPNIKNK 252
Query: 72 NRRTYAALT 80
+R+TYA +
Sbjct: 253 DRKTYAGMV 261
>gi|296121201|ref|YP_003628979.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296013541|gb|ADG66780.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 479
Score = 182 bits (462), Expect = 5e-43, Method: Compositional matrix adjust.
Identities = 135/367 (36%), Positives = 191/367 (52%), Gaps = 43/367 (11%)
Query: 80 TKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
+KS +L++ G+ DL G EIPTP++D LA +GI N Y + P C+PSRA
Sbjct: 33 SKSGRPNILVIMADDLGYADLGVQGGCEIPTPHLDQLAASGIRCTNAYVSAPYCSPSRAG 92
Query: 135 LMTGKYPIHTGMQGPPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
+TGKY G + P G E + G+PL E + L+ GY T IGKWH GF +++
Sbjct: 93 FLTGKYQTRFGHEFNPHVGEEAKLGLPLEEVTIANLLQTEGYRTALIGKWHQGF-SKDHH 151
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
P RGF+ FG+L G +Y H + +R + HDM G YATDLFT
Sbjct: 152 PQSRGFDEFFGFLVGGHNYLLH--KEVKARFGTAHSHDMIYRGREVEPQEG-YATDLFTN 208
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLE-APQETINQFQYITDPNRRTYAAMVK 312
EA++ + P +KP FLYL++ A H LE AP + + P RR Y +++
Sbjct: 209 EALRWMSGPP-NKPWFLYLSYNAVHT-----PLEIAPHLQKRIPESVKLPARRGYLSLLA 262
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNG----APTVEYRETSNYRNWGSNYPYRGVK 368
LDDS+G + L + G+ E ++IIF+SDNG AP + Y N G N+P RG K
Sbjct: 263 GLDDSIGRITQHLSQHGLREKTLIIFLSDNGGSGRAPILAY-------NSGLNHPLRGDK 315
Query: 369 NTLWEGGVKVPAIL-WSPQ-----IQQNPRVSLQMMHISDWLPTLYTAAGGDTSR---LP 419
EGG++VP + W Q I + P +SL D LPT+ A + ++ LP
Sbjct: 316 GQTLEGGIRVPFFVSWPGQLPARTIYEQPIISL------DLLPTVCQLAANNPAKPQPLP 369
Query: 420 LNIDGLD 426
IDG++
Sbjct: 370 QGIDGVN 376
>gi|326428223|gb|EGD73793.1| arylsulfatase [Salpingoeca sp. ATCC 50818]
Length = 697
Score = 182 bits (461), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 126/402 (31%), Positives = 192/402 (47%), Gaps = 47/402 (11%)
Query: 101 SNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG--PPIWGAEPRG 158
++E TP+ID LA NG +L Y C+P+R+SL++G+ PIH + PP+ G G
Sbjct: 94 NDEPLTPHIDELARNGTMLRRFYTYKYCSPTRSSLLSGRLPIHVNQENRPPPVPGG---G 150
Query: 159 VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILS 218
+ L E L+ GY+T +GKWH G + P++RGF S FGYL+G +Y I
Sbjct: 151 IDQNMTTLAEVLQRAGYATHQVGKWHCGMSSPDRLPVHRGFNSSFGYLSGAEHHYSQIRQ 210
Query: 219 DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 278
D+ R+ A+ G Y T ++ +EA ++I PLFLYLA H
Sbjct: 211 GYV---------DLWRDTQPAYGENGTYGTFMYAREAERIIHAHDPTTPLFLYLAFQNVH 261
Query: 279 AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
+ LE P ++++ + R+ AMV +D+++ V AL+R G+ ++++IIF
Sbjct: 262 S-----PLEVPDRFLDKYPDVDYRPRKHCLAMVSAVDEAINNVTLALRRAGLFDDTLIIF 316
Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVS--LQ 396
SDNG P + +NYP+RG K +EGGV+ A + + +N +
Sbjct: 317 SSDNGGP----------HDHANNYPFRGAKTADFEGGVRAVAFVSGGVVPKNMVGTNVHG 366
Query: 397 MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLL-LN 455
+MH+ DW TL AG D + + N P IDG D W L N
Sbjct: 367 LMHVCDWYSTLARLAGEDPT----------DHRAARSNLPP-----IDGFDMWPMLTGAN 411
Query: 456 TPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYY 497
+ S R V ++ +A+ +KLV G Q G G +
Sbjct: 412 STSPRVDVPLSGGIPVAASALISGQYKLVRGKQPYGFYPGVH 453
>gi|325106503|ref|YP_004276157.1| sulfatase [Pedobacter saltans DSM 12145]
gi|324975351|gb|ADY54335.1| sulfatase [Pedobacter saltans DSM 12145]
Length = 470
Score = 182 bits (461), Expect = 6e-43, Method: Compositional matrix adjust.
Identities = 112/330 (33%), Positives = 177/330 (53%), Gaps = 28/330 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGP- 149
G+ D +G EIPTPNIDALA NG + + Y + VC PSRA ++TG Y G +
Sbjct: 38 GYADFGCYGGKEIPTPNIDALAKNGTLFTDAYVSASVCAPSRAGILTGMYQQRFGFEHNI 97
Query: 150 ---PI--WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
P+ + G+ + + + ++ GY T AIGKWH G +Y PL RGF+ +G
Sbjct: 98 SELPVKPYTLNDVGMDPKIKTIGDQMKHNGYRTIAIGKWHQGDLP-QYFPLKRGFDEFYG 156
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
++ G S++ + S + L +D +T+G Y TD+FT +A+ +++
Sbjct: 157 FVGGHRSFFGYPGGKAPSHELALFDNDK----IVPENTIG-YLTDMFTDKAISFVKENK- 210
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
KP F+YLA+ A H + A +E +++F ITDP R+ YAAM+ LDD VG ++
Sbjct: 211 SKPFFMYLAYNAVHV-----PMNAKKELMDRFPNITDPGRKAYAAMMVSLDDGVGELVKT 265
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
L+ + + +N++I+F++DNGA T N+ N RG+K + WEGG++V I+
Sbjct: 266 LKEQNLYDNTLIVFVNDNGAAT---------GNYADNGKLRGLKGSKWEGGIRVAYIMQY 316
Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGD 414
P +V +M+ D +PT A G+
Sbjct: 317 PDQIPAGKVYSKMVSGLDIMPTAVGAGKGE 346
Score = 53.1 bits (126), Expect = 4e-04, Method: Compositional matrix adjust.
Identities = 28/73 (38%), Positives = 45/73 (61%), Gaps = 7/73 (9%)
Query: 10 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 69
+T+G Y TD+FT +A+ +++ KP F+YLA+ A H + A +E +++F IT
Sbjct: 189 NTIG-YLTDMFTDKAISFVKENK-SKPFFMYLAYNAVHV-----PMNAKKELMDRFPNIT 241
Query: 70 DPNRRTYAALTKS 82
DP R+ YAA+ S
Sbjct: 242 DPGRKAYAAMMVS 254
>gi|196231555|ref|ZP_03130413.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224408|gb|EDY18920.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 467
Score = 181 bits (458), Expect = 1e-42, Method: Compositional matrix adjust.
Identities = 140/447 (31%), Positives = 206/447 (46%), Gaps = 64/447 (14%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
A L + L GW D+ FH N +PTPN+D LA G+ L Y PVC+P+R + +
Sbjct: 37 APLRPNFIFILADDLGWGDVGFHHGN-VPTPNLDHLAGEGLELMQHYVYPVCSPTRCAFL 95
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
+G+Y + P PR L L+ +GY T GKWHLG + E+ P
Sbjct: 96 SGRYASRFSVTTP----QNPRAFRWDTVTLARALKSVGYDTALCGKWHLGS-KPEWGPQK 150
Query: 197 RGFESHFGYLNGVISYYDHILS-DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
GF+ +G L G + +DH ++++T +G + + TDL TKEA
Sbjct: 151 FGFDHSYGSLAGGVGPWDHHYKIGEFTQTWHRDGKLIEEQ---------GHVTDLITKEA 201
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKL 314
V+ +E + DKP FLY+ A H + P E + + IT P+ R Y A V L
Sbjct: 202 VEWLESR-TDKPFFLYVPFTAVHI-----PIREPDEILQRVPASITKPSLRHYGANVMHL 255
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRE----TSNY---RNWGSNYPYRG 366
DDSVG ++ AL++ G N+++IF SDNGA P VE + +Y GSN P G
Sbjct: 256 DDSVGKILVALEKTGKAGNTLVIFGSDNGAIPGVENNDPLYPPDHYPPGPAGGSNEPLHG 315
Query: 367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
+K ++EGG+ A+ W Q++ P L + HI+DW+PT AG
Sbjct: 316 MKGEVYEGGIHTAAVARWPGQLK--PGKFLGLAHITDWMPTFCALAG------------- 360
Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
++ DG + W L P + ++ + R+ A+R WKLVL
Sbjct: 361 ---------YKPEKDLKWDGQNIWPQLTGAEPVKPRTIYV-AGPGFRSKALRDGDWKLVL 410
Query: 486 GTQENGTMDGYYGQTRSNKVPLLNFNA 512
+Q G+ + KV L N A
Sbjct: 411 -SQTKGS------KNSPPKVELFNIGA 430
Score = 39.7 bits (91), Expect = 5.2, Method: Compositional matrix adjust.
Identities = 25/65 (38%), Positives = 34/65 (52%), Gaps = 7/65 (10%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNR 73
+ TDL TKEAV+ +E + DKP FLY+ A H + P E + + IT P+
Sbjct: 192 HVTDLITKEAVEWLESR-TDKPFFLYVPFTAVHI-----PIREPDEILQRVPASITKPSL 245
Query: 74 RTYAA 78
R Y A
Sbjct: 246 RHYGA 250
>gi|374619563|ref|ZP_09692097.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
gi|374302790|gb|EHQ56974.1| arylsulfatase A family protein [gamma proteobacterium HIMB55]
Length = 539
Score = 181 bits (458), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 123/396 (31%), Positives = 184/396 (46%), Gaps = 61/396 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ FHG+ I TP++D +A G LN Y P+C+P+RA+LMTG+ PI G+ I
Sbjct: 37 GWADVGFHGNQIIETPSLDRIAAEGTQLNRFYTTPICSPTRAALMTGRDPIRLGVAYSTI 96
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ E FLPE GY T +GKWHLG ++ Y P RGFE +G+L+ +
Sbjct: 97 MPWHNNGIHPEETFLPELFAGAGYQTAMVGKWHLGHAQQTYHPNARGFEHFYGHLHTEVG 156
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
++ S L G D +RN + D G Y + L E + I ++ KP F+Y
Sbjct: 157 FFPPFAS--------LGGKDFQRN-GVSIDDQG-YESYLLADEVSRYIRERDAAKPFFIY 206
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYITDPN-----------------------RRTYA 308
+ +A H L+AP++ ++ + D R YA
Sbjct: 207 MPFIAPHT-----PLDAPEDLKAKYADMEDDRGKSRSKMADNTRFMAKVTRRESARPMYA 261
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
A+V +D S+G V+ L +G+ +N+I++F SDNG N P RG K
Sbjct: 262 AVVDGMDQSIGRVLDTLDAEGLTDNTIVLFFSDNGGAVYAI-------GGADNAPLRGGK 314
Query: 369 NTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR---------L 418
+EGG++V A + W +I +V +M + D PTL AAG + + L
Sbjct: 315 GDTFEGGIRVVATMRWPEKIAPGGKVD-SIMSVMDVFPTLLAAAGIEPATRHRLDGRNLL 373
Query: 419 PLNIDGLDQWSSLLL----NTPSRRNSNIDGL-DQW 449
P +DG D LL +P + N+ D+W
Sbjct: 374 PAIVDGEDVAREDLLFFIAESPFKNTVNVTAFNDEW 409
>gi|392966318|ref|ZP_10331737.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387845382|emb|CCH53783.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 461
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 124/351 (35%), Positives = 182/351 (51%), Gaps = 55/351 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
G+ DLS GS ++ TP+ID+L G+ N YA VC+PSRA+L+TG+YP G+ G
Sbjct: 47 GYGDLSCFGSTDLKTPHIDSLIGAGMRFTNFYANSSVCSPSRAALLTGRYPERVGVPGVI 106
Query: 149 ----PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
WG + + LP YLR+ GY + IGKWHLG P RGF+ +G
Sbjct: 107 RDEVQDSWGY----LASSATLLPTYLRKQGYHSANIGKWHLGL-ESPNLPNERGFQEFYG 161
Query: 205 YLNGVISYYDHILSDQYSRTVEL-NGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
L G++ Y V+L +G + R+ D G +ATD+FT AV+ + D+
Sbjct: 162 LLEGMMDDY----------VVKLRHGQNFLRHNGQVIDPPG-HATDVFTDAAVRYLNDRK 210
Query: 264 VDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI---TDPNRRTYAAMVKKLDDSVG 319
K P FLYLA+ A H L+ P +N+ Q DP R A+++ LDD++G
Sbjct: 211 AKKDPFFLYLAYTAPH-----DPLQPPVAYLNRVQKRQPGIDPQRAKLVALIEHLDDNIG 265
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG----SNYPYRGVKNTLWEGG 375
V++ L+ G L N++++F SDNG WG +N +RGVK +EGG
Sbjct: 266 RVLATLRANGQLANTLVVFTSDNGG-------------WGPGKANNGQFRGVKGQFYEGG 312
Query: 376 VKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
+++P +WS I+ S ++ + DW PTL AAG + +P I+GL
Sbjct: 313 IRIPTGAMWSGHIEAGSE-SGALLQLMDWFPTLLQAAG---ASVPDGINGL 359
>gi|294053769|ref|YP_003547427.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613102|gb|ADE53257.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 490
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 118/358 (32%), Positives = 181/358 (50%), Gaps = 35/358 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL F+GS + TP +D LA+NG+I N Y P C PSRA L+TG+Y GM+
Sbjct: 34 GYGDLGFNGSKVVETPTLDRLAHNGVIFQNGYVTHPYCGPSRAGLITGRYQARFGMEINL 93
Query: 151 IWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
+ +G+PL E+ E +R GY T IGKWHLG + P RGF+ +G+L+G
Sbjct: 94 TYSPFDLHQGLPLDEKTFAERVRPAGYRTGMIGKWHLGA-SEPFHPNSRGFDYFYGFLSG 152
Query: 209 VISYYDHILSDQYSRTVELNG-----------HDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
Y+ ++ + + + + NG + RN + A EY T +K+A +
Sbjct: 153 GHDYFPEMV-NTHPKLLLPNGKPHYSHNEGCTQPLLRNKNAA--EFNEYLTTALSKDAAR 209
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR-TYAAMVKKLDD 316
+ D +KP LYLA+ A H L AP+E + ++ + RR TY AM+ +D
Sbjct: 210 FVSDS--EKPFLLYLAYNAPHG-----PLHAPKELVEKYSKLEKDKRRATYLAMIDSMDQ 262
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+G ++ AL++ G L+N++I FMSDNG V+ + Y W N P+R K ++ EGG
Sbjct: 263 GIGLIVEALEQSGKLDNTLIFFMSDNGG--VQSKPGHEYETWADNGPFRNGKGSMREGGS 320
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN-------IDGLDQ 427
VP I P+ + D T +GGD + PL+ ++G+D+
Sbjct: 321 HVPFIAHWPKGFPQGITYPHPVSSLDLTATAVELSGGDATGKPLDGVNLTPFVNGIDK 378
Score = 38.9 bits (89), Expect = 8.9, Method: Compositional matrix adjust.
Identities = 24/78 (30%), Positives = 39/78 (50%), Gaps = 8/78 (10%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
EY T +K+A + + D +KP LYLA+ A H L AP+E + ++ + R
Sbjct: 197 EYLTTALSKDAARFVSDS--EKPFLLYLAYNAPHG-----PLHAPKELVEKYSKLEKDKR 249
Query: 74 R-TYAALTKSTTLTLLIV 90
R TY A+ S + ++
Sbjct: 250 RATYLAMIDSMDQGIGLI 267
>gi|440715767|ref|ZP_20896296.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436439253|gb|ELP32723.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 826
Score = 180 bits (457), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 120/340 (35%), Positives = 179/340 (52%), Gaps = 33/340 (9%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI 142
+LIV G++D+ F+G EIPTP++D LA +G++ N YA P C+PSRA L+TG++
Sbjct: 48 VLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQ 107
Query: 143 HTGMQGPP----IW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
G + P W G + G+PL+E L + L+E GY T AIGKWHLG + + P +R
Sbjct: 108 RFGHESNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLG-DAKPFWPNHR 166
Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
GF+ FG+ G SY+ D + L H + T + TD F+ EAV+
Sbjct: 167 GFDEWFGFSGGGFSYW----GDLGKKDPLLGVHRGDEPVDPKTLT---HLTDDFSTEAVK 219
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
I+ +P FLYLA+ A HA + A + + + +I R Y AMV +D+
Sbjct: 220 FIQRHE-SEPFFLYLAYNAPHAPD-----HATRAHLQKTAHIEYGGRAVYGAMVAGMDEG 273
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
+G V+ ++ G+ EN+++IF SDNG R N+PYRG K L+EGG++
Sbjct: 274 IGRVVDQIRESGLGENTMVIFYSDNGG----------RREHAVNFPYRGHKGMLFEGGIR 323
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
VP ++ P ++ + D PT AAG D S+
Sbjct: 324 VPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPSQ 363
>gi|294053770|ref|YP_003547428.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613103|gb|ADE53258.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 491
Score = 180 bits (456), Expect = 2e-42, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 191/364 (52%), Gaps = 34/364 (9%)
Query: 77 AALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPS 131
A L+ +LI+ G++DL + GS EI +P ID LA NG+I N Y P C PS
Sbjct: 18 AVLSADERPNILIILADDLGYSDLGYTGSTEIESPVIDKLANNGVIFANGYVTHPYCGPS 77
Query: 132 RASLMTGKYPIHTGMQGPPIWGA--EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
RA L+TG++ GM+ + + G+P+ E + L+ GY T IGKWHLG
Sbjct: 78 RAGLITGRHQARFGMEINATYSPFDQHMGLPVDEPTFAKRLQPAGYRTGIIGKWHLGA-A 136
Query: 190 REYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD-----------MRRNLST 238
++ P RGF+ +G+L+G Y+ + + + V NG + RN +
Sbjct: 137 PQFHPNNRGFDYFYGFLSGGHDYFPESV-NTHLELVLPNGKPNYGANEGTLLPLLRNKNA 195
Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
A +Y T +K+A + + ++P LYLA+ A H L+AP+ETI ++ +
Sbjct: 196 A--EFDDYLTTALSKDAARFVTSS--EQPFCLYLAYNAPHT-----PLQAPKETIAKYSH 246
Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
I DP RR YAAM+ ++D +G V+ AL++ G L+N++I F+SDNG + ++ Y ++
Sbjct: 247 IKDPKRRIYAAMIDEMDAGIGLVVDALEQSGKLDNTLIFFLSDNGGCVPQAWDS--YSDF 304
Query: 359 GSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
N P++ K + +EGG VP I W +++ P+ ++ D T AGGD S
Sbjct: 305 ADNGPFKDGKTSFYEGGTHVPFIAHWPAKLK--PQTYDNIISSLDLAATAVALAGGDDSG 362
Query: 418 LPLN 421
PL+
Sbjct: 363 KPLD 366
Score = 47.0 bits (110), Expect = 0.030, Method: Compositional matrix adjust.
Identities = 26/77 (33%), Positives = 42/77 (54%), Gaps = 7/77 (9%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y T +K+A + + ++P LYLA+ A H L+AP+ETI ++ +I DP R
Sbjct: 200 DYLTTALSKDAARFVTSS--EQPFCLYLAYNAPHT-----PLQAPKETIAKYSHIKDPKR 252
Query: 74 RTYAALTKSTTLTLLIV 90
R YAA+ + +V
Sbjct: 253 RIYAAMIDEMDAGIGLV 269
>gi|403182689|gb|EJY57565.1| AAEL017303-PA [Aedes aegypti]
Length = 176
Score = 180 bits (456), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 81/126 (64%), Positives = 98/126 (77%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS++IPTPNIDALAY GIILN Y P+CTPSRASLM+GK+PI+ GMQ I
Sbjct: 41 GWNDVSFHGSSQIPTPNIDALAYQGIILNRHYTPPLCTPSRASLMSGKHPINVGMQHHVI 100
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ L ++ +PEY RE GY T+ +GKWHLGFFR+ YTP RGF+SHFGY+ I
Sbjct: 101 ESNEPWGLGLDQKLMPEYFREAGYRTRLVGKWHLGFFRKAYTPTRRGFDSHFGYIGPYID 160
Query: 212 YYDHIL 217
Y+DH L
Sbjct: 161 YWDHSL 166
>gi|340368073|ref|XP_003382577.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 507
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 157/569 (27%), Positives = 252/569 (44%), Gaps = 110/569 (19%)
Query: 84 TLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
L L+ +GW ++ +H + E+ TPNID L G+ L+ YA C+PSR+ LM+G+
Sbjct: 26 VLMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQGLELDQHYAYKFCSPSRSCLMSGRL 85
Query: 141 PIHTG---------MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
PIH P+ G +P + E ++E GY+T +GKW G +
Sbjct: 86 PIHVNDLNLAPTNYNPNDPVSGFS--AIPRNMTGIAEKMKEAGYATHQVGKWDAGMATPD 143
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRT--VEL--NGHDMRRNLSTAWDTVGEYA 247
+TP RGF++ FGY + YY ++ Q S + V+L H T D +Y
Sbjct: 144 HTPKGRGFDTSFGYYHHDNDYYTEVVGPQCSGSPIVDLWDTDHPAHGINGTGPD---KYE 200
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
LF + + ++ + PLFLY A AH L+ P + +N+F +I D +R+ Y
Sbjct: 201 EGLFKERLMDVVSKHDPNTPLFLYYAPHIAHT-----PLQVPDDYLNKFSFIDDSDRKYY 255
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRG 366
AMV LDD VG +++AL++KGM +N + + SDNG P Y G+ NYP +G
Sbjct: 256 HAMVNYLDDVVGDLVAALKQKGMWDNLLYVTSSDNGGPI--------YAGGGANNYPLKG 307
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
K T W+GG++V A + + + R + +H++DW T AG +D
Sbjct: 308 GKLTDWQGGIRVNAFVSGGYLPEKMRGQKTDGYVHLADWYGTFCAIAG---------VDP 358
Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTP-SRRNSVLINIDEKKRTAAVRLDSWKL 483
D+ ++ P +D + W + T S R + +I+ A+ +K+
Sbjct: 359 TDEKAAKAKLPP------VDSYNMWPFISGETDTSPRTDIPASIN------ALISGDYKI 406
Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTR 543
++GT + G+ G N+ P I
Sbjct: 407 LIGTVDQA---GWTGPQYPNQTN-------------------------PAGGI------- 431
Query: 544 QQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRT 603
+AT CG S C L+N+ +DP E++N+ASS P+ + E LK + T
Sbjct: 432 -KATERCGE-------SGC------LYNIKDDPEERHNLASSMPEQLKTMQEKLKKYLAT 477
Query: 604 LV--PQSHEQPDLVQADPKRFNDTWSPWI 630
+ H P QA ++ W P++
Sbjct: 478 YFNPDRGHVSPLACQAAVNKYGGFWGPFV 506
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 20/67 (29%), Positives = 34/67 (50%), Gaps = 5/67 (7%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y LF + + ++ + PLFLY A AH L+ P + +N+F +I D +R
Sbjct: 198 KYEEGLFKERLMDVVSKHDPNTPLFLYYAPHIAHT-----PLQVPDDYLNKFSFIDDSDR 252
Query: 74 RTYAALT 80
+ Y A+
Sbjct: 253 KYYHAMV 259
>gi|149178145|ref|ZP_01856740.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
gi|148843065|gb|EDL57433.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
Length = 460
Score = 179 bits (455), Expect = 3e-42, Method: Compositional matrix adjust.
Identities = 129/378 (34%), Positives = 191/378 (50%), Gaps = 60/378 (15%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP 141
+LI++ G ND+ +GS EIPTP+ID LA G++ Y A +CTPSR ++TG+ P
Sbjct: 30 ILIIFTDDQGINDVGCYGS-EIPTPHIDQLAKEGLLFRQYYSASAICTPSRFGILTGRNP 88
Query: 142 IHTGMQGPPIWGA--------EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
+ Q + GA + RG+ E + + L++ GY T +GKWHLG +
Sbjct: 89 TRSQDQ---LLGALMFMSDIDQNRGIQPGETTIADVLQQNGYQTALLGKWHLGHGTESFL 145
Query: 194 PLYRGFESHFGYLNGVISYYDHI---LSDQY--SRTVELNGHDMRRNLSTAWDTVGEYAT 248
P GF+ G+ G I Y+ + D Y R V NG YAT
Sbjct: 146 PTAHGFDLFRGHTGGCIDYFTMTYGNIPDWYHNQRHVSENG----------------YAT 189
Query: 249 DLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNA---GKH-----LEAPQETINQFQYI 299
DL T+EA ++DQ DKP FL+L++ A H G G ++A + + + I
Sbjct: 190 DLITEEAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQARGDDLKRVGTI 249
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
D RR +AAM LDD +G V+S+L+ G+ +N+++IFM+D+G +Y G
Sbjct: 250 KDKVRREFAAMTVSLDDGIGRVMSSLKNNGLDQNTLVIFMTDHGG---------DYVYGG 300
Query: 360 SNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
+N P+RG K TL+EGG++VP I+ W +I+ + ++ D PT+ A DT L
Sbjct: 301 NNQPFRGAKATLFEGGIRVPCIIRWPGKIKAGTETN-EVAWALDLFPTICHFANVDTDGL 359
Query: 419 PLNIDGLDQWSSLLLNTP 436
L DG D L TP
Sbjct: 360 TL--DGKDISGLLTRQTP 375
Score = 46.6 bits (109), Expect = 0.041, Method: Compositional matrix adjust.
Identities = 29/77 (37%), Positives = 42/77 (54%), Gaps = 9/77 (11%)
Query: 15 YATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNA---GKH-----LEAPQETINQF 65
YATDL T+EA ++DQ DKP FL+L++ A H G G ++A + + +
Sbjct: 187 YATDLITEEAEHFLKDQQTTDKPFFLFLSYNAPHFGKGWSPGDQSPVNIMQARGDDLKRV 246
Query: 66 QYITDPNRRTYAALTKS 82
I D RR +AA+T S
Sbjct: 247 GTIKDKVRREFAAMTVS 263
>gi|440713713|ref|ZP_20894310.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436441429|gb|ELP34656.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 472
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 146/465 (31%), Positives = 215/465 (46%), Gaps = 98/465 (21%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS EI TPNID LA + L+ Y P+C+P+RA ++TG YP G G +
Sbjct: 40 GWNDVGFHGS-EIRTPNIDRLANESVTLDRFYVTPICSPTRAGVLTGLYPFRFGFWGGVV 98
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ G+P PE+L +LGY +A+ GKWHLG + PL G +G+ NG I
Sbjct: 99 SPTKKHGLPPQLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLQHGMTEFYGHYNGAI 158
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+ R +L+ H RN +D+V E Y+T+L V I+ P+
Sbjct: 159 DYFSR------ERFGQLDWH---RN----FDSVHEEGYSTELVGNAVVDFIDRNANAGPV 205
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------------------- 303
+ Y+A A H+ L+A + ++++ + DPN
Sbjct: 206 YAYVAFNAPHS-----PLQALRSDLDEYGF--DPNNKLAPNTDRKIAKREKALDYGKRGK 258
Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
R+T+AAM +D +G ++ A+ R GM EN++++F SDNGA + ++ G
Sbjct: 259 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGG 309
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
SN P RG K T WEGGV+V A++ P + D LPT+ AAGG P
Sbjct: 310 SNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPTMVAAAGGPP---P 366
Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLD 479
DG++ LL S + S PS R ++L++ + V D
Sbjct: 367 EETDGIN-----LLPFLSGKAS---------------PSER-TILLDAE------TVVSD 399
Query: 480 SWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQ 524
WKL G + T D + PL N V S+ Q+LQ+
Sbjct: 400 RWKLKTGESFDLTND------PNETTPLEVANQNVRSRLEQALQR 438
>gi|156340112|ref|XP_001620356.1| hypothetical protein NEMVEDRAFT_v1g148421 [Nematostella vectensis]
gi|156205165|gb|EDO28256.1| predicted protein [Nematostella vectensis]
Length = 260
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 100/257 (38%), Positives = 144/257 (56%), Gaps = 15/257 (5%)
Query: 74 RTYAALTKSTTLTLLIVYGWNDLSFHG-SNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
RT A L+ GW+D+ +H S+ + TPNID LA G+ L + Y+QP+CTPSR
Sbjct: 18 RTSATKPPHILFILMDDLGWSDVGYHNISHAVKTPNIDKLASQGVKLMSYYSQPMCTPSR 77
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
+LMTGKYPIH GMQ I P G+P +P+ LR LGY T IGKWHLGFF +Y
Sbjct: 78 GALMTGKYPIHLGMQHFVINITSPWGMPRRFPTIPQKLRTLGYRTSMIGKWHLGFFDWDY 137
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
TPL RGF+S G+ G ++ H ++ D RR+ A + G+++TD+FT
Sbjct: 138 TPLRRGFDSFLGFFAGEQDHWRH---------SKMGFLDFRRDEEPANEYGGQHSTDVFT 188
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
+EA+ + +PLFL L++ A H L+A +N+ ++D +R+ Y M+
Sbjct: 189 QEAINIAMRHNASQPLFLLLSYAAVHTP-----LQAHPNDVNKIGGVSDKDRQNYLGMMG 243
Query: 313 KLDDSVGTVISALQRKG 329
D S+G +I +R G
Sbjct: 244 AADWSIGRLIDVYKRNG 260
Score = 43.9 bits (102), Expect = 0.26, Method: Compositional matrix adjust.
Identities = 23/87 (26%), Positives = 46/87 (52%), Gaps = 5/87 (5%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
RR+ A + G+++TD+FT+EA+ + +PLFL L++ A H L+A
Sbjct: 168 FRRDEEPANEYGGQHSTDVFTQEAINIAMRHNASQPLFLLLSYAAVHTP-----LQAHPN 222
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTL 87
+N+ ++D +R+ Y + + ++
Sbjct: 223 DVNKIGGVSDKDRQNYLGMMGAADWSI 249
>gi|119504674|ref|ZP_01626753.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
gi|119459696|gb|EAW40792.1| arylsulfatase B precursor [marine gamma proteobacterium HTCC2080]
Length = 545
Score = 179 bits (454), Expect = 4e-42, Method: Compositional matrix adjust.
Identities = 118/363 (32%), Positives = 183/363 (50%), Gaps = 47/363 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ +HG +I TP++D LA G+ LN Y P+C+P+RA+LMTG+ PI G+ I
Sbjct: 44 GWADVGYHGG-DIDTPSLDRLAQQGVRLNRFYTTPICSPTRAALMTGRDPIRLGVTYGVI 102
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ + GV E F+PE + GY T IGKWHLG + Y P RGFE +G+L+ +
Sbjct: 103 FPWDNIGVHPDEHFMPETFQAAGYQTAIIGKWHLGHAQMTYHPNNRGFEHFYGHLHTEVG 162
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
+Y S+Q G D +RN + D G Y T L E + I ++ D+P +Y
Sbjct: 163 FYPP-FSNQ-------GGKDFQRN-GVSIDDQG-YETYLLADEVSRYIRERDRDRPFLVY 212
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYI-----------TDPNRRT------------YA 308
+ +A H L+AP E ++++ I TD R YA
Sbjct: 213 MPFIAPHT-----PLDAPVELQDKYKDIETDLPMARSRQTDDTRLISRVMLQPSARPMYA 267
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
A+V +D ++G V+ L ++G+ +N+I++F SDNG Y +N P RG K
Sbjct: 268 AVVDAMDQAIGRVLDTLDQEGISDNTIVLFFSDNGGAAYSY-------GGANNAPLRGGK 320
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
+EGG++V +++ P + + ++ Q+M + D PTL AA +DG W
Sbjct: 321 GETFEGGIRVTSLMRWPAMLEPGQIFEQIMSVMDVFPTLVDAADVRPGN-NFALDGRSMW 379
Query: 429 SSL 431
++L
Sbjct: 380 TAL 382
>gi|294053962|ref|YP_003547620.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613295|gb|ADE53450.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 494
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 123/346 (35%), Positives = 170/346 (49%), Gaps = 33/346 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ-GP 149
G+ D+ F GS EI TP +D LA G+I NN Y C PSRA L+TG+Y G++
Sbjct: 34 GYADVGFTGSTEIQTPVLDRLAAGGVIFNNGYVTHAYCGPSRAGLITGRYQARFGVEVNF 93
Query: 150 PIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P +P G+P E+ L++ GY T IGKWHLG Y P RGF+ +G+L G
Sbjct: 94 PYAPFDPHSGLPTDEKTFATRLKQSGYRTAMIGKWHLGA-AYPYHPNNRGFDYFYGFLGG 152
Query: 209 VISYYDHILSDQYSRTVELNGHD------------MRRNLSTAWDTVGEYATDLFTKEAV 256
Y S T+E NG MR N++ +D EY T +++A
Sbjct: 153 AHDYMPENTSTTVPLTLE-NGKVNHMANAGSYLPLMRNNVNAEFD---EYLTTALSRDAA 208
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
+ IE + P +YL++ A H L+AP+ I ++ +I RRTYAAM+ +D
Sbjct: 209 RFIEK--TEGPFCVYLSYNAPHT-----PLQAPKALIEKYAHIESQKRRTYAAMIDSMDQ 261
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY--RNWGSNYPYRGVKNTLWEG 374
+G V+ AL++ G LEN++I F SDNG P R Y ++ N PYRG K + EG
Sbjct: 262 GIGRVVDALEKSGKLENTLIFFFSDNGGP----RPKPGYEGEDFADNTPYRGGKGAMLEG 317
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
G VP I + P+ + D T GDTS PL
Sbjct: 318 GTHVPFIAYWPKGIPAGSTYDYPVSALDVAATFVELGEGDTSGHPL 363
Score = 44.7 bits (104), Expect = 0.16, Method: Compositional matrix adjust.
Identities = 27/82 (32%), Positives = 44/82 (53%), Gaps = 10/82 (12%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
MR N++ +D EY T +++A + IE + P +YL++ A H L+AP+
Sbjct: 187 MRNNVNAEFD---EYLTTALSRDAARFIEK--TEGPFCVYLSYNAPHT-----PLQAPKA 236
Query: 61 TINQFQYITDPNRRTYAALTKS 82
I ++ +I RRTYAA+ S
Sbjct: 237 LIEKYAHIESQKRRTYAAMIDS 258
>gi|392390175|ref|YP_006426778.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
gi|390521253|gb|AFL96984.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
Length = 467
Score = 177 bits (450), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/366 (33%), Positives = 196/366 (53%), Gaps = 41/366 (11%)
Query: 76 YAALTKSTTLTLLIVY-----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCT 129
Y ++TT +IV G+ D +G+ EIPTPNI+ LA G + + Y + VC
Sbjct: 18 YTTQAQTTTKPNIIVILSDDAGYADFECYGNKEIPTPNINRLAKEGTLFSKAYVSASVCA 77
Query: 130 PSRASLMTGKYPIHTGMQGPPI------WGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
PSRA L+TG+Y G + P + E G+ L+E+ + + ++E GY T A+GKW
Sbjct: 78 PSRAGLLTGRYQQRFGFENNPTGKPREGFKKEDMGLALSEKTIGDRMKEEGYRTLAVGKW 137
Query: 184 HLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV 243
HLG ++ PL RGF+ +G+ G ++ S + R + D + + T
Sbjct: 138 HLGN-DAKFFPLKRGFDEFYGFQEGHRDFF----SFKKKRAEKYALWDNDKIIPEEEIT- 191
Query: 244 GEYATDLFTKEAVQLIEDQPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 301
Y TD+FT +A++ I D+ DK P F+YLA+ A H L+A + +++F +
Sbjct: 192 --YLTDMFTDKALKFI-DENADKKQPFFIYLAYNAVHT-----PLQAKKNDLDKFAQVGS 243
Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
R+TY AM+ +D ++G V+ L+ KG+ +N+++IF++DNG T N+ N
Sbjct: 244 EGRQTYDAMLSNMDYNIGRVMQELKNKGIDDNTLVIFLNDNGGATT---------NYSDN 294
Query: 362 YPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
RG+K ++WEGGV+V I+ W+ +I N V + + D LPT AAG +R
Sbjct: 295 GQLRGMKGSVWEGGVRVGYIMRWNGKIPANV-VYDKAVSSLDILPTSLAAAG--NTRKDK 351
Query: 421 NIDGLD 426
++DG++
Sbjct: 352 HLDGVN 357
Score = 42.4 bits (98), Expect = 0.70, Method: Compositional matrix adjust.
Identities = 24/67 (35%), Positives = 39/67 (58%), Gaps = 8/67 (11%)
Query: 15 YATDLFTKEAVQLIEDQPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
Y TD+FT +A++ I D+ DK P F+YLA+ A H L+A + +++F +
Sbjct: 192 YLTDMFTDKALKFI-DENADKKQPFFIYLAYNAVHT-----PLQAKKNDLDKFAQVGSEG 245
Query: 73 RRTYAAL 79
R+TY A+
Sbjct: 246 RQTYDAM 252
>gi|32473617|ref|NP_866611.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
gi|32398297|emb|CAD78392.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Rhodopirellula baltica
SH 1]
Length = 543
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 121/340 (35%), Positives = 176/340 (51%), Gaps = 33/340 (9%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI 142
+LIV G++D+ F+G EIPTP++D LA +G++ N YA P C+PSRA L+TG++
Sbjct: 48 VLIVADDLGYSDVGFNGCKEIPTPHLDELAASGVVFTNGYASHPYCSPSRAGLLTGRHQQ 107
Query: 143 HTGMQGPP----IW-GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
G P W G + G+PL+E L + L+E GY T AIGKWHLG + + P R
Sbjct: 108 RFGHGSNPEPDTQWHGEDTPGMPLSETTLADALKEAGYVTGAIGKWHLGD-AKPFWPNRR 166
Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
GF+ FG+ G SY+ D + L H R + TD F+ EAV+
Sbjct: 167 GFDEWFGFSGGGFSYW----GDLGMKDPLLGVH---RGDEPVDPKTLTHLTDDFSTEAVK 219
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
I+ + P FLYLA+ A HA + A + + + +I R Y AMV +D+
Sbjct: 220 FIQRHETE-PFFLYLAYNAPHAPD-----HATRAHLQKTAHIEYGGRAVYGAMVAGMDEG 273
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
+G V+ ++ G+ EN++IIF SDNG R N+PYRG K L+EGG++
Sbjct: 274 IGRVVDQIRESGLGENTMIIFYSDNGG----------RREHAVNFPYRGHKGMLFEGGIR 323
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
VP ++ P ++ + D PT AAG D S+
Sbjct: 324 VPFLVSWPGTVRSGMKEESPITALDLFPTALAAAGMDPSQ 363
>gi|296124181|ref|YP_003631959.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296016521|gb|ADG69760.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 470
Score = 177 bits (449), Expect = 1e-41, Method: Compositional matrix adjust.
Identities = 136/412 (33%), Positives = 200/412 (48%), Gaps = 65/412 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW + G+ +IPTP+ID++A NG+ A C+PSRA L+TG+YP G +
Sbjct: 52 GWGETGIQGNPQIPTPHIDSIAKNGVRCTQGFVAATYCSPSRAGLLTGRYPTRFGHEFNR 111
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I A G+ L E L + L LGY T +GKWHLG EY P RGF+ FG L
Sbjct: 112 I--ANVSGLDLQETTLADRLHGLGYKTACVGKWHLGD-GPEYRPTKRGFDEFFGTLANTP 168
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
++ D SR N Y TD + K +V+ I Q P FL
Sbjct: 169 FFHPTKFVD--SRVSNDVAEVSDENF---------YTTDEYAKRSVEWIGQQQ-QSPWFL 216
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YL A HA L+APQ+ +++F+ I DP R+ +AAM+ +DD++G V+ ++ G
Sbjct: 217 YLPFNAQHA-----PLQAPQKYLDRFESIADPKRKLFAAMMSAMDDAIGQVLGKVRELGQ 271
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQ- 388
EN+++ F+SDNG PT + T++ N P RG K T +EGG +VP ++ W ++
Sbjct: 272 EENTLVFFISDNGGPT---QGTTS-----QNGPLRGFKMTTFEGGTRVPFLVQWKGKLPA 323
Query: 389 ----QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
NP ++L D LPT+ TAAG ID P+ + +D
Sbjct: 324 GKTYDNPVINL------DVLPTVLTAAGS-------KID------------PAWKLDGVD 358
Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGY 496
+ ++S + N P ++ E+ AVR WKLV+ +G + Y
Sbjct: 359 LVPYFTSSIANKP--HETLYWRFGEQ---WAVRQGDWKLVVARGGSGQPELY 405
Score = 46.6 bits (109), Expect = 0.037, Method: Compositional matrix adjust.
Identities = 28/78 (35%), Positives = 42/78 (53%), Gaps = 6/78 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD + K +V+ I Q P FLYL A HA L+APQ+ +++F+ I DP R+
Sbjct: 193 YTTDEYAKRSVEWIGQQQ-QSPWFLYLPFNAQHA-----PLQAPQKYLDRFESIADPKRK 246
Query: 75 TYAALTKSTTLTLLIVYG 92
+AA+ + + V G
Sbjct: 247 LFAAMMSAMDDAIGQVLG 264
>gi|414072362|ref|ZP_11408307.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410805226|gb|EKS11247.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 473
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 116/346 (33%), Positives = 177/346 (51%), Gaps = 31/346 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS ++ TPN+D LA +G+ Y + C PSRA LMTGKY G +
Sbjct: 40 GFGDFGFQGSTQLKTPNLDKLAQSGVRFTQGYVSDSTCGPSRAGLMTGKYQQRFGYEEIN 99
Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
+ G G+PL ++ + +YL+E GY T GKWHLG R + PL RGF++
Sbjct: 100 VPGFMSDNSALKGADMGLPLDQKTMGDYLKEQGYKTAVFGKWHLGDADR-FHPLKRGFDT 158
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
G+ G SY+++ + + + R+ ++ EY TD+ KEA + IE
Sbjct: 159 FLGFRGGDRSYFNYSEQEMKNGNKHFFDKKLERDFGN-YEEPKEYLTDVLGKEAAKYIE- 216
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
Q D+P F+YLA A H LE+ + + +F +T R+ AAM LD + G V
Sbjct: 217 QNKDEPFFIYLAFNAVHT-----PLESDPKDLAKFPNLTG-KRKELAAMTLGLDRASGYV 270
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ L+ G+ +N+I++F +DNG P+ +N +N P G K+ EGG++VP +
Sbjct: 271 LDKLKELGLDDNTIVVFSNDNGGPSD--------KNASNNAPLAGTKSNQLEGGIRVPFL 322
Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
+ W I+ + + D LPT Y+AA G L +IDG+D
Sbjct: 323 ISWPKHIKPGSTYDYPVSTL-DLLPTFYSAAKGKA--LGSDIDGVD 365
>gi|149177349|ref|ZP_01855954.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
8797]
gi|148843874|gb|EDL58232.1| N-acetylgalactosamine-4-sulfatase precursor [Planctomyces maris DSM
8797]
Length = 472
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 127/368 (34%), Positives = 193/368 (52%), Gaps = 57/368 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ +L G+ +IPTP+ID+LA +GI Y P C+PSRA L+TG+ P G + P
Sbjct: 37 GYGELGCQGNPQIPTPHIDSLASHGIRFTQAYVTAPNCSPSRAGLLTGRIPTRFGYEFNP 96
Query: 151 IWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
I GA G+P E+ + E L + GY+T IGKWHLG +Y P GF+ FG++
Sbjct: 97 I-GARNEDSGTGLPPDEQTIAERLHDQGYTTCLIGKWHLGG-TADYHPFRHGFDEFFGFM 154
Query: 207 N-----------GVISYYDH--ILSDQYSRTVELN-------GHD---------MRRNLS 237
+ GV + + Q R + N G+D + R
Sbjct: 155 HEGHYFVPPPYHGVTTMLRRKTLPGRQKGRWISENLIYSTHMGYDEPDYDANNPIIRGGQ 214
Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
+T EY TD FT+EAV I ++ DKP FLYLA+ A H+ GK ++ I F
Sbjct: 215 PVNET--EYLTDAFTREAVSFI-NRHQDKPFFLYLAYNAVHSPLQGK-----KKDIQHFT 266
Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
I D +R+ +AAM+ +D S+G ++ +Q+ G+ E ++I+F+SDNG PT RE ++
Sbjct: 267 QIEDIHRQIFAAMLSSMDQSIGKILKQVQQSGLDEKTLIVFLSDNGGPT---RELTS--- 320
Query: 358 WGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
SN P RG K +++EGG++VP ++ W+ + + + + + D PT AG +
Sbjct: 321 --SNLPLRGEKGSMYEGGLRVPFLMRWTGTLAPKQTIDVPVSSL-DIFPTSVALAG---A 374
Query: 417 RLPLNIDG 424
LP N+DG
Sbjct: 375 SLPQNLDG 382
Score = 51.2 bits (121), Expect = 0.002, Method: Compositional matrix adjust.
Identities = 30/69 (43%), Positives = 41/69 (59%), Gaps = 6/69 (8%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
EY TD FT+EAV I ++ DKP FLYLA+ A H+ GK ++ I F I D +R
Sbjct: 220 EYLTDAFTREAVSFI-NRHQDKPFFLYLAYNAVHSPLQGK-----KKDIQHFTQIEDIHR 273
Query: 74 RTYAALTKS 82
+ +AA+ S
Sbjct: 274 QIFAAMLSS 282
>gi|325286704|ref|YP_004262494.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
gi|324322158|gb|ADY29623.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga lytica DSM 7489]
Length = 484
Score = 177 bits (449), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 120/347 (34%), Positives = 177/347 (51%), Gaps = 37/347 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS + TPN+D LA +G Y C PSRA L+TGKY G +
Sbjct: 34 GYADFGFQGSKIMKTPNLDKLAKSGAKFTQGYVTDATCGPSRAGLITGKYQQRFGYEEIN 93
Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
+ G A+ G+PL + + ++L++LGY T GKWHLG R Y P RGF+
Sbjct: 94 VPGYMSANSKFLADDMGLPLDQLTIADHLKKLGYKTAMYGKWHLGDADR-YHPTKRGFDE 152
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
+G+ G +Y+ Y+ + N + M R + EY TD KEAV IE
Sbjct: 153 FYGFRGGARNYFG------YNDVSKANLDNRMERGFGN-YQEPTEYVTDALAKEAVSFIE 205
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
+ P F+YLA A H ++A +E +N+F +T R+ AAM LD + GT
Sbjct: 206 KNKGN-PFFIYLAFNAVHT-----PMQATKEDLNKFPNLTG-KRKELAAMTLALDRACGT 258
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
V++ L+ G+ +N+I++F +DNG PT +N N P G K+ EGG++VP
Sbjct: 259 VLNKLKELGLDKNTIVVFSNDNGGPTD--------KNASLNLPLSGTKSNHLEGGIRVPF 310
Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
++ W QI+ + + + D LPT Y A GG+ + L +IDG+D
Sbjct: 311 LISWPKQIKSKTVYNFPVSTL-DLLPTFYAAGGGNVADLK-DIDGVD 355
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 26/67 (38%), Positives = 36/67 (53%), Gaps = 7/67 (10%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
EY TD KEAV IE + P F+YLA A H ++A +E +N+F +T R
Sbjct: 190 EYVTDALAKEAVSFIEKNKGN-PFFIYLAFNAVHT-----PMQATKEDLNKFPNLTG-KR 242
Query: 74 RTYAALT 80
+ AA+T
Sbjct: 243 KELAAMT 249
>gi|449138178|ref|ZP_21773473.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448883202|gb|EMB13740.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 489
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 179/361 (49%), Gaps = 53/361 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS+ I TPNID LA ++L+ Y P+C+P+RA ++TG YP G+ G +
Sbjct: 57 GWNDVGFHGSD-IRTPNIDRLARESVVLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 115
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ G+P PE+L +LGY +A+ GKWHLG + PL G +G+ NG I
Sbjct: 116 SPTKKHGLPSELETTPEHLAKLGYDHRAMFGKWHLGLASTLFHPLRHGMTEFYGHYNGAI 175
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+ R +L+ H RN D+V E Y+T+L V I+ +PL
Sbjct: 176 DYFSR------ERFGQLDWH---RN----HDSVHEEGYSTELVGNAVVDFIDRHAGQQPL 222
Query: 269 FLYLAHLAAH---------------------AGNAGKHLEAPQETINQFQYITDPN--RR 305
+ Y+A A H A N + + A +E ++ N R+
Sbjct: 223 YAYVAFNAPHSPLQALRSDLDDYGFDPTGKLAPNTDRKI-AKRENALEYGKRGKGNSIRQ 281
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
T+AAM +D +G ++ A+ R GM EN++++F SDNGA + ++ GSN P R
Sbjct: 282 TFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGGSNEPLR 332
Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
G K T WEGGV+V A + P+ + D LPT+ AAGG P DG+
Sbjct: 333 GNKFTTWEGGVRVVATMRWPREFPAGVTYESVASYVDLLPTMVAAAGGQP---PERTDGV 389
Query: 426 D 426
+
Sbjct: 390 N 390
>gi|421614608|ref|ZP_16055661.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408494617|gb|EKJ99222.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 472
Score = 177 bits (448), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 119/367 (32%), Positives = 180/367 (49%), Gaps = 65/367 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS EI TPNID LA + L+ Y P+C+P+RA ++TG YP G+ G +
Sbjct: 40 GWNDVGFHGS-EIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 98
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ G+P PE+L +LGY +A+ GKWHLG + PL+ G +G+ NG I
Sbjct: 99 SPTKKHGLPPQLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 158
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+ R +L+ H +D+V E Y+T+L V I+ P+
Sbjct: 159 DYFSR------ERFGQLDWH-------RDFDSVHEEGYSTELVGNAVVDFIDRNANAGPV 205
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------------------- 303
+ Y+A A H+ L+A + ++++ + DP
Sbjct: 206 YAYVAFNAPHS-----PLQALRSDLDEYGF--DPTGKLAPNTDRKIAKREKALDYGKRGK 258
Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
R+T+AAM +D +G ++ A+ R GM EN++++F SDNGA + ++ G
Sbjct: 259 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGG 309
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
SN P RG K T WEGGV+V A++ P+ + D LPT+ AAGG P
Sbjct: 310 SNEPLRGNKFTTWEGGVRVVAMMRWPRELPAGIAYDSVTSYVDLLPTMVAAAGGPP---P 366
Query: 420 LNIDGLD 426
DG++
Sbjct: 367 EETDGIN 373
>gi|241143299|ref|XP_002404958.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215493668|gb|EEC03309.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 166
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 89/171 (52%), Positives = 120/171 (70%), Gaps = 6/171 (3%)
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
PLFLYL+HLA HAG ++AP+E + +F YI NR YA V LD SVG V AL+
Sbjct: 1 PLFLYLSHLAVHAGTQYGPIQAPKENLRKFDYIGVKNRSLYAGAVDALDQSVGIVFQALE 60
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
+K ML+N++++F+SDNGA + + E SN GSN+P RG K TLWEGGV+VPA +W P
Sbjct: 61 KKDMLQNTMLVFISDNGA--IPWGEWSNA---GSNWPLRGAKFTLWEGGVRVPAFVWGPL 115
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
+++ RVS Q+MH+SDW+PTLY+ AGG+ + L + IDGL+ W +L N PS
Sbjct: 116 LRKPQRVSNQLMHVSDWVPTLYSLAGGNIADLGV-IDGLNMWQALSRNLPS 165
Score = 52.4 bits (124), Expect = 8e-04, Method: Compositional matrix adjust.
Identities = 24/56 (42%), Positives = 34/56 (60%)
Query: 36 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVY 91
PLFLYL+HLA HAG ++AP+E + +F YI NR YA + ++ IV+
Sbjct: 1 PLFLYLSHLAVHAGTQYGPIQAPKENLRKFDYIGVKNRSLYAGAVDALDQSVGIVF 56
>gi|407938758|ref|YP_006854399.1| sulfatase [Acidovorax sp. KKS102]
gi|407896552|gb|AFU45761.1| sulfatase [Acidovorax sp. KKS102]
Length = 452
Score = 176 bits (447), Expect = 2e-41, Method: Compositional matrix adjust.
Identities = 118/344 (34%), Positives = 172/344 (50%), Gaps = 31/344 (9%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRA 133
+A T++ + ++ GW DLS +G + TP++DALA G+ YA VC+ +R
Sbjct: 10 SASTQAPNIVFILADDLGWADLSVYGQADFTTPHLDALAGEGVRFTQAYANSAVCSATRF 69
Query: 134 SLMTGKYP--IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
+L+TG+Y + G++ P + A G+P LP LR+ GY T IGKWHLG
Sbjct: 70 ALITGRYQYRLRGGLEEPLVRKAHVHGLPPEHPTLPSLLRDAGYDTALIGKWHLGSLP-T 128
Query: 192 YTPLYRGFESHFGYLNGVISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
+ PL G++ FG G I Y+ H + D +R + + R VG Y T
Sbjct: 129 FGPLKSGYDRFFGNYGGAIDYFTHKPGVGDAVARDLYEGEVPVER--------VGYY-TQ 179
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
L EA + I ++ KP FL L A H G E + + N TYA
Sbjct: 180 LLADEATRWIGERSAAKPFFLSLHFTAPHWPWVGPEDEEISLGLKDLFHYDGGNLATYAR 239
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
MV+ LD +VG V+ AL+ +G+ +N+I++F SDNG + +P+ G K
Sbjct: 240 MVRSLDKAVGQVLEALKAQGLADNTIVVFTSDNGG-----------ERFSKTWPFTGQKT 288
Query: 370 TLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
L EGG++VP +L W +I P+VS Q+ DWLPTL AAG
Sbjct: 289 ELLEGGIRVPTLLRWPARIA--PQVSDQVTATMDWLPTLLAAAG 330
>gi|149198444|ref|ZP_01875489.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149138450|gb|EDM26858.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 458
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 114/321 (35%), Positives = 177/321 (55%), Gaps = 27/321 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGI-ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D+ F+G +IPTP++D++A NG+ ++ + PVC PSRA L+TG+Y G P
Sbjct: 31 GYQDVGFNGCKDIPTPHLDSIAQNGVNCIDAHVSYPVCGPSRAGLLTGRYQDRFGFTTNP 90
Query: 151 IWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
E G+PL E+ + E L+E+GYS+ IGKWH+G + PL RGF+ FG+L+G
Sbjct: 91 TVNPENPIAGLPLEEKNIAEVLKEVGYSSSIIGKWHMGTHPIHH-PLNRGFDHFFGFLSG 149
Query: 209 VISYY--DHILSD--QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
Y+ + L D + R + + R+ + G Y TD+ T AV I+ +
Sbjct: 150 GHDYFPAKYNLKDLSEVKRIWDWYRTHLIRDRERIQVSEG-YLTDILTDAAVDFIDKKAS 208
Query: 265 DK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
+K P LYL++ A H L+A ++ + +F +I D RRTYAAMV +DD VG V++
Sbjct: 209 EKKPFMLYLSYNAPHTP-----LQASEKYLKRFTHIKDSKRRTYAAMVSAVDDGVGRVMA 263
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AIL 382
L++ + E++++ F+SDNG P + N P+ K +L EGG +VP A+
Sbjct: 264 RLKKHKIEEDTLVFFLSDNGGP------LKGPMPFTDNGPFSKGKGSLHEGGTRVPFAVQ 317
Query: 383 W-----SPQIQQNPRVSLQMM 398
W S QI + P SL ++
Sbjct: 318 WKGTIPSGQIYKKPMSSLDIL 338
Score = 45.8 bits (107), Expect = 0.077, Method: Compositional matrix adjust.
Identities = 25/70 (35%), Positives = 39/70 (55%), Gaps = 6/70 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
Y TD+ T AV I+ + +K P LYL++ A H L+A ++ + +F +I D R
Sbjct: 190 YLTDILTDAAVDFIDKKASEKKPFMLYLSYNAPHTP-----LQASEKYLKRFTHIKDSKR 244
Query: 74 RTYAALTKST 83
RTYAA+ +
Sbjct: 245 RTYAAMVSAV 254
>gi|325108643|ref|YP_004269711.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324968911|gb|ADY59689.1| N-acetylgalactosamine-4-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 484
Score = 176 bits (446), Expect = 3e-41, Method: Compositional matrix adjust.
Identities = 131/401 (32%), Positives = 198/401 (49%), Gaps = 70/401 (17%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
LLIV G+NDL S+ I TP++D LA G L + Y A P CTPSRASL+TG+YP
Sbjct: 31 LLIVSDDQGYNDLGVLNSDLI-TPHLDRLAAEGTRLTDFYVAWPACTPSRASLLTGRYPQ 89
Query: 143 HTGM-----QGPPIWGAEPR------------GVPLTERFLPEYLRELGYSTKAIGKWHL 185
G+ P +G + + G+ E+ LPEYL++LGY++ GKW L
Sbjct: 90 RNGIYDMIRNEAPDYGYKYKPAEYEVSFERIGGMDQREKLLPEYLKKLGYTSAIFGKWDL 149
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
G +R + P RGF+ +G++N I Y+ H ++Y M R S + GE
Sbjct: 150 GSLKR-FLPTNRGFDEFYGFVNTGIDYFTH---ERYGVP------SMFRQTSLTEEDRGE 199
Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA-----GKHLEAPQE--------- 291
YAT+LF +EA+ ++ +P LYL A H ++ ++AP+E
Sbjct: 200 YATELFKREALAFLDRAEASEPFLLYLPFNAPHNSSSLDPRIRSTVQAPEEYKAMYPAVD 259
Query: 292 ----TINQFQY-----ITDP--NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
T+N+++Y + P +R Y A V +D ++G V+ L+ KG+L+N+I+IF S
Sbjct: 260 PEYRTVNRYRYGNPAQVVTPVARQRDYRAAVTCMDAAIGEVLGKLEDKGVLDNTIVIFFS 319
Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMH 399
DNG N P RG K +WEGG++VP ++ W + +
Sbjct: 320 DNGG-----------SGGADNSPLRGHKAQVWEGGIRVPCLVRWPDGGVPAGATNDSFLS 368
Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
+ +P+L AAGG T + +DG + W +L S RN
Sbjct: 369 SLELVPSLVKAAGG-TIPEDIVMDGRNWWPALTEAASSPRN 408
>gi|323455793|gb|EGB11661.1| hypothetical protein AURANDRAFT_70842 [Aureococcus anophagefferens]
Length = 1489
Score = 176 bits (445), Expect = 4e-41, Method: Compositional matrix adjust.
Identities = 125/395 (31%), Positives = 198/395 (50%), Gaps = 74/395 (18%)
Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
TP +D LA +GI L+ Y+Q +CTPSR++L++G YPIHTGMQ I G+P +
Sbjct: 136 TPFMDLLADDGIKLDKYYSQQLCTPSRSALLSGYYPIHTGMQHDVIQPESMFGMPAGHKL 195
Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV 225
+P +L++ W+ Y H +G YYD
Sbjct: 196 MPSFLKDDQL-------WY-----------YNHKSPH--ACDGSNCYYD----------- 224
Query: 226 ELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
M+RN + A D++G Y+T LF ++E++ PLF+YL+ HA
Sbjct: 225 ------MQRNGAIAEDSIGVYSTFLFVDAFTDVLENEDASTPLFMYLSWQNVHA-----P 273
Query: 286 LEAP------QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
L+ P + ++ I DP+RRT+A++ LD+++ V+ A+++ G +NSI++
Sbjct: 274 LDPPPASFYTKTELSLVNSIEDPHRRTFASITIVLDNAMKKVVGAMKKSGFYDNSILVVA 333
Query: 340 SDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR-VSLQ-M 397
SDNG + + G NYPYRG K L+EGG+ V A + SP I ++ R S Q +
Sbjct: 334 SDNGGCS---------GSGGYNYPYRGGKQYLYEGGIHVNAFIHSPLIPKSARGTSYQGL 384
Query: 398 MHISDWLPTLYTAA-GGDTSRLPLNIDGLDQWSSLL---LNTPSR--RNSNIDGLDQWSS 451
H+SDWLPTL T A + + LP ++DG+DQW ++L P+ R + +D WS
Sbjct: 385 FHVSDWLPTLVTGAMKTEPTMLPADLDGVDQWHAILGSGRGAPAAYPRTEILHNIDLWSL 444
Query: 452 LLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLG 486
+ ++ + + A+R+ KLV+G
Sbjct: 445 EIYGNVTKLRTPI---------QAIRVGDMKLVMG 470
Score = 47.4 bits (111), Expect = 0.023, Method: Compositional matrix adjust.
Identities = 28/86 (32%), Positives = 47/86 (54%), Gaps = 11/86 (12%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP-- 58
M+RN + A D++G Y+T LF ++E++ PLF+YL+ HA L+ P
Sbjct: 225 MQRNGAIAEDSIGVYSTFLFVDAFTDVLENEDASTPLFMYLSWQNVHA-----PLDPPPA 279
Query: 59 ----QETINQFQYITDPNRRTYAALT 80
+ ++ I DP+RRT+A++T
Sbjct: 280 SFYTKTELSLVNSIEDPHRRTFASIT 305
>gi|187735676|ref|YP_001877788.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
gi|187425728|gb|ACD05007.1| sulfatase [Akkermansia muciniphila ATCC BAA-835]
Length = 465
Score = 176 bits (445), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 120/346 (34%), Positives = 180/346 (52%), Gaps = 38/346 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL GS +I TP++D LA G+ + Y P+C+PSR L+TG++P G+ P
Sbjct: 40 GYGDLGCTGSKQIKTPSLDRLAREGVFCSRAYVTAPMCSPSRMGLLTGRFPKRYGITTNP 99
Query: 151 IWGAE--PR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
+ P G+P TE+ +PEYL GY + GKWHLG + YTP RGF +G+
Sbjct: 100 NIQMDYLPESHYGLPQTEKLIPEYLAPCGYRSAVFGKWHLGH-TKGYTPPERGFTHWWGF 158
Query: 206 LNGVISYYDHILSDQYSRTVE-LNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
L G Y+ + E LN + N + D Y TD T AV+ +++
Sbjct: 159 LGGSRHYF------PVKKEAEGLNPSMIVSNFTDKTDIT--YLTDDITDRAVEFLQEAGK 210
Query: 265 DK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
DK P F+++++ A H N EA E I +F+ + + RR Y AMV +D +G ++
Sbjct: 211 DKKPFFMFVSYNAPHWPN-----EAKPEDIAKFRNVQNGERRVYCAMVYAMDRGIGRILD 265
Query: 324 ALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL+ G+ +++I++F+SDN GAP E S+ N P+RG K +EGGV+VP I+
Sbjct: 266 ALKADGLEKDTIVVFLSDNGGAP-----EASS-----CNAPFRGAKRQHFEGGVRVPFII 315
Query: 383 WSPQIQQ--NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
P ++ V Q + D LP L A G +P +DG+D
Sbjct: 316 RYPADKRLVPGSVCRQPVSSVDLLPALLKANG---RHIPRKLDGMD 358
>gi|399033016|ref|ZP_10732099.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
gi|398068627|gb|EJL60037.1| arylsulfatase A family protein [Flavobacterium sp. CF136]
Length = 460
Score = 175 bits (444), Expect = 5e-41, Method: Compositional matrix adjust.
Identities = 129/374 (34%), Positives = 193/374 (51%), Gaps = 47/374 (12%)
Query: 79 LTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLM 136
LT+ + L+I GWND+ +HGS I TPNID LA NG+ LN YA P C+PSRASL
Sbjct: 31 LTEKPNIILIIADDAGWNDVEYHGS-VIQTPNIDFLAKNGVELNRFYANPTCSPSRASLF 89
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TG+ G+ PI +P + LP+ L + Y T IGKWHLG ++ P
Sbjct: 90 TGRPASRMGIVA-PISDKSQFKLPDSIATLPKLLHQNNYQTALIGKWHLG-LQQSSGPKA 147
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY------ATDL 250
GF+ +G+L+G I Y H+ +N +W GE+ ATDL
Sbjct: 148 YGFDYSYGFLHGQIDQYTHLY----------------KNGDKSWYRNGEFIDEQGHATDL 191
Query: 251 FTKEAVQ-LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYA 308
T EA+ L E + +K FL +A+ A H L+ Q+ + + I D +RR +A
Sbjct: 192 ITNEAIHWLSEKRDSNKNFFLEVAYSAPHFP-----LQEEQKWKDPYMNSIKDSSRRDFA 246
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN--------WGS 360
A + LD S+G ++ L+++ + +N+++IFMSDNGA +E ++ N N G
Sbjct: 247 AAMSHLDYSIGILLEKLKQENLDKNTLVIFMSDNGA--MENWDSRNEYNGVHPSNTVLGD 304
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
N P R K + +EGG++VP I + +N + S + I D LP++ AG LP
Sbjct: 305 NSPLRDWKTSNYEGGIRVPCIFYWKDHLKNYK-SESYISIIDLLPSILFLAG--DKNLPQ 361
Query: 421 NIDGLDQWSSLLLN 434
+I+G + W ++ N
Sbjct: 362 SIEGKNVWPAIAEN 375
>gi|32471439|ref|NP_864432.1| arylsulfatase B [precursor] [Rhodopirellula baltica SH 1]
gi|32443280|emb|CAD72111.1| Arylsulfatase B [Precursor] [Rhodopirellula baltica SH 1]
Length = 579
Score = 175 bits (444), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 181/367 (49%), Gaps = 65/367 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS EI TPNID LA + L+ Y P+C+P+RA ++TG YP G+ G +
Sbjct: 147 GWNDVGFHGS-EIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 205
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
++ G+P PE+L +LGY +A+ GKWHLG + PL+ G +G+ NG I
Sbjct: 206 SPSKKHGLPPQLETAPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 265
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+ R +L+ H +D+V E Y+T+L V I+ P+
Sbjct: 266 DYFSR------ERFGQLDWH-------RDFDSVHEEGYSTELVGNAVVDFIDRNANAGPV 312
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------------------- 303
+ Y+A A H+ L+A + ++++ + DPN
Sbjct: 313 YAYVAFNAPHS-----PLQALRSDLDEYGF--DPNNKLAPNTDRKIAKREKALDYGKRGK 365
Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
R+T+AAM +D +G ++ A+ R GM EN++++F SDNGA + ++ G
Sbjct: 366 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGG 416
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
+N P RG K T WEGGV+V A++ P + D LP++ AAG S P
Sbjct: 417 NNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPSMVGAAG---SPPP 473
Query: 420 LNIDGLD 426
DG++
Sbjct: 474 EETDGIN 480
>gi|449671825|ref|XP_002165184.2| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 160
Score = 175 bits (444), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 80/144 (55%), Positives = 102/144 (70%), Gaps = 6/144 (4%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS EI TPNID LA NG++L+N Y P+CTPSR+++MTG+YPIHTGMQ I
Sbjct: 21 GWNDIGFHGSKEISTPNIDRLATNGVVLDNYYVLPICTPSRSAIMTGRYPIHTGMQQDTI 80
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+G P GV L E+FLP+YL++ GY T +GKWHLGFF +EYTP YRGF+S++G G
Sbjct: 81 YGPNPYGVSLNEKFLPQYLKQQGYKTYGVGKWHLGFFAKEYTPTYRGFDSYYGSYLGKGD 140
Query: 212 YYDHILSDQYSRTVELNGHDMRRN 235
Y++H S T G D+ N
Sbjct: 141 YWNH------SNTETFTGLDLHDN 158
>gi|149199999|ref|ZP_01877025.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149136872|gb|EDM25299.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 512
Score = 175 bits (444), Expect = 6e-41, Method: Compositional matrix adjust.
Identities = 126/384 (32%), Positives = 190/384 (49%), Gaps = 40/384 (10%)
Query: 75 TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRA 133
T+A + L G++D+ +HG+ I TPNID++A G+ + Y + VC PSRA
Sbjct: 15 TFADKQPNIILIFADDMGYDDVGYHGNKRIITPNIDSIAEQGVQFSQGYVSASVCGPSRA 74
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPR-------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
L+TG Y G P P G+P ++ + E L+ LGY+ IGKWH+G
Sbjct: 75 GLLTGVYQQRFGCGENPNGSGYPNQMKYPMAGLPQSQSMISEELKTLGYTNGMIGKWHMG 134
Query: 187 FFRREYTPLYRGFESHFGYLNGVISY--YDHILSDQYSRTVELNGHDMR-RNLSTAWDTV 243
F P RG++ +G++NG Y + + SR +M N + D
Sbjct: 135 F-DMSLRPNQRGYDFFYGFINGSHDYTEWTQEFAKGKSRWPIFRNEEMEPANKAQYIDVF 193
Query: 244 GE---------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETIN 294
E Y TDLFT EAV I D+ DKP FLYLA+ A H + Q ++
Sbjct: 194 KEKGVKVVDENYLTDLFTDEAVNFI-DRNADKPFFLYLAYNAVH-----HPWQTTQHALD 247
Query: 295 QFQYITD-PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT---VEYR 350
+ ++ D N +A+MV +D+ +G V+ L+ K + +N+IIIF+SDNG+P +E+
Sbjct: 248 KTAHLKDDKNYHVFASMVYAMDEGIGKVMKKLKEKNIDDNTIIIFLSDNGSPQGQGIEHS 307
Query: 351 ETSNYRNWG-----SNYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWL 404
R+ G S +RG K +EGG++VP I W QIQ+ + + + + D
Sbjct: 308 PKDPNRHRGGFTMSSTGIFRGYKGDTYEGGIRVPFCIKWPQQIQKGTKYDMPISAL-DLQ 366
Query: 405 PTLYTAAGGDTSR--LPLNIDGLD 426
PTL AAGG+ + DG+D
Sbjct: 367 PTLVKAAGGNDKKPQKGFAYDGVD 390
Score = 42.4 bits (98), Expect = 0.81, Method: Compositional matrix adjust.
Identities = 21/33 (63%), Positives = 23/33 (69%), Gaps = 1/33 (3%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 47
Y TDLFT EAV I D+ DKP FLYLA+ A H
Sbjct: 205 YLTDLFTDEAVNFI-DRNADKPFFLYLAYNAVH 236
>gi|417301514|ref|ZP_12088666.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542201|gb|EGF28693.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 489
Score = 175 bits (443), Expect = 7e-41, Method: Compositional matrix adjust.
Identities = 118/367 (32%), Positives = 178/367 (48%), Gaps = 65/367 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+ FHGS EI TPNID LA + L+ Y P+C+P+RA ++TG YP G+ G +
Sbjct: 57 GWNDVGFHGS-EIRTPNIDRLASESVTLDRFYVTPICSPTRAGVLTGLYPFRFGIWGGVV 115
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAI-GKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ G+P PE+L +LGY +A+ GKWHLG + PL+ G +G+ NG I
Sbjct: 116 SPTKKHGLPTLLETTPEHLSKLGYDHRAMFGKWHLGLASTLFHPLHHGMTEFYGHYNGAI 175
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE--YATDLFTKEAVQLIEDQPVDKPL 268
Y+ R +L+ H +D+V E Y+T+L V I+ +
Sbjct: 176 DYFSR------ERFGQLDWH-------RDFDSVHEEGYSTELVGNAVVDFIDRNANAGAV 222
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN------------------------- 303
+ Y+A A H+ L+A + ++++ + DPN
Sbjct: 223 YAYVAFNAPHS-----PLQALRSDLDEYGF--DPNNKLSPNTDRKIAKREKALDYGKRGK 275
Query: 304 ----RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
R+T+AAM +D +G ++ A+ R GM EN++++F SDNGA + ++ G
Sbjct: 276 GNSIRQTFAAMTTAMDRQIGRILDAIDRNGMRENTLVVFHSDNGA---------DPKHGG 326
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
SN P RG K T WEGGV+V A++ P + D LP + AAGG P
Sbjct: 327 SNEPLRGNKFTTWEGGVRVVAMMRWPNELPAGITYDSVTSYVDLLPIMVAAAGGPP---P 383
Query: 420 LNIDGLD 426
DG++
Sbjct: 384 EETDGIN 390
>gi|109897220|ref|YP_660475.1| sulfatase [Pseudoalteromonas atlantica T6c]
gi|109699501|gb|ABG39421.1| sulfatase [Pseudoalteromonas atlantica T6c]
Length = 471
Score = 174 bits (442), Expect = 1e-40, Method: Compositional matrix adjust.
Identities = 120/377 (31%), Positives = 186/377 (49%), Gaps = 44/377 (11%)
Query: 74 RTYAALTKSTTLTLL-------IVY------GWNDLSFHGSNEIPTPNIDALAYNGIILN 120
R AAL+ S T L IV+ G+ D F GS + TPN+D LA G+
Sbjct: 7 RVVAALSISVACTSLSYAKQPNIVFLFSDDAGYADFGFQGSETMKTPNLDQLASEGVRFT 66
Query: 121 NMY-AQPVCTPSRASLMTGKYPIHTGMQG----------PPIWGAEPRGVPLTERFLPEY 169
Y + C PSRA +MTG+Y G + I GAE G+PL E + +Y
Sbjct: 67 QGYVSDSTCGPSRAGIMTGRYQQKFGYEEINVPGYMSEHSAIKGAE-MGIPLDEVTMGDY 125
Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNG 229
++ LGY T GKWHLG E P++RGF+ +G+ G SY+ + ++ ++
Sbjct: 126 MKSLGYRTAFYGKWHLGG-TDELHPMHRGFDEFYGFRGGDRSYWAYEVNAPERKSAVFTD 184
Query: 230 HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP 289
+ + + G Y TD+ ++A Q IE P DKP F++L+ A H +EA
Sbjct: 185 KKLEHGIDQFQEHEG-YLTDVLAEKANQFIEKAP-DKPFFIFLSFNAVHT-----PMEAT 237
Query: 290 QETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEY 349
E + +F + R+ AAM LD + G V++ L+ G+ ++++++F +DNG PT
Sbjct: 238 PEDLAKFPQLKG-KRKEVAAMTLALDRASGAVLNKLKELGLEDDTLVVFSNDNGGPTD-- 294
Query: 350 RETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYT 409
+N SNYP G K+ EGG++VP ++ P +V + + D LPT +
Sbjct: 295 ------KNASSNYPLAGTKSNFLEGGIRVPFLVKWPAKLAAGKVYDKPVSTLDLLPTFFK 348
Query: 410 AAGGDTSRLPLNIDGLD 426
A GG+ + +DG+D
Sbjct: 349 AGGGE--EVMSELDGVD 363
>gi|406834549|ref|ZP_11094143.1| Cerebroside-sulfatase [Schlesneria paludicola DSM 18645]
Length = 470
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 115/337 (34%), Positives = 176/337 (52%), Gaps = 29/337 (8%)
Query: 95 DLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
D + G+ ++ TP+ID LA G+ L YA PVC+PSRA L+TG+YP+ G+ G
Sbjct: 61 DANCFGAKDLETPHIDGLAARGVRLTQFYAAAPVCSPSRAGLLTGRYPVRGGVPGNVSST 120
Query: 154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
A G+P T+ + E + GY+T IGKWHLG + + PL +GF++ FG++ G I +
Sbjct: 121 AGVAGMPTTQVTMAETFKAAGYATAHIGKWHLG-YSEDTMPLAQGFQTSFGHMGGCIDNF 179
Query: 214 DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLA 273
H Q N HD+ ++ G + DL +EA Q I ++P F+Y A
Sbjct: 180 SHFFYWQGP-----NRHDLHKDGREV-QRPGRFFPDLMVEEASQFIGTN-RERPFFVYFA 232
Query: 274 HLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLEN 333
H G + +++ + P R YAA V +D+ +G ++ + G+ E
Sbjct: 233 LNGPHYPYQGD-----PRWLERYKDLPYP-RNLYAAFVSSIDERIGGLLKRVDELGLREK 286
Query: 334 SIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPR 392
+I++F SD+G T E + GS YRG K +L+EGG++VPAI+ W ++ Q
Sbjct: 287 TIVVFQSDHGHST----EERAHLGGGSAGIYRGAKFSLFEGGIRVPAIISWPGRLPQGEA 342
Query: 393 VSLQMMHISDWLPTLYTAAGGDTSRLPL---NIDGLD 426
S Q+ H DWLPTL A +PL +IDG++
Sbjct: 343 RS-QLAHGCDWLPTLAALAA-----VPLVEPDIDGVN 373
>gi|254444367|ref|ZP_05057843.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
gi|198258675|gb|EDY82983.1| sulfatase, putative [Verrucomicrobiae bacterium DG1235]
Length = 462
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 125/375 (33%), Positives = 185/375 (49%), Gaps = 39/375 (10%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQP 126
+T P+ + A + G+NDLS +G+ +I TP ID+L GI + Y A P
Sbjct: 22 LTPPSAASSAEKPPNIVFIFADDLGYNDLSSYGATDIATPAIDSLGEQGIRFTDFYSASP 81
Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
VC+PSRA+L+TG+YPI G+ G W G+ E + E L+E GY T +GKWHLG
Sbjct: 82 VCSPSRAALLTGRYPIRQGITG-VFWPQSFDGIDPAETTIAELLQENGYRTGLVGKWHLG 140
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
++ + PL GF S+FG I Y S+ V + G+D+ +++ Y
Sbjct: 141 HHQK-HLPLQNGFHSYFG-----IPY-----SNDMDMVVYMRGNDVE-----SYEVDQHY 184
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
T +T+EAVQ IE Q D+P FLYLAH H ++ R
Sbjct: 185 TTRRYTEEAVQFIE-QNKDQPFFLYLAHSMPHV-----------PIYASENFVGTSKRGL 232
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y ++++LD SV ++ L + + EN++++F SDNG P + GS P R
Sbjct: 233 YGDVIQELDWSVAQILDTLDKHQLSENTLVVFTSDNG-PWTALKHLG-----GSAAPLRE 286
Query: 367 VKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
K ++GG++VP ++ W QI + S M ++ DW PT A DT + +IDGL
Sbjct: 287 GKMFTFDGGMRVPCLVRWPAQIPAG-QTSHAMANMMDWFPTFSRIANLDTPK-SRSIDGL 344
Query: 426 DQWSSLLLNTPSRRN 440
D L + P N
Sbjct: 345 DITDVLTGSGPRADN 359
Score = 40.4 bits (93), Expect = 2.7, Method: Compositional matrix adjust.
Identities = 22/47 (46%), Positives = 28/47 (59%), Gaps = 1/47 (2%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 47
MR N +++ Y T +T+EAVQ IE Q D+P FLYLAH H
Sbjct: 170 MRGNDVESYEVDQHYTTRRYTEEAVQFIE-QNKDQPFFLYLAHSMPH 215
>gi|87306602|ref|ZP_01088749.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
gi|87290781|gb|EAQ82668.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
Length = 468
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 123/363 (33%), Positives = 182/363 (50%), Gaps = 53/363 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGM---- 146
G+ DLS G N TP +D LA +G L + Y P CTPSRASLMTG+YP G
Sbjct: 43 GFADLSCIGDNGCRTPRLDQLAASGTRLTSFYVSWPACTPSRASLMTGRYPQRNGTYDMI 102
Query: 147 --QGP-----------PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
+ P + G L E FL + L++ GY + GKW G +R Y
Sbjct: 103 RNEAPDYDYLYTPEEYAVTAERILGTDLQEVFLADVLKQAGYVSAVFGKWDGGQLKR-YL 161
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
PL RGF+ ++G+ N + Y+ H ++Y M R+ + G Y TDLF +
Sbjct: 162 PLQRGFDQYYGFANTGVDYFTH---ERYGVP------SMFRDNQPTEEDKGTYLTDLFER 212
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHL----EAPQETINQF---QYITDPNRR 305
EA++ I D+ D+P FLYL A H A N + + +APQE ++ F + + R+
Sbjct: 213 EAIRFI-DENHDRPFFLYLPFNAPHSASNLDRSIRGFAQAPQEYLDHFPGGESKQEKRRQ 271
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
Y A V+++D+++G V+ LQ+ + +N++IIF+SDNG N P R
Sbjct: 272 AYLAAVERMDEAIGKVVDQLQQHQIADNTLIIFLSDNGGGGG-----------ADNSPLR 320
Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI--D 423
G K ++EGG +VP I+ P +VS Q + + PT+ A GG +LP ++ D
Sbjct: 321 GGKAKMFEGGNRVPCIVHWPGKVPAGKVSNQFLTSLEVFPTVIAAIGG---KLPDDVIYD 377
Query: 424 GLD 426
G D
Sbjct: 378 GFD 380
Score = 42.7 bits (99), Expect = 0.60, Method: Compositional matrix adjust.
Identities = 29/70 (41%), Positives = 39/70 (55%), Gaps = 7/70 (10%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHL---- 55
R N T D G Y TDLF +EA++ I D+ D+P FLYL A H A N + +
Sbjct: 192 FRDNQPTEEDK-GTYLTDLFEREAIRFI-DENHDRPFFLYLPFNAPHSASNLDRSIRGFA 249
Query: 56 EAPQETINQF 65
+APQE ++ F
Sbjct: 250 QAPQEYLDHF 259
>gi|149177363|ref|ZP_01855968.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148843888|gb|EDL58246.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 466
Score = 174 bits (440), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 129/367 (35%), Positives = 181/367 (49%), Gaps = 53/367 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM---- 146
G+ DL +G+ + TP +D LA G+ L + Y A P CT SRA+L+TG+YP G+
Sbjct: 46 GYGDLGCYGNPVMKTPMLDQLASEGVRLTDFYTASPTCTVSRATLLTGRYPQRIGLNHQL 105
Query: 147 QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
+G G+ +E +PEYL++ GY T GKW++GF P RGF+ FG+
Sbjct: 106 SADENYGD---GLRKSEVLIPEYLKQQGYRTACFGKWNVGF-SPGSRPTERGFDEFFGFA 161
Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
G I YY H + + HD+ R L + V Y+TDLF A Q I + D+
Sbjct: 162 AGNIDYYHHYYAGR---------HDLWRGLKEVF--VEGYSTDLFADAACQYISAE-SDQ 209
Query: 267 PLFLYLAHLAAH-------AGNAGKHLEAPQETINQFQY---ITDPNRRTYAAMVKKLDD 316
P F+YL A H G +AP ++ Y +P R Y A+V LD
Sbjct: 210 PFFIYLPFNAPHFPSQRNKQPGQGNEWQAPDLAFEKYGYDPQTKNPQER-YRAVVTALDS 268
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
++G V+ L G+ + +I+I+ SDNGA ++ R SN P R TLWEGG+
Sbjct: 269 AIGRVLKQLDTSGLRDQTIVIWYSDNGAFMLKERGLEV----ASNKPLRDGGVTLWEGGI 324
Query: 377 KVPAILWSP------QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLDQW 428
+VPAI+ P + Q+P +SL D LPTL T AGG LP +DG D
Sbjct: 325 RVPAIIRYPGHLKAGTVNQSPLISL------DILPTLITLAGGP---LPAERILDGQDML 375
Query: 429 SSLLLNT 435
+L T
Sbjct: 376 PALAAQT 382
>gi|325108489|ref|YP_004269557.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324968757|gb|ADY59535.1| Cerebroside-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 455
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 119/345 (34%), Positives = 175/345 (50%), Gaps = 25/345 (7%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
+L++Y G D +GS+++ TP ID LA GI MYA +C+ SRA L+TG+ P
Sbjct: 34 VLVIYSDDQGTIDAGCYGSDDLTTPAIDHLAETGIRFTQMYAPSCICSASRAGLLTGRLP 93
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
+ G+ G+P +E L + ++ GY T +GKWHLG+ PL +GF+S
Sbjct: 94 VRAGVPANVSSKKGAEGMPASEVTLGDVFKQAGYRTAHVGKWHLGY-TPATMPLGQGFDS 152
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
FG++ G I Y H N HD+ R+ + G++ DL E +L+ +
Sbjct: 153 SFGHMGGCIDNYSHFFYWNGP-----NRHDLWRDGKEVFHD-GQFFPDLLVDEMQRLLTE 206
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
+ ++PLF+Y A H G + N + + P RR YAA V LD+ +G V
Sbjct: 207 KS-EQPLFVYWAINVPHYPYQGT-----DKWRNYYADLETP-RREYAAFVSTLDEKLGEV 259
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
++ L+ G EN+I++F SD+G T E + GS PYRG K L+EGG++VP++
Sbjct: 260 LATLEETGQRENTIVVFQSDHGHST----EERAFFGGGSAGPYRGAKGCLFEGGIRVPSV 315
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
+ P V QM DWLPTL D LPL +DG D
Sbjct: 316 ISWPAGLPQGEVRDQMAVGCDWLPTLVELTKVDVD-LPL-LDGKD 358
>gi|149174645|ref|ZP_01853271.1| sulfatase [Planctomyces maris DSM 8797]
gi|148846755|gb|EDL61092.1| sulfatase [Planctomyces maris DSM 8797]
Length = 405
Score = 173 bits (439), Expect = 2e-40, Method: Compositional matrix adjust.
Identities = 110/322 (34%), Positives = 164/322 (50%), Gaps = 25/322 (7%)
Query: 95 DLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPPIWG 153
DL+ +G+ ++ TP++D++A GI YA PVC+PSRA ++TG++P G+ G
Sbjct: 23 DLNCYGAKDLITPHMDSIARRGIRFTQFYASAPVCSPSRAGMLTGRFPARAGVPGNVSSH 82
Query: 154 AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY 213
G+P + + E +++ GY T IGKWHLG+ E P +GFE+ FG++ G I
Sbjct: 83 HGKSGMPTEQITIAEMMQQAGYQTAHIGKWHLGY-TPETMPHGQGFETSFGHMGGCI--- 138
Query: 214 DHILSDQYSRTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
D YS NG HD+ N W G + DL ++ I + DKP FL
Sbjct: 139 -----DNYSHFFYWNGPNRHDLWENGKEVWRD-GAFFPDLMVEQCQDYIR-KAGDKPFFL 191
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y A H L+ ++ + +++ P R YAA V +DD +G V++ L +
Sbjct: 192 YWAINVPH-----YPLQGKEKWRKTYAHLSSP-RDKYAAFVSTMDDCIGEVLATLDACQL 245
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
E +IIIF SD+G E + GS PYRG K +L+EGG++VPA++ P
Sbjct: 246 REKTIIIFQSDHG----HSHEERTFGGGGSAGPYRGAKFSLFEGGIRVPAMISWPGTIAE 301
Query: 391 PRVSLQMMHISDWLPTLYTAAG 412
V Q+ DWLPT+ G
Sbjct: 302 GEVRDQLATGCDWLPTISALTG 323
>gi|372209242|ref|ZP_09497044.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 479
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 117/326 (35%), Positives = 165/326 (50%), Gaps = 40/326 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQG-- 148
G+ND+ F+GS +I TPN+D LA NG+I+ Y A P C PSR S+MTGKY G Q
Sbjct: 39 GYNDVGFNGSKDIKTPNLDKLADNGMIMTAGYVAHPFCGPSRTSIMTGKYAHTMGAQFNI 98
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P G+PL +F+ + L+E GY T A GKWHLG + P RGF+ +G+L G
Sbjct: 99 PSESEGTGYGIPLNNKFISKELQEAGYYTGAFGKWHLGA-DTPFHPNKRGFDEFYGFLGG 157
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNL----------STAWDTVGEYATDLFTKEAVQL 258
Y + +QY E +N+ T D EY TD ++EAV
Sbjct: 158 GHDY----IPEQYKPKYEFLKQRGSKNIRDYIKPLEHNGTEVDEK-EYITDGLSREAVNF 212
Query: 259 IEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
+ K P F+YLA+ A H L+A +E + F+ I D RRTYAAMV +D
Sbjct: 213 VYKASEKKQPFFMYLAYNAPHV-----PLQAKKEDMAVFKSIKDEKRRTYAAMVYAVDRG 267
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
VG ++ AL+ LEN++++F SDNG + +N+P K + +EGG +
Sbjct: 268 VGKLVEALKANQQLENTLLVFFSDNGGKLGK---------GANNFPLTEGKGSAYEGGHR 318
Query: 378 VPAIL-WSPQIQ-----QNPRVSLQM 397
VP W +I+ NP +L +
Sbjct: 319 VPMFFHWPNRIKAGQKFHNPVSALDL 344
Score = 46.2 bits (108), Expect = 0.050, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 37/68 (54%), Gaps = 6/68 (8%)
Query: 14 EYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 72
EY TD ++EAV + K P F+YLA+ A H L+A +E + F+ I D
Sbjct: 199 EYITDGLSREAVNFVYKASEKKQPFFMYLAYNAPHV-----PLQAKKEDMAVFKSIKDEK 253
Query: 73 RRTYAALT 80
RRTYAA+
Sbjct: 254 RRTYAAMV 261
>gi|6863178|gb|AAF30403.1|AF109925_1 sulfatase 2 precursor [Helix pomatia]
Length = 266
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 93/235 (39%), Positives = 134/235 (57%), Gaps = 32/235 (13%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YG+ D+ +HG+ E TPN+D LA G+ L N Y QP+C+P+R+ LMTG+Y IHTG+Q
Sbjct: 38 YGYRDIGYHGA-EFATPNLDKLAAEGVKLENYYVQPICSPTRSQLMTGRYQIHTGLQHDI 96
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
IW ++P G+PL + + L+ +GYST AIGKWHLG +++EYTPLYRGF+S++GYL G
Sbjct: 97 IWPSQPYGLPLQFPTIADMLKSVGYSTHAIGKWHLGLYKKEYTPLYRGFDSYYGYLEGGE 156
Query: 211 SYYDHILSDQY-SRTVELN------------------------GHDMRRNLSTAWDTVGE 245
YY + D + +RT + G+D+R D G
Sbjct: 157 DYYTYYNCDTFHNRTTPADTSILESYSPKNILLGKHEDENKWCGYDLRDMNEPVTDMNGT 216
Query: 246 YATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
Y+T L+TK+A+ +I KP LYLA+ A H+ +E P E + +I
Sbjct: 217 YSTHLYTKKAIDIINGASTGGKPFLLYLAYQAVHSP-----MEVPAEYTKPYTFI 266
>gi|340377481|ref|XP_003387258.1| PREDICTED: arylsulfatase I-like [Amphimedon queenslandica]
Length = 507
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 112/363 (30%), Positives = 173/363 (47%), Gaps = 54/363 (14%)
Query: 79 LTKSTTLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
L + L+ +GW ++ +H + E+ TPNID L G+ LN YA C+PSR+SL
Sbjct: 22 LKPHIVMMLVDDWGWANVGYHRNPPTREVVTPNIDDLVKQGLELNQHYAYRCCSPSRSSL 81
Query: 136 MTGKYPIHTGMQG---------PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
++G+ PIH Q PI G +P + E ++E GY+T +GKW G
Sbjct: 82 ISGRLPIHVSDQNIAPTNYNPNDPISGFS--AIPRNMTGIAEKMKEAGYATHQVGKWDAG 139
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHIL-------------SDQYSRTVELNGHDMR 233
++TP RGF++ FGY + YY ++ +DQ + + G D
Sbjct: 140 MATPDHTPKGRGFDTSFGYFHHYNDYYTEVVDSCNGTGVVDLWNTDQPAHGINGTGPD-- 197
Query: 234 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI 293
+Y LF + + ++ PLFLY A H L+ P E +
Sbjct: 198 -----------KYEEALFRERLLDIVSKHDPSTPLFLYYAPHIVHT-----PLQVPDEYL 241
Query: 294 NQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS 353
N+F +I D +R Y AMV LDD VG ++SAL++KGM +N + + SDNG P +
Sbjct: 242 NKFSFIDDKDRMYYHAMVNYLDDVVGDLVSALKKKGMWDNLLYVTSSDNGGPVYPF---- 297
Query: 354 NYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAA 411
+NYP +G K + W+GG++V A + + + R + +H++DW T A
Sbjct: 298 ---GGANNYPLKGGKLSDWQGGIRVNAFVSGGHLPEKMRGQKTDGYIHLADWYGTFCAIA 354
Query: 412 GGD 414
G D
Sbjct: 355 GVD 357
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 22/71 (30%), Positives = 36/71 (50%), Gaps = 3/71 (4%)
Query: 562 CTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLV--PQSHEQPDLVQADP 619
C + C L+N+ +DP E+ N+ASS PD ++ E K + T + H P QA
Sbjct: 437 CGDSGC-LYNIKDDPEERKNLASSMPDKLKEMQEKSKKYLATYFNPDRGHASPLACQAAV 495
Query: 620 KRFNDTWSPWI 630
++ W P++
Sbjct: 496 NKYGGFWGPFV 506
>gi|298706913|emb|CBJ29740.1| Formylglycine-dependent sulfatase, C-terminal fragment [Ectocarpus
siliculosus]
Length = 597
Score = 173 bits (438), Expect = 3e-40, Method: Compositional matrix adjust.
Identities = 141/429 (32%), Positives = 207/429 (48%), Gaps = 56/429 (13%)
Query: 92 GWNDLSFHGSNEIP-TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
G ND+ + ++ TP ++ LA G+IL++ Y+ +CTPSRASLMTG+ TGMQ
Sbjct: 85 GTNDVGYESTDLWQLTPFMNTLAAEGVILDDYYSNEICTPSRASLMTGRDSFRTGMQFGV 144
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ + G+P+ E L E + GYST GKWHLG + P RGF++ GY G
Sbjct: 145 VEDSAAWGLPIDEVTLAERFQAAGYSTHMTGKWHLGVYSNANYPFSRGFDTFLGYTGGGE 204
Query: 211 SYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDT-------VGEYATDLFTKEAV 256
YY H QY+ + D ++ +T G+Y+T + T A+
Sbjct: 205 GYYTHRECVTPEFEGGQYTCYQDFGYGDKDGYINFTTNTTRKGPEMTGKYSTTVITDRAI 264
Query: 257 QL----IEDQPVDKPLFLYLAHLAAH----AGNAGKHLEAPQETINQFQYITDPNRRTYA 308
++ +E P D PLFLY+AH A H + G + E + Y + RR A
Sbjct: 265 EVAREHVEKSPSD-PLFLYVAHQAVHEPLDSPPPGSFTDEEVELLEGLDY-SHRERRALA 322
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
++V LD V + +AL G+L+N++I+F SDNGA GSNYP RG K
Sbjct: 323 SIVYYLDKEVRRLHTALDALGVLDNAVIVFASDNGACP---------SKGGSNYPLRGFK 373
Query: 369 NTLWEGGVKVPAILWS------PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN- 421
+T ++GGV+VP ++S P+ + R S MMH DW PTL + G LPL+
Sbjct: 374 HTTFQGGVRVPGFVYSKSADLIPEEARGTRYS-GMMHSMDWTPTLGSVVPG----LPLDE 428
Query: 422 ----IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN-IDEKKRTAAV 476
+ G DQW +++ R S +G + +L T +S + E A
Sbjct: 429 MGKELSGFDQWDAIV-----GRRSAEEGPVRTELVLGRTSYDFDSDAGEMVKEPLPKGAF 483
Query: 477 RLDSWKLVL 485
D WKL+L
Sbjct: 484 IHDGWKLIL 492
>gi|313228866|emb|CBY18017.1| unnamed protein product [Oikopleura dioica]
Length = 482
Score = 172 bits (437), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 110/356 (30%), Positives = 172/356 (48%), Gaps = 33/356 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ND+ ++ I PN+ +LA NG IL+N Y QPVCTPSR++LMT +YPI G+Q I
Sbjct: 94 GFNDMPWNNP-AIIAPNLHSLAKNGTILSNFYVQPVCTPSRSALMTSRYPIRLGLQTDVI 152
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+P +PL E + + GY+T +GKWHLG + + P RGF++ GYL G
Sbjct: 153 TAPQPSCLPLDEVTIGNEFQSAGYTTHIVGKWHLGHYCPQCLPNNRGFDTFRGYLTGAED 212
Query: 212 YYDH-----ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL----------FTKEAV 256
YY ++ +Q + +D + A T Y + F +
Sbjct: 213 YYKKTFCIPLVPNQRPAACGFDFYDNENRMPKANGTYSTYQVLIYLFIHTIILKFADASR 272
Query: 257 QLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
++I+ K P FLYL + H + P+ + + + R+ Y MV +D
Sbjct: 273 EVIKSHEGSKTPFFLYLPFQSVHYP-----VMVPKNYSDLYPNEKNHTRKEYMGMVTAMD 327
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
++VG ++ L+ G EN++ + +DNG G N P+RG K TLW+GG
Sbjct: 328 EAVGHIVDELKAAGAYENTLFFYATDNGGLVGA---------GGRNAPFRGQKATLWQGG 378
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
++VPA + P + MMHI D T+ A G + ++DG+ WS++
Sbjct: 379 LRVPAFISGPGVPSASNYK-NMMHIVDVQMTVLDAIGHQRTG-EKHVDGVSHWSAI 432
>gi|445495948|ref|ZP_21462992.1| sulfatase [Janthinobacterium sp. HH01]
gi|444792109|gb|ELX13656.1| sulfatase [Janthinobacterium sp. HH01]
Length = 471
Score = 172 bits (436), Expect = 4e-40, Method: Compositional matrix adjust.
Identities = 114/326 (34%), Positives = 159/326 (48%), Gaps = 36/326 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
G DL +G +I TPN+D LA GI YA VC+ +R +L+TG+Y + G++
Sbjct: 44 GHADLGVYGQTDIRTPNLDKLAGQGIRFTQAYANSAVCSATRFALITGRYQYRLRGGLEE 103
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P ++ G+P T LP L++ GY T IGKWHLG+ + PL G++S FG G
Sbjct: 104 PIAGASDTLGLPRTHPTLPSLLKKQGYGTALIGKWHLGYLP-TFGPLKSGYDSFFGNYGG 162
Query: 209 VISYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
I Y+ H + D Y V ++ +G Y TDL AV ++
Sbjct: 163 AIDYFTHKPGVGPQVKEDLYEGEVPVH-------------QIGYY-TDLLGARAVDFVQK 208
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
Q KP L L + A H G EA I+ + + TY MV+ LD S+G V
Sbjct: 209 QQAGKPFLLSLHYTAPHWPWEGPGDEAISRDIDNIFHYDGGDLATYGRMVESLDASIGRV 268
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ AL+ KG+ EN+I+IF SDNG + +P G K L EGG++VPAI
Sbjct: 269 LKALEDKGLAENTIVIFTSDNGG-----------ERFSKTWPLTGQKTELLEGGIRVPAI 317
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTL 407
+ P +VS Q+ DWLPTL
Sbjct: 318 VRWPAKIAAGQVSEQVAISMDWLPTL 343
>gi|381205411|ref|ZP_09912482.1| sulfatase [SAR324 cluster bacterium JCVI-SC AAA005]
Length = 448
Score = 172 bits (435), Expect = 6e-40, Method: Compositional matrix adjust.
Identities = 132/427 (30%), Positives = 206/427 (48%), Gaps = 60/427 (14%)
Query: 79 LTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMT 137
+ K+ + + G+ DLS G+ + TPN+D LA G Y+ PVC+PSRA+L+T
Sbjct: 1 MKKNVIVIMTDDQGYGDLSCMGATDFKTPNLDRLANEGARFKCWYSNSPVCSPSRAALLT 60
Query: 138 GKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRE-LGYSTKAIGKWHLGFFRREYTPLY 196
G+YP + G++ + G+P LP+ LR+ LGY T GKWHLG P
Sbjct: 61 GRYPANAGVRAILDGHRKATGLPKETLTLPKILRDRLGYHTAMSGKWHLG-LAEGCRPQD 119
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNG----HDMRRNLSTAWDTVGEYATDLFT 252
GFE FG+L G + ++ HI +R E G HD+ N W GEY T++ T
Sbjct: 120 HGFEQWFGFLAGCVDFFSHIFYWGMNR--EGPGMNPTHDLWVNNKEVWKN-GEYLTEVIT 176
Query: 253 KEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMV 311
+ ++Q + D K FLY+ + A H + APQ+ +++F + +R+ AAMV
Sbjct: 177 EYSIQALRRAKKDSKSFFLYIPYNAPHY-----PMHAPQKYMDRFGGLP-WDRQVMAAMV 230
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW----------GSN 361
+DD +G + + L R +E++I F SDNG RE+ RNW GS+
Sbjct: 231 SAVDDGIGEIFNELARLDFIEDTITFFTSDNGPS----RES---RNWLDGTLDPYYGGSS 283
Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
++G K +L+EGGV+VP I+ P + ++ + D LPT+ L L+
Sbjct: 284 GGFKGHKFSLFEGGVRVPGIVHYPGVVPASKIVEEPFCSFDMLPTI-------CEWLHLD 336
Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
I ++ +DG+ Q L P+ + +D++ ++R W
Sbjct: 337 ISDME----------------LDGISQAGFLEGKQPALERDLFWEMDDQ---TSIRRGDW 377
Query: 482 KLVLGTQ 488
KLVL Q
Sbjct: 378 KLVLNGQ 384
>gi|326433715|gb|EGD79285.1| hypothetical protein PTSG_12912 [Salpingoeca sp. ATCC 50818]
Length = 562
Score = 172 bits (435), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 152/579 (26%), Positives = 249/579 (43%), Gaps = 110/579 (18%)
Query: 85 LTLLIVYGWNDLSFHGS----NEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
+ L+ GW D+ +H S ++I TP ID L GI L Y VC+P+RAS +G+
Sbjct: 57 MVLVDDLGWADVGYHRSGPHKSDIQTPTIDKLVSQGIALERHYVHKVCSPTRASFQSGRL 116
Query: 141 PIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
P+H G+ G + A G+P + ++L + GY++ +GKW +G +TP RG+
Sbjct: 117 PVH-GIDGQVVLCAPRAGIPENMTTVAQHLNKAGYASHFVGKWDVGMATPSHTPHGRGYN 175
Query: 201 S---HFGYLNGVISYYDHILS-DQYSRTVELNGHDMRRNLSTAWDTVGE--------YAT 248
+ +FG+ N + + + S + S D ++ WDT Y
Sbjct: 176 TSLNYFGHANWMWNQDEWQGSQNNVSHRPPCKAPDCFKDF---WDTDRPAHNLSGTLYEE 232
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
LF + +IE +PLFL A AH L+AP E QF I P+RR Y
Sbjct: 233 QLFVQRITDIIEAHDPSQPLFLTYASKVAH-----YPLQAPIEYQQQFANIEPPSRRVYH 287
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS------------NYR 356
AMVK LDD++ + + +Q+KGM N++++F SDNG P + + N
Sbjct: 288 AMVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHPSRGYMCYNGE 347
Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVS--LQMMHISDWLPTLYTAAGGD 414
+NYP RG K + +EGG++V A + I + R S +MHI+DW T AG
Sbjct: 348 AGANNYPLRGGKYSFFEGGIRVNAFVSGGLIPADQRGSNRTGIMHIADWYATFCALAG-- 405
Query: 415 TSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTP-SRRNSVLINIDEKKRT 473
+D D ++ N P ID + W L S R++VL++ D
Sbjct: 406 -------VDPTDHRAA-AANLPP-----IDSHNMWPYLAGQARFSPRDTVLVDRD----- 447
Query: 474 AAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
+ +KL+LG P+ N++ + +S
Sbjct: 448 -VLVFGDYKLLLGASN----------------PMANWSGPAYPNATTATSPIS------- 483
Query: 534 SNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQL 593
+ T++C + P LFN+ DP E++++A +PDI +
Sbjct: 484 -----------RYTLNC-------------SDPGCLFNVVEDPEERHDLAEEKPDILKLM 519
Query: 594 YELLKYHRRTLVPQSHE--QPDLVQADPKRFNDTWSPWI 630
L +++ P + P ++ +++ + PW+
Sbjct: 520 KFTLTELAKSIFPYNPGPVDPQCMKTAETKYDGFYGPWL 558
Score = 42.0 bits (97), Expect = 0.94, Method: Compositional matrix adjust.
Identities = 39/129 (30%), Positives = 52/129 (40%), Gaps = 25/129 (19%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y LF + +IE +PLFL A AH L+AP E QF I P+RR
Sbjct: 230 YEEQLFVQRITDIIEAHDPSQPLFLTYASKVAH-----YPLQAPIEYQQQFANIEPPSRR 284
Query: 75 TYAALTKS--------TTL---------TLLIVYGWND---LSFHGSNEIPTPNIDALAY 114
Y A+ K TTL TL++ N ++F G + P+ + Y
Sbjct: 285 VYHAMVKYLDDNLYYITTLMQKKGMWNNTLMVFSSDNGGPVMAFDGDCDASHPSRGYMCY 344
Query: 115 NGIILNNMY 123
NG N Y
Sbjct: 345 NGEAGANNY 353
>gi|298715187|emb|CBJ27859.1| Formylglycine-dependent sulfatase, C-terminal fragment
Formylglycine-dependent sulfatase, N-terminal
[Ectocarpus siliculosus]
Length = 610
Score = 172 bits (435), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 144/445 (32%), Positives = 214/445 (48%), Gaps = 73/445 (16%)
Query: 85 LTLLIVYGWNDLSFHGSN--EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
L+ G ND+ + ++ E+ TP +D+LA +G++L+N Y +CTPSRASLMTG+
Sbjct: 74 FILIDDVGTNDMGYRSTDLWEL-TPFLDSLASSGVVLDNYYTNQLCTPSRASLMTGRDSF 132
Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
TGMQ + + P G+P E L + + GYST GKWHLG F P RGF++
Sbjct: 133 RTGMQHGIVDYSAPWGLPFEEVTLADRFKAAGYSTHMTGKWHLGVFSDASYPFSRGFDTF 192
Query: 203 FGYLNGVISYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDTVGE-------YAT 248
GY G YY+H +YS + D + +T E Y+T
Sbjct: 193 LGYTGGGEGYYNHSTCFTPTFEGGEYSCLKDFGYGDEDGYIDYTTNTTKEGPAMVDNYST 252
Query: 249 DLFTKEAVQLIEDQ----PVDKPLFLYLAHLAAHAGNAGKHLEAP-------QETINQFQ 297
+ T A+ + + D PLFLY+A+ AAH L+AP +E +
Sbjct: 253 TIMTDRAIDVAREHTGTASSDDPLFLYVAYQAAHT-----PLDAPPPGTFTEEEEETLAR 307
Query: 298 YITD-PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
++D P+R +A ++ LD V + AL+ G+L+N++I+ SDNGA +
Sbjct: 308 VVSDEPDRHVFARIIYYLDKEVRRLHDALEALGVLDNAVIVLASDNGACS---------S 358
Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL-----QMMHISDWLPTLYTAA 411
+ GSN+P RG KNT ++GGV+VPA ++S + P + MMH +DW TL
Sbjct: 359 SGGSNHPLRGFKNTTFQGGVRVPAFVYSKSTELIPEEARGTTYSGMMHSTDWTQTLGRVV 418
Query: 412 GGDTSRLPL-----NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN 466
G LPL + G DQW +++ R S +G + +L RNS +
Sbjct: 419 PG----LPLAGMGKELSGFDQWDAIV-----GRRSAEEGPVRTDMVL-----GRNSFTFD 464
Query: 467 IDEK---KRTA---AVRLDSWKLVL 485
D KR + A D WKL+L
Sbjct: 465 TDTAEMVKRASPWGAYIHDGWKLIL 489
>gi|294011191|ref|YP_003544651.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
gi|292674521|dbj|BAI96039.1| putative arylsulfatase A [Sphingobium japonicum UT26S]
Length = 468
Score = 171 bits (434), Expect = 7e-40, Method: Compositional matrix adjust.
Identities = 116/341 (34%), Positives = 172/341 (50%), Gaps = 28/341 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYP--IHTGMQG 148
G DL +GS +I TP IDA+A G+ N YA VC+P+R +L+TG+Y G++
Sbjct: 49 GHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEE 108
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P + + +P R LP LR+LGY+T +GKWH+G +PL G++ FG +G
Sbjct: 109 PIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVGELPAS-SPLDHGYDYFFGIASG 167
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLS--TAWDTVGEYATDLFTKEAVQLIEDQPV-D 265
Y+ H +NGH+M + T G Y TDL +A+ ++ D
Sbjct: 168 GTDYFAH--------ATTINGHEMGKLFENRTEIQRPG-YLTDLLGAKAIDRMQQAARQD 218
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
+P F+ L A H G A + + N R YAAM++ +D +VG V++ L
Sbjct: 219 RPFFISLHFTAPHWPWQGPGDAAQGSAAADPRQMDGGNVRAYAAMMEGMDANVGRVLAEL 278
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
G+ E++I+IF SDNG + ++P G+K L EGG++VP I+ P
Sbjct: 279 AALGLEEDTIVIFTSDNGG-----------ERFSDSWPLTGMKGELLEGGIRVPLIVSWP 327
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
R S Q+ D LPTL +AAGGD R+ +DG+D
Sbjct: 328 AHLPKGRDSAQVTMSMDALPTLLSAAGGDPGRVE-GLDGID 367
>gi|196231680|ref|ZP_03130537.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224152|gb|EDY18665.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 474
Score = 171 bits (434), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 123/370 (33%), Positives = 188/370 (50%), Gaps = 54/370 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ + +G +IPTPNID L +G+ ++ Y + P C SRA+LMTG+Y G + P
Sbjct: 39 GYGEPGCYGGKDIPTPNIDKLVASGVRFSSGYVSAPFCAASRAALMTGRYQTRFGFEYNP 98
Query: 151 IWG--AEP-RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
I A+P G+P+ E+ + + LR++GY+T +GKWHLG + P RGF+ FG+L+
Sbjct: 99 IGAKNADPGTGLPVNEKTVADRLRDVGYATGLVGKWHLG-GTAPFHPQRRGFDEFFGFLH 157
Query: 208 -----------GVISYYDH--ILSDQYSRTVELNGH-----DMRRNLSTAWDT------- 242
G ++ + R +GH D+ N A+D
Sbjct: 158 EGHFYLPPPWSGATTWLRRKALPDGSQGRWTSPDGHTVWSTDLHEN-EPAYDADNPLLRN 216
Query: 243 ---VGEYA--TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
V E A TD FT+EA I D+ +P FLYLA+ A H+ L+ + +F
Sbjct: 217 SQPVEEKANLTDAFTREACSFI-DRHQAQPWFLYLAYNAVHS-----PLQGEDTYMEKFS 270
Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
+I D RR +AA++ LD+ +G V + L+ G+ EN++++F+SDNG PT E
Sbjct: 271 HIGDIQRRIFAAVLAHLDEDIGKVRAQLRADGLEENTLVVFLSDNGGPTKELTS------ 324
Query: 358 WGSNYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
SN P RG K LW+GG+++P A+ W QI + + + D T AG +T
Sbjct: 325 --SNLPLRGGKGDLWDGGIRIPFAVSWKGQIPAGHTIDAPAISM-DLTATALKLAGAETE 381
Query: 417 RLPLNIDGLD 426
+ L DG+D
Sbjct: 382 QAKL--DGVD 389
>gi|430741674|ref|YP_007200803.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430013394|gb|AGA25108.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 454
Score = 171 bits (434), Expect = 8e-40, Method: Compositional matrix adjust.
Identities = 117/353 (33%), Positives = 183/353 (51%), Gaps = 48/353 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW D+ F+G E TPN+D LA G Y A VC PSRA+LMTG+Y IH G+
Sbjct: 51 GWGDVGFNGRTEWATPNLDRLAARGTTFKRFYTAAVVCAPSRAALMTGRYTIHDGVS--- 107
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR---REYT-PLYRGFESHFGYL 206
+P E L E + GY T GKWH G R + Y P+ +GF+ FG+
Sbjct: 108 ---RNNDDLPAREVTLAEAFKTHGYDTALFGKWHHGQPRDGSKTYVHPMDQGFDEFFGFT 164
Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE---DQP 263
+ ++ ++Y + +G +++ V Y+ D+F A+ ++ ++P
Sbjct: 165 DAKHAW------EKYPEQL-WHGRELK--------PVSGYSDDMFADHAIDFLKRHKEKP 209
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETI----NQFQYI--TDPNRRTYAAMVKKLDDS 317
P FLY+ + H ++EAP E + +F + + P R TYAAMV +LD +
Sbjct: 210 T--PFFLYVPFINTHF-----NIEAPAEEVALHQGKFTEVDPSKPIRATYAAMVTQLDKN 262
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE--TSNYRNWGSNYPYRGVKNTLWEGG 375
VG +++AL+ G+ +++++F+SD+GA T E S+Y + SN P+RG K TLWEGG
Sbjct: 263 VGRIMTALEGLGLAADTLVVFVSDHGA-TFESGNLGASDYHD--SNRPFRGQKRTLWEGG 319
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
++VP + P +S +++H++D PTL A G ++DG D W
Sbjct: 320 IRVPGVACWPGHVPAGVISNEVVHMTDLFPTLLAATGAH-PEAAWHVDGADLW 371
>gi|323451422|gb|EGB07299.1| putative arylsulfatase [Aureococcus anophagefferens]
Length = 512
Score = 171 bits (434), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 154/548 (28%), Positives = 231/548 (42%), Gaps = 78/548 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWN+ ++ ++ TP+ DAL G+ L N Y C+P+R+S ++G+ PIH +
Sbjct: 32 GWNNGFYN--EDVLTPHTDALVAAGVKLTNHYVYKFCSPTRSSFLSGRLPIHVNQENSAT 89
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
A GVPL + E +R GY T +GKWH G +TP RGF+S G+ N
Sbjct: 90 EQAG-AGVPLGMSTIGETMRAAGYRTAHVGKWHCGMATPAHTPYGRGFDSSIGFFNFGED 148
Query: 212 YYDHILSDQY----SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
+Y + Q + + +G D+ R+ + A G Y FT+ AV +E KP
Sbjct: 149 HYTQVRGGQALEEGGASGQCSGVDLWRDGAPARGENGTYGGYTFTRAAVAAVEANDT-KP 207
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LFL+ A H LE P E + +Y + R T M LDDSVG V AL++
Sbjct: 208 LFLFAAFQNLH-----PPLEVPPEYVA--RYANETLRTTINGMAAFLDDSVGNVTKALKK 260
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
+ M ++S+I++ DNG + G + P+RG K + ++GGV+V A + +
Sbjct: 261 QNMYDSSLIVYTPDNGGYLTQ---------GGDDAPFRGGKFSDFQGGVRVAAFISGGLV 311
Query: 388 QQNPR--VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDG 445
R S ++H+ DW+ T GG +P + + L P D
Sbjct: 312 PAGLRGTASAALVHVCDWMATFAALGGG---AIPAD-------AVAGLPAP-------DS 354
Query: 446 LDQWSSLLLNTPSRRNSVLINI---DEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
LD W +L PS R V ++I D R A V D +G + G
Sbjct: 355 LDVWPALSRGRPSPRTEVPLSILPADGAAR-AFVDFDGVGYFVGGE---------GLIVG 404
Query: 503 NKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPC 562
+ LL YQ I DK+ + A P P C
Sbjct: 405 DMKLLLG---------YQH-----------IGPFDKVTNATCAGVAGPWAAGRPGVPCDC 444
Query: 563 TNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL-VPQSHEQPDLVQADPKR 621
C L+N+ DP E +++A++RP + L L+ R T+ P + D A
Sbjct: 445 ATSGC-LYNVTADPTESDDLAAARPGAVAALRTRLEALRATVYAPDRGAKDDAACAQIAT 503
Query: 622 FNDTWSPW 629
W PW
Sbjct: 504 NGGFWGPW 511
>gi|320102821|ref|YP_004178412.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319750103|gb|ADV61863.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 480
Score = 171 bits (434), Expect = 9e-40, Method: Compositional matrix adjust.
Identities = 117/337 (34%), Positives = 175/337 (51%), Gaps = 32/337 (9%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L GW DLS++G + TP++D LA G + Y A VC PSRA+L+TGK PI
Sbjct: 39 VLILADDLGWGDLSYNGRIQWSTPHLDDLAARGARFDRFYTAGVVCAPSRAALLTGKNPI 98
Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE----YTPLYRG 198
H G W + +P E + E L+ GY T GKWH G RE PL +G
Sbjct: 99 HCGT-----WRND-HDLPAEEVTIAEALKAKGYHTALYGKWHQGRPTREGGTRTHPLDQG 152
Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
F+ FG+L+ ++ +++ + NG D R + G YA D FT + V+
Sbjct: 153 FDETFGFLSATHAW------EKFPNHL-FNGRD-RVEVPQ-----GRYADDWFTDKGVEF 199
Query: 259 IEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-YITDPNRRTYAAMVKKLDD 316
I D P FLYL + H H+EAP++ I + + +DP + YAA+V ++D
Sbjct: 200 IARHKDDPTPFFLYLPLTSTHF-----HIEAPEDEIAKHRSTFSDPIQTAYAAIVTRMDA 254
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
VG ++ A++ G+ ++++I+F SD+GA T E + SN+P+RG K T++EGG+
Sbjct: 255 QVGRIVQAVRDAGLEDSTLIVFTSDHGA-TFEGGSKEASSSLDSNHPFRGQKRTVYEGGI 313
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
+VP I+ P + R+ + D LPT AGG
Sbjct: 314 RVPGIMTYPGVIPAGRIVSTPVQTIDLLPTFLELAGG 350
>gi|343084004|ref|YP_004773299.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342352538|gb|AEL25068.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 445
Score = 171 bits (433), Expect = 1e-39, Method: Compositional matrix adjust.
Identities = 119/353 (33%), Positives = 181/353 (51%), Gaps = 42/353 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G+ I TPN+D LA G++ + ++ VC+P+RA+LMTGKY TG++G
Sbjct: 39 GYGDLSCYGNEYINTPNLDLLASEGVLFTDYHSNGSVCSPTRAALMTGKYQQRTGVEGVV 98
Query: 151 IWGAE-PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
+ G+ L E L E L++LGY+T GKWHLG+ + + P +GF+ G+++G
Sbjct: 99 TAKSHRDVGLALAEVTLAEELKQLGYNTGMFGKWHLGY-DKAFNPTLQGFDEFVGFVSGN 157
Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG-----EYATDLFTKEAVQLIEDQ-P 263
+ Y+ HI + Y WD V Y TDL ++ V+ I++ P
Sbjct: 158 VDYHGHIDQEGYL---------------DWWDGVKIKNEKGYTTDLISEYGVKFIQEHNP 202
Query: 264 VDK--PLFLYLAHLAAHAGNAGKHLEAPQE--TINQFQYITDPNRRTYAAMVKKLDDSVG 319
K P FLYL H A H+ + + +E T + I D Y MV+ +D+ VG
Sbjct: 203 EVKRAPFFLYLPHEAPHSPYQRRIDKVLREIGTAGTQEVIQDSISSIYKEMVEVMDEGVG 262
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
++ +L+ G EN+I+IF+SDNGA ++G N RG K +EGG +VP
Sbjct: 263 RIMQSLKETGQYENTIVIFISDNGA-----------NHYGDNGGLRGFKAGPYEGGSRVP 311
Query: 380 AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
AI P+ + V+ Q + D LPTL G S ++DG+ +LL
Sbjct: 312 AIFSFPKEVKGGSVNNQTVLSMDLLPTLLDFIGRKPST---SVDGISIKENLL 361
>gi|347528892|ref|YP_004835639.1| putative arylsulfatase [Sphingobium sp. SYK-6]
gi|345137573|dbj|BAK67182.1| putative arylsulfatase [Sphingobium sp. SYK-6]
Length = 458
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 117/347 (33%), Positives = 175/347 (50%), Gaps = 27/347 (7%)
Query: 74 RTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTP 130
R AA + + ++ G+ DLSF G E TP+ID++A G ++ YA VC+P
Sbjct: 19 RVEAAPARPPHIVFIMADDLGYADLSFTGRTEYATPHIDSIAEQGCFFSDAYANSSVCSP 78
Query: 131 SRASLMTGKYP--IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
+R +L TG+Y + G+ P G EP G+P + LRE GY T +GKWHLG
Sbjct: 79 TRMALATGRYQYRLRGGLDEPIGGGPEPVGLPPEHPTIASRLREAGYRTVLVGKWHLGSP 138
Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
R ++PL G++ FG G Y+ H S++++ G + S+ D G Y T
Sbjct: 139 PR-FSPLLSGYDRFFGIHGGGADYFTH------SQSMQGGGSSDLYDGSSVTDRHG-YLT 190
Query: 249 DLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
D+ T+ AV I+ KPLFL L + A H G EA I + T+
Sbjct: 191 DILTERAVAEIDLAARTRKPLFLSLHYTAPHWPWEGPEDEALAAQIKSLLNYDGGSDSTF 250
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
A MV+ LD VG V++AL+ +GM +++I++F SDNG + +P+ G+
Sbjct: 251 AKMVQALDKGVGKVLAALEAQGMTQDTIVVFTSDNGG-----------ERFSKVWPFDGM 299
Query: 368 KNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
K L EGG++VP ++ W +I R S Q+ D++PT AGG
Sbjct: 300 KGDLLEGGIRVPMMMRWPGKIPAGSR-SAQIAMTMDYMPTFLATAGG 345
>gi|126304968|ref|XP_001376926.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Monodelphis
domestica]
Length = 520
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 162/569 (28%), Positives = 243/569 (42%), Gaps = 111/569 (19%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRA 133
A ++ + L++ GW DL G TP++D +A G++ N Y A P+C+PSRA
Sbjct: 22 GATSQPPNIVFLLMDDMGWGDLGVFGEPSRETPHLDQMAAEGMLFPNFYTANPLCSPSRA 81
Query: 134 SLMTGKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
+L+TG+ PI G G P+ G+P +E LPE L++ GY K +GKWHLG
Sbjct: 82 ALLTGRLPIRNGFYTTNGHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVGKWHLG 141
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDH--ILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
R ++ PL GF+ FG N YD+ + + R E+ G + T
Sbjct: 142 H-RPQFHPLKHGFDEWFGSPNCHFGPYDNKAMPNIPVYRNWEMVGR-FYEDFPINHKTGE 199
Query: 245 EYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
T ++ KEAV I+ Q +P FLY A A HA ++
Sbjct: 200 ANLTQIYLKEAVDFIKKQQAHQQPFFLYWAIDATHA-----------PVYASKSFLGTSQ 248
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
R Y V+++DDS+G ++ LQ G+ EN+ + F SDNGA + S + GSN P
Sbjct: 249 RGRYGDAVREIDDSIGKILKLLQDLGIKENTFVFFTSDNGAALI-----SAPKEGGSNGP 303
Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
+ K T +EGG++ PAI W P +VS Q+ ++ D T + AG
Sbjct: 304 FLCGKQTTFEGGLREPAIAWWPGHIPAGQVSHQLGNVMDLFTTGLSLAG----------- 352
Query: 424 GLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
L PS R IDG+D +++ R +E AVR+ +K
Sbjct: 353 ---------LKPPSDR--EIDGIDLLPAIVQGKVIDRPIFYYRGNE---MMAVRVGQYKA 398
Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-FLPISNIDKMRST 542
T N S ++ SQ I F P N+ + +
Sbjct: 399 HYWTWSN------------------------------SWEEFSQGIDFCPGQNVSGVTTH 428
Query: 543 RQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSR---PDISSQLYELLKY 599
Q+ TN P LF+LG DP E+ I S D +++ +++
Sbjct: 429 SQEEH---------------TNLPL-LFHLGRDPGERFPIRVSSLEYKDALLRIFPIVEE 472
Query: 600 HRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
H+RTL+P +P L +P N W+P
Sbjct: 473 HKRTLIPG---EPQLNVCNPAVMN--WAP 496
>gi|390167238|ref|ZP_10219235.1| putative arylsulfatase A [Sphingobium indicum B90A]
gi|389590183|gb|EIM68184.1| putative arylsulfatase A [Sphingobium indicum B90A]
Length = 468
Score = 170 bits (431), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 118/343 (34%), Positives = 173/343 (50%), Gaps = 32/343 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYP--IHTGMQG 148
G DL +GS +I TP IDA+A G+ N YA VC+P+R +L+TG+Y G++
Sbjct: 49 GHADLGCYGSRDIRTPAIDAIAARGVKFGNAYANSCVCSPTRIALLTGRYQGRFRIGLEE 108
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P + + +P R LP LR+LGY+T +GKWH+G +PL G++ FG +G
Sbjct: 109 PIAFNGDELSLPRGTRTLPGLLRDLGYATSLVGKWHVGELPAS-SPLDHGYDYFFGIASG 167
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLS--TAWDTVGEYATDLFTKEAV---QLIEDQP 263
Y+ H +NGH+M + T G Y TDL +A+ +L Q
Sbjct: 168 GTDYFAH--------ATTINGHEMGKLFENRTEIQRPG-YLTDLLGAKAIDRMRLAARQ- 217
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
D+P F+ L A H G A + + N R YAAM++ +D +VG V++
Sbjct: 218 -DRPFFISLHFTAPHWPWQGPGDAAQGSAAADPRQMDGGNVRAYAAMMEGMDANVGRVLA 276
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
L G+ E++I+IF SDNG + ++P G+K L EGG++VP I+
Sbjct: 277 ELAALGLQEDTIVIFTSDNGG-----------ERFSDSWPLTGMKGELLEGGIRVPLIVS 325
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
P R S Q+ D LPTL +AAGGD R+ +DG+D
Sbjct: 326 WPAHLPKGRDSAQVTMSMDALPTLLSAAGGDPGRVE-GLDGID 367
>gi|336424342|ref|ZP_08604383.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
gi|336003446|gb|EGN33530.1| hypothetical protein HMPREF0994_00389 [Lachnospiraceae bacterium
3_1_57FAA_CT1]
Length = 460
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 148/512 (28%), Positives = 224/512 (43%), Gaps = 95/512 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
G+ D TPN+D L G +++ YA PVC P+RA+L+TG+YP TG
Sbjct: 19 GYGDFGIFSDGSARTPNLDRLVRQGCAMSHCYAASPVCAPARAALLTGRYPHRTGAVDTY 78
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
I G + L E L + R GY T IGKWHLG +EY P RGF++ G+ G
Sbjct: 79 EAIGGDR---MALREVTLADVYRANGYRTGLIGKWHLGLIGKEYHPCRRGFDTFIGFRGG 135
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY + L R + A D G Y TD+ T+E+++ I + ++P
Sbjct: 136 WSDYYQYKLD--------------RNGILEASD--GTYMTDVITEESIRFIREN-REQPF 178
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FL+ A+ A H + P+E F+ +P T M+ +D+ +G ++ L+
Sbjct: 179 FLHAAYNAPHFP-----FQCPEEYAAPFRKRFNPTLATLYGMIACMDEGIGKLLDVLEET 233
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
G+ +N+I+IF SDNG P + Y +T+ Y N G K +EGG++VPA++ W I
Sbjct: 234 GLSDNTIVIFASDNG-PQL-YGDTNRY-----NCYLNGQKGEAFEGGIRVPAVIRWPGHI 286
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP--LNIDGLDQWSSLLLNTPSRRNSNIDG 445
+ R H DW PTL +A LP +++DG ++ +LL + S
Sbjct: 287 APDSRCH-AFFHGVDWFPTLLSAC---RLELPDGVSLDGKNRLDEILLGSSSPSEPY--- 339
Query: 446 LDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKV 505
QW+ TP K AAVR WKLV E +V
Sbjct: 340 YWQWNRF---TPV-----------SKCNAAVRCGKWKLVWPPVEEAM-----------QV 374
Query: 506 PLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNG 565
P +VE ++ + P D R PAP P
Sbjct: 375 PPY----VVEIDERIKQEETVPDTLYPFD--DSYRRI-----------PAPHPPR----- 412
Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELL 597
L+NL +DP E+++++ P+I+ L ++
Sbjct: 413 ---LYNLEDDPLERHDVSIDYPEITESLSQVF 441
>gi|340616348|ref|YP_004734801.1| sulfatase [Zobellia galactanivorans]
gi|339731145|emb|CAZ94409.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 489
Score = 170 bits (430), Expect = 2e-39, Method: Compositional matrix adjust.
Identities = 138/467 (29%), Positives = 213/467 (45%), Gaps = 85/467 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI------HT 144
G+ DL F GS+ TPN+D LA + + Y+ P C PSR S+MTGKYP H
Sbjct: 50 GFADLGFTGSDTHLTPNLDKLAKESVYFDRAYSSHPTCAPSRMSIMTGKYPARLGAVSHG 109
Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
+ G G G+P+TE + E L++ GY+T IGKWH+G + E P RGF+
Sbjct: 110 KLGGVAHPGPNDNGLPMTETTIGEALKKEGYTTAHIGKWHIG--KGENNPGTRGFDVD-- 165
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD----TVGEYATDLFTKEAVQLIE 260
I+ + Y E N ++ +R S D G++ TD +EAV+ I
Sbjct: 166 -----IASNEFCCPGSYMYPFESN-NEKQRVASKIPDLEDRKPGDFLTDALAEEAVKFIH 219
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR----TYAAMVKKLDD 316
+KP FL ++ A H + A E + +++ + P+ R TYA +V+ LDD
Sbjct: 220 STD-EKPFFLNMSFYAVHTP-----ITAIPEKVEKYKRLIGPDARQKNPTYAGLVEHLDD 273
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+VG ++ AL+ KG+++N+II+F SDNG + N+P R K + +EGG
Sbjct: 274 AVGAILKALEEKGIIDNTIIVFTSDNGGEIL--------HGITDNFPLRDGKGSSYEGGT 325
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN-- 434
+VP ++ P + Q VS + + D+ PT + AG + +IDG D +S LL N
Sbjct: 326 RVPLLVKWPGVTQVNTVSHERIIGFDYYPTFLSMAGANPDL--HDIDGRD-FSPLLKNPK 382
Query: 435 --------------------TPSRRNSNI------------------DGLDQWSSL--LL 454
P N DG Q L L
Sbjct: 383 EKLEERDLHWLKYLSLIHYRIPIADNKRCFETIVSGDWKLHEYFQMPDGYKQHFELYNLK 442
Query: 455 NTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
+ PS +N++ EK + ++++SWK +G M+ +YG +
Sbjct: 443 DDPSEKNNLANKFPEKVQELKLKMESWKKDIGAPVY-DMEKFYGHVK 488
>gi|386821789|ref|ZP_10109005.1| arylsulfatase A family protein [Joostella marina DSM 19592]
gi|386426895|gb|EIJ40725.1| arylsulfatase A family protein [Joostella marina DSM 19592]
Length = 474
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 112/345 (32%), Positives = 175/345 (50%), Gaps = 33/345 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS+E TP++D LA I + Y + VC PSRA ++TGKY G +
Sbjct: 47 GYADFGFQGSSEFKTPHLDQLASQSIRFSQAYVSAAVCGPSRAGILTGKYQQRFGYEENN 106
Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
+ G + G+PL ++ LPEYL+E GY T GKWH+G + + P RGF++
Sbjct: 107 VPGYMSASATTGDEMGLPLDQKLLPEYLKEQGYKTALFGKWHMGNADK-FHPTKRGFDTF 165
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
+G+ G SYY+ +++ +R + + R ++ Y TD + IE
Sbjct: 166 YGFRGGARSYYEFNENNKNNRQED----RLERGFGN-FEESKLYLTDALAEATTDFIEKN 220
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
+P F+YL+ A H +EA + + QF + R+T AAM LD + G V+
Sbjct: 221 Q-KQPFFVYLSFNAVHT-----PMEARPDDLKQFPNLKG-KRKTLAAMTLGLDRACGNVL 273
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
+ L G+ EN++I+F +DNG P+ N SN P G K EGG++VP ++
Sbjct: 274 AKLDELGVRENTLIVFTNDNGGPSGT--------NASSNKPLGGTKANHLEGGIRVPFLM 325
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
W Q+++N + + D LPT Y AGG+ + +IDG++
Sbjct: 326 SWPKQLKENTVYDYPISTL-DLLPTFYQIAGGNEKNIN-DIDGVN 368
>gi|417301111|ref|ZP_12088281.1| arylsulfatase B [Rhodopirellula baltica WH47]
gi|327542540|gb|EGF29014.1| arylsulfatase B [Rhodopirellula baltica WH47]
Length = 498
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 128/412 (31%), Positives = 200/412 (48%), Gaps = 49/412 (11%)
Query: 65 FQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
F + T+ +R AA + + +++ G+ D+ GS + TPN+D LA +G++ +
Sbjct: 19 FVFATEASR---AAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGVLCSQA 75
Query: 123 Y-AQPVCTPSRASLMTGKYPIHTGMQG-----PPIWGAEPR--GVPLTERFLPEYLRELG 174
Y A VC+PSRA L+TG+ P G +G + P G+P +E+ L ++L G
Sbjct: 76 YVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPKSEKTLADHLGAAG 135
Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRR 234
Y+T IGKWHLG + P RGF+ G L G Y+ ++ H + R
Sbjct: 136 YATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMN-----------HVIER 183
Query: 235 NLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEAPQE 291
N + EY TD FT E ++ I+ D+P F++ ++ A H + A +
Sbjct: 184 NGKRVENFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PMHATEA 238
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
+ +F I + RRTYAAM+ LD VG + L+ G EN++++F SDNG T
Sbjct: 239 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT----- 293
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
N +W N P RGVK ++ EGG++VP I W + R + + D LPT +A
Sbjct: 294 --NNGSW--NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVRYDGVISSL-DLLPTFCSA 348
Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT--PSRR 460
AG + L + D + +N R + DG+D L+ + P+RR
Sbjct: 349 AGAEPLALADPMSHEDASNRKRMN---RLSGTHDGIDMAPHLVDGSEPPNRR 397
>gi|149196020|ref|ZP_01873076.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149140867|gb|EDM29264.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 462
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 105/301 (34%), Positives = 162/301 (53%), Gaps = 22/301 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D+ F+G +IPTP ID++A NG+ ++ Y VC PSRA +TG+Y G + P
Sbjct: 34 GYADVGFNGCKDIPTPGIDSIANNGVKFSSGYTSYSVCGPSRAGFITGRYQQRFGFERNP 93
Query: 151 IWG-AEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
W +P +P +E + E L+++GY IGKWHLG P RGF FG+L G
Sbjct: 94 QWNLTDPNSALPKSEMTIAESLQQVGYHCGIIGKWHLGA-EPSLRPNQRGFNEFFGHLGG 152
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT---VGEYATDLFTKEAVQLIEDQPVD 265
+Y+ L + V+ N D + T DT +Y TD F+ EA++ +E +
Sbjct: 153 GHAYFPEKLRIIKTEDVK-NEMDSYASYITRNDTPVKTTKYLTDEFSDEAIRFVEKN-YE 210
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
+P FL+L++ A H L+A Q+ +++F +I D NRRTY AMV +DD + ++ L
Sbjct: 211 QPFFLFLSYNAPHT-----PLQATQKYLDRFPHIEDQNRRTYCAMVSAMDDGIVRLMKKL 265
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
+ +++I+ F+SDNG PT +N N RG K ++EGG +VP + P
Sbjct: 266 DALKIKDDTIVFFLSDNGGPTT--------KNNSDNGDLRGKKGDVFEGGFRVPFAMQYP 317
Query: 386 Q 386
+
Sbjct: 318 R 318
Score = 50.1 bits (118), Expect = 0.004, Method: Compositional matrix adjust.
Identities = 25/69 (36%), Positives = 43/69 (62%), Gaps = 6/69 (8%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y TD F+ EA++ +E ++P FL+L++ A H L+A Q+ +++F +I D NR
Sbjct: 191 KYLTDEFSDEAIRFVEKN-YEQPFFLFLSYNAPHT-----PLQATQKYLDRFPHIEDQNR 244
Query: 74 RTYAALTKS 82
RTY A+ +
Sbjct: 245 RTYCAMVSA 253
>gi|410446790|ref|ZP_11300893.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409980462|gb|EKO37213.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 540
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 122/386 (31%), Positives = 184/386 (47%), Gaps = 55/386 (14%)
Query: 75 TYAALT----KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVC 128
T+ AL K + + +V GW D+S G+ I TP ID+L G+ L+ Y P+C
Sbjct: 12 TFCALANAEEKRPNIIIFLVDDLGWADISLRGA-PIDTPAIDSLFSEGLTLDRFYTTPIC 70
Query: 129 TPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
+P+RA+LMTG+ P+ G+ + GV E F+PE + GY T +GKWHLG
Sbjct: 71 SPTRAALMTGRDPLRLGISYSVVMPWMNNGVHPDEHFMPESFKAAGYQTAMVGKWHLGHS 130
Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
+ + P RGF+ +G+L+ + Y+ + G D +RN T D Y T
Sbjct: 131 QEIFHPNARGFDDFYGHLHTEVGYFLPFANQ--------GGVDFQRNGVTIADE--GYET 180
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT-- 306
L EA + I+ + DKP FLY+ +A H+ LEAP + + +++ + D T
Sbjct: 181 FLLADEASRWIKARDKDKPFFLYMPFIAPHS-----PLEAPDDLVKKYENLEDTRELTRS 235
Query: 307 -------------------YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
YAA+V LD ++G V+ L ++G+ E +II+F SDNG
Sbjct: 236 AAIDRTRTLNGFSPSARPMYAAVVDGLDQAIGQVLDTLTKEGIEEETIILFSSDNGGAAY 295
Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPT 406
N+P RG K +EGG++V A + W + +M + D PT
Sbjct: 296 AGGGA-------DNFPLRGGKGDTYEGGIRVVAAMRWKGTLDAGKSFE-SIMTVMDVFPT 347
Query: 407 LYTAAGGDTSRL-PLNIDGLDQWSSL 431
L +AAG S L I G D WS++
Sbjct: 348 LASAAG--VSMLNTKEIWGRDLWSAI 371
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 24/72 (33%), Positives = 39/72 (54%), Gaps = 6/72 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y T L EA + I+ + DKP FLY+ +A H+ LEAP + + +++ + D
Sbjct: 178 YETFLLADEASRWIKARDKDKPFFLYMPFIAPHS-----PLEAPDDLVKKYENLEDTREL 232
Query: 75 TY-AALTKSTTL 85
T AA+ ++ TL
Sbjct: 233 TRSAAIDRTRTL 244
>gi|449677596|ref|XP_004208885.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 193
Score = 169 bits (429), Expect = 3e-39, Method: Compositional matrix adjust.
Identities = 77/130 (59%), Positives = 98/130 (75%), Gaps = 3/130 (2%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM---QG 148
G+ND+SFHGS +IPTPNID +A G+ILNN Y P+CTPSR+++MTG+YPIHTG+
Sbjct: 46 GFNDVSFHGSKQIPTPNIDKIAKEGVILNNYYVLPICTPSRSAIMTGRYPIHTGIFYHTY 105
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
I+ A GV L E+FLP+YL+ +GY T AIGKWHLGFF +EYTP YRGF+S +GY G
Sbjct: 106 DTIFAANAWGVGLDEKFLPQYLKNVGYQTHAIGKWHLGFFSKEYTPTYRGFDSFYGYYGG 165
Query: 209 VISYYDHILS 218
Y+DH L+
Sbjct: 166 QADYWDHSLA 175
>gi|426242290|ref|XP_004015007.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Ovis aries]
Length = 522
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 178/362 (49%), Gaps = 36/362 (9%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 34 LLLMDDMGWGDLGVYGEPSRETPNLDQMATEGMLFPNFYTANPLCSPSRAALLTGRLPIR 93
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
+G G P+ G+P +E LP L+ GY++K +GKWHLG R ++ PL
Sbjct: 94 SGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH-RPQFHPLK 152
Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
GF+ FG N YD+ R E+ G + NL T + T ++
Sbjct: 153 HGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL----TQIY 208
Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+EA++ I+ Q +P FLY A A HA ++ R Y
Sbjct: 209 LQEALEFIQRQQAAHRPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDA 257
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
+++LDDSVG ++ LQ G+ EN+ + F SDNGA + S R GSN P+ K T
Sbjct: 258 IRELDDSVGRILRLLQDLGIAENTFVFFTSDNGAALI-----SAPRQGGSNGPFLCGKQT 312
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
+EGG++ PAI W P +VS Q+ I D T + AG + R IDGLD +
Sbjct: 313 TFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPR-DRAIDGLDLLPA 371
Query: 431 LL 432
+L
Sbjct: 372 ML 373
>gi|449138580|ref|ZP_21773837.1| arylsulfatase B [Rhodopirellula europaea 6C]
gi|448882842|gb|EMB13399.1| arylsulfatase B [Rhodopirellula europaea 6C]
Length = 498
Score = 169 bits (427), Expect = 5e-39, Method: Compositional matrix adjust.
Identities = 123/383 (32%), Positives = 186/383 (48%), Gaps = 44/383 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQG-- 148
G+ D+ GS + TPN+D LA +G++ + Y A VC+PSRA L+TG+ P G +G
Sbjct: 45 GYGDMGCMGSQTLQTPNLDRLAESGVLCSQAYVASAVCSPSRAGLLTGRDPRRFGYEGNL 104
Query: 149 ---PPIWGAEPR--GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
+ P G+P++E+ L ++L GY+T IGKWHLG + P RGF+
Sbjct: 105 NASDERYATRPELLGLPVSEKTLGDHLGAAGYATALIGKWHLGMGEMHH-PNRRGFDHFC 163
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
G L G Y+ ++ H + RN D EY TD FT E ++ I+
Sbjct: 164 GMLTGSHHYFPTTMN-----------HVIERNGQRVEDFSNEYLTDFFTDEGLRFIDQHE 212
Query: 264 V---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
D+P F++ ++ A H + A + + +F I + RRTYAAM+ LD VG
Sbjct: 213 AAKPDQPWFVFYSYNAPHT-----PMHATEADLARFANIQNKKRRTYAAMMFALDRGVGR 267
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
+ L+ G +N++++F SDNG T N +W N P RGVK ++ EGG++VP
Sbjct: 268 IRKHLEETGQWKNTLLVFFSDNGGAT-------NNGSW--NGPLRGVKGSMREGGIRVPM 318
Query: 381 I-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
I W + R + + D LPT +AAG L + D + +N R
Sbjct: 319 IWTWPAKFPAGKRYDGVISSL-DLLPTFCSAAGAQPLALAEPMSHEDASNRKRMN---RL 374
Query: 440 NSNIDGLDQWSSLLLNT--PSRR 460
DG++ L+ + PSRR
Sbjct: 375 AGTHDGIEMTPHLVDGSEPPSRR 397
Score = 42.0 bits (97), Expect = 0.98, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 40/82 (48%), Gaps = 8/82 (9%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
+ RN D EY TD FT E ++ I+ D+P F++ ++ A H + A
Sbjct: 181 IERNGQRVEDFSNEYLTDFFTDEGLRFIDQHEAAKPDQPWFVFYSYNAPHT-----PMHA 235
Query: 58 PQETINQFQYITDPNRRTYAAL 79
+ + +F I + RRTYAA+
Sbjct: 236 TEADLARFANIQNKKRRTYAAM 257
>gi|313236789|emb|CBY12041.1| unnamed protein product [Oikopleura dioica]
gi|313242643|emb|CBY39450.1| unnamed protein product [Oikopleura dioica]
Length = 622
Score = 168 bits (426), Expect = 7e-39, Method: Compositional matrix adjust.
Identities = 147/544 (27%), Positives = 236/544 (43%), Gaps = 98/544 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW D+ ++ TP +D L NG MY+ C+PSRA +TG+Y +GM PI
Sbjct: 40 GWADVGWNNKGLESTPFMDKLVKNGTQFTQMYSSHRCSPSRAMALTGRYAFRSGMGSFPI 99
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
P G+ ++ LPEYL+E+GY T A+GKWHLG Y P RGF++ +G+ +G +
Sbjct: 100 AREVPFGMNTQDKTLPEYLKEVGYDTHAVGKWHLGVCNSSYLPTSRGFDTFYGHYSGAVD 159
Query: 212 YYDHILS-------DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
Y H + D + T+E + D+ + G++ TDLF + ++++
Sbjct: 160 YRGHFIKRSKNFYHDFFDNTIEQHKLDLESD--------GQWTTDLFRDRTIDILKEAKR 211
Query: 265 DK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAAMVKKLDDSVG 319
K P ++YLA A H + AP + I + PN R + A V ++D +V
Sbjct: 212 SKTPAYVYLAFNAPH-----EPTRAPADLIARI-LEKHPNLPYTRAEHLAAVNQIDTAVR 265
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG------------SNYPYRGV 367
+ +K + ++I+FMSDNG ++ + N G NYP++G
Sbjct: 266 QIYQQ-AKKDVKRETLIVFMSDNGGAVKKFGGGRSPDNTGHEYDDEESLPRACNYPFKGF 324
Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQM-MHISDWLPTLYTAAGGDTSRLP-LNIDGL 425
KN+L+EGG+ P+ L S ++ R HI DW PT AG +P + DG+
Sbjct: 325 KNSLFEGGILSPSFLIS-TARKFSRTRFNFPFHIIDWAPTFLDFAG----IVPDADFDGI 379
Query: 426 DQWSSLLLN-TPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT----AAVRLDS 480
+ S L S+R + + G VL +EKK AVR
Sbjct: 380 SRKSELESRWIKSQRTNFVYG-----------------VLDFYNEKKNIWEDQIAVRFGK 422
Query: 481 WKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMR 540
WK +NFN V + + + + + +
Sbjct: 423 WK------------------------FMNFNMAVNTFKCDEHENQPVTVSRNMQDFLNVP 458
Query: 541 STRQQATIHCGANPA-PMTPSPCTNGPCYLFNLGNDPCEQNNI---ASSRPDISSQLY-E 595
++ + H A P T +G L+NL NDP E +N+ S P+ +++ E
Sbjct: 459 TSYWERNFHNSITVAEPYTKKTLYDGFS-LYNLENDPFELDNLIDPISGNPEAYNEIIEE 517
Query: 596 LLKY 599
++K+
Sbjct: 518 IMKF 521
>gi|355689580|gb|AER98880.1| galactosamine -6-sulfate sulfatase [Mustela putorius furo]
Length = 503
Score = 168 bits (426), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 127/370 (34%), Positives = 183/370 (49%), Gaps = 42/370 (11%)
Query: 81 KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMT 137
K L LL++ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+T
Sbjct: 10 KPPNLVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLT 69
Query: 138 GKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
G+ PI G G P+ G+P ER LPE L+ GY++K +GKWHLG R
Sbjct: 70 GRLPIRNGFYTTNGHARNAYTPQEIVGGIPAEERLLPELLKGAGYASKIVGKWHLG-HRP 128
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGE 245
++ PL RGF+ FG N YD+ R E+ G + NL T +
Sbjct: 129 QFHPLKRGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL-- 186
Query: 246 YATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
T L+T+EA+ ++ Q +P FLY A A HA ++ R
Sbjct: 187 --TQLYTQEALDFVQRQHAARRPFFLYWAIDATHA-----------PVYASRPFLGTSRR 233
Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPY 364
Y V+++DDSVG ++S L+ + E++ + F SDNGA + S + GSN P+
Sbjct: 234 GRYGDAVREVDDSVGKILSLLRDLRIAEDTFVFFTSDNGAALI-----SAPKQGGSNGPF 288
Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNI 422
K T +EGG++ PAI W P +VS Q+ I D T + AG + R+ I
Sbjct: 289 LCGKQTTFEGGMREPAIAWWPGRVPAGQVSHQLGSIMDLFTTSLSLAGLAPPSDRV---I 345
Query: 423 DGLDQWSSLL 432
DGLD ++L
Sbjct: 346 DGLDLLPAML 355
>gi|340620621|ref|YP_004739074.1| sulfatase [Zobellia galactanivorans]
gi|339735418|emb|CAZ98795.1| Sulfatase, family S1-19 [Zobellia galactanivorans]
Length = 462
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 126/403 (31%), Positives = 189/403 (46%), Gaps = 56/403 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D FHGS E TP +D A N + + Y + VC PSRA L+TGKY G +
Sbjct: 37 GYADFGFHGSKEFKTPELDKFAKNAVRFSQAYVSAAVCGPSRAGLLTGKYQQKFGFEENN 96
Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
+ G + G+PL ++ + +YL+E GY T GKWH G R + P RGF+
Sbjct: 97 VPGLMSKNGLTGDDMGLPLDQKTIADYLKEQGYRTALFGKWHQGNADR-FHPTKRGFDEF 155
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
+G+ G SY ++ +R N + R + G Y T+ EA+ IE
Sbjct: 156 YGFRGGARSYMPFGADNELTR----NEDRLERGFGGFQEHEG-YLTEELADEAIAFIERN 210
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
+ P F+YLA A H +EA +E + + + ++ R+T AAM +D + G V+
Sbjct: 211 QKN-PFFVYLAFNAVHT-----PMEAKEEDLKKVKGLSG-KRKTLAAMTIAMDRACGKVL 263
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
++ G+ +N++I+F +DNG P+ N SN P G K EGG++VP ++
Sbjct: 264 DRIKALGLDKNTLIVFTNDNGGPSDA--------NESSNLPLSGTKANHLEGGIRVPFLM 315
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
W +++ N V + D LPT TAAGGD + IDGL
Sbjct: 316 SWPGKLKPNT-VYENPISTLDLLPTFVTAAGGDVN----AIDGL---------------- 354
Query: 442 NIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
DG+D L R + L +K+ A+R + WKLV
Sbjct: 355 --DGVDLLPYLQGKKQGRPHETLYW--KKEARGAIRHNDWKLV 393
>gi|408793054|ref|ZP_11204664.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
meyeri serovar Hardjo str. Went 5]
gi|408464464|gb|EKJ88189.1| type I phosphodiesterase/nucleotide pyrophosphatase [Leptospira
meyeri serovar Hardjo str. Went 5]
Length = 583
Score = 168 bits (425), Expect = 8e-39, Method: Compositional matrix adjust.
Identities = 130/384 (33%), Positives = 196/384 (51%), Gaps = 70/384 (18%)
Query: 92 GWNDLSFHGSN----EIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
G+ND++ +G +PTPNID++A +G+ ++ Y+ VCT SRA+LMTG+YP G+
Sbjct: 108 GFNDVTTYGGGYADLGVPTPNIDSIAKDGVRFDSGYSGSAVCTVSRAALMTGRYPSRFGV 167
Query: 147 QGPPIWGAEPR-------------------------------GVPLTERFLPEYLRELGY 175
+ P GA R G+P +E + E L+E GY
Sbjct: 168 EFTPTPGALARVGADLYADPNRLYPVVIDKEKAEKSKSFNELGMPGSEITIAEVLKERGY 227
Query: 176 STKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD---- 231
+ IGKWHLG E P +GF+ +G+ D + Y+ + + D
Sbjct: 228 HSVHIGKWHLGS-TEEMRPNKQGFDESLFMESGLYLPVDD--PNVYNAKQDFDPIDRFLW 284
Query: 232 --MRRNLS---TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHL 286
MR +S W Y TD FT EAV++I+ +P FL+LAH A H L
Sbjct: 285 PNMRFGVSYNGGKWFEPSRYLTDYFTDEAVKVIKTNK-HRPFFLFLAHWAVHT-----PL 338
Query: 287 EAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT 346
+A +E + +I D +R Y +M++ LD SVG ++++L+ +G+ EN+I+IF SDNGAP
Sbjct: 339 QASKEDYDALSHIKDHRKRVYLSMIRSLDRSVGKILASLKEEGLDENTIVIFTSDNGAP- 397
Query: 347 VEYRETSNYRNW-GSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWL 404
NY N PYRG K TL++GG++VP + W I+ + + +I D L
Sbjct: 398 -------NYIGLPDVNSPYRGWKLTLFQGGIRVPYVAKWPGHIKPGTKYQNAITNI-DIL 449
Query: 405 PTLYTAAGGDTSRLPLN--IDGLD 426
PT+ AAG +LPL+ IDG++
Sbjct: 450 PTVANAAG---IKLPLDREIDGVN 470
Score = 47.4 bits (111), Expect = 0.025, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 39/74 (52%), Gaps = 6/74 (8%)
Query: 9 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
W Y TD FT EAV++I+ +P FL+LAH A H L+A +E + +I
Sbjct: 298 WFEPSRYLTDYFTDEAVKVIKTNK-HRPFFLFLAHWAVHT-----PLQASKEDYDALSHI 351
Query: 69 TDPNRRTYAALTKS 82
D +R Y ++ +S
Sbjct: 352 KDHRKRVYLSMIRS 365
>gi|354465430|ref|XP_003495183.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cricetulus
griseus]
Length = 493
Score = 168 bits (425), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 124/359 (34%), Positives = 178/359 (49%), Gaps = 40/359 (11%)
Query: 90 VYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM-- 146
V GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI G
Sbjct: 10 VMGWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYT 69
Query: 147 -QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
G P+ G+P +E LPE L++ GY+ K +GKWHLG R ++ PL GF+
Sbjct: 70 SNGHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQFHPLKHGFDE 128
Query: 202 HFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAV 256
FG N YD+ + R E+ G + NL T + T L+ +EA+
Sbjct: 129 WFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL----TQLYLQEAL 184
Query: 257 QLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
I Q + P FLY A A HA ++ +R Y V+++D
Sbjct: 185 DFIRTQHARQSPFFLYWAIDATHA-----------PVYASRPFLGTSHRGRYGDAVREID 233
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
DSVG ++S LQ G+ +N+ + F SDNGA + S + GSN P+ K T +EGG
Sbjct: 234 DSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLCGKQTTFEGG 288
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQWSSLL 432
++ PAI W P +VS Q+ I D T + AG + R+ IDGLD ++L
Sbjct: 289 MREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSIAGLKPPSDRI---IDGLDLLPTML 344
>gi|334139745|ref|YP_004532943.1| sulfatase [Novosphingobium sp. PP1Y]
gi|333937767|emb|CCA91125.1| sulfatase [Novosphingobium sp. PP1Y]
Length = 472
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 107/303 (35%), Positives = 159/303 (52%), Gaps = 26/303 (8%)
Query: 93 WNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ-GPP 150
W D+S +G ++PTPNID +A G+ ++ Y A VC SRA LMTG+ P G
Sbjct: 39 WADVSTYGRTDVPTPNIDRIAKTGVAFSSGYVAASVCAVSRAGLMTGRMPQRFGFTYNIN 98
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G G+P+ ++ + + L+ LGY T A GKWHLG R++ P RGF+ FG+L G
Sbjct: 99 DKGDVGAGLPVGQKTIADRLQPLGYRTAAFGKWHLGA-DRQFYPTNRGFDEFFGFLAGET 157
Query: 211 SYYD----HILS-----DQYSRTVELNGHDMRRNL-STAWDTVGEYATDLFTKEAVQLI- 259
+Y D I++ D+Y H M + D +Y T+ T AV I
Sbjct: 158 NYVDPKTPGIVTTPTKVDKYEIGPGEGNHAMVEGPDARPADDFSKYLTNQITDRAVDFIN 217
Query: 260 EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
+P F Y+A+ A H L+ PQ ++F + DP RRTY AM+ +DD VG
Sbjct: 218 RSADAKQPFFSYVAYNAPHW-----PLQVPQAYYDRFANVKDPVRRTYIAMIAAMDDGVG 272
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
++ L+ +G+ EN+I++F+SDNG P E + N +N+P+ K T EGG +VP
Sbjct: 273 QILDTLEERGLRENTIVVFLSDNGCP-----ERFGFCN--TNHPWGSGKFTYLEGGTRVP 325
Query: 380 AIL 382
++
Sbjct: 326 FMM 328
>gi|421611065|ref|ZP_16052220.1| arylsulfatase B [Rhodopirellula baltica SH28]
gi|408498167|gb|EKK02671.1| arylsulfatase B [Rhodopirellula baltica SH28]
Length = 498
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 129/412 (31%), Positives = 197/412 (47%), Gaps = 49/412 (11%)
Query: 65 FQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
F + T+ +R AA + + +++ G+ D+ GS + TPN+D LA +G++ +
Sbjct: 19 FVFATEASR---AAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGVLCSQA 75
Query: 123 Y-AQPVCTPSRASLMTGKYPIHTGMQG-----PPIWGAEPR--GVPLTERFLPEYLRELG 174
Y A VC+PSRA L+TG+ P G +G + P G+P +E+ L ++L G
Sbjct: 76 YVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADHLGAAG 135
Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRR 234
Y+T IGKWHLG + P RGF+ G L G Y+ + H + R
Sbjct: 136 YATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPTTMK-----------HVIER 183
Query: 235 NLSTAWDTVGEYATDLFTKEAVQLI---EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
N EY TD FT E ++ I E D+P F++ ++ A H + A +
Sbjct: 184 NGKRVDGFSSEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNAPHT-----PMHATEA 238
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
+ +F I + RRTYAAM+ LD VG + L+ G EN++++F SDNG T
Sbjct: 239 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT----- 293
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
N +W N P RGVK ++ EGG++VP I W + R + + D LPT +A
Sbjct: 294 --NNGSW--NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVRYDGVISSL-DLLPTFCSA 348
Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT--PSRR 460
AG + L + D + +N R + DG+D L + P+RR
Sbjct: 349 AGAEPLALADPMSHEDASNRKRMN---RLSGTHDGIDMAPHLADGSEPPNRR 397
>gi|116620007|ref|YP_822163.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
gi|116223169|gb|ABJ81878.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
Length = 443
Score = 167 bits (424), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 123/377 (32%), Positives = 188/377 (49%), Gaps = 40/377 (10%)
Query: 73 RRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPS 131
R AA + + +L G +DL + G+ ++ TP+IDALA G+ N Y+ PVC P+
Sbjct: 19 RAAPAADRPNVLVVVLDDLGCHDLGYLGAADLKTPHIDALAARGLKFRNWYSNAPVCAPA 78
Query: 132 RASLMTGKYPIHTGM--QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
R++++TG++P G+ GP A G+P L L+ GY T GKWHLG
Sbjct: 79 RSAILTGRFPASAGVPDNGP----ALAHGIPT----LASVLKGSGYQTGCFGKWHLGS-T 129
Query: 190 REYTPLYRGFESHFGYLNGVISYYDHIL--SDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
E P GF+S +G+ +G + YY H D Y HD+ N + ++ G Y
Sbjct: 130 DETAPTGHGFDSFYGFHSGCVDYYSHRFYWGDNY--------HDLWHNRTEIFED-GRYL 180
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
T+ EA I ++P Y+A A H + AP + +F + P R+TY
Sbjct: 181 TERIADEAAGFIGR---NRPFLGYVAFNAPH-----YPMHAPAQYKARFPNLA-PERQTY 231
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN--YRNWGSNYPYR 365
AAM+ +DD +G + AL+ G EN+++ F+ DNGA T E R N + G N ++
Sbjct: 232 AAMIAAVDDGIGQIQRALETTGAAENTLMFFIGDNGA-TTEKRAGLNGDFATAGDNGVFK 290
Query: 366 GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
G K +L++GG+ VP + W I++ M + D LPT+ A G + LP +DG
Sbjct: 291 GYKFSLFDGGMHVPGFVSWPAGIRKGGWTDELAMSM-DILPTICRATG---APLPPRVDG 346
Query: 425 LDQWSSLLLNTPSRRNS 441
D +++ N PS S
Sbjct: 347 SDLLNTIASNAPSPHKS 363
>gi|323451693|gb|EGB07569.1| hypothetical protein AURANDRAFT_2707, partial [Aureococcus
anophagefferens]
Length = 351
Score = 167 bits (423), Expect = 1e-38, Method: Compositional matrix adjust.
Identities = 122/350 (34%), Positives = 169/350 (48%), Gaps = 36/350 (10%)
Query: 88 LIVY-----GWNDLSFH----GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
LIV+ G ND+ + G I +P IDALA ++L+ YAQP+CTP+RA+LMTG
Sbjct: 13 LIVFLADDLGRNDVGYAHRDGGPGRIASPRIDALAAESLVLDRFYAQPMCTPTRAALMTG 72
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
++ TG+ + + G+P E + E LR+ GYST IGKWHLGF ++ P RG
Sbjct: 73 RHAYRTGLAYFVLLANQGTGLPAAEVTVAERLRDAGYSTHMIGKWHLGFAKKAMLPTSRG 132
Query: 199 FESHFGYLNGVISYYDH--------ILSDQYSRTVELN-----GHDM---RRNLSTAWDT 242
F+ FGY G Y+ H + S + E GHD+ L T
Sbjct: 133 FDRFFGYCLGSSDYWLHQSPEWVPGVPSTDRATGAEPPTTGGMGHDLWDGATPLPRTPKT 192
Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
++ DLF A + P DKPLFLY A A HA G DP
Sbjct: 193 ENVHSADLFAARATETFATAPRDKPLFLYYASQAPHAHFVGAPPRHVAAAAAMGFAAGDP 252
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
R AA+V LD+ VG V+ ++ EN+++ FM DNG S++
Sbjct: 253 -RADVAALVAALDEQVGNVVDGVRASRGWENTVLWFMGDNG---------PEPGTGASSW 302
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
P RG K TL+EGGV+ P+ +++P + P S +H+ D PTL AG
Sbjct: 303 PLRGAKRTLFEGGVREPSFVYAPG-RVAPGASSAPVHVIDVAPTLLGLAG 351
>gi|326426859|gb|EGD72429.1| hypothetical protein PTSG_00448 [Salpingoeca sp. ATCC 50818]
Length = 540
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 117/365 (32%), Positives = 178/365 (48%), Gaps = 44/365 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG--- 148
GWN SFH +NEI TP + L NG+ L + Y C+P+RAS +TG++P M
Sbjct: 41 GWNAPSFH-NNEIITPTLHHLHANGVELYSHYTYMFCSPTRASFLTGRFPYKHEMTNTNL 99
Query: 149 -PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
PP G+ L+ L + L++ YST IGKWHLG +++EYTP YRGF++ FG+L
Sbjct: 100 LPP---TRMLGLDLSYTTLADKLKQANYSTHHIGKWHLGMYKKEYTPRYRGFDTTFGFLT 156
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK- 266
G ++Y V D+ + A+ G + +FT A+ +I + +
Sbjct: 157 GGENHYTQ------RAFVSPPAVDLWDEEAPAYGMNGTWTGKMFTDAALDIIRNNAQLRN 210
Query: 267 ------PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN--RRTYAAMVKKLDDSV 318
PLF+Y A HA ++P + + D + T+ M+ +D++V
Sbjct: 211 ATGDAPPLFIYFALHDVHAPT-----QSPVRLAQLYNFTGDDAGLKSTFYGMISGVDEAV 265
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPT----------VEYRETSNYRNWGSNYPYRGVK 368
V AL+ GM ++++++ SDNG+PT V GSNYP RG K
Sbjct: 266 RNVTDALKAAGMWNDTLLVWTSDNGSPTMGSFNGGCKNVVPDSRGGACMCGSNYPLRGSK 325
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPLNI---D 423
+T +EGGV+ P + P + + R +MH+ DW TL + AG D + P I D
Sbjct: 326 HTNFEGGVRTPTFVSGPVLPAHMRGKRLDGLMHVCDWYSTLCSVAGVDPAD-PGGIAPLD 384
Query: 424 GLDQW 428
+D W
Sbjct: 385 AIDMW 389
>gi|390356461|ref|XP_793612.3| PREDICTED: arylsulfatase B-like [Strongylocentrotus purpuratus]
Length = 181
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 71/124 (57%), Positives = 93/124 (75%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GWND+SFHGS++IPTP+IDALA G+IL N Y P+CTP+R+++MTGK+PIHTG+Q I
Sbjct: 40 GWNDVSFHGSSQIPTPHIDALAQEGVILTNYYVSPICTPTRSAIMTGKHPIHTGLQYSVI 99
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+ E +P+YLR LGY T +GKWHLGFF+ TP +RGFES++GY G+
Sbjct: 100 IADEPYGLGTNETIMPQYLRSLGYRTHMVGKWHLGFFKESLTPSHRGFESYYGYYGGMQD 159
Query: 212 YYDH 215
Y+ H
Sbjct: 160 YFTH 163
>gi|345317091|ref|XP_001516952.2| PREDICTED: arylsulfatase I-like, partial [Ornithorhynchus anatinus]
Length = 264
Score = 166 bits (421), Expect = 2e-38, Method: Compositional matrix adjust.
Identities = 93/249 (37%), Positives = 144/249 (57%), Gaps = 20/249 (8%)
Query: 139 KYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
+Y IHTG+Q I +P +PL + LP+ L+E GYST +GKWHLGF+++E P RG
Sbjct: 6 RYQIHTGLQHSIIRPRQPNCLPLDQVTLPQKLQEAGYSTHMVGKWHLGFYKKECLPTRRG 65
Query: 199 FESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
F++ G L G + YY + D + G+D+ + AW+ G+Y+T L+ + A ++
Sbjct: 66 FDTFLGSLTGNVDYYTYDNCD----GPGVCGYDLHEGENVAWEQSGKYSTLLYAQRAGKI 121
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
+ +PLFLY+A A H L++P+E + +++ + + RR YAAMV +D++V
Sbjct: 122 LASHDPRRPLFLYVAFQAVHT-----PLQSPREYLYRYRAMGNVARRKYAAMVTCMDEAV 176
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
+ AL++ G +NS+I+F +DNG T + GSN+P RG K T WEGG
Sbjct: 177 RNITRALKKYGYYDNSVIVFSTDNGGQTF---------SGGSNWPLRGRKGTYWEGGTP- 226
Query: 379 PAILWSPQI 387
PA L SP +
Sbjct: 227 PATL-SPSL 234
Score = 47.8 bits (112), Expect = 0.017, Method: Compositional matrix adjust.
Identities = 23/75 (30%), Positives = 44/75 (58%), Gaps = 5/75 (6%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+ AW+ G+Y+T L+ + A +++ +PLFLY+A A H L++P+E + ++
Sbjct: 100 NVAWEQSGKYSTLLYAQRAGKILASHDPRRPLFLYVAFQAVHT-----PLQSPREYLYRY 154
Query: 66 QYITDPNRRTYAALT 80
+ + + RR YAA+
Sbjct: 155 RAMGNVARRKYAAMV 169
>gi|351729940|ref|ZP_08947631.1| sulfatase [Acidovorax radicis N35]
Length = 446
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 115/333 (34%), Positives = 158/333 (47%), Gaps = 40/333 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
GW DLS +G + TP++DALA G+ YA VC+ +R +L+TG+Y + G++
Sbjct: 20 GWADLSVYGQADFSTPHLDALAGEGVRFTQAYANSAVCSATRFALITGRYQYRLRGGLEE 79
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P + A G+P LP LR GY T IGKWHLG + PL G+ FG G
Sbjct: 80 PLVRKAHVHGLPPEHPTLPSLLRNAGYDTALIGKWHLGSLP-TFGPLKSGYNRFFGNYGG 138
Query: 209 VISYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
I Y+ H + D Y V + + VG Y T L EA +
Sbjct: 139 AIDYFTHKPGVGEAVPRDLYEGEVPV-------------ERVG-YYTQLLADEASGWLRG 184
Query: 262 QPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
+ KP FL L A H G E + + N TY MV+ LD +VG
Sbjct: 185 RSAGSKPFFLSLHFTAPHWPWVGPEDEEVSLGLKDLFHYDGGNLHTYGKMVRSLDGAVGQ 244
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
V+ ALQ +G+ +N+I++F SDNG + +P+ G K L EGG++VP
Sbjct: 245 VLHALQEQGLADNTIVVFTSDNGG-----------ERFSKTWPFTGQKTELLEGGIRVPT 293
Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
+L W +I P+VS Q+ DWLPTL AAG
Sbjct: 294 LLRWPARIA--PQVSDQVTATMDWLPTLLAAAG 324
>gi|344237970|gb|EGV94073.1| N-acetylgalactosamine-6-sulfatase [Cricetulus griseus]
Length = 483
Score = 166 bits (421), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 123/357 (34%), Positives = 177/357 (49%), Gaps = 40/357 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM---Q 147
GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI G
Sbjct: 2 GWGDLGVYGEPSRETPNLDQMALEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFYTSN 61
Query: 148 GPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
G P+ G+P +E LPE L++ GY+ K +GKWHLG R ++ PL GF+ F
Sbjct: 62 GHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQFHPLKHGFDEWF 120
Query: 204 GYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
G N YD+ + R E+ G + NL T + T L+ +EA+
Sbjct: 121 GSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL----TQLYLQEALDF 176
Query: 259 IEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
I Q + P FLY A A HA ++ +R Y V+++DDS
Sbjct: 177 IRTQHARQSPFFLYWAIDATHA-----------PVYASRPFLGTSHRGRYGDAVREIDDS 225
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
VG ++S LQ G+ +N+ + F SDNGA + S + GSN P+ K T +EGG++
Sbjct: 226 VGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLCGKQTTFEGGMR 280
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQWSSLL 432
PAI W P +VS Q+ I D T + AG + R+ IDGLD ++L
Sbjct: 281 EPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSIAGLKPPSDRI---IDGLDLLPTML 334
>gi|196229618|ref|ZP_03128482.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196225944|gb|EDY20450.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 490
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/301 (36%), Positives = 156/301 (51%), Gaps = 36/301 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D SF GS +I TPN+DALA +G+ Y PVC+PSRA LMTG+Y G
Sbjct: 50 GYADASFQGSKDILTPNLDALAKSGVRCTRGYVTAPVCSPSRAGLMTGRYQERFGHHNNI 109
Query: 151 IW-GAEPRG-VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
+ A P +P E LP+ L + GY T +GKWHLG + P RGF+ FG + G
Sbjct: 110 VAEAALPIAHLPSNETLLPQVLAKAGYYTAMVGKWHLGL-QDGCRPYERGFDEFFGIITG 168
Query: 209 VISYY-----DHILSDQ-YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
Y+ + + DQ Y +E NG + V Y TD F +AV++I +
Sbjct: 169 GHDYFVNHPEERAVGDQSYKARIERNG--------PVGEAVPGYLTDAFGADAVRIIRES 220
Query: 263 PV---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSV 318
D+PLFLYLA A H +AP++ ++ + +RRTYAA + +D SV
Sbjct: 221 HTKRPDQPLFLYLAFNAPHTPT-----QAPKDLVDTMPATLESKDRRTYAAQITSMDASV 275
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V +AL+ GM +++ I+F SDNG Y + N P R K +L+EGG++V
Sbjct: 276 GKVRAALKENGMEKDTFIVFFSDNGGANHPYYD---------NTPLRDHKGSLYEGGIRV 326
Query: 379 P 379
P
Sbjct: 327 P 327
>gi|32470862|ref|NP_863855.1| arylsulfatase B [Rhodopirellula baltica SH 1]
gi|32443007|emb|CAD71528.1| arylsulfatase B [Rhodopirellula baltica SH 1]
Length = 520
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 127/411 (30%), Positives = 196/411 (47%), Gaps = 47/411 (11%)
Query: 65 FQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
F + T+ +R AA + + +++ G+ D+ GS + TPN+D LA +G++ +
Sbjct: 41 FVFATEASR---AAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGVLCSQA 97
Query: 123 Y-AQPVCTPSRASLMTGKYPIHTGMQG-----PPIWGAEPR--GVPLTERFLPEYLRELG 174
Y A VC+PSRA L+T + P G +G + P G+P +E+ L ++L G
Sbjct: 98 YVASAVCSPSRAGLLTSRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADHLGAAG 157
Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRR 234
Y+T IGKWHLG + P RGF+ G L G Y+ + H + R
Sbjct: 158 YATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGSHHYFPATMK-----------HVIER 205
Query: 235 NLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEAPQE 291
N D EY TD FT E ++ I+ D+P F++ ++ A H + A +
Sbjct: 206 NGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PMHATEA 260
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
+ +F I + RRTYAAM+ LD VG + L+ G EN++++F SDNG T
Sbjct: 261 DLARFANIQNQKRRTYAAMMYALDRGVGRIREHLEETGQWENTLLVFFSDNGGAT----- 315
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
N +W N P RGVK ++ EGG++VP I P + ++ D LPT +AA
Sbjct: 316 --NNGSW--NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVLYDGVVSSLDLLPTFCSAA 371
Query: 412 GGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT--PSRR 460
G + L + D + +N R + DG+D L + P+RR
Sbjct: 372 GAEPLALADPMSHEDASNRKRMN---RLSGTHDGIDMAPHLADGSEPPNRR 419
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 25/82 (30%), Positives = 40/82 (48%), Gaps = 8/82 (9%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEA 57
+ RN D EY TD FT E ++ I+ D+P F++ ++ A H + A
Sbjct: 203 IERNGKRVDDFSSEYLTDFFTDEGLRFIDQHKSANPDQPWFVFFSYNAPHT-----PMHA 257
Query: 58 PQETINQFQYITDPNRRTYAAL 79
+ + +F I + RRTYAA+
Sbjct: 258 TEADLARFANIQNQKRRTYAAM 279
>gi|149701806|ref|XP_001488119.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Equus caballus]
Length = 491
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 121/355 (34%), Positives = 174/355 (49%), Gaps = 36/355 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM---Q 147
GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI G
Sbjct: 10 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIRNGFYTTS 69
Query: 148 GPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
G P+ G+P +ER LPE L+E GY +K +GKWHLG R ++ PL GF+ F
Sbjct: 70 GHARNAYTPQEIVGGIPDSERLLPELLKEAGYVSKIVGKWHLG-HRPQFHPLKHGFDEWF 128
Query: 204 GYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
G N YD+ R E+ G + NL T + T ++ +EA+
Sbjct: 129 GSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEALDF 184
Query: 259 I-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
I Q +P FLY A A HA ++ R Y V+++DDS
Sbjct: 185 IRRQQAARRPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREIDDS 233
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
VG ++ +LQ G+ E++ + F SDNGA + S + GSN P+ K T +EGG++
Sbjct: 234 VGKILHSLQDLGIAESTFVFFTSDNGAALI-----SAPKQGGSNGPFLCGKQTTFEGGMR 288
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
PA+ W P +VS Q+ I D T + AG + R IDGL ++L
Sbjct: 289 EPAVAWWPGRIPAGQVSHQLGSIMDLFTTSLSLAGLEPPR-DRAIDGLSLLPAML 342
>gi|319954036|ref|YP_004165303.1| n-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
gi|319422696|gb|ADV49805.1| N-acetylgalactosamine-4-sulfatase [Cellulophaga algicola DSM 14237]
Length = 467
Score = 166 bits (420), Expect = 3e-38, Method: Compositional matrix adjust.
Identities = 111/345 (32%), Positives = 172/345 (49%), Gaps = 33/345 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS E TP +D LA I + Y + VC PSRA ++TGKY G +
Sbjct: 38 GYADFGFQGSKEFKTPELDKLAKKSIKFSQAYVSAAVCGPSRAGILTGKYQQKFGFEENN 97
Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
+ G + G+PL + + YL++LGY T GKWH G R + P RGF+
Sbjct: 98 VPGYMSTSGLVGDEMGLPLDQITIANYLQDLGYKTALFGKWHQGNADR-FHPTKRGFDEF 156
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
+G+ G SY + S+ S+ N + R + G Y TD EA+ I ++
Sbjct: 157 YGFRGGARSYMPYDDSNPLSK----NEDRLERGFGNFLEHEG-YLTDELAHEAISFI-NR 210
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
P F+YL+ A H +EA E + QF ++ R+T AAM +D ++GTV+
Sbjct: 211 NKKHPFFIYLSFNAVHTP-----MEATAEDLEQFPHLKG-KRKTLAAMTLAMDRAIGTVL 264
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
L+ +G+ N++I+F +DNG P+ N N P G K EGG++VP ++
Sbjct: 265 KELETQGLSNNTLIVFTNDNGGPSDA--------NDSDNSPLSGTKANHLEGGIRVPFLM 316
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
W Q+Q+N + + + D PT +AA G++S + +DG++
Sbjct: 317 SWPKQLQKNKNYNYPISTL-DLFPTFLSAASGNSSAME-ELDGVN 359
>gi|149197416|ref|ZP_01874467.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139434|gb|EDM27836.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 166 bits (420), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 117/353 (33%), Positives = 177/353 (50%), Gaps = 30/353 (8%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASL 135
AA + L L G+ DL F G+ +I TP+IDALA +G+ Y + VC PSRA L
Sbjct: 18 AAQKPNIILILADDLGYEDLGFLGAPDIKTPHIDALARSGMNFTQGYQSASVCGPSRAGL 77
Query: 136 MTGKYPIHTGM-QGPPIWGAEPR-----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
+TG+Y G + PP G + G+PL E+ + + L+ Y+T IGKWH+G
Sbjct: 78 LTGRYQQLFGSGENPPETGELSKRFPDAGIPLDEQMIFDLLKPAAYTTGVIGKWHMGL-S 136
Query: 190 REYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
E P R + ++G+LNG SY + + + + R N + Y T+
Sbjct: 137 HEQRPTQRSVDYYYGFLNGAHSYREAKMD---MKGAPMTWPIFRNNEPVPFSG---YTTE 190
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+F E V I+ + DKP FLY+++ + H EA + + + +I RR Y+A
Sbjct: 191 VFNDEGVNFIK-RNKDKPFFLYMSYNSVHG-----PWEAQPKDLQRSDHIKKKWRRIYSA 244
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
M+ +DD VG +I L+ +G+ EN+++IFMSDNGAP + SN RG K
Sbjct: 245 MLISMDDGVGRLIQTLKDEGIYENTLVIFMSDNGAPNNLHEAERAGDYLASNGSLRGRKG 304
Query: 370 TLWEGGVKVPAILWSPQI------QQNPRVSLQ----MMHISDWLPTLYTAAG 412
+EGG++VP I+ PQ+ Q+P L ++HIS P +G
Sbjct: 305 DTYEGGIRVPYIMSWPQVIPKQSTYQHPVSGLDIVPTLIHISQAAPAKKELSG 357
>gi|348550278|ref|XP_003460959.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Cavia porcellus]
Length = 502
Score = 166 bits (419), Expect = 4e-38, Method: Compositional matrix adjust.
Identities = 127/375 (33%), Positives = 179/375 (47%), Gaps = 62/375 (16%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 14 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIR 73
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G G P+ G+P +ER LP+ L+E GY+TK +GKWHLG R ++ PL
Sbjct: 74 NGFYTTNGHARNAYTPQEIVGGIPDSERLLPQLLKEAGYATKIVGKWHLG-HRPQFHPLK 132
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEY-------- 246
GF+ FG N YD + R N+ WD VG +
Sbjct: 133 HGFDEWFGSPNCHFGPYD---------------NKARPNIPVYRNWDMVGRFYEEFPINV 177
Query: 247 ------ATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
T ++ +EA+ I + Q P FLY A A H AP F I
Sbjct: 178 KTGESNLTQIYLQEALDFIRQQQAAQHPFFLYWAVDATH---------APVYASRPFLGI 228
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+ R Y V+++DDSVG ++S L+ + ENS + F SDNGA + S G
Sbjct: 229 S--QRGRYGDAVREIDDSVGKILSLLRTLSIAENSFVFFTSDNGAALI-----SAPSQGG 281
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSR 417
SN P+ K T +EGG++ PAI W P+ +VS Q+ I D T AG + R
Sbjct: 282 SNGPFLCGKQTTFEGGMREPAIAWWPKHIPAGQVSHQLGSIMDLFTTSLALAGLKPPSDR 341
Query: 418 LPLNIDGLDQWSSLL 432
+ IDGL+ ++L
Sbjct: 342 V---IDGLNLLPTML 353
>gi|170726755|ref|YP_001760781.1| sulfatase [Shewanella woodyi ATCC 51908]
gi|169812102|gb|ACA86686.1| sulfatase [Shewanella woodyi ATCC 51908]
Length = 548
Score = 166 bits (419), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 136/391 (34%), Positives = 190/391 (48%), Gaps = 71/391 (18%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPI 142
L L G ND+S G I TPNID LA G + N Y+ C PSRA+L+TG+
Sbjct: 66 VLILADDMGINDVSTFGGGMIETPNIDKLAAKGALFTNGYSGHANCAPSRAALLTGRDAT 125
Query: 143 HTGMQGPPI-----------------------WGAEP---------RGVPLTERFLPEYL 170
TG PI + AE RG+P +E +PE L
Sbjct: 126 RTGYDTTPIPDGMSRIIAAIENNEDNGRPEMSYSAEADATNPTYDNRGLPGSEILIPEIL 185
Query: 171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
+E GY T IGKWHLG E P +GF+ +G+ DH + + VE +G
Sbjct: 186 KESGYHTMHIGKWHLGR-SPEMMPNAQGFDESLMMDSGLYLPVDH--PESVNAPVESSGL 242
Query: 231 D------MRRNLSTAWDTVGE------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 278
D MR S W+ GE Y TD FT+EA + IE ++P FLYLAH H
Sbjct: 243 DRFIWATMR--YSVNWNG-GEIFKPNGYLTDYFTEEAEKAIEAN-ANRPFFLYLAHWGPH 298
Query: 279 AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
N + A E + Q N+R YAAM++ +D SV V++ L+++G+ +N+I+I
Sbjct: 299 --NPVQAKRADYEAVGDIQ---PHNKRVYAAMLRSIDRSVERVMAKLEKQGIADNTIVIL 353
Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQM 397
SDNG +Y ++ N PYRG KNT +EGG++VP ++ W I ++ + +
Sbjct: 354 SSDNGG--ADYVAINDL-----NKPYRGWKNTFFEGGIRVPFSVTWPNVIDESTVIEEPV 406
Query: 398 MHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
HI D +PT+ A D LP + IDG+D
Sbjct: 407 NHI-DLMPTIINMANAD---LPQDREIDGVD 433
Score = 41.6 bits (96), Expect = 1.3, Method: Compositional matrix adjust.
Identities = 27/68 (39%), Positives = 36/68 (52%), Gaps = 6/68 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD FT+EA + IE ++P FLYLAH H N + A E + Q N+R
Sbjct: 267 YLTDYFTEEAEKAIEAN-ANRPFFLYLAHWGPH--NPVQAKRADYEAVGDIQ---PHNKR 320
Query: 75 TYAALTKS 82
YAA+ +S
Sbjct: 321 VYAAMLRS 328
>gi|443716207|gb|ELU07840.1| hypothetical protein CAPTEDRAFT_136127, partial [Capitella teleta]
Length = 311
Score = 166 bits (419), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 102/327 (31%), Positives = 161/327 (49%), Gaps = 32/327 (9%)
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SR S ++G+Y + MQ I +P+ + L LP YL+ELGY T A GKWHLG+ R
Sbjct: 1 SRGSFLSGRYSYKSAMQHGVILDNKPQCLGLDYTILPGYLKELGYETHAFGKWHLGYCRD 60
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL 250
E TP +RGF+S G +G +Y+H T +D +D +G+++ DL
Sbjct: 61 ECTPTHRGFDSFSGGFSGEGEFYEHT-------TATGGYYDWHLGTEVDYDAIGKHSEDL 113
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRR 305
+ +++ PLF+Y+A H +P + +F + D +R+
Sbjct: 114 IGYYVNKTLDEYDQSSPLFMYVAFHNVH---------SPLDPKPEFLALYDDLDVTDDRK 164
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
Y +V +D +G ++ ++ G+ EN+ I+F SDNG E G N P+R
Sbjct: 165 KYLGLVSGMDYIIGGIVDKIKEIGIYENTYILFSSDNGGDVGE----------GDNSPHR 214
Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
G K+TLWEGG K + SP + + + +MH++DWLPT+ AGG +DG
Sbjct: 215 GGKSTLWEGGCKANSWTHSPLLGASGVENNGLMHVTDWLPTIVDLAGGSVPASD-GLDGF 273
Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSL 452
Q ++ N S R + I +D+ S +
Sbjct: 274 VQTDMVINNGDSARTTMIYNIDRESDI 300
>gi|114145565|ref|NP_001041316.1| N-acetylgalactosamine-6-sulfatase precursor [Rattus norvegicus]
gi|123779981|sp|Q32KJ6.1|GALNS_RAT RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|81158026|tpe|CAI84987.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus
norvegicus]
Length = 524
Score = 165 bits (418), Expect = 5e-38, Method: Compositional matrix adjust.
Identities = 125/368 (33%), Positives = 179/368 (48%), Gaps = 46/368 (12%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 35 VLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 94
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 95 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 150
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
PL GF+ FG N YD+ + R E+ G + NL T +
Sbjct: 151 HPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL---- 206
Query: 248 TDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
T L+ +EA+ I Q + P FLY A A HA Q++ R
Sbjct: 207 TQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----------PVYASKQFLGTSLRGR 255
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y V+++DDSVG ++S LQ G+ +N+ + F SDNGA + S + GSN P+
Sbjct: 256 YGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLC 310
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDG 424
K T +EGG++ PAI W P +VS Q+ I D T + AG + R+ IDG
Sbjct: 311 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDG 367
Query: 425 LDQWSSLL 432
LD ++L
Sbjct: 368 LDLLPTML 375
>gi|149199924|ref|ZP_01876952.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149136993|gb|EDM25418.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 455
Score = 165 bits (418), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 120/350 (34%), Positives = 181/350 (51%), Gaps = 47/350 (13%)
Query: 76 YAALTKSTTLTLLIV---YGWNDLSFHGSNE--IPTPNIDALAYNGIILNNMY-AQPVCT 129
+ AL S ++I+ G+ D+S++ ++ I TP+ DALA +G+I + Y + VC+
Sbjct: 17 FVALADSKPNIIVILSDDQGYADVSYNPEHDDYISTPHTDALAKSGVIFHRGYTSGSVCS 76
Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
+R+ LMTG+Y G+ G G L +F+P YL+E GY + A GKWHLG
Sbjct: 77 TTRSGLMTGRYQQRYGIYTA---GEGGTGTDLNAKFIPNYLKEAGYKSMAFGKWHLG-HE 132
Query: 190 REYTPLYRGFESHFGYLN-GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
+Y PL+RGF+ +G++ G ++ L +Y G + R L D Y T
Sbjct: 133 MKYHPLHRGFDDFYGFMGRGAHDFFR--LEKEYDGKF---GGPIYRGLEPIDDK--GYLT 185
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
T+E V+ IE+ DKP F Y+A+ A H +AP E I D R
Sbjct: 186 TRITEETVKFIEENK-DKPFFAYVAYNAVHT-----PAQAPAEDIKAVS--GDETRDILV 237
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
AM+K LD VG ++ L++ + EN+III++SDNG +N P RGVK
Sbjct: 238 AMLKHLDLGVGEIVKTLKKHDIYENTIIIYLSDNGGAKSMV---------ANNKPLRGVK 288
Query: 369 NTLWEGGVKVPAIL-WSPQIQ-----QNPRVSLQMMHISDWLPTLYTAAG 412
+ +++GG++VP ++ W QI+ Q+P +SL D LPTL AAG
Sbjct: 289 HDIYDGGIRVPFLMSWPAQIKAGQDTQSPVISL------DILPTLLDAAG 332
>gi|406661802|ref|ZP_11069914.1| Arylsulfatase [Cecembia lonarensis LW9]
gi|405554347|gb|EKB49446.1| Arylsulfatase [Cecembia lonarensis LW9]
Length = 486
Score = 165 bits (418), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 134/386 (34%), Positives = 190/386 (49%), Gaps = 53/386 (13%)
Query: 77 AALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRAS 134
+ +TK + +L G+ DL F G I TPN+D LA GI +N YA VC PSR++
Sbjct: 29 STITKPNIIFILADDLGYGDLGFLGQKIIETPNLDRLAAEGIFFSNHYAGATVCAPSRST 88
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
LMTG + HT ++G + +P G +P T +P +E GY+T A GKW LGF E
Sbjct: 89 LMTGLHTGHTPVRGN--FEIQPEGQYPLPDTILTIPRLFKEAGYATGAFGKWGLGFVGTE 146
Query: 192 YTPLYRGFESHFGYLNGVIS--YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
P ++GF+ FGY I+ YY L D + V L G+D + ++ YA D
Sbjct: 147 GDPNHQGFDQFFGYNCQRIAHRYYPEYLWDN-AEKVFLEGNDWTQKIT--------YAAD 197
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHA-----------------GNAGKHLEAPQ-- 290
+ KE+++ I D + P F+++ + HA G H+ AP+
Sbjct: 198 IIHKESLKFIRDN-ANNPFFMFVTSVMPHAELAVPDGELLAYYREKIGPEEPHI-APRGS 255
Query: 291 ----ETINQFQYITDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
E IN Y ++P + +YAAMV LD VG ++ L G+ E +IIIF SDNG P
Sbjct: 256 DYGDEPINIPGYQSNPYPKASYAAMVALLDRQVGEIMDKLDELGLTEQTIIIFTSDNG-P 314
Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLP 405
E ++ N SN +RG K L+EGG++VP IL P + Q S + D+LP
Sbjct: 315 HQEGGNDPDFFN--SNGIFRGYKRDLYEGGIRVPMILKWPGVIQGGAQSGHVSAFWDFLP 372
Query: 406 TLYTAAGGDTSRLPLNIDGLDQWSSL 431
T G N+D LD S L
Sbjct: 373 TFADLLGEK------NLDKLDGVSFL 392
>gi|325110321|ref|YP_004271389.1| arylsulfatase [Planctomyces brasiliensis DSM 5305]
gi|324970589|gb|ADY61367.1| Arylsulfatase [Planctomyces brasiliensis DSM 5305]
Length = 980
Score = 165 bits (418), Expect = 6e-38, Method: Compositional matrix adjust.
Identities = 106/303 (34%), Positives = 167/303 (55%), Gaps = 23/303 (7%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGP 149
+G+ +G TP++DALA G+ N Y + VCTPSRA ++TG+Y G++
Sbjct: 40 FGFQGGGINGDFANLTPHLDALAEGGVRFTNGYVSAAVCTPSRAGMLTGRYQHRFGVE-- 97
Query: 150 PIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
++G P G+P +E + + LR+ GY T AIGKWHLG E+ P RGF+ +G L G
Sbjct: 98 TVYGRIPEAGLPASEITMADTLRKAGYRTYAIGKWHLGEHLHEHLPNQRGFDEFYGALTG 157
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD--- 265
+++ + ++ S+ ++ NG + L Y TDL ++ V I+D +
Sbjct: 158 ARTFFPYRGNNPGSK-LQRNGVFLPEPLDQP------YFTDLLARQTVAYIDDHVANHAN 210
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETIN--QFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
P FLYLA A H LEA + ++ + Q I+ P R+T AAM +DD+VGTV++
Sbjct: 211 APFFLYLAFTAVHT-----PLEADPKRLDDRRIQDISPPQRKTLAAMTIAMDDAVGTVMA 265
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
L+ + EN+I++F+SDNG P E + NW N +G K+ +EGG++VP ++
Sbjct: 266 KLRENNLTENTIVVFLSDNGGP--EDNRSLRAPNWSDNGALQGNKSQNFEGGIRVPFVIH 323
Query: 384 SPQ 386
P+
Sbjct: 324 WPR 326
>gi|440716553|ref|ZP_20897058.1| arylsulfatase B [Rhodopirellula baltica SWK14]
gi|436438412|gb|ELP31962.1| arylsulfatase B [Rhodopirellula baltica SWK14]
Length = 498
Score = 165 bits (418), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 128/412 (31%), Positives = 197/412 (47%), Gaps = 49/412 (11%)
Query: 65 FQYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNM 122
F + T+ +R AA + + +++ G+ D+ GS + TPN+D LA +G++ +
Sbjct: 19 FVFATEASR---AAESTPPNIVVILADDMGYGDMGCMGSQTLQTPNLDRLAESGVLCSQA 75
Query: 123 Y-AQPVCTPSRASLMTGKYPIHTGMQG-----PPIWGAEPR--GVPLTERFLPEYLRELG 174
Y A VC+PSRA L+TG+ P G +G + P G+P +E+ L ++L G
Sbjct: 76 YVASAVCSPSRAGLLTGRDPRRFGYEGNLNASDENYATRPELLGLPTSEKTLADHLGAAG 135
Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRR 234
Y+T IGKWHLG + P RGF+ G L G Y+ ++ H + R
Sbjct: 136 YATALIGKWHLGMGEMHH-PNRRGFDHFCGMLTGGHHYFPTTMN-----------HVIER 183
Query: 235 NLSTAWDTVGEYATDLFTKEAVQLI---EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
N + EY TD FT E ++ I E D+P F++ ++ A H + A +
Sbjct: 184 NGKRVENFSSEYLTDFFTDEGLRFIDQHESANPDQPWFVFFSYNAPHT-----PMHATEA 238
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
+ +F I RRTYAAM+ LD VG + L+ G N++++F SDNG T
Sbjct: 239 DLARFANIQTQKRRTYAAMMYALDRGVGRIREHLEESGQWGNTLLVFFSDNGGAT----- 293
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
N +W N P RGVK ++ EGG++VP I W + R + + D LPT +A
Sbjct: 294 --NNGSW--NGPLRGVKGSMREGGIRVPMIWTWPAKFPAGVRYDGVVSSL-DLLPTFCSA 348
Query: 411 AGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT--PSRR 460
AG + L + D + +N R + DG+D L + P+RR
Sbjct: 349 AGAEPLALANPMSHEDASNRKRMN---RLSGTHDGIDMAPHLADGSEPPNRR 397
>gi|340373299|ref|XP_003385179.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 508
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 150/569 (26%), Positives = 243/569 (42%), Gaps = 111/569 (19%)
Query: 84 TLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
+ L+ +GW ++ +H + E+ TPNID+L G+ L+ Y VC+PSR+SLM+G+
Sbjct: 28 VMMLVDDWGWANVGYHRNPPTKEVVTPNIDSLVRQGLELDQHYVFNVCSPSRSSLMSGRL 87
Query: 141 PIHTG---------MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
PIH P+ G +P + + ++ GY T +GKW G
Sbjct: 88 PIHVNDLNIEPDYYNPDDPVSGFS--AIPRNMTGIAQKMKLGGYDTHQVGKWDAGMATHT 145
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG----EYA 247
+TP RGF+S FGY + +Y I ++T + D+ + G Y
Sbjct: 146 HTPKGRGFDSSFGYFHHANDFYTEIDGKPCNKTKIV---DIWVTDKPGYGLNGTGPDNYE 202
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
LF ++ ++++ + KPLFLY A HA L+ PQ ++F +I D +R+ Y
Sbjct: 203 EGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA-----PLQVPQRYQDKFSFIDDHDRQIY 257
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRG 366
AMV LDD VG ++ AL++KG+ EN + + SDNG P Y G+ NYP +G
Sbjct: 258 HAMVNYLDDVVGELVDALKKKGLWENLLFVTSSDNGGPV--------YPGGGANNYPLKG 309
Query: 367 VKNTLWEGGVKVPAIL---WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
K + W+GG++ A + + P+ + ++ +H++DW T A N+D
Sbjct: 310 GKASDWQGGIRSNAFVSGGYLPEKMRGKKLD-GYIHLADWYATFCALA---------NVD 359
Query: 424 GLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKL 483
D+ ++ P ID L+ W L+ V I I K + +K+
Sbjct: 360 PTDEAAAKADLPP------IDSLNMW-PLISGETMTSPRVDIPISYKTLISG----EYKI 408
Query: 484 VLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTR 543
+ GT GY G N+ N N + NIF
Sbjct: 409 LTGTNNQA---GYTGPQYPNQT---NPNGGI-------------NIF------------- 436
Query: 544 QQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYE-LLKYHRR 602
QQ C L+N+ DP E+ ++A+ PD ++ + L +Y +
Sbjct: 437 QQCGDGC------------------LYNIMTDPEERVDLAAKMPDKLKEMQQKLAQYQKT 478
Query: 603 TLVP-QSHEQPDLVQADPKRFNDTWSPWI 630
P + P A +N W P++
Sbjct: 479 HFAPDRGSPWPGACDAALNTYNGFWGPFV 507
Score = 42.0 bits (97), Expect = 1.0, Method: Compositional matrix adjust.
Identities = 21/66 (31%), Positives = 35/66 (53%), Gaps = 5/66 (7%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y LF ++ ++++ + KPLFLY A HA L+ PQ ++F +I D +R+
Sbjct: 201 YEEGLFKEQLLKVVNEHDTGKPLFLYYAPHIVHA-----PLQVPQRYQDKFSFIDDHDRQ 255
Query: 75 TYAALT 80
Y A+
Sbjct: 256 IYHAMV 261
>gi|171184398|ref|NP_057931.3| N-acetylgalactosamine-6-sulfatase isoform 1 precursor [Mus
musculus]
gi|124007189|sp|Q571E4.2|GALNS_MOUSE RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|74198064|dbj|BAE35212.1| unnamed protein product [Mus musculus]
gi|148679747|gb|EDL11694.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_b [Mus
musculus]
Length = 520
Score = 165 bits (417), Expect = 7e-38, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 175/365 (47%), Gaps = 40/365 (10%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31 VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 91 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
PL GF+ FG N YD+ R E+ G T T L
Sbjct: 147 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205
Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+T+EA+ I+ Q + P FLY A A HA Q++ R Y
Sbjct: 206 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V+++DDSVG ++S LQ G+ +N+ + F SDNGA + S GSN P+ K
Sbjct: 255 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
T +EGG++ PAI W P +VS Q+ I D T + AG + R+ IDGLD
Sbjct: 310 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 366
Query: 428 WSSLL 432
++L
Sbjct: 367 LPTML 371
>gi|329744562|ref|NP_001193258.1| N-acetylgalactosamine-6-sulfatase precursor [Bos taurus]
gi|296478055|tpg|DAA20170.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Bos taurus]
Length = 522
Score = 164 bits (416), Expect = 9e-38, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 177/362 (48%), Gaps = 36/362 (9%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 34 LLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPSRAALLTGRLPIR 93
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
+G G P+ G+P +E LP L+ GY++K +GKWHLG R ++ PL
Sbjct: 94 SGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH-RPQFHPLK 152
Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
GF+ FG N YD+ R E+ G + NL T + T ++
Sbjct: 153 HGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL----TQIY 208
Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+EA++ I+ Q +P FLY A A HA ++ R Y
Sbjct: 209 LQEALEFIQRQQAAHRPFFLYWAVDATHA-----------PIYASKPFLGTSQRGRYGDA 257
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
+++LDDSVG ++ L+ + EN+ + F SDNGA + S R GSN P+ K T
Sbjct: 258 IRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALI-----SAPRQGGSNGPFLCGKQT 312
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
+EGG++ PAI W P +VS Q+ I D T + AG + R IDGLD +
Sbjct: 313 TFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPR-DRAIDGLDLLPA 371
Query: 431 LL 432
+L
Sbjct: 372 ML 373
>gi|149038400|gb|EDL92760.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Rattus norvegicus]
Length = 466
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/368 (33%), Positives = 179/368 (48%), Gaps = 46/368 (12%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 35 VLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 94
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 95 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 150
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
PL GF+ FG N YD+ + R E+ G + NL T +
Sbjct: 151 HPLKHGFDEWFGSPNCHFGPYDNKVKPNIPVYRDWEMVGRFYEEFPINLKTGEANL---- 206
Query: 248 TDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
T L+ +EA+ I Q + P FLY A A HA Q++ R
Sbjct: 207 TQLYLQEALDFIRTQHARQSPFFLYWAIDATHA-----------PVYASKQFLGTSLRGR 255
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y V+++DDSVG ++S LQ G+ +N+ + F SDNGA + S + GSN P+
Sbjct: 256 YGDAVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPKEGGSNGPFLC 310
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDG 424
K T +EGG++ PAI W P +VS Q+ I D T + AG + R+ IDG
Sbjct: 311 GKQTTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDG 367
Query: 425 LDQWSSLL 432
LD ++L
Sbjct: 368 LDLLPTML 375
>gi|395508489|ref|XP_003758543.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Sarcophilus harrisii]
Length = 492
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 122/364 (33%), Positives = 169/364 (46%), Gaps = 41/364 (11%)
Query: 79 LTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMT 137
LT T + GW DL G TP++D +A G++ N Y A P+C+PSRA+L+T
Sbjct: 21 LTCLRVATFSLQMGWGDLGVFGEPSKETPHLDQMAAEGMLFPNFYTANPLCSPSRAALLT 80
Query: 138 GKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
G+ PI G A R G+P +E LPE L++ GY K +GKWHLG
Sbjct: 81 GRLPIRNGFY---TTNAHARNAYTPQEIVGGIPDSEFLLPELLKKAGYVNKIVGKWHLGH 137
Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDT 242
R ++ PL GF+ FG N YD+ R E+ G D NL T
Sbjct: 138 -RPQFHPLKHGFDEWFGAPNCHFGPYDNKARPNIPVYRNWEMVGRFFEDFPINLKTGEAN 196
Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
+ T ++ +EAV I+ Q +P FLY A A HA ++
Sbjct: 197 L----TQIYLQEAVDFIKQQAHQQPFFLYWAVDATHA-----------PVYASKSFLGTS 241
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
R Y V+++DD VG ++ LQ G+ EN+ + F SDNGA + S GSN
Sbjct: 242 QRGRYGDAVREIDDCVGKILKLLQDLGIQENTFVFFTSDNGAALI-----SAPIQGGSNG 296
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
P+ K T +EGG++ PAI W P RVS Q+ ++ D T + AG I
Sbjct: 297 PFLCGKQTTFEGGMREPAIAWWPGHIPAGRVSHQLGNVMDLFTTSLSLAGLKPPS-DREI 355
Query: 423 DGLD 426
DG+D
Sbjct: 356 DGID 359
>gi|384098332|ref|ZP_09999449.1| chondroitinsulfatase [Imtechella halotolerans K1]
gi|383835828|gb|EID75248.1| chondroitinsulfatase [Imtechella halotolerans K1]
Length = 464
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 104/310 (33%), Positives = 167/310 (53%), Gaps = 30/310 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G D+ +G + I TP+ID LA G+ NN YA VC PSRA+L+TG++ HT ++G
Sbjct: 36 GIGDIGVYGQSIIQTPHIDRLAEEGMKFNNFYAGSTVCAPSRAALLTGQHTGHTQVRGNG 95
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ +P ++ +PE L+ GY+ GKW +G TP RGF+ G+++ V
Sbjct: 96 EFPLDP-----AKKIIPEVLKNAGYTNAIFGKWGMGLNGSGSTPDKRGFDVFAGHIHHVS 150
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
++Y Q +++ H + + DT Y + FT++A+Q I +QP DKP FL
Sbjct: 151 AHY------QKPDSIDAIVHGDLKRIGLPKDT---YVNEYFTEQALQFINNQPKDKPFFL 201
Query: 271 YLAHLAAHA------GNAGKHLEAPQETINQFQ--------YITDP-NRRTYAAMVKKLD 315
++++ HA KHL + +++ + + Y P + YAA+V+ LD
Sbjct: 202 FMSYTIPHAELVVPPKYLQKHLLSAEQSKHDSEKEWPAGRHYGPQPFPKAAYAALVESLD 261
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
D G +++AL+ KG+ EN+I+IF SDNG T R + +GS+ Y+GVK L+ GG
Sbjct: 262 DYTGQILAALKAKGIDENTIVIFTSDNGTHTEGGRTQEDVDYFGSSAHYQGVKRDLYTGG 321
Query: 376 VKVPAILWSP 385
+K P ++ P
Sbjct: 322 IKEPFLIRWP 331
Score = 40.4 bits (93), Expect = 3.3, Method: Compositional matrix adjust.
Identities = 15/34 (44%), Positives = 24/34 (70%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 48
Y + FT++A+Q I +QP DKP FL++++ HA
Sbjct: 177 YVNEYFTEQALQFINNQPKDKPFFLFMSYTIPHA 210
>gi|60359902|dbj|BAD90170.1| mFLJ00319 protein [Mus musculus]
Length = 537
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 175/365 (47%), Gaps = 40/365 (10%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 48 VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 107
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 108 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 163
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
PL GF+ FG N YD+ R E+ G T T L
Sbjct: 164 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 222
Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+T+EA+ I+ Q + P FLY A A HA Q++ R Y
Sbjct: 223 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 271
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V+++DDSVG ++S LQ G+ +N+ + F SDNGA + S GSN P+ K
Sbjct: 272 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 326
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
T +EGG++ PAI W P +VS Q+ I D T + AG + R+ IDGLD
Sbjct: 327 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 383
Query: 428 WSSLL 432
++L
Sbjct: 384 LPTML 388
>gi|313228605|emb|CBY07397.1| unnamed protein product [Oikopleura dioica]
Length = 492
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 141/481 (29%), Positives = 221/481 (45%), Gaps = 92/481 (19%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
L K + ++V G++DL + + ++ +PNIDALA + + L Y QP CTPSRA+
Sbjct: 19 VLEKKPHVIFVLVDDLGFDDLGY-VNRDVISPNIDALAKDALHLKKHYVQPSCTPSRAAF 77
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
+TG+Y I GMQ I +EP G+PL E L E ++ GY T GKWHLGF+ ++ P
Sbjct: 78 LTGRYNIRMGMQSGVIRPSEPEGIPLRETLLSEAFKQCGYRTSLQGKWHLGFYTYKHCPQ 137
Query: 196 YRGFESHFGYLNGVISYYDH----ILSDQYSRTVELNGHDMRR----------------- 234
RGF+ +G+ G +Y H + S ++S E++G + +R
Sbjct: 138 NRGFDRFYGFYLGSQDFYFHDSGRLSSHKFSYEDEVSGVNNKRWLQGPHGRPWGYDFREA 197
Query: 235 ---NLSTAWDTV-------GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGK 284
N DT+ G Y+T LF + + + LF Y++ H
Sbjct: 198 YPANGDVENDTILDDFHTNGTYSTKLFVNDFINDLAKHDPAVSLFNYVSFQDVH------ 251
Query: 285 HLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
P +T+N+F+ Y + T+ ++ + T I+ S+++F SDN
Sbjct: 252 ---GPLQTVNKFKKHYKNKTDTWTHERIL------ISTKIT----------SLLVFTSDN 292
Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ----MM 398
G E SN+P RG K T+++GG+K A + SP +Q R+ Q +
Sbjct: 293 GGRPKE---------GASNWPLRGSKGTIYDGGLKSRAFVASPLLQN--RMKGQNYNYLF 341
Query: 399 HISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLLNTPSR-RNSNIDGLDQWSSLLLNT 456
H+SDW PTL T +G D R +DG + L + P + RN + LD +
Sbjct: 342 HVSDWFPTLLTLSGCDVPRKEGRPLDG--KIHELFVYKPKKARNKMLHYLDPLTQFDYGD 399
Query: 457 PSRRNSVLINIDEKKRT--AAVRLDSWKLVLGT-QENGTMDGYYGQTR-----SNKVPLL 508
P RN +++ T A+ R WK++ G NG GY + R NK+ L+
Sbjct: 400 P--RNFEVLDDRAFNTTIKASYRQGRWKIITGKPCANGC--GYIKKERPWNYAKNKMKLI 455
Query: 509 N 509
+
Sbjct: 456 D 456
>gi|323454930|gb|EGB10799.1| hypothetical protein AURANDRAFT_22180 [Aureococcus anophagefferens]
Length = 530
Score = 164 bits (416), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 114/380 (30%), Positives = 180/380 (47%), Gaps = 42/380 (11%)
Query: 80 TKSTTLTLLIVYGWNDLSFHGSNEIPT-------PNIDALAYNGIILNNMYAQPVCTPSR 132
+K + ++V +DL F+ + + T P++ ALA + + Y +CTP+R
Sbjct: 17 SKQPHIAFMLV---DDLGFNDAWAVNTSDVAGAWPHVGALANESVRIATYYTTMLCTPTR 73
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
+ MTG+ P G+ I G + G+P E + + L+ GY+T +GKWHLG F
Sbjct: 74 GAFMTGRLPQRLGLHHGVIGGFQDYGLPANETTIADKLKGAGYATAHVGKWHLGNFDDAS 133
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS--TAWDTV------- 243
P RGF++ +GY NG D ++R ++ G +L+ ++ TV
Sbjct: 134 EPTRRGFDASYGYQNGE--------EDHFTRILQGLGDFQASSLADGASYYTVDDLLKIN 185
Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA---GNAGKHLEAPQETINQFQYIT 300
G + + LF AV ++ PLFLYLA HA AG +A +I ++
Sbjct: 186 GTHNSFLFVDRAVAVVGAHDASIPLFLYLALQDTHAPIQAPAGYADDAACASIGSSRHYP 245
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
D RR + MV+ D+S+ V +A+ NS+I+ DNG + G+
Sbjct: 246 DGARREFCGMVRAADESIANVTAAIDANFGKRNSVIVIAGDNGGIV---------KGGGN 296
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
N+P RG K WEGGV+ A++ +P M+H+ D TL AGG ++
Sbjct: 297 NWPLRGQKGQPWEGGVRNHALIRAPGRLAPGLYDKGMVHVVDLHATLVALAGGASA---Y 353
Query: 421 NIDGLDQWSSLLLNTPSRRN 440
IDGLD W++L +PS R+
Sbjct: 354 EIDGLDVWAALRDGSPSPRS 373
>gi|7527462|gb|AAF63155.1|AF111346_1 N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
gi|7576473|gb|AAF63858.1| N-acetylgalactosamine-6-sulfate sulfatase [Mus musculus]
Length = 520
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 124/365 (33%), Positives = 174/365 (47%), Gaps = 40/365 (10%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31 VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 91 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
PL GF FG N YD+ R E+ G T T L
Sbjct: 147 HPLKHGFNEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205
Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+T+EA+ I+ Q + P FLY A A HA Q++ R Y
Sbjct: 206 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V+++DDSVG ++S LQ G+ +N+ + F SDNGA + S GSN P+ K
Sbjct: 255 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
T +EGG++ PAI W P +VS Q+ I D T + AG + R+ IDGLD
Sbjct: 310 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 366
Query: 428 WSSLL 432
++L
Sbjct: 367 LPTML 371
>gi|406661522|ref|ZP_11069640.1| Arylsulfatase [Cecembia lonarensis LW9]
gi|405554671|gb|EKB49747.1| Arylsulfatase [Cecembia lonarensis LW9]
Length = 477
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 190/364 (52%), Gaps = 40/364 (10%)
Query: 69 TDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPV 127
++P ++ + L G++D+ +G+++I TP++D LA G+ N Y AQ V
Sbjct: 19 SNPQGNNETSIIPNIILIFTDDQGYHDVGVYGASDIETPHLDQLASEGLQFTNFYVAQAV 78
Query: 128 CTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
C+ SRA+L+TG YP G+ G + + G+ E + + L+ LGY+T GKWHLG
Sbjct: 79 CSASRAALLTGTYPNRLGIHGA-LDHSSKHGLHPEEATIADLLKPLGYATAVFGKWHLGH 137
Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
E+ P +GF+ +FG + +H + Y + + +D + + T W+ +
Sbjct: 138 -HPEFLPTNQGFDEYFGIPYSNDMWPNHPQTKDYYPPLPIYQND--KVVDTIWNDQSMFT 194
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
T FT++++ IE D+P FLYLAH H P ++F+ NR Y
Sbjct: 195 T-WFTEKSIDFIERNK-DRPFFLYLAHPMPHV---------PLFVSDKFK--GKSNRGLY 241
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
++ ++D SVG ++ L++ + EN+IIIF+SDNG P + Y E S GS YP R
Sbjct: 242 GDVIMEIDWSVGQIMQTLKQNDLDENTIIIFLSDNG-PWLSYSEHS-----GSAYPLREG 295
Query: 368 KNTLWEGGVKVPAIL-W-----SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL- 420
K T W+GGVKVPAI+ W S +Q+NP +++ D LP L G + LP+
Sbjct: 296 KGTSWDGGVKVPAIMRWKGTWPSGLVQKNPAMTI------DILPMLAHLTG---AELPVK 346
Query: 421 NIDG 424
IDG
Sbjct: 347 KIDG 350
>gi|340369111|ref|XP_003383092.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 507
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 112/347 (32%), Positives = 165/347 (47%), Gaps = 33/347 (9%)
Query: 84 TLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
+ L+ +GW ++ +H + E+ TPNID L G+ LN YA C+PSR+SL++G+
Sbjct: 28 VMMLVDDWGWANVGYHRNPPTREVVTPNIDKLVKEGLELNQHYAFRFCSPSRSSLISGRL 87
Query: 141 PIHTG---------MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
PIH P+ G +P + E +R GY+T +GKW G +
Sbjct: 88 PIHVNDLNLAPTNYNPNDPVSGFS--AIPRNMTGIAEKMRGAGYATHQVGKWDAGMATSD 145
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNL--STAWDTVGEYATD 249
+TP RGF++ FGY + YY I V+L D + T D Y
Sbjct: 146 HTPKGRGFQTSFGYFHHANDYYTEIAGKCKIPIVDLWETDKPAHGINGTGPDN---YEEG 202
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
LF + + ++ + PLFLY A HA E P +N+F +I + R+ Y A
Sbjct: 203 LFKERVLSVVSNHDPSTPLFLYYAPHIVHA-----PYEVPDRYLNKFSFIDNHLRQYYHA 257
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
MV LDD VG + AL+ KGM +N + + SDNG P E +NYP +G K
Sbjct: 258 MVNYLDDVVGELTDALKNKGMWDNLLFVTSSDNGGPVHE-------GTGANNYPLKGGKT 310
Query: 370 TLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD 414
T W+GGV+V A + + + R + +H++DW T AG D
Sbjct: 311 TDWQGGVRVNAFVSGGYLPEKMRGQKTDGYIHLADWYGTFCAIAGVD 357
>gi|299530773|ref|ZP_07044188.1| sulfatase [Comamonas testosteroni S44]
gi|298721289|gb|EFI62231.1| sulfatase [Comamonas testosteroni S44]
Length = 440
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 160/328 (48%), Gaps = 30/328 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
GW DL +G+ + TP++D LA G+ N YA VC+ +R +L+TG+Y + G++
Sbjct: 15 GWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGRYQYRLQAGLEE 74
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P G+P LP L++ GY T IGKWHLG Y P G++ FG +G
Sbjct: 75 PLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLGK-PPAYGPQLSGYDYFFGNHSG 133
Query: 209 VISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVD 265
I Y+ H + +Q+S + + R Y T + EA + + E + D
Sbjct: 134 AIDYFTHKPGVGEQFSSDLWQGNEPVERT---------GYYTYILGDEATRYVHERKGQD 184
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
KP FL L A H G E +I + + + Y +V+ LD +VG V+ AL
Sbjct: 185 KPFFLSLHFTAPHWPWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVEALDKAVGQVLQAL 244
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WS 384
G +N+I+IF SDNG + +P+ G K L EGG++VP +L WS
Sbjct: 245 DDSGQADNTIVIFTSDNGG-----------ERFSKTWPFTGQKTELLEGGIRVPTLLRWS 293
Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAG 412
+IQ P+V Q+ DWLPTL AAG
Sbjct: 294 ARIQ--PQVQEQITASFDWLPTLLAAAG 319
>gi|149196558|ref|ZP_01873612.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149140238|gb|EDM28637.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 443
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 106/313 (33%), Positives = 159/313 (50%), Gaps = 32/313 (10%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L G+ D+ F GS++I TP+ID LA +G+I + Y + VC PSRA LMTGK +
Sbjct: 12 VLILADDLGYGDVGFTGSSQIKTPHIDRLAKDGVIFSQGYVSSSVCGPSRAGLMTGKNQV 71
Query: 143 HTGMQGP-----PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
G P + E G+P++E+ L L E GY +GKWHLG + +Y PL R
Sbjct: 72 RFGFDNNLTNYLPQFKDEFHGLPISEKTLATRLAEKGYVNGLVGKWHLGD-KEQYHPLKR 130
Query: 198 GFESHFGYLNGVISYYDHILSDQ-YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
GF +GYL G Y+ + + Y +E N + + +S D G+ D +
Sbjct: 131 GFHEFWGYLGGGHHYFRSKPNGKGYDCPIECN-YKTPQPISYITDDKGDECVDFIRRHK- 188
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
D+P FL+ + A HA + A +E + + +I RR Y AM LD
Sbjct: 189 --------DEPFFLFASFNAPHA-----PMHAKEEDLKLYSHIEGEKRRAYCAMTHNLDL 235
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+VG +++ L+ + + +N++++F+SDNG PT N N P RG K L EGG+
Sbjct: 236 NVGKIVAELKAQNIYDNTVVVFLSDNGGPTPN--------NASLNAPLRGGKGILLEGGI 287
Query: 377 KVP-AILWSPQIQ 388
+VP + W +IQ
Sbjct: 288 RVPFTMTWPAKIQ 300
>gi|395856891|ref|XP_003800850.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Otolemur garnettii]
Length = 526
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 125/373 (33%), Positives = 176/373 (47%), Gaps = 42/373 (11%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASL 135
A + + L L+ GW DL +G TPN+D +A G + N Y A P+C+PSRA+L
Sbjct: 30 APQSPNIVLLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAAL 89
Query: 136 MTGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHL 185
+TG+ PI G A R G+P +E LPE L+E GY +K +GKWHL
Sbjct: 90 LTGRLPIRNGFY---TTNAHARNAYTPQEIVGGIPRSEHLLPELLKEAGYISKIVGKWHL 146
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAW 240
G R ++ PL GF+ FG N YD+ R E+ G + NL T
Sbjct: 147 G-HRPQFHPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGE 205
Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
+ T ++ +EA+ I+ Q + P FLY A A HA ++
Sbjct: 206 SNL----TQIYLQEALDFIKRQQAQQHPFFLYWAIDATHA-----------PVYASKAFL 250
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
R Y V+++DDSVG ++ L+ + EN+ + F SDNGA + S R G
Sbjct: 251 GTSQRGRYGDAVREIDDSVGKILDLLRGLSIGENTFVFFTSDNGAALI-----SAPRQGG 305
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
SN P+ K T +EGG++ PAI W P +VS Q+ I D T + AG +
Sbjct: 306 SNGPFLCGKQTTFEGGMREPAIAWWPGHIPAAQVSHQLGSIMDLFTTSLSLAGLEPPS-D 364
Query: 420 LNIDGLDQWSSLL 432
IDGLD +LL
Sbjct: 365 RTIDGLDLLPTLL 377
>gi|255038246|ref|YP_003088867.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254951002|gb|ACT95702.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 481
Score = 164 bits (415), Expect = 1e-37, Method: Compositional matrix adjust.
Identities = 138/419 (32%), Positives = 198/419 (47%), Gaps = 64/419 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D+ F+G I TPNID LA G+I N YA VC PSR+SL+TG++ HT ++G
Sbjct: 41 GYGDVGFNGQKLIKTPNIDKLAKEGMIFNQFYAGTSVCAPSRSSLLTGQHTGHTYIRGNK 100
Query: 151 IWGAEPRGV-PLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
G EP G P+ + L E L++ GY T A GKW LG E P +GF+ +GY
Sbjct: 101 --GVEPEGQQPIADSVTTLAEVLKKSGYVTAAFGKWGLGPVGSEGDPNKQGFDRFYGYNC 158
Query: 208 GVIS--YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
++ YY L D + + + N EYA DL K+A+ + Q
Sbjct: 159 QSLAHRYYPEHLWDNSKKILLEGNKGLIHNK--------EYAPDLIQKKALSFVNAQDGK 210
Query: 266 KPLFLYLAHLAAHAG-----------NAGKHLEAPQETI------NQFQYIT-DPNRRTY 307
+P FL+L ++ HA GK E P + N Y + D T+
Sbjct: 211 QPFFLFLPYILPHAELVVPDDSLFRYYKGKFEEKPHKGADYGPGANGGGYASQDFPHATF 270
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
AAMV +LD VG V++AL++KG+ +N+++IF SDNG P VE + N G+ +RGV
Sbjct: 271 AAMVARLDLYVGQVMNALKKKGLDKNTLVIFTSDNG-PHVEGGADPRFFNSGAG--FRGV 327
Query: 368 KNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
K L+EGG++ P A W I+ + S + D LPT
Sbjct: 328 KRDLYEGGIREPFAARWPAAIKPGSK-SDYIGAFWDILPTF------------------- 367
Query: 427 QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLI-NIDEKKRTAAVRLDSWKLV 484
+ L N P+ R NIDG+ +L +++ L E+ AVR +WK V
Sbjct: 368 ---AELANAPAPR--NIDGISFTDALKGKAIQKKHDYLYWEFHEQGGRQAVRQGNWKAV 421
>gi|418530519|ref|ZP_13096442.1| sulfatase [Comamonas testosteroni ATCC 11996]
gi|371452238|gb|EHN65267.1| sulfatase [Comamonas testosteroni ATCC 11996]
Length = 440
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 160/328 (48%), Gaps = 30/328 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
GW DL +G+ + TP++D LA G+ N YA VC+ +R +L+TG+Y + G++
Sbjct: 15 GWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGRYQYRLQAGLEE 74
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P G+P LP L++ GY T IGKWHLG Y P G++ FG +G
Sbjct: 75 PLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLGK-PPAYGPQLSGYDYFFGNHSG 133
Query: 209 VISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVD 265
I Y+ H + +Q+S + + R Y T + EA + + E + D
Sbjct: 134 AIDYFTHKPGVGEQFSSDLWQGNEPVERT---------GYYTYILGDEATRYVHERKGQD 184
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
KP FL L A H G E +I + + + Y +V+ LD +VG V+ AL
Sbjct: 185 KPFFLSLHFTAPHWPWEGPDDEHVAHSIKDLFHYDGGSLKKYGEIVEALDKAVGQVLQAL 244
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WS 384
G +N+I+IF SDNG + +P+ G K L EGG++VP +L WS
Sbjct: 245 DDSGQADNTIVIFTSDNGG-----------ERFSKTWPFTGQKTELLEGGIRVPTLLRWS 293
Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAG 412
+IQ P+V Q+ DWLPTL AAG
Sbjct: 294 ARIQ--PQVQEQVTASFDWLPTLLAAAG 319
>gi|255530697|ref|YP_003091069.1| sulfatase [Pedobacter heparinus DSM 2366]
gi|255343681|gb|ACU03007.1| sulfatase [Pedobacter heparinus DSM 2366]
Length = 472
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 107/323 (33%), Positives = 164/323 (50%), Gaps = 32/323 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQG-- 148
G+ D +G +IPTPNIDA+A G + Y + VC PSRA ++TG+Y G +
Sbjct: 40 GYVDFGCYGGKQIPTPNIDAIAKQGTRFTDAYVSASVCAPSRAGILTGRYQQRFGFEHNT 99
Query: 149 ----PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
P + G+ +E+ + ++ GY T AIGKWH G + + PL RGF +G
Sbjct: 100 SNVLAPGYKITDVGMDPSEQTIGNEMQANGYKTIAIGKWHQGDEPKHF-PLNRGFNEFYG 158
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+ G ++ + + RT N H + N + Y TD+FT +A I
Sbjct: 159 FTGGHRDFFAY----KGKRT---NEHALYNNKEIVPENEITYLTDMFTDKATSFITANK- 210
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
DKP F+YL++ A H + A ++ + ++ I D RR YAAM+ LDD +G V++
Sbjct: 211 DKPFFMYLSYNAVHT-----PMNAKKDLMERYASIADTGRRAYAAMMTSLDDGIGKVMAT 265
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-W 383
L+ + +N++IIF++DNG TV N N P RG+K + WEGG++V ++ W
Sbjct: 266 LKANQLDKNTLIIFINDNGGATV---------NSSDNGPLRGMKGSKWEGGIRVAMMMKW 316
Query: 384 SPQIQQNPRVSLQMMHISDWLPT 406
I N S + + D LPT
Sbjct: 317 PGHIAANKTDSRPVSSL-DILPT 338
Score = 45.1 bits (105), Expect = 0.13, Method: Compositional matrix adjust.
Identities = 24/68 (35%), Positives = 36/68 (52%), Gaps = 6/68 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD+FT +A I DKP F+YL++ A H + A ++ + ++ I D RR
Sbjct: 193 YLTDMFTDKATSFITANK-DKPFFMYLSYNAVHT-----PMNAKKDLMERYASIADTGRR 246
Query: 75 TYAALTKS 82
YAA+ S
Sbjct: 247 AYAAMMTS 254
>gi|340367653|ref|XP_003382368.1| PREDICTED: arylsulfatase J-like [Amphimedon queenslandica]
Length = 504
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 130/436 (29%), Positives = 201/436 (46%), Gaps = 73/436 (16%)
Query: 76 YAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV------ 127
+ A K + ++V +G+ D+ F I +PN D LA G++LN Y
Sbjct: 17 FVATAKLPNIVFVLVDDWGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYLAKTGLVLNR 75
Query: 128 ------CTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIG 181
C+PSRAS +TG++P H P ++ G L +P L+ YST +G
Sbjct: 76 HYVFKYCSPSRASFLTGRWPHHAHQWNLP--QSDMAGTNLNMTMIPAKLKAAKYSTHMVG 133
Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD 241
KWH GFF Y P+ RGF++ G+L G DH ++ + ++ +D + +D
Sbjct: 134 KWHQGFFDPRYLPINRGFDTSSGFLGGG---EDH-MTQEAECAIDYWKNDAPDPRNGTYD 189
Query: 242 TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-IT 300
Y DL ++ + V+ PLF+YL HA L+AP E +N + T
Sbjct: 190 AY-NYGDDL-----TDIMNNHNVNDPLFIYLPLHNVHA-----PLQAPDEWLNIYSVNST 238
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
NRRTY AMV D+ G ++ L++KGM +N+III +DNG GS
Sbjct: 239 CENRRTYQAMVSVADNVTGYLVQLLKKKGMWDNTIIIISADNGGAACG----------GS 288
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD---- 414
NYP +G K + +EGGV+ A + + ++ R + +HI+DW T AG D
Sbjct: 289 NYPLKGCKGSFFEGGVRALAFVNGGLLPESRRGESTDGFIHIADWYTTFCKLAGVDPDDS 348
Query: 415 -TSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKR- 472
T + P +DGLD W I G D+ ++ +++ D R
Sbjct: 349 GTGKFP--VDGLDVWPI------------ITGEDE--------KTQHEDIILGYDFFSRH 386
Query: 473 TAAVRLDSWKLVLGTQ 488
T A+ + +KL++G Q
Sbjct: 387 TGAIIMGDYKLLVGPQ 402
>gi|410628681|ref|ZP_11339399.1| sulfatase [Glaciecola mesophila KMM 241]
gi|410151685|dbj|GAC26168.1| sulfatase [Glaciecola mesophila KMM 241]
Length = 502
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 112/365 (30%), Positives = 176/365 (48%), Gaps = 31/365 (8%)
Query: 69 TDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPV 127
+PN R + L L G+ D F GS++I TPN+D LA + Y + V
Sbjct: 48 ANPNPRGSNKDAPNIVLILSDDAGYGDFGFQGSSQIRTPNLDNLAVQSTVFTQAYVSAAV 107
Query: 128 CTPSRASLMTGKYPIHTGMQGPPIWG--------AEPRGVPLTERFLPEYLRELGYSTKA 179
C PSRA ++TGKY G + + G + G+PL +R + +YL GYST
Sbjct: 108 CGPSRAGILTGKYQQRFGFEENNVPGYMSDSGLTGDDMGLPLNQRTIGDYLTHFGYSTAL 167
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA 239
IGKWH G + + P RGFE +G+ G SY++ ++ S + + R +
Sbjct: 168 IGKWHQGNADK-FHPTKRGFEHFYGFRGGARSYFEFGPNNPVSYPED----RLERGFAH- 221
Query: 240 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
+ Y T EA++ I+ Q +P FL+L+ A H ++A +E + QF +
Sbjct: 222 YKESPHYLTQALADEAIKFIK-QNQREPFFLFLSFNAVHT-----PMDANKEDLAQFPQL 275
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+ +R AAM +D +G V+++L G+ EN++++F +DNG P+ N
Sbjct: 276 SGKRQRV-AAMTLSMDREIGKVLASLDELGLSENTLLVFTNDNGGPSDT--------NAS 326
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
N P G K EGG++VP ++ P P + + D LPT + AAGG S +
Sbjct: 327 DNTPLSGTKANHLEGGIRVPFLMRWPAGNIGPNTYHEPISTLDLLPTFFAAAGGKKSAVA 386
Query: 420 LNIDG 424
++DG
Sbjct: 387 -DLDG 390
>gi|323456816|gb|EGB12682.1| hypothetical protein AURANDRAFT_60668 [Aureococcus anophagefferens]
Length = 534
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 160/585 (27%), Positives = 245/585 (41%), Gaps = 116/585 (19%)
Query: 75 TYAALTKS--TTLTLLIVYGWNDLSF----HGSNEIPTPNIDALAYNGIILNNMYAQPVC 128
T AL ++ T LL V G +D+ F + S ++ +P +D LA G+ L YA C
Sbjct: 15 TLGALARAADTKPHLLFVLG-DDVGFGDVGYSSPDVISPTLDRLAAEGLKLGRHYAYMWC 73
Query: 129 TPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRE-LGYSTKAIGKWHLGF 187
PSRA+L+TG YP TG+ GA+ +PL LP LR+ GY T A+GKWHLGF
Sbjct: 74 APSRAALLTGYYPSTTGVYS--TSGAQ-NALPLEFALLPGLLRDRAGYRTAAVGKWHLGF 130
Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWD--TVGE 245
P RGF+ FG+L+G +Y + + + D RR D G
Sbjct: 131 MSEADLPERRGFDGFFGFLDGGEDHYSRVGAGAPGCDRVFDLWDSRRRGPATDDPSAFGR 190
Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA-------------------------- 279
Y+ +L+ + A I +PLFLY A AH+
Sbjct: 191 YSAELYGEAAADAIRGHDAAEPLFLYAAFQVAHSPLEVPPAKYFDAYADAALCANASTKG 250
Query: 280 -GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
A +AP N + +R AAMV LD +V + +AL K M + ++++
Sbjct: 251 FACARPGADAPYRAPN---HDCGCDRLVVAAMVTALDAAVANMTTALASKNM-ADFVLVY 306
Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMM 398
DNG P +E +W N RG K T +EGGV+ A + SP + + + +
Sbjct: 307 SGDNGGPEIE-------GHW--NGGLRGGKWTWFEGGVRPAAFVHSPLLTRRGWHN-GTL 356
Query: 399 HISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPS 458
H+ DWLPT AG +R DG DQW+SL R + + G
Sbjct: 357 HLVDWLPTFLALAG--AARDDDAFDGADQWASLAFGAAPAREATLIG------------- 401
Query: 459 RRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN--AIVES 516
VL+ G + VP F ++ +
Sbjct: 402 --PGVLVQ-------------------------------GNYKLYAVPPAGFGECSVFNT 428
Query: 517 KTYQ-SLQQLSQNIFLPISNIDKMRSTRQQATIHCGANP------APMTPSPCT-NGPCY 568
T+ +L + Q + P +++ + + + C A P A + +PC+ + PC
Sbjct: 429 ATWDCTLGRRGQPLAAP-ADVKADGNASRCPRLQCSAGPFDDPADAWLCSAPCSLDKPC- 486
Query: 569 LFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQSHEQPD 613
L+++ +DP E ++AS PD+ + + L+ H T+ P PD
Sbjct: 487 LYDVVSDPAEARDVASRFPDVVAAMAAALRNH--TIRPPYPIVPD 529
>gi|323454643|gb|EGB10513.1| hypothetical protein AURANDRAFT_62515 [Aureococcus anophagefferens]
Length = 981
Score = 164 bits (414), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 132/467 (28%), Positives = 204/467 (43%), Gaps = 71/467 (15%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTP-NIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
L L G NDL S+++P P NI L G+ L Y Q +C+P+RA+L++GK+
Sbjct: 78 VLVTLDDVGMNDL--WQSSDLPVPENIATLVAEGVELTAYYGQSMCSPARAALLSGKFVH 135
Query: 143 HTGMQGPPIWGAEPR-------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
G WG + VPL +PE L+ GY T IGKW++G Y P
Sbjct: 136 KIGFSDK--WGPKREVTAFSNYSVPLGHVLMPEALKRNGYGTHGIGKWNIGHCNEAYLPW 193
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM-------RRNLSTAWDTVGEYAT 248
RGF++ GYL I Y DH+ S + + D+ R G Y T
Sbjct: 194 MRGFDTFVGYLTDGIGYTDHVADGPSSYLYDNDALDLYDFVSHERGVTKNGSAYAGAYTT 253
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP---------QETINQFQYI 299
++F A ++ ++P D PLFL+LAH H NAG A +
Sbjct: 254 EIFNARAETILREEPSDAPLFLWLAHHGVH-DNAGVADAATCVDGDDGDDTAIFALAASM 312
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
T P R + +K +D VG V AL + +++ SDNG T + N G
Sbjct: 313 TRPRFR-FGCALKAVDRGVGAVRRALDDRA--REYVLVVHSDNGGNTC-----GTHCN-G 363
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQI---QQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
+NYP RG K +EGG++VP +++SP + ++ +MH DWL T GGD +
Sbjct: 364 NNYPRRGAKFFEFEGGLRVPGVVYSPTLIPARRRGAAYAGLMHHVDWLATFVALGGGDAA 423
Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAV 476
L + D +D W+ + + + P R++++ TA +
Sbjct: 424 DLDPSYDSVDHWAHI------------------AGRVSGAP--RDTIVFA--ASGDTATI 461
Query: 477 RLDSWKLV--------LGTQENGTMDGYYGQTRSNKVPLLNFNAIVE 515
R ++KL+ T NGT + Y + SN++ L ++ + +
Sbjct: 462 RSGNYKLIHHALNATYFHTWFNGTANSYPWKCNSNEMVSLFYDVVAD 508
Score = 39.7 bits (91), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 18/41 (43%), Positives = 25/41 (60%), Gaps = 1/41 (2%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAG 52
G Y T++F A ++ ++P D PLFL+LAH H NAG
Sbjct: 248 AGAYTTEIFNARAETILREEPSDAPLFLWLAHHGVH-DNAG 287
>gi|241756057|ref|XP_002401379.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215508427|gb|EEC17881.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 287
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 168/321 (52%), Gaps = 38/321 (11%)
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V ++D S+G VI AL ++ ML NSI+IF +DNG T N+G N+P RG K T
Sbjct: 2 VYEMDQSIGLVIEALHKRNMLGNSIVIFSTDNGG-----LPTGLGSNYGFNWPLRGFKAT 56
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
LWEGGV+ A +WSP ++++ R+S QMMHI+DWLPTLY+AAGGD SRL + IDG+D
Sbjct: 57 LWEGGVRGAAFIWSPLLRRSGRISSQMMHITDWLPTLYSAAGGDVSRLGV-IDGID---- 111
Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
W +L + PS R+ +L+NID ++A+ + + K+ L +
Sbjct: 112 -----------------MWEALCEDLPSPRHEILLNIDSAANSSALIVGNRKVFLTSSFP 154
Query: 491 GTMDGYYGQTRSNKVPLLNFN-AIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIH 549
+ + + P+ + +++S+T L+ F + + + R++ ++
Sbjct: 155 DELRYHRSEVPGGTRPVEGLDEMMLKSRTGTVLRD-----FYKVEKLSIRPNWRKEVVVN 209
Query: 550 CGA-NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVPQS 608
C NP + N Y F++ DPCE N+A+ +L E L + T++P +
Sbjct: 210 CSQYNPG---SNFVANSSPYYFDIQRDPCELENLAAVNATEVDELMEKLAAYAATMIPPA 266
Query: 609 HEQPDLVQADPKRFNDTWSPW 629
++ D + PK + W+PW
Sbjct: 267 NKPMD-PRGLPKYNHGLWAPW 286
>gi|404450774|ref|ZP_11015752.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
gi|403763554|gb|EJZ24502.1| arylsulfatase A family protein [Indibacter alkaliphilus LW1]
Length = 469
Score = 163 bits (413), Expect = 2e-37, Method: Compositional matrix adjust.
Identities = 119/365 (32%), Positives = 180/365 (49%), Gaps = 48/365 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G D+ +G IPTPN+D +A G+ N+ YA VC PSRASLMTG++ HT ++G
Sbjct: 42 GIGDIGPYGQTIIPTPNLDKMASTGMKFNDFYAGSTVCAPSRASLMTGQHTGHTLVRGNG 101
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ E + LPE L+E GY+ GKW LG +P G++ G+L+ V
Sbjct: 102 EYPLES-----GRKILPERLKEAGYTNAMFGKWGLGLANTHSSPEKVGWDHFVGHLHHVS 156
Query: 211 SYY------DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
++Y D I+ + SR G +YA DLF + +++ IE Q
Sbjct: 157 AHYQKPDSLDCIIEGEISRMPVPKG---------------KYANDLFVENSIEFIETQTD 201
Query: 265 DKPLFLYLAHLAAHA-------------GNAGKHLEAPQETIN--QFQYITDPN-RRTYA 308
++P FLYL+ HA +GK + P E + Y P + YA
Sbjct: 202 EQPFFLYLSLTVPHAELLVPERYFQPFLDESGKSI-FPSEKAHPEGLHYGAQPYPKAAYA 260
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
A+V +DD VG V+SAL++ G+ EN+++IF SDNG R + + S+ +RGVK
Sbjct: 261 ALVSSIDDYVGRVMSALEKIGIAENTLVIFTSDNGTHVEGGRRMEDVDYFKSSAHFRGVK 320
Query: 369 NTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
L++GG+KVP ++ W +I++ + D PTL AG + R + DG+
Sbjct: 321 RDLFDGGIKVPLMVNWPGKIKEGSTSDFRGA-FWDLFPTLSHLAGIEVDR--DDTDGISF 377
Query: 428 WSSLL 432
+LL
Sbjct: 378 LPTLL 382
Score = 38.9 bits (89), Expect = 7.6, Method: Compositional matrix adjust.
Identities = 16/36 (44%), Positives = 24/36 (66%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 48
G+YA DLF + +++ IE Q ++P FLYL+ HA
Sbjct: 181 GKYANDLFVENSIEFIETQTDEQPFFLYLSLTVPHA 216
>gi|346992020|ref|ZP_08860092.1| arylsulfatase [Ruegeria sp. TW15]
Length = 522
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 139/469 (29%), Positives = 213/469 (45%), Gaps = 72/469 (15%)
Query: 78 ALTKSTTLTLLIVY----GWNDL-SFH-GSNEIPTPNIDALAYNGIILNNMYAQPVCTPS 131
A+ +S +L++ GW L S+H G I TPN+D LA G L + YAQP CT
Sbjct: 22 AVAQSDKPNILVIMADDVGWASLGSYHQGVKSIQTPNLDQLAAEGARLTDYYAQPSCTAG 81
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
R++ +TG++P+ TGM + G +P G+ + L L+ +GY+T GK HLG +
Sbjct: 82 RSAFITGQFPVRTGMHTVGLPG-DPVGLSDEDPTLANILKSMGYTTGQFGKNHLGDLNK- 139
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNL--STAWDTVGEYATD 249
+ P +GF+ ++G+L ++ ++ + E RN+ S A DTV E
Sbjct: 140 FLPTAKGFDEYWGWLYH-LNAMEYTTDPDWPDDPEFAAQFGPRNIIHSFATDTVDETVDP 198
Query: 250 LFTKEAVQ-LIEDQPV--------------------------DKPLFLYLAHLAAHAGNA 282
+ Q +I+D P + P F+++A AH
Sbjct: 199 RWGPVGKQRIIDDGPAPPERQKTLDDEVTAHTIDFINRAVDSETPFFVWMAPARAHVWT- 257
Query: 283 GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
HL +++ + R ++K+LDD+VG V + L+ G+ +N+I+IF SDN
Sbjct: 258 --HLSP------EYEAMLGDGRGLQDVVMKELDDNVGKVTAHLEELGIADNTIVIFTSDN 309
Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISD 402
G T+ + + G P+ G K T WEGG +VPAI+ P +V + D
Sbjct: 310 GPETMTWPD-------GGTTPFHGEKGTTWEGGFRVPAIIRWPGKVAEGQVLNGIFDGMD 362
Query: 403 WLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNS 462
W+PTL AAGG + LP + LL ++DG +Q S L TPS R
Sbjct: 363 WMPTLVAAAGG-SQNLPAD----------LLEGFEGYQVHLDGYNQLSYLTGETPSNRKE 411
Query: 463 VLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFN 511
++ E A+R + WK QE G+ G PLL FN
Sbjct: 412 IVYY--EGTDLQAIRYNDWKAHFTIQEK----GWAGPKEELNAPLL-FN 453
>gi|326433598|gb|EGD79168.1| hypothetical protein PTSG_09899 [Salpingoeca sp. ATCC 50818]
Length = 515
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 122/386 (31%), Positives = 182/386 (47%), Gaps = 45/386 (11%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGS--------NEIPTPNIDALAYNGIILNNMYAQP 126
A T + L+I GW + H + +++ TP + L G+ L YA
Sbjct: 22 VARTSQPNIVLIIGDDVGWANTELHCNISDPAQVCSDVKTPFLLQLKDEGVHLLRHYAYK 81
Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
C+P+R++LM+G+ PIH P + GV L LPE L +GY+T +GKWHLG
Sbjct: 82 FCSPTRSALMSGRLPIHVNEVNPTY--IKGGGVSLNFTMLPEKLSRVGYATHHVGKWHLG 139
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV---ELNGHDMRRNLSTAWDTV 243
P RGF+S GYL G D + +TV G D+ N A+
Sbjct: 140 MSSHSRVPTGRGFDSSLGYLLGA--------EDHWKQTVTQGNATGVDLWLNEGPAYLRN 191
Query: 244 GEYATDLFTKEAVQLIEDQPV---DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
EY LF + AV++I D D PLFLY+ H+ E P+E +++ I
Sbjct: 192 NEYGDLLFGRRAVEVIHDHAKNHRDSPLFLYVGLQVNHS-----PAEVPKEYQDRYPGI- 245
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
+R M +DD++G + +AL M E+++++++SDNG + N G+
Sbjct: 246 QVDRAIVNGMSSVMDDAIGKITTALWETTMWEDTLVVYISDNGG------ASGINANNGN 299
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ--MMHISDWLPTLYTAAGG----D 414
NYP RG K+T +EGGV+ A++ + + R S + MH DW TL AG D
Sbjct: 300 NYPLRGGKHTDFEGGVRTAALVSGGFLPDDQRGSTRDGYMHACDWYTTLAKLAGASPADD 359
Query: 415 TSRLPLNIDGLDQWSSLLLNTPSRRN 440
LP +DG+D W + N S R+
Sbjct: 360 VEGLP-GVDGMDMWPMIANNEKSPRH 384
>gi|444722183|gb|ELW62881.1| N-acetylgalactosamine-6-sulfatase [Tupaia chinensis]
Length = 764
Score = 163 bits (412), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 174/364 (47%), Gaps = 37/364 (10%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
T T L GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 65 TPTALFFMGWGDLGVYGEPSRETPNLDQMAAEGMLFPNFYSANPLCSPSRAALLTGRLPI 124
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
TG A R G+P +E LPE L+ GY +K +GKWHLG R ++
Sbjct: 125 RTGFY---TTNAHARNAYTPQEIVGGIPSSEHLLPELLKGAGYVSKIVGKWHLG-HRPQF 180
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
PL GF+ FG N YD+ R E+ G + T T +
Sbjct: 181 HPLRHGFDEWFGAPNCHFGPYDNKARPNIPVYRDWEMVGR-FYEEFPISLKTGEANLTQI 239
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+ +EA+ I+ Q +P FL+ A A HA + A Q + Q R Y
Sbjct: 240 YLQEALDFIKRQAGRRPFFLHWAIDATHA-----PVYASQPFLGTSQ------RGRYGDA 288
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V+++DDSVG +++ L+ G+ ++ + F SDNGA + GSN P+ K T
Sbjct: 289 VREIDDSVGKILALLRALGIGNSTFVFFTSDNGAALISAPRQGKC---GSNGPFLCGKQT 345
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQW 428
+EGG++ PAI W P +VS Q+ + D T + AG + R IDGLD
Sbjct: 346 TFEGGMREPAIAWWPGHIPAGQVSHQLGSLMDLFTTSLSLAGLKPPSDR---EIDGLDLL 402
Query: 429 SSLL 432
++L
Sbjct: 403 PTIL 406
>gi|431899642|gb|ELK07596.1| Arylsulfatase J [Pteropus alecto]
Length = 637
Score = 162 bits (411), Expect = 3e-37, Method: Compositional matrix adjust.
Identities = 102/270 (37%), Positives = 146/270 (54%), Gaps = 22/270 (8%)
Query: 121 NMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAI 180
+ + P C S +L +Y IHTG+Q I +P +PL LP+ LRE GY+T +
Sbjct: 94 GVRSSPECQGSPDTL---QYQIHTGLQHSVIRPTQPNCLPLDAATLPQKLREAGYATHMV 150
Query: 181 GKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAW 240
GKWHLGF+R E P RGF+S FG L G YY H D + + G+D+ N AW
Sbjct: 151 GKWHLGFYRTECLPTRRGFDSFFGSLLGSGDYYTHYKCD----SPGVCGYDLHENDRAAW 206
Query: 241 D-TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
G Y+T ++T+ +++ KPLFLY+A+ A H+ L+AP + ++ I
Sbjct: 207 GRGGGAYSTQMYTQRVQRILASHDPGKPLFLYIAYQAVHS-----PLQAPGRYLGHYRSI 261
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+ RR YAAM+ LD++V V AL+ G +NS+I++ SDNG G
Sbjct: 262 ANARRRRYAAMLSCLDEAVANVTLALKAHGFYDNSVIVYSSDNGGQPAAG---------G 312
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
SN+P RG K T WEGGV+ + SP ++Q
Sbjct: 313 SNWPLRGRKGTYWEGGVRAVGFVHSPLLRQ 342
>gi|160891516|ref|ZP_02072519.1| hypothetical protein BACUNI_03967 [Bacteroides uniformis ATCC 8492]
gi|317478375|ref|ZP_07937539.1| sulfatase [Bacteroides sp. 4_1_36]
gi|156858923|gb|EDO52354.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
gi|316905534|gb|EFV27324.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 525
Score = 162 bits (410), Expect = 4e-37, Method: Compositional matrix adjust.
Identities = 117/355 (32%), Positives = 182/355 (51%), Gaps = 43/355 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGM-QGP 149
GW D+ + G+ ++ TPNIDALA G+ + Y + PSRA ++TG Y G
Sbjct: 43 GWGDVGYQGAVDVSTPNIDALARRGVQFSQGYVSCSISGPSRAGILTGVYQQRFGFYNNL 102
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
W P G + L E +R+ GY+T +GKWH+ E +P RGF+ +G+ +
Sbjct: 103 HPWAKIPEG----QSTLGEMVRDCGYATGFVGKWHMAD-SPEQSPNRRGFDQFYGFWSDT 157
Query: 210 ISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
YY D + Y +++ L + GEY TD FT+EAV+ I D+
Sbjct: 158 HDYYRSTDKPGVELYDFCPLYRNGEIQPPLHES----GEYITDCFTREAVEFI-DKHASS 212
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVISA 324
P L L++ A H+ + P+ +N+ + +R+ +AAMV LDD +G V+ +
Sbjct: 213 PFLLCLSYNAVHS-----PWQVPEHYVNRLEGRRFHHEDRKVFAAMVLALDDGIGRVMES 267
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETS---NYRNWG-----SNYPYRGVKNTLWEGGV 376
L++ G+ EN++ I +SDNG+P + E S Y++ G S P+RG K +EGG+
Sbjct: 268 LRKNGLEENTLFILISDNGSPRGQGIECSTGYEYKDRGNTTMSSPGPFRGYKADTYEGGI 327
Query: 377 KVPAIL-WSPQIQQ-----NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
+VP I+ W ++ Q NP +SL D PT+ A GG TSR ++DG+
Sbjct: 328 RVPYIMSWPSELPQGMVYDNPVISL------DIFPTVMQAVGG-TSRQKYSLDGV 375
>gi|119587161|gb|EAW66757.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
mucopolysaccharidosis type IVA), isoform CRA_a [Homo
sapiens]
Length = 708
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 35 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIR 94
Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
G A R G+P +E+ LPE L++ GY +K +GKWHLG R ++
Sbjct: 95 NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
PL GF+ FG N YD+ R E+ G + NL T + T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
++ +EA+ I+ Q P FLY A A HA ++ R Y
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
V+++DDS+G ++ LQ + +N+ + F SDNGA + E GSN P+ K
Sbjct: 256 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T +EGG++ PA+ W P +VS Q+ I D L T A G T IDGL+
Sbjct: 311 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369
Query: 429 SSLL 432
+LL
Sbjct: 370 PTLL 373
>gi|241998988|ref|XP_002434137.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215495896|gb|EEC05537.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 167
Score = 162 bits (410), Expect = 5e-37, Method: Composition-based stats.
Identities = 81/163 (49%), Positives = 105/163 (64%), Gaps = 5/163 (3%)
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
LFTK+A+ LI+D KP+FLY +H A H G+ LEAP I F YI + NR +A
Sbjct: 1 LFTKKALSLIKDHDPAKPMFLYFSHQAVHCGDYKVKLEAPASAIAHFPYIKEQNRSIHAG 60
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V +LD SVG V+ AL ++GML+NSII+F +DNG S Y N+P RG K
Sbjct: 61 AVFELDKSVGLVMEALNKRGMLKNSIIVFSTDNGGLPWGKEPNSGY-----NWPLRGSKF 115
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
TLWEGGV+ A +WSP + + R+S Q+MHI+DWLPT Y+AAG
Sbjct: 116 TLWEGGVRGAAFVWSPLLSKTGRLSNQLMHITDWLPTFYSAAG 158
Score = 60.1 bits (144), Expect = 4e-06, Method: Composition-based stats.
Identities = 32/73 (43%), Positives = 41/73 (56%), Gaps = 4/73 (5%)
Query: 19 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 78
LFTK+A+ LI+D KP+FLY +H A H G+ LEAP I F YI + NR +A
Sbjct: 1 LFTKKALSLIKDHDPAKPMFLYFSHQAVHCGDYKVKLEAPASAIAHFPYIKEQNRSIHAG 60
Query: 79 ----LTKSTTLTL 87
L KS L +
Sbjct: 61 AVFELDKSVGLVM 73
>gi|189069200|dbj|BAG35538.1| unnamed protein product [Homo sapiens]
Length = 522
Score = 162 bits (410), Expect = 5e-37, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 35 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAGGLLFPNFYSANPLCSPSRAALLTGRLPIR 94
Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
G A R G+P +E+ LPE L++ GY +K +GKWHLG R ++
Sbjct: 95 NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
PL GF+ FG N YD+ R E+ G + NL T + T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
++ +EA+ I+ Q P FLY A A HA ++ R Y
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
V+++DDS+G ++ LQ + +N+ + F SDNGA + E GSN P+ K
Sbjct: 256 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T +EGG++ PA+ W P +VS Q+ I D L T A G T IDGL+
Sbjct: 311 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369
Query: 429 SSLL 432
+LL
Sbjct: 370 PTLL 373
>gi|351712929|gb|EHB15848.1| N-acetylgalactosamine-6-sulfatase [Heterocephalus glaber]
Length = 482
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 120/358 (33%), Positives = 172/358 (48%), Gaps = 43/358 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L++G+ PI +G
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLSGRPPIRSGFY--- 58
Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
A R G+P +ER LP L+E GY+TK +GKWHLG R ++ PL GF+
Sbjct: 59 TTNAHARNAYTPQEIVGGIPDSERLLPSLLKEAGYATKIVGKWHLG-HRPQFHPLKHGFD 117
Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
FG N YD+ + E+ G + N+ T + T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKAKPNIPVYKDWEMVGRFYEEFPINVKTGESNL----TQIYLQEA 173
Query: 256 VQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
+ I+ Q +P FLY A A HA ++ R Y V+++
Sbjct: 174 LDFIKRQQAARRPFFLYWAVDATHA-----------PVYASRPFLGSSQRGRYGDAVREI 222
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
DDSVG ++S LQ G+ E++ + F SDNGA V GSN P+ K T +EG
Sbjct: 223 DDSVGKMLSLLQVLGLAESTFVFFTSDNGAALVSAPSQ------GSNGPFLCGKQTTFEG 276
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
G++ PAI W P+ +VS Q+ I D T AG R + IDG +LL
Sbjct: 277 GMREPAIAWWPRHIPPGQVSHQLGSIMDLFTTSLALAGLKPPRDRV-IDGFSLLPALL 333
>gi|372210598|ref|ZP_09498400.1| n-acetylgalactosamine-4-sulfatase [Flavobacteriaceae bacterium S85]
Length = 468
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 107/349 (30%), Positives = 170/349 (48%), Gaps = 41/349 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS PTPN+D LA G++ Y VC PSRA L+TG+Y G +
Sbjct: 37 GYFDFGFQGSKTFPTPNLDQLAKEGMVFKQAYTTAAVCGPSRAGLLTGRYQQRFGFEENN 96
Query: 151 IWG---------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
+ G + G+PL E+ + +YL +LGY + +GKWH+G R Y PL RGF
Sbjct: 97 VPGYMSKSSKLLGDDMGLPLDEKTMADYLGKLGYQSIVLGKWHMGNADR-YHPLKRGFTE 155
Query: 202 HFGYLNGVISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQL 258
+G+ G S+Y +D+ +E+ + +Y T A
Sbjct: 156 FYGFRGGARSFYPLTQKQAADKPEDRLEIG--------YKKYQEPKKYLTYDLADAACDF 207
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I D+ KP F+Y++ A H+ ++A QE + + + +T R+ AAM LD +
Sbjct: 208 I-DRNKKKPFFMYVSFNAVHS-----PMQATQEDLKKIKGLTG-KRKVLAAMTLALDKAC 260
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G ++ L+ +G+ ++++I+F +DNG P NYP G K+ EGG++V
Sbjct: 261 GQILEKLKEEGLEDDTLIVFSNDNGGPD---------GTKTCNYPLSGCKSNNLEGGIRV 311
Query: 379 PAILWSPQIQQNPRVSLQM-MHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
P ++ P++ Q P S + + D LPT AGG ++ +DG+D
Sbjct: 312 PFVMKLPKVIQ-PGSSYEYPVSTLDLLPTFVNIAGGKADKIA-QLDGVD 358
>gi|4503899|ref|NP_000503.1| N-acetylgalactosamine-6-sulfatase precursor [Homo sapiens]
gi|462148|sp|P34059.1|GALNS_HUMAN RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|618426|gb|AAC51350.1| N-acetylgalactosamine 6-sulphatase [Homo sapiens]
gi|870751|dbj|BAA04535.1| N-acetylgalactosamine 6-sulfate sulfatase [Homo sapiens]
gi|33440495|gb|AAH56151.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
gi|37589093|gb|AAH50684.2| Galactosamine (N-acetyl)-6-sulfate sulfatase [Homo sapiens]
gi|119587163|gb|EAW66759.1| galactosamine (N-acetyl)-6-sulfate sulfatase (Morquio syndrome,
mucopolysaccharidosis type IVA), isoform CRA_c [Homo
sapiens]
Length = 522
Score = 162 bits (409), Expect = 6e-37, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 35 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLCSPSRAALLTGRLPIR 94
Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
G A R G+P +E+ LPE L++ GY +K +GKWHLG R ++
Sbjct: 95 NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
PL GF+ FG N YD+ R E+ G + NL T + T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
++ +EA+ I+ Q P FLY A A HA ++ R Y
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
V+++DDS+G ++ LQ + +N+ + F SDNGA + E GSN P+ K
Sbjct: 256 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T +EGG++ PA+ W P +VS Q+ I D L T A G T IDGL+
Sbjct: 311 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369
Query: 429 SSLL 432
+LL
Sbjct: 370 PTLL 373
>gi|440908775|gb|ELR58760.1| N-acetylgalactosamine-6-sulfatase, partial [Bos grunniens mutus]
Length = 525
Score = 161 bits (408), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 175/362 (48%), Gaps = 37/362 (10%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 38 LLLMDDMGWGDLGVYGEPSRETPNLDRMAVEGMLFPNFYTANPLCSPSRAALLTGRLPIR 97
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
+G G P+ G+P +E LP L+ GY++K +GKWHLG R ++ PL
Sbjct: 98 SGFYTTNGHARNAYTPQEIVGGIPDSELLLPALLKGAGYASKIVGKWHLGH-RPQFHPLK 156
Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
GF+ FG N YD+ R E+ G + NL T + T ++
Sbjct: 157 HGFDEWFGSPNCHFGPYDNKARPNIPVYRDQEMVGRFYEEFPINLKTGEANL----TQIY 212
Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+EA++ I+ Q +P FLY A A HA ++ R Y
Sbjct: 213 LQEALEFIQRQQAAHRPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDA 261
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
+++LDDSVG ++ L+ + EN+ + F SDNGA + GSN P+ K T
Sbjct: 262 IRELDDSVGRILRLLRDLSIAENTFVFFTSDNGAALISAPRQ------GSNGPFLCGKQT 315
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
+EGG++ PAI W P +VS Q+ I D T + AG + R IDGLD +
Sbjct: 316 TFEGGMREPAIAWWPGHIPAGQVSHQLGSIMDLFTTSLSLAGLEPPR-DRAIDGLDLLPA 374
Query: 431 LL 432
+L
Sbjct: 375 ML 376
>gi|264677229|ref|YP_003277135.1| sulfatase [Comamonas testosteroni CNB-2]
gi|262207741|gb|ACY31839.1| sulfatase [Comamonas testosteroni CNB-2]
Length = 440
Score = 161 bits (408), Expect = 9e-37, Method: Compositional matrix adjust.
Identities = 110/328 (33%), Positives = 159/328 (48%), Gaps = 30/328 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
GW DL +G+ + TP++D LA G+ N YA VC+ +R +L+TG+Y + G++
Sbjct: 15 GWADLGVYGATDFKTPHLDRLAAQGVRFNQAYANSAVCSATRIALITGRYQYRLQAGLEE 74
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P G+P LP L++ GY T IGKWHLG Y P G++ FG +G
Sbjct: 75 PLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLGK-PPSYGPQRSGYDYFFGNHSG 133
Query: 209 VISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVD 265
I Y+ H + +Q+S + + R Y T + EA + + E + D
Sbjct: 134 AIDYFTHKPGVGEQFSPDLWQGSEPVERT---------GYYTYILGDEATRYVHERKGQD 184
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
KP FL L A H G E I + + + Y +V+ LD +VG V+ AL
Sbjct: 185 KPFFLSLHFTAPHWPWEGPDDEHVAHNIKDLFHYDGGSLKKYGEIVEALDKAVGQVLQAL 244
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WS 384
G +N+I+IF SDNG + +P+ G K L EGG++VP +L WS
Sbjct: 245 DDTGQADNTIVIFTSDNGG-----------ERFSKTWPFTGQKTELLEGGIRVPTLLRWS 293
Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAG 412
+IQ P+V Q+ DWLPTL AAG
Sbjct: 294 ARIQ--PQVQEQVTVSFDWLPTLLAAAG 319
>gi|340367645|ref|XP_003382364.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 128/432 (29%), Positives = 200/432 (46%), Gaps = 65/432 (15%)
Query: 75 TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
+ A K + +++ G+ D+ F I +PN D LA G++LN Y C+PSR
Sbjct: 15 CFVATAKLPNIVFVLIDDLGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYVFKYCSPSR 73
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
ASL+TG++P H P + G + LP L+ YST +GKWHLGFF Y
Sbjct: 74 ASLLTGRWPHHAHQWNPLM--DSTIGTNINMTMLPAKLKAANYSTHMVGKWHLGFFDPRY 131
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL---NGHDMRRNLSTAWDTVGEYATD 249
P+ RGF++ G+ DH ++++ +++ N D R A++ Y D
Sbjct: 132 LPINRGFDTSTGFFGCC---EDH-MNEKSGCSIDYWKNNAPDPRNGTYDAYN----YRDD 183
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYA 308
L ++ + + PLFLYL HA +AP+E ++ + T R T
Sbjct: 184 L-----TDVMSNYNTENPLFLYLPLHNVHA-----PFQAPKEWLDIYPANSTCEKRHTLQ 233
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGV 367
AM+ D+ G ++ L++KG+ +N+I++ +DN GAP GSNYP +G
Sbjct: 234 AMISVADNVTGHLVELLKKKGIWDNTIMVISADNGGAPCA-----------GSNYPLKGC 282
Query: 368 KNTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD-----TSRLPL 420
K T +EGGV+ A + + ++ R + +HI+DW PT AG D T + P
Sbjct: 283 KTTFFEGGVRSVAFVNGGLLPESRRGQSTDGFIHIADWYPTFCKLAGVDPDDSGTGKFP- 341
Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS 480
+DGLD W + ++ I L N RR A+ + +
Sbjct: 342 -VDGLDVWPIITGENEKTQHEEI-------VLGYNFDFRRPG----------QGAIIVGN 383
Query: 481 WKLVLGTQENGT 492
+KLV+G QE G
Sbjct: 384 YKLVVGLQEIGC 395
>gi|410215590|gb|JAA05014.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410254514|gb|JAA15224.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410288780|gb|JAA22990.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
gi|410330541|gb|JAA34217.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Pan troglodytes]
Length = 522
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 173/364 (47%), Gaps = 41/364 (11%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 35 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIR 94
Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
G A R G+P +E+ LPE L++ GY +K +GKWHLG R ++
Sbjct: 95 NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 150
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
PL GF+ FG N YD+ R E+ G + NL T + T
Sbjct: 151 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 206
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
++ +EA+ I+ Q P FLY A A HA ++ R Y
Sbjct: 207 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 255
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
V+++DDS+G ++ LQ + +N+ + F SDNGA + E GSN P+ K
Sbjct: 256 DAVREIDDSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 310
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T +EGG++ PA+ W P +VS Q+ I D L T A G T IDGL+
Sbjct: 311 QTTFEGGMREPALAWWPGHVAAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 369
Query: 429 SSLL 432
+LL
Sbjct: 370 PTLL 373
>gi|432452061|ref|ZP_19694315.1| arylsulfatase [Escherichia coli KTE193]
gi|433035723|ref|ZP_20223410.1| arylsulfatase [Escherichia coli KTE112]
gi|430977211|gb|ELC94062.1| arylsulfatase [Escherichia coli KTE193]
gi|431545828|gb|ELI20473.1| arylsulfatase [Escherichia coli KTE112]
Length = 551
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 140/445 (31%), Positives = 214/445 (48%), Gaps = 69/445 (15%)
Query: 82 STTLTLLIVYGWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTG 138
+ + LL GW D+ F+G PTP+IDA+A G+IL + Y+QP +P+RA+++TG
Sbjct: 87 NVVIFLLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTG 146
Query: 139 KYPIHTGMQGPPIWGAEPRGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
+Y IH G+ PP++G +P G+ LT LP+ L + GY T+AIGKWH+G +E P
Sbjct: 147 QYSIHHGILMPPMYG-QPGGLQGLTT--LPQLLHDQGYVTQAIGKWHMG-ENKESQPQNV 202
Query: 198 GFESHFGYLNGVISYYD-----HI-----LSDQYSRTVEL------NGHDMRRNLSTA-W 240
GF+ G+ N V Y H+ LS S ++ + H MR A
Sbjct: 203 GFDDFRGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAMRGGEQQAIA 261
Query: 241 DTVGEYATDL---FTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 296
D +Y DL + + V+ ++ DKP FLY H N N
Sbjct: 262 DITPKYMEDLDQRWMEYGVKFLDKMAKSDKPFFLYYGTRGCHFDN----------YPNAK 311
Query: 297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
+ P R +Y + +++D + AL++ G L+N++I+F SDNG P E
Sbjct: 312 YAGSSPARTSYGDCMVEMNDVFANLYKALEKNGQLDNTLIVFTSDNG-PEAEVPPH---- 366
Query: 357 NWGSNYPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
P+RG K + WEGGV+VP + W IQ PR S ++ ++D PT AG
Sbjct: 367 ---GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHPG 421
Query: 416 SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAA 475
+++ ++L+ T + IDG+DQ +S L T + N + + AA
Sbjct: 422 AKV----------ANLVPKT-----TFIDGVDQ-TSFFLGTNGQSNRKAEHYFLNGKLAA 465
Query: 476 VRLDSWKLVLGTQE--NGTMDGYYG 498
VR+D +K + Q+ T GY G
Sbjct: 466 VRMDEFKYHVLIQQPYAYTQSGYQG 490
>gi|340367643|ref|XP_003382363.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 178/367 (48%), Gaps = 46/367 (12%)
Query: 75 TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSR 132
+AA K + ++V +G+ D+ F I +PN D LA G++LN Y C+PSR
Sbjct: 15 CFAATAKVPNIVFVLVDDWGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYVFKYCSPSR 73
Query: 133 ASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
AS +TG++P H P + G L LP L+ Y+T +GKWHLGFF Y
Sbjct: 74 ASFLTGRWPHHAHQWNPLM--DNMIGTNLNMTMLPAKLKAANYATHMVGKWHLGFFDPRY 131
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL---NGHDMRRNLSTAWDTVGEYATD 249
P+ RGF++ G+L G DH ++++ +++ N D R A++ Y D
Sbjct: 132 LPINRGFDTSTGFLGGG---EDH-MNEKSGCSIDYWKNNAPDPRNGTYDAYN----YRDD 183
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYA 308
L ++ + D PLF YL H L+AP+E ++ + T R
Sbjct: 184 L-----TDVMNNHNADNPLFFYLPLHNVHT-----PLQAPKEWLDIYPANSTCEKRHILQ 233
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
AMV D+ G ++ L++KGM +N+I++ +DNG GSNYP +G K
Sbjct: 234 AMVSVADNVTGHLVELLKKKGMWDNTIMVISADNGGAPCS----------GSNYPLKGCK 283
Query: 369 NTLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD-----TSRLPLN 421
+T +EGG++ A + + ++ R + +HI+DW T AG D T + P
Sbjct: 284 STFFEGGIRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFP-- 341
Query: 422 IDGLDQW 428
+DGLD W
Sbjct: 342 VDGLDVW 348
>gi|156368526|ref|XP_001627744.1| predicted protein [Nematostella vectensis]
gi|156214663|gb|EDO35644.1| predicted protein [Nematostella vectensis]
Length = 157
Score = 161 bits (407), Expect = 1e-36, Method: Composition-based stats.
Identities = 66/124 (53%), Positives = 90/124 (72%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+ +H ++ TPNID LA G++L N Y QP+CTP+R + ++G+YPIHTG+Q I
Sbjct: 33 GWSDVGYHNITDLKTPNIDRLAGEGVVLENYYVQPICTPARGTFLSGRYPIHTGLQHSNI 92
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
EP G+PL LP+ L++ GYST A+GKWHLGFF +EYTPLYRGF++ FGY +G
Sbjct: 93 HETEPFGLPLDFTLLPQKLKKAGYSTHAVGKWHLGFFEKEYTPLYRGFDTFFGYYSGSED 152
Query: 212 YYDH 215
+Y H
Sbjct: 153 HYTH 156
>gi|332666885|ref|YP_004449673.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
gi|332335699|gb|AEE52800.1| N-acetylgalactosamine-6-sulfatase [Haliscomenobacter hydrossis DSM
1100]
Length = 443
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 117/347 (33%), Positives = 163/347 (46%), Gaps = 43/347 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHT--GMQG 148
G+ DLS +G + TPNID LA GI N Y A P+CTP+R + MTG+YP T G+
Sbjct: 42 GYGDLSGYGRKDFLTPNIDKLAAQGIKFVNAYSAAPLCTPTRTAFMTGRYPARTPVGLME 101
Query: 149 PPIWGAEPRGVPLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
P V LT F + +R GY T IGKWHLGF + +P+ GF+ FG
Sbjct: 102 PLTPSKRDSTVGLTAAFPSVATLMRASGYETALIGKWHLGFLPQN-SPVKNGFDYFFGIH 160
Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
+G Y H R HD+ N + Y TDLFT++AV ++ Q +K
Sbjct: 161 SGAADYISHKTGPAGRRI-----HDLYENDQAVYPE--GYLTDLFTQKAVTFLK-QKHNK 212
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR------TYAAMVKKLDDSVGT 320
P FL L + AAH P + N Y+ + R YAAM+ +LD+ +G
Sbjct: 213 PFFLTLTYNAAHW---------PWQGPNDKPYVDSVDLRIGGSAAVYAAMMNRLDEGIGE 263
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
+I L + + +I+IF +DNG + N K LWEGG++VPA
Sbjct: 264 IIKTLDEEQLSNQTIVIFTNDNGG-----------EKYSDNGGLAKAKGALWEGGIRVPA 312
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR-LPLNIDGLD 426
I+ P + Q+ DW T+ +A G + PL DG+D
Sbjct: 313 IVKWPGKITAGSSTQQVAITMDWTKTILSAGGAKMDKNFPL--DGID 357
>gi|403260898|ref|XP_003922887.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Saimiri boliviensis
boliviensis]
Length = 482
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/357 (33%), Positives = 170/357 (47%), Gaps = 41/357 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW DL +G TPN+D +A G + N Y A P+C+PSRA+L+TG+ PI G
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 58
Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
A R G+P +E+ LPE L+E GY TK +GKWHLG R ++ PL GF+
Sbjct: 59 TTNAHARNAYTPQEIVGGIPDSEQLLPELLKEAGYVTKIVGKWHLG-HRPQFHPLKHGFD 117
Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
FG N YD+ R E+ G + NL T + T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 173
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
+ I+ Q +P FLY A A HA ++ R Y V+++D
Sbjct: 174 LDFIKRQARRRPFFLYWAVDATHA-----------PVYASKPFLGTSRRGRYGDAVREMD 222
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
DS+G ++ LQ + +++ + F SDNGA + E GSN P+ K T +EGG
Sbjct: 223 DSIGKMLQLLQDLRVADSTFVFFTSDNGAALISAPEQG-----GSNSPFLCGKQTTFEGG 277
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
++ PA+ W P +VS Q+ I D L T A G T IDGL+ +LL
Sbjct: 278 MREPALAWWPGHITPGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLLPALL 333
>gi|340378605|ref|XP_003387818.1| PREDICTED: hypothetical protein LOC100637044 [Amphimedon
queenslandica]
Length = 2318
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 109/351 (31%), Positives = 164/351 (46%), Gaps = 34/351 (9%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
L+ +G+ D SF I TPN L NG+ILN Y C+PSRAS +TG++P H
Sbjct: 1144 VFVLVDDWGFADASFRNP-AIKTPNFQYLVDNGLILNRHYVFKYCSPSRASFLTGRFPHH 1202
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
P G + +T LP L+ GY+T +GKWH G + Y P+ RGF++
Sbjct: 1203 VHQWNPTPLGMVGTNINMT--MLPAKLKNAGYATHMVGKWHQGLYDPAYLPINRGFDTSS 1260
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
G+L +++ + V+ +D + D+ ++ K+ + +
Sbjct: 1261 GFLQAEEGHFNQTI----GCAVDFWKNDAPDTRNGTCDSY------IYNKDLTTVFNEHD 1310
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTVI 322
KPLFLYL HA +EAPQE ++ + T RR AMV D+ G V+
Sbjct: 1311 ASKPLFLYLPLHNVHA-----PIEAPQEWLDLYPANSTCDTRRILQAMVSVADNVTGHVV 1365
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
+ L+ M +N++++ +DNG + GSNYP +G K T +EGGV+V A
Sbjct: 1366 NLLRENDMWDNTLMVVSADNGGADCK----------GSNYPLKGSKYTFFEGGVRVIAFA 1415
Query: 383 WSPQIQQNPR--VSLQMMHISDWLPTLYTAAG---GDTSRLPLNIDGLDQW 428
I R + +HI+DW T AG D+ + +DGLD W
Sbjct: 1416 SGGLIPAGRRGKSTDGFIHIADWYTTFCKMAGVDHNDSGKGKFPVDGLDVW 1466
>gi|340367649|ref|XP_003382366.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 495
Score = 161 bits (407), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 113/363 (31%), Positives = 173/363 (47%), Gaps = 40/363 (11%)
Query: 76 YAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
+ A K + ++V +G+ D+ F I +PN D LA G++LN Y C PSRA
Sbjct: 17 FVATAKLPNIVFVLVDDWGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYVFKYCAPSRA 75
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
SL+TG++P H + G L P L+ YST +GKWH GFF Y
Sbjct: 76 SLLTGRWPHHVYQWN--LATDATAGTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYL 133
Query: 194 PLYRGFESHFGYLNGVISYY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
P+ RGF++ G+L G + + I + Y + N D R G Y ++
Sbjct: 134 PINRGFDTSSGFLCGSEDHMTQNAICAIDYWKN---NAPDPRN---------GTYDAYIY 181
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR-TYAAM 310
+ +I D+PLFLYL H L+AP+E ++ + + ++R T AM
Sbjct: 182 RDDLTDIINSHNTDEPLFLYLPLHNVHLP-----LQAPKEWLDIYPANSTCDKRHTLQAM 236
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V D+ G ++ L++KG+ +N+I++ +DNG GSNYP RG K T
Sbjct: 237 VSVADNVTGHLVELLKKKGLWDNTIMVISADNGGAHCS----------GSNYPLRGCKET 286
Query: 371 LWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAG---GDTSRLPLNIDGL 425
L+EGGV+ A + + ++ R + +HI+DW T AG D+ ++DGL
Sbjct: 287 LFEGGVRSLAFVNGGLLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFSVDGL 346
Query: 426 DQW 428
D W
Sbjct: 347 DVW 349
>gi|221068581|ref|ZP_03544686.1| sulfatase [Comamonas testosteroni KF-1]
gi|220713604|gb|EED68972.1| sulfatase [Comamonas testosteroni KF-1]
Length = 440
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 111/340 (32%), Positives = 163/340 (47%), Gaps = 30/340 (8%)
Query: 80 TKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTG 138
T + L GW DL +G+ + TP++D LA G+ N YA VC+ +R +L+TG
Sbjct: 3 TPNIVFILADDLGWADLGVYGATDFKTPHLDHLASQGVRFNQAYANSAVCSATRIALITG 62
Query: 139 KYP--IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
+Y + G++ P G+P LP L++ GY T IGKWHLG Y P
Sbjct: 63 RYQYRLQAGLEEPLARHGAQLGLPADHPTLPSLLKQAGYDTALIGKWHLGK-PPAYGPQR 121
Query: 197 RGFESHFGYLNGVISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
G++ FG +G I Y+ H + +Q+S + + R Y T + E
Sbjct: 122 SGYDYFFGNHSGAIDYFTHKPGVGEQFSPDLWQGNEPVERT---------GYYTYILGDE 172
Query: 255 AVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKK 313
A + + E + DKP FL L A H G E +I + + + Y +V+
Sbjct: 173 ATRYVHERKDQDKPFFLSLHFTAPHWPWEGPGDEHVARSIKDLFHYDGGSLKKYGEIVEA 232
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
LD +VG V+ AL G +N+I+IF SDNG + +P+ G K L E
Sbjct: 233 LDKAVGQVLQALDDTGQADNTIVIFTSDNGG-----------ERFSKTWPFTGQKTELLE 281
Query: 374 GGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
GG++VP +L WS +I+ P+V Q+ DWLPTL AAG
Sbjct: 282 GGIRVPTLLRWSARIR--PQVQEQVTASFDWLPTLLAAAG 319
>gi|410030097|ref|ZP_11279927.1| sulfatase [Marinilabilia sp. AK2]
Length = 476
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 120/356 (33%), Positives = 183/356 (51%), Gaps = 48/356 (13%)
Query: 73 RRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPV 127
RR+ A+ K + ++I++ G++D+ G+++I TP++D LA G+ N Y AQ V
Sbjct: 18 RRSDEAVEKQSLPNIIIIFTDDQGYHDVGVFGASDIATPHLDQLAAEGVQFTNFYVAQAV 77
Query: 128 CTPSRASLMTGKYP----IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
C+ SRA+L+TG Y IH + +G P E + + L+ LGY+T GKW
Sbjct: 78 CSASRAALLTGVYSNRLGIHGALDHMSRYGLHPE-----EATIADILKPLGYATAMFGKW 132
Query: 184 HLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV 243
HLG + E+ P +GF+ + G + +H + Y + L +D + + T W+
Sbjct: 133 HLGHYP-EFLPTNQGFDEYLGIPYSNDMWPNHPQTKDYYPPLPLYQND--KVIDTIWNDQ 189
Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
+ T LFT++++ IE D+P FLYLAH H P ++F+ N
Sbjct: 190 SMFTT-LFTEKSIDFIERNK-DRPFFLYLAHPMPHV---------PLFVSDKFK--GKSN 236
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
R Y ++ ++D SVG ++ L+ + EN+IIIF+SDNG P + Y S GS YP
Sbjct: 237 RGLYGDVIMEIDWSVGQIMETLKHYDLDENTIIIFLSDNG-PWLSYSGHS-----GSAYP 290
Query: 364 YRGVKNTLWEGGVKVPAIL-W-----SPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
R K T W+GGVKVP I+ W S +Q NP +S+ D LPTL G
Sbjct: 291 LREGKGTSWDGGVKVPVIMRWKGTWASGLVQPNPAMSI------DILPTLAYLTGA 340
>gi|343086704|ref|YP_004775999.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342355238|gb|AEL27768.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 462
Score = 160 bits (406), Expect = 1e-36, Method: Compositional matrix adjust.
Identities = 130/402 (32%), Positives = 189/402 (47%), Gaps = 49/402 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQG-- 148
G+ DL +G+ ++ T NID LA G+ YA PVC+PSRA+LMTGKY + G+ G
Sbjct: 41 GYKDLGSYGAKDLVTTNIDRLAEEGVKFTQFYAAAPVCSPSRAALMTGKYNFNAGLFGNV 100
Query: 149 -PPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
PP E + G+P E + E + GY T IGKWHLG E P +GF+ FG+
Sbjct: 101 SPPHSDPEGKSGLPTEEITMAEMFKSNGYRTGLIGKWHLGH-TPEKLPNGQGFDYFFGHQ 159
Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
G I + H Y L HD+ RN + + GE+ DL +E Q I ++ ++
Sbjct: 160 RGCIDNFSHFF---YWHGPNL--HDLYRNETEIYRP-GEFFGDLMVQEMKQFISEK-SEE 212
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQ 326
P F Y A H G + ++ + + P R+ YAA V +D+ VG + L+
Sbjct: 213 PFFTYWAINMPHYPYQGD-----PKWLDYYADLESP-RKEYAAFVSTIDEMVGEIYHHLK 266
Query: 327 RKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQ 386
G LEN+II++ SD+G +VE R + G P G K ++ EGG++VPAI+ P
Sbjct: 267 AIGELENTIIVYQSDHGY-SVEERA---FFGGGDAGPLSGAKFSMLEGGIRVPAIIRFPS 322
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGL 446
+ + DW+PTL G + P+ + NIDG
Sbjct: 323 MLPKDEARHHWANSIDWMPTLAELTG--------------------IKIPASQ--NIDGK 360
Query: 447 DQWSSLL-LNTPSRRNSVLINI---DEKKRTAAVRLDSWKLV 484
+L N PS ++ + KK AVR WKL+
Sbjct: 361 SLLPLILDANVPSPHETMFWGTGDPENKKHAWAVRKGPWKLL 402
>gi|380796101|gb|AFE69926.1| N-acetylgalactosamine-6-sulfatase precursor, partial [Macaca
mulatta]
Length = 503
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 120/364 (32%), Positives = 172/364 (47%), Gaps = 41/364 (11%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 16 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIR 75
Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
G A R G+P +E+ LPE L++ GY +K +GKWHLG R ++
Sbjct: 76 NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 131
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
PL GF+ FG N YD+ R E+ G + NL T + T
Sbjct: 132 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 187
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
++ +EA+ I+ Q P FLY A A HA ++ R Y
Sbjct: 188 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 236
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
V+++DDSVG ++ L + +N+ + F SDNGA + E + GSN P+ K
Sbjct: 237 DAVREIDDSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQA-----GSNGPFLCGK 291
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T +EGG++ PA+ W P +VS Q+ I D T AG R IDGL+
Sbjct: 292 QTTFEGGMREPALAWWPGHIAAGQVSHQLGSIMDLFTTSLALAGLTPPR-DRAIDGLNLL 350
Query: 429 SSLL 432
+LL
Sbjct: 351 PALL 354
>gi|338213632|ref|YP_004657687.1| arylsulfatase [Runella slithyformis DSM 19594]
gi|336307453|gb|AEI50555.1| Arylsulfatase [Runella slithyformis DSM 19594]
Length = 535
Score = 160 bits (406), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 128/415 (30%), Positives = 195/415 (46%), Gaps = 78/415 (18%)
Query: 75 TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
T AA + L+ G++D+ +G E+ TPNID +A NGI L + Y C P+RAS
Sbjct: 37 TQAAKRPNILYILVDDMGFSDIGCYGG-EVNTPNIDQMAANGIKLRSFYNNARCCPTRAS 95
Query: 135 LMTGKYPIHTGMQ---GPPIWGAEPRGVP--LTERF--LPEYLRELGYSTKAIGKWHLGF 187
L+TG+YP GM P +P L +R+ + E L++ GY T +GKWH+G
Sbjct: 96 LLTGQYPHTVGMGLMVTMPNAAIQPGSYQGFLDDRYPTIAEQLKKTGYHTYMLGKWHVG- 154
Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA 247
R ++ PL RGF+++FG ++G SYY+ I +++ R + L+ + + G Y
Sbjct: 155 ERPQHWPLKRGFDNYFGLISGASSYYEIIPAEKGKRFMVLDDKEFTP------PSDGFYV 208
Query: 248 TDLFTKEAVQLIEDQ---PVDKPLFLYLAHLAAH----------------------AGNA 282
TD FT AVQ + Q DKP F+YLA+ A H +
Sbjct: 209 TDAFTDYAVQYLNKQKQEAADKPFFMYLAYTAPHFPLHAYESDIAKYEKLYAQGWDVTRS 268
Query: 283 GKHLEAPQ--------------ETINQFQYITDPNR-----RTYAAMVKKLDDSVGTVIS 323
++ + Q + I ++ TD +R YAAM+ ++D ++G +I
Sbjct: 269 KRYQKMKQLGFIDKRYPLTPRPKDIPDWESATDKDRWIRKMAVYAAMIDRMDQNIGRLIK 328
Query: 324 ALQRKGMLENSIIIFMSDNGA------------PTVEYRETSNYRNWG------SNYPYR 365
L+ G EN++I+FMSDNG+ PT + + +Y + SN P+R
Sbjct: 329 TLKANGQYENTLIVFMSDNGSSNENMESRKLNDPTKKIGDKGSYVTYDTPWANVSNTPFR 388
Query: 366 GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
K L EGG+ P I+ W IQ + H+ D LPT AG + LP
Sbjct: 389 KYKRFLHEGGMITPCIMQWPRTIQPKAGFVDGIGHVMDLLPTGLELAGIPDNALP 443
>gi|426383228|ref|XP_004058189.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gorilla gorilla
gorilla]
Length = 528
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/357 (33%), Positives = 170/357 (47%), Gaps = 41/357 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI G
Sbjct: 48 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 104
Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
A R G+P +E+ LPE L++ GY +K +GKWHLG R ++ PL GF+
Sbjct: 105 TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFD 163
Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
FG N YD+ R E+ G + NL T + T ++ +EA
Sbjct: 164 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 219
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
+ I+ Q P FLY A A HA ++ R Y V+++D
Sbjct: 220 LDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREID 268
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
DS+G ++ LQ + +N+ + F SDNGA + E GSN P+ K T +EGG
Sbjct: 269 DSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGKQTTFEGG 323
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
++ PA+ W P +VS Q+ I D L T A G T IDGL+ +LL
Sbjct: 324 MREPALAWWPGHVAAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLLPTLL 379
>gi|13278373|gb|AAH04002.1| Galactosamine (N-acetyl)-6-sulfate sulfatase [Mus musculus]
Length = 520
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 122/365 (33%), Positives = 173/365 (47%), Gaps = 40/365 (10%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31 VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 91 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
PL GF+ FG N YD+ R E+ G T T L
Sbjct: 147 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205
Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+ +EA+ I Q + P FLY A A HA Q++ R Y
Sbjct: 206 YLQEALDFIRTQHARQGPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V+++DDSVG +++ LQ G+ +N+ + F SDNGA + S GSN P+ K
Sbjct: 255 AVREIDDSVGKILNLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQ 427
T +EGG++ PAI W P +VS Q+ I D T + AG + R+ IDGLD
Sbjct: 310 TTFEGGMREPAIAWWPGHIAAGQVSHQLGSIMDLFTTSLSLAGLKPPSDRV---IDGLDL 366
Query: 428 WSSLL 432
++L
Sbjct: 367 LPTML 371
>gi|301608268|ref|XP_002933701.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Xenopus
(Silurana) tropicalis]
Length = 520
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 163/577 (28%), Positives = 234/577 (40%), Gaps = 145/577 (25%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 32 LLLMDDMGWGDLGVFGEPSRETPNLDKMASEGMLFPNFYTANPLCSPSRAALLTGRLPIR 91
Query: 144 TGMQGP-----------PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G I G G+P +E PE L++ GY K IGKWHLG R Y
Sbjct: 92 NGFYTTNSHARNAYTPQEIMG----GIPDSEVLFPELLKKAGYVNKIIGKWHLG-HRPSY 146
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEYATD- 249
PL RGF+ FG N YD + R N+ WD VG Y D
Sbjct: 147 HPLRRGFDEWFGSPNCHFGPYD---------------NKQRPNIPVYRDWDMVGRYYEDF 191
Query: 250 -------------LFTKEAVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQ 295
++ +EA++ I Q D +P FLY + A HA
Sbjct: 192 NIDHKTGESNLTQIYLQEALEFINRQAGDQQPFFLYWSIDATHA-----------PVYAS 240
Query: 296 FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY 355
++ R Y V+++D S+G ++ L +K +++++ + F SDNGA + S
Sbjct: 241 RPFLGTSRRGLYGDAVREIDFSIGKILGLLNQKDIIKDTFVFFTSDNGAALI-----SAP 295
Query: 356 RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
GSN P+ K T +EGG++VP I W P +VS Q+ I D T + AG
Sbjct: 296 MQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIMDLFTTSLSLAG--- 352
Query: 416 SRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAA 475
+ P R IDG+D +L + + R +E A
Sbjct: 353 -----------------VEIPKDR--AIDGIDLSPVILESKETERPIFYYRGNE---LMA 390
Query: 476 VRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-FLPIS 534
VRL K T N S ++LSQ I F P
Sbjct: 391 VRLGQHKAHYWTWSN------------------------------SWEELSQGINFCPGQ 420
Query: 535 NIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDIS---S 591
NI + + Q+ NP LF+LG DP E+ +I+ + S
Sbjct: 421 NITGVTTHVQEEH---SINP-------------LLFHLGRDPGEKYSISVLSTEYEAALS 464
Query: 592 QLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
++ +++ HR ++VP +P L D N WSP
Sbjct: 465 RISPVVQQHRNSMVPG---EPQLNVCDLAVMN--WSP 496
>gi|397468273|ref|XP_003805816.1| PREDICTED: N-acetylgalactosamine-6-sulfatase isoform 1 [Pan
paniscus]
Length = 482
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 119/357 (33%), Positives = 170/357 (47%), Gaps = 41/357 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI G
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 58
Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
A R G+P +E+ LPE L++ GY +K +GKWHLG R ++ PL GF+
Sbjct: 59 TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFD 117
Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
FG N YD+ R E+ G + NL T + T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 173
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
+ I+ Q P FLY A A HA ++ R Y V+++D
Sbjct: 174 LDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREID 222
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
DS+G ++ LQ + +N+ + F SDNGA + E GSN P+ K T +EGG
Sbjct: 223 DSIGKMLELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGKQTTFEGG 277
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
++ PA+ W P +VS Q+ I D L T A G T IDGL+ +LL
Sbjct: 278 MREPALAWWPGHVAAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLLPTLL 333
>gi|340380159|ref|XP_003388591.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 500
Score = 160 bits (405), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/352 (31%), Positives = 167/352 (47%), Gaps = 36/352 (10%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
L+ +G+ D SF I TP+ L NG+ILN Y C+PSRAS +TG++P H
Sbjct: 31 VFVLVDDWGFADASFRNP-AIKTPSFQYLVDNGLILNRHYVFKYCSPSRASFLTGRFPHH 89
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
P G + +T LP L+ GYST +GKWH G + Y P+ RGF++
Sbjct: 90 VHQWNPTPPGMVGTNINMT--MLPAKLKTAGYSTHMVGKWHQGLYDPAYLPVNRGFDTSS 147
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQ 262
G+L +++ + D +N A DT G Y + ++ K+ +
Sbjct: 148 GFLQAGEGHFNQTIGCAV---------DFWKN--HAPDTRNGTYDSYIYNKDLTTIFSKH 196
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMVKKLDDSVGTV 321
KPLFLYL HA +EAPQE ++ + T RR AMV D+ G V
Sbjct: 197 DASKPLFLYLPLHNVHAP-----IEAPQEWLDLYPVNSTCATRRVLQAMVSVADNVTGHV 251
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
++ L+ M +N+I++ +DNG + GSNYP +G K+T +EGGV+V A
Sbjct: 252 VNLLKENDMWDNTIMVISADNGGADCK----------GSNYPLKGSKHTFFEGGVRVIAF 301
Query: 382 LWSPQI--QQNPRVSLQMMHISDWLPTLYTAAG---GDTSRLPLNIDGLDQW 428
I + + + +H++DW T AG D+ + +DGLD W
Sbjct: 302 ASGGLIPAARRGKSTDGFIHVADWYTTFCKMAGVDHNDSGKGKFPVDGLDVW 353
>gi|423226077|ref|ZP_17212543.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
CL02T12C19]
gi|392630595|gb|EIY24583.1| hypothetical protein HMPREF1062_04729 [Bacteroides cellulosilyticus
CL02T12C19]
Length = 483
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 125/374 (33%), Positives = 186/374 (49%), Gaps = 49/374 (13%)
Query: 75 TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCT 129
T A TKS + +Y G++D+S +G TPNIDALA GI + YA P+ +
Sbjct: 24 TEATPTKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISS 83
Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
PSRA +TG+YP G+QG + G+ E + E L+ GY+T IGKWHLG R
Sbjct: 84 PSRAGFLTGRYPARMGIQG-VFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SR 141
Query: 190 REYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
+Y PL +GF+ +FG I Y + + + Y R E+ + N T
Sbjct: 142 EKYLPLQQGFDEYFG-----IPYSNDMSAQVYLRGNEVEEFHIDIN----------NVTK 186
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+T+EAV I + D+P FL+LAH H P ++F + Y
Sbjct: 187 KYTEEAVDYIR-RKADQPFFLFLAHSMMH---------VPIYVSDEFAGKSGAG--IYGD 234
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V ++D SVG ++ L+ G+ +N++++F SDNG P ++ G P R K
Sbjct: 235 AVLEVDWSVGRIMETLRELGLDDNTLVVFTSDNG-PWLQEGPLG-----GRALPLREGKT 288
Query: 370 TLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQ 427
T +EGGV+VP I W QI+ P V+ ++ + DW PT+ +GG LP + +DG D
Sbjct: 289 TAFEGGVRVPCIAYWKGQIK--PVVNTDVVSLLDWFPTVTALSGG---ILPDVRLDGYDL 343
Query: 428 WSSLLLNTPSRRNS 441
+ +LN +R S
Sbjct: 344 --TAVLNGTGKRAS 355
>gi|431797495|ref|YP_007224399.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430788260|gb|AGA78389.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 477
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 112/330 (33%), Positives = 166/330 (50%), Gaps = 27/330 (8%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP 141
+L+++ G D + +GS ++ TPN+D +A+ G YA PVC+PSRASLMTGK+
Sbjct: 44 ILVIFADDQGTFDTNAYGSKDLETPNLDRIAHEGTRFTQFYAAAPVCSPSRASLMTGKFN 103
Query: 142 IHTGMQG---PPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
H + G PP E + G+P E + E + GY T IGKWHLG E P +
Sbjct: 104 HHARLFGNVAPPHADPEFKSGLPTDEITMAEIFKADGYKTGLIGKWHLGH-SPEKLPNGQ 162
Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
GF+ FG+ G I + H N HD+ RN + G++ DL E +
Sbjct: 163 GFDYFFGHQRGCIDNFSHFFF-----WAGPNMHDLYRNEEEVY-YPGQFFGDLMVDEVKK 216
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
+ D P F+Y A H G+ + + ++ + P RR YAA V D+
Sbjct: 217 FTAPETED-PFFVYWAINMPHYPYQGR-----PKWLEHYKDLPSP-RREYAAFVSTYDEM 269
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
+G V+ L+ G L+N+II++ SD+G T E + G+N Y G K + EGG++
Sbjct: 270 IGEVLDHLKATGQLDNTIIVYQSDHGHST----EERAFFGGGNNGVYHGAKFCMLEGGIR 325
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
VPA++ P+ +V QM + DWLPTL
Sbjct: 326 VPAMIRYPKAVPANQVRNQMANSIDWLPTL 355
>gi|311745970|ref|ZP_07719755.1| arylsulfatase [Algoriphagus sp. PR1]
gi|311302452|gb|EAZ80458.2| arylsulfatase [Algoriphagus sp. PR1]
Length = 476
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 120/377 (31%), Positives = 182/377 (48%), Gaps = 62/377 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL F G I TPNID LA G+ ++ Y+ VC PSR+S +TG + HT ++G
Sbjct: 30 GYGDLGFLGQEYIETPNIDRLAKEGMFFSDFYSGSTVCAPSRSSFLTGMHTGHTPIRGNS 89
Query: 151 IWGAEPRG-VPLTERFL--PEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+P G P+ + L + +++ GY T A GKW LGF P +GFE +GY
Sbjct: 90 E--VQPEGQFPMPDSVLTVAKVMKQAGYVTGAFGKWGLGFIGSTGEPASQGFEEFYGY-- 145
Query: 208 GVISYYDHILSDQY----SRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
Y H Y ++L G+D W T G+YA DL ++ ++ I D+
Sbjct: 146 -NCQRYAHRYYPAYLWHNDEKIDLPGND--------WTTKGDYAPDLIQEKTLEFI-DEN 195
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR------------------ 305
+ P FL++ + HA L AP + I + P+ +
Sbjct: 196 KENPFFLFMPIVTPHA-----ELAAPDDEIMEKYRAMFPDEKPYVAGNGADYGPDMRIPG 250
Query: 306 ---------TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE-TSNY 355
T+AAMV+++D VG V+ L+ G+ EN++IIF SDNGA +RE ++
Sbjct: 251 YQSQPYPHATFAAMVERIDRYVGEVLQKLESNGLSENTVIIFTSDNGA----HREGGADP 306
Query: 356 RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
+ SN +RG K L+EGGV+ P I W P + + + D LPT + A GG+
Sbjct: 307 EFFDSNGKFRGFKRDLYEGGVRAPMIAWWPNKIKAGSTTDHVSAFWDLLPT-FAAIGGE- 364
Query: 416 SRLPLNIDGLDQWSSLL 432
++ +IDG+ +LL
Sbjct: 365 -KITKSIDGISFLPTLL 380
>gi|167520280|ref|XP_001744479.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163776810|gb|EDQ90428.1| predicted protein [Monosiga brevicollis MX1]
Length = 314
Score = 160 bits (404), Expect = 2e-36, Method: Compositional matrix adjust.
Identities = 104/331 (31%), Positives = 168/331 (50%), Gaps = 35/331 (10%)
Query: 88 LIVYGWNDLSFHGSN--------EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGK 139
+ V G+ D+S+ IPTP+ID LA +G+ L + Y +C+P+R SL++G+
Sbjct: 4 VAVLGFTDVSYKNQTYPDVAFGPMIPTPSIDELALSGVRLESYYVNQLCSPTRTSLLSGR 63
Query: 140 YPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
Y GM I P +PLT + + E+L+E G++T A GKW G TP RGF
Sbjct: 64 YAYTLGMNAEVIVDGHPDQLPLTVKTIGEHLQEGGWATFAGGKWDAGMTSWGCTPTCRGF 123
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI 259
++ G+ N Y+ H + + ++L HD +L+ T G Y T+L T Q I
Sbjct: 124 DTFSGFYNAFNDYFTHRVGNY----LDLR-HDFAPDLADDNHT-GVYMTELLTSRVQQFI 177
Query: 260 EDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT----DPNRRTYAAMVKKL 314
++ F Y+AH A HA N + P + + T P RR M++ +
Sbjct: 178 TTAAETEQSTFAYVAHQAVHAPN-----QVPMSYLEGYCLDTIPEDRPTRRILCGMMRAV 232
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D+SV + + ++ G+ ++++IF +DNG N GSNYP RG K T +EG
Sbjct: 233 DESVRNITATYKQLGLWNDTVLIFTTDNGG---------NPETGGSNYPLRGQKATTFEG 283
Query: 375 GVKVPAILWSPQIQQNPR--VSLQMMHISDW 403
G++ + SP + ++ R +S +++H+SDW
Sbjct: 284 GMRGVGFVNSPLLNESQRGFISDELIHVSDW 314
>gi|432943494|ref|ZP_20140329.1| arylsulfatase [Escherichia coli KTE196]
gi|433045335|ref|ZP_20232807.1| arylsulfatase [Escherichia coli KTE117]
gi|431466713|gb|ELH46730.1| arylsulfatase [Escherichia coli KTE196]
gi|431551968|gb|ELI25931.1| arylsulfatase [Escherichia coli KTE117]
Length = 551
Score = 160 bits (404), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 139/440 (31%), Positives = 213/440 (48%), Gaps = 69/440 (15%)
Query: 87 LLIVYGWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
LL GW D+ F+G PTP+IDA+A G+IL + Y+QP +P+RA+++TG+Y IH
Sbjct: 92 LLDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSIH 151
Query: 144 TGMQGPPIWGAEPRGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
G+ PP++G +P G+ LT LP+ L + GY T+AIGKWH+G +E P GF+
Sbjct: 152 HGILMPPMYG-QPGGLQGLTT--LPQLLHDQGYVTQAIGKWHMG-ENKESQPQNVGFDDF 207
Query: 203 FGYLNGVISYYD-----HI-----LSDQYSRTVEL------NGHDMRRNLSTA-WDTVGE 245
G+ N V Y H+ LS S ++ + H +R A D +
Sbjct: 208 RGF-NSVSDMYTEWRDVHVNPEVALSPDRSEYIKQLPFSKDDVHAVRGGEQQAIADITPK 266
Query: 246 YATDL---FTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 301
Y DL + + V++++ DKP FLY H N N +
Sbjct: 267 YMEDLDQRWMEYGVKILDKMAKSDKPFFLYYGTRGCHFDN----------YPNAKYAGSS 316
Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
P R +Y + +++D + AL++ G L+N++I+F SDNG P E
Sbjct: 317 PARTSYGDCMVEMNDVFANLYKALEKNGQLDNTLIVFTSDNG-PEAEVPPH-------GR 368
Query: 362 YPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
P+RG K + WEGGV+VP + W IQ PR S ++ ++D PT AG +++
Sbjct: 369 TPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGIVDLADLFPTALDLAGHPGAKV-- 424
Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS 480
++L+ T + IDG+DQ +S L T + N + + AAVR+D
Sbjct: 425 --------ANLVPKT-----TFIDGVDQ-TSFFLGTNGQSNRKAEHYFLNGKLAAVRMDE 470
Query: 481 WKLVLGTQE--NGTMDGYYG 498
+K + Q+ T GY G
Sbjct: 471 FKYHVLIQQPYAYTQSGYQG 490
>gi|154250816|ref|YP_001411640.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
gi|154154766|gb|ABS61983.1| Steryl-sulfatase [Parvibaculum lavamentivorans DS-1]
Length = 553
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 126/382 (32%), Positives = 178/382 (46%), Gaps = 70/382 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ--- 147
G+ND+S G +PTPNID++A G + Y+ C PSRA +MTG+Y TG +
Sbjct: 82 GFNDISHFGGGIVPTPNIDSIARGGANFTSAYSGTAACAPSRAMIMTGRYGTRTGFEFTP 141
Query: 148 -------------------------------GPPIWGAEPRGVPLTERFLPEYLRELGYS 176
PP +G+P +E L E L+ GY
Sbjct: 142 TPPGMTRIVDMFYNDGTRTHEMLVDREAAAKAPPF---REQGLPGSEITLAEALKPKGYH 198
Query: 177 TKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---- 232
IGKWHLG E+ P +GF+ +G+ D D + + + D
Sbjct: 199 NIHIGKWHLGN-APEFLPNAQGFDESVMLESGLFLPEDS--PDVVNAKLPFDPIDQFLWA 255
Query: 233 RRNLST-----AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLE 287
R +T AW Y TD +T EA++ IE ++P FLYLAH H L+
Sbjct: 256 RMQYATSYNGSAWFEPKGYLTDFYTDEAIKAIEAN-RNRPFFLYLAHWGVHT-----PLQ 309
Query: 288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
A + + +I D R YAAM+ LD SVG V+ +L+ G+ EN+++IF SDNGAP
Sbjct: 310 ASKADYDALSHIEDERLRVYAAMIVALDRSVGRVLQSLKENGLEENTLVIFSSDNGAPG- 368
Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPT 406
Y + N PYRG K T +EGG++VP W +I + + H+ D PT
Sbjct: 369 -YIGLPDV-----NKPYRGWKLTFFEGGIRVPFFAKWPARIPAGTERTTPVAHL-DMFPT 421
Query: 407 LYTAAGGDTSRLPLN--IDGLD 426
+ AAGG+ LP + IDG+D
Sbjct: 422 IVAAAGGE---LPADRVIDGID 440
Score = 44.3 bits (103), Expect = 0.20, Method: Compositional matrix adjust.
Identities = 26/74 (35%), Positives = 37/74 (50%), Gaps = 6/74 (8%)
Query: 6 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 65
+AW Y TD +T EA++ IE ++P FLYLAH H L+A + +
Sbjct: 265 GSAWFEPKGYLTDFYTDEAIKAIEAN-RNRPFFLYLAHWGVHT-----PLQASKADYDAL 318
Query: 66 QYITDPNRRTYAAL 79
+I D R YAA+
Sbjct: 319 SHIEDERLRVYAAM 332
Score = 40.8 bits (94), Expect = 2.2, Method: Compositional matrix adjust.
Identities = 21/53 (39%), Positives = 28/53 (52%), Gaps = 4/53 (7%)
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYH----RRTLVPQSHEQPDLV 615
+LFNL DP EQNN+A P+ ++L L++ H R L P E P V
Sbjct: 482 TWLFNLKTDPTEQNNVADENPEKVAELKALVEAHNATQREPLFPAVAEMPVTV 534
>gi|323452121|gb|EGB07996.1| hypothetical protein AURANDRAFT_64538 [Aureococcus anophagefferens]
Length = 1591
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 114/364 (31%), Positives = 173/364 (47%), Gaps = 34/364 (9%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPN-IDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
L + G NDL S ++P P + L +G+ L + Y Q +C+P+RA+LMTGK+
Sbjct: 1098 VLVFMDDVGLNDL--WRSTDLPKPTEMSKLVRDGVELTSYYGQSLCSPARATLMTGKFAH 1155
Query: 143 HTGM---QGP--PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
G QG + VPL LP+ ++ LGY T AIGKW++G +Y P R
Sbjct: 1156 KIGFSDQQGGVREVTAYSNFSVPLGHDMLPQGMKRLGYQTHAIGKWNIGHCNVKYMPWQR 1215
Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
GF++ GY I Y DH+ SD + T +N + N S + G Y T LFT+ A +
Sbjct: 1216 GFDTFVGYFTDGIGYTDHV-SDT-ANTYTVNDGGLAFNGS---EYEGTYTTALFTERAEK 1270
Query: 258 LIEDQPVDKPLFLYLAHLAAH-------AGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
++ D P D PLF++LA+ H A + + + I + R +
Sbjct: 1271 VLHDAPEDAPLFMWLAYHGMHDNDGVTDADTCAQTGDDDDDAIFMERGHLPFARFNFGCG 1330
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
++ +D VG V +AL + + +++ SDNG Y GSNYPYRG K
Sbjct: 1331 LRAIDRGVGQVRTALDDRA--RDYVLVVTSDNGG------FACGYHCQGSNYPYRGAKFY 1382
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQ---MMHISDWLPTLYTAAGGDTSRLPL---NIDG 424
+EGG+++P +++SP R +MH DW T GGD + L + D
Sbjct: 1383 EFEGGLRLPGVVYSPSRVPARRRGAAYDGLMHHVDWFGTFVHLGGGDVAALTTFDSSFDT 1442
Query: 425 LDQW 428
++ W
Sbjct: 1443 MNHW 1446
>gi|255014752|ref|ZP_05286878.1| arylsulfatase A [Bacteroides sp. 2_1_7]
gi|410102839|ref|ZP_11297764.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
gi|409237966|gb|EKN30761.1| hypothetical protein HMPREF0999_01536 [Parabacteroides sp. D25]
Length = 459
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 123/362 (33%), Positives = 179/362 (49%), Gaps = 45/362 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G+ I TPNID +A G+ L Y V TPSRA+LMTG+ P+ G+ G
Sbjct: 42 GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101
Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ P G+ E + + L++ GY+T +GKWHLG F Y P GF+++FG
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQNGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
I Y + + V+ G R ST G+ T +T++AV
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPSTPLILDGKQIESEPDQGELTRRYTEKAVSF 209
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I+D +P FLY AH H P T +F+ + R Y +V+++D SV
Sbjct: 210 IKDHS-KEPFFLYFAHTFPHI---------PLYTNARFEGTS--KRGLYGDVVEEIDWSV 257
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V+ AL+ G+ EN+ +IF SDNG E + N GS P + K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGPWLTE------HENGGSAGPLKDGKGTWWEGGFRV 311
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
PAI W P + NP ++ ++M D PT + AG + + L +DG++Q L S
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369
Query: 439 RN 440
R+
Sbjct: 370 RD 371
>gi|241574524|ref|XP_002403353.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215500210|gb|EEC09704.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 255
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 104/283 (36%), Positives = 151/283 (53%), Gaps = 37/283 (13%)
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
MV LD+S+G V AL G+LEN+II+ SDNG + ++ N G N+P RG K
Sbjct: 1 MVDALDESIGEVFQALGDAGILENTIIVLSSDNGGLPIGFQS-----NRGFNFPLRGGKG 55
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
TLWEGG + +WSP +++ S QMMH++DWLPTLY+AAG D PL++
Sbjct: 56 TLWEGGCRATGFIWSPLLKRKGVASDQMMHVTDWLPTLYSAAGED----PLSL------- 104
Query: 430 SLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE 489
+DG D W+ L L S R +L NID +A+R ++KLVLG
Sbjct: 105 -----------GKLDGTDMWAPLSLGYASPRYEILYNIDPINNISALRYTNYKLVLGRTN 153
Query: 490 NGTMDGYYGQTRSNKVPLLNFNAI-VESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATI 548
+G D + T ++ P + + + V+S + L+ L L + R++AT+
Sbjct: 154 DGQYDPRFLTTGGSR-PYDDLDLLMVQSTAARVLRDLRNKGDLQFPD-----QWREKATL 207
Query: 549 HCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISS 591
CG N P + + P YLF+L +DPCE NNIASS+ ++S
Sbjct: 208 DCG-NKVP--KNFVSLDPPYLFDLASDPCEINNIASSQAGVTS 247
>gi|262383796|ref|ZP_06076932.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|262294694|gb|EEY82626.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
Length = 459
Score = 159 bits (403), Expect = 3e-36, Method: Compositional matrix adjust.
Identities = 123/362 (33%), Positives = 179/362 (49%), Gaps = 45/362 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G+ I TPNID +A G+ L Y V TPSRA+LMTG+ P+ G+ G
Sbjct: 42 GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101
Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ P G+ E + + L++ GY+T +GKWHLG F Y P GF+++FG
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQNGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
I Y + + V+ G R ST G+ T +T++AV
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPSTPLILDGKQIESEPDQGELTRRYTEKAVSF 209
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I+D +P FLY AH H P T +F+ + R Y +V+++D SV
Sbjct: 210 IKDHS-KEPFFLYFAHTFPHI---------PLYTNARFEGTS--KRGLYGDVVEEIDWSV 257
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V+ AL+ G+ EN+ +IF SDNG E + N GS P + K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGPWLTE------HENGGSAGPLKDGKGTWWEGGFRV 311
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
PAI W P + NP ++ ++M D PT + AG + + L +DG++Q L S
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369
Query: 439 RN 440
R+
Sbjct: 370 RD 371
>gi|149177301|ref|ZP_01855906.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
gi|148843826|gb|EDL58184.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Planctomyces
maris DSM 8797]
Length = 501
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 127/455 (27%), Positives = 192/455 (42%), Gaps = 99/455 (21%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGM---- 146
G+ DL GS EI TP++D LA G L + Y P CTPSR SL+TG+YP G+
Sbjct: 49 GYRDLGSFGSEEIMTPHLDRLAKEGAKLTSFYVTWPACTPSRGSLLTGRYPQRNGIYDMI 108
Query: 147 -QGPPIWGAEPR------------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
P +G + + G+ + E+ LP L+ GY + GKW LG +R +
Sbjct: 109 RNEAPDFGHKYKPAEYEVTFERIGGMDVREKLLPALLKPAGYVSAIYGKWDLGIHKR-FL 167
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
PL RGF+ +G+ N I Y+ H ++Y M RN + G Y T LF +
Sbjct: 168 PLARGFDDFYGFTNTGIDYFTH---ERYGVP------SMYRNNQPTEEDKGTYCTYLFQR 218
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNA-----GKHLEAPQETINQFQYITD------- 301
EAV+ I++ KP FLYL A H ++ +AP++ N + ++ D
Sbjct: 219 EAVRFIKENH-QKPFFLYLPFNAPHGASSLDPRIRGGAQAPEKYKNMYPHLKDTLVTKKK 277
Query: 302 ----------------------PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
R Y A + +DD++G V+ L + +N+I++F
Sbjct: 278 TGRYEFRERPDGPVIHQGVSASKRRLEYVASITCMDDAIGEVLGLLDEYQIADNTIVVFF 337
Query: 340 SDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMH 399
SDNG N P +G K ++EGG++VP ++ P + V+ +++
Sbjct: 338 SDNGG-----------SGGADNSPLKGKKGMMFEGGIRVPCLVRYPAKIKPGTVNDELLT 386
Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSR 459
+ +PT A P N IDG D L+ T S
Sbjct: 387 SLELVPTFLKEAA----------------------IPLPENVVIDGYDMLPVLMGKTTSP 424
Query: 460 RNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
RN + E K A R+ WK V + +G D
Sbjct: 425 RNEMYWQRREDK---AARVGHWKWVESEKGSGLFD 456
>gi|355710478|gb|EHH31942.1| hypothetical protein EGK_13112 [Macaca mulatta]
Length = 482
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 119/357 (33%), Positives = 170/357 (47%), Gaps = 41/357 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI G
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 58
Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
A R G+P +E+ LPE L++ GY +K +GKWHLG R ++ PL GF+
Sbjct: 59 TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFD 117
Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
FG N YD+ R E+ G + NL T + T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 173
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
+ I+ Q P FLY A A HA ++ R Y V+++D
Sbjct: 174 LDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREID 222
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
DSVG ++ L + +N+ + F SDNGA + E + GSN P+ K T +EGG
Sbjct: 223 DSVGKMLELLHDLHVADNTFVFFTSDNGAALISAPEQA-----GSNGPFLCGKQTTFEGG 277
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
++ PA+ W P +VS Q+ I D L T A G T IDGL+ +LL
Sbjct: 278 MREPALAWWPGHIAAGQVSHQLGSIMD-LFTTSLALAGLTPPRDRAIDGLNLLPALL 333
>gi|298706919|emb|CBJ29746.1| Formylglycine-dependent sulfatase [Ectocarpus siliculosus]
Length = 616
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 118/383 (30%), Positives = 182/383 (47%), Gaps = 64/383 (16%)
Query: 85 LTLLIVYGWNDLSFHGSN--EIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPI 142
L+ G ND+ + ++ E+ TP +D+L+ G+ L Y +CTPSRASLMTG+
Sbjct: 124 FILIDDVGTNDIGYQSTDLWEL-TPFMDSLSSEGVRLTKYYTNQLCTPSRASLMTGRDTF 182
Query: 143 HTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
TGMQ + + G+PL E L E + GKWHLG + + P RGF++
Sbjct: 183 RTGMQYEVVEDSGAWGLPLEEVTLAERFK--------TGKWHLGMYSDAHYPFARGFDTF 234
Query: 203 FGYLNGVISYYDH-------ILSDQYSRTVELNGHDMRRNLSTAWDT-------VGEYAT 248
GY+ V Y H +YS + D ++ +T VG Y+T
Sbjct: 235 LGYMGAVRGYSSHEGCNTPTFEGGEYSCFKDFGYGDKDGYINHITNTTRQGPSFVGNYST 294
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT-------D 301
+ T A+++ ++ D P FLY++H A H+ P + + ++ T D
Sbjct: 295 TIITDRAIEVAKEHGED-PFFLYVSHQAVHSPVD----PPPDDYFTEEEHATLARVVSED 349
Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-PTVEYRETSNYRNWGS 360
R+ +A ++ LD + + L G L+N++++ SDNGA PT GS
Sbjct: 350 AYRQRFARVILFLDKEMRRLHDELDALGALDNTVLVVASDNGACPTA----------GGS 399
Query: 361 NYPYRGVKNTLWEGGVKVPAILWS------PQIQQNPRVSLQMMHISDWLPTLYTAAGGD 414
NYP RG K+T++EGGV+VPA ++S P+ + R S MMH +DW PT G
Sbjct: 400 NYPLRGYKHTIFEGGVRVPAFVYSKSTDLIPEEARGTRYS-GMMHSTDWTPTF----GSV 454
Query: 415 TSRLPLN-----IDGLDQWSSLL 432
LPL + G D W +++
Sbjct: 455 VPDLPLGEMGKELSGFDHWDAII 477
>gi|340367651|ref|XP_003382367.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 494
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 112/354 (31%), Positives = 170/354 (48%), Gaps = 42/354 (11%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
L+ +G+ D+ F I +PN D LA G++LN Y C+PSRASL+TG++P HT
Sbjct: 27 FVLVDDWGFADVGFKNP-AISSPNFDHLAKTGLVLNRHYVYMYCSPSRASLLTGRWPHHT 85
Query: 145 GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
+ G L +P L+ Y+T +GKWH GFF Y P+ RGF++ G
Sbjct: 86 HQWN--LGNNSTAGTNLAMTMIPAKLKAANYATHMVGKWHQGFFDPRYLPINRGFDTSSG 143
Query: 205 YLNGVISYY--DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
+L G + + I + Y + N D R G Y ++ + +I
Sbjct: 144 FLCGSEDHMTQNAICAIDYWKN---NAPDPRN---------GTYDAYIYRDDLTDIINSH 191
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAMVKKLDDSVGTV 321
++PLFLYL H L+AP+E ++ + T R T AM+ D+ G +
Sbjct: 192 NTNEPLFLYLPLHNVHLP-----LQAPKEWLDIYPANSTCEKRHTLQAMISVADNVTGHL 246
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ L++KG+ +N+I++ +DNG GSNYP RG K TL+EGGV+ A
Sbjct: 247 VELLKKKGIWDNTIMVISADNGGAHCS----------GSNYPLRGCKETLFEGGVRSLAF 296
Query: 382 LWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD-----TSRLPLNIDGLDQW 428
+ + ++ R + +HI+DW T AG D T + P +DGLD W
Sbjct: 297 VNGGLLPESRRGQSTDGFIHIADWYTTFCKLAGVDPDDSGTGKFP--VDGLDVW 348
>gi|404447701|ref|ZP_11012695.1| sulfatase [Indibacter alkaliphilus LW1]
gi|403766287|gb|EJZ27159.1| sulfatase [Indibacter alkaliphilus LW1]
Length = 486
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 118/351 (33%), Positives = 180/351 (51%), Gaps = 44/351 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ND+ GS +I TPN+D +A G+ +N Y AQ VC+ SRA+L+TG Y G+ G
Sbjct: 52 GYNDVGVFGSKDISTPNLDNMAKEGVRFSNFYVAQAVCSASRAALLTGTYSNRLGIHGA- 110
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ + G+ E + E L+ LGY T GKWHLG E+ P+ +GF+ +FG
Sbjct: 111 LDHSSNHGLNPDETTIAEMLKPLGYQTAIFGKWHLGH-HPEFLPVQQGFDEYFGIPYSND 169
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE---YATDLFTKEAVQLIEDQPVDKP 267
+ +H + Y + L +D DT+ T + T++++ IE + P
Sbjct: 170 MWPNHPETKNYYPPLPLYEND------EVVDTLHHDQSMLTTVLTEKSLDFIERNK-ETP 222
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
FLYLAH H P ++F+ ++ R Y ++ ++D SVG V+ +L+
Sbjct: 223 FFLYLAHPMPHV---------PLFVSDKFEGKSE--RGLYGDVIMEIDWSVGQVLESLKS 271
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQ 386
KG+ EN+++IF+SDNG P + Y + GS YP + K T W+GGVK+P+I+ W +
Sbjct: 272 KGLEENTLVIFLSDNG-PWLSYSLHA-----GSAYPLKEGKGTSWDGGVKIPSIMQWKGK 325
Query: 387 I-----QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
I Q NP +S+ D LPT+ G LP IDG W +L
Sbjct: 326 IPEGKVQTNPAMSI------DILPTIAEITGAKLPELP--IDGKSIWKMIL 368
>gi|168701882|ref|ZP_02734159.1| Arylsulfatase [Gemmata obscuriglobus UQM 2246]
Length = 467
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 122/378 (32%), Positives = 186/378 (49%), Gaps = 47/378 (12%)
Query: 71 PNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCT 129
P+ R A + L + G +L +G +I TP+ID LA G Y+ PVC
Sbjct: 16 PSGRAADAPKPNIVLIVADDLGCFELGCYGQTKIKTPHIDKLAQGGAKFTRFYSGSPVCA 75
Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLG 186
PSR LMTGK+ H ++ A+P G + + + + L+ GY+T A+GKW LG
Sbjct: 76 PSRCVLMTGKHSGHATVRNN--VEAKPEGQFPIRAEDVTVADALKAHGYATGAMGKWGLG 133
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY--SRTVELNGHDMRRNLSTAWDTVG 244
F +PL GF+ FGY N + H + Y + VEL G+D + T
Sbjct: 134 MFDTAGSPLKHGFDLFFGY-NCQRHAHSHYPTYIYRNDKRVELKGNDGK--------TGK 184
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDP 302
++ DLF +EA+ IE KP FLYL H ++ P++++N++ Q DP
Sbjct: 185 QFTQDLFEEEALGFIEANKA-KPFFLYLPFTVPHVA-----VQVPEDSLNEYKGQLGDDP 238
Query: 303 N-------------RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPT--V 347
YAAMV ++D SVG V+ L G+ +N++++F SDNG PT V
Sbjct: 239 AYDGKKGYQPHPAPHAGYAAMVTRMDRSVGRVVEKLNALGLEKNTLVLFTSDNG-PTHNV 297
Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
++S + + G RG+K +++EGG++VP I + P + S ++ D LPTL
Sbjct: 298 GGADSSFFNSAGK---LRGLKGSVYEGGIRVPFIAYQPGTIKAGTESDAPLYFPDVLPTL 354
Query: 408 YTAAGGDTSRLPLNIDGL 425
AG ++ P IDG+
Sbjct: 355 CAFAG---TKAPSAIDGI 369
>gi|449692603|ref|XP_004213101.1| PREDICTED: arylsulfatase B-like, partial [Hydra magnipapillata]
Length = 181
Score = 159 bits (402), Expect = 4e-36, Method: Compositional matrix adjust.
Identities = 85/192 (44%), Positives = 118/192 (61%), Gaps = 12/192 (6%)
Query: 183 WHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT 242
WHLGFF ++YTP YRGF+S++G G Y++H ++ YS G D+ N + +
Sbjct: 1 WHLGFFAKQYTPTYRGFDSYYGSYLGKGDYWNHSNTETYS------GLDLHDNENGVFSQ 54
Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQETINQFQYITD 301
G Y+T+++T EA+ I + +PLFLYLA+ A H+ N + L+APQE I++F YI
Sbjct: 55 DGNYSTEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQEWIDKFSYIKH 114
Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
RR YAAM+ +D VG V AL K ML+NSIIIF +DNG P + NW +N
Sbjct: 115 EQRRKYAAMLGYMDYGVGRVHDALAEKKMLDNSIIIFTTDNGGPANGFD-----YNWANN 169
Query: 362 YPYRGVKNTLWE 373
+P RGVK TL+E
Sbjct: 170 FPLRGVKATLFE 181
Score = 65.1 bits (157), Expect = 1e-07, Method: Compositional matrix adjust.
Identities = 32/80 (40%), Positives = 49/80 (61%), Gaps = 1/80 (1%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH-LEAPQ 59
+ N + + G Y+T+++T EA+ I + +PLFLYLA+ A H+ N + L+APQ
Sbjct: 44 LHDNENGVFSQDGNYSTEMYTAEAISCINNHNSSEPLFLYLAYQAVHSANTEEDPLQAPQ 103
Query: 60 ETINQFQYITDPNRRTYAAL 79
E I++F YI RR YAA+
Sbjct: 104 EWIDKFSYIKHEQRRKYAAM 123
>gi|372221524|ref|ZP_09499945.1| sulfatase [Mesoflavibacter zeaxanthinifaciens S86]
Length = 461
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 121/373 (32%), Positives = 183/373 (49%), Gaps = 46/373 (12%)
Query: 70 DPNRRTYAALTKSTTLTLLIVY---GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP 126
D ++ +L K LL++ GWND+ +HGS +I TP +D+LA NG L Y P
Sbjct: 22 DKKIKSSNSLPKEQPNILLVIADDAGWNDVGYHGS-KIKTPVLDSLANNGAKLERFYVAP 80
Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
C+PSRA+L+TG G+ P I G +P + LP+ +++LGY T GKWHLG
Sbjct: 81 TCSPSRAALLTGIPASRLGIVAP-IAGKSKIALPDSLVTLPKAMKKLGYRTALFGKWHLG 139
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
E P GF++ +G+L+G I Y H +Y +N +W G++
Sbjct: 140 L-TPENGPQAYGFDTSYGFLHGQIDQYTH----EY------------KNGDPSWHKNGKF 182
Query: 247 ------ATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETINQFQY 298
TDL T A+ ++ + P F+ LA+ A H + + P TI
Sbjct: 183 LKEDGHVTDLLTDAAIAYFNQETKTETPSFVTLAYSAPHFPLQEEERYKRPYRTI----- 237
Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA-----PTVEYR-ET 352
TD +R YAA + LD + G ++++L+R G L+N++I+FMSDNGA P +Y+ +
Sbjct: 238 FTDSSRVDYAAAMTHLDIAFGRLLNSLKRSGKLDNTLILFMSDNGAMKNWYPKDQYQGKH 297
Query: 353 SNYRNWGSNYPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
G N P + K T +EG ++VPA + W N + + + D LPTL
Sbjct: 298 GPNTTLGDNTPLKDYKTTNYEGAIRVPAFVYWKNHF--NGKTLDDFIGVIDVLPTLVELT 355
Query: 412 GGDTSRLPLNIDG 424
G P I+G
Sbjct: 356 GASVK--PNTIEG 366
>gi|224537481|ref|ZP_03678020.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
DSM 14838]
gi|224520919|gb|EEF90024.1| hypothetical protein BACCELL_02360 [Bacteroides cellulosilyticus
DSM 14838]
Length = 525
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 125/374 (33%), Positives = 185/374 (49%), Gaps = 49/374 (13%)
Query: 75 TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCT 129
T A TKS + +Y G++D+S +G TPNIDALA GI + YA P+ +
Sbjct: 66 TEATPTKSEKPNFVFIYMDDMGYSDVSCYGETRWTTPNIDALAAEGIKFTDCYAASPISS 125
Query: 130 PSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
PSRA +TG+YP G+QG + G+ E + E L+ GY+T IGKWHLG R
Sbjct: 126 PSRAGFLTGRYPARMGIQG-VFYPDSYTGMAPEEVTMAEVLKVQGYATACIGKWHLG-SR 183
Query: 190 REYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATD 249
+Y PL +GF+ +FG I Y + + + Y R E+ + N T
Sbjct: 184 EKYLPLQQGFDEYFG-----IPYSNDMSAQVYLRGNEVEEFHIDIN----------NVTK 228
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+T+EAV I + D+P FL+LAH H P ++F Y
Sbjct: 229 KYTEEAVDYIR-RKADQPFFLFLAHSMMH---------VPIYVSDEFA--GKSGAGIYGD 276
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V ++D SVG ++ L+ G+ +N++++F SDNG P ++ G P R K
Sbjct: 277 AVLEVDWSVGRIMETLRELGLDDNTLVVFTSDNG-PWLQEGPLG-----GRALPLREGKT 330
Query: 370 TLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQ 427
T +EGGV+VP I W QI+ P V+ ++ + DW PT+ +GG LP + +DG D
Sbjct: 331 TAFEGGVRVPCIAYWKGQIK--PVVNTDVVSLLDWFPTVTALSGG---ILPDVRLDGYDL 385
Query: 428 WSSLLLNTPSRRNS 441
+ +LN +R S
Sbjct: 386 --TAVLNGTGKRAS 397
>gi|355757044|gb|EHH60652.1| hypothetical protein EGM_12064 [Macaca fascicularis]
Length = 482
Score = 159 bits (402), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 119/357 (33%), Positives = 169/357 (47%), Gaps = 41/357 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW DL +G TPN+D +A G + N Y A P+C+PSRA+L+TG+ PI G
Sbjct: 2 GWGDLGVYGEPSRETPNLDRMAAEGTLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 58
Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
A R G+P +E+ LPE L++ GY +K +GKWHLG R ++ PL GF+
Sbjct: 59 TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKVVGKWHLG-HRPQFHPLKHGFD 117
Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
FG N YD+ R E+ G + NL T + T ++ +EA
Sbjct: 118 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 173
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
+ I+ Q P FLY A A HA ++ R Y V+++D
Sbjct: 174 LDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAVREID 222
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
DSVG ++ L + +N+ + F SDNGA + E + GSN P+ K T +EGG
Sbjct: 223 DSVGKMLELLHDLHIADNTFVFFTSDNGAALISAPEQA-----GSNGPFLCGKQTTFEGG 277
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
++ PA+ W P +VS Q+ I D L T A G T IDGL+ +LL
Sbjct: 278 MREPALAWWPGHIAAGQVSHQLGSIMD-LFTTSLALAGLTPPRDRAIDGLNLLPALL 333
>gi|386287174|ref|ZP_10064349.1| sulfatase family protein [gamma proteobacterium BDW918]
gi|385279706|gb|EIF43643.1| sulfatase family protein [gamma proteobacterium BDW918]
Length = 555
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 125/383 (32%), Positives = 188/383 (49%), Gaps = 70/383 (18%)
Query: 92 GWNDLSFHG----SNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGM 146
G ND+S +G + ++ TPNID LA +G + Y+ C PSRA L+TG+YP TG
Sbjct: 76 GINDISTYGGGLANGQVQTPNIDELAASGANFSQAYSGAGTCAPSRAMLLTGRYPTRTGF 135
Query: 147 QGPP--------------------------------IWGAEPRGVPLTERFLPEYLRELG 174
+ P I + +G+P +E L E LR+ G
Sbjct: 136 EFTPTPDGMGAMATMIASSHDNGLPPSIYNKAGADKIPSFDEQGLPSSEITLAEILRDRG 195
Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM-- 232
Y T IGKWHLG + ++P +GFE +G+ DH D + V+ + D
Sbjct: 196 YYTAHIGKWHLG-RGKGFSPEEQGFEQSLLMHSGLYLPVDH--PDVVNAKVDFDPIDQFL 252
Query: 233 --RRNLSTAWDTVG------EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGK 284
+ + A++ G Y TD +T E++++I+ ++P FLYLAH AH
Sbjct: 253 WAKMRYAAAFNDGGPVFEPKSYLTDYWTDESIKIIKANK-NRPFFLYLAHWGAHT----- 306
Query: 285 HLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
L+A +E I R YAAMV+ +D SVG +++ L+ +G+ +N+I++F SDNG
Sbjct: 307 PLQATREDYEAVGDIKPHRARVYAAMVRAVDRSVGRILATLKEEGLSDNTIVVFSSDNGG 366
Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDW 403
Y + N PYRG K T++EGG++VP + W I +N + HI D+
Sbjct: 367 --AGYIGIPDI-----NAPYRGWKLTMFEGGIRVPLFMRWPAHIARNTSIDTPAAHI-DF 418
Query: 404 LPTLYTAAGGDTSRLPLN--IDG 424
LPTL AA ++LP + IDG
Sbjct: 419 LPTLVAAA---QAKLPEDRIIDG 438
>gi|161502378|ref|YP_001569490.1| hypothetical protein SARI_00412 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:- str. RSK2980]
gi|160863725|gb|ABX20348.1| hypothetical protein SARI_00412 [Salmonella enterica subsp.
arizonae serovar 62:z4,z23:-]
Length = 550
Score = 159 bits (401), Expect = 5e-36, Method: Compositional matrix adjust.
Identities = 145/463 (31%), Positives = 214/463 (46%), Gaps = 75/463 (16%)
Query: 66 QYITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEI---PTPNIDALAYNGIILN 120
Q + D ++T K + + I+ GW D+ F+G PTP+IDA+A G+IL
Sbjct: 73 QKLADIEKKT----GKKPNIVVFILDDVGWMDVGFNGGGVAVGNPTPDIDAVASQGLILT 128
Query: 121 NMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVP-LTERFLPEYLRELGYSTKA 179
+ Y+QP +P+RA++MTG+Y +H G+ PP++G P G+ LT LP+ L E GY T+A
Sbjct: 129 SAYSQPSSSPTRATIMTGQYSVHHGILMPPMYGM-PGGLEGLTT--LPQLLHEQGYVTQA 185
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY----------DHILSD---QYSRTVE 226
IGKWH+G P GF+ G+ N V Y + LS QY + +
Sbjct: 186 IGKWHMG-ENTGSQPQNVGFDDFRGF-NSVSDMYTEWRDPNMNPEVALSPSRYQYIKNLP 243
Query: 227 LNGHDMRRNLSTAWDTVGE----YATDL---FTKEAVQLIEDQPV-DKPLFLYLAHLAAH 278
+ +D+ + V E Y DL + K V I DKP FLY H
Sbjct: 244 FDKNDVHAVRGGKQEAVAEITPKYMEDLDQRWMKYGVDFINKMAKNDKPFFLYYGTRGCH 303
Query: 279 AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
N A + P R +Y+ + +++D + AL+ G L+N++I+F
Sbjct: 304 FDNYPNAHYAGRS----------PARTSYSDCIVEMNDVFANLYRALETSGQLDNTLIVF 353
Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQM 397
SDNG P E P+RG K + WEGGV+VP + W IQ PR S +
Sbjct: 354 TSDNG-PEAEVPPH-------GRTPFRGAKGSTWEGGVRVPTFVYWKGMIQ--PRKSDGI 403
Query: 398 MHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTP 457
+ ++D PT + AG + L + PS+ + IDG+DQ SS L T
Sbjct: 404 VDLADLFPTSLSLAGHPGAELAKLV-------------PSK--TFIDGVDQ-SSFFLGTN 447
Query: 458 SRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE--NGTMDGYYG 498
+ N + +AVR+D +K L Q+ T GY G
Sbjct: 448 GQSNRKAEHYFLNGELSAVRIDEFKYHLLIQQPFAFTQTGYQG 490
>gi|430745365|ref|YP_007204494.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430017085|gb|AGA28799.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 476
Score = 159 bits (401), Expect = 6e-36, Method: Compositional matrix adjust.
Identities = 120/364 (32%), Positives = 185/364 (50%), Gaps = 55/364 (15%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
L L G+ DLS +G+ ++ TPNIDAL +G+ + YA PVC+P+RA+L+TG YP
Sbjct: 35 LILADDLGYGDLSSYGAADLKTPNIDALVASGVRFDRFYANSPVCSPTRAALLTGCYPDL 94
Query: 144 TGMQG------PPIWGA-EPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G+ G WG P+ V LP+ L+ GY T +GKWHLG P
Sbjct: 95 VGVPGVIRTHPDDSWGVLSPQAV-----LLPQVLKGAGYHTALVGKWHLGL-SGASLPSR 148
Query: 197 RGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
RGF+ G+L ++ Y++H +G + R D G +ATDLF++ A
Sbjct: 149 RGFDLFHGFLGDMMDDYHNH----------RRHGINYMRRDDREIDPKG-HATDLFSQWA 197
Query: 256 VQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT---DPNRRTYAAMV 311
+ + E + D+P FL LA+ H ++ PQE + + + DP R A++
Sbjct: 198 IDFLNESKGQDRPFFLELAYNVPHT-----PVQPPQEWLEKVRRRAPNLDPKRARLVALI 252
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+ LD +G V++AL+ G E ++I+F SDNG + + G PYRG K +
Sbjct: 253 EHLDHGIGQVLAALRANGQAERTLIVFTSDNGG------QLDAGAHCG---PYRGGKQDM 303
Query: 372 WEGGVKVP-AILWSPQIQ---QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
+EGG++VP +W +I+ ++ RV++ M D PTL AAG +R +DG+
Sbjct: 304 YEGGIRVPLCAVWPGRIEPGTRSDRVAVSM----DLFPTLCEAAG---ARFEHPVDGISL 356
Query: 428 WSSL 431
S+
Sbjct: 357 LSTF 360
>gi|326927530|ref|XP_003209945.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Meleagris
gallopavo]
Length = 485
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 158/562 (28%), Positives = 239/562 (42%), Gaps = 123/562 (21%)
Query: 89 IVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ 147
+ GW DL G TPN+D +A G++ + YA P+C+PSRA+L+TG+ P+ G
Sbjct: 1 MTMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAASPLCSPSRAALLTGRLPVRNGFY 60
Query: 148 GPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
A R G+ +E LPE L++ GY+ K IGKWHLG R ++ PL
Sbjct: 61 ---TTNAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWHLGH-RPQFHPLKH 116
Query: 198 GFESHFGYLNGVISYYDH--ILSDQYSRTVELNG---HDMRRNLSTAWDTVGEYATDLFT 252
GF+ FG N YD+ + + R E+ G D + +L T + T ++
Sbjct: 117 GFDEWFGSPNCHFGPYDNRALPNIPVYRDWEMIGRYYEDFKIDLKTGEANL----TQIYL 172
Query: 253 KEAVQLI-EDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+EA+ I + Q +P FLY A A HA A KH ++ R Y
Sbjct: 173 QEALDFISKQQASQQPFFLYWAIDATHAPVYASKH------------FLGTSQRGRYGDA 220
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V+++DDSVG ++ LQ+ G+ EN+ + F SDNGA + S + GSN P+ K T
Sbjct: 221 VREIDDSVGKILKHLQKLGISENTFVFFTSDNGAALI-----SAPKQGGSNGPFLCGKQT 275
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
+EGG++ PAI W P VS Q+ + D T + G
Sbjct: 276 TFEGGMREPAIAWWPGHIPAGGVSRQLGSVMDLFTTSLSLVG------------------ 317
Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
L PS R IDG+D ++L R +E AVR+ +K T N
Sbjct: 318 --LQPPSDR--QIDGIDLLPAILQGKLIDRPIFYYRGNE---MMAVRVGLYKAHYWTWSN 370
Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-FLPISNIDKMRSTRQQATIH 549
S ++ S+ I F P N+ + + Q+
Sbjct: 371 ------------------------------SWEEYSKGIDFCPGQNVSGVTTHLQEEH-- 398
Query: 550 CGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPD---ISSQLYELLKYHRRTLVP 606
+ P+ LF+LG DP E+ ++ + + + Q+ +++ H+ T+VP
Sbjct: 399 ---SSLPL-----------LFHLGRDPGEKYPLSFASDEYQGVMRQISAVVQQHKETMVP 444
Query: 607 QSHEQPDLVQADPKRFNDTWSP 628
P L D N WSP
Sbjct: 445 GV---PQLNVCDKAVMN--WSP 461
>gi|240977207|ref|XP_002402636.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215491217|gb|EEC00858.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 159
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 79/162 (48%), Positives = 108/162 (66%), Gaps = 5/162 (3%)
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+T+ AV LI +PLFLYL++ A H G LEAP+E + +F +I + NR TYA M
Sbjct: 1 YTERAVSLIRSHNPSQPLFLYLSYQAPHVGTGPSLLEAPEENMKKFLHIPEKNRTTYAGM 60
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V LD S+GT++ ALQ ++ N+I++F SDNGA + + SN G N+P RG K T
Sbjct: 61 VDALDQSIGTIMEALQEADLIGNTIVVFTSDNGA--LPWGVKSNR---GYNWPLRGGKFT 115
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
LWEGG++ +WSP ++ + RVS QMMHI+DWLPTLY+AAG
Sbjct: 116 LWEGGLRTTTFIWSPLLEMSRRVSNQMMHITDWLPTLYSAAG 157
Score = 56.6 bits (135), Expect = 4e-05, Method: Compositional matrix adjust.
Identities = 26/63 (41%), Positives = 38/63 (60%)
Query: 20 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAL 79
+T+ AV LI +PLFLYL++ A H G LEAP+E + +F +I + NR TYA +
Sbjct: 1 YTERAVSLIRSHNPSQPLFLYLSYQAPHVGTGPSLLEAPEENMKKFLHIPEKNRTTYAGM 60
Query: 80 TKS 82
+
Sbjct: 61 VDA 63
>gi|114326198|ref|NP_001041585.1| N-acetylgalactosamine-6-sulfatase precursor [Canis lupus
familiaris]
gi|122138594|sp|Q32KH5.1|GALNS_CANFA RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|81158068|tpe|CAI85008.1| TPA: galactosamine (N-acetyl)-6-sulfate sulfatase [Canis lupus
familiaris]
Length = 522
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 124/368 (33%), Positives = 175/368 (47%), Gaps = 48/368 (13%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 34 LLLMDDMGWGDLGIYGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPIR 93
Query: 144 TGM-----------QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G I G G+P E LPE L+E GY +K +GKWHLG R ++
Sbjct: 94 NGFYTTNRHARNAYTPQEIVG----GIPDQEHVLPELLKEAGYVSKIVGKWHLG-HRPQF 148
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYA 247
PL GF+ FG N YD+ R E+ G + NL T +
Sbjct: 149 HPLKHGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLKTGEANL---- 204
Query: 248 TDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
T ++ +EA+ I+ Q +P FLY A A HA ++ R
Sbjct: 205 TQVYLQEALDFIKRQQAAQRPFFLYWAIDATHA-----------PVYASRPFLGTSQRGR 253
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y V+++D+SVG ++S LQ + EN+ + F SDNGA + S GSN P+
Sbjct: 254 YGDAVREIDNSVGKILSLLQDLRISENTFVFFTSDNGAALI-----SAPNQGGSNGPFLC 308
Query: 367 VKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDG 424
K T +EGG++ PAI W P RVS Q+ I D T + AG + R+ IDG
Sbjct: 309 GKQTTFEGGMREPAIAWWPGRIPAGRVSHQLGSIMDLFTTSLSLAGLAPPSDRV---IDG 365
Query: 425 LDQWSSLL 432
LD ++L
Sbjct: 366 LDLLPAML 373
>gi|332263239|ref|XP_003280658.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Nomascus leucogenys]
Length = 528
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 118/357 (33%), Positives = 168/357 (47%), Gaps = 41/357 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI G
Sbjct: 48 GWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIRNGFY--- 104
Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
A R GVP +E+ LPE L++ GY +K +GKWHLG R ++ PL GF+
Sbjct: 105 TTNAHARNAYTPQEIVGGVPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLKHGFD 163
Query: 201 SHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEA 255
FG N YD+ R E+ G + NL T + T ++ +EA
Sbjct: 164 EWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----TQIYLQEA 219
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
+ I+ Q P FLY A A HA ++ R Y ++++D
Sbjct: 220 LDFIKRQARRHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYGDAIREID 268
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
DS+G ++ L + +N+ + F SDNGA + E GSN P+ K T +EGG
Sbjct: 269 DSIGKMLELLHNLHIADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGKQTTFEGG 323
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
++ PA+ W P +VS Q+ I D L T A G T IDGL +LL
Sbjct: 324 MREPALAWWPGHVAAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLSLLPTLL 379
>gi|50753991|ref|XP_414208.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Gallus gallus]
Length = 513
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 122/374 (32%), Positives = 179/374 (47%), Gaps = 44/374 (11%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
A + L L+ GW DL G TPN+D +A G++ + YA P+C+PSRA+L
Sbjct: 17 CAAPPNVVLLLMDDMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYAANPLCSPSRAAL 76
Query: 136 MTGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHL 185
+TG+ P+ G A R G+ +E LPE L++ GY+ K IGKWHL
Sbjct: 77 LTGRLPVRNGFYTT---NAHARNAYTPQDIVGGIQDSEILLPELLKKAGYTNKIIGKWHL 133
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDH--ILSDQYSRTVELNG---HDMRRNLSTAW 240
G R ++ PL GF+ FG N YD+ + + R E+ G D + +L T
Sbjct: 134 GH-RPQFHPLKHGFDEWFGSPNCHFGPYDNRALPNIPVYRDWEMIGRYYEDFKIDLRTGE 192
Query: 241 DTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAG-NAGKHLEAPQETINQFQY 298
+ T ++ +EA+ I + Q +P FLY A A HA A KH +
Sbjct: 193 ANL----TQIYLQEALDFISKQQASQQPFFLYWAIDATHAPVYASKH------------F 236
Query: 299 ITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
+ R Y V+++DDSVG ++ LQ+ G+ EN+ + F SDNGA + S +
Sbjct: 237 LGTSQRGRYGDAVREIDDSVGKILKHLQKLGISENTFVFFTSDNGAALI-----SAPKQG 291
Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
GSN P+ K T +EGG++ PAI W P VS Q+ + D T + G
Sbjct: 292 GSNGPFLCGKQTTFEGGMREPAIAWWPGHIPAGSVSRQLGSVMDLFTTSLSLVGLQPPS- 350
Query: 419 PLNIDGLDQWSSLL 432
IDG+D ++L
Sbjct: 351 DRQIDGIDLLPAIL 364
>gi|72161757|ref|YP_289414.1| N-acetylgalactosamine-6-sulfatase [Thermobifida fusca YX]
gi|71915489|gb|AAZ55391.1| n-acetylgalactosamine-6-sulfate sulfatase [Thermobifida fusca YX]
Length = 471
Score = 158 bits (400), Expect = 7e-36, Method: Compositional matrix adjust.
Identities = 113/357 (31%), Positives = 171/357 (47%), Gaps = 45/357 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
GW DL +GS I TPN+D LA GI + YA P C+ +R SL TG+YP + G++
Sbjct: 32 GWADLGCYGSTTIRTPNLDQLAAQGIRFTHGYAGSPWCSSTRISLYTGRYPGRLQAGLEE 91
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P + + G+P L L E GY+T GKWH G+ Y+PL GFE+ FG +G
Sbjct: 92 PLVTRSPENGIPEGHPTLSSLLVEAGYATAMFGKWHCGWLPW-YSPLRIGFETFFGNFDG 150
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
+ Y++H+ D + G T + VG Y T++ ++ A + I ++P
Sbjct: 151 ALDYFEHV--DTLGKADLYEG-------ETPVEEVG-YYTEIISERAAEYITAH-RNRPF 199
Query: 269 FLYLAHLAAHAGNAG-KHLEAPQETINQFQ---------YITDPNRRTYAAMVKKLDDSV 318
++ L + A H G E QE ++Q ++ + Y +V+ +D +
Sbjct: 200 YVQLNYTAPHWPWEGPDDHEVGQEIRRRYQQRWEHSPLMHLDGGSIAKYGELVEAMDAGI 259
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V++AL R G +N+I++F SDNG W N+P+ G K L EGG++V
Sbjct: 260 GQVLAALDRAGAADNTIVVFSSDNGG-----------ERWSKNWPFVGEKGDLTEGGIRV 308
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD---------TSRLPLNIDGLD 426
P I+ P+ +VS + DW TL AAG + LP +DG D
Sbjct: 309 PLIVAWPEAIAGNQVSDHPVITMDWTATLLAAAGTEPHPDWPLDGVDLLPWLVDGAD 365
>gi|119713178|gb|ABL97246.1| sulfatase [uncultured marine bacterium EB0_50A10]
Length = 544
Score = 158 bits (400), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 123/394 (31%), Positives = 185/394 (46%), Gaps = 79/394 (20%)
Query: 85 LTLLIVYGWNDLSFH----GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGK 139
L L G+ND+S H + T NIDALA +GI+ YA C PSRAS+MTGK
Sbjct: 64 LVLADDMGYNDISIHNGGAADGTLQTKNIDALAKSGILFTRGYAANATCAPSRASIMTGK 123
Query: 140 YPIHTGMQGPPI---------WGAEP----------------------RGVPLTERFLPE 168
YP G + PI W AE +G+P + + E
Sbjct: 124 YPTRFGYEFTPIPAFGRTVLGWLAEEDNFELKQRIDREVVSNMPPFMEQGMPTEQITIAE 183
Query: 169 YLRELGYSTKAIGKWHLGFFRREY--TPLYRGFESHFGYLNGVISYYDH--ILSDQYSRT 224
LR+ GY T IGKWHLG EY P+ +GF+ G + + DH +++ ++
Sbjct: 184 VLRDAGYYTAHIGKWHLG---HEYGMDPMSQGFQDSLGLVGPLYLPEDHPDVVNAKFDTR 240
Query: 225 VE------------LNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYL 272
++ NG D+ +Y TD +T EA+++IE+ ++P FLYL
Sbjct: 241 IDKMIWGMGQYSANFNGGDLF--------APDKYVTDYYTDEALKVIENNK-NRPFFLYL 291
Query: 273 AHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLE 332
+H A H L+A + Q ++ N + Y+ M+ LD SVG +I L+ +
Sbjct: 292 SHWAIH-----NPLQALRSDFEQMSHMHGHNLQVYSGMINSLDRSVGKIIEKLKELDIYG 346
Query: 333 NSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR 392
++IIF SDNG Y E ++ N PYRG K + ++GG++VP I+ P +
Sbjct: 347 KTLIIFTSDNGG--ANYIELNDI-----NKPYRGWKISFFDGGIRVPYIISWPDEINPGK 399
Query: 393 VSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
S +H D PT+ AAG +++ +DG+D
Sbjct: 400 KSENAVHHFDIFPTILKAAGIESTN---ELDGVD 430
Score = 40.8 bits (94), Expect = 2.5, Method: Compositional matrix adjust.
Identities = 22/69 (31%), Positives = 38/69 (55%), Gaps = 6/69 (8%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y TD +T EA+++IE+ ++P FLYL+H A H L+A + Q ++ N
Sbjct: 265 KYVTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQALRSDFEQMSHMHGHNL 318
Query: 74 RTYAALTKS 82
+ Y+ + S
Sbjct: 319 QVYSGMINS 327
>gi|431907857|gb|ELK11464.1| Arylsulfatase B [Pteropus alecto]
Length = 274
Score = 158 bits (400), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 81/209 (38%), Positives = 124/209 (59%), Gaps = 22/209 (10%)
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
IHTG+Q IW +P VPL E+ LP+ L+E GY+T +GKWHLG +R+E P RGF++
Sbjct: 22 IHTGLQHQIIWPCQPSCVPLDEKLLPQLLKEAGYTTHMVGKWHLGMYRKECLPTRRGFDT 81
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRR---------NLSTAWDTVGEYATDLFT 252
+FGYL G YY H R +N ++ R ++T ++ + Y+T++FT
Sbjct: 82 YFGYLLGSEDYYSH------ERCTLINALNVTRCALDFRDGEEVATGYENM--YSTNIFT 133
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR YA MV
Sbjct: 134 QRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRHYYAGMVS 188
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSD 341
+D++VG V +AL+ G+ N++ IF ++
Sbjct: 189 LMDEAVGNVTAALKSHGLWNNTVFIFSTE 217
Score = 55.1 bits (131), Expect = 1e-04, Method: Compositional matrix adjust.
Identities = 36/111 (32%), Positives = 58/111 (52%), Gaps = 14/111 (12%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y+T++FT+ A LI + P +KPLFLYLA + H + L+ P+E + + +I D NR
Sbjct: 127 YSTNIFTQRATALITNHPPEKPLFLYLALQSVH-----EPLQVPEEYLKPYDFIQDKNRH 181
Query: 75 TYAALTK------STTLTLLIVYG-WNDLSFHGSNEIPTPNIDALAYNGII 118
YA + L +G WN+ F S E+ + + +A++G I
Sbjct: 182 YYAGMVSLMDEAVGNVTAALKSHGLWNNTVFIFSTELLS--VLVVAFDGAI 230
>gi|343082868|ref|YP_004772163.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342351402|gb|AEL23932.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 462
Score = 158 bits (399), Expect = 8e-36, Method: Compositional matrix adjust.
Identities = 112/372 (30%), Positives = 183/372 (49%), Gaps = 56/372 (15%)
Query: 86 TLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
LLI+ G++DL G+ +I TP +D LA G+ L N Y CTPSR+ L+TG+YP
Sbjct: 27 VLLIISDDQGYHDLGSFGNKDILTPYLDQLAAEGVRLTNFYVTSSACTPSRSGLLTGRYP 86
Query: 142 -------------IHTGMQGPPI-WGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWH 184
++ G Q + P G L E F+ E L+E GY+ IGKW
Sbjct: 87 QRNGTFELFRNNMVNYGHQYTDYEYSVSPERILGTDLREIFISELLKEGGYTNGYIGKWD 146
Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
LG +R Y P+ +G++ ++G+ N I Y+ H ++Y ++G+ + + + G
Sbjct: 147 LGQLKR-YLPMQQGYDKYYGFANTGIDYFTH---ERYGSPSMIDGNQL------SLEDKG 196
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHLEAPQETINQFQYI---- 299
Y T LF +EA++ I + KP FL LA A H A N ++ + ++ +
Sbjct: 197 RYTTALFEREALEFI-NGSQGKPFFLTLAFNAPHSASNLDPEIKGTVQATEKYLKMYPKS 255
Query: 300 ---TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
+R Y A V ++D S+GT++ ++ G+ EN+++IF+SDNG ++
Sbjct: 256 NTKRGEKQRAYKAAVTQMDQSIGTILERIKEMGLEENTLVIFLSDNGGMSL--------- 306
Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
NYP +G K +EGGV+VP ++ P+ + + + + P + AAG
Sbjct: 307 --ADNYPLKGGKAQFFEGGVRVPCLVKWPEKIDAGQTNDAFLSSLELFPMIVAAAG---L 361
Query: 417 RLPLNI--DGLD 426
LP ++ DG D
Sbjct: 362 ELPKDVIYDGFD 373
>gi|149195951|ref|ZP_01873007.1| sulfatase [Lentisphaera araneosa HTCC2155]
gi|149140798|gb|EDM29195.1| sulfatase [Lentisphaera araneosa HTCC2155]
Length = 464
Score = 158 bits (399), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 112/354 (31%), Positives = 176/354 (49%), Gaps = 40/354 (11%)
Query: 95 DLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGM----QGPP 150
D++ +GS ++ TPN+D LA +G+ YA VC P+RA LMTG++P + + QG
Sbjct: 46 DVNCYGSKDLYTPNMDKLAEDGVRFTQAYAHQVCCPARAMLMTGRHPQRSNVNHWTQGDA 105
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G + R + L E L E L++ GY T GKWHLG +Y P +GF+ +G G I
Sbjct: 106 K-GPKTRNMNLEEYTLAEALKDSGYKTALFGKWHLG-AHLDYGPTKQGFDEFYGIRGGFI 163
Query: 211 SYYDHILSDQYSRTVELNG---HDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
Y+H L+G HD+ +D G+Y +L T A+ I D+ + P
Sbjct: 164 DNYNHYF---------LHGEGFHDLYEGTKEVFDE-GKYFPNLVTDRALNFI-DRNKNNP 212
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
FL+LA H +A + +++ + P R++YA M+ DD +G ++S LQ
Sbjct: 213 FFLFLAFNIPHYPE-----QADPKFDERYKNMKMP-RQSYAKMISTTDDHMGQIMSKLQE 266
Query: 328 KGMLENSIIIFMSDNG-APTVEYRETSNYRN-WGSNYPY------------RGVKNTLWE 373
G+ +N+IIIFMSDNG + + + N+++ N Y RG K+ +E
Sbjct: 267 HGIYDNTIIIFMSDNGHSRERNHIKFDNHKSGLAKNTKYGALGGGGNTGKWRGNKSNFYE 326
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
GG++VPAI+ P V Q + DW+PT+ + ++ + L Q
Sbjct: 327 GGIRVPAIITFPNKLPKGAVRDQAITAMDWMPTVLELCNIEPPKIKFDGKSLTQ 380
>gi|301788958|ref|XP_002929896.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like, partial
[Ailuropoda melanoleuca]
Length = 519
Score = 158 bits (399), Expect = 9e-36, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 176/364 (48%), Gaps = 40/364 (10%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 31 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIR 90
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G G P+ G+P E LPE L+ GY++K +GKWHLG R ++ PL
Sbjct: 91 NGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLK 149
Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
GF+ FG N YD+ R E+ G + NL T + T ++
Sbjct: 150 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLQTGEANL----TQVY 205
Query: 252 TKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+EA+ ++ Q V +P FLY A HA ++ R Y
Sbjct: 206 LQEALDFMKRQQVAQRPFFLYWAIDGTHA-----------PVYASRPFLGTSRRGRYGDA 254
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V+++DDSVG ++S L+ + EN+ + F SDNGA + S + GSN P+ K T
Sbjct: 255 VREIDDSVGKILSLLRDLRITENTFVFFTSDNGAALI-----SAPKQGGSNGPFLCGKQT 309
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQW 428
+EGG++ PAI W P +VS Q+ + D T AG + R+ IDGLD
Sbjct: 310 TFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLALAGLAPPSDRV---IDGLDLL 366
Query: 429 SSLL 432
++L
Sbjct: 367 PAML 370
>gi|403072042|pdb|4FDI|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A
gi|403072043|pdb|4FDI|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A
gi|403072044|pdb|4FDJ|A Chain A, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
With Galnac
gi|403072045|pdb|4FDJ|B Chain B, The Molecular Basis Of Mucopolysaccharidosis Iv A, Complex
With Galnac
Length = 502
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/364 (32%), Positives = 172/364 (47%), Gaps = 41/364 (11%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+ +PSRA+L+TG+ PI
Sbjct: 9 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGLLFPNFYSANPLXSPSRAALLTGRLPIR 68
Query: 144 TGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
G A R G+P +E+ LPE L++ GY +K +GKWHLG R ++
Sbjct: 69 NGFY---TTNAHARNAYTPQEIVGGIPDSEQLLPELLKKAGYVSKIVGKWHLG-HRPQFH 124
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYAT 248
PL GF+ FG N YD+ R E+ G + NL T + T
Sbjct: 125 PLKHGFDEWFGSPNCHFGPYDNKARPNIPVYRDWEMVGRYYEEFPINLKTGEANL----T 180
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
++ +EA+ I+ Q P FLY A A HA ++ R Y
Sbjct: 181 QIYLQEALDFIKRQARHHPFFLYWAVDATHA-----------PVYASKPFLGTSQRGRYG 229
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
V+++DDS+G ++ LQ + +N+ + F SDNGA + E GSN P+ K
Sbjct: 230 DAVREIDDSIGKILELLQDLHVADNTFVFFTSDNGAALISAPEQG-----GSNGPFLCGK 284
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T +EGG++ PA+ W P +VS Q+ I D L T A G T IDGL+
Sbjct: 285 QTTFEGGMREPALAWWPGHVTAGQVSHQLGSIMD-LFTTSLALAGLTPPSDRAIDGLNLL 343
Query: 429 SSLL 432
+LL
Sbjct: 344 PTLL 347
>gi|313246966|emb|CBY35811.1| unnamed protein product [Oikopleura dioica]
Length = 388
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 103/305 (33%), Positives = 164/305 (53%), Gaps = 17/305 (5%)
Query: 127 VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
+C+PSRA +TG+Y G+ PI P G+ E+ LPEYL+E+GY T A+GKWHLG
Sbjct: 20 LCSPSRAQFLTGRYAFRYGLGSDPISFENPIGMSTKEKLLPEYLKEVGYETHAVGKWHLG 79
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
+ + P RGF++ G+ G + Y+ H + Q + LN H + D E+
Sbjct: 80 YCNESFQPHNRGFDTFLGHYGGGVDYHTH--ATQGALGSYLN-HFLNGEPHIPEDGF-EF 135
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ--YITDPN- 303
A+ ++ +++ + DKP F+YLA A H + + APQ+ I + + Y P
Sbjct: 136 ASYAWSNRTRKVLREN-TDKPNFVYLAFNAPH-----EKVAAPQDLIEEAKSMYPGIPGT 189
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
RR A VK +D ++ +VI + K + +I++F +DNG + Y + G +YP
Sbjct: 190 RRLQLAAVKSIDVAMQSVIE--EAKTLERETIVVFHTDNGGALLAYAGGFDGPR-GCSYP 246
Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
Y+G K+ L EGG P ++S + Q + R M+HI D+ PT+ AG D +P ++D
Sbjct: 247 YKGYKSVLAEGGTLSPTWVYSTKRQFHSRYIDGMIHIMDFFPTILRWAGYDKP-MPRDLD 305
Query: 424 GLDQW 428
G+DQ+
Sbjct: 306 GIDQY 310
>gi|149175125|ref|ZP_01853748.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
8797]
gi|148846103|gb|EDL60443.1| N-acetylgalactosamine-6-sulfate sulfatase [Planctomyces maris DSM
8797]
Length = 413
Score = 158 bits (399), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 136/473 (28%), Positives = 213/473 (45%), Gaps = 89/473 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +GS TP++D LA NGI + ++ VC+P+RA L+TG+Y G+ G
Sbjct: 6 GYGDLSCYGSQNCNTPHLDRLAANGIRFTDFHSSGAVCSPTRAGLLTGRYQQRAGIDG-- 63
Query: 151 IWGAEPR-----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
+ A P+ G+ E L + L++ GY T GKWHLG+ +R+Y P +RGF+ GY
Sbjct: 64 VVYANPKKNRHHGLQKNEITLAQCLQDAGYQTGMFGKWHLGY-QRQYNPTFRGFQQFVGY 122
Query: 206 LNGVISYYDHILSD---QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
++G + Y+ H+ + ELN + Y T L A++ I Q
Sbjct: 123 VSGNVDYFAHLDGTGVFDWWHNAELNREEQ------------GYVTHLINDHALEFIRQQ 170
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR----TYAAMVKKLDDSV 318
+KP F+Y+AH A H+ G H + ++ I R+ Y M ++D +
Sbjct: 171 Q-EKPFFVYIAHEAVHSPYQGPHDQPMRKEGG--GDIKSAKRKDIANAYREMNTEMDKGI 227
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G ++ L+ + E + I F+SDNGA GSN RG K +LWEGG +V
Sbjct: 228 GQIVDVLKEVNLTEKTFIFFLSDNGA-----------NKNGSNGKLRGFKGSLWEGGHRV 276
Query: 379 PAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
PAI W +I + ++ I D +PT+ A +++P
Sbjct: 277 PAIACWPGRIPEGTVCDEPVISI-DLMPTILELA---NAKIPA----------------- 315
Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINID-EKKRTAAVRLDSWKLVLGTQENGTMDGY 496
+DG+ SL+ R++ V I E +A+R WKLVL
Sbjct: 316 --GHKLDGV----SLVSLLKDRKSLVPRQIFWEYNGKSAMRQGHWKLVL----------- 358
Query: 497 YGQTRSNKVPLLNFNA-IVESKTY-----QSLQQLSQNIFLPISNIDKMRSTR 543
QTR + L + + ESK Q +QQ+ + S++ K +T+
Sbjct: 359 -NQTRKEPIELYDLTRDMSESKNLADNQPQRVQQMQSALAAWKSDVQKTATTQ 410
>gi|414070344|ref|ZP_11406330.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
gi|410807261|gb|EKS13241.1| Arylsulfatase [Pseudoalteromonas sp. Bsw20308]
Length = 470
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 109/358 (30%), Positives = 169/358 (47%), Gaps = 32/358 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G++D F GS + TP ID LA ++ Y VC PSRA L TGKY G +
Sbjct: 38 GYHDFGFQGSEVMQTPTIDKLASQSVVFEQAYVTAAVCGPSRAGLYTGKYQQRFGFEENN 97
Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
+ G + G+P T+ + E+L+ELGY T GKWH G +Y P RGF++
Sbjct: 98 VPGYMSKSGFTGDKMGLPFTQVTMAEHLKELGYHTGLFGKWHQG-NHDDYHPTKRGFDNF 156
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
+G+ G Y+ + +Q + + M R+ + G Y TD + I
Sbjct: 157 YGFREGARGYFAYSNEEQQAYPSQ----KMERDFKHYIEHEG-YLTDALATQTSHFIGQS 211
Query: 263 PVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
V+K P F L+ A HA ++A + QF ++T R+ AAM +D ++ V
Sbjct: 212 VVNKQPFFAVLSFSAVHAP-----MQATNADLAQFPHLTG-KRKILAAMNFAMDRAIAVV 265
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ L + +N++++F++DNG PT + N+ +NYP G K EGG++VP +
Sbjct: 266 LDKLNALKISDNTLVVFINDNGGPTDQ--------NYANNYPLSGTKANHLEGGIRVPML 317
Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
L W ++ N + D LPT AA G + + +DG+D LL + R
Sbjct: 318 LKWPKRLSSNNTRYTYPVSSLDLLPTFIHAANGKVTE-QMQLDGVDLMPFLLSKSTQR 374
>gi|449138314|ref|ZP_21773584.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
gi|448883087|gb|EMB13631.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
europaea 6C]
Length = 485
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 125/402 (31%), Positives = 186/402 (46%), Gaps = 70/402 (17%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
LLIV G+NDL G N I TP +D LA G L N Y A P CTPSRASL+TG+YP
Sbjct: 33 LLIVSDDQGYNDLGQLG-NGIITPALDRLANEGTRLTNFYVAWPACTPSRASLLTGRYPQ 91
Query: 143 HTGM-----QGPPIWG------------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
G+ P +G G+ E +P L+ GY + GKW L
Sbjct: 92 RNGIYDMIRNEAPDYGHRYTPDQYAVTFERIGGMDEREVIIPAVLKRAGYKSGIYGKWDL 151
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
G RR P RGF+ +G++N I Y+ H ++Y M RNL + G
Sbjct: 152 GALRR-MLPNSRGFDDFYGFVNTGIDYFTH---ERYGVPC------MVRNLEPTEEDKGT 201
Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA-----GKHLEAPQE--------- 291
Y T LF +EA++ +++ ++P FLY+ A H ++ ++AP+
Sbjct: 202 YCTYLFQREALRFLDEHAGEEPFFLYVPFNAPHNSSSLDPKIRSSVQAPERFKAMYPPVE 261
Query: 292 ----TINQFQYIT-------DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
++++Y T RR Y A V +D ++G ++ L+ K MLE++I++F S
Sbjct: 262 SETRVTDRYRYGTPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLEDTIVVFFS 321
Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMH 399
DNG N P RG K WEGG++VP I+ W+ V+ + +
Sbjct: 322 DNGG-----------SGGADNSPLRGHKAQTWEGGIRVPCIVRWTAGKIPAGAVNDEFLT 370
Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
+ LP+ +AAG + + +DG D W +L + S R
Sbjct: 371 SLELLPSFASAAGLEPP-ADIVLDGFDWWPTLRGESASPREE 411
>gi|260062070|ref|YP_003195150.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88783632|gb|EAR14803.1| arylsulphatase A [Robiginitalea biformata HTCC2501]
Length = 459
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 124/344 (36%), Positives = 178/344 (51%), Gaps = 41/344 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS G+ ++ +PNIDALA NG+ N YA VC+PSRA+L+TG+YP G+ G
Sbjct: 53 GYGDLSCQGATDLQSPNIDALAANGMRFTNFYANSTVCSPSRAALLTGRYPDLVGVPGVI 112
Query: 151 IWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
E L + +P L GY T IGKWHLG + TP RGF G+L
Sbjct: 113 RQNPENNWGNLADDAVLIPSELNPAGYHTGIIGKWHLGLEEPD-TPNDRGFTYFKGFLGD 171
Query: 209 VIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDK 266
++ Y+DH R +N + R D G +ATDLFT + + E Q ++
Sbjct: 172 MMDDYWDH-------RRGGINWMRLNRE---EIDPKG-HATDLFTDWTIDFLKERQGEEQ 220
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAAMVKKLDDSVGTVI 322
P FLYLA+ A H ++ P+E +++ + +PN R A V+ LD SVG V+
Sbjct: 221 PFFLYLAYNAPHF-----PIQPPREWLDKVRE-REPNLTEKRAKNVAFVEHLDYSVGRVM 274
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL+ G+ EN++++F+SDNG + Y + SN P RG K ++EGG++VPAI
Sbjct: 275 EALKTTGLEENTLVVFVSDNGG-ALWYAQ--------SNGPLRGGKQDMYEGGIRVPAIF 325
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
W +I S + D PT AG + P N+DG+
Sbjct: 326 YWKGKIAPG-TTSDNTALLMDLFPTFCELAG---RKPPENVDGI 365
>gi|323450479|gb|EGB06360.1| hypothetical protein AURANDRAFT_13369, partial [Aureococcus
anophagefferens]
Length = 304
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 104/321 (32%), Positives = 153/321 (47%), Gaps = 35/321 (10%)
Query: 128 CTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
CTPSR SL++G+YP+ G+Q + A G+ R LPE L+ GY T +GKWHLG
Sbjct: 1 CTPSRGSLLSGRYPVRLGLQRGVVSAASKWGLDPAIRLLPETLQTAGYRTALVGKWHLGH 60
Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT----- 242
+ TP RGF +G+L G + ++ H+ + E D L W
Sbjct: 61 YNESLTPARRGFGESYGFLGGFVDHFTHVAEAPPACASETPATDGTCALD--WQVNGLPR 118
Query: 243 ---VGEYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY 298
G YA A +L+ + ++PLFLY A H P + + +
Sbjct: 119 PRATGVYAAYACRDAARRLVASHEGKEEPLFLYYALPTPH------EPITPPDDVETYDL 172
Query: 299 I----TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN 354
+ RR YAA+V D ++G + +ALQ+ + N++++ +SDNGA
Sbjct: 173 LPKLDVAGRRRDYAAVVAATDGAIGDLQAALQKHALWPNTLLVLLSDNGASAAG------ 226
Query: 355 YRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ--MMHISDWLPTLYTAAG 412
GSN P RG K +EGGV+ A+L + QN R S ++H++DW PTL AAG
Sbjct: 227 ----GSNAPLRGAKGGYFEGGVRTRALLAGSLLPQNVRGSSHEGLVHVADWAPTLLNAAG 282
Query: 413 --GDTSRLPLNIDGLDQWSSL 431
D +DG+DQW +L
Sbjct: 283 LRPDLVAGFDAMDGVDQWDAL 303
>gi|423293952|ref|ZP_17272079.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
CL03T12C18]
gi|392677173|gb|EIY70592.1| hypothetical protein HMPREF1070_00744 [Bacteroides ovatus
CL03T12C18]
Length = 453
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 135/424 (31%), Positives = 208/424 (49%), Gaps = 68/424 (16%)
Query: 79 LTKSTTLTLLIVY---GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRAS 134
+ K+T LLI+ G DLS + ++ TPNID + G+ L+N YA V +PSRA+
Sbjct: 26 VDKATPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSRAA 85
Query: 135 LMTGKYPIHTGMQG---PPI---WG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
L+TG++P G+ G P I WG +P V + PE L+ GY T IGKWHLG+
Sbjct: 86 LLTGRFPAMVGVPGVIRPSIDQNWGYFDPSAVTM-----PEVLKNGGYRTALIGKWHLGW 140
Query: 188 FRREYTPLYRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
+ P RGFE G+L ++ YY H Q + LN ++ T +
Sbjct: 141 -KSPNLPNERGFEHFHGFLADMMDDYYTH--QRQGGNYMYLNDKEI---------TPKGH 188
Query: 247 ATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP--- 302
AT+LFT +V I+ + +K P FLYLA+ A H+ L+ P E +N+ Q
Sbjct: 189 ATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPV 243
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
R A+++ LDD++G V+ AL+ G L N+++IF SDNG + + +N
Sbjct: 244 KRARLIALIEHLDDNIGKVMQALEESGQLSNTLVIFASDNGG---------DRGSMANNG 294
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL-- 420
P RG K ++EGG+ VP L P + + R + + D +PT+ D +P+
Sbjct: 295 PTRGAKGDMFEGGIHVPCALNMPGVFEGGRRDNHFVVMMDLMPTIC-----DFVNVPVRH 349
Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDS 480
IDG+ +L+ R+ N + D++ L N + K +AVR +
Sbjct: 350 EIDGIS-----VLDAIKRKKQNTE--DRFVFWLRNEGGPQFG-------GKSQSAVRYNE 395
Query: 481 WKLV 484
+KL+
Sbjct: 396 YKLL 399
>gi|423330335|ref|ZP_17308119.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
CL03T12C09]
gi|409231951|gb|EKN24799.1| hypothetical protein HMPREF1075_00132 [Parabacteroides distasonis
CL03T12C09]
Length = 459
Score = 157 bits (398), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 122/362 (33%), Positives = 177/362 (48%), Gaps = 45/362 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G+ I TPNID +A G+ L Y V TPSRA+LMTG+ P+ G+ G
Sbjct: 42 GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101
Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ P G+ E + + L++ GY+T +GKWHLG F Y P GF+++FG
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQSGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
I Y + + V+ G R T G+ T +T++AV
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSF 209
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I+D +P FLY AH H P T +F + R Y +V+++D SV
Sbjct: 210 IKDHS-KEPFFLYFAHTFPHI---------PLYTNARFAGTS--KRGLYGDVVEEIDWSV 257
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V+ AL+ G+ EN+ +IF SDNG E + N GS P + K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGPWLTE------HENGGSAGPLKDGKGTWWEGGFRV 311
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
PAI W P + NP ++ ++M D PT + AG + + L +DG++Q L S
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369
Query: 439 RN 440
R+
Sbjct: 370 RD 371
>gi|47522740|ref|NP_999120.1| N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
gi|75054309|sp|Q8WNQ7.1|GALNS_PIG RecName: Full=N-acetylgalactosamine-6-sulfatase; AltName:
Full=Chondroitinsulfatase; Short=Chondroitinase;
AltName: Full=Galactose-6-sulfate sulfatase; AltName:
Full=N-acetylgalactosamine-6-sulfate sulfatase;
Short=GalNAc6S sulfatase; Flags: Precursor
gi|18028088|gb|AAL55968.1|AF322917_1 N-acetylgalactosamine-6-sulfatase precursor [Sus scrofa]
Length = 522
Score = 157 bits (397), Expect = 1e-35, Method: Compositional matrix adjust.
Identities = 120/362 (33%), Positives = 173/362 (47%), Gaps = 36/362 (9%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ + YA P+C+PSRA+L+TG+ PI
Sbjct: 34 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPSFYAANPLCSPSRAALLTGRLPIR 93
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TG G P+ G+P E LPE L+ GY++K +GKWHLG R ++ PL
Sbjct: 94 TGFYTTNGHARNAYTPQEIVGGIPDPEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLK 152
Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
GF+ FG N YD+ R E+ G + NL T + T ++
Sbjct: 153 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRFYEEFPINLKTGESNL----TQIY 208
Query: 252 TKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+EA+ I+ Q P FLY A A HA ++ R Y
Sbjct: 209 LQEALDFIKRQQATHHPFFLYWAIDATHA-----------PVYASRAFLGTSQRGRYGDA 257
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V+++DDSVG ++ L+ + N+ + F SDNGA V S + GSN P+ K T
Sbjct: 258 VREIDDSVGRIVGLLRDLKIAGNTFVFFTSDNGAALV-----SAPKQGGSNGPFLCGKQT 312
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
+EGG++ PAI W P +VS Q+ + D T + AG + IDGLD +
Sbjct: 313 TFEGGMREPAIAWWPGHIPAGQVSHQLGSVMDLFTTSLSLAGLEPPS-DRAIDGLDLLPA 371
Query: 431 LL 432
+L
Sbjct: 372 ML 373
>gi|325285326|ref|YP_004261116.1| arylsulfatase [Cellulophaga lytica DSM 7489]
gi|324320780|gb|ADY28245.1| Arylsulfatase [Cellulophaga lytica DSM 7489]
Length = 466
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 103/347 (29%), Positives = 173/347 (49%), Gaps = 37/347 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS E TP +D LA + Y + VC PSRA ++TG+Y G +
Sbjct: 38 GYADFGFQGSTEFKTPELDKLAKKSVKFTQAYVSAAVCGPSRAGMLTGQYQQKFGFEENN 97
Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
+ G + G+P + + +YL++ GY+T GKWH+G + P RGF+
Sbjct: 98 VPGYMSSSGLLGDQMGLPTNQLTIADYLKKQGYTTALFGKWHMGN-ADLFHPTKRGFDEF 156
Query: 203 FGYLNGVISY--YDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
+G+ G SY YD + + N + + + G Y TD EA+ I
Sbjct: 157 YGFRGGSRSYLPYDK------NNVLTKNEDRLEKGFGNYLEHKG-YLTDKLANEAISFI- 208
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
++ P F+YL+ A H +EA +E + F ++T R+T AAM +D ++GT
Sbjct: 209 NKNKKTPFFIYLSFNAVHT-----PMEATKEDLKYFSHLTG-KRKTLAAMTLAMDRAIGT 262
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
V+ L+++G+ +N++++F +DNG P+ N +N P G K EGG++VP
Sbjct: 263 VLDELKKQGLDKNTLVVFTNDNGGPSDA--------NQSNNNPLSGTKANHLEGGIRVPF 314
Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
++ W ++++N + + + D PT A G++S+L +DG+D
Sbjct: 315 LMSWPGKLKKNTTYNYPISTL-DLFPTFLNVANGNSSKLK-QLDGVD 359
>gi|281346853|gb|EFB22437.1| hypothetical protein PANDA_020197 [Ailuropoda melanoleuca]
Length = 520
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 120/364 (32%), Positives = 175/364 (48%), Gaps = 38/364 (10%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 30 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYSANPLCSPSRAALLTGRLPIR 89
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G G P+ G+P E LPE L+ GY++K +GKWHLG R ++ PL
Sbjct: 90 NGFYTTNGHARNAYTPQEIVGGIPDGEHLLPELLKGAGYASKIVGKWHLG-HRPQFHPLK 148
Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDLF 251
GF+ FG N YD+ R E+ G + NL T + T ++
Sbjct: 149 HGFDEWFGSPNCHFGPYDNRARPNIPVYRDWEMVGRYYEEFPINLQTGEANL----TQVY 204
Query: 252 TKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAM 310
+EA+ ++ Q V +P FLY A HA ++ R Y
Sbjct: 205 LQEALDFMKRQQVAQRPFFLYWAIDGTHA-----------PVYASRPFLGTSRRGRYGDA 253
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V+++DDSVG ++S L+ + EN+ + F SDNGA + + GSN P+ K T
Sbjct: 254 VREIDDSVGKILSLLRDLRITENTFVFFTSDNGAALISAPKQGQC---GSNGPFLCGKQT 310
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG--GDTSRLPLNIDGLDQW 428
+EGG++ PAI W P +VS Q+ + D T AG + R+ IDGLD
Sbjct: 311 TFEGGMREPAIAWWPGRIPAGQVSHQLGSVMDLFTTSLALAGLAPPSDRV---IDGLDLL 367
Query: 429 SSLL 432
++L
Sbjct: 368 PAML 371
>gi|340373733|ref|XP_003385394.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 389
Score = 157 bits (397), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 180/367 (49%), Gaps = 43/367 (11%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYN-GIILNNMYAQPVCTPSRA 133
A + L ++V +G+ D+SF I +P+ ++LA G+IL+ Y C+PSRA
Sbjct: 17 ATVNAKPNLVFVLVDDWGFADVSFRNP-AISSPHFESLATKEGLILDRHYVFKYCSPSRA 75
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
S +TG++P H PP G + +T +P L+ GY T +GKWH GF+ +++
Sbjct: 76 SFLTGRFPHHAHQWNPPQSGLVGANINMT--MIPAKLKTAGYKTHMVGKWHEGFYLKKFL 133
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV-GEYATDLFT 252
P+ RGF++ G+L G DH+ + D +N A DT G Y +
Sbjct: 134 PINRGFDTMSGFLGGG---EDHMTQEIICAV------DYWKN--DAIDTRNGTYDDYTYV 182
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-ITDPNRRTYAAMV 311
++ +++ + P FLYL HA L+AP E +N + T +RRT AMV
Sbjct: 183 EDLSKILTNHDPSDPFFLYLPLHNVHAP-----LQAPDEFLNLYPVNSTCSDRRTIQAMV 237
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
D+ G V+ L+ KGM +N++I+ +DNG E GSNYP +G K T
Sbjct: 238 SVADNVTGFVVKMLKDKGMWDNTLIVVSADNGGAPCE----------GSNYPLKGSKMTF 287
Query: 372 WEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGDT-----SRLPLNIDG 424
+EGGV+ A + + + R + +HI+DW T AG D+ + P +DG
Sbjct: 288 YEGGVRSLAFVSGGLLPSDRRGKKTEGFIHIADWYTTFCQLAGVDSDDSGAGKFP--VDG 345
Query: 425 LDQWSSL 431
++ W L
Sbjct: 346 MNVWPIL 352
>gi|298375848|ref|ZP_06985804.1| sulfatase family protein [Bacteroides sp. 3_1_19]
gi|298266885|gb|EFI08542.1| sulfatase family protein [Bacteroides sp. 3_1_19]
Length = 459
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 121/362 (33%), Positives = 178/362 (49%), Gaps = 45/362 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G+ I TPNID +A G+ L Y V TPSRA+LMTG+ P+ G+ G
Sbjct: 42 GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101
Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ P G+ E + + L++ GY+T +GKWHLG F Y P GF+++FG
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQNGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
I Y + + V+ G R T G+ T +T++AV
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSF 209
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I+D +P FLY AH H P T +F+ + R Y +V+++D SV
Sbjct: 210 IKDHS-KEPFFLYFAHTFPHI---------PLYTNARFEGTS--KRGLYGDVVEEIDWSV 257
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V+ AL+ G+ EN+ +IF SDNG + + N GS P + K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGP------WLTKHENGGSAGPLKDGKGTWWEGGFRV 311
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
PAI W P + NP ++ ++M D PT + AG + + L +DG++Q L S
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369
Query: 439 RN 440
R+
Sbjct: 370 RD 371
>gi|302370951|ref|NP_001180574.1| N-acetylgalactosamine-6-sulfatase isoform 2 precursor [Mus
musculus]
gi|26329565|dbj|BAC28521.1| unnamed protein product [Mus musculus]
Length = 440
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 153/316 (48%), Gaps = 35/316 (11%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 31 VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 90
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 91 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 146
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
PL GF+ FG N YD+ R E+ G T T L
Sbjct: 147 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 205
Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+T+EA+ I+ Q + P FLY A A HA Q++ R Y
Sbjct: 206 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 254
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V+++DDSVG ++S LQ G+ +N+ + F SDNGA + S GSN P+ K
Sbjct: 255 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 309
Query: 370 TLWEGGVKVPAILWSP 385
T +EGG++ PAI W P
Sbjct: 310 TTFEGGMREPAIAWWP 325
>gi|410635995|ref|ZP_11346602.1| arylsulfatase [Glaciecola lipolytica E3]
gi|410144672|dbj|GAC13807.1| arylsulfatase [Glaciecola lipolytica E3]
Length = 499
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 118/365 (32%), Positives = 171/365 (46%), Gaps = 50/365 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP----IHTGM 146
G+ D+ G + + TPN+D +A G++L + Y P+C+PSRA LMTG YP + TG
Sbjct: 56 GYEDVGVFGGDHVLTPNLDKMAEEGLMLTDFYVPSPLCSPSRAGLMTGSYPRRVDMATGS 115
Query: 147 QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
P + A+ +G+ E + E L+ +GY+T GKWHLG + E+ P +GF+ FG
Sbjct: 116 NFPVLLAADTKGLNPAEITIAEVLKSVGYATGIFGKWHLG-DQPEFLPTRQGFDEFFGL- 173
Query: 207 NGVISYYDHILSDQYSRTVELNGHD---MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
Y H ++ + R D M + EY T T+ A+ IE
Sbjct: 174 -----PYSHDIAPTHKRQAHFKFPDLPLMENENVIELNPNPEYLTRRITERAIDFIERHQ 228
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT-------------DPNRRTYAAM 310
D P FLYL H H L QE +N T + + Y +
Sbjct: 229 -DAPFFLYLPHPMPHG-----PLHVSQEFLNTVDKATLARLNISDADIAANSKKAMYPLV 282
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW-GSNYPYRGVKN 369
+ +LD++VG ++ L+R + N+I+IF SDNG S RN G+ G K
Sbjct: 283 IGELDNAVGNILDTLKRLNLDSNTIVIFTSDNGP--------SGRRNSDGTERYLSGHKT 334
Query: 370 TLWEGGVKVPAILWSP-QIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDGLD 426
EGG++VP ++W P QI N VS + D LPT+ AG D T R+ IDG +
Sbjct: 335 LTQEGGMRVPTVVWGPSQIPAN-TVSQALTTSMDILPTVAFLAGADVPTDRV---IDGKN 390
Query: 427 QWSSL 431
W L
Sbjct: 391 IWPIL 395
>gi|148679746|gb|EDL11693.1| galactosamine (N-acetyl)-6-sulfate sulfatase, isoform CRA_a [Mus
musculus]
Length = 462
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 109/316 (34%), Positives = 153/316 (48%), Gaps = 35/316 (11%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 53 VLLLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 112
Query: 143 HTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
G A R G+P +E LPE L++ GY+ K +GKWHLG R ++
Sbjct: 113 RNGFY---TTNAHARNAYTPQEIMGGIPNSEHLLPELLKKAGYTNKIVGKWHLGH-RPQF 168
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDL 250
PL GF+ FG N YD+ R E+ G T T L
Sbjct: 169 HPLKHGFDEWFGSPNCHFGPYDNKAKPNIPVYRDWEMVGR-FYEEFPINRKTGEANLTQL 227
Query: 251 FTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+T+EA+ I+ Q + P FLY A A HA Q++ R Y
Sbjct: 228 YTQEALDFIQTQHARQSPFFLYWAIDATHA-----------PVYASRQFLGTSLRGRYGD 276
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V+++DDSVG ++S LQ G+ +N+ + F SDNGA + S GSN P+ K
Sbjct: 277 AVREIDDSVGKILSLLQNLGISKNTFVFFTSDNGAALI-----SAPNEGGSNGPFLCGKQ 331
Query: 370 TLWEGGVKVPAILWSP 385
T +EGG++ PAI W P
Sbjct: 332 TTFEGGMREPAIAWWP 347
>gi|340367647|ref|XP_003382365.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 490
Score = 157 bits (396), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 113/366 (30%), Positives = 169/366 (46%), Gaps = 46/366 (12%)
Query: 76 YAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
+ A K + ++V +G+ D+ F I +PN D LA G++LN Y C PSRA
Sbjct: 17 FVATAKLPNIVFVLVDDWGFADVGFRNP-AISSPNFDQLAKTGLVLNRHYVFKYCAPSRA 75
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
SL+TG++P H + G L P L+ YST +GKWH GFF Y
Sbjct: 76 SLLTGRWPHHVYQWN--LATDATAGTNLNMTMFPAKLKAANYSTHMVGKWHQGFFDPRYL 133
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL---NGHDMRRNLSTAWDTVGEYATDL 250
P+ RGF++ GYL G + +++ + V+ N D R G Y +
Sbjct: 134 PINRGFDTSSGYLCGWVDHFNQ----KQGCAVDCWKNNAPDPRN---------GTYDSYY 180
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAA 309
+ + ++ + + PLF+YL H LEAP+E ++ + T R T A
Sbjct: 181 YRDDLTNIVNNHDANNPLFIYLPLHNVHTP-----LEAPKEWLDIYPANSTCKTRHTLQA 235
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
MV D+ G ++ L+ K M +N+I++ +DNG SNYP RG K
Sbjct: 236 MVSVADNVTGHLVELLKEKEMWDNTIMVISADNGGDQCV----------ASNYPLRGCKY 285
Query: 370 TLWEGGVKVPAILWSPQIQQNPR--VSLQMMHISDWLPTLYTAAGGD-----TSRLPLNI 422
T +EGGV+ A + + ++ R + +HISDW T AG D T + P +
Sbjct: 286 TFFEGGVRSLAFVNGGLLPESRRGQSTDGFIHISDWYTTFCKLAGVDPDDSGTGKFP--V 343
Query: 423 DGLDQW 428
DGLD W
Sbjct: 344 DGLDVW 349
>gi|340619477|ref|YP_004737930.1| sulfatase [Zobellia galactanivorans]
gi|339734274|emb|CAZ97651.1| Sulfatase, family S1-20 [Zobellia galactanivorans]
Length = 488
Score = 156 bits (395), Expect = 2e-35, Method: Compositional matrix adjust.
Identities = 119/377 (31%), Positives = 183/377 (48%), Gaps = 52/377 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-P 149
G+ DLS +G + TPNID LA +GI+ Y+ VC PSR++L+TG + HT ++G
Sbjct: 46 GYGDLSSYGQQKFSTPNIDKLAKDGILFTQHYSGSTVCAPSRSALLTGMHTGHTVVRGNK 105
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN-- 207
I +P L E +++ GY T A GKW LGF E PL +GF++ FGY
Sbjct: 106 EILPEGQYPLPDNTYTLAEAMKKAGYITGAFGKWGLGFPGSEGDPLNQGFDAFFGYNCQR 165
Query: 208 -GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
G Y H+ +++ S +E N + YA +L + ++ IE D
Sbjct: 166 LGHNYYPRHLWANKDSLVLEGNSGFKKET----------YAPELIHDKTLEFIEMNK-DN 214
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----------------------- 303
P FLY+A + HA L AP+E + +++ P
Sbjct: 215 PFFLYVASIIPHA-----ELTAPEEIMEKYKGKFAPETVYEGVDDGPEYKLGPYGSQKEP 269
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
+AAMV LD VG +++ L++ G+ +N++++F SDNG P E Y N S+ P
Sbjct: 270 HAAFAAMVHLLDAQVGEIMATLEKLGIADNTLVVFTSDNG-PHTEGGADPEYFN--SSGP 326
Query: 364 YRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
+RG K L+EGG++VP + W +I+ R L + D PT AG +T P +
Sbjct: 327 FRGTKRDLYEGGIRVPMVAAWPGKIKPGSRTEL-VSAFWDVFPTFSDIAGIET---PEGL 382
Query: 423 DGLDQWSSLLLNTPSRR 439
DG+ +LL +T ++
Sbjct: 383 DGISFLPTLLGDTDQQK 399
>gi|423220023|ref|ZP_17206519.1| hypothetical protein HMPREF1061_03292 [Bacteroides caccae
CL03T12C61]
gi|392623855|gb|EIY17953.1| hypothetical protein HMPREF1061_03292 [Bacteroides caccae
CL03T12C61]
Length = 455
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 123/406 (30%), Positives = 188/406 (46%), Gaps = 60/406 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G DL +G +I TPNID LA GI+ + Y+ V PSR LMTGK+ H+ ++G
Sbjct: 39 GIGDLGCYGQTKIKTPNIDKLADEGILFTDHYSGSAVSAPSRCCLMTGKHTGHSYVRGNK 98
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN--G 208
+P E + + L++ GY T +GKW LG TPL +GF+ +GYL+
Sbjct: 99 SDNGFDLALPAEEVTVAQVLKQKGYKTMCVGKWGLG---GPGTPLQKGFDYFYGYLSQRE 155
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
YY L + + V+L G Y+ L + + I + D P
Sbjct: 156 AHRYYPERLFEN-DKEVKLGGK--------------AYSHFLIMDKGLDFIR-KNADNPF 199
Query: 269 FLYLAHLAAHAG--------NAGKHLEAP--QETINQFQYITDPNRRTYAAMVKKLDDSV 318
F Y + HA +G+ E P +E + F+ +P + TYA+MV ++D +V
Sbjct: 200 FAYFSITLPHADLDYPDISEYSGEFPEKPYGREKVEAFKMQLNP-KATYASMVSEIDRNV 258
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G +I L+ +G+ EN+II F SDNG V +Y+ + SN P+RG K L+EGG++
Sbjct: 259 GQIIQLLKEEGIWENTIIFFSSDNG---VHLTGGHDYKFFNSNGPFRGHKRDLYEGGIRA 315
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
P I+ P I + RV+ + D++PT+ G D DG+ S L L
Sbjct: 316 PFIVSWPMIIKEKRVTDHLSAFWDFMPTVAEIVGADVKS-----DGI---SYLPLLKGRE 367
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
+N P + + + E+ ++R D WKLV
Sbjct: 368 KN----------------PQQHDYIYHEFYEQGGKQSIRKDGWKLV 397
>gi|150008924|ref|YP_001303667.1| arylsulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937348|gb|ABR44045.1| arylsulfatase A [Parabacteroides distasonis ATCC 8503]
Length = 459
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 121/362 (33%), Positives = 178/362 (49%), Gaps = 45/362 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G+ I TPNID +A G+ L Y V TPSRA+LMTG+ P+ G+ G
Sbjct: 42 GYGDLSCYGNPTIRTPNIDRMACEGMKLTQFYVGAGVSTPSRAALMTGRLPVRNGLYGDR 101
Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ P G+ E + + L++ GY+T +GKWHLG F Y P GF+++FG
Sbjct: 102 VAVLFPNSKAGLGQDEVTIAKVLQQSGYATGCVGKWHLGAFS-PYLPTDHGFDTYFG--- 157
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA---------TDLFTKEAVQL 258
I Y + + V+ G R T G+ T +T++AV
Sbjct: 158 --IPYSNDM------SPVQNKGAHARNFPPTPLIVDGKQIESEPDQGELTRRYTEKAVSF 209
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I++ +P FLY AH H P T +F+ + R Y +V+++D SV
Sbjct: 210 IKNHS-KEPFFLYFAHTFPHI---------PLYTNARFEGTS--KRGLYGDVVEEIDWSV 257
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V+ AL+ G+ EN+ +IF SDNG E + N GS P + K T WEGG +V
Sbjct: 258 GEVLKALRENGLDENTFVIFTSDNGPWLTE------HENGGSAGPLKDGKGTWWEGGFRV 311
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
PAI W P + NP ++ ++M D PT + AG + + L +DG++Q L S
Sbjct: 312 PAICWMPG-KINPAINDEIMTSMDLYPTFLSMAGIEQPK-DLVLDGVNQTGLLFEEKHSA 369
Query: 439 RN 440
R+
Sbjct: 370 RD 371
>gi|196231892|ref|ZP_03130748.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196224014|gb|EDY18528.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 486
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 130/417 (31%), Positives = 202/417 (48%), Gaps = 49/417 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYP--IHTGM--Q 147
GW+DL +G++ TPNID A + + YA VC+PSR++LMTGK+ +H + +
Sbjct: 37 GWSDLGCYGADLHETPNIDRFASGAVRFTSAYAMSVCSPSRSTLMTGKHAARLHFTIWAE 96
Query: 148 GPPIWGAEPR---------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
G GA+ R +P +E+ + YL+ GY T IGKWHLG + E+ P G
Sbjct: 97 GAQEGGAKNRELREAESIWNLPNSEKTIATYLKSAGYLTALIGKWHLGDW--EHYPEAHG 154
Query: 199 FESHFGYLN-GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
F+ + G N G + S + +G + R + GEY TD T EA++
Sbjct: 155 FDINIGGTNWGAPQTFWWPYSGSGT-----HGPEFRYIPHLEYGHPGEYLTDRLTDEAIK 209
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNRR--TYAAMVKK 313
+I D D+P F+YLAH A H +EA + I F +Y N R YAAM K+
Sbjct: 210 VI-DHAGDQPFFVYLAHHAVHTP-----IEAKADDIQHFDAKYRDGMNHRHTIYAAMNKE 263
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
LD++VG V+ L+ +G+ +N+++IF SDNG + + +N P R K L+E
Sbjct: 264 LDENVGRVLEHLKERGLDKNTVVIFASDNGGYIGVDKVSGKNMPVTNNAPLRSGKGALYE 323
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
GG++VP I+ P + N + + ++D L T G + DG+D S LL
Sbjct: 324 GGIRVPLIIRWPGVTPNGATCDEPVILTDMLQTFLHITGQPPATDA--TDGMD--ISPLL 379
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
PS + + +L + P ++ +A+R WKL+ ++N
Sbjct: 380 KDPSAKLNR-------DALFFHYPHYYHTT-------TPVSAIRARDWKLLEFYEDN 422
Score = 44.3 bits (103), Expect = 0.19, Method: Compositional matrix adjust.
Identities = 30/73 (41%), Positives = 39/73 (53%), Gaps = 10/73 (13%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITD 70
GEY TD T EA+++I D D+P F+YLAH A H +EA + I F +Y
Sbjct: 196 GEYLTDRLTDEAIKVI-DHAGDQPFFVYLAHHAVHTP-----IEAKADDIQHFDAKYRDG 249
Query: 71 PNRR--TYAALTK 81
N R YAA+ K
Sbjct: 250 MNHRHTIYAAMNK 262
>gi|423219918|ref|ZP_17206414.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
CL03T12C61]
gi|392624181|gb|EIY18274.1| hypothetical protein HMPREF1061_03187 [Bacteroides caccae
CL03T12C61]
Length = 463
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 154/301 (51%), Gaps = 36/301 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ND F GS E+ TPNIDAL G++ + + A V +PSRA L+TG+Y G +
Sbjct: 43 GYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGRYGHRFGYECN- 101
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+PL E + E + GY T AIGKWHLG R E P RGF+ +G G
Sbjct: 102 -LSDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLGS-RDEQHPNNRGFDLFYGMKAGGR 159
Query: 211 SY-YDHILSDQ--YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
Y Y+ SD+ R + LN ++ +Y TD F+++AV+ I + +P
Sbjct: 160 DYFYNEKKSDRPGDERNLLLNDRQVK---------FEKYLTDAFSEKAVEFINESS--QP 208
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
+YLA+ A H ++A E + +F+ P R+ AAM LD VGTVI L+
Sbjct: 209 FMMYLAYNAVHT-----PMQATDEDMAKFE--GHP-RQKLAAMTYALDRGVGTVIRGLKD 260
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
G +N++I F+SDNG T N SNYP +G K +EGG +VP ++W
Sbjct: 261 SGKFDNTLIFFLSDNGGATT---------NQSSNYPLKGFKGNKFEGGHRVPYFVVWKNG 311
Query: 387 I 387
I
Sbjct: 312 I 312
>gi|373956421|ref|ZP_09616381.1| sulfatase [Mucilaginibacter paludis DSM 18603]
gi|373893021|gb|EHQ28918.1| sulfatase [Mucilaginibacter paludis DSM 18603]
Length = 510
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 122/386 (31%), Positives = 177/386 (45%), Gaps = 71/386 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ--- 147
G+NDLS +G+ I TPNIDALA G+ Y + P+C+PSRA L+TG+Y G +
Sbjct: 41 GYNDLSSYGNTLINTPNIDALAKEGVNFTQGYVSAPICSPSRAGLITGRYQQRFGYEFFA 100
Query: 148 -GPPIWGAE--------------------------------PRGVPLTERFLPEYLRELG 174
P WG + P+G+P E + L++ G
Sbjct: 101 ATPATWGVQDAAHAEASKNALRKYGAYYTIDGLALETYNKTPQGLPANEITIAGLLKKQG 160
Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY-----YDHILSD----QYSRTV 225
Y T IGKW+LG ++ P GF+ H+G+L+G Y D ++ D ++ +
Sbjct: 161 YKTAVIGKWNLGE-TDDFIPEKYGFDYHYGFLSGGSRYGSTDDVDFVVKDLPHLYWNAQI 219
Query: 226 ELNGH---DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNA 282
G +R+N T T EY T F EA IE ++P FLY+ A H
Sbjct: 220 IKYGKGPVKLRKNAGTVLTT--EYLTTRFGNEAADFIEANK-NRPFFLYVPFNAPH---- 272
Query: 283 GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
A +E + D RR Y AMVK LDD+VG + L+ + +N++IIF SDN
Sbjct: 273 -DPFMAKKEDFAAVTTVKDSTRRVYQAMVKSLDDAVGVITKKLKELNLDKNTLIIFTSDN 331
Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISD 402
G + Y + N P RG K T +EGG++VP I+ P V + + D
Sbjct: 332 GG--------AMYTHALDNKPLRGGKATHFEGGIRVPFIIKYPGGIPAGEVFNKPVSTLD 383
Query: 403 WLPTLYTAAGGDTSRLPLNI--DGLD 426
PT+ G + LP N+ DG++
Sbjct: 384 LFPTIAAVTG---ANLPGNVTYDGVN 406
>gi|153807102|ref|ZP_01959770.1| hypothetical protein BACCAC_01379 [Bacteroides caccae ATCC 43185]
gi|149130222|gb|EDM21432.1| arylsulfatase [Bacteroides caccae ATCC 43185]
Length = 463
Score = 156 bits (395), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 109/301 (36%), Positives = 154/301 (51%), Gaps = 36/301 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ND F GS E+ TPNIDAL G++ + + A V +PSRA L+TG+Y G +
Sbjct: 43 GYNDFGFMGSKEMQTPNIDALTSEGVVFTDAHVAATVSSPSRACLITGRYGHRFGYECN- 101
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+PL E + E + GY T AIGKWHLG R E P RGF+ +G G
Sbjct: 102 -LSDRTNGLPLEEETIAEVFKTNGYRTAAIGKWHLGS-RDEQHPNNRGFDLFYGMKAGGR 159
Query: 211 SY-YDHILSDQ--YSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
Y Y+ SD+ R + LN ++ +Y TD F+++AV+ I + +P
Sbjct: 160 DYFYNEKKSDRPGDERNLLLNDRQVK---------FEKYLTDAFSEKAVEFINESS--QP 208
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
+YLA+ A H ++A E + +F+ P R+ AAM LD VGTVI L+
Sbjct: 209 FMMYLAYNAVHT-----PMQATDEDMAKFE--GHP-RQKLAAMTYALDRGVGTVIRGLKD 260
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
G +N++I F+SDNG T N SNYP +G K +EGG +VP ++W
Sbjct: 261 SGKFDNTLIFFLSDNGGATT---------NQSSNYPLKGFKGNKFEGGHRVPYFVVWKNG 311
Query: 387 I 387
I
Sbjct: 312 I 312
>gi|338212321|ref|YP_004656376.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
gi|336306142|gb|AEI49244.1| N-acetylgalactosamine-6-sulfatase [Runella slithyformis DSM 19594]
Length = 446
Score = 156 bits (394), Expect = 3e-35, Method: Compositional matrix adjust.
Identities = 120/368 (32%), Positives = 190/368 (51%), Gaps = 52/368 (14%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPI 142
+ L+ GW ++ +GS+ TP++D LA G+ N YA VC+P+RA+LMTG+YP
Sbjct: 30 VMILVDDMGWGEVGIYGSHYCQTPHLDNLAKQGMRFTNFYANSTVCSPTRAALMTGRYPD 89
Query: 143 HTGMQG------PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
G+ G WG P+ V L P+ L+ GY T +GKWHLG E P
Sbjct: 90 LVGVPGVIRGNVENSWGYFSPKAVTL-----PQVLKTAGYRTAMVGKWHLGL-EPENHPN 143
Query: 196 YRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
GF G+L ++ YY H+ + + LNG + D G +ATDLFT+
Sbjct: 144 RHGFTHFHGFLEDMMDDYYTHLREGK--NWMRLNGQVI--------DPSG-HATDLFTQW 192
Query: 255 AVQLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAA 309
A++ + Q + +P FLYLA+ A H ++ P+E + + + +PN R A
Sbjct: 193 AIEYLNTQKKNPQPFFLYLAYNAPHF-----PVQPPREWLEKVKK-RNPNLSEKRAKLTA 246
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
+++ LD+S+G V+ AL++ G +N++I+F SDNG + + N ++G K
Sbjct: 247 LIEHLDESIGKVMEALRQNGQADNTLIVFSSDNGGLLSDEAD---------NGIWKGGKQ 297
Query: 370 TLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T++EGG++VP I +W +I + + + + D PT AAG ++ P +DG +
Sbjct: 298 TMYEGGIRVPTIAVWKNKIIPGTKTDFRALTM-DLFPTFCEAAG---TKPPQAVDG-HSF 352
Query: 429 SSLLLNTP 436
SLLL P
Sbjct: 353 LSLLLGQP 360
>gi|148223113|ref|NP_001088386.1| galactosamine (N-acetyl)-6-sulfate sulfatase precursor [Xenopus
laevis]
gi|54038351|gb|AAH84661.1| LOC495239 protein [Xenopus laevis]
Length = 520
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 163/581 (28%), Positives = 236/581 (40%), Gaps = 140/581 (24%)
Query: 79 LTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
+T+ L LL+ GW DL G TPN+D +A G++ N Y A P+C+PSRA+L+
Sbjct: 25 VTRPNILLLLMDDMGWGDLGVFGEPSRETPNLDKMASEGMLFPNFYTANPLCSPSRAALL 84
Query: 137 TGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
TG+ PI G + R G+ +E PE L++ GY K IGKWHLG
Sbjct: 85 TGRLPIRNGFFTT---NSHARNAYTPQEIVGGISDSEILFPELLKKAGYINKIIGKWHLG 141
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
R Y PL GF+ FG N YD N L W+ VG Y
Sbjct: 142 H-RPSYHPLRHGFDEWFGSPNCHFGPYD-------------NKQIPNIPLYRDWNMVGRY 187
Query: 247 --------------ATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQE 291
T ++ +EA++ I Q V +P FLY + A HA
Sbjct: 188 YEEFNINHKTGESNLTQIYLQEALEFIYSQAVAQQPFFLYWSIDATHA-----------P 236
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
++ NR Y V+++D S+G +++ L G+ +++ + F SDNGA +
Sbjct: 237 VYASRPFLGTSNRGLYGDAVREIDFSIGKILNLLNLTGIAKDTFVFFTSDNGAALI---- 292
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
S GSN P+ K T +EGG++VP I W P +VS Q+ I D T + A
Sbjct: 293 -SAPMQGGSNGPFLCGKQTTFEGGMRVPGIAWWPGHVAPGQVSYQLGSIMDLFTTSLSLA 351
Query: 412 GGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK 471
G L P R+ IDG+D +L + + R +E
Sbjct: 352 G--------------------LEIP--RDRAIDGIDLSPVILESKETERPIFYYRGNE-- 387
Query: 472 RTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-F 530
AVRL K T N S ++LSQ I F
Sbjct: 388 -LMAVRLGQHKAHYWTWSN------------------------------SWEELSQGINF 416
Query: 531 LPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNI---ASSRP 587
P NI + + Q + NP LF+LG DP E+ +I ++
Sbjct: 417 CPGQNITGVTTHMQ---VEHSINP-------------LLFHLGRDPGEKYSISVLSTEYE 460
Query: 588 DISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
D +++ +++ HR ++VP +P L D N WSP
Sbjct: 461 DAVNRISYVVQQHRDSMVPG---EPQLNVCDLAVMN--WSP 496
>gi|410912979|ref|XP_003969966.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Takifugu
rubripes]
Length = 519
Score = 156 bits (394), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 124/376 (32%), Positives = 178/376 (47%), Gaps = 50/376 (13%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
+ L+ GW DL G TPN+DA+A G++L N Y A P+C+PSRA+L+TG+ P+
Sbjct: 31 IMLMDDMGWGDLGAFGQPSKETPNLDAMAAQGMLLLNFYTANPLCSPSRAALLTGRLPVR 90
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G G P+ G+ E LP+ L++ GY K +GKWHLG R +Y PL
Sbjct: 91 NGFYTTNGHARNAYTPQEIVGGISKDEILLPQMLKKRGYFNKIVGKWHLGH-RPQYLPLE 149
Query: 197 RGFE-------SHFGYLNGVIS-----YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
GF+ HFG N + Y + + +Y +++ NL
Sbjct: 150 HGFDEWFGAPNCHFGPYNNSVRPNIPVYRNSWMLGRYYEEFKIDKKTGESNL-------- 201
Query: 245 EYATDLFTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
T ++ E + I+ Q KP FLY A A HA + A ++ + + Q
Sbjct: 202 ---TQMYLLEGLDFIQSQAEAQKPFFLYWAPDATHA-----PVYASKDFLGKSQ------ 247
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
R Y V +LD SVG ++S L+R G+ N+ + F SDNGA + S GSN P
Sbjct: 248 RGRYGDAVMELDYSVGQILSLLRRLGIDSNTFVFFTSDNGAALMSGPNES-----GSNGP 302
Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
+ K T +EGG++ PAI W P + VS Q+ ++ D L T A G + ID
Sbjct: 303 FLCGKETTFEGGMREPAIAWWPGHIREGTVSFQLANVMD-LFTTSLALAGMSPPDDRTID 361
Query: 424 GLDQWSSLLLNTPSRR 439
G+D LL NT R
Sbjct: 362 GMDLTPVLLHNTVLNR 377
>gi|410029536|ref|ZP_11279368.1| sulfatase [Marinilabilia sp. AK2]
Length = 486
Score = 155 bits (393), Expect = 4e-35, Method: Compositional matrix adjust.
Identities = 126/370 (34%), Positives = 181/370 (48%), Gaps = 52/370 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL F G I TPN+D LA G+ +N YA VC PSR++LMTG + HT ++G
Sbjct: 45 GYGDLGFLGQKIIETPNLDRLAAEGMFFSNHYAGATVCAPSRSALMTGLHTGHTPVRGN- 103
Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ +P G +P T +P +E GY+T A GKW LGF E P ++GF+ FGY
Sbjct: 104 -FEIQPEGQYPLPDTILTIPRLFKEAGYATGAFGKWGLGFVGTEGDPNHQGFDQFFGYNC 162
Query: 208 GVIS--YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
I+ YY L + + V L G+D + ++ YA D+ +E+++ I D +
Sbjct: 163 QRIAHRYYPEYLWEN-AEKVFLEGNDWIQKIT--------YAPDIIHQESLKFIGDN-AN 212
Query: 266 KPLFLYLAHLAAHA-----------------GNAGKHLEAP------QETINQFQYITDP 302
P +++ + HA G H+ AP E IN Y ++P
Sbjct: 213 NPFLMFIPSVMPHAELAVPDGELLAYYREKIGEEKPHI-APVGWDYGDEPINIPGYQSNP 271
Query: 303 N-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
+ +YAAMV LD VG ++ L G+ E +IIIF SDNG P E ++ N SN
Sbjct: 272 YPKASYAAMVALLDRQVGEIMDKLDELGLTEQTIIIFTSDNG-PHQEGGNDPDFFN--SN 328
Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
+RG K L+EGG++VP I+ P + Q S + D+LPT G N
Sbjct: 329 GIFRGYKRDLYEGGIRVPMIVKWPGVIQGGAQSGHVSAFWDFLPTFADLLGEK------N 382
Query: 422 IDGLDQWSSL 431
+D LD S L
Sbjct: 383 LDKLDGVSFL 392
>gi|332663784|ref|YP_004446572.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332332598|gb|AEE49699.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 580
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 150/492 (30%), Positives = 212/492 (43%), Gaps = 119/492 (24%)
Query: 74 RTYAALTKSTTLTLLIVY---GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
+T A T S +LI+ G++D +GS EI TPNID LAY G+ L Y +C P
Sbjct: 23 KTTPAPTPSKPNIILIMVDDLGYSDFGAYGS-EIQTPNIDKLAYGGLRLKEFYNNSICAP 81
Query: 131 SRASLMTGKYPIHTGMQ------GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWH 184
+RASL+TG+YP G+ G P + LT E L++ GY+T GKWH
Sbjct: 82 TRASLITGQYPHKAGLGYFNTNLGLPAYQGWLNQESLT---FGEVLQQGGYNTYLTGKWH 138
Query: 185 LGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
+G Y P RGF +G++ G +YYD + + VEL ++ R NL+ G
Sbjct: 139 VG-NDSLYWPNQRGFNKFYGFIGGASNYYDISPYPEKAPPVELVENNQRINLAP-----G 192
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ------- 297
+Y TD T A+ I ++ DKP FLYLA A H L+AP E I +++
Sbjct: 193 KYLTDEITNHALSYI-NESKDKPFFLYLAFNAPHW-----PLQAPAEDIAKYKGKYKIGW 246
Query: 298 --------------YITDPNRR---------------------------TYAAMVKKLDD 316
I DP ++ YAAMV +D
Sbjct: 247 DSLRAQRYRRQLALGIADPRQKVAERDPEVVAWESLTFDEKELWQRKMEVYAAMVDHVDQ 306
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAP----------------------TVEYRETSN 354
SVG V+ L++ +N++I+ +SDNGA + +Y+E
Sbjct: 307 SVGRVVEELKKLKKDDNTLIVLISDNGAQGGLNQLGQRRRQRNSGPIGSAGSYDYQE--- 363
Query: 355 YRNWG--SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
+NW SN P+R K T EGG+ P I W P+ + ++ H+ D PT Y AG
Sbjct: 364 -QNWAYVSNTPFRNYKATSHEGGISSPFIAWFPKKIKAGTITKGTGHLIDLAPTFYDVAG 422
Query: 413 GDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKR 472
P N G+ + L S +N +G D L+ N P E+
Sbjct: 423 ---VNYPTNYRGV---QTNTLPGVSLKNLLFEGND----LVRNEPI--------FWERAG 464
Query: 473 TAAVRLDSWKLV 484
AVR WKLV
Sbjct: 465 NRAVRKGKWKLV 476
>gi|443700719|gb|ELT99563.1| hypothetical protein CAPTEDRAFT_110993 [Capitella teleta]
Length = 339
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 98/305 (32%), Positives = 152/305 (49%), Gaps = 36/305 (11%)
Query: 91 YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
YG+ D + S +I TPNID L +GI N Y+ C+PSR++ ++G+Y +GMQ
Sbjct: 42 YGYQDAGYRNS-DIHTPNIDKLVADGISFTNAYSAQQCSPSRSAFLSGRYAYTSGMQHGV 100
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
I + + L F+ +YL+EL Y+T A GKWHLG+ +E TP YRGF++ G +G
Sbjct: 101 IGDTKAHCMDLKYNFISDYLKELKYNTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEG 160
Query: 211 SYYDH-ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
YY+H Y NG ++ R+ S G ++ D+ + + + D+ PLF
Sbjct: 161 EYYEHTTFQGMYDWH---NGTELDRSAS------GIHSQDMIEADLLNQL-DKNDGTPLF 210
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITD-----PNRRTYAAMVKKLDDSVGTVISA 324
Y A H +P + + + D P RR Y +V +D +G V++
Sbjct: 211 YYAAFHNTH---------SPMHPKPEHEALYDDLDAPPTRRKYLGLVSGMDAVIGKVVTK 261
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
L+ K M +N+ I F SDNG E G N P RG K++L++GG + S
Sbjct: 262 LKEKDMFDNTYIFFSSDNGGDVHE----------GDNSPKRGAKSSLFDGGCHAHSWASS 311
Query: 385 PQIQQ 389
P +++
Sbjct: 312 PLLKK 316
>gi|405970955|gb|EKC35816.1| N-acetylgalactosamine-6-sulfatase [Crassostrea gigas]
Length = 511
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 123/369 (33%), Positives = 181/369 (49%), Gaps = 38/369 (10%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL G TP +D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 24 LMLMDDMGWGDLGVFGEPNKETPYLDQMAAEGMLFPDFYSANPLCSPSRAALLTGRLPIR 83
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G G P+ G+P E LPE L++ GY +K +GKWHLG + +Y PL
Sbjct: 84 NGFYTTNGHARNAYTPQNIVGGIPDEEILLPELLQKAGYKSKLVGKWHLG-HQAKYLPLK 142
Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEY-ATDLFTK 253
GF+ FG N YD++ + R E+ G R + GE T L+TK
Sbjct: 143 HGFDEWFGAPNCHFGPYDNVHTPNIPVYRNEEMAG---RYYQDFKIEKNGESNLTQLYTK 199
Query: 254 EAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
EAV+ I KP FLY A A H + L A + ++ NR Y +V+
Sbjct: 200 EAVEFITRMHNKSKPFFLYWAVDATH-----EPLYASKP------FLGTSNRGLYGDVVR 248
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
+LD +VG ++++L+ + N+++ F SDNG T + GSN P+ K T +
Sbjct: 249 ELDSAVGKILASLRSLKIDNNTLVFFSSDNGGATYAKQHG------GSNGPFLCGKETTF 302
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL--YTAAGGDTSRLPLNIDGLDQWSS 430
EGG++ P I W P + +VS Q+ + D T + T R+ IDG+ +S
Sbjct: 303 EGGMREPTIAWWPGHIKKGQVSHQVGSLMDLYTTFVDFAEVAMPTDRV---IDGISLRAS 359
Query: 431 LLLNTPSRR 439
LL NT +R
Sbjct: 360 LLNNTNVQR 368
>gi|410617069|ref|ZP_11328045.1| arylsulfatase B [Glaciecola polaris LMG 21857]
gi|410163338|dbj|GAC32183.1| arylsulfatase B [Glaciecola polaris LMG 21857]
Length = 482
Score = 155 bits (393), Expect = 5e-35, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 172/350 (49%), Gaps = 43/350 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS+ + TPN+D LA G + Y + VC PSRA ++TGKY G +
Sbjct: 49 GYADFGFQGSDVMRTPNLDKLASQGTVFTQAYVSAAVCGPSRAGILTGKYQQRFGYEENN 108
Query: 151 IWG--------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
+ G + G+PL ++ + +YLRE GY T IGKWH G R + P RGF+
Sbjct: 109 VPGYMSQSGLTGDDMGLPLDQKTMADYLRERGYKTALIGKWHQGNADR-FHPTKRGFDEF 167
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
+G+ G SY+ + S + + R + ++ Y T+ E V+ I+ +
Sbjct: 168 YGFRGGARSYFGFGAQNPVSYPED----KLERGFAHFQES-KRYLTEALATETVEFIK-R 221
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
P F++L+ A H +EA + QF + R+ AAM +D +G V+
Sbjct: 222 NQKHPFFVFLSFNAVHT-----PMEAKPADLAQFSNLKG-KRQQLAAMTLSMDREIGKVL 275
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
+ L G+ +N++++F +DNG P+ N +N P G K EGG++VP I+
Sbjct: 276 NTLDELGLSDNTLVVFTNDNGGPSDT--------NASNNGPLSGTKANHLEGGIRVPFIM 327
Query: 383 WSP--QIQQ----NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
P ++ Q NP +L D LPT ++AAGG+ +L IDG+D
Sbjct: 328 RWPAGKVSQGEYPNPISTL------DLLPTFFSAAGGNEQQLAA-IDGVD 370
>gi|334145289|ref|YP_004538499.1| sulfatase [Novosphingobium sp. PP1Y]
gi|333937173|emb|CCA90532.1| sulfatase [Novosphingobium sp. PP1Y]
Length = 461
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 114/359 (31%), Positives = 174/359 (48%), Gaps = 52/359 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
G+ DL +G + TP ID +A G++ YA VC+ SR +L+TG+Y + G++
Sbjct: 32 GYADLGCYGRTDYATPAIDRIAREGMLFRQAYANSAVCSASRTALITGRYQNRLPVGLEE 91
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P G RG+P + +P LR GY T IGKWHLG + Y PL G++ +G G
Sbjct: 92 P--LGRRVRGLPPSHPTMPSLLRSQGYETTLIGKWHLGQLPK-YGPLQSGYDHFWGLRGG 148
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE-----YATDLFTKEAVQLIEDQP 263
I Y+ H + NG + W+ E Y TDL + A+ I+D
Sbjct: 149 GIDYFSH----------DFNG------VPDLWEDDREIEQPGYMTDLLAERAMTAIDDYA 192
Query: 264 V-DKPLFLYLAHLAAHAGNAG-------KHLEAPQETINQFQYITDP-NRRTYAAMVKKL 314
+P F+ L A H G + L A + + F TD +R+TYA MV ++
Sbjct: 193 ARQQPFFMSLHFTAPHWPWEGPDDQTESQRLAASDKPFSMFD--TDGGSRKTYAEMVTRM 250
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D +G ++ L R G+ E++II+F SDNG + +PY G+K L EG
Sbjct: 251 DMEIGRILERLDRLGLAEDTIIVFTSDNGG-----------ERYSKTWPYNGLKTELLEG 299
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS-RLPLNIDGLDQWSSLL 432
G++VPA++ P+ + S Q++ DW+PT + R P+ DG+D ++LL
Sbjct: 300 GLRVPAVMRWPRRIAAGQESEQVIMSMDWMPTFLSLIQASPDPRYPM--DGMDLSATLL 356
>gi|149175233|ref|ZP_01853855.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
gi|148845842|gb|EDL60183.1| Twin-arginine translocation pathway signal [Planctomyces maris DSM
8797]
Length = 459
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 177/346 (51%), Gaps = 29/346 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL+ +G+ ++ TP+ID LA + + + + A +CTP+RA+++TG+Y G Q
Sbjct: 46 GYGDLACYGNKQVKTPHIDRLAASALKFTDFHSAGAMCTPTRAAMLTGQYQQRFGRQFES 105
Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+ G+P + E L++ GY+T GKWHLG+ + + P +GF+ G +
Sbjct: 106 ALSGKSNHDIGLPHQAVTMAELLKQQGYATACFGKWHLGY-QPPWLPTNQGFDLFRGLTS 164
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
G ++ H+ N M + Y DL +K +V +E +P
Sbjct: 165 GDGDHHTHVDRSGNEDWWHNNEISMEKG----------YTADLLSKYSVAFMEANRT-RP 213
Query: 268 LFLYLAHLAAHAGNAGK----HLEAPQE-TINQFQYITDPNRRT--YAAMVKKLDDSVGT 320
FLY+ HLA H G H +A Q+ ++ I DP + AM++ LD SVG
Sbjct: 214 FFLYVPHLAIHFPWQGPQDPPHRKAGQDYHAGKWGIIPDPGNVSPHTTAMIESLDQSVGK 273
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
++SAL+R + +N+++IF SDNG + Y + N++N SN P RG K TL+EGG +VP
Sbjct: 274 ILSALKRLDLEQNTLVIFTSDNGG-YLTYGK--NFQNISSNGPLRGQKATLYEGGHRVPC 330
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
++ P + V+ Q H D LPTL AAG S DGLD
Sbjct: 331 LISWPGV-ITAGVTDQTAHSVDLLPTLAQAAG--ISATNFQTDGLD 373
>gi|320105193|ref|YP_004180784.1| sulfatase [Isosphaera pallida ATCC 43644]
gi|319752475|gb|ADV64235.1| sulfatase [Isosphaera pallida ATCC 43644]
Length = 481
Score = 155 bits (392), Expect = 6e-35, Method: Compositional matrix adjust.
Identities = 128/409 (31%), Positives = 201/409 (49%), Gaps = 70/409 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G+ +I TP ID+LA +G L++ ++ PVCTP+RA+L+TG++P G++
Sbjct: 76 GYADLGCYGAPDIATPRIDSLARDGARLSHFHSPGPVCTPTRAALLTGRWPQRVGLE--- 132
Query: 151 IWG-----AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
W EP G+P+ E L L+E+GY T +GKWHLG+ R E+ P GF+ FG
Sbjct: 133 -WALSASDTEP-GLPVEEPILSRPLKEVGYRTVMVGKWHLGY-RPEFGPNAHGFDEFFGL 189
Query: 206 LNGVISYYDHILSDQYSRTVELNG-HDMRRNLSTAWDTVGEYATDLFTKEAVQLIED--- 261
L+G + +Y H E+NG D N V Y+TDL + AV I+
Sbjct: 190 LSGNVDHYSH---------REINGKEDWYENTKPV--RVEGYSTDLLSDRAVAAIQKTAA 238
Query: 262 QPVDK--PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVG 319
QP D+ PL+LY+A+ A H + PQ+ ++ + R Y MV+ +D VG
Sbjct: 239 QPPDQRQPLWLYVAYNAVHW--PFQPPGRPQDIRDRSTWFNG-TRADYVKMVESIDAGVG 295
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
++ AL+ GM +++++IF +DNG + N P+ K T+WEGG +VP
Sbjct: 296 RILDALEAGGMADHTLVIFTNDNGGERLS-----------DNGPFFHHKGTVWEGGHRVP 344
Query: 380 AIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
++ W +I + + D T+ AA ++P N P R
Sbjct: 345 CLIRWPGRIAAGTEFGQPTIGM-DLTATILAAA-----KIPPN--------------PER 384
Query: 439 RNSNIDGLDQWSSLLLNTPSRRNSVLINIDE--KKRTAAVRLDSWKLVL 485
+DG++ L P ++ ID +K+ AA+R WK V+
Sbjct: 385 ---PLDGVNLVPILRGEVPCPDRALFWRIDRPNRKQKAALR-GRWKYVV 429
>gi|256841083|ref|ZP_05546590.1| arylsulphatase A [Parabacteroides sp. D13]
gi|256736926|gb|EEU50253.1| arylsulphatase A [Parabacteroides sp. D13]
Length = 452
Score = 155 bits (392), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
AL S + L++V G+ DLS G+ +I TP+ID L G+ N +A PV +PSRA
Sbjct: 23 ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
SL+TG+YP G+ G E L+E LP+ L++ GY +GKW+LG
Sbjct: 83 SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
TP RGF+ + G+L ++ D Y + MR NL D G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHCRFGNNYMRENLKEI-DPQG-HATEVF 191
Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
+ A++ + D + +P FLYLA+ A H ++ PQE + + + +P+ R
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
A+++ LD +VG V AL++ G LEN+IIIF SDNG +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296
Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
K ++EGG++V I W QI+ P V + +SD PTL D + +P++ ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349
Query: 424 GL 425
G+
Sbjct: 350 GI 351
>gi|437818613|ref|ZP_20842917.1| arylsulfatase, partial [Salmonella enterica subsp. enterica serovar
Enteritidis str. SARB17]
gi|435307850|gb|ELO82888.1| arylsulfatase, partial [Salmonella enterica subsp. enterica serovar
Enteritidis str. SARB17]
Length = 555
Score = 155 bits (391), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 147/487 (30%), Positives = 222/487 (45%), Gaps = 82/487 (16%)
Query: 53 KHLEAPQETI--NQFQYITDPN-----RRTYAALTKST------TLTLLIVYGWNDLSFH 99
+++ AP I N I+ P R+ A L K T + +L GW D+ F+
Sbjct: 50 QYIAAPTTKIADNLMPVISHPGQDKETRQKLADLEKKTGKKPNIVVFILDDVGWMDMGFN 109
Query: 100 GSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEP 156
G PTP+IDA+A G+IL + Y+QP +P+RA++MTG+Y +H G+ PP++G P
Sbjct: 110 GGGVAVGNPTPDIDAIASQGLILTSAYSQPSSSPTRATIMTGQYSVHHGILMPPMYGM-P 168
Query: 157 RGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY-- 213
G+ LT LP+ L + GY T+AIGKWH+G P GF+ G+ N V Y
Sbjct: 169 GGLEGLTT--LPQLLHDQGYVTQAIGKWHMG-ENTGSQPQNVGFDDFRGF-NSVSDMYTE 224
Query: 214 --------DHILSD---QYSRTVELNGHDMRRNLSTAWDTVGE----YATDL---FTKEA 255
+ LS QY + + + +D+ + + + Y DL + K
Sbjct: 225 WRDPNVNPEVALSPSRLQYIKNLPFDKNDVHAVRGGKQEAIADITPKYMEDLDQRWMKYG 284
Query: 256 VQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
V I DKP FLY H N N + P R +Y+ + ++
Sbjct: 285 VDFINKMAKNDKPFFLYYGTRGCHFDN----------YPNAYYAGRSPARTSYSDCIVEM 334
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
+D + AL+ G L+N++I+F SDNG P E P+RG K + WEG
Sbjct: 335 NDVFANLYKALETSGQLDNTLIVFTSDNG-PEAEVPPH-------GRTPFRGAKGSTWEG 386
Query: 375 GVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLL 433
GV+VP + W IQ PR S ++ ++D PT + AG + L + L+
Sbjct: 387 GVRVPTFVYWKGMIQ--PRKSDGLVDLADLFPTSLSLAGYPGAEL----------AKLVP 434
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQE--NG 491
T + IDG+DQ SS + T + + + +AVR+D +K L Q+
Sbjct: 435 KT-----TFIDGIDQ-SSFFIGTNGQSSRKAEHYFLNGELSAVRIDEFKYHLLIQQPYAF 488
Query: 492 TMDGYYG 498
T GY G
Sbjct: 489 TQTGYQG 495
>gi|282900986|ref|ZP_06308919.1| Twin-arginine translocation pathway signal [Cylindrospermopsis
raciborskii CS-505]
gi|281194077|gb|EFA69041.1| Twin-arginine translocation pathway signal [Cylindrospermopsis
raciborskii CS-505]
Length = 434
Score = 155 bits (391), Expect = 7e-35, Method: Compositional matrix adjust.
Identities = 112/336 (33%), Positives = 167/336 (49%), Gaps = 35/336 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQG-P 149
GW DLS G TP +D LA +GI N Y+ PVCTP+R TG+YP +
Sbjct: 23 GWGDLSIDGQKNYQTPYLDQLARDGIRFNQAYSDSPVCTPTRIGFFTGRYPGRLAIGNYE 82
Query: 150 PIWG----AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
P+ + G+P + LR+ GY T +GKWH G+ R Y+PL GF FG
Sbjct: 83 PLLSFQQIGDSVGLPPEHPTIASLLRDNGYETVLVGKWHCGYLPR-YSPLKSGFNKFFGN 141
Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
+G I Y+ H V+ NG T + +G Y TD+FT+ AV I+ V+
Sbjct: 142 FSGAIDYFRH---------VDTNGKPDLWEADTPIEKIG-YVTDIFTERAVDFIKGPHVN 191
Query: 266 KPLFLYLAHLAAHAGNAG-KHLEAPQETI---NQFQYITDPNRRTYAAMVKKLDDSVGTV 321
P +L L + A H G + +E + I N +I + +Y A+V+ LD+ VG V
Sbjct: 192 -PFYLSLHYTAPHWPWQGPEDVELSNDLIGRDNLENWINTGTKESYKAVVQSLDNGVGKV 250
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ AL+ G+ +I+IF SDNG + N+G ++G K L+EGG++VP +
Sbjct: 251 LQALEDVGIANRTIVIFASDNGG--------ERFSNFGL---FQGKKGNLYEGGIRVPTL 299
Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
+ WS I N ++S Q++ D T+ A +T+
Sbjct: 300 IRWSGVIAPN-QISDQVIITHDITATILAATKTNTN 334
>gi|167519809|ref|XP_001744244.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163777330|gb|EDQ90947.1| predicted protein [Monosiga brevicollis MX1]
Length = 328
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 113/337 (33%), Positives = 160/337 (47%), Gaps = 35/337 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALA-YNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL + + I TPNID LA G+IL+N Y C+PSRAS +TG+ PIH P
Sbjct: 11 GYYDLGYRNPDSI-TPNIDQLATQEGVILDNAYGYRYCSPSRASFLTGRVPIHVHQGNPG 69
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ A + T +P LR GY T +GKWH G E P+ RGF++ FGYL+G
Sbjct: 70 LAAAGCTNLNYT--MIPAQLRRAGYRTAMVGKWHQGASLPECLPVNRGFDTSFGYLSG-- 125
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
DH+ DQ + + N D + A G Y++ + V +I+ ++PL L
Sbjct: 126 -EEDHM--DQTTNGGQCNVTDFWLDSGPAIRRNGTYSSFQYNDAIVDIIQQHAPEQPLML 182
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQY-----------ITDPNRRTYAAMVKKLDDSVG 319
Y A H E P+ +F + R AMV LDD VG
Sbjct: 183 YAALQNVHG-----PYEVPERYRAKFPADKNCTETGDNPTSQCTRDFMLAMVNVLDDIVG 237
Query: 320 TVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
+ AL+ GM N++++F SDNG P + N N+P RG K +EGGV+
Sbjct: 238 NITRALKANGMWNNTVLVFSSDNGGAVPGAPQHGSMN------NHPLRGGKVAYFEGGVR 291
Query: 378 VPAILWSPQIQQNPRVS--LQMMHISDWLPTLYTAAG 412
A + SP + ++ R + ++HISDW T AG
Sbjct: 292 TAAFVASPLLPKSIRGTELRGLIHISDWYATFCHLAG 328
>gi|393719701|ref|ZP_10339628.1| sulfatase [Sphingomonas echinoides ATCC 14820]
Length = 563
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 128/413 (30%), Positives = 186/413 (45%), Gaps = 83/413 (20%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNE----IPTPNIDALAYNGIILNNMYA-QPVCTP 130
A T+ + L++ G+ND+SF+G + TPNIDAL ++G+ + YA C P
Sbjct: 56 AGTRPPNIVLIVADDLGYNDISFNGGGVAGGLVKTPNIDALGHDGVTFADGYAGNATCAP 115
Query: 131 SRASLMTGKYPIHTGMQ------GPPIW-------------------------------- 152
SRA+LMTG+YP G + PP W
Sbjct: 116 SRAALMTGRYPTRFGYEFTPADTHPPAWFPSSLIRTDAMFSRTIANFPTTWEHKTIFDED 175
Query: 153 ------GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
P+GVP +E + E LR GY T +GKWHLG + P +GF+ G+L
Sbjct: 176 AARQPASTAPKGVPSSEVTIAELLRGKGYHTMHLGKWHLGE-AKGMRPEDQGFDESLGFL 234
Query: 207 NGVISYYDHILSDQYSRTVELNGHD--MRRNLSTAWDTVG-------EYATDLFTKEAVQ 257
G Y D + + D + NL + G Y TD T +A
Sbjct: 235 IGGQMYLPEASPDVENSKQPFDPIDKFLWANLPFSVQYNGGERFHPDRYMTDYLTVQAKA 294
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
I ++P FLYLA+ A H L+A + + ITD R YAAM++ LD
Sbjct: 295 AITAN-RNRPFFLYLAYNAPHT-----PLQASKADYDALPQITDHRLRVYAAMIRALDRG 348
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGV 376
VG V++ L+++G+ +N+++IF SDNG +NY N PYRG K T +EGGV
Sbjct: 349 VGQVMATLKQQGLDQNTLVIFTSDNGG--------ANYIGLPDINRPYRGWKATFFEGGV 400
Query: 377 KVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
KVP + W +I+ R++ + H TA + LP + +DG+D
Sbjct: 401 KVPFFMRWPARIKPGTRIAGPVSHFD----IFATAGDAGHASLPRDRALDGVD 449
>gi|340369113|ref|XP_003383093.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 374
Score = 155 bits (391), Expect = 8e-35, Method: Compositional matrix adjust.
Identities = 105/316 (33%), Positives = 152/316 (48%), Gaps = 36/316 (11%)
Query: 84 TLTLLIVYGWNDLSFH---GSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
+ L+ +GW ++ +H + E+ TPNID L G+ LN Y C+PSR+SL++G+
Sbjct: 28 VMMLVDDWGWANVGYHRNPPTCEVVTPNIDKLVKEGLELNQHYVYKFCSPSRSSLISGRL 87
Query: 141 PIHTGMQG---------PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
PIH Q P+ G +P + E +R GY+T +GKW G +
Sbjct: 88 PIHVNDQNLLPTNYNPDDPVSGFS--AIPRNMTGIAEKMRGAGYATHQVGKWDAGMATSD 145
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG----EYA 247
+TP RGF++ FGY + YY S +RT + D+ A G Y
Sbjct: 146 HTPKGRGFQTSFGYFHHANDYYKETAS-ACNRTQIV---DLWETDKPAHGINGTGPDNYE 201
Query: 248 TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
LF + + ++ + PLFLY A H L+ P + F +I D +R+ Y
Sbjct: 202 EGLFKERVLDVVSNHDPSTPLFLYYAPHIVHT-----PLQVPDRYLKLFSFINDHDRQYY 256
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS-NYPYRG 366
AMV LDD VG + AL+ KGM +N + + SDNG P Y G+ NYP RG
Sbjct: 257 HAMVNYLDDVVGEITDALKNKGMWDNLLFVTSSDNGGPV--------YPGGGANNYPLRG 308
Query: 367 VKNTLWEGGVKVPAIL 382
K T W+GGV+V A +
Sbjct: 309 GKVTDWQGGVRVNAFV 324
>gi|414344125|ref|YP_006985646.1| sulfatase [Gluconobacter oxydans H24]
gi|411029460|gb|AFW02715.1| sulfatase [Gluconobacter oxydans H24]
Length = 478
Score = 155 bits (391), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 119/358 (33%), Positives = 170/358 (47%), Gaps = 36/358 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
G+ D+S G+ TP+ID +A+ GI L YA VC+ +R +L+TG+Y I G++
Sbjct: 56 GYADVSCFGTPGFTTPSIDRIAHEGIKLTRAYANSAVCSATRTALITGRYQDRIACGLEE 115
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P + G+P LPE LR GY T +GKWHLG E+ PL G++ +G G
Sbjct: 116 PIASSSGKIGLPPGLPTLPEQLRRAGYKTALVGKWHLGN-PPEFGPLKSGYDHFYGISGG 174
Query: 209 VISYYDHILS---DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
Y+ H DQ+ R L HD Y T L EAV+L+E +
Sbjct: 175 AADYFTHQGKPGEDQFYRDDHLI-HDH------------GYLTGLLGDEAVRLVETFASE 221
Query: 266 -KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN-RRTYAAMVKKLDDSVGTVIS 323
P FL L A H G EA + + + D R+TY M+K +DD +G V++
Sbjct: 222 TSPFFLSLHFNAPHWPWEGPEDEAESKRLKGKIFDYDGGTRKTYGRMIKAMDDQIGRVLA 281
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
L + EN+I++F SDNG + +P+ G+K L EGG++VPA+L
Sbjct: 282 TLDSYHLTENTIVVFTSDNGG-----------ERFSDTWPFTGMKGELLEGGLRVPAVLR 330
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
P S Q + DWLP+ + AA G T DG+ +L PS+R S
Sbjct: 331 WPARVAAGSESDQTLITMDWLPS-FLAATGATPDATFPSDGISLLPFVL--DPSKRQS 385
>gi|393784381|ref|ZP_10372546.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
CL02T12C01]
gi|392666157|gb|EIY59674.1| hypothetical protein HMPREF1071_03414 [Bacteroides salyersiae
CL02T12C01]
Length = 456
Score = 154 bits (390), Expect = 9e-35, Method: Compositional matrix adjust.
Identities = 120/353 (33%), Positives = 181/353 (51%), Gaps = 46/353 (13%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
L ++ G DLS + ++ TPNID L + G+ ++N YA V +PSRA L+TG +P
Sbjct: 40 LIMVDDLGLGDLSCQYAKDVQTPNIDRLFHQGVRMDNFYANSSVSSPSRAGLLTGCFPDM 99
Query: 144 TGMQGPPIWGAEPRG----VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
G+ P + +P+G + LPE +++ GY T IGKWHLG P RGF
Sbjct: 100 VGV--PGVIRTDPKGSWGYLSPDATLLPEMMKKAGYQTAIIGKWHLGL-ESPNLPNERGF 156
Query: 200 ESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI 259
+ G+L ++ D Y+ + + MR N + D G +ATDLFT A I
Sbjct: 157 DFFHGFLGDMM--------DDYNTHLRRGFNYMRLN-TEEIDPQG-HATDLFTDWASDYI 206
Query: 260 -EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLD 315
+ + DKP FLYLA+ A H+ L+ P E + + R A+++ +D
Sbjct: 207 FKARKQDKPFFLYLAYNAPHS-----PLQPPVEWEKKVRKRHPNISETRGKLVALIEHMD 261
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
D VG VI++L++ G L+N++IIF SDNG + ++ +N P RG K +++GG
Sbjct: 262 DGVGRVINSLEKSGQLDNTLIIFCSDNGG---------DRKSEANNGPVRGDKGDMYDGG 312
Query: 376 VKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGL 425
+KV L W ++ +L MM SD PTL D +LP+N IDG+
Sbjct: 313 IKVACSLYWKGHLEHRRVNNLVMM--SDIFPTLC-----DLVQLPVNHRIDGI 358
>gi|340369799|ref|XP_003383435.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Amphimedon
queenslandica]
Length = 523
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 115/361 (31%), Positives = 173/361 (47%), Gaps = 56/361 (15%)
Query: 76 YAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
++ T + L+ GW DL +G TPN+D +A G++L + Y A P+C+PSRA+
Sbjct: 28 FSPTTPNMIFMLMDDMGWGDLGVYGHPVKETPNLDKMALEGMLLPDFYSANPLCSPSRAA 87
Query: 135 LMTGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWH 184
++TG+ PI G A R G+P +E PE L++ GY+T IGKWH
Sbjct: 88 MLTGRLPIRNGFY---TTNAHARNAYTPQDIVGGIPDSEILYPELLQKNGYATMIIGKWH 144
Query: 185 LGFFRREYTPLYRGFE-------SHFGYLNG-----VISYYDHILSDQYSRTVELNGHDM 232
LG + Y PL GF+ HFG +G + Y + ++ +Y + +N
Sbjct: 145 LGQ-QTHYHPLKHGFDEFFGSTNCHFGPFDGKEQPNMPVYRNATMAGRYYQDFPINHKTG 203
Query: 233 RRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQE 291
NL+ +T+EA++ I +K P FLY A H L A ++
Sbjct: 204 ESNLTVE-----------YTQEAIKFINKNAANKKPFFLYWTPDATHT-----PLFASKD 247
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
+ Q R Y VK+LD SVG ++ L+ G+ +++++IF SDNG T Y +
Sbjct: 248 FLGTSQ------RGLYGDAVKELDYSVGQILDTLKTLGIDKDTLVIFSSDNGGAT--YAK 299
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
S GSN P+ K T +EGG++ P I W P Q +VS Q+ + DW T A
Sbjct: 300 ESG----GSNSPFLCGKETTFEGGMREPTIAWWPGTIQPGQVSHQLGSLMDWYSTALDLA 355
Query: 412 G 412
G
Sbjct: 356 G 356
>gi|284039849|ref|YP_003389779.1| sulfatase [Spirosoma linguale DSM 74]
gi|283819142|gb|ADB40980.1| sulfatase [Spirosoma linguale DSM 74]
Length = 533
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 122/398 (30%), Positives = 181/398 (45%), Gaps = 78/398 (19%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQ---G 148
G++D+ +G E+ TPN+D LA GI L + Y C P+RASL+TG+YP GM
Sbjct: 52 GFSDIGCYGG-EVNTPNLDKLAAGGIKLRSFYNNARCCPTRASLLTGQYPHTVGMGLMVT 110
Query: 149 PPIWGAEPRGVP--LTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
P +P L R+ + E L+E GYST +GKWH+G R E+ PL RGFE +FG
Sbjct: 111 MPNAAIQPGSYQGFLDARYPTIAERLKETGYSTYMLGKWHVGE-RPEHWPLKRGFEHYFG 169
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP- 263
++G SYY+ I +++ R + L+ + G Y TD FT AVQ + Q
Sbjct: 170 LISGASSYYEIIPAEKGKRFIVLDDKEFTPPAD------GFYMTDAFTDYAVQYLNQQKQ 223
Query: 264 --VDKPLFLYLAHLAAH----------------------AGNAGKHLEAPQETINQFQYI 299
DKP F+YLA+ A H ++ + Q + +Y
Sbjct: 224 EQADKPFFMYLAYTAPHFPLHAYESDIAKYEKLYAQGWDVTRTKRYQKMQQLGLIDKRYQ 283
Query: 300 TDP-------------------NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
P YAAM+ ++D ++G +I L+ G +N++I+FMS
Sbjct: 284 LTPRPANVPAWNSATDKAQWIRKMAVYAAMIDRMDQNIGRLIKTLKANGQYDNTLIVFMS 343
Query: 341 DNGA------------PTVEYRETSNYRNWGSNY------PYRGVKNTLWEGGVKVPAIL 382
DNG+ PT + E +Y + + + P+R K L EGG+ P I+
Sbjct: 344 DNGSSNENMESRKLNDPTKKIGERGSYVTYDTPWANVSVTPFRKYKRFLHEGGMITPCIM 403
Query: 383 -WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
W I+ + H+ D LPT AG + LP
Sbjct: 404 QWPRNIRPAAGYVDGIGHVMDLLPTSLELAGLSANDLP 441
>gi|440749357|ref|ZP_20928605.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
gi|436482362|gb|ELP38485.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
Length = 484
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 126/393 (32%), Positives = 183/393 (46%), Gaps = 53/393 (13%)
Query: 72 NRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTP 130
+R+ ++ + L G+ DL G +I TPN+D LA G++ N Y+ VC P
Sbjct: 25 DRQINKSVKPNIIFILADDLGYGDLGVTGQTKIETPNLDKLAREGMLFTNHYSGATVCAP 84
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGF 187
SRA+LMTG + HT ++G +P G +P + LP+ ++ GY+T A GKW LGF
Sbjct: 85 SRAALMTGLHTGHTPIRGNK--EIQPEGQHPMPDSVLTLPKMMKRAGYATGAFGKWGLGF 142
Query: 188 FRREYTPLYRGFESHFGYLNGVISYYDHILSDQY----SRTVELNGHDMRRNLSTAWDTV 243
P+ +GF+ FGY Y H +Y S V L G+D W
Sbjct: 143 VGTSGDPINQGFDQFFGY---NCQRYAHRYFPEYLWLNSEKVFLPGND--------WTNK 191
Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAG--------KHLE-------- 287
+A D+ K A++ IE P F+Y+ + HA A K+ E
Sbjct: 192 ATFAPDVIQKAALEFIEANS-GNPFFMYVPMVIPHAELAADEESPIFKKYRERFGDEREF 250
Query: 288 --------APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
P I +Q + P R TYA MV+KLD VG +I+ L+ G+ EN+II+F
Sbjct: 251 VAPAGWDYGPGINIPGYQSVKHP-RATYATMVEKLDAHVGEIIAKLEVLGIRENTIIVFA 309
Query: 340 SDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMH 399
SDNG P E ++ + SN YRG K L+EGG+K I P+ + VS +
Sbjct: 310 SDNG-PHQEGGNDPDF--FDSNGIYRGYKRDLYEGGIKTSLIANWPRKIKAGSVSDHVSA 366
Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
D +PT AG + +DG+ +LL
Sbjct: 367 FWDLMPTFAELAG---VSIDAELDGISMVPTLL 396
>gi|150008923|ref|YP_001303666.1| secreted sulfatase [Parabacteroides distasonis ATCC 8503]
gi|149937347|gb|ABR44044.1| putative secreted sulfatase precursor [Parabacteroides distasonis
ATCC 8503]
Length = 452
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
AL S + L++V G+ DLS G+ +I TP+ID L G+ N +A PV +PSRA
Sbjct: 23 ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
SL+TG+YP G+ G E L+E LP+ L++ GY +GKW+LG
Sbjct: 83 SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
TP RGF+ + G+L ++ D Y + MR NL D G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191
Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
+ A++ + D + +P FLYLA+ A H ++ PQE + + + +P+ R
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
A+++ LD +VG V AL++ G LEN+IIIF SDNG +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296
Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
K ++EGG++V I W QI+ P V + +SD PTL D + +P++ ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349
Query: 424 GL 425
G+
Sbjct: 350 GI 351
>gi|332529144|ref|ZP_08405108.1| sulfatase [Hylemonella gracilis ATCC 19624]
gi|332041367|gb|EGI77729.1| sulfatase [Hylemonella gracilis ATCC 19624]
Length = 454
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 109/329 (33%), Positives = 159/329 (48%), Gaps = 36/329 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
GW DL +G+++ TPN+D LA G+ YA VC+ +R +L+TG+Y + G++
Sbjct: 23 GWADLGVYGASDFATPNLDRLAAQGVRFTQAYANSAVCSATRIALITGRYQYRLPAGLEE 82
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P G+P LP LRE GY T IGKWHLG Y PL G++ FG + G
Sbjct: 83 P--IARSDIGLPPEHPTLPSLLREAGYDTALIGKWHLGK-PPTYGPLKSGYDRFFGNIGG 139
Query: 209 VISYYDHI--LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA---VQLIEDQP 263
+ Y+ H + Q R + + R Y T++ EA V+ ED+
Sbjct: 140 ALDYFTHKPGVGAQVPRDLWEGDVPVERT---------GYYTNILGDEASAYVRAREDE- 189
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
KP FL L A H G EA I + N Y +V+ LD +VG V++
Sbjct: 190 -KKPFFLSLHFTAPHWPWEGPGDEAVSREITDLFHYDGGNLAKYGELVEALDAAVGQVLA 248
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL- 382
AL G +N++++F SDNG + +P+ G K L EGG++VP +L
Sbjct: 249 ALDETGQADNTLVVFTSDNGG-----------ERFSKTWPFTGQKTELLEGGLRVPTLLR 297
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
W +++ P+V Q+ +DWLPTL AA
Sbjct: 298 WPARVR--PQVQAQVTITADWLPTLLAAA 324
>gi|149199736|ref|ZP_01876767.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149137141|gb|EDM25563.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 585
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 118/389 (30%), Positives = 184/389 (47%), Gaps = 55/389 (14%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
AA K + + + GW DLS +G+ +I TPNID+LA++G + N Y QPVC+P+RA
Sbjct: 16 AAENKKPNIIIFLTDDQGWGDLSINGNKDISTPNIDSLAHDGALFENFYVQPVCSPTRAE 75
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
L+TG+Y +G++ G L E+ + + ++ GY+T A GKWH G + Y P
Sbjct: 76 LLTGRYAFRSGVRSTSEGGER---FNLDEQTIADVFKKAGYATGAFGKWHSG-MQYPYHP 131
Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
RGF+ +G+ +G Y YS +E NG ++ + D FT +
Sbjct: 132 NGRGFDEFYGFCSGHWGNY-------YSPMLEHNGKIVKGQ---------GFCVDDFTSK 175
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY------- 307
A+ IE+ +KP +YL + H+ ++ P ++Q T + TY
Sbjct: 176 AMDFIEENK-EKPFLVYLPYNTPHS-----PMQVPDRFWEKYQDKTLTMKNTYNKKWGHG 229
Query: 308 -------AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
AM + +D +VG V++ L + +N+I+++ SDNG N W
Sbjct: 230 EDFVKAALAMCENIDWNVGRVLNKLDELQIADNTIVLYFSDNGP---------NSPRW-- 278
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
N +G K ++ EGGV+ P L PQ +V ++ D LPTL + AG +
Sbjct: 279 NGEMKGKKGSVDEGGVRSPLHLRWPQKVAQGQVIKEVTGAIDLLPTLSSMAGIEFQS-KK 337
Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
+DGLD +SSLL N L W
Sbjct: 338 KLDGLD-FSSLLTGKEKNWNDQRTMLAYW 365
>gi|323456753|gb|EGB12619.1| hypothetical protein AURANDRAFT_70521 [Aureococcus anophagefferens]
Length = 913
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 103/331 (31%), Positives = 160/331 (48%), Gaps = 39/331 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT---GMQG 148
GW+D+ +H + + TP + ALA +G++L+ YA C+PSR+SL++G+YP+H M G
Sbjct: 422 GWHDVPWHNPS-LKTPTLAALAADGVVLDRFYAYRFCSPSRSSLLSGRYPMHVNQYNMAG 480
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
+ G GV + + + L+ GY+T +GKWH G + P RGF++ GYLNG
Sbjct: 481 DALGG----GVHVNMTTIAKKLKGAGYATHQLGKWHAGQSSADLVPAARGFDTSLGYLNG 536
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
DH + + V N D+ A+ G Y ++ A+ +I D PL
Sbjct: 537 A---EDHWTQARPACGVG-NFVDLYATDGPAFGKNGTYGAQIYHDAALDIIADHDASVPL 592
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYI---TDPN----RRTYAAMVKKLDDSVGTV 321
F+Y A H AP + + + + DP R TY AM DD V
Sbjct: 593 FVYFAFQINH---------APMQVPDAYARVYPCDDPKTCGVRSTYQAMTAMADDVVRNA 643
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+AL+ KGM ++++I+ SDNG P+ ++SN N+P RG K + EGGV+ +
Sbjct: 644 TAALKAKGMWDDTLIVLTSDNGGPSGTDADSSN------NFPLRGGKYSDLEGGVRAAGL 697
Query: 382 LWSPQIQQNPR-----VSLQMMHISDWLPTL 407
+ + R + +HI DW T
Sbjct: 698 VAGGFLPDAARGRTLDGAGAYVHICDWYATF 728
>gi|325110588|ref|YP_004271656.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324970856|gb|ADY61634.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 458
Score = 154 bits (390), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 111/355 (31%), Positives = 169/355 (47%), Gaps = 34/355 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGM-QGP 149
G+ DLS +G TP +D +A G+ ++ ++ VC+P+RA+L+TG+Y G+ +
Sbjct: 48 GYGDLSCYGCERYQTPFLDQMAAEGMKWSDFHSNGNVCSPTRAALLTGQYQQRWGVDRVI 107
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
G RG+PL E + GY+T GKWH G+ R + P+ +GF+ GY++G
Sbjct: 108 TAAGHRDRGLPLEAVTFAETFSQHGYATAIYGKWHQGYQAR-FNPVRQGFDHFVGYVSGN 166
Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
+ + HI DQ + + ++ Y T L T+ AV+ IED ++P
Sbjct: 167 VDFQSHI--DQAGQADWWHNDELTEEAG--------YVTHLITQYAVKFIEDNK-EQPFC 215
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LYL H H G + A + +F D YA M+ ++D +G V+ LQ+
Sbjct: 216 LYLPHETPHYPYQGPNDLADRTVGGKFVNHGSRDDKAAAYAEMMIEMDRGIGEVLQTLQK 275
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
G+ E ++++F SDNG ++ G N RG K + WEGG +VP I W+P+
Sbjct: 276 HGLAEQTLVLFFSDNGGTSL-----------GDNGGLRGTKGSDWEGGHRVPMIAWNPET 324
Query: 388 QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL--LNTPSRRN 440
R Q+ D PTL D + LPL D SLL L P R N
Sbjct: 325 VPAGRDCDQLAITHDIFPTL-----ADYADLPLPKDQSLDGISLLPYLEDPKREN 374
>gi|146275662|ref|YP_001165822.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
gi|145322353|gb|ABP64296.1| sulfatase [Novosphingobium aromaticivorans DSM 12444]
Length = 462
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 113/357 (31%), Positives = 177/357 (49%), Gaps = 32/357 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP--IHTGMQG 148
G+ D S GS I TP ID++ G++L Y+ P+C+P+R +L+TG Y G++
Sbjct: 46 GYADTSATGSRHIRTPAIDSIGAGGVMLRQGYSSTPICSPTRTALLTGCYAQRFAIGVE- 104
Query: 149 PPIWGAEPRG--VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
P+ P G VPL + ++ LGY T +GKWHLG + PL G++ G +
Sbjct: 105 EPLGPNAPAGIGVPLDRPTIASVMKALGYRTSLVGKWHLGE-PPAHGPLKHGYDHFLGIV 163
Query: 207 NGVISYYDH--ILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
G Y+ H ++S + + V L D + + + Y TD+F EAV++IE+
Sbjct: 164 EGGADYFVHRMVMSGKPA-GVGLAEDDAQTDRTG-------YLTDIFGDEAVRVIEEGG- 214
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
++P FL L A H G+ E + + N Y MV+ +D +V V++A
Sbjct: 215 NQPFFLSLHFTAPHWPWEGREDEKLARALPSSFHYEGGNLAKYREMVETMDQNVAKVLAA 274
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-W 383
+ R G +N++++F SDNG + +P+ G K + EGGV+VP ++ W
Sbjct: 275 IDRSGKADNTVVVFTSDNGG-----------ERFSDTWPFVGHKGEVLEGGVRVPLMVRW 323
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
+I+ R S Q+M D+LPTL AGGD +R+ DG D + L P R
Sbjct: 324 PRRIKAGSR-SEQVMVSMDFLPTLLGMAGGDAARIG-RFDGADLSAQLAGAAPVTRT 378
>gi|343086062|ref|YP_004775357.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354596|gb|AEL27126.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 444
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 130/410 (31%), Positives = 188/410 (45%), Gaps = 84/410 (20%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G+ + TP++D LA GI N Y VCTPSRA L+TG+YP + +
Sbjct: 45 GYADLGVYGAEDFETPHLDQLASEGIRFTNFYVPATVCTPSRAGLLTGQYPKRSNLHEAV 104
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNG 208
++ G+ + E L+ GYST IGKWHLG + EY P +GF++ +G Y N
Sbjct: 105 LFPYSEGGLSPQAFTMAELLKGAGYSTACIGKWHLG-HKDEYMPYNQGFDTFYGVPYSND 163
Query: 209 VISYYDHILSDQ------YSRT-VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIED 261
+ +YY + Q Y T V NG D R Y T +T+E V+ I++
Sbjct: 164 MDNYYYKNIDFQSPPLPFYENTKVIENGSDQR------------YLTKRYTEETVKRIKN 211
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
+ +KP F+YLAH H P F+ Y ++ +LD S G +
Sbjct: 212 RG-EKPFFIYLAHNMPH---------TPLFASPAFE--GKSKNGLYGDVIMELDWSAGEI 259
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
I+AL+ +G+ EN+I +F SDNG GS P RG K WEGG +VP I
Sbjct: 260 INALKEEGIYENTIFVFTSDNGPAK------------GSAKPLRGKKAQTWEGGQRVPGI 307
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP--LNIDGLDQWSSLLLNTPSRR 439
+ P V+ + + D PT A ++++P + IDG+D S+ L+N +
Sbjct: 308 ITWPGSIPRGVVTDEFVSTLDLFPTFAKLA---STKIPEEIKIDGMD-ISAFLMNP---K 360
Query: 440 NSNIDGLDQWSSLLLNTPSR------RNSVLINIDEKKRTAAVRLDSWKL 483
N N+ P R RN L A+RL WKL
Sbjct: 361 NENL-------------PERPFYFYARNGEL---------EAIRLGKWKL 388
>gi|298375849|ref|ZP_06985805.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|423330336|ref|ZP_17308120.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
CL03T12C09]
gi|298266886|gb|EFI08543.1| arylsulfatase [Bacteroides sp. 3_1_19]
gi|409231952|gb|EKN24800.1| hypothetical protein HMPREF1075_00133 [Parabacteroides distasonis
CL03T12C09]
Length = 452
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
AL S + L++V G+ DLS G+ +I TP+ID L G+ N +A PV +PSRA
Sbjct: 23 ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
SL+TG+YP G+ G E L+E LP+ L++ GY +GKW+LG
Sbjct: 83 SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
TP RGF+ + G+L ++ D Y + MR NL D G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191
Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
+ A++ + D + +P FLYLA+ A H ++ PQE + + + +P+ R
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
A+++ LD +VG V AL++ G LEN+IIIF SDNG +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296
Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
K ++EGG++V I W QI+ P V + +SD PTL D + +P++ ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349
Query: 424 GL 425
G+
Sbjct: 350 GI 351
>gi|325109298|ref|YP_004270366.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324969566|gb|ADY60344.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 463
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 118/372 (31%), Positives = 185/372 (49%), Gaps = 49/372 (13%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASL 135
AA + L L+ G+ DLS +G+ ++ +PNID L G+ N YA PVC+P+RA++
Sbjct: 30 AADKPNVLLILVDDLGYGDLSCYGATDLQSPNIDKLVSRGLKFTNFYANCPVCSPTRAAI 89
Query: 136 MTGKYPIHTGMQG------PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
++GKYP G+ G WG P E LP L+ GY + IGKWHLG
Sbjct: 90 LSGKYPDRVGVPGVIRTHADNSWGYL---APEAE-LLPSLLQPAGYHSAIIGKWHLGL-E 144
Query: 190 REYTPLYRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
P RGF+ GYL ++ YYDH +G + R D G +AT
Sbjct: 145 APNRPNDRGFDHFKGYLGDMMDDYYDH----------RRHGINYMRENEQEIDPEG-HAT 193
Query: 249 DLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQ---ETINQFQYITDPNR 304
DLFT+ + +++Q ++P FLYLA+ A H ++ PQ E + + + R
Sbjct: 194 DLFTEWSCDYLKEQADNEQPFFLYLAYNAPHT-----PIQPPQDWLEKVKKREAGIGEQR 248
Query: 305 RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY-P 363
A+++ +DD +G V++ L++ G +++++IF SDNG N G+N
Sbjct: 249 AKLVALIEHMDDGIGQVLTCLEQTGQADDTLVIFTSDNGGQL----------NVGANNGA 298
Query: 364 YRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
R K +++EGG+KVP A++W +I+ Q + + D+ T+ AG + P +
Sbjct: 299 TRDGKQSMYEGGLKVPTAVVWPGKIEAGQTTEFQALSM-DFCSTILDVAGVEQ---PSDC 354
Query: 423 DGLDQWSSLLLN 434
DG +LL N
Sbjct: 355 DGRSILPTLLGN 366
>gi|241836593|ref|XP_002415110.1| arylsulfatase B, putative [Ixodes scapularis]
gi|215509322|gb|EEC18775.1| arylsulfatase B, putative [Ixodes scapularis]
Length = 254
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 106/289 (36%), Positives = 152/289 (52%), Gaps = 52/289 (17%)
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V +LD SVG V+ +L R+GMLENS+++F SDNGA R N GSN+P RG K+T
Sbjct: 8 VYELDQSVGMVMESLHRRGMLENSVVVFSSDNGA-----RPLGLGANAGSNWPLRGGKST 62
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
LWEGGV+ A +WSP + + R+S QMMHISDWLPT YTAAGG+ SRL
Sbjct: 63 LWEGGVRGAAFVWSPLLSRVGRLSDQMMHISDWLPTFYTAAGGNLSRL------------ 110
Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
+IDG D W +L N S R+ +L+NID +A+ ++ K++LG
Sbjct: 111 ----------GDIDGKDMWDALSKNLASPRHEILLNIDPIYNYSALIFENRKVILGD--- 157
Query: 491 GTMDGYY-------GQTRSNKVPLLNFN-AIVESKTYQSLQQLSQNIFLPISNIDKMRST 542
T G Y G TR P+ + ++ S+T L+ F + + +
Sbjct: 158 -TAGGLYSLRTPVPGGTR----PVDGLDQMMLNSRTGAVLRD-----FYKVQQVPVRPNW 207
Query: 543 RQQATIHCG-ANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDIS 590
RQ+ ++C NP + N Y F++ DPCE +N+A++ +S
Sbjct: 208 RQEVAVNCSYYNP---RNNFSDNVSLYYFDVKQDPCELDNLAATNVTVS 253
>gi|381160679|ref|ZP_09869911.1| arylsulfatase A family protein [Thiorhodovibrio sp. 970]
gi|380878743|gb|EIC20835.1| arylsulfatase A family protein [Thiorhodovibrio sp. 970]
Length = 609
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 150/527 (28%), Positives = 228/527 (43%), Gaps = 86/527 (16%)
Query: 54 HLEAPQETINQFQYITDPN-----RRTYAALTKSTT------LTLLIVYGWNDLSFHGSN 102
HL+A + N + I P R A L K T + LL GW D F+G
Sbjct: 72 HLKAEKPADNMYPVIAHPEQENAAREKLAELEKKTGKKPNVLIFLLDDVGWMDPGFNGGG 131
Query: 103 EI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGV 159
TPN+D LA +G++L + Y+ P C+PSRA++ TG+ P+H G+ PP++G EP G+
Sbjct: 132 VAVGNATPNMDELAQDGLLLTSAYSTPSCSPSRATIHTGQNPLHHGILRPPMYG-EPGGL 190
Query: 160 PLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYY------ 213
LP+ L++ GY T+ +GKWH+G + P G++ + G+L GV Y
Sbjct: 191 D-GAITLPKLLKKHGYVTQGVGKWHMG-ENKGSLPQNVGYDDYVGFL-GVSDMYTEWRDL 247
Query: 214 ----DHILSDQYSRTVELNGHDMRRNLSTAWDTVG---------EYATDLFT--KEAVQL 258
+ LS + ++ +G D TA DT +Y DL T K+
Sbjct: 248 YFNPEVALSPARFKMMQEDGFDHHEVHCTADDTEQCKDGKLIDLDYIKDLDTHWKDVSLA 307
Query: 259 IED--QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
D + D+P FLY A H N P E + + T Y+ + ++DD
Sbjct: 308 FLDKMKDSDEPFFLYHATRGCHFDNY-----PPDEYLGKSAAHT-----VYSDCIVQMDD 357
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
G ++ L+ G LEN++I+ SDNG P E P+RG K + WEGGV
Sbjct: 358 IFGALVDKLEETGQLENTLILLTSDNG-PECEIPPH-------GRTPFRGCKGSSWEGGV 409
Query: 377 KVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNT 435
+VP W I R S + +D LPT+ + G + G D L
Sbjct: 410 RVPTFAYWKGMIPS--RRSDGLFDFADILPTVMSMTG---------VSGAD------LAG 452
Query: 436 PSRRNSNIDGLDQWSSLLLNTP-SRRNSVLINIDEKKRTAAVRLDSWKLVL------GTQ 488
+ IDG+DQ LL + S R S I++ +R+D +KLV G
Sbjct: 453 EFSDVTYIDGVDQLPFLLADDGLSSRRSRPYTINQ--YFGGIRIDEFKLVWTGEIENGVV 510
Query: 489 ENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISN 535
+ G G+ G + + FN + S+ +++P+
Sbjct: 511 QKGDFGGFSGSVFTESGGAIMFNLYTNPQEDVSIGVRHIPMYVPLEG 557
>gi|406661473|ref|ZP_11069592.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
gi|405554747|gb|EKB49822.1| Arylsulfatase precursor [Cecembia lonarensis LW9]
Length = 478
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 117/345 (33%), Positives = 172/345 (49%), Gaps = 39/345 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL G+++I TPNID +A GI + + A PVC+PSRA L+TG+ P G+
Sbjct: 58 GYGDLGCFGASDIATPNIDRIAAEGIKFTSFLSASPVCSPSRAGLLTGRMPQRMGIN-TV 116
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ G+ E + E L+ GY T +GKWHLG R + PL +GF +FG I
Sbjct: 117 FFPESLTGMDPEEITIAEILKTKGYRTGIVGKWHLGHLER-FLPLNQGFYEYFG-----I 170
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y + + S Y MR N A+ Y T +T+E+++ I D D+P FL
Sbjct: 171 PYSNDMASVVY----------MRGNEVEAYHVDQRYMTRTYTEESLKFI-DASGDQPFFL 219
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLAH H P +F+ + NR Y ++++LD SVG V+ L+ KG+
Sbjct: 220 YLAHNMPHV---------PIYASPEFEGTS--NRGLYGDVIQELDWSVGEVLRKLEEKGI 268
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI-LWSPQIQQ 389
LEN++I+F SDNG P + + + GS R K +EGGV+VP + +W +I+
Sbjct: 269 LENTLIVFSSDNG-PWLVMED-----HGGSAGHLREGKQFTFEGGVRVPTVAMWKGKIEP 322
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
+V + DW PT G + + + IDG D L N
Sbjct: 323 G-QVYEDLATQMDWFPTFSQLVGAEIPQDRI-IDGKDLGPVLFEN 365
>gi|260063387|ref|YP_003196467.1| arylsulfatase [Robiginitalea biformata HTCC2501]
gi|88783481|gb|EAR14653.1| putative arylsulfatase [Robiginitalea biformata HTCC2501]
Length = 561
Score = 154 bits (389), Expect = 1e-34, Method: Compositional matrix adjust.
Identities = 124/375 (33%), Positives = 180/375 (48%), Gaps = 62/375 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G + TPNID LA G++ Y+ VC PSRA+L+TG + HT ++G
Sbjct: 126 GYGDLGCYGQEKFNTPNIDRLAQEGLLFTQHYSGSTVCAPSRAALLTGMHTGHTVVRG-- 183
Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY-- 205
+P G +P + L E RE GY+T A GKW LG+ E P +GF++ FGY
Sbjct: 184 NREVQPEGQHPIPDSTYTLAEMFRENGYATGAFGKWGLGYPGSEGDPNAQGFDTFFGYNC 243
Query: 206 -LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
G Y H+ ++ S +E N ++ G YA +L +E + IE+
Sbjct: 244 QRMGHHYYPHHLWYNRDSIVLEENAGKLK----------GRYAPELIHRETLNFIEEN-R 292
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-----------------QYITDP----- 302
+ P FLY+A + HA L AP+ET+ +F QY P
Sbjct: 293 ENPFFLYVASIIPHA-----ELAAPEETLQRFRGRYPPEKPYEGTDQGPQYRKGPYESQQ 347
Query: 303 -NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
R +AAM+ LD VG ++ ++ G+ EN++I+F SDNG P +E +Y + SN
Sbjct: 348 KPRAAFAAMISILDRQVGEIMEKVRELGLEENTLIVFTSDNG-PHLEGGADPDY--FDSN 404
Query: 362 YPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHIS---DWLPTLYTAAGGDTSR 417
P RG K L+EGG++VP + W I R HIS D PT G
Sbjct: 405 GPLRGYKRDLYEGGIRVPMLARWPGWIPPGTRTD----HISAFWDVFPTFAEILG---VA 457
Query: 418 LPLNIDGLDQWSSLL 432
P+ +DG+ +LL
Sbjct: 458 PPVEMDGISFLPTLL 472
>gi|255014751|ref|ZP_05286877.1| putative secreted sulfatase precursor [Bacteroides sp. 2_1_7]
gi|410102840|ref|ZP_11297765.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
gi|409237967|gb|EKN30762.1| hypothetical protein HMPREF0999_01537 [Parabacteroides sp. D25]
Length = 452
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
AL S + L++V G+ DLS G+ +I TP+ID L G+ N +A PV +PSRA
Sbjct: 23 ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
SL+TG+YP G+ G E L+E LP+ L++ GY +GKW+LG
Sbjct: 83 SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
TP RGF+ + G+L ++ D Y + MR NL D G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191
Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
+ A++ + D + +P FLYLA+ A H ++ PQE + + + +P+ R
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
A+++ LD +VG V AL++ G LEN+IIIF SDNG +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296
Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
K ++EGG++V I W QI+ P V + +SD PTL D + +P++ ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349
Query: 424 GL 425
G+
Sbjct: 350 GI 351
>gi|343086301|ref|YP_004775596.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354835|gb|AEL27365.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 481
Score = 154 bits (389), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 148/526 (28%), Positives = 223/526 (42%), Gaps = 109/526 (20%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL G I TPN+D +A+ G N Y A PVCTPSRA L+TG+ PI +GM
Sbjct: 44 GYGDLGVFGHPSIKTPNLDKMAFEGQKWTNFYVAAPVCTPSRAGLLTGRLPIRSGMSSDD 103
Query: 151 ---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--Y 205
++ G+P +E + + L+ GY T AIGKWHLG + Y P GF+S+FG Y
Sbjct: 104 RRVLFPDSDGGLPQSEITIAKALKGNGYQTAAIGKWHLGH-KSPYLPTDHGFDSYFGIPY 162
Query: 206 LNGV--ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
N + I DH + N M+ T T +T+EAV I+
Sbjct: 163 SNDMDKIDKTDHFTLTDNEKFDAYNVPLMKDAEIAERPTDQRTLTKRYTEEAVSKIKAFK 222
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR---TYAAMVKKLDDSVGT 320
D+P F+YLAH H I F+ ++ Y +++++D SVG
Sbjct: 223 -DEPFFIYLAHNLPH--------------IPLFRSAAFKDQSLGGIYGDVIEEIDWSVGQ 267
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
++S L+ +G+ EN++++F SDNG P ++ + G+ RG K +EGG++ P
Sbjct: 268 ILSTLKEEGIAENTLVVFTSDNG-PWHVFK-----THGGTAGLLRGAKGGTFEGGMREPT 321
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
+ W P Q P V + M D LPT +G + LP +
Sbjct: 322 VFWWPA-QIKPGVVMDMGTTMDLLPTFCAISG---TELP-------------------DD 358
Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
DG D L S R +V +R A+R +K TQ YG
Sbjct: 359 RIYDGYDISGLLKGTGKSERETVFYY--RGQRVYAIRKGDYKAHFITQLE------YGNP 410
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPS 560
++ V TR + + N A + +
Sbjct: 411 TAHPV------------------------------------TRPEVVVE---NKATVLET 431
Query: 561 PCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
P L+N+ DP E+ NIA P+I +++ E+L H+ ++VP
Sbjct: 432 PL------LYNVNVDPGERFNIAEDHPEIIAEIREVLAAHQASIVP 471
>gi|301311961|ref|ZP_07217883.1| putative arylsulfatase [Bacteroides sp. 20_3]
gi|423339483|ref|ZP_17317224.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
CL09T03C24]
gi|300830063|gb|EFK60711.1| putative arylsulfatase [Bacteroides sp. 20_3]
gi|409230864|gb|EKN23725.1| hypothetical protein HMPREF1059_03149 [Parabacteroides distasonis
CL09T03C24]
Length = 452
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 124/362 (34%), Positives = 187/362 (51%), Gaps = 47/362 (12%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
AL S + L++V G+ DLS G+ +I TP+ID L G+ N +A PV +PSRA
Sbjct: 23 ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
SL+TG+YP G+ G E L+E LP+ L++ GY +GKW+LG
Sbjct: 83 SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
TP RGF+ + G+L ++ D Y + MR NL D G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191
Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
+ A++ + D + +P FLYLA+ A H ++ PQE + + + +P+ R
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
A+++ LD +VG V AL++ G LEN+IIIF SDNG +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALEQNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296
Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
K ++EGG++V I W QI+ P V + +SD PTL D + +P++ ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNQIR--PAVRDNFVMLSDMFPTLC-----DLTAVPVSHEID 349
Query: 424 GL 425
G+
Sbjct: 350 GI 351
>gi|390944774|ref|YP_006408535.1| arylsulfatase A family protein [Belliella baltica DSM 15883]
gi|390418202|gb|AFL85780.1| arylsulfatase A family protein [Belliella baltica DSM 15883]
Length = 470
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 116/371 (31%), Positives = 180/371 (48%), Gaps = 46/371 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G D+ +G + TPN+ LA GI+ + YA VC PSRASLMTG++ ++G
Sbjct: 43 GVGDIEPYGQSIFETPNLSRLAQEGIMFKDFYAGSTVCAPSRASLMTGQHTGKIRIRGNG 102
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ +P + LPE L+E GY+ GKW LG E +P RG++ G+L+ V
Sbjct: 103 EFPLDP-----DVKILPEMLKENGYTNLMFGKWGLGLQESESSPEKRGWDFFLGHLHHVS 157
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+++ Q +++L + L G YA D FT+ A+ IE ++P F+
Sbjct: 158 AHF------QKPDSLDLF---LNNTLGRVKVPEGTYANDWFTEGAINYIETSSNEEPFFI 208
Query: 271 YLAHLAAHAGNAGKHLEAP----QETINQFQYITDPN----------------RRTYAAM 310
YL++ HA L P ++ +N+ PN + YAA+
Sbjct: 209 YLSYTIPHA-----ELLVPDRYFKDFVNENGQSVFPNEIAWPDGKHYGQQNYPKAAYAAL 263
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V LDD VG V+ AL+ KG+ +N+I+IF SDNG R + + S+ YRGVK
Sbjct: 264 VHSLDDYVGQVMDALESKGIADNTILIFTSDNGTHIEGGRRMEDVDYFQSSGQYRGVKRD 323
Query: 371 LWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
L++GG+KVP I+ W +I+QN S+ S W +Y G + DG+
Sbjct: 324 LYDGGIKVPLIIKWPDKIKQN---SVTEHRASFW--DVYNTFGDIVGNIDSTRDGISFLP 378
Query: 430 SLLLNTPSRRN 440
+LL + +++
Sbjct: 379 TLLGKSKQKQH 389
>gi|149197520|ref|ZP_01874571.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
gi|149139538|gb|EDM27940.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Lentisphaera araneosa
HTCC2155]
Length = 446
Score = 154 bits (388), Expect = 2e-34, Method: Compositional matrix adjust.
Identities = 103/339 (30%), Positives = 170/339 (50%), Gaps = 19/339 (5%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASL 135
AA + L GW D+++HG + TP IDA+A G+ YA VC PSRA +
Sbjct: 16 AADKPNIVLVFADDMGWGDVAYHGVEDAQTPAIDAIAKGGVWFEQGYAAASVCGPSRAGI 75
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
+TG+Y + G G +G+P +++ + E L+ GY + A GKWHLG + ++ P
Sbjct: 76 LTGRY---QQLFGVVTNGDADKGIPKSQKNIAELLKPAGYKSGAFGKWHLGSKKGQF-PN 131
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
RGF++ +G+ G YY +D+ + + N G+Y T+ T A
Sbjct: 132 DRGFDTFYGFHFGAHDYY---RADKKLNKKKKGYAPIYFNQDIVDYKEGDYLTEKITDHA 188
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLD 315
V+ IE+ D+P F+Y+A+ + H+ + P E + + RR + AMV +D
Sbjct: 189 VEFIEENK-DQPFFMYVAYNSVHS-----PWQVPDEYLARIPESVPAYRRLFLAMVLAMD 242
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE--TSNYRNWGSNYPYRGVKNTLWE 373
D VG + + L+ + EN+I +F +DNG+P + ++ YR S +RG K +E
Sbjct: 243 DGVGRIRAKLKELNLDENTIFVFTTDNGSPKIGNKKPNEGQYRMSMSQ-GFRGYKGDTYE 301
Query: 374 GGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
GG++VP + W +I+ + ++ D PT +AA
Sbjct: 302 GGIRVPFCMSWPKKIKSGNKFEAPVI-AYDLAPTFLSAA 339
>gi|443694992|gb|ELT96000.1| hypothetical protein CAPTEDRAFT_113428 [Capitella teleta]
Length = 521
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 115/384 (29%), Positives = 187/384 (48%), Gaps = 55/384 (14%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRA 133
+ L++ + ++++ GW DL +G TPN+D +A G++ + Y A +C+PSRA
Sbjct: 21 SCLSRKPNIIVMLMDDMGWGDLGSYGHPARETPNLDRMAAEGMLFTDFYSAAGICSPSRA 80
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPR-------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
S+MTG+ PI G + G+ +E PE L++ GY K +GKWHLG
Sbjct: 81 SVMTGRLPIRNGFYSTNQFAKNSYLPQDVMGGISDSEVLFPELLKQNGYRNKIVGKWHLG 140
Query: 187 FFRREYTPLYRGFESHFG-------YLNGVIS----YYDHILSDQYSRTVELNGHDMRRN 235
R ++ PL GF+ FG Y + V+ Y D ++ +Y ++ D++
Sbjct: 141 H-REQFLPLQHGFDEWFGAPDCHYKYDDKVMPNIPVYKDSLMQGRYYDHFKI---DIKTG 196
Query: 236 LSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ 295
LS T L+ +EA+ +E + D+P FLY A A H P +
Sbjct: 197 LSNL--------TQLYIQEALDFLEKE-RDRPFFLYWAPDATHT---------PLYASEK 238
Query: 296 FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY 355
F+ ++ R Y V +LD VG +++ L+ G+ N+ F SDNGA T + E S
Sbjct: 239 FRGKSE--RGLYGDAVMELDSGVGKILNKLKELGIENNTFAFFSSDNGAATYAFTEVSII 296
Query: 356 RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
+ GSN P+ K T +EGG++ PA+ W P + +S Q+ ++ D T+
Sbjct: 297 ISGGSNGPFLCGKQTTFEGGMREPALAWWPTKIRPGSISKQVGNLMDIFSTVLK-----I 351
Query: 416 SRLP----LNIDGLDQWSSLLLNT 435
+ +P L +DG+D SS+L N+
Sbjct: 352 TSIPHPKGLYLDGVD-LSSVLFNS 374
>gi|313241546|emb|CBY33792.1| unnamed protein product [Oikopleura dioica]
Length = 336
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 107/304 (35%), Positives = 152/304 (50%), Gaps = 23/304 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW+D+S++ TP + L + I L++ Y+ CTPSRASL+TGKY G+ PI
Sbjct: 41 GWSDVSWNNKKIKATPFLGQLEKHSITLSSSYSTHRCTPSRASLLTGKYAWRFGLGTDPI 100
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G+ L E+ LPE LR+ GYST +GKWHLG Y P RGF++ +G+ GV++
Sbjct: 101 DANTAAGLDLKEKLLPEILRKNGYSTHHVGKWHLGHCNSSYLPHNRGFDTFYGHTGGVLN 160
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y+ H + + + D N + + G Y+T F A +L +D P F+Y
Sbjct: 161 YFQHNRAVGNCKYL-----DYFENDTPIHEKTGVYSTFDFGDHARKLY--NKIDDPKFIY 213
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRR-TYAAMVKKLDDSVGTVISALQRK 328
LA A H L AP+ I + + Y P R TY AMVK +D + ++ + K
Sbjct: 214 LALNAPHGP-----LMAPEHMIEEMRVLYPDSPRTRLTYLAMVKAIDLEMEKLLGTIWMK 268
Query: 329 GMLENSII--IFMSDNGAPTV-----EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
++II +F SDNG P + + S YR NYPY G K L EGG P +
Sbjct: 269 NEKRDTIIATVFQSDNGGPLYMNGRPKDNDPSRYRQ-ACNYPYLGEKTLLTEGGTLSPTL 327
Query: 382 LWSP 385
P
Sbjct: 328 GTDP 331
>gi|326331907|ref|ZP_08198194.1| twin-arginine translocation pathway signal [Nocardioidaceae
bacterium Broad-1]
gi|325950404|gb|EGD42457.1| twin-arginine translocation pathway signal [Nocardioidaceae
bacterium Broad-1]
Length = 463
Score = 153 bits (387), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 111/337 (32%), Positives = 163/337 (48%), Gaps = 43/337 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP--IHTGMQG 148
GW DLS +GS +I TP++D LA +G+ + Y A VC+P+R L TG+YP + G+
Sbjct: 60 GWADLSAYGSPDIKTPHLDRLAASGVRFTDAYSASSVCSPTRFGLYTGRYPGRLAGGLPE 119
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P + G+PL L L+E GY+T GKWH G+ Y+P G++ FG +G
Sbjct: 120 PISAPNKLHGIPLDHPTLASLLKEQGYATALFGKWHCGYLPW-YSPTRVGWDEFFGNFSG 178
Query: 209 VISYYDHI----LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+ Y+ I D Y VE+ D+R Y T++ T+ AV IE +
Sbjct: 179 GLDYFSKINHNGAHDLYEDEVEV--EDLR------------YYTEIVTERAVSFIE-RDH 223
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQ--------FQYITDPNR-RTYAAMVKKLD 315
D P L L H G +A + + + TD TY MV+ LD
Sbjct: 224 DTPWLLNLNFTTPHWPWEGPEDKAVSDELTARIEGGEKGVLFHTDGGSLATYKTMVEDLD 283
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
S+G VI AL+R G L ++++ F SDNG + + +P+ G K+ L EGG
Sbjct: 284 QSIGQVIKALERTGQLRDTVVFFASDNGG-----------ERFSNVWPFTGAKSGLNEGG 332
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
++VP IL P I + +VS + + +DW T AG
Sbjct: 333 IRVPTILSWPGILRGRQVSDEPVVTTDWTATFLELAG 369
>gi|444523022|gb|ELV13435.1| Arylsulfatase J [Tupaia chinensis]
Length = 383
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 115/405 (28%), Positives = 183/405 (45%), Gaps = 69/405 (17%)
Query: 227 LNGHDMRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
+ G+D+ N AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+
Sbjct: 1 MCGYDLYENDHAAWDQDNGIYSTQMYTQRVQQILASHNPAKPIFLYIAYQAVHSP----- 55
Query: 286 LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA- 344
L+AP ++ I + NRR YAAM+ LD+++ V AL+ G +NSI+I+ SDNG
Sbjct: 56 LQAPGRYFEHYRSIININRRRYAAMLACLDEAIHNVTLALKTYGFYDNSILIYSSDNGGQ 115
Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWL 404
PT GSN+P RG K T WEGG++ + SP ++ V +++HI+DW
Sbjct: 116 PTAG----------GSNWPLRGSKGTYWEGGIRAVGFVHSPLLKNRGTVCRELVHITDWY 165
Query: 405 PTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL 464
PTL + A G + +DG D W ++ + R+ +D L + + +
Sbjct: 166 PTLISLAEGQIDE-DIQLDGYDIWETI---SEGLRSPRVDILHNIDPIYTKAKNGSWAAG 221
Query: 465 INIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQ 524
I +A+R+ WKL+ G GY S+ VP F
Sbjct: 222 YGIWNTAIQSAIRVQHWKLLTGNP------GY-----SDWVPPQAF-------------- 256
Query: 525 LSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIAS 584
SN+ R ++ T+ G + +LFN+ DP E+ +++S
Sbjct: 257 ---------SNLGPTRWHNERVTLSTGKS-------------VWLFNITADPYERVDLSS 294
Query: 585 SRPDISSQLYELLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSPW 629
P I QL L +T VP + P +++P+ W PW
Sbjct: 295 RYPGIVKQLLRRLSQFNKTAVPVRYP-PKDPRSNPRLNGGVWGPW 338
Score = 48.9 bits (115), Expect = 0.009, Method: Compositional matrix adjust.
Identities = 27/80 (33%), Positives = 43/80 (53%), Gaps = 6/80 (7%)
Query: 1 MRRNLSTAWDTV-GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 59
+ N AWD G Y+T ++T+ Q++ KP+FLY+A+ A H+ L+AP
Sbjct: 6 LYENDHAAWDQDNGIYSTQMYTQRVQQILASHNPAKPIFLYIAYQAVHS-----PLQAPG 60
Query: 60 ETINQFQYITDPNRRTYAAL 79
++ I + NRR YAA+
Sbjct: 61 RYFEHYRSIININRRRYAAM 80
>gi|149199917|ref|ZP_01876945.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149136986|gb|EDM25411.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 499
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 114/352 (32%), Positives = 169/352 (48%), Gaps = 37/352 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ DL +G I TPNID +A G+ + YA+ C+P+RASLMTG + G+ G +
Sbjct: 34 GYGDLGCYGHPIIKTPNIDKMADRGVRFTDFYARHKCSPARASLMTGAFNFRVGV-GSIV 92
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL---NG 208
+ G+ +PE L+E GY+T IGKWHLG Y P +GF+ +FG +G
Sbjct: 93 YPNSTTGLIKEVVTIPEMLKEKGYTTALIGKWHLGH-TAGYLPRDQGFDYYFGVPGTNHG 151
Query: 209 VISYYDHILSDQYSRTVELNGHD----------------MRRNLSTAWDTVGEYATDLFT 252
+ +++ + + E D M+ + W T T +T
Sbjct: 152 DAKTHKLPVAEGFKPSGEFTIEDYWADKGKGVHGNSTILMKNDNVIEWPTDITQLTKRYT 211
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
+AV+ I++ DKP FLY AH H P F+ +D Y M++
Sbjct: 212 HDAVRYIKENK-DKPFFLYFAHGTPHH---------PYTVDAAFRGKSD--HGLYGDMIE 259
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
++D SVG VI ALQ G+ + +II F SDNGA + +E + GSN P +G K +
Sbjct: 260 EIDWSVGEVIKALQENGIEKKTIIAFTSDNGADSKPNKE---HAEKGSNLPLKGWKGSSE 316
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
EGGV+VP +L P + + ++ + D PT Y A G +P IDG
Sbjct: 317 EGGVRVPFVLSWPGTLPEGKKTNEIASLMDIFPT-YAALAGIEPEVPQKIDG 367
>gi|149198313|ref|ZP_01875359.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
gi|149138609|gb|EDM27016.1| iduronate-sulfatase or arylsulfatase A [Lentisphaera araneosa
HTCC2155]
Length = 476
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 116/366 (31%), Positives = 179/366 (48%), Gaps = 43/366 (11%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP 141
++I++ G+ DLS G + TP ID +A G L + Y A PVCTPSRA+LMTG YP
Sbjct: 34 IIIIFTDDQGYADLSCFGGTHVSTPRIDQMAAEGAKLTSFYVAAPVCTPSRAALMTGTYP 93
Query: 142 ----IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
+ G + + +G+ E + E L+ +GY T GKWHLG + E+ P +
Sbjct: 94 KRIDMARGSNFVVLLAGDKKGLNPKEITIAEVLKAVGYKTGMFGKWHLGD-QPEFLPTRQ 152
Query: 198 GFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
GF+ FG Y + + Y+ ++ L+G ++ D +Y T FT+ A
Sbjct: 153 GFDEFFGLPYSHDIHPYHPQQSHFKFPSLPLLDGEEV-----IEMDPDADYLTKRFTERA 207
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHA---GNAGKHLEAPQETINQFQ----YITDPNR-RTY 307
VQ IE D+P FLY+ H H + + P+E + YI R +
Sbjct: 208 VQFIEKNK-DQPFFLYMPHPIPHTPLHASPAYMKDLPEEIKAKLSKENNYIDYKTRTQLL 266
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
+ + ++D SVG ++ AL++ G+ +N+I+IF SDNG P + G P RG
Sbjct: 267 SRAIGEVDWSVGQILDALKKNGIDDNTIVIFSSDNG-PAI-----------GKATPLRGR 314
Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD--TSRLPLNIDGL 425
KN EGG++VP ++ P + ++M D LPT AG + + R+ IDG
Sbjct: 315 KNQTLEGGMRVPTVIHWPGKIPAGIANDEIMTTMDLLPTFAKLAGAEIPSDRV---IDGK 371
Query: 426 DQWSSL 431
D W+ +
Sbjct: 372 DIWAVI 377
>gi|262383795|ref|ZP_06076931.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
gi|262294693|gb|EEY82625.1| arylsulfatase A [Bacteroides sp. 2_1_33B]
Length = 452
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 123/362 (33%), Positives = 187/362 (51%), Gaps = 47/362 (12%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGS-NEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
AL S + L++V G+ DLS G+ +I TP+ID L G+ N +A PV +PSRA
Sbjct: 23 ALPDSPNILLILVDDLGYGDLSCQGAAKDIQTPHIDKLLNEGVRFTNFHANCPVSSPSRA 82
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLGFFRRE 191
SL+TG+YP G+ G E L+E LP+ L++ GY +GKW+LG
Sbjct: 83 SLLTGRYPDMVGVPGVIRTHKEDSWGYLSEDAVLLPQMLKKRGYHNAMVGKWNLGL-ESP 141
Query: 192 YTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLF 251
TP RGF+ + G+L ++ D Y + MR NL D G +AT++F
Sbjct: 142 NTPTERGFDFYRGFLGDMM--------DDYYTHRRFGNNYMRENLKEI-DPQG-HATEIF 191
Query: 252 TKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRT 306
+ A++ + D + +P FLYLA+ A H ++ PQE + + + +P+ R
Sbjct: 192 SDWAIRYLSDMKQKQEPFFLYLAYNAPHT-----PIQPPQEWLEKVKK-REPSLPEKRAK 245
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
A+++ LD +VG V AL++ G LEN+IIIF SDNG +N P+RG
Sbjct: 246 IVALIEHLDYNVGRVYEALKKNGQLENTIIIFTSDNGGQD---------DAGANNGPFRG 296
Query: 367 VKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--ID 423
K ++EGG++V I W +I+ P V + +SD PTL D + +P++ ID
Sbjct: 297 AKQDMYEGGIRVAGGIYWKNRIR--PAVRENFVMLSDMFPTLC-----DLTAVPVSHEID 349
Query: 424 GL 425
G+
Sbjct: 350 GI 351
>gi|424046834|ref|ZP_17784396.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Vibrio cholerae HENC-03]
gi|408884894|gb|EKM23622.1| type I phosphodiesterase / nucleotide pyrophosphatase family
protein [Vibrio cholerae HENC-03]
Length = 500
Score = 153 bits (386), Expect = 3e-34, Method: Compositional matrix adjust.
Identities = 129/429 (30%), Positives = 197/429 (45%), Gaps = 66/429 (15%)
Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
TPNID++A G++L + YAQP CT R++ +TG+ P+ TGM + G P G+
Sbjct: 54 TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHTVGLPGG-PVGLNQNTPT 112
Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN--GVISYYD--------- 214
LPE L+ELGY T GK HLG R E+ P GF+ ++G+L + Y +
Sbjct: 113 LPELLKELGYVTGQFGKNHLGD-RDEFLPTMHGFDEYWGWLYHLNAMEYTEDPDWPKDEA 171
Query: 215 ----------HILSD-QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
H +D + ++++ +G + T D V +A D F + AV+
Sbjct: 172 FQKFAPRNVIHAKADGKGGQSIKDDGALTIERMRTLDDEVNRHAID-FIERAVK------ 224
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
+KP F + H HL E + + ++K LDD VG +++
Sbjct: 225 AEKPFFTWYCPSRGHVWT---HLSPKYEAM-----LGTNGWGLQEVVMKDLDDHVGEMLA 276
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL- 382
L+ G+ +N+I+IF +DNG + + + G P+ G K T WEGGV+ P ++
Sbjct: 277 KLEELGVADNTIVIFTADNGPEIMTWPD-------GGMTPFHGEKGTTWEGGVRAPFLIS 329
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
W +I N +VS M D LPTL AAGG P ++ +L ++
Sbjct: 330 WPEKIPAN-KVSNGMFDGMDLLPTLVEAAGG-----PADL------KQQMLKGYKGHKAH 377
Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRS 502
+DG +Q L N S RN + E+ + AVR+ WK Q + G+ G
Sbjct: 378 LDGYNQMEMLTNNGKSNRNEIFYY--ERDKLQAVRVGDWKAHFVVQNH----GWSGPKEE 431
Query: 503 NKVPLLNFN 511
PLL FN
Sbjct: 432 LNAPLL-FN 439
>gi|421614584|ref|ZP_16055638.1| arylsulfatase A [Rhodopirellula baltica SH28]
gi|408494671|gb|EKJ99275.1| arylsulfatase A [Rhodopirellula baltica SH28]
Length = 1012
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 148/513 (28%), Positives = 223/513 (43%), Gaps = 80/513 (15%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQ 125
++ +R A TK + +L G+ DLS G+ + TP ID +A G L + Y A
Sbjct: 557 VSPADREKTAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAA 616
Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWG----AEPRGVPLTERFLPEYLRELGYSTKAIG 181
PVCTPSRA LMTG YP M +G +P+G+ E + E L+ GY T G
Sbjct: 617 PVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFG 676
Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---RRNLST 238
KWHLG + E+ P +GF+ FG Y H + + R + + + +
Sbjct: 677 KWHLGD-QPEFLPTKQGFDEFFGIP------YSHDIHPFHPRQNHYHFPPLPLLQNDTVI 729
Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA----------GNAGKHLEA 288
D ++ T T++AV IE D+P FLYL H HA G A + A
Sbjct: 730 EMDPDADFLTKRLTEQAVSFIERN-KDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAA 788
Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
++ Y T N + + ++D SVG ++ AL+ G+ E ++++F SDNG P
Sbjct: 789 IEKEDGNIDYATRAN--LFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPP--- 843
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
+ + Y + G RG K T +EGG++ P ++ W QI + ++M D LPT
Sbjct: 844 --KNTLYASPGE---LRGHKGTTFEGGMREPTVVRWPGQIPAE-HQNDELMTAMDLLPTF 897
Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
AG TP+ R IDG D W +L T + ++ +
Sbjct: 898 AKLAGA--------------------ATPTDR--VIDGKDIWPTLKGETQTPHDAFFYH- 934
Query: 468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYG--QTRSNKVPLLNFNAIVESKTYQSLQQL 525
+ AAVR WKL + NG Y KV ++ N V K L++
Sbjct: 935 -RGNQLAAVRSGKWKLHV---NNGVAKQLYDLENDLGEKVNVIETNPEVVKKLQHQLKEF 990
Query: 526 SQNIFLPISNIDKMRSTRQQATIHCGANPAPMT 558
+ +I ++R A I ANP P++
Sbjct: 991 AADI---------ASNSRPAAFI---ANPKPLS 1011
Score = 108 bits (271), Expect = 7e-21, Method: Compositional matrix adjust.
Identities = 140/574 (24%), Positives = 223/574 (38%), Gaps = 127/574 (22%)
Query: 75 TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPS 131
T A + + L+ V G+ DLS +G+ ++ TPNID LA G + + A VCTPS
Sbjct: 32 TSVAAERPPNVVLIFVDDLGYGDLSCYGATKLSTPNIDRLAAKGRRFTDAHSASAVCTPS 91
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPL----TERFLPEYLRELGYSTKAIGKWHLGF 187
R L+TG+YP+ M G IWG P L + + + + GY+T +GKWHLGF
Sbjct: 92 RYGLLTGQYPVRA-MGGQGIWGPLPTTSGLIIDTNTQTIGKVFKNKGYATACLGKWHLGF 150
Query: 188 FRREYT--------PLYRGFESHFGY-LNGVISYYDHILSDQYSRTVELNGHDMRRNLST 238
P GF+ +FG L S Y ++ D + G+D L
Sbjct: 151 KEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDS------IFGYDPNDPLVY 204
Query: 239 AWDTVG-----------------------------EYATDLFTKEAVQLIEDQPVDKPLF 269
V E L T+ AV+ I ++ D+P F
Sbjct: 205 GGKPVSPTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITEK-KDEPFF 263
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
LY A H P +F+ + Y V +LD VG ++ +L+ G
Sbjct: 264 LYFATPNIH---------HPFTPAPRFKGTSQCG--LYGDFVHELDWMVGEIVQSLEDNG 312
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQ 388
+ +N++++F SDNGA + N G K +WEGG +VP I W +I+
Sbjct: 313 LTDNTLVLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIK 372
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LNTPSRRNSNIDGLD 447
AG + +L +D +S+L PS D ++
Sbjct: 373 ----------------------AGTQSDQLISQVDLFATFSALTEQEMPSSEQK--DSIN 408
Query: 448 QWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPL 507
+LL + + L+ + R A+R W L +G + +G +G Q + P
Sbjct: 409 MLPALLDDPNEPLRTELVLAPRQPRNLAIRKGKW-LYIGARGSGGFNGSKPQHHAWGGP- 466
Query: 508 LNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGANPAPMTPSPCTNGPC 567
A V+ +++ S+I R + AP P
Sbjct: 467 ----AAVQFSGHKN------------SDIVNGRIKKN----------AP---------PA 491
Query: 568 YLFNLGNDPCEQNNIASSRPDISSQLYELLKYHR 601
L++L ND + N+ P++ ++ LL+ +R
Sbjct: 492 QLYDLENDRSQTTNVYREHPEVVEEMKALLESYR 525
>gi|410631840|ref|ZP_11342513.1| arylsulfatase B [Glaciecola arctica BSs20135]
gi|410148741|dbj|GAC19380.1| arylsulfatase B [Glaciecola arctica BSs20135]
Length = 544
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 130/392 (33%), Positives = 186/392 (47%), Gaps = 69/392 (17%)
Query: 82 STTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKY 140
+ L L G ND+S G I TPNID LA G + N Y+ C PSRA+L+TG+
Sbjct: 60 NVVLILTDDMGINDVSTFGGGMINTPNIDKLAAKGALFTNGYSGHANCAPSRAALLTGRD 119
Query: 141 PIHTGMQGPPI----------------WGAEP----------------RGVPLTERFLPE 168
TG PI G P RG+P E + E
Sbjct: 120 ASRTGFDVTPIPDGFGRMIAMIENKNSKGRPPVEYFPEIDKNNPTFNNRGLPGEEITIAE 179
Query: 169 YLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELN 228
+++ GY T IGKWHLG E P +GF+ +G+ DH + + VE +
Sbjct: 180 LMQDAGYRTLHIGKWHLGR-TDEMMPNAQGFDESLMMASGLYLPVDH--PEAVNAQVEFS 236
Query: 229 GHD----MRRNLSTAWDTVGE------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 278
G D + + W+ GE Y TD FT+EA + IE ++P FLYLAH H
Sbjct: 237 GLDRFIWATMDYAVTWNG-GERFAPRGYLTDYFTEEAERAIEAN-ANRPFFLYLAHWGPH 294
Query: 279 AGNAGKHLEAPQETINQFQYITDPNR-RTYAAMVKKLDDSVGTVISALQRKGMLENSIII 337
N + ++ E + Q P+R R YAAM++ LD SV V++ L+ +G+ +N+I+I
Sbjct: 295 --NPVQAKKSDYEAVGDIQ----PHRKRVYAAMLRSLDRSVERVMAKLEEQGIADNTIVI 348
Query: 338 FMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQ 396
SDNG +Y N N PYRG K+T +EGG++VP +I W +I+ +
Sbjct: 349 LSSDNGG--ADYVAIDNL-----NKPYRGWKSTFFEGGIRVPFSITWPGEIEAGLIIDEP 401
Query: 397 MMHISDWLPTLYTAAGGDTSRLPLN--IDGLD 426
+ H D +PT+ A G +LP + +DG D
Sbjct: 402 VNHF-DLMPTIVNAGQG---KLPQDRILDGRD 429
Score = 39.7 bits (91), Expect = 4.8, Method: Compositional matrix adjust.
Identities = 27/69 (39%), Positives = 38/69 (55%), Gaps = 8/69 (11%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR- 73
Y TD FT+EA + IE ++P FLYLAH H N + ++ E + Q P+R
Sbjct: 263 YLTDYFTEEAERAIEAN-ANRPFFLYLAHWGPH--NPVQAKKSDYEAVGDIQ----PHRK 315
Query: 74 RTYAALTKS 82
R YAA+ +S
Sbjct: 316 RVYAAMLRS 324
>gi|442317872|ref|YP_007357893.1| sulfatase family protein [Myxococcus stipitatus DSM 14675]
gi|441485514|gb|AGC42209.1| sulfatase family protein [Myxococcus stipitatus DSM 14675]
Length = 559
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 122/450 (27%), Positives = 203/450 (45%), Gaps = 64/450 (14%)
Query: 87 LLIVYG-----WNDLSF-HGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKY 140
+L+++G WN ++ G TPNID +A G ++ + Y Q CT RA+ +TG
Sbjct: 30 ILVIWGDDIGLWNISAYNQGMMGYRTPNIDRIAKQGALMTDCYGQQSCTAGRAAFITGMN 89
Query: 141 PIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
P+ TG+ + GA+ G+ ++ + E L+ LGY+ GK HLG R Y P GF+
Sbjct: 90 PLRTGLTTIGMPGAD-YGLQDSDPTIAEMLKPLGYTCGQFGKNHLGDSNR-YLPTVHGFD 147
Query: 201 SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT-------------VGEYA 247
G L + + + D Y + H R + +W T VG+
Sbjct: 148 EFHGNLYHLNAENEPECPD-YPKDPAFKDHFGPRGVLHSWATDRDDPTEEHRWGRVGKQK 206
Query: 248 ---TDLFTKEAVQLIEDQPV-------------DKPLFLYLAHLAAHAGNAGKHLEAPQE 291
T TK+ ++ ++++ + DKP F++ H
Sbjct: 207 VVDTGPLTKKRMETVDEEFLASSLAFMEQAVKSDKPFFIWHNTTRTHV-----------W 255
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
T Q +Y + YA + +LDD VGT+++ L G+ +N+I++F +DNG + + +
Sbjct: 256 TFLQEKYRNKTGKGLYADAMTELDDIVGTLLAKLDELGIADNTIVVFSTDNGVEKMSWPD 315
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
G N P+RG K + WEGGV+VP ++ P + + V + DW+PTL AA
Sbjct: 316 -------GGNAPFRGEKGSTWEGGVRVPCVVRWPGVIEPGTVINDLFAHEDWMPTLVAAA 368
Query: 412 GGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKK 471
GG T D +++ + + ++DG DQ L P +R+ + +D
Sbjct: 369 GGPT-------DLVEKCKQGYSASGKKFRVHLDGYDQRGLLSGKEPGQRHEFVYVLDSGN 421
Query: 472 RTAAVRLDSWKLVLGTQENGTMDGYYGQTR 501
AAVR D WK++ Q+ D ++ R
Sbjct: 422 -IAAVRYDDWKVIFAYQDGHGPDMWFSGKR 450
>gi|404486643|ref|ZP_11021833.1| hypothetical protein HMPREF9448_02276 [Barnesiella intestinihominis
YIT 11860]
gi|404336461|gb|EJZ62922.1| hypothetical protein HMPREF9448_02276 [Barnesiella intestinihominis
YIT 11860]
Length = 478
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 131/443 (29%), Positives = 195/443 (44%), Gaps = 75/443 (16%)
Query: 77 AALTKSTTLTLLIVY------GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCT 129
+ TKS + ++Y G+ DL +G I TPNID LA G++ Y+ V
Sbjct: 18 SCTTKSKKQKMNVIYILADDLGYGDLGCYGQKIIKTPNIDRLAEEGMLFTQHYSGCTVSA 77
Query: 130 PSRASLMTGKYPIHT---GMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
PSR+ LMTG++ HT G Q G EP +P L +++ GY+T A GKW LG
Sbjct: 78 PSRSCLMTGQHTGHTYIRGNQSHGTEGEEP--LPGNTYTLARMMKDAGYATGAFGKWGLG 135
Query: 187 FFRREYTPLYRGFESHFGY--LNGVISYYD-HILSDQYSRTVELNGHDMRRNLSTAWDTV 243
+ E P GF+ FGY YY H+ +Q V L G++ + +
Sbjct: 136 YPCSEGDPTNLGFDEFFGYNCQRQAHHYYPYHLWHNQ--EKVMLPGNEGSKTET------ 187
Query: 244 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAG----------------NAGKHLE 287
YA DL ++A+Q I D KP FLYL ++ HA GK E
Sbjct: 188 --YAQDLIQEKALQFIVDNQ-SKPFFLYLPYILPHAELVSPEDSILAMYKGKIEEGKPYE 244
Query: 288 APQETINQF----QYITDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDN 342
+ N + Y + N +A+MV + D VG ++ L+R G+ +N+I+ F SDN
Sbjct: 245 GVDDIKNPYYKYGGYCSSENPHADFASMVTRFDAYVGEIMQTLKRLGLDKNTIVFFASDN 304
Query: 343 GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISD 402
G P E ++ N S P+RG+K ++EGG++VP I W+P + S + D
Sbjct: 305 G-PHQEGGADPDFFN--SYGPFRGIKRDMYEGGIRVPLIAWAPGKVEAGTCSDHISAFWD 361
Query: 403 WLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNS 462
+PTL AGG+ DG+ +L +++
Sbjct: 362 VMPTLADIAGGEI------------------------KGECDGISFAPTLFGGKGQKQHE 397
Query: 463 VLI-NIDEKKRTAAVRLDSWKLV 484
L E+ AVR+ +WK V
Sbjct: 398 YLYWEFHERGGRVAVRMGNWKGV 420
>gi|32475143|ref|NP_868137.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
gi|32445684|emb|CAD78415.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH 1]
Length = 485
Score = 152 bits (385), Expect = 4e-34, Method: Compositional matrix adjust.
Identities = 127/401 (31%), Positives = 177/401 (44%), Gaps = 68/401 (16%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
LLIV G+NDL G N I TP +D LA G L N Y A P CTPSRASL+TG+YP
Sbjct: 33 LLIVSDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGRYPQ 91
Query: 143 HTGM-----QGPPIWG------------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
G+ P +G G+ E LP LR GY + GKW L
Sbjct: 92 RNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYKSGIFGKWDL 151
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
G +R P RGF+ +G++N I Y+ H ++Y M RNL G
Sbjct: 152 GALQR-MLPTSRGFDDFYGFVNTGIDYFTH---ERYGVPC------MVRNLEPTEADKGT 201
Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH--------------AGNAGKHLEAPQE 291
Y T LF +EA++ +++ ++P FLY+ A H A + K + P E
Sbjct: 202 YCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLVPTIRSSVQAPDQFKAMYPPVE 261
Query: 292 T----INQFQY-----ITDPN--RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
++++Y + P RR Y A V +D ++G ++ L+ K ML+ +I++F S
Sbjct: 262 VETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEILDRLEAKQMLDETIVVFFS 321
Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHI 400
DNG N P RG K WEGG++VP ++ P Q V
Sbjct: 322 DNGG-----------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGQIPAGVVNDEFLT 370
Query: 401 SDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
S L + AA G + +DG D W +L S R
Sbjct: 371 SLELLPSFAAAAGVEPPPGVVLDGFDWWPTLRGEAESPRKE 411
>gi|167523060|ref|XP_001745867.1| hypothetical protein [Monosiga brevicollis MX1]
gi|163775668|gb|EDQ89291.1| predicted protein [Monosiga brevicollis MX1]
Length = 221
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 84/201 (41%), Positives = 123/201 (61%), Gaps = 8/201 (3%)
Query: 77 AALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
AA TK L ++I G++DL F G ++I TPNIDAL G++ MY Q VC+PSRAS+
Sbjct: 25 AAGTKPHVLFIVIDDLGYDDLYFRG-HQIRTPNIDALQEEGLLFTQMYMQDVCSPSRASI 83
Query: 136 MTGKYPIHTGMQG--PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
++G+Y +H G+ PP + G+ L + L + +RE GY T+A+GKWH+GF++ YT
Sbjct: 84 LSGRYAMHHGVTDWIPP---RDSYGLMLNDTTLADKMREAGYDTRAVGKWHMGFYKWAYT 140
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
P +RGF S GY +G Y+ H + Y + H AWD G+Y+T +F++
Sbjct: 141 PTFRGFNSFLGYYSGGEDYFTHETDNAYDMHRDEGRHCGPNCSIPAWDLKGQYSTTIFSE 200
Query: 254 EAVQLI-EDQPVDKPLFLYLA 273
EA+++I + Q D PLFLYLA
Sbjct: 201 EAIRIINQRQAADPPLFLYLA 221
Score = 43.1 bits (100), Expect = 0.51, Method: Compositional matrix adjust.
Identities = 19/36 (52%), Positives = 28/36 (77%), Gaps = 1/36 (2%)
Query: 8 AWDTVGEYATDLFTKEAVQLI-EDQPVDKPLFLYLA 42
AWD G+Y+T +F++EA+++I + Q D PLFLYLA
Sbjct: 186 AWDLKGQYSTTIFSEEAIRIINQRQAADPPLFLYLA 221
>gi|375143552|ref|YP_005005993.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
gi|361057598|gb|AEV96589.1| Cerebroside-sulfatase [Niastella koreensis GR20-10]
Length = 482
Score = 152 bits (384), Expect = 5e-34, Method: Compositional matrix adjust.
Identities = 125/424 (29%), Positives = 198/424 (46%), Gaps = 64/424 (15%)
Query: 78 ALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLM 136
A+ + L G+ DL +G I TPN+DA+A G YA VC PSR+S +
Sbjct: 32 AVKPNIIFILADDLGYGDLGCYGQKLIQTPNLDAMAKQGTRFTQFYAGTAVCAPSRSSFL 91
Query: 137 TGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
TG++ HT ++G G +P G +P + + E L++ GY+T GKW LG
Sbjct: 92 TGQHTGHTPIRGNK--GVQPEGQWPIPDSAVTIAEVLKKAGYATGDFGKWGLGPVASSGD 149
Query: 194 PLYRGFESHFGYLNGVIS---YYDHILSDQYSRTVELNGHDMRRNLST-AWDTVGEYATD 249
P+ +GF++ +GY ++ Y DH+ + D R + + + +Y+ D
Sbjct: 150 PIKQGFDAFYGYNCQTLAHDYYPDHLWEN-----------DTRIDFAANTPEHPTDYSAD 198
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI-NQFQYITDPN----- 303
L ++A+Q I DQ P F++L++ HA L+ P +++ +++ + +
Sbjct: 199 LIHQKALQFI-DQHKGNPFFVFLSYTLPHAA-----LQVPDDSLFEKYKKLLNEKPIPVP 252
Query: 304 -------------RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
R YAAMV +LD VG V+ L+ G+ +N+++IF SDNG P E
Sbjct: 253 KWNGKNYAPQAYPRAAYAAMVSRLDVYVGQVLQKLKEAGIDKNTLVIFSSDNG-PHKEGG 311
Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
+Y N SN P RG+K L+EGG++ P I W P + S + D+LPT
Sbjct: 312 YDPDYFN--SNGPLRGIKRDLYEGGMREPMIAWWPGKIKAGATSDYVGAFWDFLPTFAEL 369
Query: 411 AGGDTSRLPLNIDGLDQWSSLL--LNTPS----------RRNSNIDGLDQWSSLLLNTPS 458
A + P NIDG+ +LL N PS + + +W + LN S
Sbjct: 370 A---KALKPANIDGISIVPALLGQKNAPSHPWLYWEFHEQGGKQAVRMGKWKGIKLNVTS 426
Query: 459 RRNS 462
+ N
Sbjct: 427 QPNG 430
>gi|323454325|gb|EGB10195.1| hypothetical protein AURANDRAFT_62802 [Aureococcus anophagefferens]
Length = 2048
Score = 152 bits (384), Expect = 6e-34, Method: Composition-based stats.
Identities = 119/391 (30%), Positives = 191/391 (48%), Gaps = 54/391 (13%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSN-EIPTPNIDALAYNGIILNNMYAQPVCTPSRA 133
+A T+ + L++ G+ND+ + ++ TP +D+LA G+ L + Y P CTP+R+
Sbjct: 191 SATTQQQNILLILADDMGYNDVGYQSTDLGAATPTLDSLASRGVELASYYTLPSCTPARS 250
Query: 134 SLMTGKYPIHTGM--QGPPIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SL T Y + GM GP + + G+PL + E L+ GY+T
Sbjct: 251 SLFTANYAANNGMGYDGPGTYVIDSSYGLPLGATLMSEVLKGRGYAT------------- 297
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL 250
+ Y E++ N SY D + R+ ++G + + EY+TDL
Sbjct: 298 -HIETYFSHEAYGTDPNNGTSYCDLM------RSNSVDG------VKVGECYLDEYSTDL 344
Query: 251 FTKEAVQLIEDQPV--DKPLFLYLAHLAAHA---GNAGKHLEAPQET-INQFQYITDP-- 302
+ + V ++ + D PLFLY A A HA + APQE + D
Sbjct: 345 YARRLVDVVGEHATTSDSPLFLYFAAQAPHAPLEEPPPMNATAPQEAMLAALVGAGDSWG 404
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
RRT+A MV LD++V V+ AL +L+N+++ F SDNG R+ GSN+
Sbjct: 405 KRRTFAKMVMNLDNNVRKVVDALAAAKLLQNTVVAFASDNGGCP---------RDGGSNW 455
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL--QMMHISDWLPTLYTAAGGDTSRLPL 420
P RG K + +EGGV+VPA LWS +Q+ ++ + H++DW PTL +AG
Sbjct: 456 PLRGTKFSQYEGGVRVPAFLWSSTLQKRGAGTVFEGLFHVADWFPTLVASAG--VRATVG 513
Query: 421 NIDGLDQWSSLLLNTPSR-RNSNIDGLDQWS 450
IDG++Q+ +L T + R+ + L++W+
Sbjct: 514 KIDGVNQYDALFDATQNAPRDEVLIHLNKWT 544
>gi|149190919|ref|ZP_01869182.1| sulfatase, putative [Vibrio shilonii AK1]
gi|148835274|gb|EDL52248.1| sulfatase, putative [Vibrio shilonii AK1]
Length = 500
Score = 152 bits (383), Expect = 6e-34, Method: Compositional matrix adjust.
Identities = 126/421 (29%), Positives = 192/421 (45%), Gaps = 50/421 (11%)
Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
TPNID++A G++L + YAQP CT R++ +TG++P+ TGM + G+ P G+
Sbjct: 54 TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFITGQFPVRTGMHTVGLPGS-PVGLNAETPT 112
Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN--GVISY-------YDHI 216
LPE L+ LGY+T GK HLG R E P GF+ ++G+L + Y D
Sbjct: 113 LPEILKTLGYTTGQFGKNHLG-DRDEVLPTMNGFDEYWGWLYHLNAMEYTEDPDWPKDEA 171
Query: 217 LSDQYSRTVELNGHDMRRNLSTAWD---TVGEYAT--DLFTKEAVQLIEDQ-PVDKPLFL 270
R V + D + + D T+ T D K A+ IE+ DKP F
Sbjct: 172 FQKFAPRNVIHSKSDGKGGQTIKDDGPLTIERMRTLDDEVNKHAINFIENAVKQDKPFFT 231
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+ G+ HL E + + ++K LDD VG +++ L+ G+
Sbjct: 232 W---YCPSRGHVWTHLSPEYEAM-----LGQNGWGLQEVVMKDLDDHVGEMLAKLEELGV 283
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+N+I+IF +DNG + + + G P+ G K + WEGGV+ P ++ P
Sbjct: 284 ADNTIVIFTADNGPEIMTWPD-------GGMTPFHGEKGSTWEGGVRAPLLVSWPGKIPA 336
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWS 450
++ M DWLPTL AAGG + + G + + +++DG +Q S
Sbjct: 337 GKIHNGMFDGMDWLPTLVEAAGGPSDLKQQMLTGYEGFK-----------AHLDGYNQIS 385
Query: 451 SLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNF 510
L PS R ++ E+ R AVR+ WK Q + G+ G PLL F
Sbjct: 386 MLTELAPSNRKEIIYY--ERDRLQAVRVGDWKAHFVVQNH----GWSGPKEELNAPLL-F 438
Query: 511 N 511
N
Sbjct: 439 N 439
>gi|86147009|ref|ZP_01065327.1| sulfatase family protein [Vibrio sp. MED222]
gi|85835259|gb|EAQ53399.1| sulfatase family protein [Vibrio sp. MED222]
Length = 500
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 127/428 (29%), Positives = 193/428 (45%), Gaps = 64/428 (14%)
Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
TPNID++A G++L + YAQP CT R++ +TG+ P+ TGM + G P G+
Sbjct: 54 TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHSVGLPGG-PVGLSADTPT 112
Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL----------------NGV 209
LPE L+ +GY T GK HLG R E+ P GF+ ++G+L +G
Sbjct: 113 LPEILKTMGYVTGQFGKNHLGD-RDEFLPTMHGFDEYWGWLYHLNAMEYTEDPDWPKDGS 171
Query: 210 ISYYD-----HILSD-QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
+ + + SD + +T+E +G + T D V ++A + F + AV+
Sbjct: 172 LDAFAPRNVIYARSDGKGGQTIEDDGALSIERMRTLDDEVNKHAIN-FIERAVE------ 224
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
DKP F + H HL E + + ++K LDD VG +++
Sbjct: 225 ADKPFFTWYCPSRGHVWT---HLSPEYEAM-----LGQNGWGLQEVVMKDLDDHVGEMMA 276
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
++ G+ +N+IIIF +DNG + + + G PY G K T WEGGV+ PA++
Sbjct: 277 KMEELGIADNTIIIFTADNGPEIMTWPD-------GGMTPYHGEKGTTWEGGVRAPALVS 329
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
P V + DWLPTL AAGG T + G D + +++
Sbjct: 330 WPGKIPAGTVGNGIFDGMDWLPTLVAAAGGPTDLKEKLLKGHDGF-----------KAHL 378
Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
DG +Q L S R + E+ + AVR+ WK Q + G+ G
Sbjct: 379 DGYNQVDMLTEKGESNRKEIYYY--ERDKLQAVRVGDWKAHFVVQNH----GWSGPKEEL 432
Query: 504 KVPLLNFN 511
PLL FN
Sbjct: 433 NAPLL-FN 439
>gi|424864690|ref|ZP_18288593.1| steryl-sulfatase [SAR86 cluster bacterium SAR86B]
gi|400759436|gb|EJP73618.1| steryl-sulfatase [SAR86 cluster bacterium SAR86B]
Length = 549
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 117/379 (30%), Positives = 182/379 (48%), Gaps = 63/379 (16%)
Query: 92 GWNDLSFH----GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
G+ND+S H + T NIDALA +GI+ YA C PSRAS+MTGKY G
Sbjct: 76 GYNDISIHNGGAADGTLKTKNIDALADSGILFTRGYAANATCAPSRASIMTGKYSTSFGY 135
Query: 147 QGPPI---------WGAEP----------------------RGVPLTERFLPEYLRELGY 175
+ PI W AE +G+P + + E LR+ GY
Sbjct: 136 EFTPIPALGRTVLGWLAEEDDSELKQRIDRDVVSNMPAFMDQGMPTEQITIAEILRDAGY 195
Query: 176 STKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDH--ILSDQYSRTVE-----LN 228
T IGKWHLG + PL +GF+ G + DH +++ ++ +++ +
Sbjct: 196 YTAHIGKWHLGH-GQGMDPLSQGFQDSLGLDGPLYLPEDHPDVVNAKFDSSIDKMIWSIG 254
Query: 229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 288
+ N + + +Y TD +T EA+++IE+ ++P FLYL+H A H L+A
Sbjct: 255 QYSATFNGGSVF-APDKYVTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQA 307
Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
+ Q ++ N + Y+ M+ LD SVG ++ L+ + ++IIF SDNG
Sbjct: 308 LRTDYEQMSHMHGHNLQVYSGMINALDRSVGKIVEKLKDLNIYGKTLIIFTSDNGG--AN 365
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
Y E + N PYRG K + +EGG++VP I+ W +I + + + H D PT+
Sbjct: 366 YIELDDI-----NKPYRGWKISFFEGGIRVPYIVSWPSEISPGKKSDIAVHHF-DIFPTI 419
Query: 408 YTAAGGDTSRLPLNIDGLD 426
AAG +++ +DG D
Sbjct: 420 AKAAGIESAHA---LDGRD 435
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 27/92 (29%), Positives = 48/92 (52%), Gaps = 7/92 (7%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y TD +T EA+++IE+ ++P FLYL+H A H L+A + Q ++ N
Sbjct: 270 KYVTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQALRTDYEQMSHMHGHNL 323
Query: 74 RTYAALTKSTTLTL-LIVYGWNDLSFHGSNEI 104
+ Y+ + + ++ IV DL+ +G I
Sbjct: 324 QVYSGMINALDRSVGKIVEKLKDLNIYGKTLI 355
>gi|336413800|ref|ZP_08594149.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
3_8_47FAA]
gi|335934817|gb|EGM96800.1| hypothetical protein HMPREF1017_01257 [Bacteroides ovatus
3_8_47FAA]
Length = 465
Score = 152 bits (383), Expect = 7e-34, Method: Compositional matrix adjust.
Identities = 115/350 (32%), Positives = 163/350 (46%), Gaps = 38/350 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G DL +G +I TPNID LA G++ N Y+ V PSR SL+TGK+ H ++G
Sbjct: 40 GIGDLGCYGQKKIKTPNIDKLAEKGVLFTNHYSGSTVSAPSRCSLLTGKHTGHAYIRGNK 99
Query: 151 IWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
+E +P E + E L+ GY+T +GKW LG +P+ +GF+ FGYL
Sbjct: 100 GIKSEEGFFDLHLPSDEVTVAEILKRKGYATMCVGKWGLGGPHTTGSPVRKGFDYFFGYL 159
Query: 207 N--GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+ YY L + + V LN L + E D K A
Sbjct: 160 SQGAAHRYYPEYLYENEDK-VMLNKKVYSHFL------IMEKGLDFIRKNA--------- 203
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR---------RTYAAMVKKLD 315
P F Y A HA L ++ + YI + + YA+MV ++D
Sbjct: 204 GHPFFAYFAITPPHADLDYPDLSQYEDAFPETPYINNKQKGFKTQMKPKAAYASMVSEID 263
Query: 316 DSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGG 375
+VG +I L+ KG+LEN+IIIF SDNG V E + SN PYRG K L+EGG
Sbjct: 264 KNVGQIIQLLKEKGILENTIIIFSSDNGVHCVGGHEPDFFD---SNGPYRGYKRDLYEGG 320
Query: 376 VKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
V+ P I+ P++ + R + D+LPT+ G D + IDG+
Sbjct: 321 VRAPFIVSWPKMIKKKRTVEHITTFWDFLPTVTELVGADMVK---GIDGI 367
>gi|407071736|ref|ZP_11102574.1| arylsulfatase [Vibrio cyclitrophicus ZF14]
Length = 500
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 119/402 (29%), Positives = 178/402 (44%), Gaps = 47/402 (11%)
Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
TPNID++A G++L + YAQP CT R++ +TG+ P+ TGM + G P G+
Sbjct: 54 TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHTVGLPGG-PVGLSADTPT 112
Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV 225
LPE L+ +GY T GK HLG R EY P GF+ ++G+L ++ ++ + +
Sbjct: 113 LPEVLKTMGYVTGQFGKNHLG-DRDEYLPTMHGFDEYWGWLY-HLNAMEYTEDPDWPKDG 170
Query: 226 ELNGHDMRRNLSTAWDTVGEYAT---------------DLFTKEAVQLIEDQ-PVDKPLF 269
L+ R + + D G + D K A+ IE DKP F
Sbjct: 171 SLDAFAPRNVIYSKSDGKGGQSIEDDGALNIERMRTLDDEVNKHAINFIERAVEADKPFF 230
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
+ H HL E + + ++K LDD VG +++ + G
Sbjct: 231 TWYCPSRGHVWT---HLSPEYEAM-----LGQNGWGLQEVVMKDLDDHVGEMLAKIDELG 282
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+ +N+IIIF SDNG + + + G P+ G K T WEGGV+ PA++ P
Sbjct: 283 IADNTIIIFTSDNGPEIMTWPD-------GGMTPFHGEKGTTWEGGVRAPALVSWPGKIP 335
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V + DWLPTL AAGG T + G D + +++DG +Q
Sbjct: 336 AGSVGNGIFDGMDWLPTLVAAAGGPTDLKEKMLKGHDGF-----------KAHLDGYNQV 384
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENG 491
L S RN + E+ + AVR+ WK Q G
Sbjct: 385 DMLTEKGESNRNEIYYY--ERDKLQAVRVGDWKAHFIVQNRG 424
>gi|149177395|ref|ZP_01855999.1| arylsulfatase A [Planctomyces maris DSM 8797]
gi|148843728|gb|EDL58087.1| arylsulfatase A [Planctomyces maris DSM 8797]
Length = 474
Score = 151 bits (382), Expect = 8e-34, Method: Compositional matrix adjust.
Identities = 124/384 (32%), Positives = 188/384 (48%), Gaps = 43/384 (11%)
Query: 75 TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCT 129
+ A L + ++++Y G+ DL G I TP +D +A G+ Y A PVCT
Sbjct: 27 SVADLNAAPGPNVIVIYADDLGYGDLGCFGHPTIKTPALDQMAAEGMKFTQFYSAAPVCT 86
Query: 130 PSRASLMTGKYPIHTGMQGPP---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLG 186
PSRA+L+TG+YPI +GM ++ G+P +E L E L+ GY T +GKWHLG
Sbjct: 87 PSRAALLTGRYPIRSGMCSDKRRVLFPNSGGGIPASEVTLAEALKAAGYKTACVGKWHLG 146
Query: 187 FFRREYTPLYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT-V 243
++ P GF+S+FG Y N + D + ++ R++ L N+ +T V
Sbjct: 147 HL-PQFLPTNNGFDSYFGIPYSNDM----DRVADRKHGRSIFLKPEVKFWNVPLMRNTEV 201
Query: 244 GEYATD------LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
E D +T+EA++LI+ Q +P F+YLAH H P +FQ
Sbjct: 202 VELPADQTTITKRYTEEAIKLIQ-QNKQQPFFIYLAHNMPH---------VPLFRSPEFQ 251
Query: 298 YITDPNRR-TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
D +RR Y +++++D SVG V+ L+ +G+ +N+I+ F SDNG P + + E
Sbjct: 252 ---DKSRRGLYGDVIEEIDWSVGEVLKELKTQGLDQNTIVWFCSDNG-PWLIFNEQG--- 304
Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
GS R K + WEGG++ P I W P + VS ++ D T AGG
Sbjct: 305 --GSAGLLRDGKGSTWEGGMREPTIAWWPGHIPSGSVSQELGSTMDIYTTSIKLAGGSVP 362
Query: 417 RLPLNIDGLDQWSSLLLNTPSRRN 440
+ +DG+D LL S R
Sbjct: 363 TDQV-VDGVDLTPVLLKKENSPRE 385
>gi|440716877|ref|ZP_20897380.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
gi|436438070|gb|ELP31646.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SWK14]
Length = 485
Score = 151 bits (382), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 130/455 (28%), Positives = 193/455 (42%), Gaps = 94/455 (20%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
LLIV G+NDL G N I TP +D LA G L N Y A P CTPSRASL+TG+YP
Sbjct: 33 LLIVSDDQGYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGRYPQ 91
Query: 143 HTGM-----QGPPIWG------------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
G+ P +G G+ E LP LR GY + GKW L
Sbjct: 92 RNGIYDMIRNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAVLRPAGYKSGIFGKWDL 151
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
G +R P RGF+ +G++N I Y+ H ++Y M RNL G
Sbjct: 152 GALQR-MLPTSRGFDDFYGFVNTGIDYFTH---ERYGVPC------MVRNLEPTEADKGT 201
Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHLEAPQETINQFQYITDP-- 302
Y T LF +EA++ +++ ++P FLY+ A H + + + + + +QF+ + P
Sbjct: 202 YCTYLFQREALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKAMYPPVE 261
Query: 303 ----------------------NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMS 340
RR Y A V +D ++G ++ L+ K ML+N+I++F S
Sbjct: 262 VETRVTDRYRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDNTIVVFFS 321
Query: 341 DNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMH 399
DNG N P RG K WEGG++VP ++ W V+ + +
Sbjct: 322 DNGG-----------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNNEFLT 370
Query: 400 ISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSR 459
+ LP+ AAG + P +DG + W +L S
Sbjct: 371 SLELLPSFAIAAG--------------------VEPPP--GVVLDGFNWWPTLRGEAESP 408
Query: 460 RNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMD 494
R + +++ R+ WK V ++ G D
Sbjct: 409 RKEMFW---KRRDQIGARVGQWKWVQMGEQGGLFD 440
>gi|432851909|ref|XP_004067102.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 1
[Oryzias latipes]
Length = 525
Score = 151 bits (382), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 167/370 (45%), Gaps = 52/370 (14%)
Query: 80 TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
T S + ++++ GW DL G TPN+DA+A GI+ + Y A P+C+PSRA+L+
Sbjct: 28 TSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSRAALL 87
Query: 137 TGKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
TG+ PI G G P+ G+ E LP+ L+E GY K +GKWHLG R
Sbjct: 88 TGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHLGH-R 146
Query: 190 REYTPLYRGFESHFGYLNGVISYYDHIL---------SDQYSRTVELNGHDMRRNLSTAW 240
+Y PL GF+ FG N Y++ + S+ R E D + S
Sbjct: 147 PQYLPLENGFDEWFGAPNCHFGPYNNTVRPNIPVYNNSEMLGRYFEEFKIDKKTGESNLT 206
Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
E D +++A +P FLY A A H+ ++
Sbjct: 207 QMYLEAGLDFISRQA-------EAKRPFFLYWAADATHS-----------PVYASKPFLG 248
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
R Y V +LD SVG +++ LQR G+ N+ + F SDNGA + S GS
Sbjct: 249 KSQRGRYGDAVMELDYSVGRILTKLQRLGIDNNTFVFFTSDNGAALM-----SGPNEGGS 303
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG----GDTS 416
N P+ K T +EGG++ PAI W P + VS Q ++ D T AG GD
Sbjct: 304 NGPFLCGKETTFEGGMREPAIAWWPGRIKGGTVSFQPANVMDLFTTSLALAGIGPPGDRV 363
Query: 417 RLPLNIDGLD 426
+DGLD
Sbjct: 364 -----LDGLD 368
>gi|340367689|ref|XP_003382386.1| PREDICTED: arylsulfatase B-like [Amphimedon queenslandica]
Length = 493
Score = 151 bits (382), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 108/363 (29%), Positives = 166/363 (45%), Gaps = 42/363 (11%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRAS 134
+A + ++V +G+ D+SF I +PN + LA G+IL+ Y C+PSRAS
Sbjct: 16 SAFSTKPNFVFVLVDDWGYADVSFRNP-AIHSPNFEKLAKEGLILDRHYVFKYCSPSRAS 74
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
+TG++P H PP G L +P L+ Y T IGKWH G ++ Y P
Sbjct: 75 FLTGRWPHHAHQWNPP--EDALVGANLKMTMIPAKLKLARYKTHMIGKWHEGLYKEAYLP 132
Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVEL---NGHDMRRNLSTAWDTVGEYATDLF 251
+ RGF++ G+L G ++ ++ Q + +G D R G Y +
Sbjct: 133 INRGFDTMSGFLGGGENH----MNQQVGCATDFWKNDGPDSRN---------GSYDAYTY 179
Query: 252 TKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF-QYITDPNRRTYAAM 310
+ +I + P FLYL H L+AP E +N + T NRR Y AM
Sbjct: 180 RDDLTDIITNHNPSDPFFLYLPLHNVHT-----PLQAPDEWLNIYPANSTCQNRRIYQAM 234
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
V D+ G ++ LQ GM +N++++ +DNG GSNYP +G K T
Sbjct: 235 VSVADNVTGHLVKLLQDNGMWDNTLLVISADNGGAPCG----------GSNYPLKGSKGT 284
Query: 371 LWEGGVKVPAILWSPQIQQN--PRVSLQMMHISDWLPTLYTAAG---GDTSRLPLNIDGL 425
+EGGV+ A + + N + + +HI+DW T AG D+ +DG+
Sbjct: 285 FYEGGVRSLAFVNGGLLPTNRKGKSTQGFIHIADWYTTFCQLAGIDHDDSGSGKFPVDGM 344
Query: 426 DQW 428
+ W
Sbjct: 345 NVW 347
>gi|84386700|ref|ZP_00989726.1| sulfatase family protein [Vibrio splendidus 12B01]
gi|84378506|gb|EAP95363.1| sulfatase family protein [Vibrio splendidus 12B01]
Length = 500
Score = 151 bits (382), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 126/428 (29%), Positives = 193/428 (45%), Gaps = 64/428 (14%)
Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
TPNID++A G++L + YAQP CT R++ +TG+ P+ TGM + G P G+
Sbjct: 54 TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHSVGLPGG-PVGLSADTPT 112
Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL----------------NGV 209
LPE L+ +GY T GK HLG R E+ P GF+ ++G+L +G
Sbjct: 113 LPEILKTMGYVTGQFGKNHLGD-RDEFLPTMHGFDEYWGWLYHLNAMEYTEDPDWPKDGS 171
Query: 210 ISYYD-----HILSD-QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
+ + + SD + +T+E +G + T D V ++A + F + AV+
Sbjct: 172 LDAFAPRNVIYARSDGKGGQTIEDDGALSIERMRTLDDEVNKHAIN-FIERAVE------ 224
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
DKP F + H HL E + + ++K LDD VG +++
Sbjct: 225 ADKPFFTWYCPSRGHVWT---HLSPEYEAM-----LGQNGWGLQEVVMKDLDDHVGEMMA 276
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
++ G+ +N+IIIF +DNG + + + G P+ G K T WEGGV+ PA++
Sbjct: 277 KMEELGIADNTIIIFTADNGPEIMTWPD-------GGMTPFHGEKGTTWEGGVRAPALVS 329
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
P V + DWLPTL AAGG T + G D + +++
Sbjct: 330 WPGKIPAGTVGNGIFDGMDWLPTLVAAAGGPTDLKEKMLKGHDGF-----------KAHL 378
Query: 444 DGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSN 503
DG +Q L S R + E+ + AVR+ WK Q + G+ G
Sbjct: 379 DGYNQVDMLTEKGESNRKEIYYY--ERDKLQAVRIGDWKAHFVVQNH----GWSGPKEEL 432
Query: 504 KVPLLNFN 511
PLL FN
Sbjct: 433 NAPLL-FN 439
>gi|406834305|ref|ZP_11093899.1| arylsulfatase [Schlesneria paludicola DSM 18645]
Length = 592
Score = 151 bits (382), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 142/496 (28%), Positives = 217/496 (43%), Gaps = 91/496 (18%)
Query: 63 NQFQYITDPNRRTYAALTKSTTLT-------LLIVYG-----WNDLSFH-GSNEIPTPNI 109
N F + + P AL + T T +L++ G WN ++H G TPN+
Sbjct: 62 NPFCFTSKPQNERVLALANAQTPTGSSKKPNILVIMGDDVGIWNIGAYHRGMMAGRTPNL 121
Query: 110 DALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEY 169
D LA G++ + YA+ CT RA+ +TG+ P TGM G+ G+P +
Sbjct: 122 DKLAAQGMLFTDYYAEASCTAGRANFITGQLPKRTGMTTVGQAGSA-IGLPAQAPTIATI 180
Query: 170 LRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV--EL 227
L+ LGY+T GK HLG R E+ P GF+ FGYL Y+ + D + + EL
Sbjct: 181 LKSLGYATGQFGKNHLG-DRNEFLPTVHGFDEFFGYL-----YHLDAMEDPFHKNYPPEL 234
Query: 228 NGHDMRRNLSTAWDTVGEYATD--LFTKEAVQLIED---------QPVD----------- 265
RN+ +W T + AT+ + K Q IED + VD
Sbjct: 235 RDSVGPRNMLHSWATDKDDATEQPRWGKIGKQKIEDAGTLPPKRMETVDDEILEHTLKFI 294
Query: 266 -------KPLFLYL----AHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
KP F++L H+ H +++ PQ T A + +L
Sbjct: 295 DKAKKDEKPFFVWLNPTRIHIVTHLSEKYENMRTPQN-----------GWATVEAGMAQL 343
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW--GSNYPYRGVKNTLW 372
DD VG+++ L+ +GM +N+I++F +DNG T N+ W G P+ G K T+
Sbjct: 344 DDIVGSMMKKLKDEGMEDNTIVMFTTDNG--------TENF-TWPDGGQTPFAGGKGTVM 394
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
EGG + PA++ P +V ++ DWLPT AAG P +D L + L
Sbjct: 395 EGGFRAPAMVRWPNHIPAGKVENGIISGMDWLPTFVAAAGN-----PKIVDELKEGKKL- 448
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
+ ++DG +Q L TPS R+ + + AVR+D +K Q +
Sbjct: 449 --GDATYKVHLDGYNQLDLLTGKTPSARHEIYYFAE--GTLGAVRIDDFKYRFIDQPS-- 502
Query: 493 MDGYYGQTRSNKVPLL 508
G++G T VP+L
Sbjct: 503 --GWFGGTVKVDVPIL 516
>gi|305665421|ref|YP_003861708.1| arylsulfatase A [Maribacter sp. HTCC2170]
gi|88710176|gb|EAR02408.1| arylsulfatase A [Maribacter sp. HTCC2170]
Length = 469
Score = 151 bits (382), Expect = 9e-34, Method: Compositional matrix adjust.
Identities = 117/380 (30%), Positives = 184/380 (48%), Gaps = 45/380 (11%)
Query: 72 NRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQP 126
N + T +I++ G+ DLS +G+ I TPN+D +A+ G N Y
Sbjct: 18 NVDIFGQKANQQTPNFIIIFADDLGYGDLSTYGNPTIHTPNLDRMAHEGQKWTNFYVGAS 77
Query: 127 VCTPSRASLMTGKYPIHTGMQGPP---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
VCTPSRA+L+TG+ P+ +GM ++ G+P E L E L+ GYST AIGKW
Sbjct: 78 VCTPSRAALLTGRLPVRSGMASSKTRVLFPNSKNGLPANEITLAEQLKTAGYSTAAIGKW 137
Query: 184 HLGFFRREYTPLYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMR---RNLST 238
HLG + +Y P GF+ +FG Y N + D+I Y + N D++ N+
Sbjct: 138 HLGH-KEQYLPTNNGFDYYFGIPYSNDM----DYIGKANYKNLAQNNPSDIKIENYNVPL 192
Query: 239 AWDTV-------GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
+T T +T+E ++ I++ + P F+YLAH H P
Sbjct: 193 LQNTKIIERPTNQNTITKRYTQETIKFIKENKRE-PFFVYLAHSLPHI---------PLF 242
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
T +F+ Y +V+++D VG +++AL+ +G+ +N+I++F SDNG P + ++
Sbjct: 243 TSKEFE--GHSKNGIYGDVVEEIDHGVGEILNALEEEGLDQNTIVVFTSDNG-PWLPFK- 298
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
+ GS R K + WEGG++ P I W P + + V+ M D T T A
Sbjct: 299 ----THGGSAGLLRAGKGSTWEGGMREPGIFWGPGLVKKGLVT-DMGSTMDLFTTFSTMA 353
Query: 412 GGDTSRLPLNIDGLDQWSSL 431
G + + IDG D +L
Sbjct: 354 GVEIPNDRI-IDGHDLSKTL 372
>gi|440715400|ref|ZP_20895947.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
gi|436439744|gb|ELP33158.1| iduronate-2-sulfatase [Rhodopirellula baltica SWK14]
Length = 1004
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 141/484 (29%), Positives = 211/484 (43%), Gaps = 68/484 (14%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQ 125
++ R A TK + +L G+ DLS G+ + TP ID +A G L + Y A
Sbjct: 549 VSPAGREKTAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAA 608
Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWG----AEPRGVPLTERFLPEYLRELGYSTKAIG 181
PVCTPSRA LMTG YP M +G +P+G+ E + E L+ GY T G
Sbjct: 609 PVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFG 668
Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---RRNLST 238
KWHLG + E+ P +GF+ FG Y H + + R + + + +
Sbjct: 669 KWHLGD-QPEFLPTKQGFDEFFGIP------YSHDIHPFHPRQNHYHFPSLPLLQNDTVI 721
Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA----------GNAGKHLEA 288
D ++ T T++AV IE D+P FLYL H HA G A + A
Sbjct: 722 EMDPDADFLTKRLTEQAVSFIERN-KDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAA 780
Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
++ Y T N + + ++D SVG ++ AL+ G+ E ++++F SDNG P
Sbjct: 781 IEKEDGNIDYATRAN--LFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPP--- 835
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
+ + Y + G RG K T +EGG++ P ++ W QI + ++M D LPT
Sbjct: 836 --KNTLYASPGE---LRGHKGTTFEGGMREPTVVRWPGQIPAG-HQNDELMTAMDLLPTF 889
Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
AG TP+ R IDG D W +L T + ++ +
Sbjct: 890 AKLAGA--------------------ATPTDR--VIDGKDIWPTLKGETQTPHDAFFYH- 926
Query: 468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYG--QTRSNKVPLLNFNAIVESKTYQSLQQL 525
+ AAVR WKL + NG Y KV ++ N V K + L+
Sbjct: 927 -RGNQLAAVRSGKWKLHV---NNGVAKQLYDLENDLGEKVNVIETNPEVVKKLQRQLKDF 982
Query: 526 SQNI 529
+ +I
Sbjct: 983 AADI 986
Score = 106 bits (265), Expect = 3e-20, Method: Compositional matrix adjust.
Identities = 119/459 (25%), Positives = 185/459 (40%), Gaps = 79/459 (17%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
A + + L+ V G+ DL +G+ ++ TPNID LA G + + A VCTPSR
Sbjct: 27 AAERPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPSRYG 86
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPL----TERFLPEYLRELGYSTKAIGKWHLGFFRR 190
L+TG+YP+ M G IWG P L + + + + GY+T +GKWHLGF
Sbjct: 87 LLTGQYPVRA-MGGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGFKEE 145
Query: 191 EYT--------PLYRGFESHFGY--LNGVISYY----DHILSDQYSRTVELNGHDMR--- 233
P GF+ +FG +N Y D I S + G +
Sbjct: 146 PCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVSPTP 205
Query: 234 -----------RNLSTAWDTVGEYATD----LFTKEAVQLIEDQPVDKPLFLYLAHLAAH 278
S A Y + L T+ AV+ I ++ D+P FLY A H
Sbjct: 206 MFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITEK-KDEPFFLYFATPNIH 264
Query: 279 AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIF 338
P +F+ + Y V +LD VG ++ +L+ G+ +N++++F
Sbjct: 265 ---------HPFTPAPRFKGTSQCG--LYGDFVHELDWMVGEIVQSLEDNGLTDNTLVLF 313
Query: 339 MSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQM 397
SDNGA + N G K +WEGG +VP I W +I+
Sbjct: 314 TSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIK--------- 364
Query: 398 MHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LNTPSRRNSNIDGLDQWSSLLLNT 456
AG + +L +D +S+L PS D ++ +LL +
Sbjct: 365 -------------AGTQSDQLISQVDLFATFSALTEQEMPSSEQK--DSINMLPALLDDP 409
Query: 457 PSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
+ L+ + R A+R W L +G + +G +G
Sbjct: 410 NEPLRTELVLAPRQPRNLAIRKGKW-LYIGARGSGGFNG 447
>gi|284035278|ref|YP_003385208.1| sulfatase [Spirosoma linguale DSM 74]
gi|283814571|gb|ADB36409.1| sulfatase [Spirosoma linguale DSM 74]
Length = 474
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 120/363 (33%), Positives = 179/363 (49%), Gaps = 50/363 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D+ +G N I TPNID +A GI YA VC PSR++LMTG++ HT ++G
Sbjct: 37 GYGDIGINGQNLIKTPNIDRMAAQGIQFRQFYAGTTVCAPSRSALMTGQHTGHTYIRGNR 96
Query: 151 IWGAEPRG-VPLTERFL--PEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY-L 206
+P G P+ + + E L++ GY+T A GKW LG E PL +GF+ +GY
Sbjct: 97 EL--QPEGQFPIADSVMTMAEILKQAGYTTAAFGKWGLGPVGSEGDPLKQGFQRFYGYNC 154
Query: 207 NGVI-SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
G+ YY + L D + V +NL+ E+A DL K+A++ ++ +
Sbjct: 155 QGLAHRYYPNHLWDNDKKVVLA----ANKNLTQ----TSEFAPDLIQKQALRFLDKRTGS 206
Query: 266 KPLFLYLAHLAAHA-----------GNAGKHLEAPQETINQFQYITDPNRRT-------- 306
+P FL+L + HA GK E P + PN +T
Sbjct: 207 QPFFLFLPYTLPHAELVPPDDSLFASYKGKFTEKPYTGAD-----YGPNAKTGGYASQAY 261
Query: 307 ----YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
YAAMV +LD VG V++ L+ KG+ +N+++IF SDNG P VE + N SN
Sbjct: 262 PHAAYAAMVARLDLYVGQVLAKLKAKGLDKNTLVIFSSDNG-PHVEGGGDPTFFN--SNG 318
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI 422
+RGVK L+EGG++ P + P + + S + D LPT AG +++
Sbjct: 319 GFRGVKRDLYEGGIREPFVARWPGVIKPGTQSDFVGAFWDLLPTFAELAG---AKVAART 375
Query: 423 DGL 425
DG+
Sbjct: 376 DGI 378
>gi|432851911|ref|XP_004067103.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like isoform 2
[Oryzias latipes]
Length = 523
Score = 151 bits (382), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 118/370 (31%), Positives = 167/370 (45%), Gaps = 52/370 (14%)
Query: 80 TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
T S + ++++ GW DL G TPN+DA+A GI+ + Y A P+C+PSRA+L+
Sbjct: 28 TSSPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAQGILFPDFYTANPLCSPSRAALL 87
Query: 137 TGKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
TG+ PI G G P+ G+ E LP+ L+E GY K +GKWHLG R
Sbjct: 88 TGRLPIRNGFYTTNGHARNAYTPQEIVGGISKDEILLPQLLKEKGYVNKIVGKWHLGH-R 146
Query: 190 REYTPLYRGFESHFGYLNGVISYYDHIL---------SDQYSRTVELNGHDMRRNLSTAW 240
+Y PL GF+ FG N Y++ + S+ R E D + S
Sbjct: 147 PQYLPLENGFDEWFGAPNCHFGPYNNTVRPNIPVYNNSEMLGRYFEEFKIDKKTGESNLT 206
Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
E D +++A +P FLY A A H+ ++
Sbjct: 207 QMYLEAGLDFISRQA-------EAKRPFFLYWAADATHS-----------PVYASKPFLG 248
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
R Y V +LD SVG +++ LQR G+ N+ + F SDNGA + S GS
Sbjct: 249 KSQRGRYGDAVMELDYSVGRILTKLQRLGIDNNTFVFFTSDNGAALM-----SGPNEGGS 303
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG----GDTS 416
N P+ K T +EGG++ PAI W P + VS Q ++ D T AG GD
Sbjct: 304 NGPFLCGKETTFEGGMREPAIAWWPGRIKGGTVSFQPANVMDLFTTSLALAGIGPPGDRV 363
Query: 417 RLPLNIDGLD 426
+DGLD
Sbjct: 364 -----LDGLD 368
>gi|305667515|ref|YP_003863802.1| N-acetylgalactosamine 6-sulfatase [Maribacter sp. HTCC2170]
gi|88709563|gb|EAR01796.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Maribacter sp. HTCC2170]
Length = 596
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/349 (32%), Positives = 167/349 (47%), Gaps = 54/349 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW DLSF+G+ + TPNIDA+A NG N Y QPVC+P+RA L+TGKY G+
Sbjct: 48 GWGDLSFNGNTNLSTPNIDAIAKNGASFQNFYVQPVCSPTRAELLTGKYAARLGVYSTST 107
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G E + E ++ GY T A GKWH G + Y P RGF+ ++G+ +G
Sbjct: 108 GGERFNS---KETTIAEIFKKAGYKTTAYGKWHSG-MQPPYHPNSRGFDDYYGFTSGHWG 163
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE-YATDLFTKEAVQLIEDQPVDKPLFL 270
Y +S +E NG ++ GE + D T + + I + + P FL
Sbjct: 164 NY-------FSPMLEHNGEIVK----------GEGFLVDDLTNKGLDFITENK-NNPFFL 205
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR------------RTYAAMVKKLDDSV 318
YL + H+ ++ P E +F+ R R AMV+ +D ++
Sbjct: 206 YLPYNTPHS-----PMQVPNEYWERFEKKKLDMRYQGNEEESENFTRAALAMVENIDFNM 260
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G + + L+ G+ EN+II+++SDNG W N RG K + EGGV+
Sbjct: 261 GRLTNKLKELGLEENTIIVYLSDNGP-----------NGWRWNGGMRGRKGSTDEGGVRS 309
Query: 379 P-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
P I W I +N ++S Q+ D LPTL + AG + + +IDG D
Sbjct: 310 PFFIQWKNTIPKNKKIS-QIAGAIDILPTLTSLAGINQPTIK-SIDGKD 356
>gi|71280931|ref|YP_269082.1| sulfatase [Colwellia psychrerythraea 34H]
gi|71146671|gb|AAZ27144.1| sulfatase family protein [Colwellia psychrerythraea 34H]
Length = 492
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 122/373 (32%), Positives = 179/373 (47%), Gaps = 45/373 (12%)
Query: 66 QYITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA 124
Q + P++ T +K + LL+ +G DLS +GSN TPNID LA +G+ +N YA
Sbjct: 19 QAVATPDKST----SKPNVVMLLVDDFGRQDLSTYGSNFYETPNIDQLAADGMKFDNAYA 74
Query: 125 -QPVCTPSRASLMTGKYPIHTGM-QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGK 182
P C PSR ++ +G YP G+ QG + +PL+ E+L+E GY T IGK
Sbjct: 75 AHPRCVPSRVAIFSGSYPTRYGVPQGERV---GKHHLPLSAVTFGEHLKEAGYQTGYIGK 131
Query: 183 WHLGFFRREYTPLYRGFESHF--GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAW 240
WHLG + P +GF+S G+ SYY + T +++ + +
Sbjct: 132 WHLG--KEGGDPTKQGFDSSIMAGHWGAPPSYY-------FPYT-KMSKSGKNKGFAKVE 181
Query: 241 DTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA--------------------- 279
+ EY TD T EA+ IE Q D+P L LAH A H
Sbjct: 182 GSEEEYLTDRLTDEALTFIE-QKKDQPFLLVLAHYAVHTPIEGKPALVKKYKTKMKKLGI 240
Query: 280 GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
NAG +A + + T N YAAMV+ +D SVG + L+R G+ +N+III
Sbjct: 241 ANAGPKSDADLIKDSTGYHKTIQNNPDYAAMVESVDISVGRIEQQLKRLGLEDNTIIILT 300
Query: 340 SDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMH 399
SD+G + + SN SN PYR K +++GG +VP I+ P+ + +S +
Sbjct: 301 SDHGGLSSRGLK-SNRVLATSNNPYRHGKGWIYDGGTRVPLIVKWPEKVKAGSISQVQVT 359
Query: 400 ISDWLPTLYTAAG 412
+D PT+ AG
Sbjct: 360 GTDHYPTILQMAG 372
>gi|410638700|ref|ZP_11349253.1| sulfatase [Glaciecola lipolytica E3]
gi|410141228|dbj|GAC16458.1| sulfatase [Glaciecola lipolytica E3]
Length = 522
Score = 151 bits (381), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 133/446 (29%), Positives = 204/446 (45%), Gaps = 69/446 (15%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
AA + L+ G D+ +G I TPN+ +A G++ + YA PVC PSRA+L
Sbjct: 40 AAKKPNVIYILVDDLGIGDIEPYGQELINTPNLQRMADQGMVFSQHYAGNPVCAPSRAAL 99
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERF-----------LPEYLRELGYSTKAIGKWH 184
MTG++ H ++G E G E F + + +++ GY+T IGKW
Sbjct: 100 MTGQHSGHNQIRG----NFELGGFADEEEFGQMPLQPGTETVAKLMKKAGYTTALIGKWG 155
Query: 185 LGFFRREYTPLYRGFESHFGYLNGVIS---YYDHI--------LSDQYSRTVELNGHDMR 233
LG P +GF+ FGYL+ + Y H+ L+++Y + G
Sbjct: 156 LGGPGSYGVPTKQGFDYFFGYLDQKQAHNHYPTHLWQNETWFPLNNEYLDPHKGLGKHDD 215
Query: 234 RNLSTAWDTV--GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
N +++ G++A T EA+ I D D+P FLYL++ A HA L+AP E
Sbjct: 216 PNDEASYEKFKRGDFAQQRLTDEALSYI-DNKKDEPFFLYLSYAAPHAA-----LQAPDE 269
Query: 292 TINQFQYITD------------PN---RRTYAAMVKKLDDSVGTVISALQRKGMLENSII 336
I ++ + D P+ R T AAM+ +D+ VG V+ L++ G+ EN+++
Sbjct: 270 EIAKYNHFEDTPYDGVADGGYVPSFRPRATRAAMISHIDEGVGLVLDKLKKLGLDENTLV 329
Query: 337 IFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ 396
IF SDNG P+ E ++ + SN YRG K L++GG+++P I P +VS
Sbjct: 330 IFSSDNG-PSFE--GGADLEFFDSNAQYRGFKRDLYDGGIRMPTIARWPTKVPAGKVSDH 386
Query: 397 MMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLN- 455
+ D LPTL P NIDG+ +LL DG Q SL
Sbjct: 387 VSAFWDVLPTLADLV---EIEAPKNIDGISFLPTLLAE---------DGQQQHDSLYWEF 434
Query: 456 ---TPSRRNSVLINIDEKKRTAAVRL 478
S + +V I ++ A+R
Sbjct: 435 HHWNGSHKQAVRIQAEKNADWKAIRF 460
Score = 39.7 bits (91), Expect = 5.3, Method: Compositional matrix adjust.
Identities = 22/58 (37%), Positives = 32/58 (55%), Gaps = 6/58 (10%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD 70
G++A T EA+ I D D+P FLYL++ A HA L+AP E I ++ + D
Sbjct: 228 GDFAQQRLTDEALSYI-DNKKDEPFFLYLSYAAPHAA-----LQAPDEEIAKYNHFED 279
>gi|332663783|ref|YP_004446571.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
gi|332332597|gb|AEE49698.1| sulfatase [Haliscomenobacter hydrossis DSM 1100]
Length = 550
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 145/486 (29%), Positives = 216/486 (44%), Gaps = 109/486 (22%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
A TK + L++V G++DL +GS EI TPNID LAY G+ L Y +C P+RASL
Sbjct: 25 AQTKKPNIILIMVDDMGYSDLGAYGS-EIQTPNIDKLAYEGLRLREFYNNSICAPTRASL 83
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFL-------PEYLRELGYSTKAIGKWHLGFF 188
+TG+YP G+ + G+P + +L E LR+ GYST GKWH+G
Sbjct: 84 ITGQYPHKAGVGYFDV----NLGIPPYQGYLNKESLTFGEVLRQAGYSTLLSGKWHVGND 139
Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHI---LSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
+ P RGF+ FG + G +Y+D L QY V L + ++ + ++
Sbjct: 140 SLHW-PKQRGFDRFFGVIGGGSNYFDAEPMPLGRQYP-VVILEDNQRQKPKANSY----- 192
Query: 246 YATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAH----------AGNAGKH--------- 285
Y TD T AVQ +++Q +DKP FLYLA+ A H A GK+
Sbjct: 193 YFTDEITNHAVQFLDEQNKMDKPFFLYLAYTAPHWPLQALPEDIAKYRGKYDTGWDALRK 252
Query: 286 ------------------------LEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVG 319
+E P F+ + YAAM+ + D VG
Sbjct: 253 ERLARQVKLGIITQAQADAAAARDVEVPDWDNLTFEEKQLWKAKMEVYAAMLDRADQGVG 312
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPT--VEYRETSNY-RNWG----------------- 359
V++ L+ +N++IIF+SDNGAP + +R+ ++ RN G
Sbjct: 313 KVLAKLKELKKDDNTLIIFISDNGAPAEDMAHRDKNHAGRNTGPVGTAGSFESQGKPWSY 372
Query: 360 -SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
SN P+R K+ +EGG+ P I W P + ++ H+ D PT Y AG +
Sbjct: 373 VSNTPFRSFKSFAYEGGISSPFIAWFPGKIKAGSIAKGTAHLIDLAPTFYELAG---LKY 429
Query: 419 PLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRL 478
P +G ++T + + ++ S+LL T + I E+ AVR
Sbjct: 430 PSTYNG--------VSTNALQGKSL------SNLLFKTEAWERGEPI-FWERAGNRAVRQ 474
Query: 479 DSWKLV 484
WKLV
Sbjct: 475 GKWKLV 480
>gi|383114003|ref|ZP_09934770.1| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
gi|382948682|gb|EFS34104.2| hypothetical protein BSGG_4804 [Bacteroides sp. D2]
Length = 496
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 113/320 (35%), Positives = 164/320 (51%), Gaps = 40/320 (12%)
Query: 103 EIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPL 161
+I TPNID LA +G+ Y + +P+RA LMTGKY G G + G+P
Sbjct: 60 KIVTPNIDKLAASGVQCMQGYTSGHLSSPTRAGLMTGKYQQSFGFYG--LSTPHVGGIPQ 117
Query: 162 TERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY 221
++ L EYL E GY+T IGKWHLG + R + P RGF++ FG++NG+ YYD ++ +
Sbjct: 118 DQKLLSEYLVENGYNTACIGKWHLGDYIRSH-PNNRGFQTFFGFINGLHDYYDPLVGGSW 176
Query: 222 SRTVELNGHDMRRNLSTAWD-----TVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLA 276
NG L+ D T EY+T +TK AV I+ D P FLYL + A
Sbjct: 177 DGV--YNG------LAFTLDNMEPVTEMEYSTYEYTKRAVDFIQKN-ADHPFFLYLPYNA 227
Query: 277 AHAGNAGKHLEAPQETINQF----QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLE 332
H+ L+AP+E I + Q I + AM LD VG V+ L++ G+ +
Sbjct: 228 IHS-----PLQAPEELIGELAINPQEIGKDD--IARAMTFALDQGVGKVVETLEQLGLRD 280
Query: 333 NSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPR 392
N+II ++SDNGA VEY + + +RG K + +EGG++VP I+ P
Sbjct: 281 NTIIFYLSDNGA--VEYSD---------KWEFRGRKGSYYEGGIRVPFIVSYPAKLAKGT 329
Query: 393 VSLQMMHISDWLPTLYTAAG 412
+ + + D PT+ AG
Sbjct: 330 IYNKPVMSIDIAPTVMELAG 349
>gi|254294928|ref|YP_003060951.1| sulfatase [Hirschia baltica ATCC 49814]
gi|254043459|gb|ACT60254.1| sulfatase [Hirschia baltica ATCC 49814]
Length = 487
Score = 150 bits (380), Expect = 1e-33, Method: Compositional matrix adjust.
Identities = 112/392 (28%), Positives = 188/392 (47%), Gaps = 41/392 (10%)
Query: 69 TDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-Q 125
T P + A K + ++ G+ D+S +G E TP ID+LA+ G+ Y+
Sbjct: 41 TSPASHSNIAQKKRPNIIFIMADDLGYADVSCYGRREYKTPAIDSLAHTGVKFTQAYSNS 100
Query: 126 PVCTPSRASLMTGKYP--IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKW 183
PVC+ +R S++TG+Y + G++ P + G+P T LP L++ GY T +GKW
Sbjct: 101 PVCSATRTSVITGRYQYRLPVGLEEPLTF--RDVGLPTTHPTLPGQLKKSGYQTALVGKW 158
Query: 184 HLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTV 243
HLG + Y PL G++ +G G + Y+ H D ++ +L ++ + +
Sbjct: 159 HLGALPK-YGPLKSGYDHFWGVRGGSVDYFTH---DSMAKQNDLWDGEVEIHET------ 208
Query: 244 GEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ-----FQ 297
EY T L ++++LIE ++P F+ L + A H G + + + Q
Sbjct: 209 -EYLTKLIGGKSIELIEKMNENEEPFFMSLHYTAPHWPWEGPEDKTESDRLKQGGPFSLL 267
Query: 298 YITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN 357
++ + TYAAMV +D+ +G +++ L G+ E ++I+F SDNG
Sbjct: 268 HLDGGSLETYAAMVTSMDNQIGRLLNKLDELGITEETLIVFTSDNGG-----------ER 316
Query: 358 WGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
+ +P+ G+K L EGG++VP I+ P I + S Q+ D +PTL AG
Sbjct: 317 FSDTWPFSGIKTELLEGGLRVPLIIRWPGITKANTTSDQVSMSMDLMPTLL-KAGQAAPH 375
Query: 418 LPLNIDGLDQWSSLLLNTPSRRN-----SNID 444
+DG+D +L+ S R SN+D
Sbjct: 376 PDFPLDGIDLTEALVSERTSTRQIYFRYSNLD 407
>gi|395804316|ref|ZP_10483556.1| sulfatase [Flavobacterium sp. F52]
gi|395433415|gb|EJF99368.1| sulfatase [Flavobacterium sp. F52]
Length = 539
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 119/400 (29%), Positives = 190/400 (47%), Gaps = 69/400 (17%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRA 133
+ + K T + +L+ G D+S +G PTP ID+LA +G+ + YA +C+PSRA
Sbjct: 57 SIIGKKTNIIILLADDLGKYDISLYGGKSTPTPQIDSLATSGVTFTDGYASAAICSPSRA 116
Query: 134 SLMTGKYPIHTGMQGPP----------IWG-----------------------AEPRGVP 160
L+TG+Y G + P +G + +G+P
Sbjct: 117 GLITGRYQERFGHEYQPGDRYPKNNLEYYGFKYLINTDGWRLNDKINYPNEASIKTQGLP 176
Query: 161 LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY-------- 212
+E + + GYST IGKWHLG + + PL RGF+ H+G+ Y
Sbjct: 177 KSEITFADLAKRQGYSTGIIGKWHLGH-NKGFFPLDRGFDHHYGFYQAFSLYAPEDDNPD 235
Query: 213 -YDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
+H D +T+ NG +RR+ ST D Y T+ F +EA I D+ +KP
Sbjct: 236 IINHHHKDFTDKTIWGNGRVGIGQIRRD-STIIDEKA-YLTEKFAEEAEAFI-DKNKNKP 292
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LY+ A H + ++ ++F + D N+R Y AMV LDD++G + +++
Sbjct: 293 FLLYIPFNAPHT-----PFQVRKKYYDRFPNVKDENKRVYFAMVSALDDAIGRIREKVKK 347
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
+G+ +N++IIF SDNG ++Y +N P +G K + +EGG+ VP A+ W +
Sbjct: 348 EGLEDNTLIIFASDNGG--------ADYTFATTNAPLKGGKFSHFEGGINVPFALSWKGK 399
Query: 387 IQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
I+ N V + D T+ TA G D + + DG+D
Sbjct: 400 IKPN-TVYKNPVSTLDIFTTIATAIGSDLPKDRI-YDGVD 437
>gi|344292944|ref|XP_003418184.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Loxodonta
africana]
Length = 513
Score = 150 bits (380), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 123/363 (33%), Positives = 175/363 (48%), Gaps = 36/363 (9%)
Query: 84 TLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPI 142
L L+ GW DL +G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ PI
Sbjct: 34 VLMLMDDMGWGDLGVNGEPSRETPNLDRMAAEGMLFPSFYSANPLCSPSRAALLTGRLPI 93
Query: 143 HTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
G G P+ G+P +E LPE L++ Y+TK +GKWHLG R ++ PL
Sbjct: 94 RNGFYTTNGHARNAYTPQDIVGGIPDSEHLLPELLKKANYATKIVGKWHLG-HRPQFHPL 152
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTVGEYATDL 250
GF+ FG N YD+ R E+ G + NL T + T +
Sbjct: 153 KHGFDEWFGSPNCHFGPYDNRAKPNIPVYRDWEMVGRFYEEFPINLKTGEANL----TQI 208
Query: 251 FTKEAVQLIE-DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+ +EA+ I+ Q +P FLY A A HA ++ R Y
Sbjct: 209 YLQEALDFIKRQQSQQRPFFLYWAIDATHA-----------PVYASKPFLGTSQRGRYGD 257
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V+++DDSVG ++ L+ G+ E++ I F SDNGA V S + GSN P+ K
Sbjct: 258 AVREVDDSVGQLLQLLRGLGIAEHTFIFFTSDNGAALV-----SAPKEAGSNGPFLCGKQ 312
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWS 429
T +EGGV+ PAI W P +VS Q+ + D T + AG + IDGLD
Sbjct: 313 TTFEGGVREPAIAWWPGHIPAGQVSHQLGSLMDLFTTSLSLAGLEPPS-DREIDGLDLLP 371
Query: 430 SLL 432
LL
Sbjct: 372 VLL 374
>gi|374369744|ref|ZP_09627765.1| sulfatase [Cupriavidus basilensis OR16]
gi|373098668|gb|EHP39768.1| sulfatase [Cupriavidus basilensis OR16]
Length = 433
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 113/330 (34%), Positives = 159/330 (48%), Gaps = 32/330 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
G+ DL +G E +P +D+LA NG++L YA PVC+P+R +LMT +Y G
Sbjct: 14 GYADLGCYGGREPVSPVLDSLAANGLMLTQGYANSPVCSPTRFALMTARYQYRLRGAAEE 73
Query: 150 PI----WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
PI G+ G+P LP LR GY T IGKWHLGF + PL G+++ FG
Sbjct: 74 PINSKSRGSSALGLPPAHPTLPSLLRAQGYRTALIGKWHLGF-APAFGPLLSGYQAFFGP 132
Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
+ G + Y+ H D R HD+ N A Y TDL ++ AV + +
Sbjct: 133 MAGGVDYFTHC--DSSGR------HDLWHN--DAEHQEPGYLTDLLSRRAVDYVTRMSGE 182
Query: 266 -KPLFLYLAHLAAHA--GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
P FL L + A H APQ + N F ++ N TY M+ +D+ +G ++
Sbjct: 183 GAPFFLSLHYTAPHWPWETRDDAALAPQVSNNLF-HLDGGNIHTYRRMISHMDEGIGWLV 241
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
AL++ G L+N++I+F SDNG + N+P G K L EGG++VP I
Sbjct: 242 QALRQLGQLDNTLIVFTSDNGG-----------ERFSDNWPLVGGKMDLTEGGIRVPWIA 290
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
P + S Q DW TL AAG
Sbjct: 291 HWPAMIAPGGTSAQHCMTMDWSATLLDAAG 320
>gi|114798452|ref|YP_760375.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
gi|114738626|gb|ABI76751.1| sulfatase family protein [Hyphomonas neptunium ATCC 15444]
Length = 508
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 132/414 (31%), Positives = 185/414 (44%), Gaps = 82/414 (19%)
Query: 92 GWNDLSFHGSN----EIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
G ND+S G + TPNID LA G + + Y+ C PSRA LMTG+YP TG
Sbjct: 30 GINDISTFGGGMADGRVQTPNIDRLAAEGALFSTAYSGTGTCAPSRAMLMTGRYPTRTGF 89
Query: 147 QGPPI----------------WGAEP----------------RGVPLTERFLPEYLRELG 174
+ P G P +G+P E L E L++ G
Sbjct: 90 EYTPTPPGMSRIVPMFANDMKTGLPPTEQVKENEKLMPPFAEQGLPTEEVTLAEVLKDRG 149
Query: 175 YSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV---------------ISYYDHILSD 219
Y T IGKWHLG + P +GF+ +G+ D L
Sbjct: 150 YHTVHIGKWHLG-NTSPFRPNDQGFDESLDMASGLFLPPGDPRGVEARLDFDPIDKFLWA 208
Query: 220 QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 279
+ NG D W G Y TD +T E++++I D ++P FLYLAH H
Sbjct: 209 RMDFAASYNGSD--------WFEPGGYLTDYWTDESLKVI-DANKNRPFFLYLAHWGVHT 259
Query: 280 GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
L+A +E I R YAAM++ +D SVG ++ L+ +G+ +N+II+F
Sbjct: 260 -----PLQATKEDYEAVGDIEPHRLRVYAAMLRAVDRSVGRIMEKLEEEGLADNTIIVFS 314
Query: 340 SDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQM 397
SDN GA + E N PYRG K TL+EGG++VP + W IQ + +
Sbjct: 315 SDNGGAGYIGIPEV--------NAPYRGWKITLFEGGIRVPLFMKWPGTIQPGTVIDEPV 366
Query: 398 MHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSS 451
HI D +PTL A G +RLP +++ +D + L L T + + D WSS
Sbjct: 367 AHI-DLMPTLAAATG---ARLPSSVE-IDGQNLLPLATGTGPITRPDDALFWSS 415
Score = 39.7 bits (91), Expect = 5.5, Method: Compositional matrix adjust.
Identities = 20/50 (40%), Positives = 28/50 (56%), Gaps = 4/50 (8%)
Query: 567 CYLFNLGNDPCEQNNIASSRPDISSQLYELLKYH----RRTLVPQSHEQP 612
+LFNL DP EQ N+A+S P+ ++L L+ H R L P + E P
Sbjct: 437 SWLFNLNEDPTEQTNLAASEPEKLAELQALIDAHWKDAREPLYPYTIESP 486
Score = 39.3 bits (90), Expect = 6.8, Method: Compositional matrix adjust.
Identities = 25/75 (33%), Positives = 37/75 (49%), Gaps = 6/75 (8%)
Query: 9 WDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 68
W G Y TD +T E++++I D ++P FLYLAH H L+A +E I
Sbjct: 221 WFEPGGYLTDYWTDESLKVI-DANKNRPFFLYLAHWGVHT-----PLQATKEDYEAVGDI 274
Query: 69 TDPNRRTYAALTKST 83
R YAA+ ++
Sbjct: 275 EPHRLRVYAAMLRAV 289
>gi|198432447|ref|XP_002128343.1| PREDICTED: similar to galactosamine (N-acetyl)-6-sulfate sulfatase
[Ciona intestinalis]
Length = 513
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 110/343 (32%), Positives = 159/343 (46%), Gaps = 49/343 (14%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L+ GW DL +G TPN+D +A G + + Y A P+C+PSRA+L+TG+ PI
Sbjct: 25 FMLMDDMGWGDLGINGQPSKETPNLDNMAKEGTLFTDFYSANPLCSPSRAALLTGRLPIR 84
Query: 144 TGMQGPPIWGAE---PR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G G P+ G+P E + E L GY+ K IGKWHLG + +Y PL
Sbjct: 85 NGFYTSNYHGHNGYTPQHIVGGIPDHEILVSELLSSAGYTNKLIGKWHLGQ-QEQYLPLK 143
Query: 197 RGFESHFGYLNGVISYYDHI------------LSDQYSRTVELNGHDMRRNLSTAWDTVG 244
GF FG N YD + +Y + H N+
Sbjct: 144 HGFHEWFGSPNCHFGPYDDKTTPNIPVYNNTEMVGRYYEEFAIESHKYLSNM-------- 195
Query: 245 EYATDLFTKEAVQLIED-QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN 303
T + +EA+ IE + +KP FLY A A H+ P + F+ +
Sbjct: 196 ---TQYYIQEALDFIERMERNEKPFFLYWAPDATHS---------PVYSSPMFRGAS--R 241
Query: 304 RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYP 363
R Y V +LD VG +I L++ G+ +N++++F SDNGA + S + N GSN P
Sbjct: 242 RGPYGDAVMELDYGVGVIIQKLKQLGLDKNTLVLFSSDNGAAMI----GSAFEN-GSNGP 296
Query: 364 YRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPT 406
+ K T +EGG++ P I W P Q +VS Q+ + DW T
Sbjct: 297 FLCGKQTTFEGGIREPTIAWGPGFVQQGQVSHQVGSLMDWFAT 339
>gi|343084600|ref|YP_004773895.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342353134|gb|AEL25664.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 472
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 116/355 (32%), Positives = 175/355 (49%), Gaps = 49/355 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW DL +GS TPNID L G+ Y A PVC+PSRAS++TGK P + G G
Sbjct: 40 GWKDLGCYGSEFYETPNIDKLRDQGMKFTAAYSASPVCSPSRASILTGKNPANIGFTGHI 99
Query: 151 I------WGAEPRGVP--------LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
+ E R +P L E+ +PE L + GY++ +IGKWH+G ++ P +
Sbjct: 100 TAIGKHRYPEEGRIIPPDDYMHVSLEEKMIPEILLQSGYTSASIGKWHVG-EEEKFFPTH 158
Query: 197 RGFESHF-GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
+GF + GY +G Y + S + D R G+Y T+ T EA
Sbjct: 159 QGFAINIAGYEHGSPPTYWGPFESEKSWNPVIKNLDNREE--------GQYLTNRLTDEA 210
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI----TDPNRRTYAAMV 311
+ I D+ + P FLYL+H A H LEAP I +++ + YAAM+
Sbjct: 211 INFI-DENKEGPFFLYLSHYAVHT-----PLEAPDSLIKKYELKLEGQMEQKNAIYAAMI 264
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
+ +D +VG ++ +L G+ N+I+I SDNG E R T +N P R K +
Sbjct: 265 ENMDWNVGRLLKSLDSLGLEGNTIVILGSDNGG---EGRVT-------NNVPLREGKGYI 314
Query: 372 WEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
+EGG++VP ++ W +++ N + ++ D LPT+ AG S +IDG+
Sbjct: 315 YEGGIRVPLVIKWPGKVKPNSSSDVPVI-TDDMLPTIVEMAG--LSNENQDIDGV 366
Score = 39.7 bits (91), Expect = 5.0, Method: Compositional matrix adjust.
Identities = 21/54 (38%), Positives = 30/54 (55%), Gaps = 6/54 (11%)
Query: 13 GEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
G+Y T+ T EA+ I D+ + P FLYL+H A H LEAP I +++
Sbjct: 199 GQYLTNRLTDEAINFI-DENKEGPFFLYLSHYAVHT-----PLEAPDSLIKKYE 246
>gi|254482499|ref|ZP_05095738.1| sulfatase domain protein [marine gamma proteobacterium HTCC2148]
gi|214037190|gb|EEB77858.1| sulfatase domain protein [marine gamma proteobacterium HTCC2148]
Length = 602
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 115/354 (32%), Positives = 169/354 (47%), Gaps = 45/354 (12%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHT 144
L L +G+NDL+ + ++ PTP +DA+A G+ YA+ CT SR +L+TG+YP
Sbjct: 20 LLALDDFGYNDLAINNGSDSPTPRLDAIAAQGVRFTRHYAESSCTASRVALLTGRYPARV 79
Query: 145 GMQGPPIWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESH 202
GA P G+ LP+ L GY +GKWH G RE P Y+GF+
Sbjct: 80 --------GAHPYLNGIDHELMTLPDALGSEGYIRHMVGKWHTGDSHRESRPEYQGFDHW 131
Query: 203 FGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
FG++N + H S Y R + N + TD+ T A+ +I+ +
Sbjct: 132 FGFINQLYLRGPH-RSANYRRGKPTYINPWLENELGDLQQYEGHLTDILTDRALDVIKRE 190
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNRRTYAAMVKKLDDSVGT 320
P FLYL++ A H P E +F +Y DP R Y AM +LD ++G
Sbjct: 191 --QNPWFLYLSYYAPH---------TPIEPAARFSERYADDPAGR-YQAMKDQLDSNIGR 238
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
+I L G ++N++II +SDNG ++W SN P+ G K T EGGV+ P
Sbjct: 239 IIDWLTESGEIDNTMIIVVSDNGGTA---------KSWPSNLPFYGSKATYTEGGVRTPL 289
Query: 381 IL-WS---PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
+L W P QQ+ ++++ I D PT+ A G P +DG D + S
Sbjct: 290 LLSWPGHWPVGQQDDQIAM----IFDLYPTILAALG---KAQPEGLDGADLFDS 336
>gi|325109705|ref|YP_004270773.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969973|gb|ADY60751.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 443
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 109/343 (31%), Positives = 173/343 (50%), Gaps = 34/343 (9%)
Query: 77 AALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRA 133
A + + + L++V G+ DL +GS I TP +D +A +G+ L + YA P+CTP+RA
Sbjct: 28 AEIQQQPNVVLILVDDLGYGDLGCYGSESIRTPRLDRMAASGMKLTSFYAAAPICTPTRA 87
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
+LMTG Y G+ P+ + G+ +E L E +++ GY T +GKWHLG R Y
Sbjct: 88 ALMTGCYATRVGLP-TPLHVYDEIGINESEFTLGEAMQQCGYETVCVGKWHLGHQPRFY- 145
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
P GF ++G G + + + T +L D R + D T+ T+
Sbjct: 146 PTEHGFNHYWGTPLGHM-FNRPAVGKAIGDTSDLFLDDT-REIPFPEDA---DLTERLTE 200
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR-TYAAMVK 312
+AV+ IE + D+P FL+LAH P E + + D ++ Y +V+
Sbjct: 201 KAVEFIEAK-RDRPFFLFLAH------------PMPHEPLAASEKFADRSQAGLYGDVVE 247
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
LD S G V+ AL+R G EN+++ F+SDNG + G+ RG K+ +
Sbjct: 248 CLDWSTGEVLDALERTGQAENTLVAFLSDNGPK----------KGHGTTAGLRGFKHDPY 297
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
EGG++VP I+ + + + S ++ ++ DW PTL AGG T
Sbjct: 298 EGGMRVPCIIAAQGLIPAGQTSDEICNVMDWYPTLVNLAGGTT 340
>gi|340619489|ref|YP_004737942.1| sulfatase [Zobellia galactanivorans]
gi|339734286|emb|CAZ97663.1| Sulfatase, family S1-16 [Zobellia galactanivorans]
Length = 483
Score = 150 bits (379), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 120/388 (30%), Positives = 178/388 (45%), Gaps = 68/388 (17%)
Query: 75 TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPS 131
+A K + ++V GW D++ +GS TPNID LA GI YA P+C+P+
Sbjct: 22 VFAQTDKQPNIVFILVDDLGWMDIAANGSTFYETPNIDQLAKEGIRFTKAYAASPICSPT 81
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEP------RGVPLTERFLPEYLRELGYSTKAIGKWHL 185
RAS++TGK P + W P R +PL E PE L++ GY T +GKWHL
Sbjct: 82 RASILTGKNPARIDLTQ---WIGGPGNPDYLRNLPLEEVLFPELLQDAGYKTAFMGKWHL 138
Query: 186 GFFRRE--YTPLYRGFESHF-GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDT 242
E + P +GF+ + G+ G + + S +E NG +
Sbjct: 139 NNVAGEEKFWPDKQGFDVNVAGHFRGGLYIKNKYFSPWNIPNLE-NGPE----------- 186
Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQ------- 295
GEY TD KEA+ I D+ P LY + + HA +AP + + +
Sbjct: 187 -GEYMTDRLAKEAIDFI-DENSKAPFLLYFSLYSVHAP-----FDAPADRVEKYEKKKKA 239
Query: 296 ------------------FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIII 337
F+Y + TYAAMV+ +D +VG ++ LQ KG+ +N+++I
Sbjct: 240 LALTDKERFAEEKNAQKPFKYRIKQDHPTYAAMVESMDMAVGKILGKLQEKGIADNTVVI 299
Query: 338 FMSDNGA-PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQ 396
F SDNG T E T +N P R K L+EGG++ PAI+ P + VS
Sbjct: 300 FFSDNGGLSTSEGIPT-------ANTPLRAGKGWLYEGGIREPAIIRWPGTIKPGTVSDA 352
Query: 397 MMHISDWLPTLYTAAGGDTSRLPLNIDG 424
++ D+ PT+ G R L++DG
Sbjct: 353 VITSMDFYPTILEMT-GQALRPDLHVDG 379
>gi|146302377|ref|YP_001196968.1| sulfatase [Flavobacterium johnsoniae UW101]
gi|146156795|gb|ABQ07649.1| sulfatase [Flavobacterium johnsoniae UW101]
Length = 539
Score = 150 bits (378), Expect = 2e-33, Method: Compositional matrix adjust.
Identities = 136/513 (26%), Positives = 227/513 (44%), Gaps = 98/513 (19%)
Query: 73 RRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCT 129
+ T AA K + +L+ G D+S +G PTP ID+LA +G+ + Y + +C+
Sbjct: 53 KDTSAASEKKPNIIILLADDLGKYDISLYGGKSTPTPQIDSLAASGVTFTDGYVSSSICS 112
Query: 130 PSRASLMTGKYPIHTGMQGPP---------------------IWGAEPR----------- 157
PSRA L+TG+Y G + P W P+
Sbjct: 113 PSRAGLLTGRYQERFGHEYQPGDRYPKNNLEYYAFKYLLNTNSWRLNPKIEYPNDASIAT 172
Query: 158 -GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY---- 212
G+P +E + ++ GYST IGKWHLG + + PL RGF+ H+G+ +
Sbjct: 173 QGLPKSEITFADLAKKQGYSTAIIGKWHLGH-TKGFFPLDRGFDYHYGFYQAFSLFAPED 231
Query: 213 -----YDHILSDQYSRTVELNGH----DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
+H +D +T+ NG +RR+ ST D +Y T+ F +EA I D+
Sbjct: 232 NNPDIINHHHTDFTDKTIWGNGRVGTGQIRRD-STIIDE-KKYLTEKFAEEAEAFI-DKN 288
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
+KP LY+ A H + ++ ++F + D N+R Y AM+ LDD++G + +
Sbjct: 289 KNKPFLLYVPFNAPHT-----PFQVRKKYYDRFPNVKDENKRVYFAMISALDDAIGLIRA 343
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AIL 382
++++G+ EN++I F SDNG Y T N P +G K + +EGGV VP A+
Sbjct: 344 KVKKEGLEENTLIFFASDNGGADYTYATT--------NAPLKGGKFSHFEGGVNVPFALS 395
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR--RN 440
W +I+ + T+Y + LD +S++ T S ++
Sbjct: 396 WKGKIKPH---------------TIYKTP----------VSSLDIFSTIAAVTHSGLPKD 430
Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQT 500
DG+D + N + +N + D K A+R WKL++ + + T +
Sbjct: 431 RVYDGVDLVDVVNNNKQAHQNLYWRSGDAK----AIRSGDWKLIISGKTHETWLYNLAKD 486
Query: 501 RSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
+S L + N + +LQ + + P+
Sbjct: 487 KSETTDLASKNPEKVKELQTALQNWEKGLIKPL 519
>gi|296122982|ref|YP_003630760.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296015322|gb|ADG68561.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 466
Score = 150 bits (378), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 119/369 (32%), Positives = 181/369 (49%), Gaps = 37/369 (10%)
Query: 82 STTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
ST +++++ G+ DL +GS I TP++D +A G+ + Y A VCTPSRA+L+
Sbjct: 33 STKPNIVVIFADDLGYGDLGCYGSPTIRTPHLDEMAAEGLRFTDFYSAAEVCTPSRAALL 92
Query: 137 TGKYPIHTGMQGP--PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
TG+ PI +GM G ++ G+P E + E L+E GY+T IGKWHLG P
Sbjct: 93 TGRLPIRSGMCGARRVLFPNSKGGLPPAEITIAEALKEKGYATAQIGKWHLGIHPGS-RP 151
Query: 195 LYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHD--MRRNLSTAWDTVGEYA-TD 249
L +GF+ FG Y N + + D S L+G + + RN + T
Sbjct: 152 LDQGFDQSFGLPYSNDMDARADLPKGSTGSPNPPLDGWNVALLRNGEVVEQPANQTTLTK 211
Query: 250 LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAA 309
+T+EA++ I ++ + P FLY+ H H P F+ + Y
Sbjct: 212 RYTEEAIKFITEK-KNVPFFLYMPHTFPHV---------PMFASQDFKGKSRAG--IYGD 259
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKN 369
V++LD SVG V+ AL R+G+ EN+++ F SDNG + + GS + K
Sbjct: 260 AVEELDWSVGQVLGALCREGIAENTLVFFSSDNGPWLIMGDQG------GSAGLLKDGKG 313
Query: 370 TLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNI--DGLDQ 427
+ WEGG++VP I W P + P V+ Q+ D PT AG + LP ++ DG+D
Sbjct: 314 STWEGGMRVPGIAWMPS-RIKPGVTSQLASAMDVFPTALALAG---ASLPKDVVFDGVDL 369
Query: 428 WSSLLLNTP 436
L + P
Sbjct: 370 APLLFESRP 378
Score = 40.4 bits (93), Expect = 3.1, Method: Compositional matrix adjust.
Identities = 15/41 (36%), Positives = 28/41 (68%)
Query: 566 PCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTLVP 606
P LF+LG DP E+ N+A++ P++ ++ E +K H+ ++P
Sbjct: 420 PELLFHLGRDPSEKRNVAAAHPEVLIRIQEAVKAHQSQVIP 460
>gi|255035933|ref|YP_003086554.1| sulfatase [Dyadobacter fermentans DSM 18053]
gi|254948689|gb|ACT93389.1| sulfatase [Dyadobacter fermentans DSM 18053]
Length = 497
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 127/426 (29%), Positives = 191/426 (44%), Gaps = 65/426 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ +L +G +I TPN+D LA GI Y PVC P+RA LMTGK+ H+ ++G
Sbjct: 38 GYGELGCYGQQKIKTPNLDRLAKEGIRFTQHYTGTPVCAPARAMLMTGKHAGHSAIRGNF 97
Query: 151 IWGA----EPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
G E RG +P E + E L++ GY+T GKW +G E TP +GF+ ++
Sbjct: 98 ELGGFRDEEERGQMPLPANELTVAELLKQKGYATALTGKWGMGMNNTEGTPTRQGFDYYY 157
Query: 204 GYLNGVIS---YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG--------EYATDLFT 252
GYL+ + Y H+ + T+ D+ R L A T EYA T
Sbjct: 158 GYLDQKQAHNLYPSHLWENDRWDTLAQPWQDIHRKLDPAKATDADFESFKGKEYAPAKMT 217
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF--QYITDPNR------ 304
++A+ I D+ P FLY+ + H L+AP E + ++ Q+ P
Sbjct: 218 EKALAFI-DRSKAGPFFLYMPYTLPHVS-----LQAPDEYVKKYIGQFDEKPYYGEKNYA 271
Query: 305 ------RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
TYA+M+ LDD VG ++ L+ G+ +N+I++F SDNGA + N + +
Sbjct: 272 STKYPLSTYASMITFLDDQVGIILDKLKALGLDDNTIVMFSSDNGA---TFNGGVNPQFF 328
Query: 359 GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
S RG+K ++EGG++ P I+ P + RVS + D +PTL G +
Sbjct: 329 NSVAGLRGLKMDVYEGGIREPFIVRWPGKIKPGRVSDHVSAQFDLMPTLAELTGQASP-- 386
Query: 419 PLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRL 478
DG+ LL T N + + EK AVR+
Sbjct: 387 --PTDGISFLPELLGQT-------------------NRQKKHEFLYFEYPEKGGQIAVRM 425
Query: 479 DSWKLV 484
WK V
Sbjct: 426 GDWKGV 431
>gi|440289645|ref|YP_007342410.1| arylsulfatase A family protein [Enterobacteriaceae bacterium strain
FGI 57]
gi|440049167|gb|AGB80225.1| arylsulfatase A family protein [Enterobacteriaceae bacterium strain
FGI 57]
Length = 550
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 130/435 (29%), Positives = 200/435 (45%), Gaps = 69/435 (15%)
Query: 92 GWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
GW D+ F+G PTP+IDA+A G+IL + Y+QP +P+RA+++TG+Y +H G+
Sbjct: 97 GWMDVGFNGGGVAVGNPTPDIDAVASQGLILTSAYSQPSSSPTRATILTGQYSVHHGILM 156
Query: 149 PPIWGAEPRGVP-LTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
PP++G P G+ LT LP+ L + GY T+AIGKWH+G P GF+ G+ N
Sbjct: 157 PPMYGM-PGGLEGLTT--LPQLLHDQGYVTQAIGKWHMG-ENEGSQPQNVGFDDFRGF-N 211
Query: 208 GVISYY----------DHILS---DQYSRTVELNGHDMRR----NLSTAWDTVGEYATDL 250
V Y + LS +Y + + + D+ D +Y DL
Sbjct: 212 SVSDMYTEWRDVNVNPEVALSPARSEYIKKLPFSKDDVHAVRGGKQQAVADITPKYMEDL 271
Query: 251 ---FTKEAVQLIEDQP-VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+ K V I DKP FLY H N N + P R +
Sbjct: 272 DQRWMKYGVDFINKMAKSDKPFFLYYGTRGCHFDN----------YPNAKYAGSSPARTS 321
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y + +++D + L++ G L+N++I+F SDNG P E P+RG
Sbjct: 322 YGDCMVEMNDVFANLYKTLEKNGQLDNTLIVFTSDNG-PEAEVPPH-------GRTPFRG 373
Query: 367 VKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
K + WEGGV+VP + W +Q PR S ++ ++D PT + AG + L
Sbjct: 374 AKGSTWEGGVRVPTFVYWKGMVQ--PRKSDGIVDLADLFPTALSLAGHPGAEL------- 424
Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWK--L 483
+ L+ T + IDG+DQ +S + + N + + AA+R+D +K +
Sbjct: 425 ---AKLVPKT-----TFIDGVDQ-TSFFVGDNGQSNRKAEHYFLNGQLAAIRMDEFKYHM 475
Query: 484 VLGTQENGTMDGYYG 498
V+ T GY G
Sbjct: 476 VIQQPYAFTQSGYQG 490
>gi|171910115|ref|ZP_02925585.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 460
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 115/343 (33%), Positives = 171/343 (49%), Gaps = 29/343 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGP- 149
G+ DL +GS I TP++D +A G+ + Y A VCTPSRA+L+TG+YP+ +GM G
Sbjct: 41 GYGDLGCYGSPTIATPHLDQMAAEGLRFTDFYVASEVCTPSRAALLTGRYPVRSGMYGKR 100
Query: 150 -PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YL 206
++ G+P E LPE L+ GY+T +GKWHLG PL +GF+ FG Y
Sbjct: 101 RVLFPNSTGGLPAGEITLPEALKARGYATAHVGKWHLGIHEGS-RPLDQGFDQSFGLPYS 159
Query: 207 NGVISYYDHILSDQYSRTVELNGHD--MRRNLSTAWDTVGE-YATDLFTKEAVQLIEDQP 263
N + + D S T ++G + + RN + + T +T+EAV+ I+ Q
Sbjct: 160 NDMDARPDLPKGSTGSPTPPIDGWNVPLLRNGEVVEKPADQVHLTGHYTEEAVKFIQ-QK 218
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
+P FLY+AH H P F+ + Y V++LD SVG V+
Sbjct: 219 KSQPFFLYMAHSFPH---------VPLFASENFKGRSRAG--IYGDTVEELDWSVGQVLQ 267
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
L+++G+ EN+++ F SDNG + + GS P + K + WEGG++VP I W
Sbjct: 268 TLRKEGLSENTLVFFTSDNGPWLIMGAQG------GSAGPLKDGKGSHWEGGMRVPGIAW 321
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
P P VS + D T AG + + + IDG D
Sbjct: 322 WPG-HIKPGVSRTPANSMDLFVTSVALAGAEVPKDRV-IDGTD 362
>gi|260816809|ref|XP_002603280.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
gi|229288598|gb|EEN59291.1| hypothetical protein BRAFLDRAFT_126970 [Branchiostoma floridae]
Length = 377
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 86/223 (38%), Positives = 121/223 (54%), Gaps = 19/223 (8%)
Query: 87 LLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIH 143
LLIV GW+D+S++ + TPN+ LA G+I N YAQP C+PSR +L+TGK+P
Sbjct: 28 LLIVADDLGWSDVSWNNPYVV-TPNLHTLATTGVIFNQTYAQPTCSPSRTALLTGKFPFR 86
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
GMQ + +P G+PL E LP+ L++LGY+T +GKWHLG + EYTP RGF+S +
Sbjct: 87 LGMQRV-MDSKKPHGLPLDEELLPQKLKKLGYATHMVGKWHLGSCKWEYTPTERGFDSFY 145
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
GY +G YY H G D ++ D G Y+T+ F A +I
Sbjct: 146 GYHHGSQDYYTH---------KSARGLDFWDGKTSISDQNGVYSTESFATRAENIISQHD 196
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+ PLFLYL + H + + P + F I D NR++
Sbjct: 197 PNTPLFLYLPFQSVHTPH-----QVPSSYLQTFSTIQDDNRKS 234
>gi|224064555|ref|XP_002192706.1| PREDICTED: N-acetylgalactosamine-6-sulfatase [Taeniopygia guttata]
Length = 549
Score = 149 bits (377), Expect = 3e-33, Method: Compositional matrix adjust.
Identities = 118/350 (33%), Positives = 171/350 (48%), Gaps = 35/350 (10%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASL 135
AA T + L L+ GW DL G TPN+D +A G++ + Y A P+C+PSRA+L
Sbjct: 53 AATTTNIVLLLMDDMGWGDLGAFGEPSKETPNLDQMAAEGMLFLDFYTASPLCSPSRAAL 112
Query: 136 MTGKYPIHTGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFF 188
+TG+ P+ G G P+ G+P +E LPE L++ GY+ K IGKWHLG
Sbjct: 113 LTGRLPVRNGFYTTNGHARNAYTPQDIVGGIPDSELLLPELLKKAGYTNKIIGKWHLGH- 171
Query: 189 RREYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWDTV 243
R ++ PL GF+ FG N Y+ R E+ G D + +L T +
Sbjct: 172 RAQFHPLKHGFDEWFGSPNCHFGPYNSTERPNIPVYRDWEMVGRYYEDFKIDLKTGEANL 231
Query: 244 GEYATDLFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
T L+ +EA+ I + Q +P FLY A A HA Q++
Sbjct: 232 ----TQLYLQEALDFISKQQASQQPFFLYWAIDATHA-----------PVYASKQFLGTS 276
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
R Y V+++DDSVG ++ LQ+ G+ E++ + F SDNGA + S + GSN
Sbjct: 277 QRGLYGDAVREIDDSVGKILKHLQQLGISESTFVFFTSDNGAALI-----SAPKQGGSNG 331
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
P+ K T +EGG++ PAI W P VS Q+ + D T + AG
Sbjct: 332 PFLCGKQTTFEGGMREPAIAWWPGHIPAGGVSHQLGSVMDLFSTSLSLAG 381
>gi|383780433|ref|YP_005464999.1| putative sulfatase [Actinoplanes missouriensis 431]
gi|381373665|dbj|BAL90483.1| putative sulfatase [Actinoplanes missouriensis 431]
Length = 464
Score = 149 bits (376), Expect = 4e-33, Method: Compositional matrix adjust.
Identities = 118/338 (34%), Positives = 165/338 (48%), Gaps = 34/338 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHT-GMQGP 149
GW DLS +GS EI TPN+D LA +G+ Y A VC+P+R +L TG+ P T G
Sbjct: 59 GWADLSSYGSPEIRTPNLDRLARSGVRFTQGYSAGAVCSPTRVALYTGRLPGRTPGGLPE 118
Query: 150 PIWGAEPR-GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
PI R G+P T L L++ GY+T IGKWH GF ++PL G++ FG +G
Sbjct: 119 PIGSVNERDGIPATHPTLASLLKKTGYATALIGKWHGGFLP-NFSPLRTGWDYFFGNYSG 177
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLI-EDQPVDKP 267
+ YY +YS T + +D+ + D G Y TD+ +A I + + KP
Sbjct: 178 GVDYYS-----KYSHTGK---YDLFEGETEVPDD-GRYYTDVIADKAEDWITKAKKQGKP 228
Query: 268 LFLYLAHLAAH----AGNAGKHLEAPQETIN----QFQYITDPNR-RTYAAMVKKLDDSV 318
L L + H A K E I Q + D TY MV+ LD +
Sbjct: 229 WLLNLNFTSPHWPWEAPGDRKISEEITAKIKAGNTQAIFHNDGGSLETYTKMVENLDARI 288
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V++AL+R G EN++++F SDNG Y N+P G K +L EGG++V
Sbjct: 289 GQVLNALRRSGQEENTLVVFASDNGGERFSY-----------NWPLTGNKGSLSEGGIRV 337
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
P IL P ++ +VS + + DW TL AG S
Sbjct: 338 PTILRWPARIRSGQVSHEPVVTHDWTATLLEIAGAAPS 375
>gi|220919599|ref|YP_002494902.1| sulfatase [Methylobacterium nodulans ORS 2060]
gi|219952019|gb|ACL62410.1| sulfatase [Methylobacterium nodulans ORS 2060]
Length = 482
Score = 149 bits (376), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 105/327 (32%), Positives = 164/327 (50%), Gaps = 29/327 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
G+ D+ +G ++ TP+ID +A G+ YA VC+ +R +L+TG+Y + G++
Sbjct: 48 GYADVGCYGRPDLKTPHIDRIAARGVRFTQGYANSAVCSATRTALITGRYQYRLPVGLE- 106
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P+ G+P LP LR GY T +GKWHLG + + PL RG+++ +G+ G
Sbjct: 107 EPVTTRFTGGLPAEHPTLPGLLRRAGYRTALVGKWHLGSVAK-FGPLQRGYDTFWGFRGG 165
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ-PVDKP 267
I Y+ H +D +L D+R + VG YAT L + AV+ I +P
Sbjct: 166 TIDYFSHRGTDDKP---DLWDGDVRV------EEVG-YATHLIGRRAVETIGTAVRAGRP 215
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETI--NQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
FL L A H EA I + ++ + TY +V ++D +G V+ AL
Sbjct: 216 FFLSLHFNAPHWPWETPEDEAESRRIAGSNLRHFDGGTQATYHRIVAEMDAQIGRVLDAL 275
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP 385
++ G+ +++I++F SDNG + +P+ G K L EGG+++PAI+ P
Sbjct: 276 EQHGIAQDTIVVFTSDNGG-----------ERFSDTWPFTGRKTELLEGGLRIPAIVSWP 324
Query: 386 QIQQNPRVSLQMMHISDWLPTLYTAAG 412
+ VS Q M DWLPTL AAG
Sbjct: 325 RRLPAGVVSAQPMITMDWLPTLLAAAG 351
>gi|440750638|ref|ZP_20929879.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
gi|436480856|gb|ELP37068.1| Arylsulfatase [Mariniradius saccharolyticus AK6]
Length = 478
Score = 149 bits (375), Expect = 5e-33, Method: Compositional matrix adjust.
Identities = 117/354 (33%), Positives = 179/354 (50%), Gaps = 39/354 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G+ +I TPN+D +A NG+ N Y +Q VC+ SRASL+TG Y G+ G
Sbjct: 44 GYGDLGAYGAEDIATPNLDQMAENGVKFTNFYVSQAVCSASRASLLTGTYANRLGIHGAL 103
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
A+ G+ E + E L+E GY T GKWHLG + P +GF+ +G
Sbjct: 104 DHTAK-HGLNPEEITIAEMLKEQGYQTAMFGKWHLG-HHPPFLPTKQGFDEFYGIPYSND 161
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+ H + Y + L +D + T + T+L T+ AV I+ + D+P FL
Sbjct: 162 MWPQHPQTKNYYPPLPLFFND--NVVDTLHNDQSRLTTEL-TERAVNFIQRKK-DQPFFL 217
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLAH H P +F+ R Y ++ ++D SVG ++ L+ G+
Sbjct: 218 YLAHPMPH---------VPLFVSEKFR--GKSKRGLYGDVLMEIDWSVGQILKTLRDNGL 266
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQI-- 387
E +++IF+SDNG P + + + GS YP R K T W+GGVKVPA++ W +I
Sbjct: 267 EEKTLVIFISDNG-PWLSFSGHA-----GSAYPLREGKGTSWDGGVKVPALMYWKGKIPA 320
Query: 388 ---QQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSR 438
Q NP +++ M LPT+ D +LP IDG D S+++L+ ++
Sbjct: 321 GTVQTNPAMTIDM------LPTIAKLTRADLPKLP--IDGKDI-STMILDAKAK 365
>gi|154501502|ref|ZP_02039203.1| hypothetical protein BACCAP_04855 [Bacteroides capillosus ATCC
29799]
gi|150269818|gb|EDM97356.1| arylsulfatase [Pseudoflavonifractor capillosus ATCC 29799]
Length = 494
Score = 149 bits (375), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 137/440 (31%), Positives = 195/440 (44%), Gaps = 105/440 (23%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP 141
++++Y G+ DL G+ I TPNIDALA G++L N YA P+C+ SRA L+TG+YP
Sbjct: 73 VVVIYVDDMGYGDLGCTGATAISTPNIDALAEGGVLLTNYYAPAPICSASRAGLLTGRYP 132
Query: 142 IHT----------GMQG------PPIWGAEP---RGVPLTERFLPEYLRELGYSTKAIGK 182
I T G+ G + G P G+P E LPE L++ GY T +GK
Sbjct: 133 IRTLTSGAYMNTEGLSGHLANLLEVVKGTYPYQNDGLPTDEILLPEVLQQAGYETALVGK 192
Query: 183 WHLGFFRREYTPLYRGFESHFGYLNG------VISYYDHILSDQYSRTVELNGHDMRRNL 236
WHLG R E P RGF+ +G L I + D ++ D+
Sbjct: 193 WHLGI-REEERPYNRGFDLFYGALYSDDNDPHRIYHNDEVVHDE---------------- 235
Query: 237 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHLEAPQETINQ 295
+D G T T+ A Q I+D D P FLY A H NA +
Sbjct: 236 --PYDQSG--MTKELTQVAKQFIDDNQ-DGPFFLYYASPFPHWPSNASE----------- 279
Query: 296 FQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY 355
+++ Y ++++D SVG ++ L+ G+LEN+++IF SDNG
Sbjct: 280 -EWLGTSQAGIYGDCMQEVDWSVGEIMDTLEENGLLENTLVIFTSDNGP----------- 327
Query: 356 RNW--GSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG 413
W G+ RG K+T + GG VP I + P V +M D PT+
Sbjct: 328 --WYDGATGGQRGRKDTNYNGGSHVPFIAYMPGTIPEGEVYDGLMSGVDVFPTI------ 379
Query: 414 DTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT 473
LN+ G++ P R IDG+D W L + S R + +N D K T
Sbjct: 380 ------LNLLGIE--------LPQDRV--IDGMDMWPFLTGQSDSPRTELFLNKD--KDT 421
Query: 474 AAVRLDSWK-LVLGTQENGT 492
A+ D++K L ENGT
Sbjct: 422 FALIEDNFKYLERSYSENGT 441
>gi|87309459|ref|ZP_01091594.1| arylsulphatase A [Blastopirellula marina DSM 3645]
gi|87287767|gb|EAQ79666.1| arylsulphatase A [Blastopirellula marina DSM 3645]
Length = 457
Score = 149 bits (375), Expect = 6e-33, Method: Compositional matrix adjust.
Identities = 116/356 (32%), Positives = 172/356 (48%), Gaps = 54/356 (15%)
Query: 77 AALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASL 135
AA TK + +LI G D +G+ TP+ID LA G+ + YA PVC+P+RASL
Sbjct: 27 AAPTKPNIVFILIDDMGCKDAGCYGATNFSTPHIDRLANQGMRFTDAYAAPVCSPTRASL 86
Query: 136 MTGKYP--IHTGMQGPPIWGAEPRG----------VPLTERFLPEYLRELGYSTKAIGKW 183
MTGK+P +H P I P G +PL E+ + + L GY IGKW
Sbjct: 87 MTGKHPARLHLTNFIPQIGRQLPAGKLIPPGFNHVLPLDEKTIAQELHADGYQCAMIGKW 146
Query: 184 HLGFFRR-EYTPLYRGFE-----SHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
HLG EY P RGF+ H G N +Y + ++ L G+
Sbjct: 147 HLGEEHGPEYRPQNRGFDRVVLSEHHGIFN---YFYPFVDQQKWPYAGPLPGNP------ 197
Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
G+Y D T EA+ + + ++P FLYL+H + H G++ AP+ I +++
Sbjct: 198 ------GDYLPDRLTDEAIDFVREN-RERPFFLYLSHWSVH----GRYF-APESLIAKYR 245
Query: 298 YITDPNR-RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
R YAAM++ +D+SVG +++ L + +N++ +FMSDNG +
Sbjct: 246 ERGLEERPAIYAAMMETVDNSVGRLMATLDELNLADNTLFVFMSDNGGERIT-------- 297
Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHIS-DWLPTLYTAA 411
S P RG K +L+EGGV+VP I+ P + + P + + IS D PT A
Sbjct: 298 ---SMAPLRGSKGSLYEGGVRVPLIVRYPGVVK-PNTTCSVPVISHDLFPTFLDFA 349
>gi|326802120|ref|YP_004319939.1| sulfatase [Sphingobacterium sp. 21]
gi|326552884|gb|ADZ81269.1| sulfatase [Sphingobacterium sp. 21]
Length = 471
Score = 149 bits (375), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 114/354 (32%), Positives = 173/354 (48%), Gaps = 48/354 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D+ F+G I TPN+D LA GI N YA VC PSR+SLM+G++ HT ++G
Sbjct: 37 GYGDVGFNGQRLINTPNLDRLAKEGIRFNQFYAGTAVCAPSRSSLMSGQHTGHTFIRGN- 95
Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
G EP G +P + + E L++ GY T A GKW LG E P +GF+ +GY
Sbjct: 96 -LGVEPEGQYPIPDSVETIAETLKKAGYQTGAFGKWGLGPVGSEGDPNKQGFDRFYGYNC 154
Query: 208 GVIS--YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
++ YY L D + V D+ + +YA DL ++A+ IE+ D
Sbjct: 155 QSLAHRYYPTHLWDNSEKVVLTENGDLEYHK--------QYAPDLIQEQALGFIEEHK-D 205
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQF---QYITDPNR------------------ 304
+P FL+L ++ HA L P ++I Q+ ++ P +
Sbjct: 206 RPFFLFLPYILPHA-----ELLVPDDSIFQYYKGKFEEKPFKGFDYGKGAKSGGYASQQY 260
Query: 305 --RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
T+AAMV +LD VG + L+ G+ EN++I+F SDNG P E N+ + S+
Sbjct: 261 PHSTFAAMVARLDLYVGQIEEKLRSLGLDENTLIVFTSDNG-PHKEGGADPNF--FQSSG 317
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
RG+K L+EGG++VP + P+ + + S + D PT G T+
Sbjct: 318 GLRGIKRDLYEGGIRVPFVAKWPKAIRAGQQSEFIGAFWDIFPTFAELGGAATA 371
>gi|170727990|ref|YP_001762016.1| sulfatase [Shewanella woodyi ATCC 51908]
gi|169813337|gb|ACA87921.1| sulfatase [Shewanella woodyi ATCC 51908]
Length = 484
Score = 148 bits (374), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 114/346 (32%), Positives = 164/346 (47%), Gaps = 63/346 (18%)
Query: 80 TKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPV-CTPSRAS 134
+K ++I+Y G D +GS + PTPNID LA +G+ YA C PSRAS
Sbjct: 33 SKLKQANVVIIYVDDLGIMDTGIYGSAQYPTPNIDKLANSGVRFTQAYANAANCAPSRAS 92
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLP---------------EYLRELGYSTKA 179
LMTG P G+ G+ RG + +P + ++ GY+T
Sbjct: 93 LMTGLTPAEHGIL---TVGSSERGESQYRKLIPVTNNTELNPDLTTIADLFKQQGYATAV 149
Query: 180 IGKWHLGFFRREYTPLYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
IGKWHLG + P GF++ +L SY+ + S + + L ++
Sbjct: 150 IGKWHLG----KTAPTEYGFDTAIAASHLGHPPSYF-YPYSKGKRKLIGLEEGGLK---- 200
Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF- 296
EY ++ T+EAV I Q +P FLYL A H +EAP+E +NQ
Sbjct: 201 ------DEYLSNRITREAVNYISSQ--RQPFFLYLPFYAVHTP-----IEAPKEWVNQHN 247
Query: 297 --QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN 354
Q + YAAM+ LD VG ++ AL + G EN++++F SDNGA
Sbjct: 248 ARQQAGEIKSAAYAAMIANLDRDVGKLLQALDKSGQRENTLVVFASDNGA---------- 297
Query: 355 YRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNP--RVSLQM 397
Y S+ PYRG K++L+EGG+K+P +L W QI N R +QM
Sbjct: 298 YDPATSSLPYRGYKSSLFEGGIKIPLVLSWPKQIPPNSQNRTPVQM 343
>gi|372210887|ref|ZP_09498689.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 476
Score = 148 bits (374), Expect = 7e-33, Method: Compositional matrix adjust.
Identities = 133/453 (29%), Positives = 203/453 (44%), Gaps = 85/453 (18%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
G DL +G +I TPN+DALA GI Y+ VC PSR+SLMTG++ HT ++G
Sbjct: 42 GIGDLGTYGQKKIKTPNLDALAAGGIKFTQHYSGSAVCAPSRSSLMTGQHTGHTPIRGNM 101
Query: 149 --PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
I G +P +P + L E L++ GY+T GKW LGF E P +GF+ +GY
Sbjct: 102 ELKGIGGQQP--LPESSITLAEMLKKAGYTTGMFGKWGLGF--GEGDPNKQGFDEFYGY- 156
Query: 207 NGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDK 266
D L+ +Y T + ++ W + YA D ++ ++ I+ DK
Sbjct: 157 ------NDQKLAHRYYPTHLNHNQEIDSLEGNDWTSTEVYAQDKIQQKTLEFIQTNK-DK 209
Query: 267 PLFLYLAHLAAHAGNAGKHLEAPQETINQ-------------FQYITD--PNR------- 304
P F Y+ + HA L +PQ++I Q Y++D PN
Sbjct: 210 PFFAYVPMVLPHA-----ELISPQDSILQQYVGKFDEVPYKNHPYLSDYGPNMVYYKYCS 264
Query: 305 -----RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
YA+MV ++D +G ++S L++ G+ N+II+F SDNGA +N +
Sbjct: 265 QETPYAVYASMVTRMDAYIGQIVSKLKKLGIDNNTIIMFSSDNGAHAA---GGANPEFFD 321
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
S RG+K L+EGG++ P I+ P + V+ + D +PTL G + P
Sbjct: 322 STDGLRGIKRDLYEGGIRAPFIVAWPGTIPSGVVTDHISAFWDVMPTLADIVG---IKAP 378
Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLD 479
DG+ +LL +++ NI W NI ++ AVRL
Sbjct: 379 KQSDGISFLPTLL----NQKQQNIHEYMYWE--------------YNIKGGRK--AVRLG 418
Query: 480 SWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNA 512
WK V+ Y + +K+ L N A
Sbjct: 419 KWKGVI----------YKANNKKSKIELYNLEA 441
>gi|212379227|gb|ACJ24843.1| n-acetylgalactosamine-6-sulfate sulfatase-like protein
[Streptomyces pactum]
Length = 524
Score = 148 bits (374), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 109/342 (31%), Positives = 160/342 (46%), Gaps = 43/342 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
GW DLS +GS I TP++D LA G+ + YA C+P+R SL TG++P T G
Sbjct: 116 GWADLSSYGSPHIHTPHLDRLARQGVRFTHAYAGSSTCSPTRFSLYTGRFPGRTPGGLHE 175
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
PI G G+P L LR GY+T IGKWH G+ +++P G+E+ FG G
Sbjct: 176 PIPGGSDAGLPPNHPTLASLLRGAGYATALIGKWHCGYLP-DHSPTKSGWETFFGNFGGA 234
Query: 210 ISYYDHI-LSDQYS-RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
+ YY + L+ +Y E++ D+R Y T + T+ A + I + KP
Sbjct: 235 LEYYSKLGLTGEYDLYEGEVSHQDLR------------YYTRIITERAAEYI-GRDHRKP 281
Query: 268 LFLYLAHLAAH-----------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
L L H + ++A Q + + Y +V+ LD
Sbjct: 282 WLLNLNFTTPHWPWIAEGDTAESARVTARIKAGQR--GALNHRDGGSLEKYRELVEDLDR 339
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
SVG V++AL+R G EN++++F SDNG Y +P G K TL EGG+
Sbjct: 340 SVGEVLAALRRSGQEENTLVVFASDNGGERFSY-----------QWPLSGEKFTLLEGGI 388
Query: 377 KVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
+VP I+ W +I +VS + ++ DW TL G R
Sbjct: 389 RVPTIVRWPARIDGGAQVSHEPVYTPDWTATLLEVGGARPDR 430
>gi|418459428|ref|ZP_13030547.1| n-acetylgalactosamine-6-sulfate sulfatase [Saccharomonospora azurea
SZMC 14600]
gi|359740510|gb|EHK89351.1| n-acetylgalactosamine-6-sulfate sulfatase [Saccharomonospora azurea
SZMC 14600]
Length = 447
Score = 148 bits (374), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 167/350 (47%), Gaps = 37/350 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYP--IHTGMQG 148
GW DL +GS I TPN+D LA G+ + YA P C+ +R SL TG+YP + G++
Sbjct: 10 GWKDLGCYGSTAIRTPNLDRLAAEGLRFTHGYAGSPWCSSTRISLYTGRYPGRLPAGLEE 69
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
P + +E G+P LP L G+ T GKWH G+ Y+PL GF++ FG L+G
Sbjct: 70 PLVTRSEGHGIPADHPTLPSLLAGAGHRTAMFGKWHCGWLPW-YSPLRIGFQTFFGNLDG 128
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
+ Y++H+ D R G T + VG Y TDL + A + + P
Sbjct: 129 ALDYFEHV--DTLGRPDLYEGE-------TPVEQVG-YYTDLISDRAAEYVAAD-HGAPF 177
Query: 269 FLYLAHLAAH-------AGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSV 318
+L L + A H + G+ + A + T + ++ + Y +V+ +D +
Sbjct: 178 YLQLNYTAPHWPWEGPGDSDVGRDIRARYDREWTRSPLMHLDGGSVAKYGELVEAMDAGI 237
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V++AL G +N+I++F SDNG W N+P+ G K L EGG++V
Sbjct: 238 GRVLNALAAAGRSDNTIVVFTSDNGG-----------ERWSKNWPFVGEKGDLTEGGIRV 286
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
P +L P + S Q + D TL A G + + LPL+ L W
Sbjct: 287 PLLLRWPAALAPGQESDQPVVTMDLTATLLDACGVEPA-LPLDGVSLAPW 335
>gi|325109524|ref|YP_004270592.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969792|gb|ADY60570.1| Steryl-sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 486
Score = 148 bits (374), Expect = 8e-33, Method: Compositional matrix adjust.
Identities = 113/363 (31%), Positives = 174/363 (47%), Gaps = 39/363 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ DL+ +G+ +I TP ID +A G+ N+ Y VC+P+RASLMTG + I G+ G +
Sbjct: 41 GYGDLACYGAKDIATPAIDRMATEGVKCNSFYVSAVCSPTRASLMTGSHSIRVGIGG-VM 99
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
+ G+ E LPE L++ GY+T IGKWHLG + P+ GF+ +G
Sbjct: 100 FPRNNHGLNPDEITLPELLKDQGYATAIIGKWHLG-NEDMFQPMNHGFDYWYGTPASNNQ 158
Query: 212 YY---------DHILSDQYSRTVELNGHD-----MRRNLSTAWDTVGEYATDLFTKEAVQ 257
+Y D + + Y+R L +R N+ T +T+E ++
Sbjct: 159 FYYPTIKKYAADCVFREGYTRNGILTRETAACPLIRDNVVIEVPADQSQFTQRYTRETIR 218
Query: 258 LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDS 317
I + ++P F+YLAH H L A + +++ +R Y +++LD S
Sbjct: 219 FITENH-EQPFFIYLAHNMPHI-----PLHASE------RFVGSSDRGIYGDTIQELDWS 266
Query: 318 VGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVK 377
G ++ AL+ G+ N+++IF SDNG N GS P +G K + +EGGV+
Sbjct: 267 TGEILRALKELGLDSNTLVIFTSDNG---------PNTGKGGSAGPLKGGKGSTYEGGVR 317
Query: 378 VPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPS 437
VP + P S + M D PTL AGGD + IDG D S LL P+
Sbjct: 318 VPFVARWPGAIPAGIESDEAMTGMDLPPTLTRLAGGDVPDDRV-IDGKD-ISPLLTGKPN 375
Query: 438 RRN 440
++
Sbjct: 376 AKS 378
>gi|374374259|ref|ZP_09631918.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
gi|373233701|gb|EHP53495.1| Cerebroside-sulfatase [Niabella soli DSM 19437]
Length = 476
Score = 148 bits (373), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 125/397 (31%), Positives = 188/397 (47%), Gaps = 56/397 (14%)
Query: 77 AALTK-STTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRAS 134
AA TK + L L G+ D+ +G +I TP+IDALA G+ Y VC PSR+S
Sbjct: 23 AAQTKPNIVLILADDLGYGDVGVYGQQKIKTPHIDALAKEGLRFTRFYCGTAVCAPSRSS 82
Query: 135 LMTGKYPIHTGMQGPPIWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
L+TG++ HT ++G G +P G +P + L+ GYST GKW LGF
Sbjct: 83 LLTGQHTGHTPIRGN--IGVKPEGQWPLPGNTVTIATLLKSAGYSTGCFGKWGLGFIGTS 140
Query: 192 YTPLYRGFESHFGYLNGVIS---YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
PL +GF+ +GY ++ + DH+ +++ + VEL+ ++ + Y+
Sbjct: 141 GDPLKQGFDRFYGYNCQTLAHDYFPDHLWNNE--QRVELSNTHEKQTI---------YSA 189
Query: 249 DLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR--- 305
L ++A+Q I Q +P FL LA+ HA L+ P++ Y N +
Sbjct: 190 QLIHEQAMQFIR-QHQRQPFFLLLAYTLPHAA-----LQLPKDDTLYNYYKKLFNEKPVP 243
Query: 306 ------------------TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
YAAMV KLD VG V L+ G+ +N++I+F SDNG
Sbjct: 244 VAEHWNGKGYEPQAYPHAAYAAMVGKLDAYVGEVQQQLKVLGLDKNTMILFASDNGP--- 300
Query: 348 EYRETSNY-RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPT 406
+RE N + SN RG+K L+EGG++ P I W P Q S Q+ D LPT
Sbjct: 301 -HREGGNDPAFFNSNGGLRGIKRDLYEGGIRSPFIAWWPGTIQAGTQSNQVGAFWDLLPT 359
Query: 407 LYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNI 443
AG +P N DG+ +LL +++ ++
Sbjct: 360 FAALAG---IGMPANNDGISMLPTLLARGKQQQHDHL 393
>gi|86142609|ref|ZP_01061048.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
gi|85830641|gb|EAQ49099.1| arylsulfatase A [Leeuwenhoekiella blandensis MED217]
Length = 477
Score = 148 bits (373), Expect = 9e-33, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 170/367 (46%), Gaps = 40/367 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G I TPN+D +A G N Y VCTPSRA++MTG+ P+ GM
Sbjct: 43 GYGDLSSYGHPTIKTPNLDKMAMEGQKWTNFYVGSSVCTPSRAAIMTGRLPVRNGMTSKV 102
Query: 151 IWGAEP---RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY-- 205
+P G+P +E + E L++ GY T A GKWHLG + EY P+ GF+ +G
Sbjct: 103 HRVLQPWSTNGLPQSEITIAEQLKKGGYGTAAFGKWHLGD-KEEYLPVNNGFDYWYGVPY 161
Query: 206 ---------LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAV 256
+N Y++ S++ N MR T +T EA+
Sbjct: 162 SNDMDITLEMNEPQDYWNFWKSEERKEISSFNIPLMRNTEIIERPANQHTITKRYTNEAI 221
Query: 257 QLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
+ I++ + P F+YLA+ H P +F+ R Y +V+++D
Sbjct: 222 EYIKEHK-ENPFFIYLAYNLPHV---------PLFASEEFE--GKSKRGLYGDVVEEIDH 269
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+VG V+ AL+ +G+ EN+I++F SDNG V E GS R K + WEGG+
Sbjct: 270 NVGRVMEALKEQGIAENTIVVFTSDNGPWLVMDEEG------GSAGLLRDGKGSTWEGGM 323
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLDQWSSLLLN 434
+ P I WSP P + ++ D T AG LP + +DG D S+L
Sbjct: 324 REPGIFWSPG-NLKPGIVTELGSTMDLFTTFSKLAG---VELPDDRILDGEDLSSTLFEG 379
Query: 435 TPSRRNS 441
S R +
Sbjct: 380 KESPRET 386
>gi|398828648|ref|ZP_10586848.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
gi|398217506|gb|EJN04023.1| arylsulfatase A family protein [Phyllobacterium sp. YR531]
Length = 470
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 102/352 (28%), Positives = 168/352 (47%), Gaps = 38/352 (10%)
Query: 69 TDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-Q 125
T + + L K + ++ G+ DLS +G I TP ID + +G+ L Y+
Sbjct: 22 TGASGKAAVKLNKKPNIIFILADDLGYADLSSYGHPTIRTPAIDKIGNDGVRLLQAYSNS 81
Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
PVC+ +R ++MTG+Y + G+P ++ LP L++ GY T IGKWHL
Sbjct: 82 PVCSATRTAIMTGQYQYRLALGLEEPLAGRDIGLPPSQTTLPSLLKQAGYETTLIGKWHL 141
Query: 186 GFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE 245
G + + Y PL G++ +G+ +SYY+H G D + + + G
Sbjct: 142 GAYPK-YGPLKSGYDHFYGFRGSALSYYNH-------------GKDFWEDDAPV-EKAG- 185
Query: 246 YATDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAH-----AGNAGKHLEAPQETINQFQYI 299
Y TDL + V+LI+ ++P F + A H G GK A ++ N
Sbjct: 186 YFTDLLGDKTVELIQKSDSCERPFFASVHFNAPHWPWEAPGEQGKKEAARKK--NSLAAF 243
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
+ Y +V+++D VG ++ AL K + EN+I++F SDNG +
Sbjct: 244 DAGTQDVYRQIVERMDFQVGRILQALTDKKIRENTIVVFTSDNGG-----------ERFS 292
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
+ +P+ G K+ L EGG+++P+++ P + + S Q+ DWLPTL AA
Sbjct: 293 NTWPFSGKKSELLEGGIRIPSLICWPAKIPSGQTSEQVTLSMDWLPTLLEAA 344
>gi|423214668|ref|ZP_17201196.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
CL03T12C04]
gi|392692574|gb|EIY85811.1| hypothetical protein HMPREF1074_02728 [Bacteroides xylanisolvens
CL03T12C04]
Length = 453
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 116/345 (33%), Positives = 175/345 (50%), Gaps = 47/345 (13%)
Query: 79 LTKSTTLTLLIVY---GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRAS 134
+ KST LLI+ G DLS + ++ TPNID + G+ L+N YA V +PSRA+
Sbjct: 26 VDKSTPNVLLILVDDLGLGDLSCQYAKDLSTPNIDRIFETGVRLDNFYANSSVSSPSRAA 85
Query: 135 LMTGKYPIHTGMQG---PPI---WG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
L+TG++P G+ G P + WG +P V +PE L+ GY T IGKWHLG
Sbjct: 86 LLTGRFPAMVGVPGVIRPSVDQNWGYFDPSAVT-----MPEVLKNGGYRTALIGKWHLG- 139
Query: 188 FRREYTPLYRGFESHFGYLNGVI-SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEY 246
+ P RGF+ G+L ++ YY H Q + LN ++ D G +
Sbjct: 140 WESPNLPNERGFDHFHGFLADMMDDYYTH--RRQGGNYMYLNDKEI--------DPKG-H 188
Query: 247 ATDLFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP--- 302
AT+LFT +V I+ + +K P FLYLA+ A H+ L+ P E +N+ Q
Sbjct: 189 ATELFTSWSVDYIKKEAKEKNPFFLYLAYNAPHS-----PLQPPVEWVNKVQERDKSLPV 243
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
R A+++ LD ++G VI +L+ G L N+++IF SDNG + + +N
Sbjct: 244 KRARLIALIEHLDYNIGKVIQSLEESGQLNNTLVIFASDNGG---------DRGSMANNG 294
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
P RG K ++EGG+ V L P + + R + + D +PT+
Sbjct: 295 PTRGAKGDMFEGGIHVACALNMPGVFEGGRRDNHFVVMMDLMPTI 339
>gi|119475675|ref|ZP_01616028.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
gi|119451878|gb|EAW33111.1| arylsulfatase A [marine gamma proteobacterium HTCC2143]
Length = 479
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 123/410 (30%), Positives = 188/410 (45%), Gaps = 58/410 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D+ +G I +PN+D +A GI N YA VCTPSRA L+TG+ P+ +GM
Sbjct: 49 GYGDIGAYGHPTIRSPNLDQMAAEGIKWTNFYAASSVCTPSRAGLLTGRLPVRSGMAHDQ 108
Query: 151 IWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY-- 205
I P G+P TE + + L+E Y T +GKWHLG + PL GF+ +FG
Sbjct: 109 IRVLFPTSTGGLPTTEITIAKALKEKDYRTALVGKWHLGHLPG-FQPLDHGFDEYFGIPY 167
Query: 206 -----LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
L +SY I +++ + N M+ T +T+EAV I+
Sbjct: 168 SNDHDLKKELSYIQTI---THAKDGDFNVPLMQNRSIIERPANQNTITKRYTQEAVSFIK 224
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGT 320
++P FLYLAH H P +QF+ +D R Y +++++D SVG
Sbjct: 225 KNS-NQPFFLYLAHSMPHV---------PLFASDQFRGSSD--RGLYGDVIEEIDWSVGQ 272
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
V+S L +G+ EN++++F SDNG + + GS + K T +EGG++ PA
Sbjct: 273 VLSTLSEQGISENTLVVFTSDNGPWLIM------GAHGGSAGLLKSGKGTSYEGGMREPA 326
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRN 440
I W P+ + P V+ D PT+ + AG ++ PS R
Sbjct: 327 IFWWPE-KIKPAVAHNTASTLDLFPTIMSIAG--------------------IDMPSDR- 364
Query: 441 SNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
+ DG D ++ + R ++ +K AVR WK+ T N
Sbjct: 365 -SYDGYDLSPTMFEQKSNERKNIFYYHGDK--IFAVRQGDWKVHFKTVAN 411
>gi|294053911|ref|YP_003547569.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
gi|293613244|gb|ADE53399.1| sulfatase [Coraliomargarita akajimensis DSM 45221]
Length = 469
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 110/336 (32%), Positives = 161/336 (47%), Gaps = 30/336 (8%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL + GS IPTPNID LA G+ Y C PSRA +TG+Y G + P
Sbjct: 43 GYGDLGYTGSKHIPTPNIDRLANEGVECTYGYVTHQYCGPSRAGFLTGRYQQRFGFETNP 102
Query: 151 IWGAEP--RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
+ GVP +ER E L+ +GY T +GKWH+G ++ + +
Sbjct: 103 PYDRHNTIAGVPASERLFAERLQAVGYKTGIVGKWHIG----SHSIHHPNNRGFDFFFGF 158
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
+ +D D E M RN S+ V Y T T EA+ IE D P
Sbjct: 159 LGGGHDFFRVDTREPMDEGYLDPMMRNGSSV--DVEGYLTTQLTDEAIGFIERNEKD-PF 215
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
FL+L++ A HA L+AP+E+I +F ++ RR Y+AMV ++D +G ++S L+ +
Sbjct: 216 FLFLSYNAPHAP-----LQAPEESIAKFSHVEGKERRVYSAMVYEMDLEIGRILSMLEDR 270
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSP--- 385
+ E +II F+SDNG P + +Y SN +RG K ++GGV VP + + P
Sbjct: 271 DLAEETIIFFLSDNGGPPHWDKSQDSYT---SNGAFRGYKGDTYDGGVHVPFLAYWPGTL 327
Query: 386 ---QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
+ +P VSL + T GGD S +
Sbjct: 328 PAGKQFTSPVVSLDIAR------TAVALGGGDESEM 357
Score = 47.8 bits (112), Expect = 0.016, Method: Compositional matrix adjust.
Identities = 31/87 (35%), Positives = 45/87 (51%), Gaps = 8/87 (9%)
Query: 1 MRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 60
M RN S+ V Y T T EA+ IE D P FL+L++ A HA L+AP+E
Sbjct: 182 MMRNGSSV--DVEGYLTTQLTDEAIGFIERNEKD-PFFLFLSYNAPHAP-----LQAPEE 233
Query: 61 TINQFQYITDPNRRTYAALTKSTTLTL 87
+I +F ++ RR Y+A+ L +
Sbjct: 234 SIAKFSHVEGKERRVYSAMVYEMDLEI 260
>gi|32477899|ref|NP_870893.1| iduronate-sulfatase or arylsulfatase A [Rhodopirellula baltica SH
1]
gi|32448456|emb|CAD77971.1| iduronate-sulfatase or arylsulfatase A [Rhodopirellula baltica SH
1]
Length = 1012
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 140/484 (28%), Positives = 209/484 (43%), Gaps = 68/484 (14%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQ 125
++ R A TK + +L G+ DLS G+ + TP ID +A G L + Y A
Sbjct: 557 VSPAGREKTAETTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAA 616
Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWG----AEPRGVPLTERFLPEYLRELGYSTKAIG 181
PVCTPSRA LMTG YP M +G +P+G+ E + E L+ GY T G
Sbjct: 617 PVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLKTAGYRTGMFG 676
Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---RRNLST 238
KWHLG + E+ P +GF+ FG Y H + + R + + + +
Sbjct: 677 KWHLGD-QPEFLPTKQGFDEFFGIP------YSHDIHPFHPRQNHYHFPPLPLLQNDTVI 729
Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA----------GNAGKHLEA 288
D ++ T T++AV IE D+P FLYL H HA G A + A
Sbjct: 730 EMDPDADFLTKRLTEQAVSFIERN-KDQPFFLYLPHPIPHAPLHASPPFMEGVADDVIAA 788
Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
++ Y T N + + ++D SVG ++ AL+ G+ E ++++F SDNG P
Sbjct: 789 IEKEDGNIDYATRAN--LFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPP--- 843
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
+ + Y + G RG K T +EGG++ P ++ W QI + ++M D LPT
Sbjct: 844 --KNTLYASPGE---LRGHKGTTFEGGMREPTVVRWPGQIPAG-HQNDELMTAMDLLPTF 897
Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
AG P+ R IDG D W +L T + ++ +
Sbjct: 898 AKLAGA--------------------AIPTDR--VIDGKDIWPTLKGETQTPHDAFFYH- 934
Query: 468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYG--QTRSNKVPLLNFNAIVESKTYQSLQQL 525
+ AAVR WKL + NG Y KV ++ N V K L+
Sbjct: 935 -RGNQLAAVRSGKWKLHV---NNGVAKQLYDLENDLGEKVNVIETNPEVVKKLQHQLKDF 990
Query: 526 SQNI 529
+ +I
Sbjct: 991 AADI 994
Score = 105 bits (262), Expect = 8e-20, Method: Compositional matrix adjust.
Identities = 119/462 (25%), Positives = 186/462 (40%), Gaps = 79/462 (17%)
Query: 75 TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPS 131
T A + + L+ V G+ DL +G+ ++ TPNID LA G + + A VCTPS
Sbjct: 32 TSVAAERPPNVVLIFVDDLGYGDLGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPS 91
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPL----TERFLPEYLRELGYSTKAIGKWHLGF 187
R L+TG+YP+ M G IWG P L + + + + GY+T +GKWHLGF
Sbjct: 92 RYGLLTGQYPVRA-MGGQGIWGPLPTTSGLIIDTNTKTIGKVFKNKGYATACLGKWHLGF 150
Query: 188 FRREYT--------PLYRGFESHFGY--LNGVISYY----DHILSDQYSRTVELNGHDMR 233
P GF+ +FG +N Y D I S + G +
Sbjct: 151 KEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVS 210
Query: 234 --------------RNLSTAWDTVGEYATD----LFTKEAVQLIEDQPVDKPLFLYLAHL 275
S A Y + L T+ AV+ I ++ ++P FLY A
Sbjct: 211 PTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITEK-KNEPFFLYFATP 269
Query: 276 AAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSI 335
H P +F+ + Y V +LD VG ++ +L+ G+ +N++
Sbjct: 270 NIH---------HPFTPAPRFKGTSQCG--LYGDFVHELDWMVGEIVQSLEDNGLTDNTL 318
Query: 336 IIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVS 394
++F SDNGA + N G K +WEGG +VP I W +I+
Sbjct: 319 VLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIK------ 372
Query: 395 LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LNTPSRRNSNIDGLDQWSSLL 453
AG + +L +D +S+L PS D ++ +LL
Sbjct: 373 ----------------AGTQSDQLISQVDLFATFSALTEQEMPSSEQK--DSINMLPALL 414
Query: 454 LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
+ + L+ + R A+R W L +G + +G +G
Sbjct: 415 DDPNEPLRTELVLAPRQPRNLAIRKGKW-LYIGARGSGGFNG 455
>gi|116621986|ref|YP_824142.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
gi|116225148|gb|ABJ83857.1| sulfatase [Candidatus Solibacter usitatus Ellin6076]
Length = 461
Score = 148 bits (373), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 118/343 (34%), Positives = 162/343 (47%), Gaps = 49/343 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +GS I TPNID LA G + Y A PVC+PSRA+LMTG+YP T ++ P
Sbjct: 39 GYGDLGCYGS-PIATPNIDRLAEEGARFTSFYSASPVCSPSRAALMTGRYP--TRVEVPV 95
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ G G+P +E + + L+ GY T IGKWH+G Y P RGF+ FG +
Sbjct: 96 VLGPGDAGLPDSEITMAQVLKSAGYRTSCIGKWHIGS-TPGYLPTNRGFDEFFG-----V 149
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
Y I R + ++ A D T FT+EA+ + + D P FL
Sbjct: 150 PYSADITPCPLMRGSSV--------VAPAVDC--STLTSSFTQEALDFMR-RAQDNPFFL 198
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
YLAH A H A A Q + YA +V++LD S G V++AL+ G+
Sbjct: 199 YLAHTAPHLPLAASPRFAGQSGLGM-----------YADVVQELDWSTGQVMAALKATGL 247
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNW--GSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
N++++F SDNG W GS RG K +EGG++ P + P +
Sbjct: 248 DSNTLVMFSSDNGP-------------WYQGSQGKLRGRKGETYEGGMREPFLARYPGVI 294
Query: 389 QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+ + D LPTL AG T PL DG+D W L
Sbjct: 295 PSGIGCAGLATTMDLLPTLARLAGAQTPSNPL--DGVDIWPVL 335
>gi|160891517|ref|ZP_02072520.1| hypothetical protein BACUNI_03968 [Bacteroides uniformis ATCC 8492]
gi|317478374|ref|ZP_07937538.1| sulfatase [Bacteroides sp. 4_1_36]
gi|156858924|gb|EDO52355.1| arylsulfatase [Bacteroides uniformis ATCC 8492]
gi|316905533|gb|EFV27323.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 448
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 133/410 (32%), Positives = 186/410 (45%), Gaps = 83/410 (20%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G+ I TPNID +A G+ Y V TPSR++LMTG+ P+ G+ G
Sbjct: 43 GYGDLGCYGNPTIHTPNIDRMASEGMKFTQFYVGASVSTPSRSALMTGRLPVRNGLYGDK 102
Query: 151 ---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
++ G+ E + + L++ GY+T +GKWHLG F Y P GF+ FG
Sbjct: 103 RDVLFPDSKAGLGQDEVTVAKLLQQNGYATACVGKWHLGHFS-PYLPTDHGFDDFFG--- 158
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA------TDLFTKEAVQLIED 261
I Y +DMR D+V E T +T+ AV I+
Sbjct: 159 --IPY----------------SNDMRPASLMRGDSVLEIKPEQGELTRRYTEYAVDYIKK 200
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
++P FLYLAH H P T +F+ + NR Y +V++LD SVG V
Sbjct: 201 NK-NRPFFLYLAHTFPHT---------PLHTNERFEGRS--NRGLYGDVVEELDWSVGEV 248
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ AL+ G+ EN+++IF SDNG V+ N GS R K+T WEGG++VPA+
Sbjct: 249 LKALKENGLDENTLVIFTSDNGPWLVQ------RLNGGSAGALRQGKSTCWEGGLRVPAV 302
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL----NIDGLDQWSSLLLNTPS 437
P + P MM D LPT AG +PL +DG+DQ L
Sbjct: 303 FRMPG-RIAPCTQQGMMATMDILPTFLNMAG-----IPLPEGVTLDGMDQTPMLW----E 352
Query: 438 RRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGT 487
+R S+ D + W R S L+ AVR WK+ T
Sbjct: 353 QRESSRDEMYYW----------RGSHLL---------AVRKGPWKIHFST 383
>gi|87307904|ref|ZP_01090047.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
gi|87289518|gb|EAQ81409.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Blastopirellula
marina DSM 3645]
Length = 459
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 113/346 (32%), Positives = 176/346 (50%), Gaps = 27/346 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL G +I TPN+D +A G+ L + YA VC PSR LMTG + H ++G
Sbjct: 26 GYGDLGCFGQQKISTPNLDQMAQEGMRLTDFYAGCTVCAPSRCVLMTGLHTGHCFIRGNA 85
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG-- 208
P V + E L+ GY T +GKW LG P +GF+ +GYL+
Sbjct: 86 RDNLRPSDVTVAE-----VLKAAGYQTALMGKWGLGHEGSNGVPTRQGFDYFYGYLDQGH 140
Query: 209 VISYYDHILSDQYSRT----VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+YY L R V LN + + + A + V EY+ DL +EA+ ++
Sbjct: 141 AHNYYTTFLMRNEERVALKNVPLNEDESGKGV--AKEKV-EYSHDLIMEEALAFVDRAAK 197
Query: 265 -DKPLFLYLAHLAAHAGN-AG-KHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
D P FLYL+ HA N AG K +E P + +P ++ +AAM+ ++D +GT+
Sbjct: 198 KDAPFFLYLSLTIPHANNQAGDKGMEVPDYGQYADKDWPEP-QKGHAAMISRMDRDLGTL 256
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG-SNYPYRGVKNTLWEGGVKVPA 380
+ L++ G+ +N+++IF SDNG ++E N ++ SN P +G+K +L EGG++VP
Sbjct: 257 FAQLKQDGIDDNTLVIFTSDNGP----HKEGGNDPDFADSNGPLQGIKRSLHEGGIRVPT 312
Query: 381 IL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
I+ W ++ + D +PTL A G + +P IDG+
Sbjct: 313 IVRWPGHVKADSESDFAGA-FWDVMPTL-AAVAGVSKEVPAGIDGI 356
>gi|348516447|ref|XP_003445750.1| PREDICTED: N-acetylgalactosamine-6-sulfatase-like [Oreochromis
niloticus]
Length = 525
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 119/383 (31%), Positives = 175/383 (45%), Gaps = 42/383 (10%)
Query: 78 ALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
A T + + L+ GW DL G TPN+DA+A G++L N Y A P+C+PSRA+L+
Sbjct: 28 APTPNIIIMLMDDMGWGDLGVFGQPSKETPNLDAMAAEGMLLPNFYTANPLCSPSRAALL 87
Query: 137 TGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
TG+ PI G A R G+ E LP+ L+ GY +K +GKWHLG
Sbjct: 88 TGRLPIRNGFYTT---NAHARNAYTPQEIVGGISKDEILLPQLLKTKGYVSKIVGKWHLG 144
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVEL-NGHDM----RRNLSTAWD 241
R +Y PL GF+ FG N Y+ DQ + + N +M + +
Sbjct: 145 H-RPQYLPLKNGFDEWFGSPNCHFGPYN----DQSKPNIPVYNNSEMLGRFYEDFKIDRN 199
Query: 242 TVGEYATDLFTKEAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
T T ++ E + I Q +P FLY A A HA +++
Sbjct: 200 TGESNLTQIYLMEGLDFILRQTKAQQPFFLYWAVDATHA-----------PVYASKRFLG 248
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
R Y V +LD S+G ++ L+ G+ N+ + F SDNGA + S GS
Sbjct: 249 KSQRGRYGDAVMELDYSIGQILKWLRTLGIDNNTFVFFTSDNGAAVMSGPNES-----GS 303
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
N P+ K T +EGG++ PAI W P + V+ Q+ ++ D T AG + +
Sbjct: 304 NGPFLCGKETTFEGGMREPAIAWWPGHIKEGSVNFQLANVMDLFTTSLVLAGINPPDDRI 363
Query: 421 NIDGLDQWSSLLLNTPSRRNSNI 443
+DGLD LL ++ +N I
Sbjct: 364 -LDGLDLTPVLLNSSQPLKNRPI 385
>gi|436835023|ref|YP_007320239.1| sulfatase [Fibrella aestuarina BUZ 2]
gi|384066436|emb|CCG99646.1| sulfatase [Fibrella aestuarina BUZ 2]
Length = 485
Score = 147 bits (372), Expect = 1e-32, Method: Compositional matrix adjust.
Identities = 151/532 (28%), Positives = 224/532 (42%), Gaps = 103/532 (19%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRASL 135
AA + L + G+ DLS G+ TPN+D +A G N + AQPVCT SRASL
Sbjct: 33 AAQKPNVVLFFMDDLGYGDLSCTGAMGYLTPNLDRMAAEGTRFTNFLAAQPVCTASRASL 92
Query: 136 MTGKYPIHTGMQGPPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
+TG YP G+ G +G + G+ E L E L+E GY+T GKWHLG ++ + P
Sbjct: 93 LTGCYPNRIGIAG--AYGPSSTVGLNPDEETLAELLKEQGYATGIFGKWHLG-SKQAFLP 149
Query: 195 LYRGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
L GF+ +FG Y + + ++ +Y + G+ + D GE T T
Sbjct: 150 LQHGFDEYFGVPYSHDMWPHHPDQAHAKYPPLYLIEGNQPTNEVKYLTD-AGEL-TAAIT 207
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
AV I Q P FLY+ H H P QF+ + R + ++
Sbjct: 208 DRAVSFIR-QHRKAPFFLYIPHPLPH---------VPLAASPQFRGKS--RRGLFGDVLM 255
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
+LD SVG V+ L+ +G+ +N++++FM DNG P + Y + + GS R K T +
Sbjct: 256 ELDWSVGQVMQELKAQGLDKNTLVLFMGDNG-PWLNYGDHA-----GSTAGLREGKGTTY 309
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
EGG +VP ++ P + RVS +++ D LPT+ G + P IDG+D W++LL
Sbjct: 310 EGGNRVPCLVRWPGVVPAGRVSNKLLTTMDVLPTVAKLCGARQPKRP--IDGVD-WTALL 366
Query: 433 LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGT 492
++ TP R K AVR WKLV
Sbjct: 367 KGDDTQ-----------------TPRDR---FYYYYRKNNLEAVRKGDWKLVFAH----- 401
Query: 493 MDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPISNIDKMRSTRQQATIHCGA 552
+TY+ L +N
Sbjct: 402 ----------------------PGRTYEGF-PLGRN-----------------------G 415
Query: 553 NPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKYHRRTL 604
P P T + N P L++L +DP E+ ++ + PDI L +L R L
Sbjct: 416 QPGPSTEN--FNFPTALYDLAHDPGERYDVQAQHPDIVQSLQKLADEARADL 465
>gi|16125424|ref|NP_419988.1| sulfatase [Caulobacter crescentus CB15]
gi|13422494|gb|AAK23156.1| sulfatase family protein [Caulobacter crescentus CB15]
Length = 521
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 158/350 (45%), Gaps = 77/350 (22%)
Query: 92 GWNDLSFHGSN----EIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
G+ND++ +G +PTP ID++A G+ N Y+ C PSRA++MTG+Y G
Sbjct: 44 GYNDITLNGGGVAGGTVPTPAIDSIAKEGVTFANGYSGNATCAPSRAAIMTGRYATRFGF 103
Query: 147 QGPPIWGAEPR------------------------------GVPLTERFLPEYLRELGYS 176
+ P A R VP E + E L+ GY
Sbjct: 104 EFTPTPVAFSRVVGGHAGDPLHPSRFNQAEVKNMPKDENVLAVPAAEVTIAEALKTKGYH 163
Query: 177 TKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY-----------------YDHILSD 219
T +GKWHLG + P +GF+ G++ G + D L
Sbjct: 164 TIHLGKWHLGGVKGS-RPEDQGFDESLGFMAGAALFAPVGDPGVVESRQDWDPIDKFLWG 222
Query: 220 QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 279
V+ NG + Y TD T EAV+ I D ++P F+YLA+ A H
Sbjct: 223 AAPFAVQFNGGKLF--------NPSHYMTDYLTDEAVKAI-DANKNRPFFMYLAYNAVHT 273
Query: 280 GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
L+AP+ + +I D R YAAMV+ LD +VG V+ AL+ +G+ +N+++IF
Sbjct: 274 -----PLQAPKADYDALSHIKDHRMRVYAAMVRNLDRNVGKVLQALKDRGLDDNTLVIFT 328
Query: 340 SDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
SDNG +NY N PYRG K T +EGG+KVP +L W Q+
Sbjct: 329 SDNGG--------ANYIGLPDINKPYRGWKATFFEGGIKVPFLLRWPEQL 370
Score = 43.5 bits (101), Expect = 0.32, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 6/68 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD T EAV+ I D ++P F+YLA+ A H L+AP+ + +I D R
Sbjct: 241 YMTDYLTDEAVKAI-DANKNRPFFMYLAYNAVHT-----PLQAPKADYDALSHIKDHRMR 294
Query: 75 TYAALTKS 82
YAA+ ++
Sbjct: 295 VYAAMVRN 302
>gi|291232668|ref|XP_002736267.1| PREDICTED: galactosamine (N-acetyl)-6-sulfate sulfatase-like
[Saccoglossus kowalevskii]
Length = 518
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 122/372 (32%), Positives = 173/372 (46%), Gaps = 41/372 (11%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
L L+ GW DL G+ TPN+D +A G ++ + YA P+C+PSRASL+TG+ PI
Sbjct: 31 LMLMDDMGWGDLGVLGNPAKETPNLDRMASEGALMTDFYAPNPLCSPSRASLLTGRLPIR 90
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G P+ G+P +E LPE L + GY +K IGKWHLG + +Y PL
Sbjct: 91 NGFYTTNDHARCSYTPQYIVGGIPDSEIVLPELLNKAGYRSKIIGKWHLG-HQTQYHPLK 149
Query: 197 RGFESHFGYLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEY-ATDLFTK 253
GF+ +FG N + YD+ R ++ G R D GE T +F +
Sbjct: 150 HGFDEYFGAPNCHVGPYDNKKQPNIPVYRDADMIG---RYYEEFKIDKSGESNLTQMFIE 206
Query: 254 EAVQLIEDQ-PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
EA+ IE Q + FLY A+H +P F + R Y V
Sbjct: 207 EAIAFIEKQHQTGEQFFLYWTPDASH---------SPHYASKSF--LGTSQRELYGDAVM 255
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLW 372
+LD VG ++ L+ + N+ + F SDNGA S GSN P+ K+T +
Sbjct: 256 ELDYGVGQILGKLKELQIENNTFVFFSSDNGA------ALSGGAYVGSNGPFLCGKHTTF 309
Query: 373 EGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL 432
EGG++ P+I W P + +VS Q+ I D T AG + L D + +SLL
Sbjct: 310 EGGMREPSIAWWPGRIKPGKVSHQLGTIMDLFTTALDIAG-----VTLPTDRIIDGTSLL 364
Query: 433 LNTPSRRNSNID 444
P+ N ID
Sbjct: 365 ---PALVNGKID 373
>gi|323452998|gb|EGB08871.1| hypothetical protein AURANDRAFT_71524 [Aureococcus anophagefferens]
Length = 495
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 108/361 (29%), Positives = 167/361 (46%), Gaps = 48/361 (13%)
Query: 107 PNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI----------WGAEP 156
P++ ALA +G+ L N Y Q CTP+RA+LM+GK+ TG + WG
Sbjct: 16 PHMLALAQDGVRLTNYYGQAFCTPARAALMSGKFGHRTGFANVDVDSAATLEISAWGNFS 75
Query: 157 RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHI 216
+ FL + L GY +GKW+LG + P RGF+S+ GY I Y H
Sbjct: 76 LANAGDQIFLSQRLATAGYEVHGVGKWNLGHCNAAFLPPSRGFKSYLGYYGAGIGYVSHE 135
Query: 217 LSDQYSRTVELNGH-------DMRR-----NLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+ + H DM+R LS A +GEY+T LFT E+++ I+ V
Sbjct: 136 VEQSGNVENSFTRHFRDYSLVDMQRCTPQGCLSDA-SVIGEYSTILFTNESLRRIDGFSV 194
Query: 265 DKPLFLYLAHLAAH---AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
+ P +LY+A+ H NA L+ P + + RR + + +D VG +
Sbjct: 195 ENPTYLYVAYHGVHDDKQSNASG-LDYPAAALEALE----ERRRNFGIALHAVDHGVGLL 249
Query: 322 ISALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
L+++ + +++ SDN G+P Y ++ SN PYRG+K +EGG+K+PA
Sbjct: 250 QQRLEKRS--SDYVLVVHSDNGGSPCGTYCDS-------SNAPYRGMKFFDFEGGLKLPA 300
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG---DTSRLPLNIDGLDQWSSLLLNTPS 437
++SP + +MH DW T G D S + D D WS++ N+
Sbjct: 301 FVYSPTRLASRGTYDGLMHHVDWTATFLRGLAGTLLDCS----DCDSRDHWSAISSNSDD 356
Query: 438 R 438
+
Sbjct: 357 K 357
>gi|325111172|ref|YP_004272240.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324971440|gb|ADY62218.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 474
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 120/392 (30%), Positives = 192/392 (48%), Gaps = 42/392 (10%)
Query: 64 QFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIIL 119
+F ++ AL + T +IV+ G+ D+ G+ TPN+D +A G
Sbjct: 2 RFHFVILALLLLAPALATAETPNFVIVFTDDQGYADVGCFGAEGYETPNLDRMAQEGRKF 61
Query: 120 NNMY-AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTK 178
+ Y +Q VC+ SR SL+TG Y + G+ G + G+ E L E +++ GY+T
Sbjct: 62 TSFYVSQAVCSASRTSLLTGCYNVRVGINGA-LSHKSTVGISTEEMTLGELVKQQGYTTA 120
Query: 179 AIGKWHLGFFRREYTPLYRGFESHFG--YLNGVISYY--------DHILSDQYSRTVELN 228
A+GKWHLG+ ++++ PL GF+ +FG Y N + Y+ + Y +
Sbjct: 121 AVGKWHLGY-QQKFLPLQHGFDRYFGLPYSNDMWPYHPRLEGMTLEEARVKGYPDLPLYS 179
Query: 229 GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 288
G+++ T D T +T++A+ I+D DKP LYLAH H
Sbjct: 180 GNEIIDAQVTPEDQ--RQLTTQYTEQALDFIDDNH-DKPFLLYLAHSMPHV--------- 227
Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
P ++FQ T+ R Y +++++D SVG V++ L++ G+ +N+++IF SDNG P +
Sbjct: 228 PLFVSDKFQDKTEQGR--YGDVIQEIDWSVGQVLARLKKHGLDDNTLVIFTSDNG-PWLS 284
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
Y + GS P+R K T WEGGV+VP I+ W +I ++ D PT+
Sbjct: 285 YGNHA-----GSAGPFREGKGTSWEGGVRVPCIMRWPGKIPAGTSCD-ELAATIDIFPTV 338
Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRR 439
AA D IDG D SL+ +P +
Sbjct: 339 --AALADAPLPDHKIDGKD-IRSLMFESPCAK 367
>gi|398810130|ref|ZP_10568960.1| arylsulfatase A family protein [Variovorax sp. CF313]
gi|398083821|gb|EJL74525.1| arylsulfatase A family protein [Variovorax sp. CF313]
Length = 438
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/338 (33%), Positives = 163/338 (48%), Gaps = 44/338 (13%)
Query: 92 GWNDLSFHGSNEIP----TPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH-TG 145
G+ DL +G + +P +D LA NG+ L YA PVC+P+R ++TG+Y G
Sbjct: 15 GYADLGCYGGRDAVFGPVSPVLDGLAANGLKLTQGYANSPVCSPTRFGMITGRYQYRLRG 74
Query: 146 MQGPPI----WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
PI G+ G+P LP L+ GYST IGKWHLG+ + PL G++
Sbjct: 75 AAEEPINSRSRGSATLGLPTDHPTLPSLLKASGYSTALIGKWHLGY-PPSFGPLRSGYDE 133
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE-YATDLFTKEAVQLIE 260
FG ++G + Y+ H D R HD+ S D V E Y TD+ +K AV+ +E
Sbjct: 134 FFGPMSGGVDYFTHC--DSTGR------HDL---WSGEEDKVEEGYLTDILSKRAVEYVE 182
Query: 261 DQPV-DKPLFLYLAHLAAH-----AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
+ P FL L + A H +A K AP N F + N Y M+ +
Sbjct: 183 RMAKQEAPFFLSLHYTAPHWPWETRDDAAK---APAVKDNLFD-LAGGNIHVYRRMIHHM 238
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D+ +G +++ALQ+ GM +N++++F SDNG + N+P G K L EG
Sbjct: 239 DEGIGWIMAALQKHGMADNTLVVFTSDNGG-----------ERFSDNWPLVGGKMDLTEG 287
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
G++VP I P + S Q+ DW T+ AAG
Sbjct: 288 GIRVPWIAHWPAVIGKGGESTQLCMTMDWSATMLDAAG 325
>gi|325108958|ref|YP_004270026.1| sulfatase [Planctomyces brasiliensis DSM 5305]
gi|324969226|gb|ADY60004.1| sulfatase [Planctomyces brasiliensis DSM 5305]
Length = 499
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 121/364 (33%), Positives = 175/364 (48%), Gaps = 49/364 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ--G 148
G+ +L +G I TP +D LA GI L Y+ VC PSR LMTGK+P H ++ G
Sbjct: 56 GYQELGCYGQKWIRTPFVDQLAAEGIKLTQFYSGNAVCAPSRCCLMTGKHPGHAWVRNNG 115
Query: 149 PPIWGAEPRG-----------VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYR 197
P E R +P +E + E L++ GY+T AIGKW LG F P +
Sbjct: 116 DPKLPEELREKYQWEFAGQFPLPESEVTVAELLKQKGYATAAIGKWGLGHFGTTGAPDRQ 175
Query: 198 GFESHFGYLNGVISYYDHILSDQYSRTVELN-------GHDMRRNLSTAWDTVGEYATDL 250
G + +G+ N ++H Y R + N G+D N T Y+ D
Sbjct: 176 GIDLFYGF-NSQRHAHNH-----YPRFLWRNDQKEMQPGNDRTLNGET-------YSQDR 222
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAH-------AGNAGKHLEAPQETINQFQYITDPN 303
FT+ A++ I DQ D+P FLYL H + A E P+E Y+ P
Sbjct: 223 FTEVALEFI-DQNKDQPFFLYLPFAIPHLSIQVPESSLAEYTDEIPEEEYKHRGYLKHPK 281
Query: 304 RRT-YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
R YAAM+ +D +G +++ L G+ +N++IIF SDNG PT + S+ + S
Sbjct: 282 PRAGYAAMITHMDRDIGKIMARLNEYGLTDNTLIIFTSDNG-PTYDRLGGSDSDFFESAG 340
Query: 363 PYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
RG+K +L+EGG++VP + W QI L + + D PT+ AAG + P
Sbjct: 341 KMRGLKGSLYEGGIRVPCVARWPGQIPAGNTSDL-ISALWDLFPTIAEAAGVEP---PEE 396
Query: 422 IDGL 425
IDG+
Sbjct: 397 IDGI 400
>gi|149196206|ref|ZP_01873261.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149140467|gb|EDM28865.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 454
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/362 (30%), Positives = 180/362 (49%), Gaps = 44/362 (12%)
Query: 76 YAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTP 130
+A + K+ ++I++ G+ D+S +G+ I TP++D +AY G N Y A PVCTP
Sbjct: 12 FALVAKAEKPNVIIIFADDMGYGDMSCNGNPSIKTPHLDRMAYEGQKWTNFYVAAPVCTP 71
Query: 131 SRASLMTGKYPIHTGM---QGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF 187
SRA L+TG+ PI GM + ++ G+P +E L E + GY T +GKWHLG
Sbjct: 72 SRAGLLTGRLPIRNGMCSNKRRVLFPDSQGGLPQSEYTLAEMFKSAGYETGMVGKWHLG- 130
Query: 188 FRREYTPLYRGFESHFG--YLNGVISYYDHILSDQYSRTVELN-GHDMRRNLSTAW---- 240
+ E+ P+ GF+S +G Y N + + + I + +++ + G S W
Sbjct: 131 HKAEFLPIKHGFDSWYGIPYSNDMDANRELINKVKKDKSLPWHKGKHWEEPKSEYWEVPL 190
Query: 241 ---DTVGEYA------TDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
+TV E A T +T+++ Q I DKP FLYLAH H P
Sbjct: 191 MQGETVLERAPNQELLTKTYTEKSQQFIRVNK-DKPFFLYLAHSMPH---------VPLF 240
Query: 292 TINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE 351
N+F+ ++ Y ++++LD SVG V+ ++ G+ + +I++F SDNG P + ++
Sbjct: 241 RSNKFKDMS--TMGLYGDVIEELDWSVGQVLKTVKEMGLEQQTIVVFTSDNG-PWLSFKT 297
Query: 352 TSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAA 411
G P R K + WEGG++ P ++W P + P + + D + T +
Sbjct: 298 QG-----GIAGPLRDGKGSTWEGGMREPTVIWGPGYVK-PGIIHDLGSTLDMMATFAKMS 351
Query: 412 GG 413
GG
Sbjct: 352 GG 353
>gi|221234167|ref|YP_002516603.1| sulfatase family protein [Caulobacter crescentus NA1000]
gi|220963339|gb|ACL94695.1| sulfatase family protein [Caulobacter crescentus NA1000]
Length = 551
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 110/350 (31%), Positives = 158/350 (45%), Gaps = 77/350 (22%)
Query: 92 GWNDLSFHGSN----EIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
G+ND++ +G +PTP ID++A G+ N Y+ C PSRA++MTG+Y G
Sbjct: 74 GYNDITLNGGGVAGGTVPTPAIDSIAKEGVTFANGYSGNATCAPSRAAIMTGRYATRFGF 133
Query: 147 QGPPIWGAEPR------------------------------GVPLTERFLPEYLRELGYS 176
+ P A R VP E + E L+ GY
Sbjct: 134 EFTPTPVAFSRVVGGHAGDPLHPSRFNQAEVKNMPKDENVLAVPAAEVTIAEALKTKGYH 193
Query: 177 TKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISY-----------------YDHILSD 219
T +GKWHLG + P +GF+ G++ G + D L
Sbjct: 194 TIHLGKWHLGGVKGS-RPEDQGFDESLGFMAGAALFAPVGDPGVVESRQDWDPIDKFLWG 252
Query: 220 QYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA 279
V+ NG + Y TD T EAV+ I D ++P F+YLA+ A H
Sbjct: 253 AAPFAVQFNGGKLF--------NPSHYMTDYLTDEAVKAI-DANKNRPFFMYLAYNAVHT 303
Query: 280 GNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFM 339
L+AP+ + +I D R YAAMV+ LD +VG V+ AL+ +G+ +N+++IF
Sbjct: 304 -----PLQAPKADYDALSHIKDHRMRVYAAMVRNLDRNVGKVLQALKDRGLDDNTLVIFT 358
Query: 340 SDNGAPTVEYRETSNYRNWGS-NYPYRGVKNTLWEGGVKVPAIL-WSPQI 387
SDNG +NY N PYRG K T +EGG+KVP +L W Q+
Sbjct: 359 SDNGG--------ANYIGLPDINKPYRGWKATFFEGGIKVPFLLRWPEQL 400
Score = 43.5 bits (101), Expect = 0.34, Method: Compositional matrix adjust.
Identities = 25/68 (36%), Positives = 37/68 (54%), Gaps = 6/68 (8%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 74
Y TD T EAV+ I D ++P F+YLA+ A H L+AP+ + +I D R
Sbjct: 271 YMTDYLTDEAVKAI-DANKNRPFFMYLAYNAVHT-----PLQAPKADYDALSHIKDHRMR 324
Query: 75 TYAALTKS 82
YAA+ ++
Sbjct: 325 VYAAMVRN 332
>gi|392413324|ref|YP_006449931.1| arylsulfatase A family protein [Desulfomonile tiedjei DSM 6799]
gi|390626460|gb|AFM27667.1| arylsulfatase A family protein [Desulfomonile tiedjei DSM 6799]
Length = 566
Score = 147 bits (371), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 120/418 (28%), Positives = 195/418 (46%), Gaps = 68/418 (16%)
Query: 92 GWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
GW D F+G E PTP +D LA +G++L + Y+ P CTP+RA++ TG+ P+H G+
Sbjct: 108 GWMDPGFNGGGESVGNPTPTMDRLARDGLVLTSAYSTPSCTPTRATIHTGQNPLHHGLTR 167
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
PP++G EP G+ + L++LGY T+ +GKWH+G P G+++++G+L G
Sbjct: 168 PPMYG-EPGGLD-GAVTVASVLQKLGYVTQGVGKWHMG-ENEASLPHNVGYDNYYGFL-G 223
Query: 209 VISYYDHILSDQYSRTVELN------------GHDMRRNLSTAWDTVGEYATDLFTKEAV 256
V Y ++ + L+ H+ R + A Y DL T + +
Sbjct: 224 VSDMYTEWRDTYFNPEIALSPGRFALMQKLPFNHNNVRCVKGAKSCENLYEIDLTTIKDL 283
Query: 257 ---------QLIEDQPVD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
+ I D + KP FLY A H N +N P R
Sbjct: 284 DQDWAAYSEKFIRDMKDNTKPWFLYHATRGCHFDN----------YLNDKYAGKSPARTV 333
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRG 366
Y+ + ++DD + ++ L+ G LEN++I+ SDNG P E P+RG
Sbjct: 334 YSDCMVEMDDILARLVKTLEETGQLENTLILLTSDNG-PECEIPPH-------GRTPFRG 385
Query: 367 VKNTLWEGGVKVPA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
K + WEGGV+VP + W ++ PR S + ++D T + AG + + +
Sbjct: 386 CKGSNWEGGVRVPTFVYWKGMVK--PRRSDGLFDLADLFNTCISLAGSPGAEVAKFV--- 440
Query: 426 DQWSSLLLNTPSRRNSNIDGLDQWSSLLLNT-PSRRNSVLINIDEKKRTAAVRLDSWK 482
P+ R +DG+DQ S L+ + S R S++ +++ A VR+D +K
Sbjct: 441 ----------PADR--YVDGIDQASFLIGDQGQSNRRSIIYTLNQ--FVAGVRVDEFK 484
>gi|260062069|ref|YP_003195149.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88783631|gb|EAR14802.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
Length = 492
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 133/448 (29%), Positives = 197/448 (43%), Gaps = 72/448 (16%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP 141
+IV+ G+ DLS G I T N+D +A G N Y A VCTPSRA L+TG+ P
Sbjct: 52 FIIVFADDLGYGDLSSFGHPTIHTKNLDRMAAEGQKWTNFYVAASVCTPSRAGLLTGRLP 111
Query: 142 IHTGMQGPPIWGAEP---RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
+ G+ I P G+P +E L E L++ GY+T +GKWHLG + EY P G
Sbjct: 112 VRNGLTSNEIGVFFPDSHNGMPASEITLAEQLKKAGYATGMVGKWHLGH-KEEYLPPNHG 170
Query: 199 FESHFG--------YLNGVISYYDHI--LSDQYS--RTVELNGHDMRRNLSTAWDTVGEY 246
F+ +FG + SY D+ +++Y +T E N +R
Sbjct: 171 FDDYFGIPYSNDMDFTGQFTSYQDYFGRYTERYESLKTEEYNVPLIRGTEEIERPVNQNT 230
Query: 247 ATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRT 306
T + EAV+ I + D+P F+YLAH H P T ++F+ + R
Sbjct: 231 ITKRYNDEAVKWIREHK-DEPFFMYLAHSLPHV---------PLFTSDEFRGTS--ARGL 278
Query: 307 YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRNWGSNYPY 364
Y +V+++D VG ++ L+ +G+ EN+I++F SDNG PT + GS
Sbjct: 279 YGDVVEEIDHGVGQIMELLEAEGLAENTIVVFTSDNGPWLPTG--------ISGGSAGLL 330
Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
R K T WEGG++ P I W+P + +V + M D T + AG
Sbjct: 331 REGKGTTWEGGMREPTIFWAPGMLP-AKVVMDMGSTLDLFNTFSSLAG------------ 377
Query: 425 LDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
P + +DG+D L + S R + + AVRL ++K
Sbjct: 378 ----------VPMPDDREMDGVDLSPILFGDAESPRKEMFYY--QGADLYAVRLGAYKAH 425
Query: 485 LGTQENGTMDGYYGQTRSNKVPLLNFNA 512
T+E M G R P L +N
Sbjct: 426 FYTKEAYVM----GAERVEHNPPLLYNV 449
>gi|410635289|ref|ZP_11345904.1| N-acetylgalactosamine-6-sulfatase [Glaciecola lipolytica E3]
gi|410145262|dbj|GAC13109.1| N-acetylgalactosamine-6-sulfatase [Glaciecola lipolytica E3]
Length = 493
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 113/371 (30%), Positives = 173/371 (46%), Gaps = 56/371 (15%)
Query: 60 ETINQFQYITDPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIIL 119
ET+ + ++ AAL + + G+ D+S G++ I TPNID++ G
Sbjct: 21 ETLKETSTESNKGSVQKAALKPNIVIIFADDLGYGDISSFGADGIRTPNIDSIGQEGFTS 80
Query: 120 NNMYAQP-VCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGV-PLTERF------------ 165
+ + VC+PSRASL+TG+YP+ GM P V PL+E+
Sbjct: 81 RDFFIPANVCSPSRASLLTGRYPMRNGM---------PVAVNPLSEKHVSSHFGLHPDEI 131
Query: 166 -LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRT 224
+PE L+ GY + +GKWHLGF ++ PL GF+ H G L +YY +D
Sbjct: 132 TIPEMLKPAGYRSLMVGKWHLGFQQKGSHPLDAGFDEHLGLLG---NYYKARENDPRYPI 188
Query: 225 VELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGK 284
++ N R + + + E T +T E + IE + D P F+Y AH H
Sbjct: 189 LKDNQTLYRGHEAVKEEIELEEVTQRYTDEVISFIEREK-DGPFFVYFAHNIVH------ 241
Query: 285 HLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
+P +FQ ++ + Y + ++D S G +++ L+R + EN+++IF SDNGA
Sbjct: 242 ---SPHLPRKEFQGTSEKGK--YGDFIHEMDYSTGRILATLKRLNLEENTLVIFTSDNGA 296
Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI---QQNPRVSLQMMHIS 401
NY G+N GVK T EGG VPA+L P + Q+ + +L M
Sbjct: 297 ---------NYA--GNNGELAGVKYTTMEGGHIVPAMLKWPSVIPAGQDSKTTLTSM--- 342
Query: 402 DWLPTLYTAAG 412
D P AG
Sbjct: 343 DLFPLFADIAG 353
>gi|440714613|ref|ZP_20895192.1| arylsulfatase A [Rhodopirellula baltica SWK14]
gi|436440809|gb|ELP34113.1| arylsulfatase A [Rhodopirellula baltica SWK14]
Length = 470
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 114/381 (29%), Positives = 185/381 (48%), Gaps = 50/381 (13%)
Query: 81 KSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASL 135
KS +++++ G+NDL +GS I TPN+D LA G + Y A VC+PSRA+L
Sbjct: 24 KSDKPNVIVIFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAAL 83
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
+TG YP G+ ++ G+ E + ++L+ GY+T +GKWHLG +E P
Sbjct: 84 LTGCYPKRVGLHQHVLFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPT 142
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGE 245
GF+S++G I Y + + R +++ D + S+A W+T + E
Sbjct: 143 SNGFDSYYG-----IPYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIE 197
Query: 246 YATD------LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYI 299
D +T A++ +E DKP FLYL H H L P++ Y
Sbjct: 198 LPVDQRTVTRRYTDRAIEFVEANQ-DKPFFLYLPHSMPHI-----PLYVPEDV-----YD 246
Query: 300 TDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG 359
DP + Y +++ +D VG ++ ++ G+ E ++I++ SDNG P ++++ + G
Sbjct: 247 PDP-QNAYKCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDNG-PWLQFKN-----HGG 299
Query: 360 SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL--YTAAGGDTSR 417
S P R K T +EGG +VP I+W+P S D LPT+ +T + R
Sbjct: 300 SAGPLRAGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASFTGVALENDR 359
Query: 418 LPLNIDGLDQWSSLLLNTPSR 438
IDG+D S+ + +R
Sbjct: 360 ---KIDGIDLTSTFTSDESAR 377
>gi|421612498|ref|ZP_16053605.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
gi|408496794|gb|EKK01346.1| arylsulfatase A, partial [Rhodopirellula baltica SH28]
Length = 487
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 112/364 (30%), Positives = 177/364 (48%), Gaps = 42/364 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+NDL +GS I TPN+D LA G + Y A VC+PSRA+L+TG YP G+
Sbjct: 58 GYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHV 117
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
++ G+ E + ++L+ GY+T +GKWHLG +E P GF+S++G I
Sbjct: 118 LFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGFDSYYG-----I 171
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGEYATD------LFTKE 254
Y + + R +++ D + S+A W+T + E D +T
Sbjct: 172 PYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTITRRYTDR 231
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
A++ +E DKP FLYL H H L P++ Y DP + Y +++ +
Sbjct: 232 AIEFVEAN-QDKPFFLYLPHSMPHIP-----LYVPEDV-----YDPDP-QNAYKCVIEHV 279
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D VG +I ++ G+ E ++I++ SDNG P ++++ + GS P R K T +EG
Sbjct: 280 DTEVGRLIQTVRDLGLSEKTLIVYTSDNG-PWLQFKN-----HGGSAGPLRAGKGTTFEG 333
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
G +VP I+W+P S D LPT+ + G + IDG+D S+ +
Sbjct: 334 GQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVTGV-ALKNDRKIDGVDLTSTFTSD 392
Query: 435 TPSR 438
+R
Sbjct: 393 ESAR 396
>gi|223940145|ref|ZP_03632007.1| sulfatase [bacterium Ellin514]
gi|223891162|gb|EEF57661.1| sulfatase [bacterium Ellin514]
Length = 508
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 113/375 (30%), Positives = 186/375 (49%), Gaps = 47/375 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D+ G +I TPNID +A G+ Y+ PVC PSR LMTGK+ H+ ++
Sbjct: 49 GYADVGCFGQKKIHTPNIDRIATEGMKFTQHYSGSPVCAPSRCVLMTGKHSGHSAVRD-- 106
Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+P G +P + L++ GY T A GKW LG PL +GF FGY
Sbjct: 107 NRELKPEGQFPLPANTITVARLLQQNGYITGAFGKWGLGGPESSGKPLDQGFTRFFGYNC 166
Query: 208 GVISY--YDHILSDQYSRTVELN---GHDMR------RNLSTAWD--TVGEYATDLFTKE 254
+++ + L D R N G D + N ++ T YA DL+ ++
Sbjct: 167 QRVAHNLFPTYLWDDNHRLALDNPPIGEDQKLPADADSNDPASYKAFTGKSYAPDLYAEQ 226
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-------------YITD 301
A++ I D D P FL+ + H L+ P++++ +++ Y+
Sbjct: 227 ALRFIRDN-KDHPFFLFFPTIVPHVA-----LQVPEDSLKEYEGKLPETPYTGGKGYL-- 278
Query: 302 PNR---RTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNW 358
PNR YAAM+ ++D +G +++ ++ + +++I +F SDNG P + ++ + +
Sbjct: 279 PNRTPHAAYAAMITRMDRDLGRMLALIKELNLDDDTIFVFTSDNG-PAPQDMGGTDTKFF 337
Query: 359 GSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSR 417
S+ P+R K +++EGG+++P I+ W +IQ N S ++ DWLPTL +G S
Sbjct: 338 NSSGPFRSGKTSIYEGGMRIPLIVRWHGKIQPN-STSDRVTGFEDWLPTLLELSGNKKS- 395
Query: 418 LPLNIDGLDQWSSLL 432
+P IDGL S+LL
Sbjct: 396 VPTGIDGLSFASTLL 410
>gi|343086300|ref|YP_004775595.1| sulfatase [Cyclobacterium marinum DSM 745]
gi|342354834|gb|AEL27364.1| sulfatase [Cyclobacterium marinum DSM 745]
Length = 454
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 117/348 (33%), Positives = 179/348 (51%), Gaps = 54/348 (15%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYP 141
+LI++ G++D+S++G+ +I TP+IDA+ +G+ + Y PVC P+RASLM+G+YP
Sbjct: 34 ILIIFTDDQGYHDVSYYGTEDIQTPHIDAIRNDGMQFDYFYTNSPVCAPTRASLMSGRYP 93
Query: 142 IHTGMQG------PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
H G+ G WG P + LP L++ GY T IGKW+LG P
Sbjct: 94 DHVGVPGLVRSNPANNWGYLNPETI-----LLPAMLKKGGYHTAHIGKWNLGLMHPN-LP 147
Query: 195 LYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
GF+ G+L ++ Y T +G + R D G +ATDLFT+
Sbjct: 148 NDHGFDYFHGWLEDMMDDY---------LTHRRHGKNYMRLNEEIIDPKG-HATDLFTQW 197
Query: 255 AVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ---ETINQFQYITDPNRRTYAAM 310
+V I E + ++P FLYLA+ A H ++ P+ E +N+ + R A
Sbjct: 198 SVNYIKEREKEERPFFLYLAYNAPHFP-----VQPPKSWVEKVNKREPGLPEKRANLIAF 252
Query: 311 VKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNT 370
++ LDD +G VI+AL+ G EN++IIF SDNG + + +N R K +
Sbjct: 253 IEHLDDGIGQVINALKETGQYENTLIIFTSDNGG---------HLPDLANNGLLRDGKQS 303
Query: 371 LWEGGVKVP-AILWSPQIQQ---NPRVSLQMMHISDWLPTLYTAAGGD 414
++EGG++VP AI W +I+ + +V+L M D PTL AG D
Sbjct: 304 MYEGGLRVPTAISWPAKIKAGSTSDQVNLSM----DIFPTLTEIAGVD 347
>gi|421613760|ref|ZP_16054831.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
gi|408495346|gb|EKJ99933.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Rhodopirellula
baltica SH28]
Length = 485
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 117/394 (29%), Positives = 175/394 (44%), Gaps = 67/394 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGM---- 146
G+NDL G N I TP +D LA G L N Y A P CTPSRASL+TG+YP G+
Sbjct: 41 GYNDLGQLG-NGIITPALDRLAKEGTRLTNFYVAWPACTPSRASLLTGRYPQRNGIYDMI 99
Query: 147 -QGPPIWG------------AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
P +G G+ E LP LR GY + GKW LG +R
Sbjct: 100 RNEAPDYGHRYTPEEYAVTFERIGGMDEREVILPAMLRPAGYKSGIYGKWDLGALQR-ML 158
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTK 253
P RGF+ +G++N I Y+ H ++Y M RNL G Y T LF +
Sbjct: 159 PTSRGFDDFYGFVNTGIDYFTH---ERYGVPC------MVRNLEPTEADKGTYCTYLFQR 209
Query: 254 EAVQLIEDQPVDKPLFLYLAHLAAH-AGNAGKHLEAPQETINQFQYITDP---------- 302
EA++ +++ ++P FLY+ A H + + + + + +QF+ + P
Sbjct: 210 EALRFLDEHAGNEPFFLYVPFNAPHNSSSLDPTIRSSVQAPDQFKAMYPPVEVETRVTDR 269
Query: 303 --------------NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
RR Y A V +D ++G ++ L+ K ML+ ++++F SDNG
Sbjct: 270 YRYGSPATVATPQARRRDYRAAVTCMDAAIGEMLDRLEAKQMLDETMVVFFSDNGG---- 325
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
N P RG K WEGG++VP ++ W V+ + + + LP+
Sbjct: 326 -------SGGADNSPLRGHKAQTWEGGIRVPCLVRWPAGKIPAGVVNDEFLTSLELLPS- 377
Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNS 441
+ AA G + +DG + W +L S R
Sbjct: 378 FAAAAGVEPPPGVVLDGFNWWPTLRGEAESPRKE 411
>gi|327289638|ref|XP_003229531.1| PREDICTED: n-acetylgalactosamine-6-sulfatase-like [Anolis
carolinensis]
Length = 521
Score = 147 bits (370), Expect = 2e-32, Method: Compositional matrix adjust.
Identities = 157/573 (27%), Positives = 239/573 (41%), Gaps = 122/573 (21%)
Query: 78 ALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLM 136
A + + L L+ GW DL G TP++D +A G++ + Y A P+C+PSRA+L+
Sbjct: 25 ATSPNLVLLLMDDMGWGDLGVFGQPSKETPHLDQMAAEGMLFPSFYTANPLCSPSRAALL 84
Query: 137 TGKYPIHTGMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLG 186
TG+ PI G A R G+P +E LPE L++ GY+ K +GKWHLG
Sbjct: 85 TGRLPIRNGFY---TTNAHARNAYTPQDITGGIPDSELLLPELLKKAGYANKIVGKWHLG 141
Query: 187 FFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYS--RTVELNG---HDMRRNLSTAWD 241
+ ++ PL GF+ FG N YD+ + R E+ G D++ N T
Sbjct: 142 H-QPQFHPLRHGFDEWFGSPNCHFGPYDNKAAPNIPVYRDWEMIGRYYEDIKINHKTGES 200
Query: 242 TVGEYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT 300
+ T ++ +EA+ + Q +P FLY A A HA ++
Sbjct: 201 NL----TQMYLQEALDFMSSQQAKQQPFFLYWAIDATHA-----------PVYASRAFLG 245
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
R Y V+++DDSVG ++S L++ G+ E++ + F SDNGA V E + GS
Sbjct: 246 SSQRGRYGDAVREIDDSVGKILSHLRKLGISEDTFVFFTSDNGAALVSAPEQA-----GS 300
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSP-QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
N P+ K T +EGG++ PAI W P I VS Q+ + D T AG
Sbjct: 301 NGPFLCGKQTTFEGGMREPAIAWWPGHIPAGQLVSHQLGSVMDLFATSLALAG------- 353
Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLD 479
P + IDG+D +L R +E AVR+
Sbjct: 354 ---------------LPPPGDREIDGIDLSPVVLQGKLIDRPIFYYRGNE---MMAVRVG 395
Query: 480 SWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNI-FLPISNIDK 538
+K T N S ++ Q I F P N+
Sbjct: 396 LYKAHYWTWSN------------------------------SWEEFGQGIDFCPGQNVSG 425
Query: 539 MRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRP---DISSQLYE 595
+ S Q+ + P+ LF+LG DP E+ I+ S D+ ++
Sbjct: 426 VTSHDQEEH-----SKLPL-----------LFHLGKDPGERFPISFSSAEYQDVMGRISP 469
Query: 596 LLKYHRRTLVPQSHEQPDLVQADPKRFNDTWSP 628
+++ H+ T++P QP L D N W+P
Sbjct: 470 VVEQHKETMLPG---QPQLNVCDKAVMN--WAP 497
>gi|311746665|ref|ZP_07720450.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|311302556|gb|EAZ82500.2| sulfatase family protein [Algoriphagus sp. PR1]
Length = 465
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 113/347 (32%), Positives = 173/347 (49%), Gaps = 32/347 (9%)
Query: 78 ALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSR 132
+L +S +I++ G+ D+ G I TPN+D +A G N Y A VCTPSR
Sbjct: 22 SLAQSQKPNFIIIFTDDQGYGDVGTFGHPTIKTPNLDQMAMEGQKWTNFYVAANVCTPSR 81
Query: 133 ASLMTGKYPIHTGMQGPP---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFR 189
+++MTG+ P+ TGM ++ G+P TE + + L+ GYST AIGKWHLG
Sbjct: 82 SAIMTGRLPVRTGMYSNTRRVLFPDSGGGLPATENTIAKLLKTSGYSTAAIGKWHLGHLP 141
Query: 190 REYTPLYRGFESHFG--YLNGVISYYDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGE 245
EY P GF+++FG Y N + D + ++ + N MR
Sbjct: 142 -EYLPTSHGFDTYFGIPYSNDMDRINDVTAQEAFASPKPEYFNVPLMRDKEIIERPADQT 200
Query: 246 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
T +T+EAV I+ D+P F+YLAH H L A ++ ++T R
Sbjct: 201 TITKRYTEEAVSYIKANK-DQPFFIYLAHSLPHI-----PLFASED------FLTKSERG 248
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
Y +++++D SVG ++S L+ +G+ +N+ +IF SDNG P + Y E
Sbjct: 249 LYGDVIEEIDWSVGQILSTLKSEGIDKNTYVIFTSDNG-PWLVYNEQGGSSGGLFG---- 303
Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
K T +EGGV+VP I+W P P+V ++ D LPT+ + +G
Sbjct: 304 -GKGTSYEGGVRVPTIIWGPG-NIKPQVVSKIGSTLDLLPTISSLSG 348
>gi|449134034|ref|ZP_21769542.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448887354|gb|EMB17735.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 728
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 128/432 (29%), Positives = 189/432 (43%), Gaps = 62/432 (14%)
Query: 77 AALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASL 135
+A + L L+ G+ DL +G+ E+ TP ID +A GI + YA P+C+PSRA L
Sbjct: 2 SAEKPNIVLILVDDQGYYDLGCYGATEVKTPRIDEMAGGGIRFTDYYAAAPICSPSRAGL 61
Query: 136 MTGKYPIHTGMQGPPIW---GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
+TG YP G +W G+ E L E ++ GY T IGKWHLG F +
Sbjct: 62 LTGCYPRRVGNH---VWVHRADSNTGIHSDELTLAELFKDNGYKTACIGKWHLG-FHEPF 117
Query: 193 TPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFT 252
P +GF+ +FG L+ + D ++ + ++R + T L+T
Sbjct: 118 LPQNQGFDHYFGLLHNLDPVETVYFEDVGGVPLQRDRDVVKRPADP------DELTKLYT 171
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVK 312
EA+ IE + P LYL H H L +E Q+ Y ++
Sbjct: 172 NEAIDFIEANK-EGPFLLYLPHTMLH-----NPLGVSEEFKGTSQW------GEYGDAIQ 219
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN--YPYRGVKNT 370
+LD VG +I L+R + +N+++++ SDNG R G N RG K +
Sbjct: 220 ELDHHVGRIIDVLKRLNIDDNTVVVYASDNG------------RGPGRNPQQKIRGRKLS 267
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSL-QMMHISDWLPTLYTAAGGDTSRLPLNIDGLD--- 426
+EGG++VPAI W P + +S ++ DW PTL T A + + IDG D
Sbjct: 268 TYEGGIRVPAIAWGPGLGLRAGISSPAVVRAMDWYPTLATLADIEIPSERV-IDGRDLSP 326
Query: 427 -------------QWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRT 473
SL P RR DG +W SL+ ++ N +
Sbjct: 327 LLRGETDFVPPPGLKKSLNAGVPLRRRWEPDG--EWGSLI--NRNQYNDAFFYHGSQGAL 382
Query: 474 AAVRLDSWKLVL 485
AAVR +WKL L
Sbjct: 383 AAVRWRNWKLYL 394
>gi|260060774|ref|YP_003193854.1| arylsulfatase A [Robiginitalea biformata HTCC2501]
gi|88784904|gb|EAR16073.1| arylsulfatase A (precursor) [Robiginitalea biformata HTCC2501]
Length = 526
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 111/341 (32%), Positives = 167/341 (48%), Gaps = 25/341 (7%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G++D+ +G+ +IPTPN+DA+A +G++L N Y AQPVC+ SRA L+TG YP G+
Sbjct: 85 GYSDVGVYGARDIPTPNLDAMAADGLLLTNFYAAQPVCSASRAGLLTGCYPNRVGIHN-A 143
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+ P G+ E L E LR+ GY T GKWHLG ++ P GF+ FG
Sbjct: 144 LMPNSPVGLNPAEETLAELLRQQGYRTGIFGKWHLG-DHPDFLPTRHGFDEFFGIPYSND 202
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+ H L L ++ R + T D T T+ +V I ++ ++P FL
Sbjct: 203 MWPLHPLQGPVFDFGPLPLYEQERVVDTLEDQ--RLLTRQITERSVDFI-NRHKEEPFFL 259
Query: 271 YLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
Y+ H H P + F+ + R Y ++ ++D SVG V+ AL+ G+
Sbjct: 260 YVPHPQPH---------VPLFVSDAFRGKS--GRGLYGDVIMEIDWSVGQVLGALEDNGL 308
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
+++ +IF SDNG P + Y S G P R K T WEGGV+ P I+ P
Sbjct: 309 TDDTWVIFTSDNG-PWLAYGNHS-----GRAEPLREGKGTNWEGGVREPCIMKFPGRLPR 362
Query: 391 PRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+V + + D LPT+ + G + + IDG + W L
Sbjct: 363 GKVLDEPLMAIDLLPTIASVTG--SPQPGREIDGKNAWGLL 401
>gi|196232399|ref|ZP_03131252.1| sulfatase [Chthoniobacter flavus Ellin428]
gi|196223471|gb|EDY17988.1| sulfatase [Chthoniobacter flavus Ellin428]
Length = 452
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 179/364 (49%), Gaps = 60/364 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQG-- 148
G + S G+ +I TPNID L + G+ +N YA VC+PSRA+LMTG +P G+ G
Sbjct: 39 GRGEYSDFGTKDIQTPNIDRLCHEGLTFDNFYANSCVCSPSRAALMTGCFPNRVGVPGVI 98
Query: 149 -----PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
WG +GV R P+ L+ GY + +GKWHLG TP RGF+
Sbjct: 99 REEDQDNNWGWLAQGV----RLFPQLLKPAGYQSAIVGKWHLGI-NSPNTPTERGFDRFD 153
Query: 204 GYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
G+L ++ Y+ H+ Q + MR N + +ATD+FT A++ ++++
Sbjct: 154 GFLGDMMDDYWTHLRHGQ---------NFMRHNQEVV--SPEGHATDVFTDWALRYLDER 202
Query: 263 PVDK-PLFLYLAHLAAHAGNAGK-----HLEAPQETINQFQYITDPNRRTYAAMVKKLDD 316
K P FLYLA+ A H K ++A + +++ R A+++ LDD
Sbjct: 203 SKSKEPFFLYLAYNAPHDPIQPKPDWLAKVKAREPGLSE-------KRAKLVALIEHLDD 255
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
+G V++ L G+ +N+I++F SDNG N +N P+R K ++EGG+
Sbjct: 256 GIGKVLAKLDETGLAKNTIVLFSSDNGG---------VLSNEANNGPWRSGKTHMYEGGL 306
Query: 377 KVPAIL-W----SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+VP ++ W +P N RV+L M D LPT G P NIDG+ +L
Sbjct: 307 RVPCMVRWPGHVAPGTHSN-RVALLM----DILPTALEIVG---QTPPPNIDGVSFLPTL 358
Query: 432 LLNT 435
L T
Sbjct: 359 LGET 362
>gi|313232711|emb|CBY19381.1| unnamed protein product [Oikopleura dioica]
Length = 453
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 136/522 (26%), Positives = 218/522 (41%), Gaps = 117/522 (22%)
Query: 136 MTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
MTG+YP+ G+Q I EP G+PL E LP+ RE GYST GKWHLG++ + + P
Sbjct: 1 MTGRYPMRYGLQRGVIRPDEPEGIPLREILLPQGFRECGYSTLMSGKWHLGYYTKAHRPQ 60
Query: 196 YRGFESHFGYLNGVISYYDHIL-----------SDQYSRTVELNGHDMRRNLSTAWDTVG 244
RGF+ +G+ G +Y H + YS E+ HD AW G
Sbjct: 61 NRGFDRFYGFYVGSQDFYFHRTGFKSDGASFWSEEGYSTGEEIIRHD-------AW---G 110
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAP---QETINQFQYITD 301
Y+T L +A++ + ++ PLF YL+ HA + P +++I F +
Sbjct: 111 HYSTTLLVDDAIKQLRNRNPGNPLFHYLSIQDPHAPE-----QRPLRLEQSIKHFALRGE 165
Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLE--NSIIIFMSDNGAPTVEYRETSNYRNWG 359
R+ + VK +D +G + L+R + + N+I+IF SDNG P R G
Sbjct: 166 --RKIFVTKVKSMDLEIGRYLDVLKRIYIPDYNNTIVIFSSDNGGP----------RKLG 213
Query: 360 -SNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRL 418
SN+P RG K +++G + A++ P ++ + M H DW+PTL +
Sbjct: 214 ASNWPLRGGKGCIFDGAFRSRALIHIPGMKH--KNYPHMFHAVDWMPTLMEMLSCKLPKD 271
Query: 419 PLNIDGLDQWSSLL-LNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLIN---IDEKKRTA 474
L +DG+ ++L + + R + +D L R+ + D + R A
Sbjct: 272 HLPLDGVSHMKTILGMTDRAPREEMLLDIDPEEVLQKTASDARDFSYLKKSGFDSRMR-A 330
Query: 475 AVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPIS 534
A+R WK++ G +GT G +Q ++N +
Sbjct: 331 ALRYKEWKIITGPPCSGTKCG--------------------------VQSENRNYYKAEE 364
Query: 535 NIDKMRSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLY 594
+ K RS R L+N+ DP E +++ P+++ L
Sbjct: 365 YVSKSRSVR-------------------------LYNITADPRESYDLSDKYPEVTKAL- 398
Query: 595 ELLKYHRRTLVPQSHEQPDLV-------QADPKRFNDTWSPW 629
LLK + HE+ L +A P+ TW PW
Sbjct: 399 -LLK------LSVYHEEQSLFMDRMRDKKALPETSEHTWFPW 433
>gi|32476258|ref|NP_869252.1| arylsulfatase A [Rhodopirellula baltica SH 1]
gi|32446802|emb|CAD76638.1| arylsulfatase A [Rhodopirellula baltica SH 1]
Length = 489
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 112/366 (30%), Positives = 178/366 (48%), Gaps = 46/366 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+NDL +GS I TPN+D LA G + Y A VC+PSRA+L+TG YP G+
Sbjct: 58 GYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHV 117
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
++ G+ E + ++L+ GY+T +GKWHLG +E P GF+S++G I
Sbjct: 118 LFPQSTYGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGFDSYYG-----I 171
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGEYATD------LFTKE 254
Y + + R +++ D + S+A W+T + E D +T
Sbjct: 172 PYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTVTRRYTDR 231
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
A++ +E DKP FLYL H H L P++ Y DP + Y +++ +
Sbjct: 232 AIEFVEANQ-DKPFFLYLPHSMPHIP-----LYVPEDV-----YDPDP-QNAYKCVIEHI 279
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D VG ++ ++ G+ E ++I++ SDNG P ++++ + GS P R K T +EG
Sbjct: 280 DTEVGRLVQTVRDLGLSEKTLIVYTSDNG-PWLQFKN-----HGGSAGPLRAGKGTTFEG 333
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL--YTAAGGDTSRLPLNIDGLDQWSSLL 432
G +VP I+W+P S D LPT+ +T + R IDG+D S+
Sbjct: 334 GQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASFTGVALENDR---KIDGIDLTSTFT 390
Query: 433 LNTPSR 438
+ +R
Sbjct: 391 SDESAR 396
>gi|297180066|gb|ADI16291.1| arylsulfatase a and related enzymes [uncultured bacterium
HF0010_16H03]
Length = 543
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 121/409 (29%), Positives = 193/409 (47%), Gaps = 61/409 (14%)
Query: 62 INQFQYITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFH----GSNEIPTPNIDALAYNG 116
I++ Q I P + TK + +L G+ND+SF+ + TPNID LA G
Sbjct: 38 ISENQEINWPIVESSVITTKPNIILILADDLGFNDVSFYNGGAADGSLMTPNIDNLAKEG 97
Query: 117 I-ILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI------------------------ 151
+ LN A PVC+PSRA++MTG+Y G + P
Sbjct: 98 VAFLNGYAASPVCSPSRAAIMTGRYSSRYGFEFTPYPAQAAKIMNLLREDDELETIYLED 157
Query: 152 -----WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYL 206
G G+P E + E L++ GY T IGKWHLG F P +GF+ L
Sbjct: 158 VQLDEVGLTIGGLPDKEVTVAEMLQKNGYYTAHIGKWHLGGFTDGMKPNDQGFDDSL-ML 216
Query: 207 NGVISYYDHILSDQYSRTVELNGHDM---RRNLSTAWDTV-----GEYATDLFTKEAVQL 258
N + Y D + ++ + DM + +++ G Y TD +T EA+++
Sbjct: 217 NSSL-YLPKNHPDIVNAKIDSSIEDMVWASSQFAASFNGSEPFKPGGYITDYYTDEAIKV 275
Query: 259 IEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
I + D+P FLYL H A H L++ ++ + ++ + + Y+ M++ LD S+
Sbjct: 276 INNNK-DRPFFLYLGHFAPH-----NPLQSLRKDYEKHNHMENHTLQVYSGMIEALDRSI 329
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G +++ L+ G+ EN++IIF SDNG Y N N PYRG K T +EGG+ +
Sbjct: 330 GKIVTTLEENGLTENTLIIFTSDNGG--AGYIGLDNI-----NKPYRGWKLTHFEGGMHI 382
Query: 379 PAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
P W +I+++ R ++ H +D T+ AA + + + IDG++
Sbjct: 383 PFFAKWPAKIKKDMRYDKRIHH-TDIFSTILGAASIEPPK-EIKIDGIN 429
Score = 39.3 bits (90), Expect = 5.7, Method: Compositional matrix adjust.
Identities = 29/80 (36%), Positives = 39/80 (48%), Gaps = 7/80 (8%)
Query: 540 RSTRQQATIHCGANPAPMTPSPCTNGPCYLFNLGNDPCEQNNIASSRPDISSQLYELLKY 599
++ QA IH N + S N YL+NL DP EQNN+A S + S L+E L
Sbjct: 449 KNASYQAVIH---NDWKLLRSKFPNEQEYLYNLKKDPYEQNNLAMSELTLKSLLHEKLDT 505
Query: 600 HRRTLV----PQSHEQPDLV 615
H ++ PQS P +V
Sbjct: 506 HIESMPEPNWPQSVFMPVVV 525
>gi|336413808|ref|ZP_08594157.1| hypothetical protein HMPREF1017_01265 [Bacteroides ovatus
3_8_47FAA]
gi|335934825|gb|EGM96808.1| hypothetical protein HMPREF1017_01265 [Bacteroides ovatus
3_8_47FAA]
Length = 472
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 139/457 (30%), Positives = 204/457 (44%), Gaps = 71/457 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G DL +G N I TP IDALA G+ Y+ V PSR L+TGK+ H+ ++G
Sbjct: 48 GIGDLGCYGQNRIKTPAIDALAAQGMKFTQHYSGSTVSAPSRCVLLTGKHTGHSYIRGNK 107
Query: 151 IWGAEP-RGVPLT----ERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
+ AE R L E + E + Y T +GKW LG E P +GF+ FGY
Sbjct: 108 GYKAEDGRSYDLNLADEEITIGEMFKTKNYVTACVGKWGLGGPSAEGHPNKQGFDYFFGY 167
Query: 206 LN-GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
L G Y + + + V LN H + Y+ D+ +A++ I D+
Sbjct: 168 LGQGNAHNYYPVQLFENNTPVMLN-HKV-------------YSHDMIMDKALEFI-DKNA 212
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETI----NQFQYITDPN---------RRTYAAMV 311
DKP F+YL HA + P + +F+ + P R T+AAMV
Sbjct: 213 DKPFFVYLTPTIPHA-----DIIVPNNELFDYDGKFEEVPYPGKGYRPQEKPRATFAAMV 267
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS-NYRNWGSNYPYRGVKNT 370
+LD V VI L KG+L+N+IIIF SDNG ++E + R + SN P+RG+K
Sbjct: 268 TRLDRDVQRVIDLLADKGVLDNTIIIFTSDNGT----HKEGGHDPRYFDSNGPFRGMKRD 323
Query: 371 LWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSS 430
L+EGG++ P I+ P + VS Q+ D++PT+ G + P NIDGL +
Sbjct: 324 LYEGGIRTPFIVKWPGVIPAGSVSFQVSAFWDFMPTMCELIG---APAPANIDGLSYLPT 380
Query: 431 LLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
L +++ + + E+ AV D WKL L Q N
Sbjct: 381 LTGKGKQKQH--------------------DYLYFEFHEQGGKQAVIKDGWKL-LHLQVN 419
Query: 491 GTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQ 527
+Y N P N N + ++ Q +Q+L +
Sbjct: 420 NPQKEHYELYNLNADPGENANVL--NQYPQKVQELEK 454
>gi|417301368|ref|ZP_12088525.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327542298|gb|EGF28785.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 470
Score = 146 bits (369), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 112/373 (30%), Positives = 181/373 (48%), Gaps = 46/373 (12%)
Query: 87 LLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYP 141
+++V+ G+NDL +GS I TPN+D LA G + Y A VC+PSRA+L+TG YP
Sbjct: 30 VIVVFTDDQGYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYP 89
Query: 142 IHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFES 201
G+ ++ G+ E + ++L+ GY+T +GKWHLG +E P GF+S
Sbjct: 90 KRVGLHQHVLFPQSKHGLHPDEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGFDS 148
Query: 202 HFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGEYATD-- 249
++G I Y + + R +++ D + S+A W+T + E D
Sbjct: 149 YYG-----IPYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQR 203
Query: 250 ----LFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR 305
+T A++ +E DKP FLYL H H L P++ Y DP +
Sbjct: 204 TVTRRYTDRAIEFVEANQ-DKPFFLYLPHSMPHIP-----LYVPEDV-----YDPDP-QN 251
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
Y +++ +D VG ++ ++ G+ E ++I++ SDNG P ++++ + GS P R
Sbjct: 252 AYKCVIEHIDTEVGRLVQTVRDLGLSEKTLIVYTSDNG-PWLQFKN-----HGGSAGPLR 305
Query: 366 GVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
K T +EGG +VP I+W+P S D LPT+ + G IDG+
Sbjct: 306 AGKGTTFEGGQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVTGVALEN-DRKIDGI 364
Query: 426 DQWSSLLLNTPSR 438
D S+ + +R
Sbjct: 365 DLTSTFTSDESAR 377
>gi|149195908|ref|ZP_01872964.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
gi|149140755|gb|EDM29152.1| arylsulfatase A [Lentisphaera araneosa HTCC2155]
Length = 490
Score = 146 bits (368), Expect = 3e-32, Method: Compositional matrix adjust.
Identities = 114/351 (32%), Positives = 172/351 (49%), Gaps = 30/351 (8%)
Query: 75 TYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTP 130
Y K T ++ + G L G + TP ID L ++ + P CTP
Sbjct: 18 VYCEELKGTKPNIIFILTDDQGKGQLGCEGHPWLETPYIDKLFSESSYFSDFHMSPTCTP 77
Query: 131 SRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
SRA+LMTG YP G+ GA R + L LPE L+ LGY+T GKWHLG +
Sbjct: 78 SRAALMTGNYPFRNGVT--HTGGARAR-MTLKIETLPEALKSLGYTTGIFGKWHLG-YEE 133
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDL 250
Y P RGF+ F + G I + +++Y + +R N T T G + TD+
Sbjct: 134 AYQPHSRGFDEVFIHGYGGIGQPMDVPNNKYENPI------IRHN-ETFVKTEG-FCTDV 185
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYIT-DPNRRTYAA 309
F +A+ I+ + DKP F Y++ A H G ++ AP++ +F+ + +
Sbjct: 186 FFDQALAWIK-KSKDKPFFAYISTNAPH----GPYI-APKDKREKFKKLGFKSGDAGFYG 239
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETS-NYRNWGSNYPY---- 364
M++ +DD+VG +++ L++ + EN+++IFMSDNG ++ E RN Y Y
Sbjct: 240 MIENIDDNVGRLMTKLKQWDLDENTLVIFMSDNGFASLVINEKKLGQRNGEDLYSYQAGL 299
Query: 365 RGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGD 414
RG K T +EGG VPA W + + ++S HI D PTL AGG+
Sbjct: 300 RGYKKTPYEGGTLVPAFFRWKGKFKAGNKISGLSAHI-DLFPTLVELAGGE 349
>gi|149179345|ref|ZP_01857904.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Planctomyces maris DSM
8797]
gi|148841817|gb|EDL56221.1| N-acetylgalactosamine 6-sulfatase (GALNS) [Planctomyces maris DSM
8797]
Length = 461
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 118/366 (32%), Positives = 178/366 (48%), Gaps = 50/366 (13%)
Query: 75 TYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSR 132
T A T+ L +L+ G+ DLS +G+ ++ +P+ID L G+ +N YA PVC+P+R
Sbjct: 26 TTAQQTRPNVLVILVDDLGYGDLSSYGATDLKSPHIDELLNRGMKFSNFYANCPVCSPTR 85
Query: 133 ASLMTGKYPIHTGMQG------PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHL 185
A+L+TG Y G+ G WG +P V L + F GY T IGKWHL
Sbjct: 86 AALLTGHYQDMVGVPGVIRTHPENSWGYLKPSAVTLADVF-----HSAGYQTAIIGKWHL 140
Query: 186 GFFRREYTPLYRGFESHFGYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
G TP RGF+ G+L ++ YY H R +N MRRN T D G
Sbjct: 141 GL-ESPNTPNERGFDLFRGFLGDMMDDYYLH-------RRHGVN--YMRRNQKTV-DPQG 189
Query: 245 EYATDLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYIT 300
+ATDLFT + ++ Q + P FLYLA+ A H ++ P++ + Q +
Sbjct: 190 -HATDLFTDWTCEYLKQQATSESPFFLYLAYNAPHT-----PIQPPEDWLGKVKQRETGI 243
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
DP+R A+++ LD +G VI L + +N++I+F SDNG +
Sbjct: 244 DPDRARLVALIEHLDAGIGKVIQTLDETKLSDNTLIVFSSDNGG---------QLGVGAN 294
Query: 361 NYPYRGVKNTLWEGGVKVP-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
N R K +++EGG+KVP ++W I + M + D PT+ A G ++P
Sbjct: 295 NGALRDGKQSMYEGGLKVPTGVVWKKHIAPHTETDFMAMSM-DIFPTVCEAGG---IKVP 350
Query: 420 LNIDGL 425
+D +
Sbjct: 351 SGLDAV 356
>gi|427388367|ref|ZP_18884250.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
12058]
gi|425724950|gb|EKU87824.1| hypothetical protein HMPREF9447_05283 [Bacteroides oleiciplenus YIT
12058]
Length = 457
Score = 146 bits (368), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 117/350 (33%), Positives = 171/350 (48%), Gaps = 54/350 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
G DLS + ++ TPNID L +G+ LNN YA V +PSRA L+TG YP G+ G
Sbjct: 48 GLGDLSCQYARDLQTPNIDKLFSDGVRLNNFYANSSVSSPSRAGLLTGCYPDMVGVPGVI 107
Query: 149 ----PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
WG P + LP+ +++ Y+T IGKWHLG P RGF+
Sbjct: 108 RTETTASWGYLSPEAI-----LLPQMMKKAEYNTAIIGKWHLGL-EEPNLPNSRGFDFFH 161
Query: 204 GYLNGVI-SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
G+L ++ +YY H+ GH+ R + G +ATDLF+ A+ +
Sbjct: 162 GFLGDMMDNYYTHLR----------RGHNYMRLNEEVIEPTG-HATDLFSSWALDYFDTM 210
Query: 263 PVDK-PLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSV 318
K P FLYLA+ A H L+ P E + + + R A+++ LD ++
Sbjct: 211 RKSKVPFFLYLAYNAPHVP-----LQPPTEWLERVREREPGISEKRAKLVALIEHLDYNI 265
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V AL+ G L+N++IIF SDNG + +N P RG K ++EGG+KV
Sbjct: 266 GRVYKALEESGQLDNTLIIFCSDNGG---------DAGAEANNGPVRGSKGDMYEGGIKV 316
Query: 379 P-AILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGL 425
P A+ W ++ PR ++ +SD PTL D +P+N IDG+
Sbjct: 317 PCAVYWRNHLK--PRSIDDLVIMSDIFPTL-----CDLVSIPVNHRIDGI 359
>gi|87312329|ref|ZP_01094424.1| arylsulfatase A [Blastopirellula marina DSM 3645]
gi|87284951|gb|EAQ76890.1| arylsulfatase A [Blastopirellula marina DSM 3645]
Length = 477
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 114/345 (33%), Positives = 171/345 (49%), Gaps = 33/345 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G+ D+ GS TPN++A+A G+ L YA PVC+PSRA+LMTG YP +
Sbjct: 42 GYADIEPFGSEVNRTPNLNAMADEGMKLTCFYAAPVCSPSRAALMTGCYPKRALTIPHVL 101
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNGV 209
+ G+ E + E ++E GY+T IGKWHLG + ++ P +GF+ ++G Y N +
Sbjct: 102 FPGNAEGMSPNEVTIAELMKEQGYATAIIGKWHLG-DQPDFLPTRQGFDYYYGLPYSNDM 160
Query: 210 ISYYDHILSDQYSRTVELNGHD-----MRRNLSTAWDTVGEYATDL---FTKEAVQLIED 261
D + S+ + + G + RN + + + T+L +T+EA+Q I D
Sbjct: 161 GPAADGVKSNYGAPIPQRKGKGQPPLPLLRNETVLQRVLAKDQTELVTNYTEEAIQFIRD 220
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
+KP FLYL H A H P + F+ + Y V+++D SVG V
Sbjct: 221 HQ-EKPFFLYLPHSAVH---------FPMYPGDAFR--GKNSHGLYNDWVEEVDWSVGQV 268
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ AL+ G+ + +++IF SDNG T R N P R K T +EGG++VP I
Sbjct: 269 LQALKDLGLDQRTLVIFTSDNGGQT---------RFGAVNKPLRAGKATTYEGGMRVPTI 319
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLD 426
+ P S ++ + D LPTL AGG T+ IDG D
Sbjct: 320 VRWPGKVPAGSSSDAVVGMIDVLPTLVKLAGG-TTPTDRKIDGAD 363
>gi|171912352|ref|ZP_02927822.1| arylsulfatase A [Verrucomicrobium spinosum DSM 4136]
Length = 491
Score = 145 bits (367), Expect = 4e-32, Method: Compositional matrix adjust.
Identities = 122/361 (33%), Positives = 177/361 (49%), Gaps = 37/361 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL G+ TPN+D +A G+ Y AQPVC+ SR +LMTG YP G++G
Sbjct: 45 GYGDLGCFGAKGQATPNLDRMAAEGVKFERFYVAQPVCSASRMALMTGCYPNRVGIKGAL 104
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNG 208
GA+ G+ E L E +++ GY+T A GKWHLG ++ P+ GF+ + G Y N
Sbjct: 105 GPGAK-VGISKEETTLAELVKQNGYATAAFGKWHLGD-DPQFLPVRHGFDEYLGLPYSND 162
Query: 209 VISYYDHILSDQYSRTVELNGH------DMRRNLSTAWDTVGE-YATDLFTKEAVQLIED 261
+ Y+ +++ + + G D R + TV + T +T+ AV+ I
Sbjct: 163 MWPYHPELVNLTPEQRKKRRGFPALPLVDGDRIILPEVTTVEQTRLTTWYTQRAVKFINT 222
Query: 262 QPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTV 321
DKP LYLAH H +Y + Y +++++D SVG V
Sbjct: 223 NK-DKPFLLYLAHSMPHV-----------PLFVSDKYKGNSANGVYGDVIEEIDWSVGEV 270
Query: 322 ISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
+ AL+ G+ EN+++IF SDNG P + Y S GS P R K T+WEGGV+ P +
Sbjct: 271 LKALKENGLDENTLVIFTSDNG-PWLSYGTHS-----GSPGPLREGKGTVWEGGVREPFV 324
Query: 382 L-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLNTPSRR 439
W +I V+ M I D LPT+ G S+LP IDG + W LL T +
Sbjct: 325 ARWPGKIPSGKVVTEPAMTI-DILPTVAALTG---SKLPDHKIDGRNIW-PLLSATDGAK 379
Query: 440 N 440
N
Sbjct: 380 N 380
>gi|410447290|ref|ZP_11301386.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
gi|409979565|gb|EKO36323.1| type I phosphodiesterase/nucleotide pyrophosphatase [SAR86 cluster
bacterium SAR86E]
Length = 548
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 117/370 (31%), Positives = 168/370 (45%), Gaps = 60/370 (16%)
Query: 92 GWNDLSFH----GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGM 146
G+ND+S H + T NIDALA +G++ YA C PSRAS+MTGKYP G
Sbjct: 73 GYNDISMHNGGAADGTLQTQNIDALAKSGMLFTRGYAANATCAPSRASIMTGKYPTRFGY 132
Query: 147 QGPPI---------WGAEP----------------------RGVPLTERFLPEYLRELGY 175
+ P+ W A +G+P E + E LR+ GY
Sbjct: 133 EFTPVPATGRLIMNWLANESDEELKARIDREVASKIPPFMQQGMPTEEITIAEVLRDAGY 192
Query: 176 STKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRN 235
T IGKWHLG PL +GF+ + + DH D + + M
Sbjct: 193 YTAHIGKWHLG-HAYGMDPLSQGFQDSLSMVGPLYLPEDH--PDVVNAKFDTRIDQMIWG 249
Query: 236 LSTAWDTVGE--------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLE 287
+ T E Y TD +T EA+++IE+ ++P FLYL+H A H L+
Sbjct: 250 IGQYSATFNEGELFAPDKYLTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQ 303
Query: 288 APQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTV 347
A + + Q ++ N + Y+ M+ LD SVG VI L+ + ++IIF SDNG
Sbjct: 304 ALRSDVEQMSHMEGHNLQVYSGMIAALDRSVGKVIQKLKDLDIHGKTLIIFTSDNGG--A 361
Query: 348 EYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTL 407
Y E + N PYRG K + +EGG++VP I+ P + S +H D T+
Sbjct: 362 NYIELDDI-----NKPYRGWKISFFEGGIRVPYIVSWPDEINPAQKSNSAVHHFDIFSTI 416
Query: 408 YTAAGGDTSR 417
+AAG +S
Sbjct: 417 ASAAGIQSSH 426
Score = 43.5 bits (101), Expect = 0.36, Method: Compositional matrix adjust.
Identities = 26/92 (28%), Positives = 48/92 (52%), Gaps = 7/92 (7%)
Query: 14 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 73
+Y TD +T EA+++IE+ ++P FLYL+H A H L+A + + Q ++ N
Sbjct: 267 KYLTDYYTDEALKVIENNK-NRPFFLYLSHWAIH-----NPLQALRSDVEQMSHMEGHNL 320
Query: 74 RTYAALTKSTTLTL-LIVYGWNDLSFHGSNEI 104
+ Y+ + + ++ ++ DL HG I
Sbjct: 321 QVYSGMIAALDRSVGKVIQKLKDLDIHGKTLI 352
>gi|223936859|ref|ZP_03628768.1| sulfatase [bacterium Ellin514]
gi|223894428|gb|EEF60880.1| sulfatase [bacterium Ellin514]
Length = 460
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 121/391 (30%), Positives = 184/391 (47%), Gaps = 63/391 (16%)
Query: 75 TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRA 133
+AA + + L G+ DL +G I TPN+D +A GI + Y A VCTPSRA
Sbjct: 23 VFAAKAPNIVIILADDLGYGDLGCYGHPSIRTPNLDRMAAEGIRFTDFYVAANVCTPSRA 82
Query: 134 SLMTGKYPIHTGMQGPP---IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRR 190
LMTG++PI +GM G + G+P E + L+ GY+T IGKWHLG
Sbjct: 83 GLMTGRWPIRSGMAGSTNDVLRVFSMGGLPTNEITIAAALKSKGYATACIGKWHLGH-EL 141
Query: 191 EYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTA-------W--- 240
++ P + GF+ +G ++S +E + +N S++ W
Sbjct: 142 QFLPTHHGFDYFYGL--------------RFSNDMEPVRGKIPKNASSSLHPKLEWWNSA 187
Query: 241 ----DTVGEYATDL------FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQ 290
D + E TDL +T+EA++ I Q +P FLY H H P
Sbjct: 188 LLQNDKILEQPTDLSTLTRRYTEEAIKFIH-QNKKQPFFLYFPHTFPHV---------PL 237
Query: 291 ETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYR 350
+ F+ R Y +V++LD SVG V++ L+++G+ EN+++ F SDNG + R
Sbjct: 238 FASDAFK--EHSARGLYGDVVEELDWSVGQVLNTLRKEGLAENTLVFFTSDNGPWLI--R 293
Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
+ + GS P + K + WEGG++ PAI W P + P ++ +++ D T
Sbjct: 294 DLAG----GSAGPLKDGKGSTWEGGMREPAIAWWPG-KIKPAINHELVCSLDLFTTSLCL 348
Query: 411 AGGDTSRLPLN--IDGLDQWSSLLLNTPSRR 439
AG LP + IDGLD L S+R
Sbjct: 349 AG---VALPPDRVIDGLDMRPMLFGTGHSQR 376
>gi|434394677|ref|YP_007129624.1| N-acetylgalactosamine-6-sulfatase [Gloeocapsa sp. PCC 7428]
gi|428266518|gb|AFZ32464.1| N-acetylgalactosamine-6-sulfatase [Gloeocapsa sp. PCC 7428]
Length = 469
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 120/400 (30%), Positives = 189/400 (47%), Gaps = 55/400 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G +I TP++D +A G+ N Y+ VC PSR +LMTG + H ++G
Sbjct: 44 GYGDLGCYGQKQIKTPHLDQMASEGMRFTNFYSGSTVCAPSRCTLMTGYHTGHCRIRGNN 103
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
GA+ + + + + L+ GY+T GKW LG P +GF+ FGYLN
Sbjct: 104 --GADDISLHAEDITVAQVLKAAGYTTAMFGKWGLGGPDSPGEPHLQGFDEWFGYLN--- 158
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
H+ + Y + + RN G Y+ +LFT+ A+ I+ + D+P FL
Sbjct: 159 ----HVHAHDYYPSY------LWRNQQRVAIAPGTYSHNLFTQAALDFIKAKQ-DEPFFL 207
Query: 271 YLAHLAAHAGN--AGKHLEAPQETINQFQYITDP---NRRTYAAMVKKLDDSVGTVISAL 325
YL + HA N + ++ P + Y +P ++ YAAM+ LD VG +++ L
Sbjct: 208 YLPYTIPHANNELGSEGMQVPSDA----PYSDEPWPQQQKNYAAMISLLDADVGKILALL 263
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WS 384
G+ E++I+ F SDNG P E T+ + + SN P G+K L++GGV+VP I+ W+
Sbjct: 264 AELGLEEDTIVFFSSDNG-PHEEGGATAEF--FDSNGPLNGIKRDLYDGGVRVPMIVRWT 320
Query: 385 PQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNID 444
IQ + + Q + D+LPT AG R ID
Sbjct: 321 GTIQPK-QENHQPYALWDFLPTAAEIAGVSAPR------------------------GID 355
Query: 445 GLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLV 484
G+ +LL ++ N + E+ A+R+ WK V
Sbjct: 356 GVSILPALLGQPANKHNFLYWEFHERGFEQAIRMGKWKAV 395
>gi|449282487|gb|EMC89320.1| N-acetylgalactosamine-6-sulfatase, partial [Columba livia]
Length = 488
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 118/364 (32%), Positives = 176/364 (48%), Gaps = 42/364 (11%)
Query: 86 TLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHT 144
LL GW DL G TPN+D +A G++ + Y A P+C+PSRA+L+TG+ P+
Sbjct: 1 VLLTQMGWGDLGAFGEPSKETPNLDQMASEGMLFLDFYTANPLCSPSRAALLTGRLPVRN 60
Query: 145 GMQGPPIWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTP 194
G A R G+P +E LPE L++ GY+ K IGKWHLG R ++ P
Sbjct: 61 GFY---TTNAHARNAYTPQDIVGGIPDSELLLPELLKKAGYTNKIIGKWHLGH-RPQFHP 116
Query: 195 LYRGFESHFGYLNGVISYYDH--ILSDQYSRTVELNG---HDMRRNLSTAWDTVGEYATD 249
L GF+ FG N +D+ + + R E+ G D + +L T + T
Sbjct: 117 LKHGFDEWFGSPNCHFGPFDNRALPNIPLYRDWEMIGRYYEDFKIDLKTGEANL----TQ 172
Query: 250 LFTKEAVQLI-EDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
++ +EA+ I + Q +P FLY A A HA Q++ R Y
Sbjct: 173 IYLQEALDFISKQQASQQPFFLYWAIDATHA-----------PVYASKQFLGTSQRGLYG 221
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
V+++DDS+G ++ LQ+ G+ N+ + F SDNGA + S + GSN P+ K
Sbjct: 222 DAVREIDDSIGKILKHLQKLGISNNTFVFFTSDNGAALI-----SAPKQGGSNGPFLCGK 276
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T +EGG++ PAI W P VS Q+ ++ D T + AG IDG+D
Sbjct: 277 QTTFEGGMREPAIAWWPGHIPAGGVSHQLGNVMDLFTTSLSLAGLQPPS-DRQIDGIDLL 335
Query: 429 SSLL 432
++L
Sbjct: 336 PAIL 339
>gi|218262868|ref|ZP_03477199.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
DSM 18315]
gi|218223078|gb|EEC95728.1| hypothetical protein PRABACTJOHN_02879 [Parabacteroides johnsonii
DSM 18315]
Length = 461
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 101/308 (32%), Positives = 153/308 (49%), Gaps = 36/308 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS +I TPNID LA G I + + A V +PSR+ ++TG+Y G +
Sbjct: 42 GYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 100
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+P E LP L+ GY T IGKWHLG P +GF++ +G L G
Sbjct: 101 -LDKPGDGIPDDEELLPALLKRYGYRTGCIGKWHLGS-EPSQRPNAKGFDTFYGLLAGHR 158
Query: 211 SY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
SY YD SD+ + + NG + + Y TD +A Q + + ++P
Sbjct: 159 SYFYDPETSDKDGNLQQYQYNGQKL---------SFDGYFTDELASKARQFVAES--EQP 207
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LY++ A H+ N EA +E + +F+ R+ YAAM+ LD VG ++ L+
Sbjct: 208 FMLYMSFTAPHSPN-----EAAEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDELKA 259
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
G +++II F+SDNG T N SN P +G K +EGG +VP ++W +
Sbjct: 260 AGKFDDTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWGDR 310
Query: 387 IQQNPRVS 394
+++ R +
Sbjct: 311 FKRDQRFT 318
>gi|390958067|ref|YP_006421824.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390958408|ref|YP_006422165.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390412985|gb|AFL88489.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
gi|390413326|gb|AFL88830.1| arylsulfatase A family protein [Terriglobus roseus DSM 18391]
Length = 470
Score = 145 bits (367), Expect = 5e-32, Method: Compositional matrix adjust.
Identities = 115/363 (31%), Positives = 173/363 (47%), Gaps = 43/363 (11%)
Query: 74 RTYAALTKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCT 129
+++AA TK + I+ G+ D S +G + TP+ID LA G+ YA PVC
Sbjct: 29 KSFAAQTKRRPNIIFILADDMGYADTSVYGQRKFKTPHIDRLAAEGVRFAQAYAGAPVCA 88
Query: 130 PSRASLMTGKYPIHTGMQGPPIWGA---------EPRGVPLTE--RFLPEYLRELGYSTK 178
PSR+ LMTG HT ++ A E R LT+ + +YL GY T
Sbjct: 89 PSRSVLMTGLNTGHTRVRDNFALAAGHVGFKGKEEIRRASLTDADHTIADYLARSGYKTA 148
Query: 179 AIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY--SRTVELNGHDMRRNL 236
GKWHL + TP GF++ G+L S + + +Y R ++L
Sbjct: 149 LFGKWHLDGYDPAATPTRHGFQTFKGWLTQTASTQGYFPTQRYDNERLIDLP-------- 200
Query: 237 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 296
A G Y T + T ++V I+ Q D P FLY+A+ A H+ P +Q
Sbjct: 201 ENANGKQGRYDTTMITDDSVNYIQ-QSSDTPFFLYVAYDAPHS---------PYIAPDQG 250
Query: 297 QYITDP---NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNG---APTVEYR 350
Y +P + + YAAM+ LD+ VG +++AL++KG+ E++I+ F SDNG PT +
Sbjct: 251 SYAKEPWDEDEKIYAAMIGHLDNGVGKILAALKQKGIDEDTIVFFASDNGPRSEPTPQQT 310
Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
+ N+ + SN +G K ++EGG++ P I P +VS + D LPT
Sbjct: 311 KVVNF--FDSNGALQGYKRDMYEGGIREPFIARWPGHIPAGKVSQVPIFFPDVLPTALDL 368
Query: 411 AGG 413
AG
Sbjct: 369 AGA 371
>gi|431838572|gb|ELK00504.1| N-acetylgalactosamine-6-sulfatase [Pteropus alecto]
Length = 456
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 131/412 (31%), Positives = 190/412 (46%), Gaps = 86/412 (20%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIH 143
L L+ GW DL +G TPN+D +A G++ N Y A P+C+PSRA+L+TG+ PI
Sbjct: 37 LLLMDDMGWGDLGVYGEPSRETPNLDRMAAEGMLFPNFYTANPLCSPSRAALLTGRLPIR 96
Query: 144 TGM---QGPPIWGAEPR----GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
G G P+ G+P TE LPE L++ GY +K +GKWHLG R ++ PL
Sbjct: 97 NGFYTTNGQARNAYTPQEVVGGIPDTELLLPELLKKAGYVSKIVGKWHLG-HRPQFHPLR 155
Query: 197 RGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLST--AWDTVGEY-------- 246
GF+ FG N YD + R N+ W+ VG +
Sbjct: 156 HGFDEWFGSPNCHFGPYD---------------NRARPNIPVYRDWEMVGRFYEEFPIDL 200
Query: 247 ------ATDLFTKEAVQLIE-DQPVDKPLFLYLAHLAAHA-----------------GNA 282
T ++ +EA+ I+ Q +P FLY A A HA G+A
Sbjct: 201 RTGEANLTQIYLQEALDFIKRQQAAQRPFFLYWAIDATHAPVYASKPFLGTSQRGRYGDA 260
Query: 283 --------GKHLEAPQE---TINQFQYITD---------PNRRTYAAMVKKLDDSVGTVI 322
GK L Q+ T + F + T P + Y V+++DDSVG ++
Sbjct: 261 VREIDDSVGKILSLLQDLKITKDTFVFFTSDNGAALISAPKQGRYGDAVREIDDSVGKIL 320
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL 382
S LQ + +++ + F SDNGA + + GSN P+ K T +EGG++ PAI+
Sbjct: 321 SLLQDLKITKDTFVFFTSDNGAALISAPKQGRC---GSNGPFLCGKQTTFEGGMREPAIV 377
Query: 383 WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLDQWSSLL 432
W P +VS Q+ ++ D T + AG LP + IDGLD S+L
Sbjct: 378 WWPGHIAAGQVSHQLGNVMDLFTTSLSLAG---LELPSDRVIDGLDLLPSML 426
>gi|296121751|ref|YP_003629529.1| sulfatase [Planctomyces limnophilus DSM 3776]
gi|296014091|gb|ADG67330.1| sulfatase [Planctomyces limnophilus DSM 3776]
Length = 536
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 122/384 (31%), Positives = 183/384 (47%), Gaps = 57/384 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
GW ++ G ++IPTPNID LA G+ L Y+ P C PSR LMTGK+ H ++G
Sbjct: 50 GWGEVGCFGQSKIPTPNIDRLASRGVKLTRHYSGAPTCAPSRCVLMTGKHLGHAEIRGNQ 109
Query: 149 ------PPIWGAEPRGVPLTERFL--PEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
P + PL+++ L ++ GY+T A GKW LG P +GF+
Sbjct: 110 QAKVKLPQFTEGQ---HPLSDKALTIARQFQKAGYATGAFGKWGLGPVGSTGEPNRQGFD 166
Query: 201 SHFGYLNGVI--SYYDHILSDQYSRTVE----LNGHDMRRNLSTAWDTV-GE-YATDLFT 252
FGY + SY+ L V + GH + + GE YA L
Sbjct: 167 EFFGYNCQALAHSYFPKALWKNAESIVNNEKPVPGHKKQPEGEVTMEAYQGENYAPRLIM 226
Query: 253 KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ-------------YI 299
EA+ I D+ +P FLYL H ++ P + + +F Y+
Sbjct: 227 AEALSFI-DRHHQQPFFLYLPFTEPHVA-----MQPPPKIVEEFPVEWDERVYRGDGGYL 280
Query: 300 TDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE-----YRETS 353
P R YAAM++ LD+ VG VI++L++ G+LE ++I+F SDNGA + +
Sbjct: 281 PHPRPRAAYAAMIRDLDNHVGDVITSLEKHGLLEKTLIVFTSDNGATHASANPDFHVGGA 340
Query: 354 NYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQM-MHISDWLPTLYTAA 411
+ + S +G K +++EGG++VPAI+ W QI P ++ + DW PTL A
Sbjct: 341 DPLFFNSTRELKGFKGSIYEGGLRVPAIVSWPGQIP--PATTINTPSYFPDWFPTLCNA- 397
Query: 412 GGDTSRLPLNIDGLDQWSSLLLNT 435
++LPL +GLD + L L T
Sbjct: 398 ----TQLPLP-EGLDGVNLLPLLT 416
>gi|410097286|ref|ZP_11292268.1| hypothetical protein HMPREF1076_01446 [Parabacteroides goldsteinii
CL02T12C30]
gi|409224604|gb|EKN17536.1| hypothetical protein HMPREF1076_01446 [Parabacteroides goldsteinii
CL02T12C30]
Length = 446
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 107/355 (30%), Positives = 175/355 (49%), Gaps = 55/355 (15%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY 123
++ P + A K+ ++I+Y G+ D+ G +I TPN+D +A +G+ N Y
Sbjct: 12 LSVPLCQAAAKENKNDRPNIVIIYCDDMGYGDIGVTGHPDIKTPNLDRMALDGMRFTNYY 71
Query: 124 -AQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEP---RGVPLTERFLPEYLRELGYSTKA 179
A P T SR SL+TG+YP+ G + W P RG+ E + E L+E GY+T
Sbjct: 72 SASPASTASRYSLLTGRYPVRAGFR----WVLSPDAERGIHPRELTIAELLKEQGYATAI 127
Query: 180 IGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHD-MRRNLST 238
GKWHLG ++EY PL GF+ + G + Y + ++ +Y + G+D +R N
Sbjct: 128 YGKWHLGSTKKEYLPLQNGFDEYVG-----LPYSNDMIPPKYPDIALMCGNDTLRMNPDQ 182
Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY----LAHLAAHAGNAGKHLEAPQETIN 294
+ T L+T++A+ I+ + F+Y + H+ + G A
Sbjct: 183 S------ELTALYTEKAISFIKKHKKEN-FFVYVPYAMPHVPLYPGKA------------ 223
Query: 295 QFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSN 354
+I R TY V+++D VG ++SAL+++G+ +N+I+ FMSDNG P + E
Sbjct: 224 ---FIGKSKRGTYGDAVEEIDWGVGEILSALKKEGIDKNTIVWFMSDNG-PWLPRNEEG- 278
Query: 355 YRNWGSNYPYRGVKNTLWEGGVKVPAILWSP-----QIQQNPRVSLQMMHISDWL 404
GS +R K + WEGGV+VP + P Q+ + S+ + S W+
Sbjct: 279 ----GSAGIFRDGKGSTWEGGVRVPCFVSWPDHLNGQVNEEVITSMDVFATSVWM 329
>gi|305665425|ref|YP_003861712.1| arylsulfatase A [Maribacter sp. HTCC2170]
gi|88710180|gb|EAR02412.1| arylsulfatase A (precursor) [Maribacter sp. HTCC2170]
Length = 489
Score = 145 bits (366), Expect = 6e-32, Method: Compositional matrix adjust.
Identities = 120/348 (34%), Positives = 173/348 (49%), Gaps = 43/348 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ--- 147
G+ D+ GS I TP++D +A G+ L + Y AQ VC+ SRA L+TG YP G+
Sbjct: 54 GYQDVGVFGSPNIKTPHLDRMASEGVKLTSYYSAQAVCSASRAGLLTGCYPNRIGIHNAL 113
Query: 148 GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--Y 205
GP G+ +E + E L+ GY T GKWHLG E+ P GF+ FG Y
Sbjct: 114 GP----GNTHGINSSETTMAEMLKNNGYKTAIFGKWHLGH-HPEFMPNKHGFDEFFGIPY 168
Query: 206 LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYA--TDLFTKEAVQLIEDQP 263
N + Y+ Q + N T DT+ E + T T+ +V I ++
Sbjct: 169 SNDMWPYHP-----QQGPIFNFPDLPLYEN-ETVIDTLTEQSNLTTQITERSVDFI-NRN 221
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVIS 323
D P FLY+ H H P +F+ +D R Y ++ ++D SVG +I
Sbjct: 222 KDNPFFLYVPHPQPHV---------PLFVSEKFKGKSD--RGLYGDVIMEIDWSVGQIIE 270
Query: 324 ALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILW 383
AL++ G+ EN+I+IF SDNG P + Y + GS P+R K T WEGG + P I+
Sbjct: 271 ALKKNGLEENTIVIFTSDNG-PWLSYGNHA-----GSALPFREGKGTAWEGGQREPFIMK 324
Query: 384 SP-QIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWS 429
P +++ + + MM I D LPTL A S+LP L+IDG + W+
Sbjct: 325 YPNKLKAGEVIDVPMMAI-DLLPTL---ASLTNSKLPELSIDGKNVWN 368
>gi|373956153|ref|ZP_09616113.1| sulfatase [Mucilaginibacter paludis DSM 18603]
gi|373892753|gb|EHQ28650.1| sulfatase [Mucilaginibacter paludis DSM 18603]
Length = 468
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 117/341 (34%), Positives = 173/341 (50%), Gaps = 36/341 (10%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGP- 149
G+ D+ +G +I TPNID LA G L + YA PVC+PSR L+TG H ++G
Sbjct: 46 GYGDVGVYGQTKILTPNIDLLAREGTKLTDFYAGAPVCSPSRGVLLTGLNTGHATIRGNM 105
Query: 150 PIWG--AEPRGVPLTER--FLPE------YLRELGYSTKAIGKWHLGFFRREYTPLYRGF 199
I G A +G R LP+ L+ GY+T IGKWH+ F TP GF
Sbjct: 106 AIKGGLAGNKGSDKIRRAGLLPQDSTIGDLLQRSGYTTGLIGKWHVDGFDTLATPQRHGF 165
Query: 200 ESHFGYLNGVISYYDHILS----DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
+ +G+L +SY S D + R +L +N + Y +++ T +A
Sbjct: 166 DYFYGWL---VSYPQTYASTYWPDTWYRDGKLVAVPQNQNGQKNY-----YTSEIITDDA 217
Query: 256 VQ-LIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
++ L + + KP FL + H H+ L+AP TI + + T +++TYAAMV L
Sbjct: 218 IRYLAKHKKSKKPFFLMVNHSNPHS-----PLDAPHNTIYEGKDWTS-DQKTYAAMVTYL 271
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNG---APTVEYRETSNYRNWGSNYPYRGVKNTL 371
D SVG + L G+ +N+I+IF SDNG PT S++ + S+ P RG K +
Sbjct: 272 DSSVGKIRKYLIESGLDKNTIVIFTSDNGPRSEPTTSLTAVSDF--FHSSGPLRGYKRDV 329
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
EGG++VP I+W+ ++ VS + +D LPTL AG
Sbjct: 330 TEGGIRVPMIVWNKNRIKSDAVSHTPGYFADILPTLAGIAG 370
>gi|126663704|ref|ZP_01734700.1| putative arylsulfatase [Flavobacteria bacterium BAL38]
gi|126624287|gb|EAZ94979.1| putative arylsulfatase [Flavobacteria bacterium BAL38]
Length = 468
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 118/380 (31%), Positives = 186/380 (48%), Gaps = 46/380 (12%)
Query: 80 TKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLM 136
TK + ++ G+N+L +G I TPNID LA G+ +N Y +C PSR +LM
Sbjct: 26 TKKPNIVFILADDMGYNELGSYGGKIIETPNIDQLAKEGMKFSNHYCGSNICAPSRGTLM 85
Query: 137 TGKYPIHTGMQGP---PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
TGK+ H ++ P G EP +P +E + E L+ GY+T A GKW LG+ E +
Sbjct: 86 TGKHTGHAYIRDNKPLPYEGNEP--IPASEITVAEILKTAGYTTGAFGKWGLGYPASEGS 143
Query: 194 PLYRGFESHFGYLNGVISYYDHILSDQYSRT---VELNGHDMRRNLSTAWDTVGEYATDL 250
P +GF+ +GY NG I +++ S Y R VELN N+ + Y+ D+
Sbjct: 144 PNNQGFDQFYGY-NGQIHAHNYFTS--YLRKNDLVELNA-----NIDAPYSV---YSADI 192
Query: 251 FTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQY-----ITDPNRR 305
A++ +E + P FLY H + + +T+ + I D +
Sbjct: 193 IKDRALEFVEVNK-NNPFFLYFCPTLPH----NPYHQPDDKTLEYYAKKTGFPIGDAHSE 247
Query: 306 -----TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
YAA+ +LD VG +++ L+ +L+N++IIF SDNG+ +E +Y G
Sbjct: 248 EFSVPKYAALSSRLDQQVGEIMAKLKELNLLDNTLIIFASDNGSALT--KEEDSYLRTGG 305
Query: 361 NYPYRGVKNTLWEGGVKVPAI-LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
+ RG K+ ++EGG+K P I W +I + HIS + L T A ++ P
Sbjct: 306 D--LRGRKSEVYEGGIKSPLIAFWKGKIIPGSSSN----HISAFWDFLPTCAEIVKAKTP 359
Query: 420 LNIDGLDQWSSLLLNTPSRR 439
NIDG+ +LL T +++
Sbjct: 360 DNIDGISYLPTLLGKTDNQK 379
>gi|284041117|ref|YP_003391047.1| sulfatase [Spirosoma linguale DSM 74]
gi|283820410|gb|ADB42248.1| sulfatase [Spirosoma linguale DSM 74]
Length = 486
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 128/429 (29%), Positives = 211/429 (49%), Gaps = 53/429 (12%)
Query: 78 ALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRASLM 136
A + L + G+ DLS G+ + TPN+D +A G N + AQ VC+ SRA+L+
Sbjct: 35 ATPPNVVLFFMDDLGYGDLSVTGALDYTTPNLDKMAAEGTRFTNFLAAQAVCSASRAALL 94
Query: 137 TGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLY 196
TG YP G+ G + P G+ E L E L+E GY+T GKWHLG +++ P+
Sbjct: 95 TGCYPNRLGLYGA-LGPNSPIGLNPNEETLAELLKERGYATGMFGKWHLGD-NKQFLPMQ 152
Query: 197 RGFESHFG--YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKE 254
+GF+ ++G Y + + + +Y ++G++ + D T T++
Sbjct: 153 QGFDEYYGVPYSHDMWPLHPAQAQAKYPPLRWIDGNEPGPEIKDLNDA--GKITGTITEK 210
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
AV I + KP FLY+ H H P T +F+ R + ++ +L
Sbjct: 211 AVSFIRNHK-KKPFFLYVPHPLPHV---------PLATSARFK--GQSARGIFGDVLTEL 258
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D SVG +++ L+++G+ +N+++IF+SDNG P + Y + + GS+ +R K T +EG
Sbjct: 259 DWSVGQIMNELKQQGLDKNTLVIFISDNG-PWLNYGDHA-----GSSGGFREGKGTSFEG 312
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN-IDGLDQWSSLLL 433
G +VP ++ P + RVS +++ D LPT+ G +RLP IDG+D W +LL
Sbjct: 313 GHRVPCLVRWPGVVPAGRVSNKLLTALDILPTVANVCG---ARLPKQRIDGVD-WVALL- 367
Query: 434 NTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL---GTQEN 490
+ ++++ D++ R+NS+ AVR WKLV G
Sbjct: 368 ----KGDNSVTPRDKFYYYY-----RKNSL----------EAVRQGDWKLVFAHPGRTYE 408
Query: 491 GTMDGYYGQ 499
G + G G+
Sbjct: 409 GFLPGQGGK 417
>gi|448409003|ref|ZP_21574483.1| sulfatase [Halosimplex carlsbadense 2-9-1]
gi|445673783|gb|ELZ26342.1| sulfatase [Halosimplex carlsbadense 2-9-1]
Length = 496
Score = 145 bits (366), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 120/396 (30%), Positives = 174/396 (43%), Gaps = 56/396 (14%)
Query: 41 LAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAALTKSTTLTLLIVY----GWNDL 96
L A +G AG+ P T N Y +++VY GW D
Sbjct: 16 LGAGVALSGCAGRRETTPSAT----------NAPDYPGAPLPERPNIVVVYVDDLGWLDT 65
Query: 97 SFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKY-PIH-----TGMQGP 149
F GS TPN+DALA G++ + YA P C PSRA TG+Y P H T
Sbjct: 66 GFQGSQYYDTPNLDALADEGVVATSGYANAPNCAPSRACFNTGQYTPRHGVGTVTDPPPR 125
Query: 150 PIWGAEPRGVPLTERF------LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
P +ER + E L + GY+T +GKWHLG + PL RGF+ +
Sbjct: 126 RPEARRLETPPNSERLGTDNATIAEVLSDAGYATGFVGKWHLGEADSRHGPLARGFDRNV 185
Query: 204 GYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
G D R G+ L D G Y D T +A++ + D
Sbjct: 186 G-------------GDIVGRPPA--GYFSPYRLPNVTDD-GGYLPDRLTDDAIEFVADHA 229
Query: 264 VD-KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR---TYAAMVKKLDDSVG 319
P FL + + H L+AP + ++ ++ + N + TY AMV LD +VG
Sbjct: 230 GGPDPFFLLFSTYSVHG-----PLQAPDDAVDAYRDVDCWNGQCNPTYGAMVSNLDANVG 284
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWG---SNYPYRGVKNTLWEGGV 376
+++AL +G+ E ++++F SDNG +Y E R+W S P RG K TL+EGG+
Sbjct: 285 RLLAALDERGIREETLVVFTSDNGG-LGDYDEIGLPRDWKRVTSQGPLRGGKRTLYEGGI 343
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
+VP ++ P + R S + +D+ PT AG
Sbjct: 344 RVPTVVSWPGVVPGGRRSSTPIAGTDFFPTFVDLAG 379
>gi|87308971|ref|ZP_01091109.1| Arylsulfatase [Blastopirellula marina DSM 3645]
gi|87288314|gb|EAQ80210.1| Arylsulfatase [Blastopirellula marina DSM 3645]
Length = 482
Score = 145 bits (365), Expect = 7e-32, Method: Compositional matrix adjust.
Identities = 118/365 (32%), Positives = 178/365 (48%), Gaps = 36/365 (9%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQ--- 147
G+ +L +G I TPNID LA GI + Y+ VC PSR LMTGK+P H ++
Sbjct: 35 GYAELGCYGQKWIKTPNIDRLAAEGIRFTDFYSGNAVCAPSRCCLMTGKHPGHAYVRNNF 94
Query: 148 ------------GPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPL 195
G W + R + E + E L++ GY+T AIGKW LG F P
Sbjct: 95 AQKKLPAESLALGLTDWTGQ-RPLASEEVTIAEVLKQRGYATAAIGKWGLGQFGTTGDPN 153
Query: 196 YRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
+GF+ +G+ N ++H Y + + N + + T ++ D FTK A
Sbjct: 154 QQGFDLFYGF-NCQTHAHNH-----YPKFLMRNREKEVQPGNNRGATGETFSQDQFTKVA 207
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAG------NAGKHLEA-PQETINQFQYITDPN-RRTY 307
++ I DQ DKP FLYL H + ++ A P+E YI P R Y
Sbjct: 208 LEFI-DQNKDKPFFLYLPFAIPHLSIQVPDESTAEYTNAIPEEKYVHKGYIQHPTPRAGY 266
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
AAM+ +D +G ++S ++ G+ ++++IF SDNGA T + S+ + S RG+
Sbjct: 267 AAMITHMDRDIGKILSKIETLGLDGDTLVIFTSDNGA-TYDRLGGSDSDFFESVANLRGL 325
Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
K +L+EGGV+VP + P ++ +VS + D LPT+ A ++ P IDGL
Sbjct: 326 KGSLYEGGVRVPLVARWPGKIKSGQVSHVPSALWDMLPTI---ADITETKAPQMIDGLSL 382
Query: 428 WSSLL 432
+LL
Sbjct: 383 APTLL 387
>gi|405952520|gb|EKC20320.1| Arylsulfatase B [Crassostrea gigas]
Length = 500
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 92/267 (34%), Positives = 143/267 (53%), Gaps = 34/267 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
G ND+ ++ E+ TPN+D LA NG+IL + Y PVC+PSRA+ MTG+Y G Q P+
Sbjct: 41 GNNDIGYNNP-EVDTPNLDNLANNGVILESNYVYPVCSPSRAAFMTGRYAHKIGFQRGPV 99
Query: 152 WGAEPRGVPLTERFLPEYL-RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
+P + + + E L GY+ IGKWHLG+ + TP RGF+S +G+ G
Sbjct: 100 EHKQPAYIESNYKTVAEKLTTNYGYAAHMIGKWHLGYCKDAVTPTNRGFDSFYGFYGGQE 159
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAW--------DTVGEYATDLFTKEAVQLIEDQ 262
+YY + S +Y D R NL+ + V Y+T + K A++++ +
Sbjct: 160 NYYTYT-SARYK--------DFRDNLTAVTPQNPNYPREDVDGYSTFEYKKRAIEIVGNH 210
Query: 263 PVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ---YITD-------PNRRTYAAMVK 312
PLFLYLA A H+ L+APQE +++ TD +R T+ AMV
Sbjct: 211 DKSVPLFLYLAFQAPHS-----PLQAPQELKDKYPGDWTYTDWDGSTKTTSRATFYAMVT 265
Query: 313 KLDDSVGTVISALQRKGMLENSIIIFM 339
+D +VG V + ++ +G+L+N+II+ +
Sbjct: 266 AVDQAVGEVYTEMETQGLLDNAIIVLL 292
Score = 42.7 bits (99), Expect = 0.59, Method: Compositional matrix adjust.
Identities = 26/82 (31%), Positives = 44/82 (53%), Gaps = 17/82 (20%)
Query: 395 LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLL 454
+ ++H SDWLPT + AGG ++ L +ID ++IDG+DQ LL
Sbjct: 289 IVLLHASDWLPTFISLAGG-SADLTTDID----------------LTSIDGVDQTQMLLN 331
Query: 455 NTPSRRNSVLINIDEKKRTAAV 476
N S R+ +++NI + +R + +
Sbjct: 332 NGESARSKIILNIYDLERPSTI 353
>gi|431797524|ref|YP_007224428.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
gi|430788289|gb|AGA78418.1| arylsulfatase A family protein [Echinicola vietnamensis DSM 17526]
Length = 442
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 110/331 (33%), Positives = 158/331 (47%), Gaps = 39/331 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D SF + TPNID +A G N Y VC+PSRA L++G G
Sbjct: 35 GYADFSFQSTKLTTTPNIDRIAQEGAKFTNAYVTASVCSPSRAGLLSGTNQPEFGHIYNY 94
Query: 151 IWGAE------PRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG 204
I G E G+P + + EYL+ LGY+T A GKWH GF E+ P +GF+ +G
Sbjct: 95 IKGVEYSIQQHEYGIPHNIKTIGEYLQPLGYATAAFGKWHEGF-AEEFQPEQKGFDHFWG 153
Query: 205 YLNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+L G Y+ + + D+ + T Y TD T EA+
Sbjct: 154 FLWGANHYHT-------GKAI-----DVIDDGKPVDPTTIPYMTDAITDEALSFASAHEE 201
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ--YITDPNRRTYAAMVKKLDDSVGTVI 322
D P FLY+A+ A H+ LEA E I ++ + DP R AAM LD++VG +
Sbjct: 202 D-PFFLYIAYNAVHSP-----LEAKPEHIALYEGKFPDDPKREILAAMTHSLDENVGKIF 255
Query: 323 SALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AI 381
+ L+ G L+N+II F++DNG + S N P +G K L+EGG++VP A+
Sbjct: 256 NRLEAMGELDNTIIFFINDNGGS----KHIS-----ADNSPLKGYKGQLYEGGIRVPFAV 306
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
W +I + V + D LP++ A G
Sbjct: 307 RWPGKI-KGKTVCHSITSSLDILPSILDAVG 336
>gi|417305609|ref|ZP_12092563.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327538042|gb|EGF24732.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 1004
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 139/484 (28%), Positives = 209/484 (43%), Gaps = 68/484 (14%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQ 125
++ +R A TK + +L G+ DLS G+ + TP ID +A G L + Y A
Sbjct: 549 VSPADRAKTAGTTKPNFIVILTDDQGYGDLSCFGAKHVDTPRIDQMAAEGSRLTSFYVAA 608
Query: 126 PVCTPSRASLMTGKYPIHTGMQGPPIWG----AEPRGVPLTERFLPEYLRELGYSTKAIG 181
PVCTPSRA LMTG YP M +G +P+G+ E + E L GY T G
Sbjct: 609 PVCTPSRAGLMTGCYPKRIDMAMGSNFGVLLAGDPKGLHPDEITIAEVLNTAGYRTGMFG 668
Query: 182 KWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM---RRNLST 238
KWHLG + E+ P +GF+ FG Y H + + R + + + +
Sbjct: 669 KWHLGD-QPEFLPTKQGFDEFFGIP------YSHDIHPFHPRQNXYHFPSLPLLQNDTVI 721
Query: 239 AWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHA----------GNAGKHLEA 288
D ++ T T++AV IE D+P FLYL H HA A + A
Sbjct: 722 EMDPDADFLTKRLTEQAVSFIERN-KDQPFFLYLPHPIPHAPLHASPPFMEDVADDVIAA 780
Query: 289 PQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
++ Y T N + + ++D SVG ++ AL+ G+ E ++++F SDNG P
Sbjct: 781 IEKEDGNIDYATRAN--LFRQAIAEIDWSVGQILDALRSNGLDEKTMVLFTSDNGPP--- 835
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
+ + Y + G RG K T +EGG++ P ++ W QI + ++M D LPT
Sbjct: 836 --KNTLYASPGE---LRGHKGTTFEGGMREPTVVRWPGQIPAG-HQNDELMTAMDLLPTF 889
Query: 408 YTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINI 467
AG P+ R IDG D W +L T + ++ +
Sbjct: 890 AKLAGA--------------------AIPTDR--VIDGKDIWPTLKGETQTPHDAFFYH- 926
Query: 468 DEKKRTAAVRLDSWKLVLGTQENGTMDGYYG--QTRSNKVPLLNFNAIVESKTYQSLQQL 525
+ AAVR WKL + NG Y KV ++ N V K + L+
Sbjct: 927 -RGNQLAAVRSGKWKLHV---NNGVAKQLYDLENDLGEKVNVIETNPEVVKKLQRQLKDF 982
Query: 526 SQNI 529
+ +I
Sbjct: 983 AADI 986
Score = 105 bits (263), Expect = 5e-20, Method: Compositional matrix adjust.
Identities = 119/462 (25%), Positives = 186/462 (40%), Gaps = 79/462 (17%)
Query: 75 TYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPS 131
T A + + L+ V G+ D+ +G+ ++ TPNID LA G + + A VCTPS
Sbjct: 24 TSVAAERPPNVVLIFVDDLGYGDVGCYGATKLSTPNIDRLAAEGRRFTDAHSASAVCTPS 83
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPL----TERFLPEYLRELGYSTKAIGKWHLGF 187
R L+TG+YP+ M G IWG P L + + + + GY+T +GKWHLGF
Sbjct: 84 RYGLLTGQYPVRA-MGGKGIWGPLPTTSGLIIDTNTQTIGKVFKNKGYATACLGKWHLGF 142
Query: 188 FRREYT--------PLYRGFESHFGY--LNGVISYY----DHILSDQYSRTVELNGHDMR 233
P GF+ +FG +N Y D I S + G +
Sbjct: 143 KEEPCDWQVPLRPGPQDVGFDHYFGVPLVNSGSPYVYVNDDSIFGYDPSDPLVYGGKPVS 202
Query: 234 --------------RNLSTAWDTVGEYATD----LFTKEAVQLIEDQPVDKPLFLYLAHL 275
S A Y + L T+ AV+ I ++ D+P FLY A
Sbjct: 203 PTPMFPEEASVKSPNRFSGALKAHEIYDDEKTGTLLTERAVKWITEK-KDEPFFLYFATP 261
Query: 276 AAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSI 335
H P +F+ + Y V +LD VG ++ +L+ G+ +N++
Sbjct: 262 NIH---------HPFTPAPRFKGTSQCG--LYGDFVHELDWMVGEIVQSLEDNGLTDNTL 310
Query: 336 IIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVS 394
++F SDNGA + N G K +WEGG +VP I W +I+
Sbjct: 311 VLFTSDNGAMLNRAGRDAIKAGHQPNGELLGFKFGVWEGGHRVPLIAKWPGKIK------ 364
Query: 395 LQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLL-LNTPSRRNSNIDGLDQWSSLL 453
AG + +L +D +S+L PS D ++ +LL
Sbjct: 365 ----------------AGTQSDQLISQVDLFATFSALTEQEMPSSEQK--DSINMLPALL 406
Query: 454 LNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDG 495
+ + L+ + R A+R W L +G + +G +G
Sbjct: 407 DDPNEPLRTELVLAPRQPRNLAIRKGKW-LYIGARGSGGFNG 447
>gi|449136530|ref|ZP_21771910.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
gi|448884847|gb|EMB15319.1| N-acetylgalactosamine-4-sulfatase [Rhodopirellula europaea 6C]
Length = 480
Score = 145 bits (365), Expect = 8e-32, Method: Compositional matrix adjust.
Identities = 111/339 (32%), Positives = 163/339 (48%), Gaps = 48/339 (14%)
Query: 78 ALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRAS 134
A++ L ++I G+ + G+ EIPTP IDALA +G+ + Y C+PSRA
Sbjct: 30 AVSSQPNLVVIIADDLGYGETGMMGNAEIPTPAIDALARSGVRCTSGYVTSSYCSPSRAG 89
Query: 135 LMTGKYPIHTGMQGPPIWGAEPR---GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRRE 191
M+G+Y G P G+P ++ E+L+ GY T IGKWHLG R
Sbjct: 90 FMSGRYQSRFGYDLNPTGERNNHPNAGLPPQQKTFVEHLQSAGYHTSLIGKWHLGT-RPP 148
Query: 192 YTPLYRGFESHFGYLNGVISY-----YDHI---LSDQY--SRTVELNGHDMRRNLST--- 238
P +GF+ FG+L+ Y Y+++ L D + E N +R N +
Sbjct: 149 QVPTSKGFDRFFGFLHEGHFYVPGPPYENVWTMLRDNSLPAGQFETNQRTIRGNYARINE 208
Query: 239 -AWDTV------GE------YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKH 285
A+DT GE Y TD T +AV I Q P + +++ A H+
Sbjct: 209 PAYDTGNPVLDGGEPIDDWNYLTDTITDKAVDTIS-QAASNPFAMVVSYNAVHS-----P 262
Query: 286 LEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
++A E +I DP RR +A M+ LD VG +I L R+ + + ++++F SDNG P
Sbjct: 263 MQASLEDHAAMDHIADPQRRIFAGMLIALDRGVGRIIKKLDRQKLRQETLVVFFSDNGGP 322
Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWS 384
T E SN P RG K +L+EGGV++P I WS
Sbjct: 323 TAELTS--------SNAPLRGGKGSLYEGGVRIPMI-WS 352
>gi|302535739|ref|ZP_07288081.1| arylsulfatase [Streptomyces sp. C]
gi|302444634|gb|EFL16450.1| arylsulfatase [Streptomyces sp. C]
Length = 495
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 108/336 (32%), Positives = 153/336 (45%), Gaps = 42/336 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
GW DLS +GS I TPN+D LA G+ + Y+ C+P+R SL TG+YP T G
Sbjct: 91 GWADLSSYGSPHIKTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYTGRYPARTRGGLAE 150
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
PI + G+ T L L+E GY+T IGKWH G+ +Y+P G+++ FG G
Sbjct: 151 PI-ADKSAGLDPTHPTLASLLKEAGYATALIGKWHAGYLP-DYSPTKSGWDTFFGNFGGA 208
Query: 210 ISYYDHI----LSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
+ YY + D Y E D+R Y T + T+ A + +
Sbjct: 209 LEYYSKLGLGGEYDLYEGDAEYK--DLR------------YYTRIITERASEYVSRDHGG 254
Query: 266 KPLFLYLAHLAAH-----AGNAGKHLEAPQET----INQFQYITDPNRRTYAAMVKKLDD 316
+P L L H G+ + E + + + R Y MV+ LD
Sbjct: 255 RPWLLNLNFTTPHWPWIADGDTAESAEIVRRIKAGDARALWHQDGGSVRKYKQMVEDLDR 314
Query: 317 SVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGV 376
SVG V+ AL+R G EN++++F SDNG Y N+P G K +L EGG+
Sbjct: 315 SVGEVLQALRRSGQEENTLVVFSSDNGGERFSY-----------NWPLSGNKASLQEGGI 363
Query: 377 KVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
+VP I+ P +VS + DW TL AG
Sbjct: 364 RVPNIVRWPARLDGGQVSRVPVFSPDWTATLLEVAG 399
>gi|32471441|ref|NP_864434.1| arylsulfatase [Rhodopirellula baltica SH 1]
gi|32443282|emb|CAD72113.1| Arylsulfatase [Rhodopirellula baltica SH 1]
Length = 538
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 113/375 (30%), Positives = 173/375 (46%), Gaps = 46/375 (12%)
Query: 70 DPNRRTYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVC 128
+PN + + L + G+ +L +G +I TP +D LA GI L N Y+ VC
Sbjct: 63 EPNAKDATVSRPNIVLIVADDLGYGELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVC 122
Query: 129 TPSRASLMTGKYPIHTGMQ--GPPIWGAEPRG-----------VPLTERFLPEYLRELGY 175
PSR LMTGK+P H ++ G P R +P+ E + EYL+ +GY
Sbjct: 123 APSRCCLMTGKHPGHAHVRNNGDPKIDPAVREALKLEFPGQYPLPVDEVTIAEYLKSVGY 182
Query: 176 STKAIGKWHLGFFRREYTPLYRGFESHFGY--LNGVISYYDHILSDQYSRTVELNGHDMR 233
T A GKW LG F P +GF+ +G+ ++Y + L + V+ G+D
Sbjct: 183 RTGAFGKWGLGHFGTTGDPNEQGFDLFYGFNCQRHAHNHYPNFLWRNRVKEVQ-PGNDRT 241
Query: 234 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQ 290
+ T Y+ D F EA + I + +P F YL H ++ P+
Sbjct: 242 LHGET-------YSQDQFVNEACEFIRQSVAEDKTQPFFAYLPFAVPHLS-----IQVPE 289
Query: 291 ETINQFQ------------YITDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIII 337
E ++ + Y+ P R YAAMV ++D+ VG V+ + G+ EN++I+
Sbjct: 290 EEVDAYDGVIEEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQVVDLVDSLGLGENTLIM 349
Query: 338 FMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQM 397
F SDNG PT + S+ + S +G+K L EGG++VP I + R S +
Sbjct: 350 FTSDNG-PTYDRLGGSDSDYFNSASGMKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWI 408
Query: 398 MHISDWLPTLYTAAG 412
D+LPT+ AAG
Sbjct: 409 GAWWDFLPTITDAAG 423
>gi|449138176|ref|ZP_21773471.1| arylsulfatase [Rhodopirellula europaea 6C]
gi|448883200|gb|EMB13738.1| arylsulfatase [Rhodopirellula europaea 6C]
Length = 516
Score = 145 bits (365), Expect = 9e-32, Method: Compositional matrix adjust.
Identities = 115/371 (30%), Positives = 174/371 (46%), Gaps = 49/371 (13%)
Query: 80 TKSTTLTLLIV---YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASL 135
T S +LIV G+++L +G +I TP +D LA GI L N Y+ VC PSR L
Sbjct: 48 TASRPNIVLIVADDLGYSELGCYGQTKIRTPRLDQLAAEGIKLTNFYSGNAVCAPSRCCL 107
Query: 136 MTGKYPIHTGMQ--GPPIWGAEPRG-----------VPLTERFLPEYLRELGYSTKAIGK 182
MTGK+P H ++ G P R +P+ E + EYL+ +GY T A GK
Sbjct: 108 MTGKHPGHAYVRNNGDPKIDPAVREELKLEFPGQYPLPVDEVTIAEYLKSVGYRTGAFGK 167
Query: 183 WHLGFFRREYTPLYRGFESHFGY--LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAW 240
W LG F P +GF+ +G+ ++Y L + V+ G+D + T
Sbjct: 168 WGLGHFGTTGDPNEQGFDLFYGFNCQRHAHNHYPKFLWRNRVKEVQ-PGNDRTLHGET-- 224
Query: 241 DTVGEYATDLFTKEAVQLIEDQPVD---KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
Y+ D F EA + I + +P F YL H ++ P+E ++ ++
Sbjct: 225 -----YSQDQFVNEACEFIRQSVAEDTTQPFFAYLPFAVPHLS-----IQVPEEEVDAYE 274
Query: 298 ------------YITDPN-RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA 344
Y+ P R YAAMV ++D+ VG ++ + G+ EN++I+F SDNG
Sbjct: 275 GVIEEADYEHHGYLKHPRPRAGYAAMVTRMDEGVGQIVDLIDSMGLGENTLILFTSDNG- 333
Query: 345 PTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWL 404
PT + S+ + S +G+K L EGG++VP I + R S + D+L
Sbjct: 334 PTYDRLGGSDSDYFDSASGMKGLKGQLDEGGIRVPMIARQTGVVPAGRTSDWIGAWWDFL 393
Query: 405 PTLYTAAGGDT 415
PT+ AAG D
Sbjct: 394 PTITDAAGVDV 404
>gi|294633612|ref|ZP_06712170.1| twin-arginine translocation pathway signal [Streptomyces sp. e14]
gi|292830254|gb|EFF88605.1| twin-arginine translocation pathway signal [Streptomyces sp. e14]
Length = 490
Score = 145 bits (365), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 157/334 (47%), Gaps = 38/334 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
GW DLS +G+ I TPN+D LA G+ + Y+ C+P+R SL TG+YP T G
Sbjct: 85 GWADLSCYGAPHIRTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYTGRYPGRTKGGLAE 144
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
PI G+ +G+ + L+ GY+T IGKWH G+ +Y+P G++ FG G
Sbjct: 145 PI-GSRTQGLDPHHPTVASLLKRAGYATALIGKWHCGWLP-DYSPTKSGWDEFFGNFGGA 202
Query: 210 ISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLF 269
+ Y+ S+ +L +D+ +T D Y T + T+ AV+ + + D+P
Sbjct: 203 LEYF--------SKLGQLGDYDLYEGDATYQDL--RYYTTVLTERAVEYVGRRH-DRPWL 251
Query: 270 LYLAHLAAH-----------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSV 318
L L H A + A + + + Y MV+ LD S+
Sbjct: 252 LNLNFTTPHWPWLTEDDEETAAGIAADIRA-GNVLGALMHTDGGSVEKYKEMVENLDASI 310
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G V++AL+R G EN++++F SDNG Y+ +P G K+ L EGG++V
Sbjct: 311 GEVLAALRRSGQEENTVVVFASDNGGERFSYQ-----------WPLSGGKSELLEGGIRV 359
Query: 379 PAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
P +L P +VS ++ DW TL G
Sbjct: 360 PTVLRWPAAVDGHQVSRTPVYSPDWTATLLELGG 393
>gi|423348778|ref|ZP_17326457.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
CL03T12C32]
gi|409212738|gb|EKN05774.1| hypothetical protein HMPREF1060_04129 [Parabacteroides merdae
CL03T12C32]
Length = 461
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 104/308 (33%), Positives = 156/308 (50%), Gaps = 36/308 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F G+ +I TPNID LA G I + + A V +PSR+ ++TG+Y G +
Sbjct: 42 GYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 100
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+P E LP L+ Y T IGKWHLG P +GF++ +G L G
Sbjct: 101 -LDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHLGS-EPSQRPNAKGFDTFYGLLAGHR 158
Query: 211 SY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
SY YD SD+ + + NG R LS +D+ Y TD +A Q + + ++P
Sbjct: 159 SYFYDPETSDKDGNLQQYQYNG----RKLS--FDS---YFTDELASKAQQFVTES--EQP 207
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LY++ A H+ N EA +E + +F+ R+ YAAM+ LD VG ++ L+
Sbjct: 208 FMLYMSFTAPHSPN-----EATEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDELKA 259
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
G +N+II F+SDNG T N SN P +G K +EGG +VP ++W +
Sbjct: 260 AGKFDNTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWGDR 310
Query: 387 IQQNPRVS 394
+++ R +
Sbjct: 311 FKRDQRFT 318
>gi|149196941|ref|ZP_01873994.1| putative arylsulfatase [Lentisphaera araneosa HTCC2155]
gi|149140051|gb|EDM28451.1| putative arylsulfatase [Lentisphaera araneosa HTCC2155]
Length = 469
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 109/370 (29%), Positives = 177/370 (47%), Gaps = 61/370 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
G+ DLS +G + TPNID + G++ + Y+ VC PSRA+LMTGK+ H ++G
Sbjct: 32 GYGDLSLYGQKKFSTPNIDRIGKEGMVFTDHYSGSTVCAPSRAALMTGKHSGHGLVRGNY 91
Query: 149 ---PPIWGAEPRGVPL--TERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
P +G E +PL + L E ++ GY+T IGKW +G P +GF+ +
Sbjct: 92 EVGPHGFGGE---LPLRPEDVSLAEVMKSAGYATGLIGKWGMGMDGTTGEPRKKGFDYSY 148
Query: 204 GYLNGVISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIE 260
G+LN +++ ++I + + N D R G Y +D F ++ ++ +E
Sbjct: 149 GFLNQAHAHHYYPEYIYENGEKLMIPENKDDAR----------GLYISDTFAEKGIEFVE 198
Query: 261 DQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ----------------------- 297
+ DKP FL+ A + HA L P +++N+F+
Sbjct: 199 ENK-DKPFFLFWAFVTPHA-----ELLVPDDSLNEFKGKWPETPFVMGKQGGDGTDNPFG 252
Query: 298 -YIT-DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNY 355
Y + D R ++ M+ +LD VG + L+ G+ +N+II+F SDNG P E ++
Sbjct: 253 VYASQDHPRAAFSGMITRLDKRVGDLFDKLEELGIDDNTIIMFSSDNG-PHKEGGADPDF 311
Query: 356 RNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDT 415
+ SN G K L EGG++VP ++ P + + S D +PT+ A D+
Sbjct: 312 --FDSNAELTGYKRDLTEGGIRVPFMVRWPNVVKARSKSSHASAFWDVMPTIAEIANTDS 369
Query: 416 SRLPLNIDGL 425
P +IDGL
Sbjct: 370 ---PEDIDGL 376
>gi|372208819|ref|ZP_09496621.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 459
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 118/382 (30%), Positives = 180/382 (47%), Gaps = 71/382 (18%)
Query: 81 KSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMT 137
K T + L +V GW D S GS+ TP++D L +G++ N YA VC+P+RAS++T
Sbjct: 12 KKTNVILFLVDDMGWTDASCLGSDLYKTPHLDKLVSDGMMFTNGYASSTVCSPTRASIVT 71
Query: 138 GKYPIH-------TGMQGP------PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWH 184
GKYP G Q P P W + + + E + E ++ Y T +GKWH
Sbjct: 72 GKYPASLRLTDWIEGHQKPFAKMKIPKWNQK---LDIEEVTIAEMFKKNHYKTIHLGKWH 128
Query: 185 LGFFRREYTPLYRGFESHFG-------YLNGVISYYDHILSDQYSRTVELNGHDMRRNLS 237
LG +Y P ++GF+ + G +G Y+ Y NG+
Sbjct: 129 LGE-DPKYWPEHQGFDINIGGYSAGSPKADGGQGYFS-----PYHNPRMSNGNK------ 176
Query: 238 TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
GEY TD EAV I+ KP F+ L H L+A +E + +++
Sbjct: 177 ------GEYLTDRLANEAVSFIKKNKT-KPFFMNLWLYNVHMP-----LQAKKEVVQKYK 224
Query: 298 YITDP----NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGA---PTVEYR 350
+ N TYAAM++ +DD++G V+ AL+ G+ N+I++F SDNG PT
Sbjct: 225 SKINKGMTHNNATYAAMIEHMDDALGAVVQALKETGLENNTIVVFHSDNGGHLKPT---- 280
Query: 351 ETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTA 410
SN P R K L+EGGV+VP + P + VS ++ +D +PTL +
Sbjct: 281 ---------SNAPLRSGKGDLYEGGVRVPLVFKWPLKIKAGAVSNELAISADIMPTLLSL 331
Query: 411 AGGDTSRLPLNIDGLDQWSSLL 432
+ ++P +DG+D +LL
Sbjct: 332 V-NNKYKMPKGVDGIDLSPALL 352
>gi|227540350|ref|ZP_03970399.1| arylsulfatase [Sphingobacterium spiritivorum ATCC 33300]
gi|227239674|gb|EEI89689.1| arylsulfatase [Sphingobacterium spiritivorum ATCC 33300]
Length = 473
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 130/430 (30%), Positives = 203/430 (47%), Gaps = 67/430 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G +I TPNID LA G+ YA VC PSR+SLMTG++ HT ++G
Sbjct: 40 GYGDLGAYGQKKIQTPNIDRLAAEGMKFEQFYAGTSVCAPSRSSLMTGQHTGHTYIRGNR 99
Query: 151 IWGAEPRGV-PLTE--RFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
EP G PL + + + L++ GY+T A GKW LG P +GF+ +GY
Sbjct: 100 --EIEPEGQQPLADSVQTFVQLLQKAGYATGAFGKWGLGMVGTTGAPDQKGFDEFYGYNC 157
Query: 208 GVISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
S+ H+ + + VEL G+D++ + +YA L ++ ++ I D
Sbjct: 158 QRQSHRYFPTHLWHN--DQRVELQGNDLKNKV--------QYAPALIQEQTLRFI-DAHK 206
Query: 265 DKPLFLYLAHLAAHAGNAG---KHLE----------------APQETINQFQYITDPNRR 305
D+P FL++ + HA +G +H + P T+ + + P R
Sbjct: 207 DEPFFLFIPTVLPHAELSGIDDEHYQKYASRFEETPHKGNDYGPGATVTGYASVEKP-RA 265
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE-TSNYRNWGSNYPY 364
T+A+MV ++D VG V+ L G+ EN+I+IF SDNGA +RE ++ + S+
Sbjct: 266 TFASMVSRMDAYVGQVLKKLDDLGLTENTIVIFSSDNGA----HREGGADPVFFNSSGGL 321
Query: 365 RGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDG 424
RG K L+EGG++ P I+ P ++ S D +PTL G +R DG
Sbjct: 322 RGYKRDLYEGGIRTPFIVKWPHKVKSHSTSEFTGAFWDVMPTLLDITGAGKARY---TDG 378
Query: 425 LDQWSSLLLNTPSRRNS-------NIDGLDQ------WSSLLLNTPSRRNSV--LINID- 468
+ +LL T ++ DG Q W ++ L +++ +V L N+
Sbjct: 379 VSLVPTLLGKTNEQQQHPYLYWEFQEDGGRQAVRAGDWKAVKLQVKNKQKTVTELYNLKK 438
Query: 469 ---EKKRTAA 475
EKK AA
Sbjct: 439 DPAEKKNVAA 448
>gi|325107642|ref|YP_004268710.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
gi|324967910|gb|ADY58688.1| N-acetylgalactosamine-6-sulfatase [Planctomyces brasiliensis DSM
5305]
Length = 749
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 124/417 (29%), Positives = 187/417 (44%), Gaps = 62/417 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G+ E+ TP IDALA G + YA P+C+PSRA L+TG YP G
Sbjct: 40 GYYDLGCYGATEVETPEIDALAAEGTRFTDYYAAAPICSPSRAGLLTGCYPRRVGNH--- 96
Query: 151 IW---GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
IW G+ E L E + GY+T IGKWHLG F + P +GF+ +FG L+
Sbjct: 97 IWVHRADSDTGIHPNELTLAELFHQNGYATACIGKWHLG-FHEPFLPQNQGFDHYFGLLH 155
Query: 208 GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
+ +Q + N ++R A T +T EA+ +E Q D+P
Sbjct: 156 NLDPVETVYFEEQGGVPLLRNDQVVQRPADPA------ELTKQYTDEAISWME-QHRDQP 208
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
FLYL H H P ++F+ N Y +++LD ++G ++ L+R
Sbjct: 209 FFLYLPHTMLH---------NPLGVSDEFR--GSSNWGEYGDAIQELDHNMGRLMQTLKR 257
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQI 387
+ +N+++++ SDNG + +G K + WE G++VPAI W P +
Sbjct: 258 LEIDKNTVVVYASDNGRGPGRTDQQK----------LQGRKLSTWEAGIRVPAIAWGPGL 307
Query: 388 Q-QNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN--IDGLD----------------QW 428
Q+ VS ++ DW PTL + AG + +P + IDG D Q
Sbjct: 308 GLQSGAVSSTVVSAMDWYPTLASLAGFE---VPADRVIDGRDFSPLLKGESKIVPPAGQK 364
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVL 485
S+ P +R+ N G +W L+ + + AAVR WKL L
Sbjct: 365 LSINAAVPLQRSWNPAG--EWKDLV--NRYEYSEAFFYHGAQGELAAVRWRDWKLFL 417
>gi|430746414|ref|YP_007205543.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
gi|430018134|gb|AGA29848.1| arylsulfatase A family protein [Singulisphaera acidiphila DSM
18658]
Length = 590
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/334 (30%), Positives = 156/334 (46%), Gaps = 51/334 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW DLS HG+ + TPNID+LA +G + Y PVC P+RA +TG+Y G++G
Sbjct: 35 GWGDLSVHGNTNLKTPNIDSLARDGALFERFYVCPVCAPTRAEFLTGRYHPRGGVRGVTS 94
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
G + L E+ + E + GY+T A GKWH G + Y P RGF+ ++G+ +G
Sbjct: 95 GGER---LDLNEKTIAETFKSAGYATGAFGKWHNG-TQFPYHPNARGFDEYYGFTSGHWG 150
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
Y + R V+ NG + TD T A+ I+ D+P F Y
Sbjct: 151 EYFDPPLEHNGRPVQGNG----------------FITDDLTDHAISFIKASK-DRPFFCY 193
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYIT------DPNR------RTYAAMVKKLDDSVG 319
L H+ ++ P ++F+ D R R AM + +D++VG
Sbjct: 194 LPFNTPHS-----PMQVPDRFYDKFKNAALKLRAHDAEREDLMMTRAALAMCENIDENVG 248
Query: 320 TVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP 379
V+ L+ + + +I+++ SDNG +W N RG K + EGGV+ P
Sbjct: 249 RVLKTLEDLSLDQKTIVLYFSDNGP-----------NSWRWNGGMRGRKGSTDEGGVRSP 297
Query: 380 AIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
++ W QI+ RV+ ++ D LPTL AG
Sbjct: 298 LLIRWPKQIRPETRVA-KISAAIDLLPTLTDLAG 330
>gi|154493323|ref|ZP_02032643.1| hypothetical protein PARMER_02660 [Parabacteroides merdae ATCC
43184]
gi|423346574|ref|ZP_17324262.1| hypothetical protein HMPREF1060_01934 [Parabacteroides merdae
CL03T12C32]
gi|423723129|ref|ZP_17697282.1| hypothetical protein HMPREF1078_01342 [Parabacteroides merdae
CL09T00C40]
gi|154086533|gb|EDN85578.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
gi|409219725|gb|EKN12685.1| hypothetical protein HMPREF1060_01934 [Parabacteroides merdae
CL03T12C32]
gi|409241554|gb|EKN34322.1| hypothetical protein HMPREF1078_01342 [Parabacteroides merdae
CL09T00C40]
Length = 483
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 128/426 (30%), Positives = 196/426 (46%), Gaps = 74/426 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G + TPNID LA +G+ Y+ V PSR+ L+TG + HT ++G
Sbjct: 39 GYGDLSCYGQEKFETPNIDKLAQSGMRFTQCYSGTTVSAPSRSCLLTGTHSGHTPVRGN- 97
Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+P G +P R + E +++ GY T A GKW LG+ P +G ++ +GY
Sbjct: 98 -LELDPEGQFPLPDDARTIFEVMKDAGYKTSAFGKWGLGYIGSTGDPKNQGCDTFYGYNC 156
Query: 208 GVI--SYY-DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
++ SYY DH+ + + VEL + NL+ + G Y+ DL +A++ +++
Sbjct: 157 QLLAHSYYPDHLWDN--DKRVELEDN----NLNVQYGK-GTYSQDLIHGKALEYLDNMNP 209
Query: 265 DKPLFLYLAHLAAHAGNAGKHLEAPQETINQ-FQYI--------TDPN------------ 303
D+P F++ + HA L P+++I Q F+ + T+P
Sbjct: 210 DEPFFMWYPTIIPHA-----ELIVPEDSIIQKFRGMYPEKPYKGTEPGSPAFRKGGYCSQ 264
Query: 304 ---RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
T+AAMV +LD VG +I L+ KG+ +N+IIIF SDNG P +E ++ N S
Sbjct: 265 FHPHATFAAMVYRLDVYVGQIIQKLKDKGLYDNTIIIFASDNG-PHMEGGADPDFFN--S 321
Query: 361 NYPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP 419
N YRG K L+EGG++VP I+ W +Q N M D LPT
Sbjct: 322 NGIYRGYKRDLYEGGIRVPMIISWPGHVQPNTETDF-MCSFWDVLPTFEE---------- 370
Query: 420 LNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL-INIDEKKRTAAVRL 478
+ P + +DG+ L + + L E AVR
Sbjct: 371 -------------IIHPKAKQKEMDGVSMLPLLENRKGQKEHEFLYFEFQELNGRQAVRK 417
Query: 479 DSWKLV 484
WKLV
Sbjct: 418 GPWKLV 423
>gi|47216038|emb|CAG11369.1| unnamed protein product [Tetraodon nigroviridis]
Length = 474
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 118/406 (29%), Positives = 184/406 (45%), Gaps = 71/406 (17%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G TPN+D LA G+ + Y PVC+PSRASL+TG+Y +G+
Sbjct: 33 GFGDLGCYGHPTSLTPNLDGLAAGGLRFTDFYCTSPVCSPSRASLLTGRYQTRSGVYPGV 92
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
++ G+PL E + E L+ GY+T A+GKWHLG + T + FG L V
Sbjct: 93 LYPGSRGGLPLNETTIAEVLKPRGYATAAVGKWHLGGPCQNLTCFPPDVKC-FG-LCDVG 150
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFL 270
+ ++ D+ + +N D+ + S ++A D T A + +P FL
Sbjct: 151 TVTVPLMHDEVIKQQPVNFLDLEKAYS-------DFAKDFITTSAKR-------KQPFFL 196
Query: 271 YLAHLAAH------AGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISA 324
Y H G AGK L P + + + D ++G++++
Sbjct: 197 YFPSHHTHYPQYAGPGAAGKSLRGP-----------------FGDALLEFDQTIGSLLAT 239
Query: 325 LQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAIL-W 383
L+R G++ N++I F SDNG E G+ P R K T +EGG++ PAI W
Sbjct: 240 LERTGVINNTLIFFTSDNGP------ELMRMSRGGNAGPLRCGKGTTYEGGMREPAIAYW 293
Query: 384 SPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLP-LNIDGLDQWSSLLLNTPSRRNSN 442
IQ P V+ +M D LPT + AG ++LP + +DG+D + L S+R +
Sbjct: 294 QGLIQ--PGVTHEMASTLDILPTFASLAG---AKLPQVMLDGVDMTNILFSQGKSKREAM 348
Query: 443 IDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
+ PS +N + A+RL+ +K TQ
Sbjct: 349 M--------FYPTDPSEKNGLF----------AIRLEKYKAHFYTQ 376
>gi|386381787|ref|ZP_10067486.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
gi|385670744|gb|EIF93788.1| sulfatase, partial [Streptomyces tsukubaensis NRRL18488]
Length = 426
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 111/339 (32%), Positives = 155/339 (45%), Gaps = 48/339 (14%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHT-GMQGP 149
GW DLS +G+ EI TPN+D LA G+ + Y+ C+P+R SL TG+YP T G
Sbjct: 23 GWADLSSYGAPEIRTPNLDRLARQGVRFTDAYSGSATCSPTRFSLYTGRYPGRTKGGLAE 82
Query: 150 PIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGV 209
PI G+ T L LR+ GY T IGKWH G+ +Y+P G+E FG G
Sbjct: 83 PI-ADRSVGLEPTHPTLASLLRDAGYRTALIGKWHCGYLP-DYSPTRSGWEEFFGNFGGA 140
Query: 210 ISYYDHILS----DQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
+ YY + S D Y E D+R Y T + T+ AV+ +
Sbjct: 141 LEYYSKLSSGGEYDLYEGDAEY--RDLR------------YYTRILTERAVEYVGRDHGR 186
Query: 266 KPLFLYLAHLAAH-AGNAGKHLEAPQETINQFQYITDPNRRT-----------YAAMVKK 313
P L L H A E E + + I +RR Y +V+
Sbjct: 187 TPWLLNLNFTTPHWPWIADGDEETSAEIVRR---IKAGDRRALWHQDGGSLAKYRQLVED 243
Query: 314 LDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWE 373
LD SVG V++AL+R G E++++ F SDNG Y N+P G K +L +
Sbjct: 244 LDRSVGKVLAALKRSGQEEDTLVFFASDNGGERFSY-----------NWPLSGNKGSLLD 292
Query: 374 GGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
GG++VP+IL P + +VS ++ DW TL G
Sbjct: 293 GGIRVPSILRWPARIEGGQVSGVPVYSPDWTATLLDVGG 331
>gi|307726728|ref|YP_003909941.1| sulfatase [Burkholderia sp. CCGE1003]
gi|307587253|gb|ADN60650.1| sulfatase [Burkholderia sp. CCGE1003]
Length = 527
Score = 144 bits (364), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 130/440 (29%), Positives = 196/440 (44%), Gaps = 62/440 (14%)
Query: 92 GWNDLSFHGSNEI---PTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQG 148
GW DL +G + PTPN+D +A G+ N Y Q CT RAS +TG+ PI T +
Sbjct: 49 GWGDLGSYGGGTMRGAPTPNLDRMAAEGMRFLNYYGQASCTAGRASFITGRQPIRTSLSS 108
Query: 149 PPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF---GY 205
+ G +P G+ + E+L++ GY+T +GKWHLG + E P GF+ + Y
Sbjct: 109 VLVPG-DPNGLTKQTPTVAEFLKKAGYTTVQLGKWHLG-DKPENYPTAHGFDEMYDMLAY 166
Query: 206 LNGVISYYDHIL-SDQYSRTVELNGHDMRRNLSTAWDTVGE---------------YATD 249
GV +Y + L +D + R N+S G+ A D
Sbjct: 167 YAGVYAYSNRDLHADWPAHDAAFQKVWDRLNVSMVEQKAGQPVKVVKQKFTYEDLATADD 226
Query: 250 LFTKEAVQLIEDQPVDK-PLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYA 308
+AV I+ D+ P F+YL L H N +P+ P Y
Sbjct: 227 NMRAKAVDWIKAHAKDQHPFFMYLNFLKVHNPNN----PSPRWKGKS------PGGGNYL 276
Query: 309 AMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVK 368
+ +LDD+ G ++ A++ G+ EN+++I+ +DNGA V+ + Y P+RG K
Sbjct: 277 DSLMELDDNSGQIVQAIRDLGIAENTLVIWTTDNGA-WVDAWPDAGYT------PFRGEK 329
Query: 369 NTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
T +EGG +VPAI W P + V+ M DW PT + + GG+T P + D
Sbjct: 330 GTPFEGGFRVPAIAWWPGHIKPGSVNTDMFSHMDWWPT-FASLGGET---PPPHEWKDNE 385
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQ 488
++ DG+D SLL P +R + + AVR+ S+K V Q
Sbjct: 386 GKPII---------FDGIDLSDSLLGKGPGKRTDFIYF--SGQSFGAVRVKSFKAVYTAQ 434
Query: 489 ENGTMDGYYGQTRSNKVPLL 508
D + G RS K P +
Sbjct: 435 -----DTWLGPQRSMKAPAI 449
>gi|423341845|ref|ZP_17319560.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
CL02T12C29]
gi|409219938|gb|EKN12897.1| hypothetical protein HMPREF1077_00990 [Parabacteroides johnsonii
CL02T12C29]
Length = 461
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 102/306 (33%), Positives = 154/306 (50%), Gaps = 40/306 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F GS +I TPNID LA G I + + A V +PSR+ ++TG+Y G +
Sbjct: 42 GYADFGFMGSADIQTPNIDRLAAEGRIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 100
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGF--FRREYTPLYRGFESHFGYLNG 208
G+P E LP L+ GY T IGKWHLG F+R P +GF++ +G L G
Sbjct: 101 -LDKPGDGIPDDEELLPALLKRYGYRTGCIGKWHLGSKPFQR---PNAKGFDTFYGLLAG 156
Query: 209 VISY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
SY YD SD+ + + NG + + Y TD +A Q + + +
Sbjct: 157 HRSYFYDPETSDKDGNLQQYQYNGQKL---------SFDGYFTDELASKARQFVAES--E 205
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISAL 325
+P LY++ A H+ N EA +E + +F+ R+ YAAM+ LD VG ++ L
Sbjct: 206 QPFMLYMSFTAPHSPN-----EATEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDEL 257
Query: 326 QRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWS 384
+ G +++II F+SDNG T N SN P +G K +EGG +VP ++W
Sbjct: 258 KAVGKFDDTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWG 308
Query: 385 PQIQQN 390
+ +++
Sbjct: 309 DRFKRD 314
>gi|417303299|ref|ZP_12090357.1| arylsulfatase A [Rhodopirellula baltica WH47]
gi|327540271|gb|EGF26857.1| arylsulfatase A [Rhodopirellula baltica WH47]
Length = 616
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 99/317 (31%), Positives = 149/317 (47%), Gaps = 39/317 (12%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI 151
GW DL+ HG+ +I TP +DALA L+ Y PVC P+RA+L+TG+YP +G+ G
Sbjct: 73 GWGDLAAHGNPKISTPTLDALANKSARLDRFYVSPVCAPTRAALLTGRYPERSGVAGVTG 132
Query: 152 WGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVIS 211
R +T L E R GY T GKWH G + P +GF+ FG+ G +
Sbjct: 133 RREVMRAEEIT---LAELYRSAGYVTGCFGKWHNG-AQMPLHPNGQGFDEFFGFCGGHFN 188
Query: 212 YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLY 271
YD L ++ V+ NG Y TD+ T AV+ I++ D+P F Y
Sbjct: 189 LYDDALLERNGTPVQTNG----------------YITDVLTDAAVEFIQNHH-DRPFFCY 231
Query: 272 LAHLAAHAGNAGKHLEAPQETINQFQYIT-DPNRRTYAAMVKKLDDSVGTVISALQRKGM 330
+ A H + Q+ +++ + D AMV+ +D +V ++ L +
Sbjct: 232 VPFNAPHG-----PFQVRQDLFDRYNDGSIDEKTAAVYAMVQNIDTNVSRLLKCLSDHSL 286
Query: 331 LENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQN 390
E +I++F++DNG + N RG K ++ EGG +VP + P Q
Sbjct: 287 DEETIVVFLTDNGPNGKRF-----------NGGMRGTKGSVHEGGCRVPCFIRWPGNIQP 335
Query: 391 PRVSLQMMHISDWLPTL 407
+S +HI D LPTL
Sbjct: 336 QTISQVAVHI-DLLPTL 351
>gi|393784512|ref|ZP_10372675.1| hypothetical protein HMPREF1071_03543 [Bacteroides salyersiae
CL02T12C01]
gi|392665493|gb|EIY59017.1| hypothetical protein HMPREF1071_03543 [Bacteroides salyersiae
CL02T12C01]
Length = 483
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 111/343 (32%), Positives = 166/343 (48%), Gaps = 40/343 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G + TPNID LA NG+ Y+ V PSR+ L+TG + HT ++G
Sbjct: 45 GYGDLSCYGQEKFDTPNIDQLAQNGMRFTQCYSGTTVSAPSRSCLLTGTHSGHTAIRGNK 104
Query: 151 IWGAEPRG---VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+P G +P R + ++ GY T A GKW LGF P +G + +GY
Sbjct: 105 --ELDPEGQFPLPADARTIFHLFKDAGYKTAAFGKWGLGFIGSTGDPNKQGIDYFYGYNC 162
Query: 208 GVI--SYY-DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
++ SYY DH+ + + +EL + N+ G Y+ DL +A+ ++ +
Sbjct: 163 QLLAHSYYPDHLWEN--DQRIEL-----KDNVLEVEYGKGTYSQDLIHSKALSFLDARKQ 215
Query: 265 DKPLFLYLAHLAAHA-----------GNAGKHLEAPQE-------TINQFQYITDPN-RR 305
++P FL+ + HA GK E P + + Y T N R
Sbjct: 216 EEPFFLFYPTIIPHAELIVPEDSIIKKFRGKFPEVPYKGFEPGSPAFRKGGYCTQLNPRA 275
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYR 365
T+AAM+ +LD VG ++ L+ KGM EN++IIF SDNG P E ++ N SN +R
Sbjct: 276 TFAAMIYRLDVYVGQLVQKLKEKGMYENTLIIFASDNG-PHQEGGADPDFFN--SNGIWR 332
Query: 366 GVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTL 407
G K L+EGG++VP I+ W +Q M D +PT
Sbjct: 333 GYKRDLYEGGIRVPMIVSWPGHVQSGTETDF-MCSFWDLMPTF 374
>gi|300771164|ref|ZP_07081040.1| N-acetylgalactosamine 6-sulfate sulfatase [Sphingobacterium
spiritivorum ATCC 33861]
gi|300761834|gb|EFK58654.1| N-acetylgalactosamine 6-sulfate sulfatase [Sphingobacterium
spiritivorum ATCC 33861]
Length = 473
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 130/431 (30%), Positives = 205/431 (47%), Gaps = 69/431 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DL +G +I TPNID LA G+ YA VC PSR+SLMTG++ HT ++G
Sbjct: 40 GYGDLGAYGQKQIQTPNIDRLAAEGMKFEQFYAGASVCAPSRSSLMTGQHTGHTYIRGNR 99
Query: 151 IWGAEPRGV-PLTERF--LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
EP G PL + + L++ GY+T A GKW LG P +GF+ +GY
Sbjct: 100 --EIEPEGQQPLADSVQTFVQLLQKAGYATGAFGKWGLGMVGTTGAPDQKGFDEFYGYNC 157
Query: 208 GVISYY---DHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
S+ H+ + + VEL+G+D++ + +YA L ++ ++ I D
Sbjct: 158 QRQSHRYFPTHLWHN--DQRVELHGNDLKNKV--------QYAPALIQEQTLRFI-DVHK 206
Query: 265 DKPLFLYLAHLAAHAGNAG---KHLE----------------APQETINQFQYITDPNRR 305
D+P FL++ + HA +G +H + P T+ + + P R
Sbjct: 207 DEPFFLFIPTVLPHAELSGIDDEHYQKYASRFQETPHKGNDYGPGATVTGYASVEKP-RA 265
Query: 306 TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE-TSNYRNWGSNYPY 364
T+A+MV ++D VG V+ L G+ EN+I+IF SDNGA +RE ++ + S+
Sbjct: 266 TFASMVSRMDAYVGQVLKKLDDLGLTENTIVIFSSDNGA----HREGGADPVFFNSSGGL 321
Query: 365 RGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNID 423
RG K L+EGG++ P I+ W +++ N D +PTL G +R D
Sbjct: 322 RGYKRDLYEGGIRTPFIVKWPHKVKPNSTSEFAGA-FWDVMPTLLDITGAGKARY---TD 377
Query: 424 GLDQWSSLLLNTPSRRNS-------NIDGLDQ------WSSLLLNTPSRRNSV--LINID 468
G+ +LL T ++ + DG Q W ++ L +++ +V L N+
Sbjct: 378 GVSLVPALLGKTKEQQQHPYLYWEFHEDGGRQAVRAGDWKAVKLQVKNKQKTVTELYNLK 437
Query: 469 ----EKKRTAA 475
EKK AA
Sbjct: 438 KDPAEKKNVAA 448
>gi|408528003|emb|CCK26177.1| hypothetical protein BN159_1798 [Streptomyces davawensis JCM 4913]
Length = 588
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 119/365 (32%), Positives = 175/365 (47%), Gaps = 46/365 (12%)
Query: 77 AALTKSTTLTLLIVY----GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPS 131
A + S L +++ GW +L +G I TP +DALA G+ + YA PVC PS
Sbjct: 44 AVRSGSGRLNFVVILADDLGWGELGSYGQKLIRTPRLDALAAEGLRFTDAYATAPVCAPS 103
Query: 132 RASLMTGKYPIHTGMQGPPIWGAEPRGVPLTER--FLPEYLRELGYSTKAIGKWHLG--F 187
R SL+TG + H+ ++ P WG +G LT+R E LR LGY T IGKW G
Sbjct: 104 RCSLLTGLHAGHSTVRENP-WGPGGQGA-LTDRDFTFAEALRSLGYRTGIIGKWGFGPEL 161
Query: 188 FRREYTPLYRGFESHFGYLNGVIS---YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
+ P RGFE +GY+ + + DH+ + + + N + R
Sbjct: 162 PDQPSHPNSRGFEEFYGYIGHRHAHDYFPDHLWHNGTRQEIPANRGEARE---------- 211
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLA----HLAAHAGNAGKHLEAPQETINQFQYIT 300
YA DL + A+ + D +P LYL H +HA + G++ + P
Sbjct: 212 VYAPDLLKERALAFL-DAHRAEPFLLYLTPTLPHAPSHAPDQGEYADRPWP--------- 261
Query: 301 DPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGS 360
P+RR +AA + D VG V+ L G+ ++++ SDNG P E R +N + +
Sbjct: 262 GPDRR-HAAQITTFDSLVGEVVDRLTALGVAGRTLVLVTSDNG-PHEEGR--TNPELFDA 317
Query: 361 NYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
N P RG K L+EGGV+VP I WSP+ + + ++D LPTL AG P
Sbjct: 318 NGPLRGYKRNLYEGGVRVPLIAWSPR-HVPVGTTDRPTPLTDLLPTLADLAGAPA---PT 373
Query: 421 NIDGL 425
++DGL
Sbjct: 374 DVDGL 378
>gi|372210171|ref|ZP_09497973.1| sulfatase [Flavobacteriaceae bacterium S85]
Length = 651
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 116/407 (28%), Positives = 174/407 (42%), Gaps = 77/407 (18%)
Query: 104 IPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQ-GPPIWGAEPRGVPL 161
IPTP +D LA NGII N + A P C PSRA+LMTG P G+Q P G+PL
Sbjct: 58 IPTPELDQLANNGIICTNGHVAHPFCGPSRAALMTGVQPSRIGVQYNLPNDINTSLGIPL 117
Query: 162 TERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQY 221
E + P+ L++ Y T A GKWHLGF + +Y PL RGF+ FG+L G +Y++ D Y
Sbjct: 118 EETYFPKILQQNNYHTAAFGKWHLGFTQGKYQPLDRGFDYFFGFLGGGKAYFEREYEDLY 177
Query: 222 SR--------TVELNGHDMRRNLSTAWDTVG--EYATDLFT------------------- 252
R T E R+ A D EY TD+ T
Sbjct: 178 YRRLGGSNPVTNEYQDPLQRQRDYVAKDEFNQDEYLTDILTDEAINYIAENKTKSDPFFM 237
Query: 253 ---------------KEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 297
+E Q E+ P + L +L +E + + +
Sbjct: 238 YVAYNAPHTPLQAPAEEIAQFKEENPNFESLVRNSDYLKESGAYELDKVERELKKVKDYD 297
Query: 298 Y-----------ITD----------PNRRTYAAMVKKLDDSVGTVISALQR-KGMLENSI 335
Y + D +R YA MV +D ++G +++ L++ + + +N++
Sbjct: 298 YNSLSSSEKEEKVADYREHLLDKFTESRIVYATMVSNMDKNIGRIVAELKKDQKVFDNTM 357
Query: 336 IIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSL 395
I+F+SDNG T Y N+P +K +++EGG +VP + P ++P+
Sbjct: 358 IVFLSDNGGYT--------YSKGAVNHPLYALKGSVFEGGHRVPFFIHWPNKIKSPKKYK 409
Query: 396 QMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSN 442
+ D PTL AG +DGL +L N R N++
Sbjct: 410 YQISSLDLYPTLVDLAGASVPSTK-KLDGLSFMDKVLDNKQIRPNND 455
>gi|323449079|gb|EGB04970.1| hypothetical protein AURANDRAFT_66827 [Aureococcus anophagefferens]
Length = 918
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 106/357 (29%), Positives = 164/357 (45%), Gaps = 46/357 (12%)
Query: 107 PNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPI----------WGAEP 156
P++ ALA +G+ L N Y Q CTP+RA+LM+GK+ TG + WG
Sbjct: 102 PHMLALAQDGVRLTNYYGQAFCTPARAALMSGKFGHRTGFANVDVDSAATLEISAWGNFS 161
Query: 157 RGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHI 216
+ FL + L GY +GKW+LG + P RGF+S+ GY I Y H
Sbjct: 162 LANAGDQIFLSQRLATAGYEVHGVGKWNLGHCNAAFLPPSRGFKSYLGYYGAGIGYVSHE 221
Query: 217 LSDQYSRTVELNGH-------DMRR-----NLSTAWDTVGEYATDLFTKEAVQLIEDQPV 264
+ + H DM+R LS A +GEY+T LFT E+++ I+ V
Sbjct: 222 VEQSGNVENSFTRHFRDYSLVDMQRCTPQGCLSDA-SVIGEYSTILFTNESLRRIDGFSV 280
Query: 265 DKPLFLYLAHLAAHAGNA--GKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVI 322
+ P +LY+A+ H L+ P + + RR + + +D VG +
Sbjct: 281 ENPTYLYVAYHGVHDDKQSNASGLDYPAAALEALE----ERRRNFGIALHAVDHGVGLLQ 336
Query: 323 SALQRKGMLENSIIIFMSDN-GAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAI 381
L+++ + +++ SDN G+P Y ++ SN PYRG+K +EGG+K+PA
Sbjct: 337 QRLEKRS--SDYVLVVHSDNGGSPCGTYCDS-------SNAPYRGMKFFDFEGGLKLPAF 387
Query: 382 LWSPQIQQNPRVSLQMMHISDWLPTLYTAAGG---DTSRLPLNIDGLDQWSSLLLNT 435
++SP + +MH DW T G D S + D D WS++ N+
Sbjct: 388 VYSPTRLASRGTYDGLMHHVDWTATFLRGLAGTLLDCS----DCDSRDHWSAISSNS 440
>gi|148978065|ref|ZP_01814612.1| sulfatase, putative [Vibrionales bacterium SWAT-3]
gi|145962749|gb|EDK28023.1| sulfatase, putative [Vibrionales bacterium SWAT-3]
Length = 500
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 121/422 (28%), Positives = 182/422 (43%), Gaps = 52/422 (12%)
Query: 106 TPNIDALAYNGIILNNMYAQPVCTPSRASLMTGKYPIHTGMQGPPIWGAEPRGVPLTERF 165
TPNID++A G++L + YAQP CT R++ +TG+ P+ TGM + G P G+
Sbjct: 54 TPNIDSIAEKGMLLTDYYAQPSCTAGRSAFLTGQLPVRTGMHTVGLPGG-PVGLHADTPT 112
Query: 166 LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTV 225
LPE L+ +GY T GK HLG R E+ P GF+ ++G+L ++ ++ + +
Sbjct: 113 LPEILKTMGYVTGQFGKNHLG-DRDEFLPTMHGFDEYWGWLY-HLNAMEYTEDPDWPKDG 170
Query: 226 ELNGHDMRRNLSTAWDTVG---------------EYATDLFTKEAVQLIEDQ-PVDKPLF 269
L+ R + D G D K A+ IE DKP F
Sbjct: 171 SLDAFAPRNVIYAKTDGKGGQIIEDDGALTIERMRTLDDEVNKHAINFIERAVEADKPFF 230
Query: 270 LYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKG 329
+ H HL E + + ++K L D VG +++ ++ G
Sbjct: 231 TWYCPSRGHVWT---HLSPEYEAM-----LGQNGWGLQEVVMKDLYDHVGEMMAKMEELG 282
Query: 330 MLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQ 389
+ +N+IIIF +DNG + + + G PY G K T WEGGV+ PA++ P
Sbjct: 283 IADNTIIIFTADNGPEIMTWPD-------GGMTPYHGEKGTTWEGGVRAPALVSWPGKIP 335
Query: 390 NPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQW 449
V + DWLPTL AAGG + G D + +++DG +Q
Sbjct: 336 AGSVGNGIFDGMDWLPTLVAAAGGPADLKEKMLKGYDGF-----------KAHLDGYNQV 384
Query: 450 SSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLN 509
L S R + E+ + AVR+ WK Q + G+ G PLL
Sbjct: 385 DMLTEKGESNRKEIYYY--ERDKLQAVRVGDWKAHFVVQNH----GWSGPKEELNAPLL- 437
Query: 510 FN 511
FN
Sbjct: 438 FN 439
>gi|311746014|ref|ZP_07719799.1| sulfatase family protein [Algoriphagus sp. PR1]
gi|126576230|gb|EAZ80508.1| sulfatase family protein [Algoriphagus sp. PR1]
Length = 489
Score = 144 bits (363), Expect = 1e-31, Method: Compositional matrix adjust.
Identities = 114/356 (32%), Positives = 171/356 (48%), Gaps = 54/356 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGM---- 146
GW D+ GS+ TPNI LA GI+ N YA +C+P+RA+L+TGKYP G+
Sbjct: 54 GWADIEPLGSDFYETPNITKLAKEGILFTNSYAAAAICSPTRAALLTGKYPARLGITDWI 113
Query: 147 ---------QGPP----IWGAEPRGVPLTERFLP-------EYLRELGYSTKAIGKWHLG 186
G P ++ +P P + FLP E ++ GY T +GKWHLG
Sbjct: 114 RAKFNQNSTSGLPGEYEVFENKPLKTPKIQGFLPLEEITIAERMKAHGYGTLHVGKWHLG 173
Query: 187 FFRREYTPLYRGFESHFGY--LNGVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVG 244
+ P +GF+ + G L SY+D L + E+ R+ G
Sbjct: 174 --EEGFYPEDQGFDVNIGGNDLGQPPSYFDPYLPAKPREFYEITTLKPRKE--------G 223
Query: 245 EYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNR 304
E+ TD E V I++Q K F++ A A H GK + + +++ N+
Sbjct: 224 EFLTDREGDEVVNYIQNQK-GKKFFVHWAPYAVHTPIMGK-----PDLVEKYEQKEPGNQ 277
Query: 305 R--TYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRN-WGSN 361
R YAA+V+ +D +VG V+S L+R G+ EN+++IF SDNG NY N +N
Sbjct: 278 RNPVYAALVESVDQNVGKVLSELERMGLRENTLVIFTSDNGG------LIGNYDNPITNN 331
Query: 362 YPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
YP + K +EGG+++P I+ W +I Q V + DW+PT+ G D +
Sbjct: 332 YPLKSQKGYPYEGGIRIPTIVSWPGKIPQG-FVDETPIITMDWIPTILDFMGEDPT 386
>gi|149196404|ref|ZP_01873459.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
gi|149140665|gb|EDM29063.1| N-acetylgalactosamine 6-sulfate sulfatase (GALNS) [Lentisphaera
araneosa HTCC2155]
Length = 456
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 108/345 (31%), Positives = 174/345 (50%), Gaps = 38/345 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ L +G I TP +D +A G+ L + YA VC PSR SLMTG++ HT ++G
Sbjct: 31 GYGQLGSYGQKMIKTPRLDQMAKEGLRLTDYYAGTAVCAPSRCSLMTGQHVGHTYIRGNK 90
Query: 151 IW--GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNG 208
+ G EP +P + E ++E GY+T IGKW LG+ E P +GF+ FGY N
Sbjct: 91 EYPTGQEP--IPAETITVAEKMKEAGYATALIGKWGLGYPGSEGEPNKQGFDYFFGY-ND 147
Query: 209 VISYYDH----ILSDQYSRTVELN-GHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQP 263
++H +L ++ + T++ N G ++ EY+ + T EA I+
Sbjct: 148 QKHAHNHFPKFLLRNEETLTLKNNSGKEI------------EYSQYMLTDEAKGFIKKNK 195
Query: 264 VDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITD---PNRRTYAAMVKKLDDSVGT 320
D P FLYLA++ H+ L+ P + QY + ++ +A M+ +LD VG+
Sbjct: 196 -DNPFFLYLAYVIPHS-----RLQIPGDDECYLQYKDESWPEKQKKHAGMISRLDKDVGS 249
Query: 321 VISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPA 380
++ L+ + EN++++F SDNGA + + G P G+K +++EGGV+VP
Sbjct: 250 ILDLLKEMNLAENTLVVFTSDNGAHREGGARPEFFNDSG---PLSGIKRSMYEGGVRVPF 306
Query: 381 ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGL 425
I P + + +VS + D +PT G + P IDG+
Sbjct: 307 IAHWPGVIKPGQVSNHIGAHWDLMPTACELGG---VQPPEGIDGI 348
>gi|149174163|ref|ZP_01852791.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
8797]
gi|148847143|gb|EDL61478.1| putative secreted sulfatase ydeN precursor [Planctomyces maris DSM
8797]
Length = 470
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 121/379 (31%), Positives = 177/379 (46%), Gaps = 51/379 (13%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-A 124
IT P T+AA K + +V GW DL +GS+ +PNID LA G+ Y A
Sbjct: 22 ITQP---THAADEKPWNVVFFLVDDLGWTDLGCYGSDFYQSPNIDQLAAEGMKFTQNYSA 78
Query: 125 QPVCTPSRASLMTGKYPIHTGMQG-PPIWGAEPRGVPLT---------ERF--LPEYLRE 172
C+P+R +L+TG YP T + P W PL +++ LPE LR
Sbjct: 79 CNACSPTRGALLTGMYPARTHLTDWIPGWAKSYTDFPLKPPEWKKHLDQKYTTLPEALRT 138
Query: 173 LGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDM 232
GY T +GKWHLG R P GF+ + N + H G M
Sbjct: 139 AGYQTFHVGKWHLG--GRGNLPQDHGFDVNISGTNRGLPRSYHF---------PYGGDAM 187
Query: 233 RRNLS-TAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQE 291
+ + S T + Y TD EAV LI Q DKP FLY + + H+ G+ +
Sbjct: 188 KWDSSLTEAERQDRYLTDRMADEAVALIRQQQ-DKPFFLYCSFYSVHSPIQGR-----PD 241
Query: 292 TINQFQYITDPNRRT---YAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVE 348
+ +++ + R YAAM++ +D+++G V + L+ G+ + ++I+F SDNG
Sbjct: 242 LVKKYKGLPAGKRHKNPEYAAMIQSVDEAIGRVRAQLKESGIADRTLIVFTSDNGGVR-- 299
Query: 349 YRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLY 408
R+TSN N P RG K WEGG +VPAI+ P + V + + D+ PT+
Sbjct: 300 -RKTSN------NDPLRGEKGQHWEGGTRVPAIVLWPGVTPAGSVCAEPIITMDFYPTIL 352
Query: 409 --TAAGGDTSRLPLNIDGL 425
T G+T ++DGL
Sbjct: 353 NITGVAGNTEH-NQSVDGL 370
>gi|265752714|ref|ZP_06088283.1| sulfatase [Bacteroides sp. 3_1_33FAA]
gi|263235900|gb|EEZ21395.1| sulfatase [Bacteroides sp. 3_1_33FAA]
Length = 461
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 141/467 (30%), Positives = 206/467 (44%), Gaps = 71/467 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA-SLMTGKYPIHTGMQ--G 148
GW DL GS+ TPNID LA GI+ +N YA + +MTG Y G+
Sbjct: 31 GWTDLGVMGSDYYETPNIDRLATEGILFDNAYAAAANSAPSRACMMTGMYTPRHGVYTVS 90
Query: 149 PPIWGAEPR----GVPLTERF------LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
PP G + +P E + E L+E GY IGKWHLG PL +G
Sbjct: 91 PPDRGDRTKRKYIAIPNVEDVCADFVTMAEALQEQGYQCGHIGKWHLGDDEDGTGPLSQG 150
Query: 199 FESHFGYLNGVISY---YDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
F + G Y Y + L D+ V L +GEY TD T+EA
Sbjct: 151 FIWNVGGNRAGAPYSYFYPYCLPDKSKCHVGLEE-----------GILGEYLTDRLTEEA 199
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR----TYAAMV 311
V I+ + P FL+L+H A H L+AP IN+++ T P + YAAM+
Sbjct: 200 VSFIKSHS-EGPFFLHLSHHAVHT-----VLQAPDSLINKYRNKT-PGKYHKNPIYAAMI 252
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRNWGSNYPYRGVKN 369
+KLDDSVG + ++ G+ + +I+IF SDNG P + NYP G K
Sbjct: 253 EKLDDSVGRICQVIKTLGIADRTIVIFYSDNGGSEPVTD------------NYPLNGGKG 300
Query: 370 TLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
+EGG +VP I+ W+ +I+ R S+ + + D+ PT T A G ++P N+DG D +
Sbjct: 301 MPYEGGSRVPLIIRWTGKIEGGIRSSVPITGV-DFYPTFVTLAQG---KIPANLDGKDIF 356
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE--KKRTAAVRLDSWKLVLG 486
+L+ N + R+ L + P+ S L + K +++R WKL+
Sbjct: 357 -TLINNNETERD-----------LFWHFPAYLESYLNGGRDFRAKPYSSIRSGDWKLIYH 404
Query: 487 TQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
++ L N + + YQ L + Q PI
Sbjct: 405 YEDKSMELFNLKNDLGESQDLSGSNPVKRGELYQKLMKWIQETHAPI 451
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 6/55 (10%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
+GEY TD T+EAV I+ + P FL+L+H A H L+AP IN+++
Sbjct: 187 LGEYLTDRLTEEAVSFIKSHS-EGPFFLHLSHHAVHT-----VLQAPDSLINKYR 235
>gi|284040883|ref|YP_003390813.1| sulfatase [Spirosoma linguale DSM 74]
gi|283820176|gb|ADB42014.1| sulfatase [Spirosoma linguale DSM 74]
Length = 453
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 108/335 (32%), Positives = 167/335 (49%), Gaps = 46/335 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
G+ DLS +G+ +I TP+ID+L G+ ++ YA VC+PSRA+L++G+YP G+ G
Sbjct: 40 GYGDLSSYGAPDIRTPHIDSLVRAGMRFSHFYANSSVCSPSRAALLSGRYPEQVGVPGVI 99
Query: 149 ----PPIWG-AEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
WG P V LP L++ GY T +GKWHLG P RGF+
Sbjct: 100 RTMPDDNWGYLSPSAV-----LLPSILKKNGYYTALVGKWHLGL-EPPNLPNDRGFDLFH 153
Query: 204 GYLNGVIS-YYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQ 262
G+ ++ YY H+ D+ + MR N T +ATDLFT+ A +E +
Sbjct: 154 GFEGDMMDDYYTHLRHDR---------NYMRLNRQTI--NPQGHATDLFTQWATDYLEQR 202
Query: 263 P-VDKPLFLYLAHLAAHAGNAGKHLEAPQE---TINQFQYITDPNRRTYAAMVKKLDDSV 318
P FLYLA+ A H ++ P + + Q R +++ +DD +
Sbjct: 203 AGQSNPFFLYLAYNAPH-----DPIQPPADWLAKVKARQPGISEKRAKLVGLIEHMDDGI 257
Query: 319 GTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKV 378
G VI L+ KG+ EN++I+F+SDNG + +N P R K ++EGG+++
Sbjct: 258 GKVIQTLRAKGLYENTLIVFVSDNGGKLFD---------GATNGPLRSGKGHMYEGGIRI 308
Query: 379 PA-ILWSPQIQQNPRVSLQMMHISDWLPTLYTAAG 412
PA ++W ++ + S Q + + D PTL A G
Sbjct: 309 PACVVWPGKVAAQSQ-SQQPLLLMDIFPTLAEATG 342
>gi|149198283|ref|ZP_01875329.1| N-acetylgalactosamine-6-sulfate sulfatase [Lentisphaera araneosa
HTCC2155]
gi|149138579|gb|EDM26986.1| N-acetylgalactosamine-6-sulfate sulfatase [Lentisphaera araneosa
HTCC2155]
Length = 443
Score = 144 bits (363), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 120/429 (27%), Positives = 195/429 (45%), Gaps = 68/429 (15%)
Query: 75 TYAALTKSTTLTLLIV-YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ-PVCTPSR 132
T A K + ++I +G+ D +G+ +I TP I+ LA +G+ N YA PVC+P+R
Sbjct: 13 TLVAQDKPNIVFIIIDDFGYADSEPYGAKDIKTPGINELAKDGLKFTNFYANAPVCSPTR 72
Query: 133 ASLMTGKYPIHTGMQGPPIWGA--------------EPRGVPL--TERFLPEYLRELGYS 176
+ +TG++ +G + +G + G+ L + LP+ L++ GY
Sbjct: 73 CAFITGRWQQRSGFEWALGYGGTNSQLKNGQYEAVTDIHGIGLLPEKNHLPKLLKKAGYK 132
Query: 177 TKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGHDMRRNL 236
T A GKWHLG + ++ P++ GF+ ++G L G H Y + +R
Sbjct: 133 TGAFGKWHLGS-QDKFNPIHHGFDEYYGPLLG------HCDYYTYKYYDDTY--TLREGA 183
Query: 237 STAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQF 296
D+ Y T + AV I D+ DKP F+Y+ H+A H+ + Q T
Sbjct: 184 KVIKDS--GYLTTNINERAVDFI-DRHADKPFFMYVPHMAVHSPYQSADKKPKQITKTN- 239
Query: 297 QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYR 356
+ D NR YAAMV+++D V +I+ L+ K + ++ + SDNG
Sbjct: 240 --LNDGNRADYAAMVEEVDKGVEMIIAKLKEKKIFHKTLFVVSSDNGGA----------- 286
Query: 357 NWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTS 416
++ N P K TL+EGG++VP I+ P+ VS Q+ D T AG D
Sbjct: 287 HFSDNAPLFHRKTTLFEGGIRVPCIMHWPEKIGKGVVSDQIAITMDLSKTFLALAGIDEP 346
Query: 417 RLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAV 476
+ DG++ LL + +N+ ++ W S N+ +RR AV
Sbjct: 347 ----SYDGIN-----LLPMMTDKNNKVERTLFWRS---NSKARRQK------------AV 382
Query: 477 RLDSWKLVL 485
R+ WK +L
Sbjct: 383 RMGKWKYIL 391
>gi|189230138|ref|NP_001121393.1| galactosamine (N-acetyl)-6-sulfate sulfatase [Xenopus (Silurana)
tropicalis]
gi|158253505|gb|AAI54124.1| Galns protein [Danio rerio]
gi|183986312|gb|AAI66093.1| LOC100158481 protein [Xenopus (Silurana) tropicalis]
Length = 488
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 115/367 (31%), Positives = 164/367 (44%), Gaps = 57/367 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
GW DL G TP +D +A G++ N Y A P+C+PSRA+L+TG+ P+ G
Sbjct: 7 GWGDLGVFGEPSKETPYLDLMAAQGMLFPNFYTANPLCSPSRAALLTGRLPVRNGFY--- 63
Query: 151 IWGAEPR----------GVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFE 200
A R G+ E LPE+L+ Y +K +GKWHLG R +Y PL GF+
Sbjct: 64 TTNAHARNAYTPQEIVGGISADEILLPEHLKNKHYISKIVGKWHLGH-RTQYLPLKHGFD 122
Query: 201 -------SHFGYLN-----GVISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYAT 248
HFG N + Y + + +Y E+N NL T
Sbjct: 123 EWFGAPNCHFGPYNDSSRPNIPVYNNSEMKGRYYEEFEINVKTGESNL-----------T 171
Query: 249 DLFTKEAVQLIEDQPV-DKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTY 307
L+ KE + I Q + +P FLY A A HA ++ R Y
Sbjct: 172 QLYLKEGLDFISQQAMAQRPFFLYWAPDATHA-----------PVYASKPFLGKSQRGRY 220
Query: 308 AAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGV 367
V +LDDS+G +++ L G+ ++++ F SDNGA + S GSN P+
Sbjct: 221 GDAVMELDDSIGQILAHLVSLGIQNDTLVFFTSDNGAALM-----SGPLQSGSNGPFLCG 275
Query: 368 KNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQ 427
K T +EGG++ PAI W P VS Q+ + D T + AG + IDG+D
Sbjct: 276 KETTFEGGMREPAIAWWPGQIPAGTVSHQLASVMDLFSTSLSVAGVSPPDDRV-IDGVD- 333
Query: 428 WSSLLLN 434
S +L N
Sbjct: 334 LSPVLFN 340
>gi|443724469|gb|ELU12468.1| hypothetical protein CAPTEDRAFT_223869 [Capitella teleta]
Length = 441
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 122/435 (28%), Positives = 194/435 (44%), Gaps = 64/435 (14%)
Query: 159 VPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDH-IL 217
+ L F+ +YL+EL Y+T A GKWHLG+ +E TP YRGF++ G +G YY+H
Sbjct: 1 MDLKYNFISDYLKELKYNTHASGKWHLGYCNKECTPTYRGFDTFSGGYSGEGEYYEHTTF 60
Query: 218 SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAA 277
Y NG ++ R+ S G ++ D+ + + + D+ PLF Y A
Sbjct: 61 QGMYDWH---NGTELDRSAS------GIHSQDMIEADLLNQL-DKNDGTPLFYYAAFHNT 110
Query: 278 HAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIII 337
H+ K P+ P RR Y +V +D +G V++ L+ K M +N+ I
Sbjct: 111 HSPMHPK----PEHEALYDDLDAPPTRRKYLGLVSGMDAVIGKVVAKLKEKDMFDNTYIF 166
Query: 338 FMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQM 397
F SDNG E G N P RG K++L++GG + SP +++ + +
Sbjct: 167 FSSDNGGDVHE----------GDNSPKRGAKSSLFDGGCHAHSWASSPLLKKKGVEADGL 216
Query: 398 MHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTP 457
+HI+DWLPT+ AGG+ +P IDG++Q + PS+R I +D L TP
Sbjct: 217 LHITDWLPTIVKLAGGE---VPDGIDGIEQVDLITNAGPSKRTDMIYNIDM--ELETGTP 271
Query: 458 SRRNSVLINIDEKKRTAAVRLDSWKLVLGTQENGTMDGYYGQTRSNKVPLLNFNAIVESK 517
T A R +KL+ G + G DGY ++ I E
Sbjct: 272 DF------------GTMAARDLRYKLIWGFE--GRSDGYG----------MDAQFIYE-- 305
Query: 518 TYQSLQQLSQNIFLPISNIDKM-----RSTRQQATI--HCGANPAPMTPSPCTNGPCYLF 570
+ Q+ + + ++DK +T+Q + + + GP LF
Sbjct: 306 -VDRVIQVVKEADAAVEDVDKRGGPYPMNTQQHHILRTYLQTKLVTLDDIEAGTGPMMLF 364
Query: 571 NLGNDPCEQNNIASS 585
+L DP E ++++S
Sbjct: 365 DLQEDPNETTDLSAS 379
>gi|410631133|ref|ZP_11341814.1| arylsulfatase B [Glaciecola arctica BSs20135]
gi|410149335|dbj|GAC18681.1| arylsulfatase B [Glaciecola arctica BSs20135]
Length = 454
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 105/322 (32%), Positives = 153/322 (47%), Gaps = 44/322 (13%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQP-VCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +GS +I TP ID LA G YA +C+PSRA LMTG+YPI T +
Sbjct: 48 GFGDLSINGSGKIKTPAIDQLARTGFNFTQFYASANICSPSRAGLMTGRYPIRTSLGYNV 107
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFG--YLNG 208
I + G+P TE L E Y T IGKWHLG F E+ P+ GFE+ +G + N
Sbjct: 108 ITAGDTHGLPKTEDTLAELALRADYRTMLIGKWHLGRF-PEHAPMKHGFETFYGVPHSND 166
Query: 209 VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPL 268
+ ++ + + + VE + +R +T+++V I D P
Sbjct: 167 MANFALYSGDNIVEQPVEQSTLTLR-----------------YTEKSVSFI-DANAKHPF 208
Query: 269 FLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQRK 328
L+++H H P F +D Y +V++LD S G +++AL+R
Sbjct: 209 LLFVSHTMPH---------IPLYASEDFAGKSDAG--VYGDVVEELDYSTGKIVAALKRN 257
Query: 329 GMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQ 388
G+ +NSIII SDNG P E G+ +G K + +EG +VP I+ P
Sbjct: 258 GIFDNSIIIITSDNG-PFFE----------GATAGLKGGKGSTYEGAYRVPLIISWPAGI 306
Query: 389 QNPRVSLQMMHISDWLPTLYTA 410
NP V+ + D LPT+ A
Sbjct: 307 ANPAVTDAIAMNIDILPTIAEA 328
>gi|410096843|ref|ZP_11291828.1| hypothetical protein HMPREF1076_01006 [Parabacteroides goldsteinii
CL02T12C30]
gi|409225460|gb|EKN18379.1| hypothetical protein HMPREF1076_01006 [Parabacteroides goldsteinii
CL02T12C30]
Length = 488
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 125/425 (29%), Positives = 191/425 (44%), Gaps = 72/425 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ DLS +G + TPNID LA NG+ Y+ V PSR+ L+TG + HT ++G
Sbjct: 48 GYGDLSCYGQEKFETPNIDQLAQNGMRFTQCYSGTTVSAPSRSCLLTGTHSGHTPVRGNK 107
Query: 151 IWGAEPRG-VPLTE--RFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLN 207
+P G PL E R + +++ GY T A GKW LG+ P +G E+ +GY
Sbjct: 108 --ELDPEGQFPLPEGTRTIFHVMQDAGYKTSAFGKWGLGYIGTSGDPKNQGCETFYGYNC 165
Query: 208 GVI--SYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVD 265
++ SYY L D R +++ N+ G Y+ DL +A+ +++ D
Sbjct: 166 QLLAHSYYPDHLWDNDQRV------ELKDNVLEVQYGKGTYSQDLIHSKALDYLDNLKED 219
Query: 266 KPLFLYLAHLAAHAGNAGKHLEAPQETI-NQFQYI--------TDPN------------- 303
+P F++ + HA L P+++I +F+ + T+P
Sbjct: 220 EPFFMWYPTILPHA-----ELIVPEDSIIKKFRGMYPEKAYKGTEPGSPAFRKGGYCTQL 274
Query: 304 --RRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
T+AAM+ +LD VG ++ L+ KG+ +N+IIIF SDNG P +E ++ N SN
Sbjct: 275 HPHATFAAMIYRLDVYVGQIVQKLKDKGLYDNTIIIFASDNG-PHMEGGADPDFFN--SN 331
Query: 362 YPYRGVKNTLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPL 420
YRG K L+EGG++VP I+ W IQ + M D LPT
Sbjct: 332 GMYRGYKRDLYEGGIRVPMIISWPGHIQPGTETNF-MCSFWDVLPTFEE----------- 379
Query: 421 NIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVL-INIDEKKRTAAVRLD 479
+ +P + +DG+ L + + L E AVR
Sbjct: 380 ------------IISPKAKQKEMDGVSMLPLLQNRKGQKEHEFLYFEFQEMNGRQAVRQG 427
Query: 480 SWKLV 484
WKLV
Sbjct: 428 PWKLV 432
>gi|449138001|ref|ZP_21773306.1| arylsulfatase A [Rhodopirellula europaea 6C]
gi|448883380|gb|EMB13908.1| arylsulfatase A [Rhodopirellula europaea 6C]
Length = 470
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 111/364 (30%), Positives = 176/364 (48%), Gaps = 42/364 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+NDL +GS I TPN+D LA G + Y A VC+PSRA+L+TG YP G+
Sbjct: 39 GYNDLGCYGSPNIKTPNLDRLASEGRRYTSFYSACSVCSPSRAALLTGCYPKRVGLHQHV 98
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
++ G+ E + ++L+ GY+T +GKWHLG +E P GF+S++G I
Sbjct: 99 LFPQSNYGLHPEEVTIADHLKSAGYATACVGKWHLG-HHKETLPTSNGFDSYYG-----I 152
Query: 211 SYYDHILSDQYSRTVELNGHDMRRNLSTA---WDT-------VGEYATD------LFTKE 254
Y + + R +++ D + S+A W+T + E D +T
Sbjct: 153 PYSNDMNHPDNKRLGKMSSDDRWTDQSSAVTLWNTPLVQDEEIIELPVDQRTITRRYTDR 212
Query: 255 AVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKL 314
A++ +E DKP FLYL H H L P++ Y DP + Y +++ +
Sbjct: 213 AIEFVEAN-QDKPFFLYLPHSMPHIP-----LYVPEDV-----YDPDP-QNAYKCVIEHI 260
Query: 315 DDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEG 374
D VG ++ ++ G+ + ++I++ SDNG P ++++ + GS P R K T +EG
Sbjct: 261 DTEVGRLVQTVRDLGLSKKTLIVYTSDNG-PWLQFKN-----HGGSAGPLRAGKGTTFEG 314
Query: 375 GVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSLLLN 434
G +VP I+W+P S D LPT+ + G IDG+D S+ +
Sbjct: 315 GQRVPCIMWAPGRIPAGTSSNAFATNMDLLPTIASVTGVALEN-DRKIDGVDLTSTFTSD 373
Query: 435 TPSR 438
SR
Sbjct: 374 ESSR 377
>gi|154495003|ref|ZP_02034008.1| hypothetical protein PARMER_04049 [Parabacteroides merdae ATCC
43184]
gi|154085553|gb|EDN84598.1| arylsulfatase [Parabacteroides merdae ATCC 43184]
Length = 464
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/308 (33%), Positives = 153/308 (49%), Gaps = 36/308 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F G+ +I TPNID LA G I + + A V +PSR+ ++TG+Y G +
Sbjct: 45 GYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 103
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+P E LP L+ Y T IGKWHLG P +GF++ +G L G
Sbjct: 104 -LDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHLGS-EPSQRPNAKGFDTFYGLLAGHR 161
Query: 211 SY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
SY YD SD+ + + NG R LS Y TD +A Q + + ++P
Sbjct: 162 SYFYDPETSDKDGNLQQYQYNG----RKLS-----FDGYFTDELASKAQQFVTES--EQP 210
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LY++ A H+ N EA +E + +F+ R+ YAAM+ LD VG ++ L+
Sbjct: 211 FMLYMSFTAPHSPN-----EATEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDELKA 262
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
G +N+II F+SDNG T N SN P +G K +EGG +VP ++W +
Sbjct: 263 AGKFDNTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWGDR 313
Query: 387 IQQNPRVS 394
+++ R +
Sbjct: 314 FKRDQRFT 321
>gi|392391665|ref|YP_006428268.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
gi|390522743|gb|AFL98474.1| arylsulfatase A family protein [Ornithobacterium rhinotracheale DSM
15997]
Length = 510
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 119/442 (26%), Positives = 200/442 (45%), Gaps = 72/442 (16%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTGMQG-- 148
G+ DL+ +G + TPNID LA G++ YA PVC PSRA++MTG++ HT ++G
Sbjct: 38 GYGDLACYGQKHVETPNIDKLAKQGMMFTQAYAGAPVCAPSRATIMTGQHTGHTHVRGNK 97
Query: 149 ------PPIW----------GAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREY 192
P I G EP + LPE +++ GY+T GKW G+
Sbjct: 98 EYWQKSPKIMYGQNEEYSLVGQEPYDA--NQVILPEVMKKNGYTTGMFGKWAGGYEGSHS 155
Query: 193 TPLYRGFESHFGYLNG--VISYYDHILSDQYSRTVELNGHDMRRNLSTAWDTVGE----- 245
P RG + +GY+ SYY + L+ + + + + + + GE
Sbjct: 156 VPEKRGVDEFYGYICQFQAHSYYPNFLNRYSQKEGDTQTIRVVMDENVKYPMTGEDYYKR 215
Query: 246 --YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETI-----NQF-- 296
Y+ D+ +A++ +++Q KP + + + HA L P +++ QF
Sbjct: 216 PQYSADMIHHKALEWLDEQNGKKPFYGFFTYTLPHA-----ELMQPNDSLLKKYKKQFFE 270
Query: 297 -----------QYITDPNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAP 345
+ T +AAM+ +LD VG ++ L+ KG+ +N+I++F SDNG P
Sbjct: 271 DKTWGGSDGSRYHATTHTHAQFAAMITRLDSYVGEIMEKLKEKGLDKNTIVVFSSDNG-P 329
Query: 346 TVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLP 405
E + +G + RG+K +EGG++VP I+ P + VS + D +P
Sbjct: 330 HEEGGADPEF--FGRDGKLRGIKRQCYEGGIRVPFIVKWPGKIKPGTVSDHQIAFYDIMP 387
Query: 406 TLYTAAGGDTSRLPLNIDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRR--NSV 463
T +I G ++ N S +NS+IDG+ +LL + ++ + +
Sbjct: 388 TF------------ADIAGDKKFPKRYFNKKS-QNSSIDGISFLPTLLGDDQKQKAHDHL 434
Query: 464 LINIDEKKRTAAVRLDSWKLVL 485
E + AVR WKL++
Sbjct: 435 YWEFHETNQI-AVRKGDWKLIV 455
>gi|423725143|ref|ZP_17699283.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
CL09T00C40]
gi|409234771|gb|EKN27595.1| hypothetical protein HMPREF1078_03177 [Parabacteroides merdae
CL09T00C40]
Length = 461
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 103/308 (33%), Positives = 153/308 (49%), Gaps = 36/308 (11%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMY-AQPVCTPSRASLMTGKYPIHTGMQGPP 150
G+ D F G+ +I TPNID LA G I + + A V +PSR+ ++TG+Y G +
Sbjct: 42 GYADFGFMGATDIQTPNIDRLAAEGCIFTDAHVAATVSSPSRSMMLTGRYGQRYGYECN- 100
Query: 151 IWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVI 210
G+P E LP L+ Y T IGKWHLG P +GF++ +G L G
Sbjct: 101 -LDKPGDGLPDDEELLPALLKRYDYRTGCIGKWHLGS-EPSQRPNAKGFDTFYGLLAGHR 158
Query: 211 SY-YDHILSDQYS--RTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKP 267
SY YD SD+ + + NG R LS Y TD +A Q + + ++P
Sbjct: 159 SYFYDPETSDKDGNLQQYQYNG----RKLS-----FDGYFTDELASKAQQFVTES--EQP 207
Query: 268 LFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRRTYAAMVKKLDDSVGTVISALQR 327
LY++ A H+ N EA +E + +F+ R+ YAAM+ LD VG ++ L+
Sbjct: 208 FMLYMSFTAPHSPN-----EATEEDLARFE---GQPRQKYAAMMYALDRGVGKIVDELKA 259
Query: 328 KGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTLWEGGVKVP-AILWSPQ 386
G +N+II F+SDNG T N SN P +G K +EGG +VP ++W +
Sbjct: 260 AGKFDNTIIFFLSDNGGSTT---------NQSSNLPLKGFKGNKFEGGQRVPFFVVWGDR 310
Query: 387 IQQNPRVS 394
+++ R +
Sbjct: 311 FKRDQRFT 318
>gi|408672294|ref|YP_006872042.1| sulfatase [Emticicia oligotrophica DSM 17448]
gi|387853918|gb|AFK02015.1| sulfatase [Emticicia oligotrophica DSM 17448]
Length = 478
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 135/445 (30%), Positives = 203/445 (45%), Gaps = 67/445 (15%)
Query: 75 TYAALTKSTTLTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNN-MYAQPVCTPSRA 133
T L+ + L + G+ DLS +G+ + TPNID LA G+ N + AQ VC+ SRA
Sbjct: 18 TKKELSPNIILIFMDDMGYGDLSCYGAIDYRTPNIDKLASEGVRFTNFLAAQAVCSASRA 77
Query: 134 SLMTGKYPIHTGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYT 193
+L+TG YP G+ G + G+ E L E L++ GY+T GKWHLG + +
Sbjct: 78 ALLTGCYPNRIGISGA-LMPTSKNGINPEETTLAEMLKQKGYATGIFGKWHLGH-HKTFL 135
Query: 194 PLYRGFESHFG--YLNGV-ISYYD--------HILSDQYSRTVELNGHDMRRNLSTAWDT 242
PL GF+ + G Y N + YYD H+ Y + +D + + T D
Sbjct: 136 PLQHGFDEYVGLPYSNDMWPVYYDGKPATSTQHM--SAYPVLPLIQNNDKKEEIKTLDDQ 193
Query: 243 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDP 302
T L+T++AV+ I+ KP FLYL H H P ++F+
Sbjct: 194 AK--LTTLYTEKAVEFIKKNN-KKPFFLYLPHSMPHV---------PIAVSDKFK--GKS 239
Query: 303 NRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNY 362
+ YA ++ ++D SVG ++ L+ G+ +N+++IF SDNG P + + + GS+
Sbjct: 240 KQGLYADLMMEIDWSVGEIMKTLKENGLDKNTLVIFTSDNG-PWLNFGNHA-----GSSG 293
Query: 363 PYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN- 421
+R K T +EGG +VP I+ V Q+ D LPT+ G + LP N
Sbjct: 294 GFREGKGTSFEGGQRVPCIMRWKGTTPEGVVCNQLASTIDVLPTIARITGAE---LPKNK 350
Query: 422 IDGLDQWSSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSW 481
IDG++ S + + L Q RRNS+ AVR+ W
Sbjct: 351 IDGVNILSLIKGD-----------LSQSPRKTFYYYYRRNSL----------EAVRMGHW 389
Query: 482 KLVLGTQENGTM------DGYYGQT 500
KLV + M DGY G+
Sbjct: 390 KLVFEHKGRSYMNQLPGNDGYAGKA 414
>gi|223936271|ref|ZP_03628184.1| sulfatase [bacterium Ellin514]
gi|223895133|gb|EEF61581.1| sulfatase [bacterium Ellin514]
Length = 488
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 132/427 (30%), Positives = 196/427 (45%), Gaps = 63/427 (14%)
Query: 85 LTLLIVYGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIH 143
L L G+ DL +G +I TPNID LA +G+ + YA VC PSRA+LMTGK H
Sbjct: 47 LILADDLGYGDLGCYGQTQIKTPNIDKLAEDGMKFTSFYAGSTVCAPSRATLMTGKNTGH 106
Query: 144 TGMQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHF 203
++G G LT + + L+ GY+T IGKW LG P +GF+ +
Sbjct: 107 VNIRGNADLSLN--GEELT---IAKILKLAGYATGCIGKWGLGNEGSPGLPGRQGFDEYL 161
Query: 204 GYLNGVIS---YYDHIL---SDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEAVQ 257
GYL+ V + Y H+ S + L +D D G Y+ D FT+ A+
Sbjct: 162 GYLDQVQAHDYYPTHLFRSDSKGEESKIALTENDA--------DHKGLYSNDFFTQSALN 213
Query: 258 LIEDQPVDKP--------LFLYLAHLAAHAGN-----AGKHLEAPQE---TINQFQYITD 301
+ ++KP FLYL + HA N G +E P T Q+ +
Sbjct: 214 YLR---INKPSKLNKHRSFFLYLPYTLPHANNELGNRTGNGMEVPSTEPYTNEQWPQV-- 268
Query: 302 PNRRTYAAMVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSN 361
+ AAM+ +LD VG ++ L++ + EN+++IF SDNG P E Y N S
Sbjct: 269 --EKNKAAMITRLDHYVGEIMDYLKKSKLDENTVVIFASDNG-PHKEGGVNPKYFN--SA 323
Query: 362 YPYRGVKNTLWEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLN 421
RG+K L+EGG++VP I+ P + +S + D+LP TAA + P N
Sbjct: 324 GGLRGIKRDLYEGGIRVPFIVRWPARVKAGSISDAPLAFWDFLP---TAAEIARTSSPTN 380
Query: 422 IDGLDQWSSLLLNTPSRRNSNI------DGLDQ------WSSLL--LNTPSRRNSVLINI 467
IDG+ +LL + R+ + G DQ W ++ +N P ++ ++
Sbjct: 381 IDGISFLPTLLGKAQTNRHQYLYWEFHEQGFDQAVRMGDWKAVRHGINGPIELYNLKTDV 440
Query: 468 DEKKRTA 474
EK A
Sbjct: 441 SEKDNVA 447
>gi|317479852|ref|ZP_07938971.1| sulfatase [Bacteroides sp. 4_1_36]
gi|316903981|gb|EFV25816.1| sulfatase [Bacteroides sp. 4_1_36]
Length = 541
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 126/397 (31%), Positives = 175/397 (44%), Gaps = 88/397 (22%)
Query: 68 ITDPNRRTYAALTKSTTLTLLIV--YGWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQ 125
+ D RT + + L++ G++D+ +G EIPTPNID LA G+ Y
Sbjct: 22 MADETIRTSKETDERPNIVLIMADDMGYSDVGCYGG-EIPTPNIDRLAQKGVRYTQFYNS 80
Query: 126 PVCTPSRASLMTGKYPIHTGM----QGPPIWGAEP----RGVPLTERFL-------PEYL 170
P+RASL+TG YP G+ + P I E RGV FL E L
Sbjct: 81 GRSCPTRASLLTGLYPQQAGIGAMSEDPGIKKGEKHPENRGVHGYMGFLNRNCVTIAEVL 140
Query: 171 RELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGYLNGVISYYDHILSDQYSRTVELNGH 230
+E GY T GKWH+G +E PL RGFE +G L G SY L Q R + L+
Sbjct: 141 KEAGYHTYMTGKWHVGMHGKEKWPLQRGFEHFYGILAGASSY----LKPQGGRGLTLDNT 196
Query: 231 DMRRNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH------------ 278
++ S Y TD FT A++ I++Q D P FLYLA+ A H
Sbjct: 197 NLPAPQSPY------YTTDAFTDYAIRFIDEQTDDNPFFLYLAYNAPHWPLHAKDKDLEK 250
Query: 279 -AGNAGKHLEAPQE----------TIN-----------QFQYITDPNRRT-------YAA 309
AG K EA +E +N + +T+ R + YAA
Sbjct: 251 FAGKYDKGWEAVREERYRKMVELGIVNPDWKLAEWESRAWNELTEAERDSSSLRMSVYAA 310
Query: 310 MVKKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRE-----TSNYRN------- 357
V +D ++G +I L+ +G L+N++I+F+SDNGA Y E TS
Sbjct: 311 QVYCMDYNIGKLIDYLEEEGKLDNTLILFLSDNGACAEPYSEKGFGSTSEINKLDSWVHP 370
Query: 358 -----WG--SNYPYRGVKNTLWEGGVKVPAILWSPQI 387
W SN PYR K +EGG+ P I+ P++
Sbjct: 371 SYGLPWAQVSNTPYRKYKVRAYEGGIATPLIISWPEV 407
Score = 39.3 bits (90), Expect = 5.9, Method: Compositional matrix adjust.
Identities = 17/33 (51%), Positives = 22/33 (66%)
Query: 15 YATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAH 47
Y TD FT A++ I++Q D P FLYLA+ A H
Sbjct: 206 YTTDAFTDYAIRFIDEQTDDNPFFLYLAYNAPH 238
>gi|150003728|ref|YP_001298472.1| sulfatase [Bacteroides vulgatus ATCC 8482]
gi|212692493|ref|ZP_03300621.1| hypothetical protein BACDOR_01989 [Bacteroides dorei DSM 17855]
gi|237709187|ref|ZP_04539668.1| sulfatase [Bacteroides sp. 9_1_42FAA]
gi|345514223|ref|ZP_08793736.1| sulfatase [Bacteroides dorei 5_1_36/D4]
gi|423230607|ref|ZP_17217011.1| hypothetical protein HMPREF1063_02831 [Bacteroides dorei
CL02T00C15]
gi|423240766|ref|ZP_17221880.1| hypothetical protein HMPREF1065_02503 [Bacteroides dorei
CL03T12C01]
gi|423244316|ref|ZP_17225391.1| hypothetical protein HMPREF1064_01597 [Bacteroides dorei
CL02T12C06]
gi|149932152|gb|ABR38850.1| sulfatase [Bacteroides vulgatus ATCC 8482]
gi|212664968|gb|EEB25540.1| arylsulfatase [Bacteroides dorei DSM 17855]
gi|229456883|gb|EEO62604.1| sulfatase [Bacteroides sp. 9_1_42FAA]
gi|345456018|gb|EEO47281.2| sulfatase [Bacteroides dorei 5_1_36/D4]
gi|392630751|gb|EIY24737.1| hypothetical protein HMPREF1063_02831 [Bacteroides dorei
CL02T00C15]
gi|392642497|gb|EIY36263.1| hypothetical protein HMPREF1064_01597 [Bacteroides dorei
CL02T12C06]
gi|392643728|gb|EIY37477.1| hypothetical protein HMPREF1065_02503 [Bacteroides dorei
CL03T12C01]
Length = 473
Score = 144 bits (362), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 141/467 (30%), Positives = 206/467 (44%), Gaps = 71/467 (15%)
Query: 92 GWNDLSFHGSNEIPTPNIDALAYNGIILNNMYAQPVCTPSRA-SLMTGKYPIHTGMQ--G 148
GW DL GS+ TPNID LA GI+ +N YA + +MTG Y G+
Sbjct: 43 GWTDLGVMGSDYYETPNIDRLATEGILFDNAYAAAANSAPSRACMMTGMYTPRHGVYTVS 102
Query: 149 PPIWGAEPR----GVPLTERF------LPEYLRELGYSTKAIGKWHLGFFRREYTPLYRG 198
PP G + +P E + E L+E GY IGKWHLG PL +G
Sbjct: 103 PPDRGDRTKRKYIAIPNVEDVCADFVTMAEALQEQGYQCGHIGKWHLGDDEDGTGPLSQG 162
Query: 199 FESHFGYLNGVISY---YDHILSDQYSRTVELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
F + G Y Y + L D+ V L +GEY TD T+EA
Sbjct: 163 FIWNVGGNRAGAPYSYFYPYCLPDKSKCHVGLEE-----------GILGEYLTDRLTEEA 211
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPNRR----TYAAMV 311
V I+ + P FL+L+H A H L+AP IN+++ T P + YAAM+
Sbjct: 212 VSFIKSHS-EGPFFLHLSHHAVHT-----VLQAPDSLINKYRNKT-PGKYHKNPIYAAMI 264
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGA--PTVEYRETSNYRNWGSNYPYRGVKN 369
+KLDDSVG + ++ G+ + +I+IF SDNG P + NYP G K
Sbjct: 265 EKLDDSVGRICQVIKTLGIADRTIVIFYSDNGGSEPVTD------------NYPLNGGKG 312
Query: 370 TLWEGGVKVPAIL-WSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQW 428
+EGG +VP I+ W+ +I+ R S+ + + D+ PT T A G ++P N+DG D +
Sbjct: 313 MPYEGGSRVPLIIRWTGKIEGGIRSSVPITGV-DFYPTFVTLAQG---KIPANLDGKDIF 368
Query: 429 SSLLLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDE--KKRTAAVRLDSWKLVLG 486
+L+ N + R+ L + P+ S L + K +++R WKL+
Sbjct: 369 -TLINNNETERD-----------LFWHFPAYLESYLNGGRDFRAKPYSSIRSGDWKLIYH 416
Query: 487 TQENGTMDGYYGQTRSNKVPLLNFNAIVESKTYQSLQQLSQNIFLPI 533
++ L N + + YQ L + Q PI
Sbjct: 417 YEDKSMELFNLKNDLGESQDLSGSNPVKRGELYQKLMKWIQETHAPI 463
Score = 40.8 bits (94), Expect = 2.1, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 32/55 (58%), Gaps = 6/55 (10%)
Query: 12 VGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQ 66
+GEY TD T+EAV I+ + P FL+L+H A H L+AP IN+++
Sbjct: 199 LGEYLTDRLTEEAVSFIKSHS-EGPFFLHLSHHAVHT-----VLQAPDSLINKYR 247
>gi|392965484|ref|ZP_10330903.1| sulfatase [Fibrisoma limi BUZ 3]
gi|387844548|emb|CCH52949.1| sulfatase [Fibrisoma limi BUZ 3]
Length = 477
Score = 143 bits (361), Expect = 2e-31, Method: Compositional matrix adjust.
Identities = 127/419 (30%), Positives = 191/419 (45%), Gaps = 55/419 (13%)
Query: 92 GWNDLSFH-----GSNEIPTPNIDALAYNGIILNNMYA-QPVCTPSRASLMTGKYPIHTG 145
G+ DL + G I TPN+D +A G+ YA VC PSR++LMTG + HT
Sbjct: 48 GYGDLGCYRNTGSGQQTIRTPNLDRMAREGMRFTQHYAGSTVCAPSRSTLMTGLHTGHTR 107
Query: 146 MQGPPIWGAEPRGVPLTERFLPEYLRELGYSTKAIGKWHLGFFRREYTPLYRGFESHFGY 205
++G +P ++ + EYL+ GY+T IGKW LG P +GF+ +GY
Sbjct: 108 IRGNARVPLQP-----DDKTVAEYLKGAGYTTALIGKWGLGEDGSTGIPRQQGFDYFYGY 162
Query: 206 LNGVIS--YYDHILSDQYSRT--------VELNGHDMRRNLSTAWDTVGEYATDLFTKEA 255
LN + +Y L ++ VE G + + +A E+ DLFT EA
Sbjct: 163 LNQTHAHNHYPDFLWRNETKVKLPNEVIYVENGG---VKGIGSAATKRVEWTQDLFTNEA 219
Query: 256 VQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEAPQETINQFQYITDPN----RRTYAAMV 311
+Q IE Q +P FLYL++++ HA N K + A + + + ++ +AA +
Sbjct: 220 LQFIE-QSKQQPFFLYLSYVSPHANNEHKLVAAHGIEVPDYGPYANEEFPDVQKAFAASI 278
Query: 312 KKLDDSVGTVISALQRKGMLENSIIIFMSDNGAPTVEYRETSNYRNWGSNYPYRGVKNTL 371
LD VG V++ L + +N+++IF SDNG P E Y N SN RG+K L
Sbjct: 279 TYLDAHVGKVLAKLHELKLDDNTLVIFSSDNG-PHAEGGNDPAYFN--SNGGLRGIKRDL 335
Query: 372 WEGGVKVPAILWSPQIQQNPRVSLQMMHISDWLPTLYTAAGGDTSRLPLNIDGLDQWSSL 431
+EGG++VP + P + +S D+LPT AG P +DGL +L
Sbjct: 336 YEGGIRVPMLARWPGRIKAGTISNHPSAFWDFLPTACEVAG---VPQPAGLDGLSYLPAL 392
Query: 432 LLNTPSRRNSNIDGLDQWSSLLLNTPSRRNSVLINIDEKKRTAAVRLDSWKLVLGTQEN 490
L T + + E+ R AVR S+KLV ++N
Sbjct: 393 LGK--------------------GTQPQHPYLYWEFHEQGRKQAVRQGSYKLVYLLKDN 431
Score = 41.6 bits (96), Expect = 1.5, Method: Compositional matrix adjust.
Identities = 22/55 (40%), Positives = 33/55 (60%), Gaps = 1/55 (1%)
Query: 3 RNLSTAWDTVGEYATDLFTKEAVQLIEDQPVDKPLFLYLAHLAAHAGNAGKHLEA 57
+ + +A E+ DLFT EA+Q IE Q +P FLYL++++ HA N K + A
Sbjct: 198 KGIGSAATKRVEWTQDLFTNEALQFIE-QSKQQPFFLYLSYVSPHANNEHKLVAA 251
Database: nr
Posted date: Mar 3, 2013 10:45 PM
Number of letters in database: 999,999,864
Number of sequences in database: 2,912,245
Database: /local_scratch/syshi//blastdatabase/nr.01
Posted date: Mar 3, 2013 10:52 PM
Number of letters in database: 999,999,666
Number of sequences in database: 2,912,720
Database: /local_scratch/syshi//blastdatabase/nr.02
Posted date: Mar 3, 2013 10:58 PM
Number of letters in database: 999,999,938
Number of sequences in database: 3,014,250
Database: /local_scratch/syshi//blastdatabase/nr.03
Posted date: Mar 3, 2013 11:03 PM
Number of letters in database: 999,999,780
Number of sequences in database: 2,805,020
Database: /local_scratch/syshi//blastdatabase/nr.04
Posted date: Mar 3, 2013 11:08 PM
Number of letters in database: 999,999,551
Number of sequences in database: 2,816,253
Database: /local_scratch/syshi//blastdatabase/nr.05
Posted date: Mar 3, 2013 11:13 PM
Number of letters in database: 999,999,897
Number of sequences in database: 2,981,387
Database: /local_scratch/syshi//blastdatabase/nr.06
Posted date: Mar 3, 2013 11:18 PM
Number of letters in database: 999,999,649
Number of sequences in database: 2,911,476
Database: /local_scratch/syshi//blastdatabase/nr.07
Posted date: Mar 3, 2013 11:24 PM
Number of letters in database: 999,999,452
Number of sequences in database: 2,920,260
Database: /local_scratch/syshi//blastdatabase/nr.08
Posted date: Mar 3, 2013 11:25 PM
Number of letters in database: 64,230,274
Number of sequences in database: 189,558
Lambda K H
0.317 0.134 0.411
Lambda K H
0.267 0.0410 0.140
Matrix: BLOSUM62
Gap Penalties: Existence: 11, Extension: 1
Number of Hits to DB: 10,710,577,095
Number of Sequences: 23463169
Number of extensions: 474644220
Number of successful extensions: 1055192
Number of sequences better than 100.0: 1000
Number of HSP's better than 100.0 without gapping: 4727
Number of HSP's successfully gapped in prelim test: 4818
Number of HSP's that attempted gapping in prelim test: 1013310
Number of HSP's gapped (non-prelim): 24058
length of query: 632
length of database: 8,064,228,071
effective HSP length: 149
effective length of query: 483
effective length of database: 8,863,183,186
effective search space: 4280917478838
effective search space used: 4280917478838
T: 11
A: 40
X1: 16 ( 7.3 bits)
X2: 38 (14.6 bits)
X3: 64 (24.7 bits)
S1: 41 (21.7 bits)
S2: 80 (35.4 bits)